72 Matching Annotations

Nov 2021
en.wikipedia.org en.wikipedia.org

Golden-section search - Wikipedia

1
1. motivic 12 Nov 2021
  
  in Public
  
  squared absolute error in f ( x ) {\displaystyle f(x)} in typical cases
  
  Because it's unimodal? So quadratic approximate works well in typical cases?
Visit annotations in context

Annotators

motivic

URL

en.wikipedia.org/wiki/Golden-section_search
Sep 2021
www.csie.ntu.edu.tw www.csie.ntu.edu.tw

paper.dvi

1
1. motivic 10 Sep 2021
  
  in Public
  
  In total,this means that the dot product (i.e. the interaction) of thefactor vectors ofAliceandStar Trekwill be similar to the oneofAliceandStar Wars– which also makes intuitively sense
  
  This illustrates the power of collaborative filtering.
Visit annotations in context

Annotators

motivic

URL

csie.ntu.edu.tw/~b97053/paper/Rendle2010FM.pdf
Jul 2021
dl.acm.org dl.acm.org

Explicit or Implicit Feedback? Engagement or Satisfaction? A Field Experiment on Machine-Learning-Based Recommender Systems

3
1. motivic 15 Jul 2021
  
  in Public
  
  a
  
  $$u$$
  
  typo
2. motivic 14 Jul 2021
  
  in Public
  
  argeting user return as the objectivedoes not significantly affect user engagement (e.g., the actualfuture user return and churning risk) and shows a trend ofhurting perception metrics compared with the baseline
  
  Very interesting and somewhat counter-intuitive. Increased return should suggest more user engagement, no?
3. motivic 14 Jul 2021
  
  in Public
  
  user return
  
  "sessions"
Visit annotations in context

Tags

typo

Annotators

motivic

URL

dl.acm.org/doi/pdf/10.1145/3167132.3167275
Jun 2021
arxiv.org arxiv.org

Untitled document

7
1. motivic 29 Jun 2021
  
  in Public
  
  Gramian trick
  
  aka Kernel trick
2. motivic 29 Jun 2021
  
  in Public
  
  m|Ic|q(j|c) ̃α(c,j)
  
  Why multiply by the sample size and $I_c$?
  
  m samples in algorithm 1, but why $I_c$...
3. motivic 28 Jun 2021
  
  in Public
  
  samplesmnegative
  
  Sample multiple negatives in order to not under represent the negative examples?
4. motivic 28 Jun 2021
  
  in Public
  
  For example, if the batch containspositive observations (c1,i1),(c2,i2),..., then{i1,i2,...}are treated as the neg-atives for this batch, e.g., forc1, the negatives would be (c1,i2),(c1,i3)
  
  Interesting, how can you be sure that i2 is not relevant for c1?
5. motivic 28 Jun 2021
  
  in Public
  
  a ground truth set of relevant items{i1,i2,...}
  
  So we do not consider the order of the items.
6. motivic 28 Jun 2021
  
  in Public
  
  weak negatives are derived from all the remaining items
  
  Remaining items that the user had a chance to select.
7. motivic 28 Jun 2021
  
  in Public
  
  heseembedding matrices,W∈RC×dandH∈RI×dare the model parametersθ
  
  In particular, the number of items and context dimensions are fixed per model. This makes difficult to apply the model to new items or in new context (e.g. recommend to new users).
Visit annotations in context

Annotators

motivic

URL

arxiv.org/pdf/2101.08769.pdf
papers.nips.cc papers.nips.cc

Learning to Rank by Optimizing NDCG Measure

7
1. motivic 25 Jun 2021
  
  in Public
  
  This modeling choice is consistentwith the idea of ranking the documents with largest scores first; intuitively, the more documents ina permutation are in the decreasing order of score, the bigger the probability of the permutation is.
  
  In particular, scores themselves matter (not just the ranking).
2. motivic 25 Jun 2021
  
  in Public
  
  Proof.
  
  Jensen's Inequality
  
  Also $\langle \bullet \rangle_F$ is the expectation with respect to $\mathrm{Pr}(\pi^k | F, q^k)$.
3. motivic 25 Jun 2021
  
  in Public
  
  Zkis the normalization factor
  
  Is this the https://en.wikipedia.org/wiki/Discounted_cumulative_gain#Normalized_DCG "idealized DCG" (ranked by relevance score) up to a point $p$ (a parameter)
4. motivic 24 Jun 2021
  
  in Public
  
  F(d, q)the ranking function thattakes a document-query pair(d, q)
  
  So F is a function of just one d? So univariate scoring function
5. motivic 17 Jun 2021
  
  in Public
  
  The listwise approaches can be classified into two categorie
  
  Two categories of listwise approaches:
  
  Those that directly optimize the IR evaluation metrics
  
  Those that define a listwise loss function as an indirect way to optimize the IR evaluation metrics.
6. motivic 17 Jun 2021
  
  in Public
  
  main difficulty in optimizing these evaluation metrics is that they aredependent on the rank position of documents induced by the ranking function, not the numericalvalues output by the ranking function.
  
  The point being that the order of the updates are not what's learned directly.
7. motivic 17 Jun 2021
  
  in Public
  
  We propose a probabilisticframework that addresses this challenge by optimizing the expectation of NDCGover all the possible permutations of documents
  
  Does the model need to output the probability used to compute the NDCG?
Visit annotations in context

Annotators

motivic

URL

papers.nips.cc/paper/2009/file/b3967a0e938dc2a6340e258630febd5a-Paper.pdf
cacm.acm.org cacm.acm.org

Deep Learning for AI

3
1. motivic 24 Jun 2021
  
  in Public
  
  learning to learn, or meta-learning
  
  Interestingly two of the three papers referenced date back to the previous millennium.
2. motivic 23 Jun 2021
  
  in Public
  
  assign an energy (that is, a badness)
  
  The term "energy" is often used in early deep learning (e.g. "energy based models"), apparently borrowed from statistical physics.
3. motivic 22 Jun 2021
  
  in Public
  
  Its meaning resides in its relationships to other symbols which can be represented by a set of symbolic expressions or by a relational graph
  
  Something akin to expert systems or Bayesian networks.
Visit annotations in context

Annotators

motivic

URL

cacm.acm.org/magazines/2021/7/253464-deep-learning-for-ai/fulltext
Jan 2021
netflixtechblog.com netflixtechblog.com

Artwork Personalization at Netflix

2
1. motivic 20 Jan 2021
  
  in Public
  
  how artwork performs in relation to other artwork we select in the same page or session
  
  This is the slate optimization problem.
2. motivic 20 Jan 2021
  
  in Public
  
  when presenting a specific piece of artwork for a title influenced a member to play (or not to play) a title and when a member would have played a title (or not) regardless of which image we presented
  
  So to establish some causality between the thumbnail shown and the user's viewing of the show.
Visit annotations in context

Annotators

motivic

URL

netflixtechblog.com/artwork-personalization-c589f074ad76
Nov 2020
iwww.corp.linkedin.com iwww.corp.linkedin.com

GLMix for Jobs Relevance - Engineering - LinkedIn Corporate Wiki

2
1. motivic 06 Nov 2020
  
  in Public
  
  even two members characterized by the similar sets of features described above may still have different preferences over jobs, due to the fact that some intrinsic difference between those two members may not be well captured by the features
  
  So this brings the question on how GLMix can capture this difference if the data cannot already capture the difference?
2. motivic 06 Nov 2020
  
  in Public
  
  such individual-specific effects among different populations (e.g., members, jobs) are ignored in GLM
  
  Because they are drowned out by the broader pattern?
Visit annotations in context

Annotators

motivic

URL

iwww.corp.linkedin.com/wiki/cf/display/ENGS/GLMix+for+Jobs+Relevance
Apr 2020
ai.googleblog.com ai.googleblog.com

An Optimistic Perspective on Offline Reinforcement Learning

1
1. motivic 30 Apr 2020
  
  in Public
  
  based on logged experiences of a DQN agent
  
  So a different DQN agent is used to generate the logging data?
Visit annotations in context

Annotators

motivic

URL

ai.googleblog.com/2020/04/an-optimistic-perspective-on-offline.html
Aug 2019
en.wikipedia.org en.wikipedia.org

GloVe (machine learning) - Wikipedia

1
1. motivic 12 Aug 2019
  
  in Public
  
  As log-bilinear regression model for unsupervised learning of word representations, it combines the features of two model families, namely the global matrix factorization and local context window methods
  
  What does "log-bilinear regression" mean exactly?
  
  question machine learning
Visit annotations in context

Tags

question

machine learning

Annotators

motivic

URL

en.wikipedia.org/wiki/GloVe_(machine_learning)
Jul 2019
en.wikipedia.org en.wikipedia.org

Oblivious data structure - Wikipedia

1
1. motivic 17 Jul 2019
  
  in Public
  
  An Oblivious Tree is a rooted tree with the following property: All the leaves are in the same level. All the internal nodes have degree at most 3. Only the nodes along the rightmost path in the tree may have degree of one.
  
  Note this is not the definition of the oblivious decision trees in the CatBoost paper.
  
  There a oblivious decision tree means a tree where the feature used for splitting is the same across all intermediate nodes within the same level of the tree, and the leaves are all in the same level.
  
  See: https://stats.stackexchange.com/questions/353172/what-is-oblivious-decision-tree-and-why
Visit annotations in context

Annotators

motivic

URL

en.wikipedia.org/wiki/Oblivious_data_structure
Jun 2019
arxiv.org arxiv.org

1803.05170.pdf

5
1. motivic 26 Jun 2019
  
  in Public
  
  features (sparse)
  
  are these feature values or actual features?
  
  question
2. motivic 26 Jun 2019
  
  in Public
  
  Note that thescalar multipledoes not meanxkis linear withx0
  
  x_k is not a linear function of x_0
  
  correction
3. motivic 26 Jun 2019
  
  in Public
  
  We argue that the CrossNet learns a special typeof high-order feature interactions, where each hidden layer in theCrossNet is a scalar multiple ofx0
  
  In that case CrossNet doesn't really learn anything?
  
  question
4. motivic 26 Jun 2019
  
  in Public
  
  multivalent,
  
  takes on more than one value
  
  definition
5. motivic 26 Jun 2019
  
  in Public
  
  univalent,
  
  takes on a unique value
  
  definition
Visit annotations in context

Tags

definition

question

correction

Annotators

motivic

URL

arxiv.org/pdf/1803.05170.pdf
Mar 2019
docs.python.org docs.python.org

8.5. heapq — Heap queue algorithm — Python 3.6.7 documentation

1
1. motivic 28 Mar 2019
  
  in Public
  
  heap.sort() maintains the heap invariant
  
  may swap the indices of the nodes at the same height but will keep the sorted array a min heap
Visit annotations in context

Annotators

motivic

URL

docs.python.org/3/library/heapq.html
medium.com medium.com

Google Interview Problems: Synonymous Queries – Alex Golec – Medium

1
1. motivic 08 Mar 2019
  
  in Public
  
  The most common error I see is a subconscious assumption that each word can have at most one synonym
  
  Use sets as the value.
Visit annotations in context

Annotators

motivic

URL

medium.com/@alexgolec/google-interview-problems-synonymous-queries-36425145387c
Oct 2018
en.wikipedia.org en.wikipedia.org

Perplexity - Wikipedia

1
1. motivic 29 Oct 2018
  
  in Public
  
  The perplexity of the model q is defined as b − 1 N ∑ i = 1 N log b ⁡ q ( x i ) {\displaystyle b^{-{\frac {1}{N}}\sum _{i=1}^{N}\log _{b}q(x_{i})}}
  
  The perplexity formula is missing the probability distribution $p$
  
  typos
Visit annotations in context

Tags

typos

Annotators

motivic

URL

en.wikipedia.org/wiki/Perplexity
en.wikipedia.org en.wikipedia.org

Matrix factorization (recommender systems) - Wikipedia

3
1. motivic 19 Oct 2018
  
  in Public
  
  It has been demonstrated that this formulation is almost equivalent to a SLIM model,[9] which is an item-item model based recommender
  
  So a pre-trained item model can be used to make such recommendations.
  
  recommender systems sequence model
2. motivic 19 Oct 2018
  
  in Public
  
  The user's latent factors represent the preference of that user for the corresponding item's latent factors
  
  The higher the value of the dot product between the two, the higher the preference.
  
  recommender systems inner product
3. motivic 19 Oct 2018
  
  in Public
  
  two lower dimensional matrices
  
  Not necessary (in fact, often not) square. Typically each user is represented by a vector of dimension strictly less than the number of items and vice versa.
  
  recommender systems
Visit annotations in context

Tags

sequence model

recommender systems

inner product

Annotators

motivic

URL

en.wikipedia.org/wiki/Matrix_factorization_(recommender_systems)
karpathy.github.io karpathy.github.io

The Unreasonable Effectiveness of Recurrent Neural Networks

1
1. motivic 17 Oct 2018
  
  in Public
  
  are are
  
  *are
  
  typos
Visit annotations in context

Tags

typos

Annotators

motivic

URL

karpathy.github.io/2015/05/21/rnn-effectiveness/
redis.io redis.io

Redis cluster tutorial – Redis

1
1. motivic 10 Oct 2018
  
  in Public
  
  it will not try to start a failover if the master link was disconnected for more than the specified amount of time
  
  Why would it exhibit this behavior? Is it because a slave that's disconnected from the master for too long has stale data? Or is it because the slave made be failing as well?
  
  question
Visit annotations in context

Tags

question

Annotators

motivic

URL

redis.io/topics/cluster-tutorial
mp.weixin.qq.com mp.weixin.qq.com

机器之心

2
1. motivic 05 Oct 2018
  
  in Public
  
  会话
  
  Session
2. motivic 05 Oct 2018
  
  in Public
  
  Python
  
  typos
Visit annotations in context

Tags

typos

Annotators

motivic

URL

mp.weixin.qq.com/s
Sep 2018
docs.pymc.io docs.pymc.io

Marginal Likelihood Implementation — PyMC3 3.5 documentation

2
1. motivic 25 Sep 2018
  
  in Public
  
  conditional distribution for individual components can be constructed
  
  So the conditional distribution is conditioned on other components?
  
  bayesian gaussian processes
2. motivic 24 Sep 2018
  
  in Public
  
  p(y∣x)=∫p(y∣f,x)p(f∣x)df
  
  $y$ is the data, $f$ is the model, $x$ is the input variable
  
  bayesian marginal likelihood
Visit annotations in context

Tags

marginal likelihood

gaussian processes

bayesian

Annotators

motivic

URL

docs.pymc.io/notebooks/GP-Marginal.html
am207.github.io am207.github.io

Inference for GPs

3
1. motivic 24 Sep 2018
  
  in Public
  
  marginaly
  
  *marginal
  
  typos
2. motivic 24 Sep 2018
  
  in Public
  
  corvariance
  
  *covariance
  
  typos
3. motivic 24 Sep 2018
  
  in Public
  
  y=y1,…,yn=m
  
  $n = m$
  
  typos
Visit annotations in context

Tags

typos

Annotators

motivic

URL

am207.github.io/2017/wiki/gp3.html
am207.github.io am207.github.io

Gaussian Processes and 'Non-parametric' Bayes

9
1. motivic 22 Sep 2018
  
  in Public
  
  must store an amount of information which increases with the size of the data
  
  Or you can use MCMC.
2. motivic 21 Sep 2018
  
  in Public
  
  some
  
  *sum
  
  typos
3. motivic 19 Sep 2018
  
  in Public
  
  calculation once again involves inverting a NxN matrix as in the kernel space representation of regression
  
  this is why we use MCMC or other distribution sampling technique instead
  
  gaussian processes
4. motivic 19 Sep 2018
  
  in Public
  
  $f(x_)foratestvectorinputforatestvectorinput for a test vector input x_,givenatrainingsetXwithvaluesyfortheGPisonceagainagaussiangivenbyequationCwithameanvector,givenatrainingsetXwithvaluesyfortheGPisonceagainagaussiangivenbyequationCwithameanvector, given a training set X with values y for the GP is once again a gaussian given by equation C with a mean vector m_andcovariancematrixandcovariancematrix and covariance matrix k_$:
  
  ...$f(x)$ for a test vector input $x$, given a training set $X$ with values $y$ for the GP is once again a gaussian given by equation C with a mean vector $m$ and covariance matrix $k$:
  
  clarification
5. motivic 18 Sep 2018
  
  in Public
  
  corvariance
  
  *covariance
  
  typos
6. motivic 18 Sep 2018
  
  in Public
  
  in equation B for the marginal of a gaussian, only the covariance of the block of the matrix involving the unmarginalized dimensions matters! Thus “if you ask only for the properties of the function (you are fitting to the data) at a finite number of points, then inference in the Gaussian process will give you the same answer if you ignore the infinitely many other points, as if you would have taken them all into account!”(Rasmunnsen)
  
  key insight into Gaussian processes
  
  insights machine learning bayesian gaussian processes
7. motivic 18 Sep 2018
  
  in Public
  
  they
  
  *the
  
  typos
8. motivic 18 Sep 2018
  
  in Public
  
  im
  
  *in
  
  typos
9. motivic 18 Sep 2018
  
  in Public
  
  Notice now that the features only appear in the combination κ(x,x′)=xTΣx′,κ(x,x′)=xTΣx′,\kappa(x,x') = x^T \Sigma x', thus leading to writing the posterior predictive as p(f(x∗)|x∗,X,y)=N(κ(x∗,X)(κ(XT,X)+σ2I)−1y,κ(x∗,x∗)−κ(x∗,XT)(κ(XT,X)+σ2I)−1κ(XT,x∗))p(f(x∗)|x∗,X,y)=N(κ(x∗,X)(κ(XT,X)+σ2I)−1y,κ(x∗,x∗)−κ(x∗,XT)(κ(XT,X)+σ2I)−1κ(XT,x∗))p(f(x_*) | x_* , X, y) = N\left(\kappa(x_*,X) \left(\kappa(X^T,X) + \sigma^2 I\right)^{-1}y,\,\,\, \kappa(x_*,x_*) - \kappa(x_*,X^T)\left(\kappa(X^T,X) + \sigma^2 I\right)^{-1} \kappa(X^T,x_*) \right) The function κκ\kappa is called the kernel
  
  how the kernel came about?
Visit annotations in context

Tags

typos

insights

machine learning

bayesian

gaussian processes

clarification

Annotators

motivic

URL

am207.github.io/2017/wiki/GP2.html
am207.github.io am207.github.io

The idea behind the GP

2
1. motivic 18 Sep 2018
  
  in Public
  
  generate
  
  more like "sample"
2. motivic 18 Sep 2018
  
  in Public
  
  f∗
  
  $f^*$ denotes the model
Visit annotations in context

Annotators

motivic

URL

am207.github.io/2017/wiki/GP1.html
research.fb.com research.fb.com

Efficient tuning of online systems using Bayesian optimization

1
1. motivic 18 Sep 2018
  
  in Public
  
  Bayesian approach to handling observation noise
  
  One core contribution of this work.
Visit annotations in context

Annotators

motivic

URL

research.fb.com/efficient-tuning-of-online-systems-using-bayesian-optimization/
mp.weixin.qq.com mp.weixin.qq.com

腾讯AI实验室

1
1. motivic 17 Sep 2018
  
  in Public
  
  通道剪枝算法
  
  channel pruning algorithm
Visit annotations in context

Annotators

motivic

URL

mp.weixin.qq.com/s/qWKZpb95pQgezGTY3UE4xQ
Aug 2018
en.wikipedia.org en.wikipedia.org

Law of large numbers - Wikipedia

1
1. motivic 24 Aug 2018
  
  in Public
  
  expected values change during the series
  
  So no longer identically distributed
Visit annotations in context

Annotators

motivic

URL

en.wikipedia.org/wiki/Law_of_large_numbers
www.fast.ai www.fast.ai

An Opinionated Introduction to AutoML and Neural Architecture Search · fast.ai

1
1. motivic 06 Aug 2018
  
  in Public
  
  To learn a network for Cifar-10, DARTS takes just 4 GPU days, compared to 1800 GPU days for NASNet and 3150 GPU days for AmoebaNet
  
  What about in comparison to ENAS?
Visit annotations in context

Annotators

motivic

URL

fast.ai/2018/07/16/auto-ml2/
Jul 2018
spark.apache.org spark.apache.org

Spark Programming Guide - Spark 2.1.1 Documentation

1
1. motivic 19 Jul 2018
  
  in Public
  
  partitioner
  
  How to define a partitioner?
Visit annotations in context

Annotators

motivic

URL

spark.apache.org/docs/2.1.1/programming-guide.html
mp.weixin.qq.com mp.weixin.qq.com

SigAI

3
1. motivic 10 Jul 2018
  
  in Public
  
  极大极小(Max-min)博弈
  
  Choose D to maximally discriminate D vs G and at the same time learn the real data; choose G to best "confuse" D.
2. motivic 10 Jul 2018
  
  in Public
  
  交叉熵
  
  Cross entropy
3. motivic 10 Jul 2018
  
  in Public
  
  零和博弈
  
  Zero-sum game
Visit annotations in context

Annotators

motivic

URL

mp.weixin.qq.com/s/e9wMKj8SgjtEWB9U7MM-9w
Jun 2018
people.cs.bris.ac.uk people.cs.bris.ac.uk

ICML'04 tutorial on ROC analysis

1
1. motivic 12 Jun 2018
  
  in Public
  
  isometrics in ROC space
  
  What does this mean exactly?
Visit annotations in context

Annotators

motivic

URL

people.cs.bris.ac.uk/~flach/ICML04tutorial//
Local file Local file

Neural Network FAQ, part 1 of 7: Introduction

1
1. motivic 12 Jun 2018
  
  in Public
  
  you will have to do a lot of work to select appropriate input data and to code the data as numeric values
  
  Not really anymore with the advent of convolutional neural networks.
Annotators

motivic
ai.intel.com ai.intel.com

Guest Post (Part I): Demystifying Deep Reinforcement Learning - Intel AI

1
1. motivic 12 Jun 2018
  
  in Public
  
  next state s’
  
  Is the next state s' the state reached by taking the action with the highest reward?
Visit annotations in context

Annotators

motivic

URL

ai.intel.com/demystifying-deep-reinforcement-learning/
May 2018
blog.cloudera.com blog.cloudera.com

How-to: Tune Your Apache Spark Jobs (Part 2) - Cloudera Engineering Blog

1
1. motivic 24 May 2018
  
  in Public
  
  The number of tasks is the single most important parameter.
Visit annotations in context

Annotators

motivic

URL

blog.cloudera.com/blog/2015/03/how-to-tune-your-apache-spark-jobs-part-2/

Johnson

Mathematician, Data Scientist, Hacker, Triathlete

Annotations: 72

Joined: September 19, 2016

Location: Mountain View, CA

Link: linkedin.com/in/johnsonjia/

ORCID: 0000-0001-8451-564X

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

Annotators

URL

Annotators

URL