57 Matching Annotations

Aug 2016
www.aaai.org www.aaai.org

Intrinsic and Extrinsic Evaluations of Word Embeddings

1
1. ffbe15b4a7 08 Aug 2016
  
  in Public
  
  Code on GitHub: https://github.com/mzhai2/aaai16
Visit annotations in context

Annotators

ffbe15b4a7

URL

aaai.org/ocs/index.php/AAAI/AAAI16/paper/viewFile/12454/12257
Jul 2016
web.stanford.edu web.stanford.edu

16.pdf

3
1. ffbe15b4a7 23 Jul 2016
  
  in Public
  
  relational meanings
  
  "to capture linguistic regularities as relations between vectors", IMHO
2. ffbe15b4a7 23 Jul 2016
  
  in Public
  
  meanings
  
  add full stop
3. ffbe15b4a7 23 Jul 2016
  
  in Public
  
  different
  
  typo - difference
Visit annotations in context

Annotators

ffbe15b4a7

URL

web.stanford.edu/~jurafsky/slp3/16.pdf
Jun 2016
aclweb.org aclweb.org

Right-truncatable Neural Word Embeddings

3
1. ffbe15b4a7 26 Jun 2016
  
  in Public
  
  Neural Word Embedding Methods
  
  formal-definition word-embeddings
2. ffbe15b4a7 26 Jun 2016
  
  in Public
  
  Thus, we basically need to re-train
  
  ... in order to achieve what? Statement doesn't seem complete.
  
  Perhaps "when we need lower dimensional embeddings with d = D', we can't obtain them from higher dimensional embeddings with d = D."?
  
  However, it is possible, to a certain extent, to obtain lower dimensional embeddings from higher dimensional ones - e.g. via PCA.
  
  question
3. ffbe15b4a7 26 Jun 2016
  
  in Public
  
  dimension of embedding vectors strongly dependson applications and uses, and is basically determinedbased on the performance and memory space (orcalculation speed) trade-of
  
  dimensionality-of-word-embeddings
Visit annotations in context

Tags

word-embeddings

question

formal-definition

dimensionality-of-word-embeddings

Annotators

ffbe15b4a7

URL

aclweb.org/anthology/N/N16/N16-1135.pdf
arxiv.org arxiv.org

1404.1100v1.pdf

50
1. ffbe15b4a7 14 Jun 2016
  
  in Public
  
  remove second-order depen-dencies
  
  What is it meant by this?
  
  Related question on stats
  
  question
2. ffbe15b4a7 14 Jun 2016
  
  in Public
  
  it reveals simple underlying structures in com-plex data sets using analytical solutions from linear algebra
  
  nice-statement
3. ffbe15b4a7 14 Jun 2016
  
  in Public
  
  the third definition
  
  The definitions are not numbered. It would be nice to have them numbered.
  
  change
4. ffbe15b4a7 14 Jun 2016
  
  in Public
  
  uiˆuj
  
  These u vectors are orthogonal.
5. ffbe15b4a7 14 Jun 2016
  
  in Public
  
  (XTX)ˆvi=liˆvi
  
  Which means that transforming vector v with that matrix gives us a vector with the same direction. Direction does not change after transformation. This is eigenvector.
6. ffbe15b4a7 12 Jun 2016
  
  in Public
  
  PCA and in the process, find that PCA is closely related to
  
  What is the intuitive relationship between SVD and PCA
  
  Relationship between SVD and PCA. How to use SVD to perform PCA?
7. ffbe15b4a7 12 Jun 2016
  
  in Public
  
  subtract-ing off the mean
  
  Actually, this is a requirement for computing the covariance matrix Cx.
  
  Estimation of covariance matrices
8. ffbe15b4a7 12 Jun 2016
  
  in Public
  
  entails
  
  "entails" is an unfortunate choice of words. "implies" / "includes" / "requires" perhaps?
  
  change
9. ffbe15b4a7 12 Jun 2016
  
  in Public
  
  It is evident that the choice ofPdiagonalizesCY
  
  That is, we have found that, by selecting P = E (the set of eigenvectors of Cx), we get what we wanted: the matrix Cy to be a diagonal matrix.
10. ffbe15b4a7 12 Jun 2016
  
  in Public
  
  CY
  
  This we want to be a diagonal matrix, which would mean that the matrix Y is decorrelated.
11. ffbe15b4a7 12 Jun 2016
  
  in Public
  
  orthonormal matrix
  
  Orthogonal matrix
  
  question change
12. ffbe15b4a7 12 Jun 2016
  
  in Public
  
  the number ofmeasurement types
  
  That is, the number of features.
13. ffbe15b4a7 12 Jun 2016
  
  in Public
  
  ju-dicious
  
  prudent, sensible.
14. ffbe15b4a7 12 Jun 2016
  
  in Public
  
  bely
  
  What does "bely" mean?
  
  change
15. ffbe15b4a7 12 Jun 2016
  
  in Public
  
  by a simple algorithm
  
  nice-statement
16. ffbe15b4a7 12 Jun 2016
  
  in Public
  
  normalized direction
  
  A vector (direction vector) with norm = 1.
17. ffbe15b4a7 12 Jun 2016
  
  in Public
  
  Yisdecorrelated
  
  The features of the output matrix Y are not correlated. Building a covariance matrix for it would yield a diagonal matrix.
18. ffbe15b4a7 12 Jun 2016
  
  in Public
  
  variance
  
  Elements on the diagonal of the matrix.
19. ffbe15b4a7 12 Jun 2016
  
  in Public
  
  covariance
  
  The off-diagonal elements of the matrix.
20. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  large values cor-respond to interesting structure
  
  Features with high variance. Directions of major spread.
21. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  arises from estimation theory
  
  Wikipedia, corrected standard deviation
22. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  measurement types
  
  aka features
23. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  The covariance measures the degree of the linear relationshipbetween two variables
  
  nice-statement
24. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  Because one can calculater1fromr2
  
  Because there is a simple (almost linear in our case) relationship between the two variables.
25. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  is in meters and ̃xAis ininches.
  
  Again, might be so - but quite ambiguous statement. Since we see a decreasing function on the plot.
  
  question change
26. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  nearby
  
  "nearby" would make sense if the right-most plot of Fig. 3 shows the first diagonal, which it doesn't.
  
  Or perhaps "nearby", but one of the cameras is upside down.
  
  All in all, quite ambiguous statement.
  
  question change
27. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  correlated
  
  Pretty image on Wikipedia, it this article about correlation.
28. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  Figure 3
  
  The example for redundancy is not (or at least it doesn't seem to be) in the context of the example with the spring and the ball. Since there is no clear separation between the examples, this might be confusing to readers.
  
  change
29. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  multiple sensors record the samedynamic information
  
  More features refer to the same (or almost the same) thing.
30. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  best-fit line
  
  But not as in linear regression / ordinary least squares.
  
  Nice animation on stats.
31. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  Maximizing the variance (and by assumption the SNR)corresponds to finding the appropriate rotation of the naivebasis
  
  PCA relates to rotation.
  
  nice-statement
32. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  he dynamics of interest existalong directions with largest variance and presumably high-est SNR
  
  nice-statement
33. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  directions with largest variances in ourmeasurement space contain the dynamics of interes
  
  We seek new features (new directions) which best contain the information (variance) of interest.
  
  Amount of variance -> amount of information.
  
  explanation
34. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  rotation and a stretch
  
  https://youtu.be/BfTMmoDFXyE
  
  explanation
35. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  how do we get from this data
  
  How to reduce the 6D data set to a 1D data set? How to discover the regularities in the data set and achieve dimensionality reduction?
  
  explanation
36. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  our measurements might not even be 90o
  
  The features are not orthogonal. Information brought by distinct measurements is overlapping.
  
  explanation
37. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  non-parametric method
  
  It does not make any assumptions about the distribution of the data.
  
  r-tutor, Non-parametric methods
  
  PSU, Non-parametric methods
  
  explanation
38. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  ball’s position in a three-dimensional space
  
  ball's position = a data sample
  
  three-dimensional space = the feature space, with 3 x 2 features (because each camera records in 2D). Time dimension not recorded since it is, actually, the index of a data sample.
  
  Some of these features (dimensions) are not necessary (they are redundant).
  
  explanation
39. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  does not lie along the basis of the recording(xA;yA)butrather along the best-fit line
  
  Ambiguous statement. A "direction" cannot lie along a "basis". Perhaps "basis vectors"?
  
  Also, if "best-fit line" usually refers to a line found via least-squares regression, which is not the case here (PCA versus linear regression).
  
  change
40. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  largest directionof variance
  
  "direction of largest variance" perhaps?
  
  change
41. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  are a set of new basis vec-tors
  
  This means that P is an orthogonal matrix.
42. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  newrepresentation of that data set
  
  Original data, with a different base.
43. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  basis
  
  New basis, right?
  
  question
44. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  Thus our original basis reflects the methodwe measured our data
  
  nice-statement
45. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  some orthonormal basis
  
  PCA will uncover a smaller, better, orthonormal basis.
46. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  the number of measurement types
  
  That is, the number of features.
47. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  72000 of these vectors
  
  The data matrix. We apply PCA on this.
48. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  structure
  
  And, hopefully, the structure can be expressed in a lower-dimensional space (1D in our case).
49. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  noise
  
  AFAIK PCA works good when noise is Gaussian.
  
  AFAIK
50. ffbe15b4a7 11 Jun 2016
  
  in Public
  
  variablex
  
  Unfortunate labelling of variable. x would be time, actually.
  
  To do: don't name the variable, it's not necessary.
  
  change
Visit annotations in context

Tags

AFAIK

explanation

question

change

nice-statement

Annotators

ffbe15b4a7

URL

arxiv.org/pdf/1404.1100

ffbe15b4a7

Annotations: 57

Joined: June 11, 2016

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL