Hypothesis

2 Matching Annotations

Apr 2017
www.tensorflow.org www.tensorflow.org

Vector Representations of Words | TensorFlow

2
1. akcool123 19 Apr 2017
  
  in Public
  
  J(t)NEG=logQθ(D=1|the, quick)+log(Qθ(D=0|sheep, quick))
  
  Expression to learn theta and maximize cost and minimize the loss due to noisy words. Expression means -> probability of predicting quick(source of context) from the(target word) + non probability of sheep(noise) from word
  
  Skip-gram Language_Modelling tensorflow-word2vectut
2. akcool123 17 Apr 2017
  
  in Public
  
  Algorithmically, these models are similar, except that CBOW predicts target words (e.g. 'mat') from source context words ('the cat sits on the'), while the skip-gram does the inverse and predicts source context-words from the target words. This inversion might seem like an arbitrary choice, but statistically it has the effect that CBOW smoothes over a lot of the distributional information (by treating an entire context as one observation)
  
  Bag_of_Words_model Skip-gram NLP Language_Modelling
Visit annotations in context

Tags

Language_Modelling

tensorflow-word2vectut

Skip-gram

Bag_of_Words_model

NLP

Annotators

akcool123

URL

tensorflow.org/tutorials/word2vec/