 Mar 2019
 Oct 2018

www.slideshare.net www.slideshare.net

Do neural networks dream of semantics?
Neural networks in visual analysis, linguistics Knowledge graph applications
 Data integration,
 Visualization
 Exploratory search
 Question answering
Future goals: neurosymbolic integration (symbolic reasoning and machine learning)

 Aug 2017

arxiv.org arxiv.org

This is a very easy paper to follow, but it looks like their methodology is a simple way to improve performance on limited data. I'm curious how well this is reproduced elsewhere.


cs231n.github.io cs231n.github.io

The takeaway is that you should not be using smaller networks because you are afraid of overfitting. Instead, you should use as big of a neural network as your computational budget allows, and use other regularization techniques to control overfitting
What about the rule of thumb stating that you should have roughly 510 times as many data points as weights in order to not overfit?

 Apr 2017

www.tensorflow.org www.tensorflow.org

If we write that out as equations, we get:
It would be easier to understand what are x and y and W here if the actual numbers were used, like 784, 10, 55000, etc. In this simple example there are 3 x and 3 y, which is misleading. In reality there are 784 x elements (for each pixel) and 55,000 such x arrays and only 10 y elements (for each digit) and then 55,000 of them.

 Nov 2016

roachsinai.github.io roachsinai.github.io

Softmax分类器所做的就是最小化在估计分类概率（就是 Li=efyi/∑jefjLi=efyi/∑jefjL_i =e^{f_{y_i}}/\sum_je^{f_j}）和“真实”分布之间的交叉熵.
而这样的好处，就是如果样本误分的话，就会有一个非常大的梯度。而如果使用逻辑回归误分的越严重，算法收敛越慢。比如，\(t_i=1\) 而 \(y_i=0.0000001\)，cost function 为 \(E=\frac{1}{2}(ty)^2\) 那么，\(\frac{dE}{dw_i}=(ty)y(1y)x_i\).

 Jan 2016

thinkingmachines.mit.edu thinkingmachines.mit.edu

n and d
What do n and d mean?

 Jul 2015

neuralnetworksanddeeplearning.com neuralnetworksanddeeplearning.com

Neural Networks and Deep Learning is a free online book. The book will teach you about: Neural networks, a beautiful biologicallyinspired programming paradigm which enables a computer to learn from observational data Deep learning, a powerful set of techniques for learning in neural networks


Tags
Annotators
URL

 Jun 2015

www.technologyreview.com www.technologyreview.com

Enter the Daily Mail website, MailOnline, and CNN online. These sites display news stories with the main points of the story displayed as bullet points that are written independently of the text. “Of key importance is that these summary points are abstractive and do not simply copy sentences from the documents,” say Hermann and co.
Someday, maybe projects like Hypothesis will help teach computers to read, too.
