Hypothesis

12 Matching Annotations

Dec 2016
colah.github.io colah.github.io

Neural Networks, Manifolds, and Topology -- colah's blog

2
1. RoachSinai 20 Dec 2016
  
  in Public
  
  Either each layer is a homeomorphism, or the layer’s weight matrix has determinant 0. If it is a homemorphism, AAA is still surrounded by BBB, and a line can’t separate them. But suppose it has a determinant of 0: then the dataset gets collapsed on some axis. Since we’re dealing with something homeomorphic to the original dataset, AAA is surrounded by BBB, and collapsing on any axis means we will have some points of AAA and BBB mix and become impossible to distinguish between.
  
  拿着样是不是神经网络每层神经元的个数都是一个：第一个隐层神经元个数最多，然后依次下降？并且第一层最多。是这样么？
  
  神经网络
2. RoachSinai 20 Dec 2016
  
  in Public
  
  A linear transformation by the “weight” matrix WWW A translation by the vector bbb Point-wise application of tanh.
  
  先进行线性变换（拉伸、旋转、平移），然后用非线性激活函数实现：线性不可分->线性可分！
  
  神经网络
Visit annotations in context

Tags

神经网络

Annotators

RoachSinai

URL

colah.github.io/posts/2014-03-NN-Manifolds-Topology/
Nov 2016
www.kuanshi.me www.kuanshi.me

精油的使用方法如何用精油去黑头_保龄肤精油是什么方 - pony的美瞳是什么款式

1
1. RoachSinai 22 Nov 2016
  
  in Public
  
  1.和其他方法一样，精油去黑头也是要先清洁的；然后打开毛孔，一般我们用热毛巾敷脸就行了，如果有美容院的蒸面器，就更好了。2.可以开始了，在鼻子上涂抹精油，然后轻轻按摩至少十分钟，这样才能让肌肤充分吸收精油。3.最后用清水洗净，然后拍上爽肤水，防止毛孔粗大。
  
  精油去黑头。
Visit annotations in context

Annotators

RoachSinai

URL

kuanshi.me/精油的使用方法-如何用精油去黑头.html
bbs.pinggu.org bbs.pinggu.org

什么是统计量的size和power？ - 计量经济学与统计软件 - 经管之家(原人大经济论坛)

1
1. RoachSinai 20 Nov 2016
  
  in Public
  
  什么是统计量的size和power？ size是指size of the test，就是置信水平（1 - 阿尔法）里面的那个“阿尔法”，又称“检验水平”。 power是指power of test statistic，是统计量的“统计检验力”。在有限样本时，即使当N和T分别小于50和2时，该检验统计量仍然拥有合理的size, 特别当T大于等于10时，该检验拥有良好的power.——这句话怎样理解？ "合理的size就是能够满足合理的置信水平条件，也就是犯I类错误的概率很低。良好的power是指犯II类错误的概率很低，也就是H0为假时拒绝H0的概率很高。"
  
  在试验样本确定的情况下，$\alpha$越小，$\beta$就越大。
  
  Statistic
Visit annotations in context

Tags

Statistic

Annotators

RoachSinai

URL

bbs.pinggu.org/thread-944750-1-1.html
roachsinai.github.io roachsinai.github.io

广义线性模型与Softmax - Roach's Blog

1
1. RoachSinai 07 Nov 2016
  
  in Public
  
  Softmax分类器所做的就是最小化在估计分类概率（就是 Li=efyi/∑jefjLi=efyi/∑jefjL_i =e^{f_{y_i}}/\sum_je^{f_j}）和“真实”分布之间的交叉熵.
  
  而这样的好处，就是如果样本误分的话，就会有一个非常大的梯度。而如果使用逻辑回归误分的越严重，算法收敛越慢。比如，$t_i=1$ 而 $y_i=0.0000001$，cost function 为 $E=\frac{1}{2}(t-y)^2$ 那么，$\frac{dE}{dw_i}=-(t-y)y(1-y)x_i$.
  
  neural-networks
Visit annotations in context

Tags

neural-networks

Annotators

RoachSinai

URL

roachsinai.github.io/2016/05/16/1Softmax_GLM/
Jul 2016
en.wikipedia.org en.wikipedia.org

Ensemble learning - Wikipedia, the free encyclopedia

1
1. RoachSinai 15 Jul 2016
  
  in Public
  
  following equation
  
  $$ y={argmax} _{c_{j}\in C}\sum _{h_{i}\in H}{P(c_{j}|h_{i})P(T|h_{i})P(h_{i})} $$
  
  $$ ={argmax} _{c_{j}\in C}\sum _{h_{i}\in H}{P(c_{j}|h_{i})P(T,h_{i})} $$
  
  $$= {argmax}_{c_{j}\in C}\sum _{h_{i}\in H}{P(c_{j}|h_{i})P(h_{i}|T)}$$
  
  \propto doesn't work well.
  
  Bayes optimal
Visit annotations in context

Tags

Bayes optimal

Annotators

RoachSinai

URL

en.wikipedia.org/wiki/Ensemble_learning
cs231n.github.io cs231n.github.io

CS231n Convolutional Neural Networks for Visual Recognition

1
1. RoachSinai 05 Jul 2016
  
  in Public
  
  For example, a large gradient flowing through a ReLU neuron could cause the weights to update in such a way that the neuron will never activate on any datapoint again.
  
  ReLU函数在输入(wx)大于0的情况下激活。如果在后向传播的过程中，ReLU Unit 接收了一个大的梯度，使得某些w变成较大的负数。那么可能导致这个单元之后的输出都是0，它后向传播的梯度值也是0.
Visit annotations in context

Annotators

RoachSinai

URL

cs231n.github.io/neural-networks-1/
zhuanlan.zhihu.com zhuanlan.zhihu.com

CS231n课程笔记翻译：神经网络笔记1（上） - 智能单元 - 知乎专栏

1
1. RoachSinai 05 Jul 2016
  
  in Public
  
  z字型的下降
  
  锯齿形状的突变而非缓慢变化。
Visit annotations in context

Annotators

RoachSinai

URL

zhuanlan.zhihu.com/p/21462488
Jun 2016
cs231n.github.io cs231n.github.io

CS231n Convolutional Neural Networks for Visual Recognition

2
1. RoachSinai 30 Jun 2016
  
  in Public
  
  Backpropagation can thus be thought of as gates communicating to each other (through the gradient signal) whether they want their outputs to increase or decrease (and how strongly), so as to make the final output value higher.
  
  后向传播可以视为门单元之间的通信，只要各单元值随着梯度信号方向变化（单元局部梯度为负，单元输入值就降低，反之，增加），神经网络最后的输出值就会增加。
2. RoachSinai 30 Jun 2016
  
  in Public
  
  The derivative on each variable tells you the sensitivity of the whole expression on its value.
  
  函数对某变量的偏导表明该变量的变化对函数变化的影响。
Visit annotations in context

Annotators

RoachSinai

URL

cs231n.github.io/optimization-2/
jeremykun.com jeremykun.com

Probably Approximately Correct — a Formal Theory of Learning

2
1. RoachSinai 28 Jun 2016
  
  in Public
  
  fine solution is .
  
  So, if samples is enough, PAC-learnable, machine could learn!
  
  PAC theory
2. RoachSinai 28 Jun 2016
  
  in Public
  
  concept class
  
  Concept class(maybe, version space) is a set of concepts, which satisfy the map between given samples and their corresponding given labels(i.e. target concept).
  
  Means concept is just a mapping function. But, every concept belong to concept class is a target class.
  
  PAC theory
Visit annotations in context

Tags

PAC theory

Annotators

RoachSinai

URL

jeremykun.com/2014/01/02/probably-approximately-correct-a-formal-theory-of-learning/

Tags

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Tags

Annotators

URL