 Jun 2017

arxiv.org arxiv.org

adding these computational blocks never hurts performance,even on tasks they were not designed fo
It is interesting that domain specific mechanisms always improve performance for other domains. I wonder how performance would be if you applied these mechanisms independently on cross domain tasks.

 May 2017

arxiv.org arxiv.org

reset to their random initial values
The values are reset to the original initial random values, or to new random values?

A maximum ofNdistinct modules per layer arepermitted in a pathway (typicallyN= 3 or 4).
Why would this be beneficial? Obviously this limit prevents pathways from using entire layers, but how does that help the agents find a suitable pathway? Does this make the agent less localized?

Async Advantage ActorCritic (A3C

Agents are pathways (views) throughthe network which determine the subset of parameters thatare used and updated by the forwards and backwards passesof the backpropogation algorithm
Agents find pathways in the giant neural network that would be useful in other networks.

 Mar 2017

www.fractal.org www.fractal.org

many natural phenomena are better described using a dimension between two whole numbers
This is a nice definition of noninteger dimension.

If you look carefully at a fern leaf, you will notice that every little leaf  part of the bigger one  has the same shape as the whole fern leaf. You can say that the fern leaf is selfsimilar. The same is with fractals: you can magnify them many times and after every step you will see the same shape, which is characteristic of that particular fractal.
This is a nice example of selfsimilarity.


www.nature.com www.nature.com

This probability—the probability that a score at least as large as the observed score would occur in data drawn according to the null hypothesis—is called the Pvalue.
A good description of pvalue.

 Feb 2017

Local file Local file

Like families, tidy datasets are all alike but every messy dataset is messy in its own way.
There are many ways to do something wrong, but only one way to do it right! When adhering to a standard, there is only one way that you do it correctly, but many ways to do it incorrectly.


bmcgenomics.biomedcentral.com bmcgenomics.biomedcentral.com

https://sourceforge.net/projects/aafphylogeny/
The code is updated on Github at this repository.


goldenhelix.com goldenhelix.com

Very large sample sizes may be required to achieve such significance levels, especially for rare disease alleles and alleles with small effect sizes
The irony is that large sample sizes are needed for rare diseases alleles and small effect size alleles.

Manhattan Plot
It is helpful to note what the axises of a Manhattan Plot represent. As stated, the xaxis is the locus (location of a nucleotide) of the SNP in the genome, and the yaxis is the negative pvalue that is scaled using logarithms. This means that the highest points in the Manhattan Plot have the lowest pvalues, and are therefore the statistically significant SNPs.

GLM
General Linear Model (GLM) is a statistical linear model that generalizes multiple linear regression models such that there are multiple dependent variables.

ANOVA
The ANOVA analysis compares one nominal variable with one measurement variable. In this case the measurement variable would be the genotype of an individual, and the nominal variable is whether that individual is case or control.

consanguinity
Consanguinity is the property of being related to someone, in essence having the same ancestor as someone else.

The specificset of alleles observed together on a single chromosome, or part of a chromosome, is called a haplotype.
This is a good definition of a haplotype.


bmcbioinformatics.biomedcentral.com bmcbioinformatics.biomedcentral.com

Method
It seems that there is no weighting scheme/method that consistently performs the best for each organism and for each validation criteria, except perhaps SIMATN.

pLSA performances are always improved by the NTN schema
This is not the case for the Boss Taurus or the Danio rerio cmp groups.

APrate
APrate Annotation Predicted, equivalent to a false positive.

ACrate
ACrate Annotation Confirmed, equivalent to a true positive.

Receiver Operating Characteristic (ROC) curves
Receiver Operating Characteristic (ROC) curves these curves compare the true positive rate against the false positive rate.

−102,118
Why were there 102,118 less annotations after 4 years?

the new values for P(ft)as:
missing equation

we can interpret each of those vectors as multinomial distributions of probabilities over the set of topics
I think that this is very similar to a softmax layer in a neural network.

overcome this issue, by adding a gene clustering step and defining a specific model for each cluster,
I still see a limitation with a bias towards genes that have few or no annotations. How can you cluster based on annotation terms if it has no annotation terms? What about if there are only a few terms for a gene?

orthonormal
Orthonormal matrix a square matrix where each row and column are orthogonal unit vectors.
Orthogonal when two vectors are perpendicular.
Unit vector a vector of length one.

For each function term f it provides an estimation of the importance of an annotation to that term, decreasing the relevance of the annotations to common terms, such as the ones close to the ontology root
The inverse gene frequency (IGF) function provides a way to quantify how influential a term is based on how far away the term is from the ontology root.
Basically, the terms that are more specific are more important than the general terms.

the relevance of a function term for a given gene is proportional to the number of descendant of that terms that are annotated to the gene and (b) if a term is rare (i.e. it is annotated only to a small subset of G), it is a better discriminator among the set of genes than common function terms
This is the basis for the weighted matrix:
 The relevance of a function term increases as there are more descendant terms annotated to the gene.
 The more rare a term is, the better it describes a set of genes.

 Jan 2017

www.biostathandbook.com www.biostathandbook.com

There are three things you can do with this kind of data.
 Hypothesis test, use a ttest or something similar to see if your hypothesis is supported by the data.
 Find how tightly the variables are associated, calculate r and the higher the r the stronger the relationship is between the two variables (a.k.a. a large r means that one can accurately predict one variable from the other).
 Determine the equation of a line that generalizes the data, allowing for predictions given only one variable.

Graph of my pulse rate vs. speed on an elliptical exercise machine.
I believe that the labels for the x and yaxis should be switched.
Tags
Annotators
URL


neuralnetworksanddeeplearning.com neuralnetworksanddeeplearning.com

Their most successful network had hidden layers containing 2,5002,5002,500, 2,0002,0002,000, 1,5001,5001,500, 1,0001,0001,000, and 500500500 neurons, respectively. They used ideas similar to Simard et al to expand their training data. But apart from that, they used few other tricks, including no convolutional layers: it was a plain, vanilla network, of the kind that, with enough patience, could have been trained in the 1980s
I find it interesting that even simple neural networks can achieve good results. However, I'm not sure that this network would be considered simple given that it has so many parameters, it is hardly a simple function.


www.biostathandbook.com www.biostathandbook.com

Personally, I don't see how treating values of a Likert item as a measurement variable will cause any statistical problems.
I agree that there wouldn't be any problems, but one issue could be in comparing studies that use different intervals for their Likert items.

Converting measurement variables to nominal variables
This is an important concept in Data Science, which allows you to generalize the data so that you can get a high level picture of what the data is. It is important to keep the original measured values so that you can do more precise testing later.

You might plot 52.3% on a graph as a simple way of summarizing the data, but you should use the 34 female and 31 male numbers in all statistical tests.
There can be fundamentally different ways to display data, and to analyze using statistics. Keep in mind to use raw values as opposed to percentages, averages, etc. for statistical tests.


genome.cshlp.org genome.cshlp.org

orthologous groups
Orthologous groups are groups of genes that developed from a common ancestor, source. Orthologous genes have the same (or similar) function, but may vary in sequence.
Tags
Annotators
URL
