 Jan 2019

awspntest.apa.org awspntest.apa.org

the strongest first factor accounted for 86.3% of observed variable variance
I suspect that this factor was so strong because it consisted of only four observed variables, and three of them were written measures of verbal content. All of the verbal cariables correlated r = .72 to .89. Even the "nonverbal" variable (numerical ability) correlates r = .72 to .81 with the other three variables (Rehna & Hanif, 2017, p. 25). Given these strong correlations, a very strong first factor is almost inevitable.

The weakest first factor accounted for 18.3% of variance
This factor may be weak because the sample consists of Sudanese gifted children, which may have restricted the range of correlations in the dataset.

 Dec 2018

www.anthropicprinciple.com www.anthropicprinciple.com

The Doomsday argument
The Doomsday argument (DA) is a probabilistic argument that claims to predict the number of future members of the human species given only an estimate of the total number of humans born so far. Simply put, it says that supposing that all humans are born in a random order, chances are that any one human is born roughly in the middle.
From Wikipedia, Doomsday argument
Tags
Annotators
URL

 Nov 2018

www.coursera.org www.coursera.org

Basic Statistics
part of specialization
Tags
Annotators
URL


www.insidehighered.com www.insidehighered.com

Online Options Give Adults Access, but Outcomes Lag
In this article, drivers that increase and improve online learning success in adults are explored. State by state data along with federal stats contribute to the conclusions presented.
Roughly 13% of all undergraduates are fulltime online students and between 2012 and 2017 online students grew y 11 percent, about 2.25 million. The article presents a map showing state by state stats and the information provided can assist in growing individual school needs.
RATING: 4/5 (rating based upon a score system 1 to 5, 1= lowest 5=highest in terms of content, veracity, easiness of use etc.)

 Sep 2018

stackoverflow.com stackoverflow.com

Doublecentering a matrix M


stats.stackexchange.com stats.stackexchange.com

Relationship between ridge regression and PCA regression


Local file Local file

Lack of metrics
Develop or strengthen statistical information of new industry developments, ups and downs of existing industries.

LDLC workers and their needs
Develop or strengthen statistics regarding LDLC workers and their needs


www.statisticssolutions.com www.statisticssolutions.com

predictive analysis
Predictive analytics encompasses a variety of statistical techniques from data mining, predictive modelling, and machine learning, that analyze current and historical facts to make predictions about future or otherwise unknown events.

 Apr 2018

www.madinamerica.com www.madinamerica.com

Excellent example of badly used statistics.

 Mar 2018

www.rbloggers.com www.rbloggers.com

Boxplots and Beyond – Part II: Asymmetry


eurekastatistics.com eurekastatistics.com

Using the Median Absolute Deviation to Find Outliers


www.ncbi.nlm.nih.gov www.ncbi.nlm.nih.gov

Power analysis and sample size estimation for RNASeq differential expression
Tags
Annotators
URL


www.ncbi.nlm.nih.gov www.ncbi.nlm.nih.gov

Polyester: simulating RNAseq datasets with differential transcript expression

 Feb 2018

hypothes.is hypothes.is

My daughter will be brought up to understand her true value. That’s a promise. As for all the little girls to be born around the world, the creation of these ads is an effort to show how imagination can change the conversation around their lives.


achri.blogspot.com achri.blogspot.com

Combining data from multiple RNASeq experiments: release the Kruskal! (...Wallis test)

 Jan 2018

www.laurenbcollister.com www.laurenbcollister.com

ThiswasthenumberofpagesbeingsearchedbyGooglewhenwewereputtingthebooktogether.HavealookatthefigurepublishedonthebottomoftheGooglehomepagetoseewhatitisnow.
This cute service is no longer supported. However, at the end of 2016, the estimate was over 130 trillion pages.

 Dec 2017

www.sciencedirect.com www.sciencedirect.com

Neural correlates of interspecies perspective taking in the postmortem Atlantic Salmon: an argument for multiple comparisons correction


rsos.royalsocietypublishing.org rsos.royalsocietypublishing.org

The natural selection of bad science

 Nov 2017

rafalab.github.io rafalab.github.ioharvardx1

HarvardX Biomedical Data Science Open Online Training


www.sciencedirect.com www.sciencedirect.com

pairwise overlaps using Fisher’s test and mutual exclusion (Leiserson et al., 2016xA weighted exact test for mutually exclusive mutations in cancer. Leiserson, M.D.M., Reyna, M.A., and Raphael, B.J. Bioinformatics. 2016; 32: i736–i745Crossref  PubMed  Scopus (4)See all ReferencesLeiserson et al., 2016)


www.ncbi.nlm.nih.gov www.ncbi.nlm.nih.gov

Gene Set Enrichment Analysis Made Simple
using aggregate t or chi^2 statistic to test if a set of genes is on aggregate differentially expressed

 Oct 2017

www.restore.ac.uk www.restore.ac.uk

An Introduction to Odds, Odds Ratios and Exponents


tedunderwood.com tedunderwood.com

One of the main ways computers are changing the textual humanities is by mediating new connections to social science. The statistical models that help sociologists understand social stratification and social change haven’t in the past contributed much to the humanities, because it’s been difficult to connect quantitative models to the richer, looser sort of evidence provided by written documents.
DH as moving English more toward the statistical...

 May 2017

fivethirtyeight.com fivethirtyeight.com

Analysis of a subreddit for Trump supporters, based on comparisons of the users of various subreddits.


static1.squarespace.com static1.squarespace.com

s. If we want to understand the effects of global warming or whether the economy is headed for a recessio
The class on The Rhetorical Situation brought up discussion on the evolving notion of "weather" as a changeable, even rhetorical, thing. Moving to integrate database and narrative as symbionts makes a connection between data and delivery/appeal.

 Apr 2017

bangordailynews.com bangordailynews.com

The annual drop in Maine wood demand since 2014 would fill that imaginary 1,770mile caravan. The loss equals about 350 fewer truckloads of wood a day, every day of the year.

 Mar 2017

bangordailynews.com bangordailynews.com

The state has pumped more than 100 million pounds of low bush fruit into the frozen market each year for the last three growing cycles.

A typical acre of blueberry barrens will yield about 2,000 to 4,000 pounds of berries, depending on pollination and other factors.


tachesdesens.blogspot.com tachesdesens.blogspot.com

I never regret the eleven months which hardened my resolve, to go beyond 98 'Nos' to get to the precious, unexpected 'Yes's'. I was nobody, I was selling nothing, I could be nobody selling anything.
Numbers
Statistics
Alienation

 Feb 2017

static1.squarespace.com static1.squarespace.com

That two dice marked in the common way will tum up seven, is thrice as probable as that they will tum up eleven, and six times as probable as that they will tum up twelve
D&D has made me embarrassingly good at estimating probable outcomes of platonic die in my head.

In moral reasoning we ascend from pos~ibility, by an insensible gradation, to probability, and thence, in the same manner, to the summit of moral certainty.
I believe Campbell addresses some of the uncertainty of Inductive Reasoning here. The phrase "insensible gradation" seems meaningfulhow we go from a possibility to moral certainty is something fundamentally difficult in a manner Hume cannot accept. But Campbell explains in this section many of the difficulties of this, and how it's still useable, for moral judgment.
On the same side, I come back to Bayesian Probabilities, wondering if Campbell knew about them, and how they transfer statistical, mathematical knowledge towards determining if a hypothesis is true. Once again, I'm hesitant that I'll exceed my grasp of stats if I talk to much about it, though.

The course of nature will be the same tomorrow that it is today; or, the future will resemble the past"
Apparently, this is a surprisingly successful rationale for meteorology. If you just assume "tomorrow's weather will resemble today's," you'll end up more right than not, and can actually beat some meteorologists. Then again, Jim Flowers and the KMTV AccuWeather Forecast might have just been terrible.

 Oct 2016

www.businessinsider.com www.businessinsider.com

With figures like those, it's clear that the education system isn't going away anytime soon.
How so?

 Sep 2016

www.thelocal.se www.thelocal.se

According to the language periodical Språktidningen, ‘hen’ was by 2014 used once in the Swedish media for every 300 used of ‘hon’ or ‘han’, up from one in every 13,000 in 2011
Increasing rate of usage of hen vs. hon or han: 1/13,000 in 2011; 1/300 in 2014.

 May 2016

www.propublica.org www.propublica.org

the algorithm was somewhat more accurate than a coin flip
In machine learning it's also important to evaluate not just against random, but against how well other methods (e.g. parole boards) do. That kind of analysis would be nice to see.


medium.com medium.com

A study of housing cost in San Francisco from the 1950s to 2016.

 Mar 2016

amstat.tandfonline.com amstat.tandfonline.com

American Statistical Association statement on pvalues

 Feb 2016


3,068 adults in August 2014, found that 72 percent of Americans reported feeling stressed about money at least some of the time during the past month. Twentytwo percent said that they experienced extreme stress about money during the past month (an 8, 9 or 10 on a 10point scale, where 1 is “little or no stress” and 10 is “a great deal of stress”). For the majority of Americans (64 percent), money is a somewhat or very significant source of stress, but especially for parents and younger adults (77 percent of parents, 75 percent of millennials [18 to 35 years old] and 76 percent of Gen Xers [36 to 49 years old]).
Along the lines of the first paragraph except putting some percentages into it. Almost three quarters of Americans (out of a 3,000 person survey) feels some kind of "extreme stress about money" each month, the majority coming from parents, adults and young adults (1835). I'll incorporate this into my paper by using statistics to show how money is a huge reason for stress in adults.


bangordailynews.com bangordailynews.com

He expects that the logging project near Quimby’s land will likely generate about $755,250 at the state’s average sale price, $50.35 per cord of wood. The land has about 1,500 harvestable acres that contain about 30 cords of wood per acre, or 45,000 cords, but only about a third of that will be cut because the land is environmentally sensitive, Denico said. The Bureau of Parks and Lands expects to generate about $6.6 million in revenue this year selling about 130,000 cords of wood from its lots, Denico said. Last year, the bureau generated about $7 million harvesting about 139,000 cords of wood. The Legislature allows the cutting of about 160,000 cords of wood on state land annually, although the LePage administration has sought to increase that amount.


www.psychologytoday.com www.psychologytoday.com

From 1926 until the early 1950s, US military aircraft relied on a "one size fits all" design based on average measurements of hundreds of male pilots.
But a 1950 study by Lt. Gilbert Daniels showed that out of 4,063 airmen, not even one was average in all ten measurements. They started designing cockpits and controls to be adjustable. Accidents decreased, and pilot performance increased.
Standardized education makes the same mistake.

The science of the individual relies on dynamic systems theory rather than group statistics. Its research methodology is characterized by “analyze, then aggregate” (analyze each subject separately, then combine individual patterns into collective understanding) rather than “aggregate, then analyze” (derive group statistics based on aggregate data, then use these statistics to evaluate and understand individuals).
A mathematical psychologist at Penn State University, Molenaar extended ergodic theory (link is external) to prove that it was not mathematically permissible to use assessment instruments based on group averages to evaluate individuals.
A Manifesto on Psychology as Idiographic Science, Peter Molenaar


leanpub.com leanpub.com

Books on data science and R programming by Roger D. Peng of Johns Hopkins.


blog.cloudera.com blog.cloudera.com

Great explanation of 15 common probability distributions: Bernouli, Uniform, Binomial, Geometric, Negative Binomial, Exponential, Weibull, Hypergeometric, Poisson, Normal, Log Normal, Student's t, ChiSquared, Gamma, Beta.

 Jan 2016

courses.csail.mit.edu courses.csail.mit.edu

50 Years of Data Science, David Donoho<br> 2015, 41 pages
This paper reviews some ingredients of the current "Data Science moment", including recent commentary about data science in the popular media, and about how/whether Data Science is really different from Statistics.
The nowcontemplated field of Data Science amounts to a superset of the fields of statistics and machine learning which adds some technology for 'scaling up' to 'big data'.


blogs.scientificamerican.com blogs.scientificamerican.com

P(BE) = P(B) X P(EB) / P(E), with P standing for probability, B for belief and E for evidence. P(B) is the probability that B is true, and P(E) is the probability that E is true. P(BE) means the probability of B if E is true, and P(EB) is the probability of E if B is true.

The probability that a belief is true given new evidence equals the probability that the belief is true regardless of that evidence times the probability that the evidence is true given that the belief is true divided by the probability that the evidence is true regardless of whether the belief is true. Got that?

Initial belief plus new evidence = new and improved belief.



This criterion is not based on any specific shape of the doseresponse relationship.
I would expect that the relationship must be monotonic to support the causal hypothesis.


phys.org phys.org

paradox of unanimity  Unanimous or nearly unanimous agreement doesn't always indicate the correct answer. If agreement is unlikely, it indicates a problem with the system.
Witnesses who only saw a suspect for a moment are not likely to be able to pick them out of a lineup accurately. If several witnesses all pick the same suspect, you should be suspicious that bias is at work. Perhaps these witnesses were cherrypicked, or they were somehow encouraged to choose a particular suspect.


rpy2.readthedocs.org rpy2.readthedocs.org

Python interface to the R programming language.<br> Use R functions and packages from Python.<br> https://pypi.python.org/pypi/rpy2

 Oct 2015

cms.whittier.edu cms.whittier.edu

In 1930 its population was 112,000. Today it is 36,000. The halcyon talk of “interracial living” is dead. The neighborhood is 92 percent black. Its homicide rate is 45 per 100,000—triple the rate of the city as a whole. The infantmortality rate is 14 per 1,000—more than twice the national average.
These are some intense statistics.. It'd be interesting to compare them to other cities in the area..

 Aug 2015

www.vox.com www.vox.com

What's really being measured is heterogeneity of opinion, not centrism.

 Jul 2015

www.eblida.org www.eblida.org

European Bureau of Libraries in Europe Public libraries statistics


localhost:8080 localhost:8080

analyses
another, with tag

 Feb 2015

en.wikipedia.org en.wikipedia.org

The use of the term n − 1 is called Bessel's correction, and it is also used in sample covariance and the sample standard deviation (the square root of variance)
Why in \(\sigma^2\) is not equal to \(s^2\)

Sample variance can also be applied to the estimation of the variance of a continuous distribution from a sample of that distribution.


www.emathzone.com www.emathzone.com

Suppose the value of for wages is 10% and the values of for kilograms of meat is 25%. This means that the wages of workers are consistent. Their wages are close to the overall average of their wages. But the families consume meat in quite different quantities. Some families use very small quantities of meat and some others use large quantities of meat. We say that there is greater variation in their consumption of meat. The observations about the quantity of meat are more dispersed or more variant.
Interpretation of Relative Deviation Coefficient

 Nov 2013

cran.rproject.org cran.rproject.org

n its spacetime representation (Ogata, 1998), the ETASmodel is a temporal marked point process model, and a special case of marked Hawks process, withconditional intensity function(t;x;yjHt) =(x;y) +Xti<tk(mi)g(tti)f(xxi;yyijmi)
Testing out PDF annotation that also include LaTeX rendered formulas.


www.plosone.org www.plosone.org

two cadavers
why two? For comparative purposes? Limited sample size?

 Sep 2013

rhetoric.eserver.org rhetoric.eserver.org

Hence the man who makes a good guess at truth is likely to make a good guess at probabilities
At first, I didn't like this quote, then I thought back to good ol' Oakley's stats class. We make scientific theories based on what idea is most likely to happen (we reject/do not reject the null hypothesis, but we do not say we accept the null hypothesis). Science: putting me in my place since I had a place to be put.
