Hypothesis

579 Matching Annotations

Jul 2020
www.youtube.com www.youtube.com

Communicating statistics, risk and uncertainty in the age of Covid - Prof. David Spiegelhalter

1
1. ErikStuchly 13 Jul 2020
  
  in BehSci
  
  Communicating statistics, risk and uncertainty in the age of Covid—Prof. David Spiegelhalter. (2020, June 30). https://www.youtube.com/watch?v=Dq7W1l7RptQ&feature=youtu.be
  
  lang:en webinar COVID-19 communication misinformation uncertainty emotional response scientific evidence quality appraisal statistics transparency trustworthiness video is:youtube
Visit annotations in context

Tags

trustworthiness

is:youtube

COVID-19

emotional response

communication

transparency

uncertainty

scientific evidence

quality appraisal

misinformation

video

lang:en

webinar

statistics

Annotators

ErikStuchly

URL

youtube.com/watch
osf.io osf.io

COVerAGE-JP: COVID-19 Deaths by Age and Sex in Japan

1
1. ErikStuchly 12 Jul 2020
  
  in BehSci
  
  Uchikoshi, F. (2020). COVerAGE-JP: COVID-19 Deaths by Age and Sex in Japan [Preprint]. SocArXiv. https://doi.org/10.31235/osf.io/cpqrt
  
  is:preprint lang:en COVID-19 risk factor age sex region date database Japan mortality statistics information gathering case reporting
Visit annotations in context

Tags

sex

Japan

COVID-19

mortality

age

is:preprint

region

database

information gathering

case reporting

lang:en

statistics

date

risk factor

Annotators

ErikStuchly

URL

osf.io/preprints/socarxiv/cpqrt/
projecteuclid.org projecteuclid.org

To Explain or to Predict?

1
1. ErikStuchly 11 Jul 2020
  
  in BehSci
  
  Shmueli, G. (2010). To Explain or to Predict? Statistical Science, 25(3), 289–310.
  
  is:article lang:en modeling statistics theory testing causality prediction explanatory power predictive power conflation distinction philosophy of science
Visit annotations in context

Tags

modeling

causality

explanatory power

distinction

is:article

theory testing

predictive power

philosophy of science

lang:en

statistics

prediction

conflation

Annotators

ErikStuchly

URL

projecteuclid.org/euclid.ss/1294167961
www.theguardian.com www.theguardian.com

Risks, R numbers and raw data: how to interpret coronavirus statistics

1
1. ErikStuchly 06 Jul 2020
  
  in BehSci
  
  Spiegelhalter, D. (2020, July 5). Risks, R numbers and raw data: How to interpret coronavirus statistics. The Observer. https://www.theguardian.com/world/2020/jul/05/risks-r-numbers-and-raw-data-how-to-interpret-coronavirus-statistics
  
  is:news lang:en COVID-19 R number risk data interpretation statistics confusion lockdown cessation clarity transmission epidemiology
Visit annotations in context

Tags

COVID-19

R number

clarity

lockdown cessation

data interpretation

risk

confusion

transmission

is:news

lang:en

statistics

epidemiology

Annotators

ErikStuchly

URL

theguardian.com/world/2020/jul/05/risks-r-numbers-and-raw-data-how-to-interpret-coronavirus-statistics
www.jclinepi.com www.jclinepi.com

Missing data should be handled differently for prediction than for description or causal explanation

1
1. ErikStuchly 04 Jul 2020
  
  in BehSci
  
  Sperrin, M., Martin, G. P., Sisk, R., & Peek, N. (2020). Missing data should be handled differently for prediction than for description or causal explanation. Journal of Clinical Epidemiology, 0(0). https://doi.org/10.1016/j.jclinepi.2020.03.028
  
  is:article lang:en missing data epidemiology statistics scientific practice prediction modeling optimization scientific method causality
Visit annotations in context

Tags

modeling

causality

optimization

is:article

missing data

scientific method

scientific practice

lang:en

statistics

prediction

epidemiology

Annotators

ErikStuchly

URL

jclinepi.com/article/S0895-4356(19)30766-8/fulltext
jasp-stats.org jasp-stats.org

Introducing JASP 0.13 - JASP - Free and User-Friendly Statistical Software

1
1. ErikStuchly 04 Jul 2020
  
  in BehSci
  
  Introducing JASP 0.13. (2020, July 2). JASP - Free and User-Friendly Statistical Software. https://jasp-stats.org/?p=6483
  
  is:webpage lang:en JASP statistics software modeling Bayesian data analysis update R module improvement bug fix is:blog
Visit annotations in context

Tags

modeling

Bayesian

data analysis

improvement

update

bug fix

is:blog

R module

JASP

software

is:webpage

lang:en

statistics

Annotators

ErikStuchly

URL

jasp-stats.org/
Jun 2020
psyarxiv.com psyarxiv.com

The Practical Alternative to the P-value is the Correctly Used P-value - Lakens.pdf

1
1. ErikStuchly 29 Jun 2020
  
  in BehSci
  
  Lakens, D. (2019). The practical alternative to the p-value is the correctly used p-value [Preprint]. PsyArXiv. https://doi.org/10.31234/osf.io/shm8v
  
  is:preprint lang:en p value metascience usefulness statistics alternative equivalence test minimum-effect test misinterpretation
Visit annotations in context

Tags

usefulness

alternative

is:preprint

equivalence test

minimum-effect test

misinterpretation

lang:en

p value

statistics

metascience

Annotators

ErikStuchly

URL

psyarxiv.com/shm8v
psyarxiv.com psyarxiv.com

Reliability Multiverse

1
1. katietaylor_99 27 Jun 2020
  
  in BehSci
  
  Parsons, Sam. ‘Reliability Multiverse’, 26 June 2020. https://doi.org/10.31234/osf.io/y6tcz.
  
  is:preprint lang:en analytic flexibility statistics effect sizes p values data processing decision making reliability measur multiverse analysis splithalf accuracy response time stroop task flanker task internal consistency test-retest arbitrary unpredictable heterogeneity error
Visit annotations in context

Tags

heterogeneity

decision making

is:preprint

unpredictable

flanker task

accuracy

stroop task

statistics

test-retest

lang:en

analytic

reliability

flexibility

data processing

effect sizes

measur

p values

arbitrary

response time

error

multiverse analysis

internal consistency

splithalf

Annotators

katietaylor_99

URL

psyarxiv.com/y6tcz/
twitter.com twitter.com

Amy Perfors on Twitter: "I’ve been having a difficult time lately — partly because of [insert frantic gesturing at the state of the world], partly personal — but one thing has been a real bright light for me in the last few months. I think it has some broader lessons that might give some hope, so THREAD" / Twitter

1
1. ErikStuchly 26 Jun 2020
  
  in BehSci
  
  Amy Perfors on Twitter: “I’ve been having a difficult time lately — partly because of [insert frantic gesturing at the state of the world], partly personal — but one thing has been a real bright light for me in the last few months. I think it has some broader lessons that might give some hope, so THREAD” / Twitter. (n.d.). Twitter. Retrieved June 26, 2020, from https://twitter.com/amyperfors/status/1275931919897595904
  
  is:tweet lang:en COVID-19 struggle academia student higher learning exam initiative perseverance statistics cooperation inspiration
Visit annotations in context

Tags

COVID-19

perseverance

initiative

cooperation

struggle

is:tweet

exam

inspiration

academia

student

higher learning

lang:en

statistics

Annotators

ErikStuchly

URL

twitter.com/amyperfors/status/1275931919897595904
www.lshtm.ac.uk www.lshtm.ac.uk

Causal inference isn't what you think it is | LSHTM

1
1. ErikStuchly 26 Jun 2020
  
  in BehSci
  
  Causal inference isn’t what you think it is. (n.d.). LSHTM. Retrieved June 26, 2020, from https://www.lshtm.ac.uk/newsevents/events/causal-inference-isnt-what-you-think-it
  
  is:webpage lang:en causal inference statistics statistical tool counterfactual reasoning potential response graph statistical decision theory assisted decision making
Visit annotations in context

Tags

potential response

statistical tool

counterfactual reasoning

statistical decision theory

causal inference

graph

is:webpage

lang:en

statistics

assisted decision making

Annotators

ErikStuchly

URL

lshtm.ac.uk/newsevents/events/causal-inference-isnt-what-you-think-it
Local file Local file

Human dorsal anterior cingulate neurons signal conflict by amplifying task-relevant information

7
1. sven.wientjes 24 Jun 2020
  
  in Public
  
  higher when Ericksen conflict was present (Figure 2A)
  
  Yeah, in single neurons you can show the detection of general conflict this way, and it was not partitionable into different responses...
  
  Interpreting the results Confusing statistics
2. sven.wientjes 24 Jun 2020
  
  in Public
  
  G)
  
  Very clear effect! suspicious? how exactly did they even select the pseudo-populations, its not clear exactly from the methods to me
  
  Confusing statistics
3. sven.wientjes 24 Jun 2020
  
  in Public
  
  pseudotrial vector x
  
  one trial for all different neurons in the current pseudopopulation matrix?
  
  Confusing statistics
4. sven.wientjes 24 Jun 2020
  
  in Public
  
  The separating hyperplane for each choice i is the vector (a) that satisfies: 770 771 772 773 Meaning that βi is a vector orthogonal to the separating hyperplane in neuron-774 dimensional space, along which position is proportional to the log odds of that correct 775 response: this is the the coding dimension for that correct response
  
  Makes sense: If Beta is proportional to the log-odds of a correct response, a is the hyperplane that provides the best cutoff, which must be orthogonal. Multiplying two orthogonal vectors yields 0.
  
  Statistics
5. sven.wientjes 24 Jun 2020
  
  in Public
  
  X is the trials by neurons pseudopopulation matrix of firing rates
  
  So these pseudopopulations were random agglomerates of single neurons that were recorded, so many fits for random groups, and the best were kept?
  
  Confusing statistics Population coding
6. sven.wientjes 24 Jun 2020
  
  in Public
  
  Within each neuron, 719 we calculated the expected firing rate for each task condition, marginalizing over 720 distractors, and for each distractor, marginalizing over tasks.
  
  Distractor = specific stimulus / location (e.g. '1' or 'left')?
  
  Task = conflict condition (e.g. Simon or Ericksen)?
  
  Residuals Ephiphenomenal hypothesis Confusing statistics
7. sven.wientjes 24 Jun 2020
  
  in Public
  
  condition-averaged within neurons (9 data points per 691 neuron, reflecting all combinations of the 3 correct response, 3 Ericksen distractors, and 3 692 Simon distractors)
  
  How do all combinations of 3 responses lead to only 9 data points per neuron? 3x2x2 = 12.
  
  Condition labeling Confusing statistics
Tags

Ephiphenomenal hypothesis

Statistics

Interpreting the results

Residuals

Condition labeling

Population coding

Confusing statistics

Annotators

sven.wientjes
twitter.com twitter.com

(6) Uživatel JASP Statistics na Twitteru: „How to bootstrap confidence intervals of regression coefficients in JASP. #stats #openSource https://t.co/wNwdyHim9a“ / Twitter

1
1. tadedvorak 22 Jun 2020
  
  in BehSci
  
  Twitter. (n.d.). Twitter. Retrieved June 22, 2020, from https://twitter.com/JASPStats/status/1274764017752592384
  
  citation is:tweet is:video JASP statistics confidence interval regression coefficients bootstrap psychology behavioral science prediction
Visit annotations in context

Tags

citation

is:video

bootstrap

is:tweet

psychology

behavioral science

JASP

confidence interval

regression

statistics

prediction

coefficients

Annotators

tadedvorak

URL

twitter.com/JASPStats/status/1274764017752592384
twitter.com twitter.com

Prof Shamika Ravi on Twitter: "1) ACTIVE cases...shows which countries have 1) Peaked: Germany, S Korea, Japan, Italy, Spain... 2) Plateaued: France 3) Yet to peak: US, UK, Brazil, India...active cases still rising. 4) Second wave: Iran and.... Spain (?) https://t.co/C5c3gAhINc" / Twitter

1
1. Marlene_Wulf 19 Jun 2020
  
  in BehSci
  
  Prof Shamika Ravi on Twitter: “1) ACTIVE cases...shows which countries have 1) Peaked: Germany, S Korea, Japan, Italy, Spain... 2) Plateaued: France 3) Yet to peak: US, UK, Brazil, India...active cases still rising. 4) Second wave: Iran and.... Spain (?) https://t.co/C5c3gAhINc” / Twitter. (n.d.). Twitter. Retrieved June 2, 2020, from https://twitter.com/ShamikaRavi/status/1267664491040440322
  
  lang:en is:twitter COVID-19 graph statistics active cases peaked plateaued yet to peak second wave Germany South Korea Japan Italy Spain France UK USA Brazil India Iran
Visit annotations in context

Tags

Japan

USA

COVID-19

active cases

France

India

second wave

yet to peak

graph

Spain

lang:en

statistics

Brazil

Iran

is:twitter

UK

Germany

plateaued

South Korea

Italy

peaked

Annotators

Marlene_Wulf

URL

twitter.com/ShamikaRavi/status/1267664491040440322
iebh.bond.edu.au iebh.bond.edu.au

2 week systematic reviews (2weekSR)

1
1. edampf 16 Jun 2020
  
  in BehSci
  
  Institute for Evidence-Based Healthcare. (n.d.) 2 week systematic reviews (2weekSR). https://iebh.bond.edu.au/education-services/2-week-systematic-reviews-2weeksr
  
  is:webpage lang:en systematic review expert statistics epidemiology research project support science
Visit annotations in context

Tags

expert

systematic review

science

support

is:webpage

lang:en

statistics

research project

epidemiology

Annotators

edampf

URL

iebh.bond.edu.au/education-services/2-week-systematic-reviews-2weeksr
medium.com medium.com

Power and precision

1
1. ErikStuchly 16 Jun 2020
  
  in BehSci
  
  Morey, R. D. (2020, June 12). Power and precision. Medium. https://medium.com/@richarddmorey/power-and-precision-47f644ddea5e
  
  is:blog lang:en statistics power precision basic concept error trade-off frequentist concept theory scientific practice
Visit annotations in context

Tags

is:blog

frequentist concept

theory

error trade-off

scientific practice

basic concept

lang:en

statistics

precision

power

Annotators

ErikStuchly

URL

medium.com/@richarddmorey/power-and-precision-47f644ddea5e
www.r-bloggers.com www.r-bloggers.com

Interactive exploration of COVID-19 exit strategies

1
1. edampf 15 Jun 2020
  
  in BehSci
  
  Dablander, F. (2020, June 11). Interactive exploration of COVID-19 exit strategies. R-Bloggers. https://www.r-bloggers.com/interactive-exploration-of-covid-19-exit-strategies/
  
  is:blog lang:en COVID-19 exit strategy interactive immunity modeling extension multidisciplinary assessment Shiny app R statistics
Visit annotations in context

Tags

interactive

modeling

COVID-19

Shiny app

R statistics

exit strategy

is:blog

extension

multidisciplinary assessment

lang:en

immunity

Annotators

edampf

URL

r-bloggers.com/interactive-exploration-of-covid-19-exit-strategies/
www.aeaweb.org www.aeaweb.org

A Proposed Specification Check for p-Hacking

1
1. ErikStuchly 15 Jun 2020
 
 in BehSci
 
 Brodeur, A., Cook, N., & Heyes, A. (2020). A Proposed Specification Check for p-Hacking. AEA Papers and Proceedings, 110, 66–69. https://doi.org/10.1257/pandp.20201078
 
 is:article lang:en p-hacking bad science statistics standardization effect size significativity causality
Visit annotations in context

Tags

causality

p-hacking

is:article

bad science

significativity

lang:en

statistics

effect size

standardization

Annotators

ErikStuchly

URL

aeaweb.org/articles
rviews.rstudio.com rviews.rstudio.com

An R View into Epidemiology

1
1. ErikStuchly 14 Jun 2020
  
  in BehSci
  
  Views, R. (2020, May 20). An R View into Epidemiology. /2020/05/20/some-r-resources-for-epidemiology/
  
  is:blog is:webpage lang:en R epidemiology package modeling COVID-19 data analysis search statistics
Visit annotations in context

Tags

modeling

data analysis

R

COVID-19

is:blog

package

is:webpage

lang:en

search

statistics

epidemiology

Annotators

ErikStuchly

URL

rviews.rstudio.com/2020/05/20/some-r-resources-for-epidemiology/
psyarxiv.com psyarxiv.com

The Extended Moral Foundations Dictionary (eMFD): Development and Applications of a Crowd-Sourced Approach to Extracting Moral Intuitions from Text

1
1. gailelhalaby 13 Jun 2020
  
  in BehSci
  
  Hopp, F. R., Fisher, J. T., Cornell, D., Huskey, R., & Weber, R. (2020). The Extended Moral Foundations Dictionary (eMFD): Development and Applications of a Crowd-Sourced Approach to Extracting Moral Intuitions from Text [Preprint]. PsyArXiv. https://doi.org/10.31234/osf.io/924gq
  
  lang:en is:preprint computational communication moral intuitions open data politics statistics moral foundation theory global influence psychology social psychology
Visit annotations in context

Tags

social psychology

moral intuitions

is:preprint

computational communication

moral foundation theory

psychology

politics

global influence

open data

lang:en

statistics

Annotators

gailelhalaby

URL

psyarxiv.com/924gq/
www.tandfonline.com www.tandfonline.com

Prediction, Estimation, and Attribution

1
1. ErikStuchly 08 Jun 2020
  
  in BehSci
  
  Efron, B. (2020). Prediction, Estimation, and Attribution. Journal of the American Statistical Association, 115(530), 636–655. https://doi.org/10.1080/01621459.2020.1762613
  
  is:article lang:en prediction estimation attribution statistics scientific method algorithm regression big data comparison
Visit annotations in context

Tags

algorithm

is:article

big data

comparison

scientific method

estimation

regression

attribution

lang:en

statistics

prediction

Annotators

ErikStuchly

URL

tandfonline.com/doi/full/10.1080/01621459.2020.1762613
twitter.com twitter.com

Adam Kucharski on Twitter: "I'm getting asked more about the 'k' parameter that describes variation in the reproduction number, R (i.e. describes superspreading). But what does this parameter actually mean? A short statistical thread... 1/" / Twitter

1
1. Marlene_Wulf 04 Jun 2020
  
  in BehSci
  
  Adam Kucharski on Twitter: “I’m getting asked more about the ‘k’ parameter that describes variation in the reproduction number, R (i.e. describes superspreading). But what does this parameter actually mean? A short statistical thread... 1/” / Twitter. (n.d.). Twitter. Retrieved June 4, 2020, from https://twitter.com/AdamJKucharski/status/1267737631481364480
  
  is:twitter lang:en statistics k parameter variation reproduction number COVID-19
Visit annotations in context

Tags

reproduction number

is:twitter

COVID-19

lang:en

variation

k parameter

statistics

Annotators

Marlene_Wulf

URL

twitter.com/AdamJKucharski/status/1267737631481364480
psyarxiv.com psyarxiv.com

JASP (Software)

1
1. Marlene_Wulf 04 Jun 2020
  
  in BehSci
  
  Han, H., & Dawson, K. J. (2020). JASP (Software) [Preprint]. PsyArXiv. https://doi.org/10.31234/osf.io/67dcb
  
  is:preprint lang:en Bayesian analysis quantitative psychology Bayesian statistics statistical analysis quantitative method JSAP open-source free statistical software package statistical testing psychological science APA
Visit annotations in context

Tags

open-source

Bayesian analysis

Bayesian statistics

psychological science

APA

quantitative psychology

statistical testing

is:preprint

statistical analysis

free

JSAP

lang:en

statistical software package

quantitative method

Annotators

Marlene_Wulf

URL

psyarxiv.com/67dcb/
journals.sagepub.com journals.sagepub.com

StatBreak: Identifying “Lucky” Data Points Through Genetic Algorithms - Hannes Rosenbusch, Leon P. Hilbert, Anthony M. Evans, Marcel Zeelenberg,

1
1. ErikStuchly 03 Jun 2020
  
  in BehSci
  
  Rosenbusch, H., Hilbert, L. P., Evans, A. M., & Zeelenberg, M. (2020). StatBreak: Identifying “Lucky” Data Points Through Genetic Algorithms. Advances in Methods and Practices in Psychological Science, 2515245920917950. https://doi.org/10.1177/2515245920917950
  
  is:article lang:en scientific practice scientific method psychology statistics algorithm data analysis replication R package
Visit annotations in context

Tags

data analysis

algorithm

is:article

R package

psychology

scientific method

replication

scientific practice

lang:en

statistics

Annotators

ErikStuchly

URL

journals.sagepub.com/doi/full/10.1177/2515245920917950
twitter.com twitter.com

Probability Fact on Twitter: "Random phenomena are not obligated to follow one of the few dozen distributions that humans have given names to." / Twitter

1
1. ErikStuchly 02 Jun 2020
  
  in BehSci
  
  Probability Fact on Twitter: “Random phenomena are not obligated to follow one of the few dozen distributions that humans have given names to.” / Twitter. (n.d.). Twitter. Retrieved June 2, 2020, from https://twitter.com/probfact/status/1267204212972236808
  
  is:tweet lang:en statistics distribution data
Visit annotations in context

Tags

distribution

data

lang:en

statistics

is:tweet

Annotators

ErikStuchly

URL

twitter.com/probfact/status/1267204212972236808
link.aps.org link.aps.org

Thresholding normally distributed data creates complex networks

1
1. ErikStuchly 02 Jun 2020
  
  in BehSci
  
  Cantwell, G. T., Liu, Y., Maier, B. F., Schwarze, A. C., Serván, C. A., Snyder, J., & St-Onge, G. (2020). Thresholding normally distributed data creates complex networks. Physical Review E, 101(6), 062302. https://doi.org/10.1103/PhysRevE.101.062302
  
  is:article lang:en statistics network data distribution complexity
Visit annotations in context

Tags

distribution

complexity

data

lang:en

statistics

is:article

network

Annotators

ErikStuchly

URL

link.aps.org/doi/10.1103/PhysRevE.101.062302
May 2020
twitter.com twitter.com

🔥Kareem Carr🔥 on Twitter: "I want to talk about bugs in statistical analyses. I think many data analysts worry unnecessarily about this. I do think it's important to put a good faith effort into avoiding bugs, but I know data analysts that live in terror of hearing there's a bug in published work. 1/6" / Twitter

1
1. ErikStuchly 30 May 2020
  
  in BehSci
  
  🔥Kareem Carr🔥 on Twitter: “I want to talk about bugs in statistical analyses. I think many data analysts worry unnecessarily about this. I do think it’s important to put a good faith effort into avoiding bugs, but I know data analysts that live in terror of hearing there’s a bug in published work. 1/6” / Twitter. (n.d.). Twitter. Retrieved May 30, 2020, from https://twitter.com/kareem_carr/status/1266029701392412673
  
  is:tweet lang:en statistics bug data analysis coding research
Visit annotations in context

Tags

data analysis

research

bug

lang:en

statistics

coding

is:tweet

Annotators

ErikStuchly

URL

twitter.com/kareem_carr/status/1266029701392412673
www.theguardian.com www.theguardian.com

Coronavirus statistics: what can we trust and what should we ignore?

1
1. Marlene_Wulf 29 May 2020
  
  in BehSci
  
  Richardson, S., & Spiegelhalter, D. (2020, April 12). Coronavirus statistics: What can we trust and what should we ignore? The Observer. https://www.theguardian.com/world/2020/apr/12/coronavirus-statistics-what-can-we-trust-and-what-should-we-ignore
  
  is:news lang:en COVID-19 statistics figure graph projection source news reliable
Visit annotations in context

Tags

source

COVID-19

news

figure

reliable

projection

graph

is:news

lang:en

statistics

Annotators

Marlene_Wulf

URL

theguardian.com/world/2020/apr/12/coronavirus-statistics-what-can-we-trust-and-what-should-we-ignore
www.nytimes.com www.nytimes.com

Putting the Risk of Covid-19 in Perspective

1
1. Marlene_Wulf 25 May 2020
  
  in BehSci
  
  Roberts, D. C. (2020, May 22). Putting the Risk of Covid-19 in Perspective. The New York Times. https://www.nytimes.com/2020/05/22/well/live/putting-the-risk-of-covid-19-in-perspective.html
  
  is:news lang:en COVID-19 risk perspective statistics personal risk rate data risk perception
Visit annotations in context

Tags

COVID-19

data

risk perception

rate

perspective

personal risk

risk

is:news

lang:en

statistics

Annotators

Marlene_Wulf

URL

nytimes.com/2020/05/22/well/live/putting-the-risk-of-covid-19-in-perspective.html
psyarxiv.com psyarxiv.com

Noise resistance in communication: Quantifying uniformity and optimality

1
1. edampf 25 May 2020
  
  in BehSci
  
  Cuskley, C., & Wallenberg, J. (2020, May 14). Noise resistance in communication: Quantifying uniformity and optimality. https://doi.org/10.31234/osf.io/wpvq4
  
  is:preprint lang:en linguistics information theory language language evolution smooth signal redundancy hypothesis utterance simulation descriptive statistics
Visit annotations in context

Tags

lang:en

language evolution

is:preprint

information theory

smooth signal redundancy hypothesis

utterance

language

descriptive statistics

simulation

linguistics

Annotators

edampf

URL

psyarxiv.com/wpvq4/
www.nature.com www.nature.com

Evolution of cooperation on temporal networks

1
1. Marlene_Wulf 12 May 2020
  
  in BehSci
  
  Li, A., Zhou, L., Su, Q., Cornelius, S. P., Liu, Y.-Y., Wang, L., & Levin, S. A. (2020). Evolution of cooperation on temporal networks. Nature Communications, 11(1), 1–9. https://doi.org/10.1038/s41467-020-16088-w
  
  is:article lang:en evolution cooperation temporal network population research statistics population structure static structure
Visit annotations in context

Tags

population

population structure

temporal

cooperation

is:article

evolution

network

static structure

research

lang:en

statistics

Annotators

Marlene_Wulf

URL

nature.com/articles/s41467-020-16088-w
www.estimationstats.com www.estimationstats.com

Estimation Stats

3
1. pyxelr 09 May 2020
  
  in Public
  
  For comparisons between 3 or more groups that typically employ analysis of variance (ANOVA) methods, one can use the Cumming estimation plot, which can be considered a variant of the Gardner-Altman plot.
  
  Cumming estimation plot
  
  DataScience datavisualisation statistics
2. pyxelr 09 May 2020
  
  in Public
  
  Efron developed the bias-corrected and accelerated bootstrap (BCa bootstrap) to account for the skew whilst obtaining the central 95% of the distribution.
  
  Bias-corrected and accelerated bootstrap (BCa boostrap) deals with skewed sample distributions. However; it must be noted that it "may not give very accurate coverage in a small-sample non-parametric situation" (simply said, take caution with small datasets)
  
  DataScience statistics
3. pyxelr 09 May 2020
  
  in Public
  
  We can calculate the 95% CI of the mean difference by performing bootstrap resampling.
  
  Bootstrap - simple but powerful technique that creates multiple resamples (with replacement) from a single set of observations, and computes the effect size of interest on each of these resamples. It can be used to determine the 95% CI (Confidence Interval).
  
  We can use bootstrap resampling to obtain measure of precision and confidence about our estimate. It gives us 2 important benefits:
  
  Non-parametric statistical analysis - no need to assume normal distribution of our observations. Thanks to Central Limit Theorem, the resampling distribution of the effect size will approach normality
  
  Easy construction of the 95% CI from the resampling distribution. For 1000 bootstrap resamples of the mean difference, 25th value and 975th value can be used as boundaries of the 95% CI.
  
  Bootstrap resampling can be used for such an example:
  
  Computers can easily perform 5000 resamples:
  
  DataScience statistics
Visit annotations in context

Tags

datavisualisation

statistics

DataScience

Annotators

pyxelr

URL

estimationstats.com/
www.nass.usda.gov www.nass.usda.gov

United States

1
1. SamRose 07 May 2020
  
  in Public
  
  food21 surveys probability nonprobability food abundance index statistics
Visit annotations in context

Tags

probability

surveys

food21

nonprobability

food abundance index

statistics

Annotators

SamRose

URL

nass.usda.gov/Education_and_Outreach/Understanding_Statistics/Statistical_Aspects_of_Surveys/survey_is_survey.pdf
www.nass.usda.gov www.nass.usda.gov

United States

1
1. SamRose 07 May 2020
  
  in Public
  
  food21 surveys sampling statistics coefficient of variation
Visit annotations in context

Tags

surveys

sampling

food21

statistics

coefficient of variation

Annotators

SamRose

URL

nass.usda.gov/Education_and_Outreach/Understanding_Statistics/Statistical_Aspects_of_Surveys/sampling_errors.pdf
psyarxiv.com psyarxiv.com

Analyzing nonresponse in longitudinal surveys using Bayesian additive regression trees: A nonparametric event history analysis

1
1. edampf 07 May 2020
  
  in BehSci
  
  Zinn, S., & Gnambs, T. (2020, April 18). Analyzing nonresponse in longitudinal surveys using Bayesian additive regression trees: A nonparametric event history analysis. https://doi.org/10.31234/osf.io/82c3w
  
  is:preprint lang:en Bayesian panel study nonresponse regression tree Bayes longitudinal BART statistics survey dropout modeling
Visit annotations in context

Tags

panel study

Bayesian

nonresponse

longitudinal

BART

Bayes

survey

modeling

is:preprint

regression tree

dropout

lang:en

statistics

Annotators

edampf

URL

psyarxiv.com/82c3w/
github.com github.com

rmcelreath/statrethinking_winter2019

1
1. edampf 07 May 2020
  
  in BehSci
  
  McElreath, R. Statistical Rethinking: A Bayesian Course Using R and Stan Github.com. https://github.com/rmcelreath/statrethinking_winter2019
  
  Entire course with materials online.
  
  is:other lang:en Bayesian Stan R statistics MPI code Github lecture course
Visit annotations in context

Tags

Bayesian

R

code

Github

course

lecture

MPI

Stan

is:other

lang:en

statistics

Annotators

edampf

URL

github.com/rmcelreath/statrethinking_winter2019
statmodeling.stat.columbia.edu statmodeling.stat.columbia.edu

New analysis of excess coronavirus mortality; also a question about poststratification « Statistical Modeling, Causal Inference, and Social Science

1
1. edampf 05 May 2020
  
  in BehSci
  
  Statistical Modeling, Causal Inference, and Social Science. (2020 April 22). Blog Post: New analysis of excess coronavirus mortality; also a question about poststratification. https://statmodeling.stat.columbia.edu/2020/04/22/analysis-of-excess-coronavirus-mortality-also-a-question-about-poststratification/
  
  is:blog COVID-19 lang:en statistics Italy Gaussian Process data analysis mortality death rate Lombardia counterfactual poststratification synthetic control prediction
Visit annotations in context

Tags

synthetic control

data analysis

counterfactual

COVID-19

mortality

Lombardia

death rate

is:blog

Italy

lang:en

statistics

prediction

Gaussian Process

poststratification

Annotators

edampf

URL

statmodeling.stat.columbia.edu/2020/04/22/analysis-of-excess-coronavirus-mortality-also-a-question-about-poststratification/
Apr 2020
towardsdatascience.com towardsdatascience.com

RIP correlation. Introducing the Predictive Power Score

10
1. pyxelr 30 Apr 2020
  
  in Public
  
  the limitations of the PPS
  
  Limitations of the PPS:
  
  Slower than correlation
  
  Score cannot be interpreted as easily as the correlation (it doesn't tell you anything about the type of relationship). PPS is better for finding patterns and correlation is better for communicating found linear relationships
  
  You cannot compare the scores for different target variables in a strict math way because they're calculated using different evaluation metrics
  
  There are some limitations of the components used underneath the hood
  
  You've to perform forward and backward selection in addition to feature selection
  
  DataScience statistics
2. pyxelr 30 Apr 2020
  
  in Public
  
  How to use the PPS in your own (Python) project
  
  Using PPS with Python
  
  Download ppscore: pip install ppscoreshell
  
  Calculate the PPS for a given pandas dataframe:
  import ppscore as pps pps.score(df, "feature_column", "target_column")
  
  Calculate the whole PPS matrix:
  pps.matrix(df)
  
  DataScience statistics Python
3. pyxelr 30 Apr 2020
  
  in Public
  
  The PPS clearly has some advantages over correlation for finding predictive patterns in the data. However, once the patterns are found, the correlation is still a great way of communicating found linear relationships.
  
  PPS:
  
  good for finding predictive patterns
  
  can be used for feature selection
  
  can be used to detect information leakage between variables
  
  interpret PPS matrix as a directed graph to find entity structures Correlation:
  
  good for communicating found linear relationships
  
  DataScience statistics
4. pyxelr 30 Apr 2020
  
  in Public
  
  Let’s compare the correlation matrix to the PPS matrix on the Titanic dataset.
  
  Comparing correlation matrix and the PPS matrix of the Titanic dataset:
  
  findings about the correlation matrix:
  
  Correlation matrix is smaller because it doesn't work for categorical data
  
  Correlation matrix shows a negative correlation between TicketPrice and Class. For PPS, it's a strong predictor (0.9 PPS), but not the other way Class to TicketPrice (ticket of 5000-10000$ is most likely the highest class, but the highest class itself cannot determine the price)
  
  findings about the PPS matrix:
  
  First row of the matrix tells you that the best univariate predictor of the column Survived is the column Sex (Sex was dropped for correlation)
  
  TicketID uncovers a hidden pattern as well as it's connection with the TicketPrice
  
  DataScience statistics
5. pyxelr 30 Apr 2020
  
  in Public
  
  Let’s use a typical quadratic relationship: the feature x is a uniform variable ranging from -2 to 2 and the target y is the square of x plus some error.
  
  In this scenario:
  
  we can predict y using x
  
  we cannot predict x using y as x might be negative or positive (for y=4, x=2 or -2
  
  the correlation is 0. Both from x to y and from y to x because the correlation is symmetric (more often relationships are assymetric!). However, the PPS from x to y is 0.88 (not 1 because of existing error)
  
  PPS from y to x is 0 because there's no relationship that y can predict if it only knows its own value
  
  DataScience statistics
6. pyxelr 30 Apr 2020
  
  in Public
  
  how do you normalize a score? You define a lower and an upper limit and put the score into perspective.
  
  Normalising a score:
  
  you need to put a lower and upper limit
  
  upper limit can be F1 = 1, and a perfect MAE = 0
  
  lower limit depends on the evaluation metric and your data set. It's the value that a naive predictor achieves
  
  DataScience statistics
7. pyxelr 30 Apr 2020
  
  in Public
  
  For a classification problem, always predicting the most common class is pretty naive. For a regression problem, always predicting the median value is pretty naive.
  
  What is a naive model:
  
  predicting common class for a classification problem
  
  predicting median value for a regression problem
  
  DataScience statistics
8. pyxelr 30 Apr 2020
  
  in Public
  
  Let’s say we have two columns and want to calculate the predictive power score of A predicting B. In this case, we treat B as our target variable and A as our (only) feature. We can now calculate a cross-validated Decision Tree and calculate a suitable evaluation metric.
  
  If the target (B) variable is:
  
  numeric - we can use a Decision Tree Regressor and calculate the Mean Absolute Error (MAE)
  
  categoric - we can use a Decision Tree Classifier and calculate the weighted F1 (or ROC)
  
  DataScience statistics
9. pyxelr 30 Apr 2020
  
  in Public
  
  More often, relationships are asymmetric
  
  a column with 3 unique values will never be able to perfectly predict another column with 100 unique values. But the opposite might be true
  
  DataScience statistics
10. pyxelr 30 Apr 2020
  
  in Public
  
  there are many non-linear relationships that the score simply won’t detect. For example, a sinus wave, a quadratic curve or a mysterious step function. The score will just be 0, saying: “Nothing interesting here”. Also, correlation is only defined for numeric columns.
  
  Correlation:
  
  doesn't work with non-linear data
  
  doesn't work for categorical values
  
  Examples:
  
  DataScience statistics
Visit annotations in context

Tags

Python

statistics

DataScience

Annotators

pyxelr

URL

towardsdatascience.com/rip-correlation-introducing-the-predictive-power-score-3d90808b9598
math.stackexchange.com math.stackexchange.com

The expected payoff of a dice game

1
1. pyxelr 29 Apr 2020
  
  in Public
  
  Suppose you have only two rolls of dice. then your best strategy would be to take the first roll if its outcome is more than its expected value (ie 3.5) and to roll again if it is less.
  
  Expected payoff of a dice game:
  
  Description: You have the option to throw a die up to three times. You will earn the face value of the die. You have the option to stop after each throw and walk away with the money earned. The earnings are not additive. What is the expected payoff of this game?
  
  Rolling twice: $$\frac{1}{6}(6+5+4) + \frac{1}{2}3.5 = 4.25.$$
  
  Rolling three times: $$\frac{1}{6}(6+5) + \frac{2}{3}4.25 = 4 + \frac{2}{3}$$
  
  statistics math
Visit annotations in context

Tags

statistics

math

Annotators

pyxelr

URL

math.stackexchange.com/questions/179534/the-expected-payoff-of-a-dice-game
math.stackexchange.com math.stackexchange.com

Expected Number of Coin Tosses to Get Five Consecutive Heads

1
1. pyxelr 29 Apr 2020
  
  in Public
  
  Therefore, En=2n+1−2=2(2n−1)
  
  Simplified formula for the expected number of tosses (e) to get n consecutive heads (n≥1):
  
  $$e_n=2(2^n-1)$$
  
  For example, to get 5 consecutive heads, we've to toss the coin 62 times:
  
  $$e_n=2(2^5-1)=62$$
  
  We can also start with the longer analysis of the 5 scenarios:
  
  If we get a tail immediately (probability 1/2) then the expected number is e+1.
  
  If we get a head then a tail (probability 1/4), then the expected number is e+2.
  
  If we get two head then a tail (probability 1/8), then the expected number is e+2.
  
  If we get three head then a tail (probability 1/16), then the expected number is e+4.
  
  If we get four heads then a tail (probability 1/32), then the expected number is e+5.
  
  Finally, if our first 5 tosses are heads, then the expected number is 5.
  
  Thus:
  
  $$e=\frac{1}{2}(e+1)+\frac{1}{4}(e+2)+\frac{1}{8}(e+3)+\frac{1}{16}\\(e+4)+\frac{1}{32}(e+5)+\frac{1}{32}(5)=62$$
  
  We can also generalise the formula to:
  
  $$e_n=\frac{1}{2}(e_n+1)+\frac{1}{4}(e_n+2)+\frac{1}{8}(e_n+3)+\frac{1}{16}\\(e_n+4)+\cdots +\frac{1}{2^n}(e_n+n)+\frac{1}{2^n}(n) $$
  
  statistics math probability
Visit annotations in context

Tags

probability

statistics

math

Annotators

pyxelr

URL

math.stackexchange.com/questions/364038/expected-number-of-coin-tosses-to-get-five-consecutive-heads
psyarxiv.com psyarxiv.com

Priors in a Bayesian Audit: How Integration of Existing Information into the Prior Distribution Can Increase Transparency, Efficiency, and Quality

1
1. Marlene_Wulf 28 Apr 2020
  
  in BehSci
  
  Derks, K., de swart, j., van Batenburg, P., Wagenmakers, E., & wetzels, r. (2020, April 28). Priors in a Bayesian Audit: How Integration of Existing Information into the Prior Distribution Can Increase Transparency, Efficiency, and Quality. Retrieved from psyarxiv.com/8fhkp
  
  is:preprint lang:en Bayesian statistics audit financial statement prior distribution testing phase analysis
Visit annotations in context

Tags

distribution

Bayesian

prior

financial

phase

is:preprint

audit

statement

analysis

testing

lang:en

statistics

Annotators

Marlene_Wulf

URL

psyarxiv.com/8fhkp/
stats.stackexchange.com stats.stackexchange.com

Difference between replication and repeated measurements

1
1. pyxelr 27 Apr 2020
  
  in Public
  
  Repeated measures involves measuring the same cases multiple times. So, if you measured the chips, then did something to them, then measured them again, etc it would be repeated measures. Replication involves running the same study on different subjects but identical conditions. So, if you did the study on n chips, then did it again on another n chips that would be replication.
  
  Difference between repeated measures and replication
  
  DataScience statistics
Visit annotations in context

Tags

statistics

DataScience

Annotators

pyxelr

URL

stats.stackexchange.com/questions/62312/difference-between-replication-and-repeated-measurements
psyarxiv.com psyarxiv.com

COVID-19 Knowledge and Perceptions in Nigeria

1
1. edampf 23 Apr 2020
  
  in BehSci
  
  Olapegba, P. O., Ayandele, O., Kolawole, S. O., Oguntayo, R., Gandi, J. C., Dangiwa, A. L., … Iorfa, S. K. (2020, April 12). COVID-19 Knowledge and Perceptions in Nigeria. https://doi.org/10.31234/osf.io/j356x
  
  is:preprint COVID-19 lang:en Nigeria infection prevention knowledge media perception general public lockdown questionnaire data collection descriptive statistics China misinformation transmission symptom news health information information public health precaution behavior misconception
Visit annotations in context

Tags

information

COVID-19

media

lockdown

is:preprint

behavior

misinformation

lang:en

symptom

knowledge

data collection

perception

news

misconception

precaution

infection

China

questionnaire

general public

public health

Nigeria

health information

descriptive statistics

transmission

prevention

Annotators

edampf

URL

psyarxiv.com/j356x/
arxiv.org arxiv.org

1907.11162.pdf

1
1. edampf 23 Apr 2020
  
  in BehSci
  
  Taleb, N. N. (2019). On the Statistical Differences between Binary Forecasts and Real World Payoffs. ArXiv:1907.11162 [Physics, q-Fin]. http://arxiv.org/abs/1907.11162
  
  is:preprint forecast prediction statistics mathematics payoff decision science economics psychology pandemic lang:en
Visit annotations in context

Tags

payoff

economics

is:preprint

psychology

mathematics

pandemic

forecast

lang:en

statistics

prediction

decision science

Annotators

edampf

URL

arxiv.org/pdf/1907.11162.pdf
doi.org doi.org

Is the spread of COVID-19 across countries influenced by environmental, economic and social factors?

1
1. edampf 23 Apr 2020
  
  in BehSci
  
  Hossain, M. A. (2020). Is the spread of COVID-19 across countries influenced by environmental, economic and social factors? [Preprint]. Epidemiology. https://doi.org/10.1101/2020.04.08.20058164
  
  is:preprint COVID-19 lang:en environment economics social science statistics weather temperature economic openness democracy government analysis infection transmission
Visit annotations in context

Tags

temperature

economics

environment

COVID-19

weather

government

is:preprint

infection

economic openness

democracy

analysis

transmission

lang:en

statistics

social science

Annotators

edampf

URL

doi.org/10.1101/2020.04.08.20058164
users.ox.ac.uk users.ox.ac.uk

Untitled document

1
1. edampf 23 Apr 2020
  
  in BehSci
  
  Bird, S., Nielsen, B. (2020 April 20). Now-casting of Covid-19 deaths in English Hospitals. http://users.ox.ac.uk/~nuff0078/Covid/index.htm
  
  is:webpage COVID-19 lang:en UK England now-casting hospitalization mortality statistics plot data interpretation report death distribution prediction data data collection
Visit annotations in context

Tags

distribution

COVID-19

data

England

now-casting

is:webpage

lang:en

statistics

prediction

plot

data collection

death

mortality

UK

report

data interpretation

hospitalization

Annotators

edampf

URL

users.ox.ac.uk/~nuff0078/Covid/index.htm
doi.org doi.org

Perceptions and behavioural responses of the general public during the COVID-19 pandemic: A cross-sectional survey of UK Adults

1
1. edampf 23 Apr 2020
  
  in BehSci
  
  Atchison, C. J., Bowman, L., Vrinten, C., Redd, R., Pristera, P., Eaton, J. W., & Ward, H. (2020). Perceptions and behavioural responses of the general public during the COVID-19 pandemic: A cross-sectional survey of UK Adults [Preprint]. Public and Global Health. https://doi.org/10.1101/2020.04.01.20050039
  
  is:preprint COVID-19 lang:en UK cross-sectional survey response behavior perception adult risk perception government data collection statistics demographics social distancing lockdown self-isolation quarentine prevention handwashing face mask minority policy economy transmission dynamics modeling
Visit annotations in context

Tags

modeling

COVID-19

survey

government

economy

lockdown

demographics

social distancing

is:preprint

behavior

minority

transmission dynamics

lang:en

statistics

handwashing

policy

data collection

face mask

response

cross-sectional

perception

risk perception

self-isolation

UK

quarentine

adult

prevention

Annotators

edampf

URL

doi.org/10.1101/2020.04.01.20050039
twitter.com twitter.com

ReconfigBehSci en Twitter: "an interesting source of statistics, both on COVID-19 and other issues that help provide some context to numbers https://t.co/T0wBZIlCfR" / Twitter

1
1. Marlene_Wulf 23 Apr 2020
  
  in BehSci
  
  ReconfigBehSci en Twitter: “an interesting source of statistics, both on COVID-19 and other issues that help provide some context to numbers https://t.co/T0wBZIlCfR” / Twitter. (n.d.). Twitter. Retrieved April 17, 2020, from https://twitter.com/SciBeh/status/1246714565850734592
  
  is:twitter lang:en COVID-19 statistics number worldometer
Visit annotations in context

Tags

worldometer

is:twitter

COVID-19

lang:en

statistics

number

Annotators

Marlene_Wulf

URL

twitter.com/SciBeh/status/1246714565850734592
www.ons.gov.uk www.ons.gov.uk

Deaths involving COVID-19, England and Wales - Office for National Statistics

1
1. Marlene_Wulf 23 Apr 2020
  
  in BehSci
  
  Deaths involving COVID-19, England and Wales—Office for National Statistics. (n.d.). Retrieved April 20, 2020, from https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/bulletins/deathsinvolvingcovid19englandandwales/deathsoccurringinmarch2020
  
  is:article lang:en COVID-19 death rate England Wales March 2020 age sex region statistics
Visit annotations in context

Tags

sex

COVID-19

age

death rate

is:article

England

March

Wales

2020

lang:en

statistics

region

Annotators

Marlene_Wulf

URL

ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/bulletins/deathsinvolvingcovid19englandandwales/deathsoccurringinmarch2020
Mar 2020
datagenetics.com datagenetics.com

Toilet Paper: How long does it last?

3
1. pyxelr 18 Mar 2020
  
  in Public
  
  This volume of paper should be the same as the coaxial plug of paper on the roll.
  
  Calculating volume of the paper roll: $$\mathbf{Lwt = \pi w(R^2 - r^2)} \~\ L = \text{length of the paper} \ w = \text{width of the paper} \ t = \text{thickness} \ R = \text{outer radius} \ r = \text{inner radius}$$ And that simplifies into a formula for R: $$\color{red} {\bf R = \sqrt{\frac{Lt}{\pi}+r^2}}$$
  
  statistics math
2. pyxelr 18 Mar 2020
  
  in Public
  
  This shows the nonlinear relationship and how the consumption accelerates. The first 10% used makes just a 5% change in the diameter of the roll. The last 10% makes an 18.5% change.
  
  Consumption of a toilet paper roll has a nonlinear relationship between the:
  
  y-axis (outer Radius of the roll (measured as a percentage of a full roll))
  
  x-axis (% of the roll consumed)
  
  statistics
3. pyxelr 18 Mar 2020
  
  in Public
  
  Toilet paper is typically supplied in rolls of perforated material wrapped around a central cardboard tube. There’s a little variance between manufacturers, but a typical roll is approximately 4.5” wide with an 5.25” external diameter, and a central tube of diameter 1.6” Toilet paper is big business (see what I did there?) Worldwide, approximately 83 million rolls are produced per day; that’s a staggering 30 billion rolls per year. In the USA, about 7 billion rolls a year are sold, so the average American citizen consumes two dozen rolls a year (two per month). Americans use 24 rolls per capita a year of toilet paper Again, it depends on the thickness and luxuriousness of the product, but the perforations typically divide the roll into approximately 1,000 sheets (for single-ply), or around 500 sheets (for double-ply). Each sheet is typically 4” long so the length of a (double-ply) toilet roll is approximately 2,000” or 167 feet (or less, if your cat gets to it).
  
  Statistics on the type and use of toilet paper in the USA.
  
  1" (inch) = 2.54 cm
  
  statistics
Visit annotations in context

Tags

statistics

math

Annotators

pyxelr

URL

datagenetics.com/blog/march52020/index.html
intellspot.com intellspot.com

10 Interval Data Examples: Interval Scale Definition & Meaning

2
1. pyxelr 15 Mar 2020
  
  in Public
  
  In the interval scale, there is no true zero point or fixed beginning. They do not have a true zero even if one of the values carry the name “zero.” For example, in the temperature, there is no point where the temperature can be zero. Zero degrees F does not mean the complete absence of temperature. Since the interval scale has no true zero point, you cannot calculate Ratios. For example, there is no any sense the ratio of 90 to 30 degrees F to be the same as the ratio of 60 to 20 degrees. A temperature of 20 degrees is not twice as warm as one of 10 degrees.
  
  Interval data:
  
  show not only order and direction, but also the exact differences between the values
  
  the distances between each value on the interval scale are meaningful and equal
  
  no true zero point
  
  no fixed beginning
  
  no possibility to calculate ratios (only add and substract)
  
  e.g.: temperature in Fahrenheit or Celsius (but not Kelvin) or IQ test
  
  DataScience statistics
2. pyxelr 15 Mar 2020
  
  in Public
  
  As the interval scales, Ratio scales show us the order and the exact value between the units. However, in contrast with interval scales, Ratio ones have an absolute zero that allows us to perform a huge range of descriptive statistics and inferential statistics. The ratio scales possess a clear definition of zero. Any types of values that can be measured from absolute zero can be measured with a ratio scale. The most popular examples of ratio variables are height and weight. In addition, one individual can be twice as tall as another individual.
  
  Ratio data is like interval data, but with:
  
  absolute zero
  
  possibility to calculate ratio (e.g. someone can be twice as tall)
  
  possibility to not only add and subtract, but multiply and divide values
  
  e.g.: weight, height, Kelvin scale (50K is 2x hot as 25K)
  
  DataScience statistics
Visit annotations in context

Tags

statistics

DataScience

Annotators

pyxelr

URL

intellspot.com/interval-data-examples/
towardsdatascience.com towardsdatascience.com

Understanding AUC - ROC Curve – Towards Data Science

5
1. pyxelr 02 Mar 2020
  
  in Public
  
  when AUC is 0.5, it means model has no class separation capacity whatsoever.
  
  If AUC = 0.5
  
  DataScience statistics
2. pyxelr 02 Mar 2020
  
  in Public
  
  ROC is a probability curve and AUC represents degree or measure of separability. It tells how much model is capable of distinguishing between classes.
  
  ROC & AUC
  
  DataScience statistics
3. pyxelr 02 Mar 2020
  
  in Public
  
  In multi-class model, we can plot N number of AUC ROC Curves for N number classes using One vs ALL methodology. So for Example, If you have three classes named X, Y and Z, you will have one ROC for X classified against Y and Z, another ROC for Y classified against X and Z, and a third one of Z classified against Y and X.
  
  Using AUC ROC curve for multi-class model
  
  DataScience statistics
4. pyxelr 02 Mar 2020
  
  in Public
  
  When AUC is approximately 0, model is actually reciprocating the classes. It means, model is predicting negative class as a positive class and vice versa
  
  If AUC = 0
  
  DataScience statistics
5. pyxelr 02 Mar 2020
  
  in Public
  
  AUC near to the 1 which means it has good measure of separability.
  
  If AUC = 1
  
  DataScience statistics
Visit annotations in context

Tags

statistics

DataScience

Annotators

pyxelr

URL

towardsdatascience.com/understanding-auc-roc-curve-68b2303cc9c5
victorzhou.com victorzhou.com

A Simple Explanation of the Softmax Function - victorzhou.com

1
1. pyxelr 02 Mar 2020
  
  in Public
  
  Softmax turns arbitrary real values into probabilities
  
  Softmax function -
  
  outputs of the function are in range [0,1] and add up to 1. Hence, they form a probability distribution
  
  the calcualtion invloves e (mathematical constant) and performs operation on n numbers: $$s(x_i) = \frac{e^{xi}}{\sum{j=1}^n e^{x_j}}$$
  
  the bigger the value, the higher its probability
  
  lets us answer classification questions with probabilities, which are more useful than simpler answers (e.g. binary yes/no)
  
  DataScience statistics
Visit annotations in context

Tags

statistics

DataScience

Annotators

pyxelr

URL

victorzhou.com/blog/softmax/
www.linkedin.com www.linkedin.com

LinkedIn

1
1. pyxelr 02 Mar 2020
  
  in Public
  
  1. Logistic regression IS a binomial regression (with logit link), a special case of the Generalized Linear Model. It doesn't classify anything *unless a threshold for the probability is set*. Classification is just its application. 2. Stepwise regression is by no means a regression. It's a (flawed) method of variable selection. 3. OLS is a method of estimation (among others: GLS, TLS, (RE)ML, PQL, etc.), NOT a regression. 4. Ridge, LASSO - it's a method of regularization, NOT a regression. 5. There are tens of models for the regression analysis. You mention mainly linear and logistic - it's just the GLM! Learn the others too (link in a comment). STOP with the "17 types of regression every DS should know". BTW, there're 270+ statistical tests. Not just t, chi2 & Wilcoxon
  
  5 clarifications to common misconceptions shared over data science cheatsheets on LinkedIn
  
  DataScience statistics
Visit annotations in context

Tags

statistics

DataScience

Annotators

pyxelr

URL

linkedin.com/posts/adrianolszewski_biostatistics-statistics-rstats-activity-6621020198185111552-jw12/
www.linkedin.com www.linkedin.com

LinkedIn

1
1. pyxelr 02 Mar 2020
  
  in Public
  
  An exploratory plot is all about you getting to know the data. An explanatory graphic, on the other hand, is about telling a story using that data to a specific audience.
  
  Exploratory vs Explanatory plot
  
  DataScience statistics visualisation
Visit annotations in context

Tags

visualisation

statistics

DataScience

Annotators

pyxelr

URL

linkedin.com/feed/update/urn:li:activity:6632255681946767360/
Feb 2020
www.magasinetparagraf.se www.magasinetparagraf.se

SD vill införa ”värdefostrande” uppfostringsanstalter

1
1. pivic 27 Feb 2020
  
  in Public
  
  (Återkommande forskning visar att 85-90 procent av tonårskillar begår brott. Allt från snatteri upp till rån och mord. Och det oavsett om de har invandrarbakgrund eller inte. Cirka 97-98 procent av de här killarna blir sedan skötsamma arbetande vuxna medborgare – som beklagar sig över ungdomsbrottsligheten.)
  
  statistik brott sverige sweden crime statistics
Visit annotations in context

Tags

brott

statistik

crime

sweden

sverige

statistics

Annotators

pivic

URL

magasinetparagraf.se/nyheter/kronikor/197170-sd-vill-ha-vardefostrande-uppfostringsanstalter/
www.lesswrong.com www.lesswrong.com

Your intuitions are not magic - LessWrong 2.0

1
1. TylerRick 19 Feb 2020
  
  in Public
  
  false assumptions rationality statistics intuition
Visit annotations in context

Tags

rationality

false assumptions

statistics

intuition

Annotators

TylerRick

URL

lesswrong.com/posts/Psp8ZpYLCDJjshpRb/your-intuitions-are-not-magic
Jan 2020
www.theglobeandmail.com www.theglobeandmail.com

In the dark: The cost of Canada’s data deficit

1
1. kodourra 07 Jan 2020
  
  in Public
  
  In the dark: the cost of Canada's data deficit
  
  data data deficit canada statistics canada census data cihi rdc
Visit annotations in context

Tags

data deficit

cihi

canada

census data

data

rdc

statistics canada

Annotators

kodourra

URL

theglobeandmail.com/canada/article-in-the-dark-the-cost-of-canadas-data-deficit/
Dec 2019
www.ncbi.nlm.nih.gov www.ncbi.nlm.nih.gov

How can we boost IQs of “dull children”?: A late adoption study

1
1. Drugs_and_Ethics 19 Dec 2019
  
  in Public
  
  The average IQs of adopted children in lower and higher socioeconomic status (SES) families were 85 (SD = 17) and 98 (SD = 14.6), respectively, at adolescence (mean age = 13.5 years)
  
  I'm looking for the smallest standard deviation in an adopted sample to compare the average difference to that of identical twins. This study suggests that the SD in adoption is identical to the SD in the general population. This supports the idea that lower SD in adopted identical twins is entirely down to genes (or, in principal, prenatal environment).
  
  Note that this comment is referring to this Reddit inquiry.
  
  IQ adoption statistics
Visit annotations in context

Tags

statistics

adoption

IQ

Annotators

Drugs_and_Ethics

URL

ncbi.nlm.nih.gov/pmc/articles/PMC17595/
andrewmaclachlan.github.io andrewmaclachlan.github.io

Chapter 9 GWR and spatially lagged regression | CASA0005 Geographic Information Systems and Science

1
1. fufu 10 Dec 2019
  
  in Public
  
  If you are running a regression model on data that do not have explicit space or time dimensions, then the standard test for autocorrelation would be the Durbin-Watson test.
  
  Durbin-Watson test?
  
  statistics
Visit annotations in context

Tags

statistics

Annotators

fufu

URL

andrewmaclachlan.github.io/CASA0005repo/gwr-and-spatially-lagged-regression.html
blog.cyphermox.net blog.cyphermox.net

ss: another way to get socket statistics

1
1. almereyda 03 Dec 2019
  
  in Public
  
  ss socket statistics iproute2 ip iptables filter
Visit annotations in context

Tags

socket statistics

iptables

iproute2

ip

ss

filter

Annotators

almereyda

URL

blog.cyphermox.net/2017/05/ss-another-way-to-get-socket-statistics.html
Nov 2019
users.hist.umn.edu users.hist.umn.edu

Untitled document

1
1. cpsupolicyresearch 29 Nov 2019
  
  in Public
  
  For most of the twentieth century, Census Bureau administrators resisted private-sector intrusion into data capture and processing operations, but beginning in the mid-1990s, the Census Bureau increasingly turned to outside vendors from the private sector for data captureand processing. Thisprivatization led to rapidly escalating costs, reduced productivity, near catastrophic failures of the 2000 and 2010 censuses, and high risks for the 2020 census.
  
  Parallels to ABS in Australia
  
  Census Australian Bureau of Statistics privatisation outsourcing ICT
Visit annotations in context

Tags

privatisation

outsourcing

Census

Australian Bureau of Statistics

ICT

Annotators

cpsupolicyresearch

URL

users.hist.umn.edu/~ruggles/Articles/Ruggles_Magnuson_JAH.pdf
a-little-book-of-r-for-time-series.readthedocs.io a-little-book-of-r-for-time-series.readthedocs.io

Using R for Time Series Analysis — Time Series 0.2 documentation

1
1. udaybhaskar 07 Nov 2019
  
  in Public
  
  This booklet itells you how to use the R statistical software to carry out some simple analyses that are common in analysing time series data.
  
  what is time series?
  
  time series Statistics data analysis
Visit annotations in context

Tags

data analysis

time series

Statistics

Annotators

udaybhaskar

URL

a-little-book-of-r-for-time-series.readthedocs.io/en/latest/src/timeseries.html
docdrop.org docdrop.org

“Dear Future President of the United States”: Analyzing Youth Civic Writing Within the 2016 Letters to the Next President Project

1
1. yedaye 01 Nov 2019
  
  in Public
  
  Top 20 topic categories.
  
  Immigration, Guns, Education, that exactly what I choose for my three letters comments. I think this result is also influenced by media. Every day these three areas are the main subject developed on media. 10 years ago the result will show different areas.
  
  statistics
Visit annotations in context

Tags

statistics

Annotators

yedaye

URL

docdrop.org/static/drop-pdf/0002831219870129-tuHH9.pdf
Aug 2019
ec.europa.eu ec.europa.eu

Category:Passengers - Statistics Explained

1
1. Aliaksei_ 30 Aug 2019
  
  in Public
  
  Pages in category "Passengers"
  
  Statistics for transportation in EU
  
  Statistics
Visit annotations in context

Tags

Statistics

Annotators

Aliaksei_

URL

ec.europa.eu/eurostat/statistics-explained/index.php
www.uitp.org www.uitp.org

Key statistics

1
1. Aliaksei_ 30 Aug 2019
  
  in Public
  
  On public transport ridership in the EU
  
  A screenshot is needed
  
  Statistics
Visit annotations in context

Tags

Statistics

Annotators

Aliaksei_

URL

uitp.org/key-statistics
www.sanalabs.com www.sanalabs.com

Sana Labs - AI for Education

1
1. markus22 06 Aug 2019
  
  in Public
  
  ie. decision tree split, entropy minimum or information max at 0.5:0.5 split
  
  statistics
Visit annotations in context

Tags

statistics

Annotators

markus22

URL

sanalabs.com/
Jul 2019
www.jwilber.me www.jwilber.me

Permutation Test: Visual Explanation

1
1. markus22 24 Jul 2019
 
 in Public
 
 In statistical testing, we structure experiments in terms of null & alternative hypotheses. Our test will have the following hypothesis schema: Η0: μtreatment <= μcontrol ΗA: μtreatment > μcontrol Our null hypothesis claims that the new shampoo does not increase wool quality. The alternative hypothesis claims the opposite; new shampoo yields superior wool quality.
 
 hypothesis schema; statistics
 
 statistics
Visit annotations in context

Tags

statistics

Annotators

markus22

URL

jwilber.me/permutationtest/
Jun 2019
pib.nic.in pib.nic.in

Press Note : Clarification regarding the Statistical reforms and the existing GDP series

1
1. manga555 11 Jun 2019
  
  in Public
  
  Ministries will be involved in close monitoring and supervision of the field work to ensure data quality and good coverage. This is the first time that the rigours of monitoring and supervision of field work exercised in NSS will be leveraged for the Economic Census so that results of better quality would be available for creation of a National Statistical Business Register. This process has been catalysed by the establishment of a unified National Statistical Office (NSO).
  
  pib statistics ministry
Visit annotations in context

Tags

statistics ministry

pib

Annotators

manga555

URL

pib.nic.in/PressReleaseIframePage.aspx
May 2019
www.theatlantic.com www.theatlantic.com

A Waste of 1,000 Research Papers

1
1. chrisaldrich 18 May 2019
  
  in Public
  
  It’s as if they’d been “describing the life cycle of unicorns, what unicorns eat, all the different subspecies of unicorn, which cuts of unicorn meat are tastiest, and a blow-by-blow account of a wrestling match between unicorns and Bigfoot,” Alexander wrote.
  
  research statistics
Visit annotations in context

Tags

research

statistics

Annotators

chrisaldrich

URL

theatlantic.com/science/archive/2019/05/waste-1000-studies/589684/
en.wikipedia.org en.wikipedia.org

Parametric statistics - Wikipedia

1
1. Maciej_Motyka 15 May 2019
  
  in Public
  
  Parametric statistics is a branch of statistics which assumes that sample data comes from a population that can be adequately modelled by a probability distribution that has a fixed set of parameters.[1] Conversely a non-parametric model differs precisely in that the parameter set (or feature set in machine learning) is not fixed and can increase, or even decrease, if new relevant information is collected.[2] Most well-known statistical methods are parametric.[3] Regarding nonparametric (and semiparametric) models, Sir David Cox has said, "These typically involve fewer assumptions of structure and distributional form but usually contain strong assumptions about independencies".[4]
  
  Non-parametric vs parametric stats
  
  statistics
Visit annotations in context

Tags

statistics

Annotators

Maciej_Motyka

URL

en.wikipedia.org/wiki/Parametric_statistics
en.wikipedia.org en.wikipedia.org

Nonparametric statistics - Wikipedia, the free encyclopedia

2
1. Maciej_Motyka 15 May 2019
  
  in Public
  
  Statistical hypotheses concern the behavior of observable random variables.... For example, the hypothesis (a) that a normal distribution has a specified mean and variance is statistical; so is the hypothesis (b) that it has a given mean but unspecified variance; so is the hypothesis (c) that a distribution is of normal form with both mean and variance unspecified; finally, so is the hypothesis (d) that two unspecified continuous distributions are identical. It will have been noticed that in the examples (a) and (b) the distribution underlying the observations was taken to be of a certain form (the normal) and the hypothesis was concerned entirely with the value of one or both of its parameters. Such a hypothesis, for obvious reasons, is called parametric. Hypothesis (c) was of a different nature, as no parameter values are specified in the statement of the hypothesis; we might reasonably call such a hypothesis non-parametric. Hypothesis (d) is also non-parametric but, in addition, it does not even specify the underlying form of the distribution and may now be reasonably termed distribution-free. Notwithstanding these distinctions, the statistical literature now commonly applies the label "non-parametric" to test procedures that we have just termed "distribution-free", thereby losing a useful classification.
  
  Non-parametric vs parametric statistics
  
  statistics
2. Maciej_Motyka 15 May 2019
  
  in Public
  
  Non-parametric methods are widely used for studying populations that take on a ranked order (such as movie reviews receiving one to four stars). The use of non-parametric methods may be necessary when data have a ranking but no clear numerical interpretation, such as when assessing preferences. In terms of levels of measurement, non-parametric methods result in ordinal data. As non-parametric methods make fewer assumptions, their applicability is much wider than the corresponding parametric methods. In particular, they may be applied in situations where less is known about the application in question. Also, due to the reliance on fewer assumptions, non-parametric methods are more robust. Another justification for the use of non-parametric methods is simplicity. In certain cases, even when the use of parametric methods is justified, non-parametric methods may be easier to use. Due both to this simplicity and to their greater robustness, non-parametric methods are seen by some statisticians as leaving less room for improper use and misunderstanding. The wider applicability and increased robustness of non-parametric tests comes at a cost: in cases where a parametric test would be appropriate, non-parametric tests have less power. In other words, a larger sample size can be required to draw conclusions with the same degree of confidence.
  
  Non-parametric vs parametric statistics
  
  statistics
Visit annotations in context

Tags

statistics

Annotators

Maciej_Motyka

URL

en.wikipedia.org/wiki/Nonparametric_statistics
en.wikipedia.org en.wikipedia.org

Statistical data type - Wikipedia

1
1. Maciej_Motyka 15 May 2019
  
  in Public
  
  The concept of data type is similar to the concept of level of measurement, but more specific: For example, count data require a different distribution (e.g. a Poisson distribution or binomial distribution) than non-negative real-valued data require, but both fall under the same level of measurement (a ratio scale).
  
  statistics
Visit annotations in context

Tags

statistics

Annotators

Maciej_Motyka

URL

en.wikipedia.org/wiki/Statistical_data_type
outline.com outline.com

Muslims are not terrorists! And neither are most terrorists Muslims!

12
1. jakepalandri 14 May 2019
  
  in Outline
  
  Even if Muslims were hypothetically behind every single one of the 140,000 terror attacks committed worldwide since 1970, those terrorists would represent barely 0.009 percent of global Islam
  
  This is a veryyy relevant statistic, thank god.
  
  Statistics
2. jakepalandri 14 May 2019
  
  in Outline
  
  That is, deaths from terrorism account for 0.025 of the total number of murders, or 2.5%
  
  Irrelevant statistics IMO
  
  Bad Statistics
3. jakepalandri 14 May 2019
  
  in Outline
  
  American Muslims have killed less than 0.0002 percent of those murdered in the USA during this period
  
  selection of detail
  
  Statistics Selection of Detail
4. jakepalandri 14 May 2019
  
  in Outline
  
  How many people did toddlers kill in 2013? Five, all by accidentally shooting a gun
  
  selection of detail of outlandish statistic to emphasise main point
  
  Statistics Selection of Detail
5. jakepalandri 14 May 2019
  
  in Outline
  
  you actually have a better chance of being killed by a refrigerator falling on you
  
  selection of detail of outlandish statistic to emphasise main point
  
  Statistics Selection of Detail
6. jakepalandri 14 May 2019
  
  in Outline
  
  Since 9/11, Muslim-American terrorism has claimed 37 lives in the United States, out of more than 190,000 murders during this period
  
  stats
  
  Statistics
7. jakepalandri 14 May 2019
  
  in Outline
  
  pproximately 60 were carried out by Muslims. In other words, approximately 2.5% of all terrorist attacks on US soil between 1970 and 2012 were carried out by Muslims.
  
  stats
  
  Statistics
8. jakepalandri 14 May 2019
  
  in Outline
  
  94 percent of the terror attacks were committed by non-Muslims
  
  stats
  
  Statistics
9. jakepalandri 14 May 2019
  
  in Outline
  
  Muslim terrorists were responsible for a meagre 0.3 percent of EU terrorism during those years.
  
  stats
  
  Statistics
10. jakepalandri 14 May 2019
  
  in Outline
  
  in 2013, there were 152 terrorist attacks in Europe. Only two of them were “religiously motivated”, while 84 were predicated on ethno-nationalist or separatist beliefs
  
  stats
  
  Statistics
11. jakepalandri 14 May 2019
  
  in Outline
  
  in the 4 years between 2011 and 2014 there were 746 terrorist attacks in Europe. Of these, only eight were religiously-inspired, which is 1% of the total
  
  stats
  
  Statistics
12. jakepalandri 14 May 2019
  
  in Outline
  
  official data from Europol
  
  Stats
  
  Statistics
Visit annotations in context

Tags

Bad

Statistics

Selection of Detail

Annotators

jakepalandri

URL

outline.com/phxAAk
bengoldhaber.com bengoldhaber.com

Why aren’t there more hacks?

2
1. ghabs 12 May 2019
  
  in Public
  
  info-request
  
  What is the current price of cyber insurance? Has it gone up in price?
  
  statistics
2. ghabs 12 May 2019
  
  in Public
  
  info-request
  
  Looking for statistics on the number of cybercrime prosecutions over time.
  
  statistics
Visit annotations in context

Tags

statistics

Annotators

ghabs

URL

bengoldhaber.com//2019/05/10/em-hackers.html
Apr 2019
mp.weixin.qq.com mp.weixin.qq.com

果壳

4
1. Sauchin_chen 09 Apr 2019
  
  in Public
  
  要保持谦逊：兼容性评估的前提是用于计算区间的统计假设是正确的
  
  應翻為確認統計假設的正確性。這點看出他們的立論基於估計的參數，而非實在的科學理論。統計假設是科學理論推理的延伸，只用推理合乎有效的邏輯形式，有效結果與無效結果都會是科學理論的證據。
  
  statistics
2. Sauchin_chen 09 Apr 2019
  
  in Public
  
  在给定假设的情况下，区间内数值与研究数据的兼容性并不完全相同
  
  原文“not all values inside are equally compatible with the data, given the assumptions. ” 這裡的assumption是指估計的參數，還是科學理論對現實狀況的預測，並沒有明確說明。如果是估計的參數，Amrhein等人也許將P(D|theta)當成P(theta)。
  
  statistics
3. Sauchin_chen 09 Apr 2019
  
  in Public
  
  我们看到了大量具有“统计学显著性”的结果；而不具有“统计学显著性”的结果则被显著低估
  
  豈止低估。不顯著的研究結果經常被鎖起來不見天日。
  
  statistics
4. Sauchin_chen 09 Apr 2019
  
  in Public
  
  在置信区间包含风险显著增高的情况下，仅因为结果不具有统计学显著性就推论药物与房颤发生“无关”十分可笑；据此就认为前后两项研究矛盾——即便风险比完全一致——同样非常荒谬。这些常见情况表明我们依赖的统计学显著性阈值有可能误导我们。
  
  Amrhein 等人以此例子顯示confidence interval能突顯不一致的研究之間，評估要測量的效應其實一致的資訊。
  
  statistics
Visit annotations in context

Tags

statistics

Annotators

Sauchin_chen

URL

mp.weixin.qq.com/s/Jm9AarE30ydtooeqXvNnEQ
Mar 2019
medium.economist.com medium.economist.com

Mistakes, we’ve drawn a few – The Economist

1
1. pivic 28 Mar 2019
  
  in Public
  
  At The Economist, we take data visualisation seriously. Every week we publish around 40 charts across print, the website and our apps. With every single one, we try our best to visualise the numbers accurately and in a way that best supports the story. But sometimes we get it wrong. We can do better in future if we learn from our mistakes — and other people may be able to learn from them, too.
  
  This is, factually and literally speaking, laudable in the extreme.
  
  Anybody can make mistakes; the best one can do is to admit that one does, and publicly learn from them - if one is a magazine. This is beauteously done.
  
  the economist learning statistics visualisation
Visit annotations in context

Tags

visualisation

the economist

statistics

learning

Annotators

pivic

URL

medium.economist.com/mistakes-weve-drawn-a-few-8cdd8a42d368
www.nature.com www.nature.com

Scientists rise up against statistical significance

1
1. mlenc 27 Mar 2019
  
  in Public
  
  statistics controversy pvalues p-values
Visit annotations in context

Tags

pvalues

controversy

statistics

p-values

Annotators

mlenc

URL

nature.com/articles/d41586-019-00857-9
Jan 2019
inst-fs-iad-prod.inscloudgate.net inst-fs-iad-prod.inscloudgate.net

Spearman’s g found in 31 non-Western nations: Strong evidence that g is a universal phenomenon.

2
1. rwarne 23 Jan 2019
 
 in APA Publishing Open
 
 the strongest first factor accounted for 86.3% of observed variable variance
 
 I suspect that this factor was so strong because it consisted of only four observed variables, and three of them were written measures of verbal content. All of the verbal cariables correlated r = .72 to .89. Even the "non-verbal" variable (numerical ability) correlates r = .72 to .81 with the other three variables (Rehna & Hanif, 2017, p. 25). Given these strong correlations, a very strong first factor is almost inevitable.
 
 intelligence statistics factor analysis
2. rwarne 23 Jan 2019
 
 in APA Publishing Open
 
 The weakest first factor accounted for 18.3% of variance
 
 This factor may be weak because the sample consists of Sudanese gifted children, which may have restricted the range of correlations in the dataset.
 
 intelligence statistics restriction of range gifted students factor analysis
Visit annotations in context

Tags

gifted students

restriction of range

factor analysis

statistics

intelligence

Annotators

rwarne

URL

inst-fs-iad-prod.inscloudgate.net/files/d6655cf3-b382-4fc8-98b3-fe58fe4a4a1e/Warne & Burningham (2019) - Spearman's g found in 31 non-western nations, g as universal phenomenon (uh what).pdf
Dec 2018
www.anthropic-principle.com www.anthropic-principle.com

A Primer on the Doomsday Argument | anthropic-principle.com

1
1. andycyca 05 Dec 2018
  
  in Public
  
  The Doomsday argument
  
  The Doomsday argument (DA) is a probabilistic argument that claims to predict the number of future members of the human species given only an estimate of the total number of humans born so far. Simply put, it says that supposing that all humans are born in a random order, chances are that any one human is born roughly in the middle.
  
  From Wikipedia, Doomsday argument
  
  doomsday argument statistics
Visit annotations in context

Tags

doomsday argument

statistics

Annotators

andycyca

URL

anthropic-principle.com/
Nov 2018
www.coursera.org www.coursera.org

Basic Statistics | Coursera

1
1. mglotov 15 Nov 2018
  
  in Public
  
  Basic Statistics
  
  part of specialization
  
  statistics math education
Visit annotations in context

Tags

education

statistics

math

Annotators

mglotov

URL

coursera.org/learn/basic-statistics
www.insidehighered.com www.insidehighered.com

Online education gives adults access, but student outcomes lag | Inside Higher Ed

1
1. beachboundhawaii 02 Nov 2018
  
  in Public
  
  Online Options Give Adults Access, but Outcomes Lag
  
  In this article, drivers that increase and improve online learning success in adults are explored. State by state data along with federal stats contribute to the conclusions presented.
  
  Roughly 13% of all undergraduates are full-time online students and between 2012 and 2017 online students grew y 11 percent, about 2.25 million. The article presents a map showing state by state stats and the information provided can assist in growing individual school needs.
  
  RATING: 4/5 (rating based upon a score system 1 to 5, 1= lowest 5=highest in terms of content, veracity, easiness of use etc.)
  
  etcnau etc556 online success online education growth online classroom enrollment digital enrollment online learning statistics online learning success
Visit annotations in context

Tags

online success

online learning success

digital enrollment

etc556

etcnau

online classroom enrollment

online education growth

online learning statistics

Annotators

beachboundhawaii

URL

insidehighered.com/digital-learning/article/2018/06/20/online-education-gives-adults-access-student-outcomes-lag
Sep 2018
nces.ed.gov nces.ed.gov

Student Enrollment - How many students enroll in postsecondary institutions annually?

1
1. jeremydean 14 Sep 2018
  
  in Public
  
  education statistics
Visit annotations in context

Tags

education

statistics

Annotators

jeremydean

URL

nces.ed.gov/ipeds/trendgenerator/
stackoverflow.com stackoverflow.com

Double centering in R

1
1. rschulz 14 Sep 2018
  
  in Public
  
  Double-centering a matrix M
  
  linear-algebra matrix centering double-centering statistics
Visit annotations in context

Tags

double-centering

statistics

linear-algebra

centering

matrix

Annotators

rschulz

URL

stackoverflow.com/questions/43639063/double-centering-in-r
stats.stackexchange.com stats.stackexchange.com

Relationship between ridge regression and PCA regression

1
1. rschulz 14 Sep 2018
  
  in Public
  
  Relationship between ridge regression and PCA regression
  
  statistics PCA regression ridge SVD
Visit annotations in context

Tags

SVD

regression

ridge

PCA

statistics

Annotators

rschulz

URL

stats.stackexchange.com/questions/81395/relationship-between-ridge-regression-and-pca-regression
Local file Local file

Untitled document

2
1. morenita 05 Sep 2018
  
  in Public
  
  Lack of metrics
  
  Develop or strengthen statistical information of new industry developments, ups and downs of existing industries.
  
  Greg Halseth t-statistics t-economy d-federal d-provincial d-municipal t-community
2. morenita 05 Sep 2018
  
  in Public
  
  LDLC workers and their needs
  
  Develop or strengthen statistics regarding LDLC workers and their needs
  
  Greg Halseth t-statistics t-labour market d-federal d-municipal d-provincial t-community
Tags

d-provincial

t-economy

t-labour market

Greg Halseth

t-community

d-federal

t-statistics

d-municipal

Annotators

morenita
www.statisticssolutions.com www.statisticssolutions.com

Conduct and Interpret a Canonical Correlation - Statistics Solutions

1
1. ashutosh_pathak 01 Sep 2018
  
  in Public
  
  predictive analysis
  
  Predictive analytics encompasses a variety of statistical techniques from data mining, predictive modelling, and machine learning, that analyze current and historical facts to make predictions about future or otherwise unknown events.
  
  statistics machine learning data mining facts
Visit annotations in context

Tags

machine learning

statistics

facts

data mining

Annotators

ashutosh_pathak

URL

statisticssolutions.com/testing-assumptions-of-linear-regression-in-spss/
Apr 2018
www.madinamerica.com www.madinamerica.com

Scientists Fight Against the Myth of the Normal or Optimal Brain

1
1. Perig 06 Apr 2018
  
  in Public
  
  Excellent example of badly used statistics.
  
  neurodiversity bad science statistics
Visit annotations in context

Tags

statistics

bad science

neurodiversity

Annotators

Perig

URL

madinamerica.com/2018/04/scientists-fight-myth-normal-optimal-brain/
Mar 2018
www.r-bloggers.com www.r-bloggers.com

Boxplots and Beyond – Part II: Asymmetry

1
1. rschulz 17 Mar 2018
  
  in Public
  
  Boxplots and Beyond – Part II: Asymmetry
  
  R ggplot outlier statistics boxplot IQD
Visit annotations in context

Tags

boxplot

ggplot

IQD

R

outlier

statistics

Annotators

rschulz

URL

r-bloggers.com/boxplots-and-beyond-–-part-ii-asymmetry/
eurekastatistics.com eurekastatistics.com

Using the Median Absolute Deviation to Find Outliers

1
1. rschulz 17 Mar 2018
  
  in Public
  
  Using the Median Absolute Deviation to Find Outliers
  
  statistics MAD median eureka R outlier
Visit annotations in context

Tags

R

outlier

MAD

statistics

median

eureka

Annotators

rschulz

URL

eurekastatistics.com/using-the-median-absolute-deviation-to-find-outliers/
www.ncbi.nlm.nih.gov www.ncbi.nlm.nih.gov

Power analysis and sample size estimation for RNA-Seq differential expression. - PubMed - NCBI

1
1. rschulz 05 Mar 2018
  
  in Public
  
  Power analysis and sample size estimation for RNA-Seq differential expression
  
  RNAseq power statistics
Visit annotations in context

Tags

statistics

power

RNAseq

Annotators

rschulz

URL

ncbi.nlm.nih.gov/pubmed/25246651
www.ncbi.nlm.nih.gov www.ncbi.nlm.nih.gov

Polyester: simulating RNA-seq datasets with differential transcript expression. - PubMed - NCBI

1
1. rschulz 05 Mar 2018
  
  in Public
  
  Polyester: simulating RNA-seq datasets with differential transcript expression
  
  RNAseq simulation power statistics transcriptomics
Visit annotations in context

Tags

simulation

RNAseq

transcriptomics

statistics

power

Annotators

rschulz

URL

ncbi.nlm.nih.gov/pubmed/25926345
Feb 2018
hypothes.is hypothes.is

Hypothesis

1
1. CaraBell17 27 Feb 2018
  
  in Public
  
  My daughter will be brought up to understand her true value. That’s a promise. As for all the little girls to be born around the world, the creation of these ads is an effort to show how imagination can change the conversation around their lives.
  
  The article lists interesting statistics but the most disapointing information is that so many girls are not given a chance. I wish she mentioned how she would bring up her own daughter to understand her value. Hopefully new efforts will bring about a change of opinion faster.
Visit annotations in context

Tags

The article lists interesting statistics but the most disapointing information is that so many girls are not given a chance. I wish she mentioned how she would bring up her own daughter to understand her value. Hopefully

new efforts will bring about a change of opinion faster.

Annotators

CaraBell17

URL

hypothes.is/groups/AAEwBqQZ/eng-111-spring-2018
achri.blogspot.com achri.blogspot.com

Combining data from multiple RNASeq experiments: release the Kruskal! (...Wallis test)

1
1. rschulz 09 Feb 2018
  
  in Public
  
  Combining data from multiple RNASeq experiments: release the Kruskal! (...Wallis test)
  
  RNAseq statistics Kruskal Wallis batch sleuth kallisto
Visit annotations in context

Tags

Kruskal

batch

Wallis

sleuth

RNAseq

statistics

kallisto

Annotators

rschulz

URL

achri.blogspot.com/2017/03/combining-rnaseq-experiments-to-find.html
Dec 2017
www.sciencedirect.com www.sciencedirect.com

Neural correlates of interspecies perspective taking in the post-mortem Atlantic Salmon: an argument for multiple comparisons correction

1
1. rschulz 04 Dec 2017
  
  in Public
  
  Neural correlates of interspecies perspective taking in the post-mortem Atlantic Salmon: an argument for multiple comparisons correction
  
  statistics multiple-testing salmon dead-salmon imaging MRI fMRI
Visit annotations in context

Tags

salmon

fMRI

MRI

imaging

multiple-testing

statistics

dead-salmon

Annotators

rschulz

URL

sciencedirect.com/science/article/pii/S1053811909712029
royalsocietypublishing.org royalsocietypublishing.org

The natural selection of bad science

1
1. rschulz 04 Dec 2017
  
  in Public
  
  The natural selection of bad science
  
  journal-club science statistics publication impact
Visit annotations in context

Tags

publication

impact

journal-club

science

statistics

Annotators

rschulz

URL

royalsocietypublishing.org/doi/10.1098/rsos.160384
Nov 2017
rafalab.github.io rafalab.github.io

harvardx

1
1. rschulz 14 Nov 2017
  
  in Public
  
  HarvardX Biomedical Data Science Open Online Training
  
  teaching statistics R analysis-tool data-analysis youtube
Visit annotations in context

Tags

youtube

R

analysis-tool

data-analysis

statistics

teaching

Annotators

rschulz

URL

rafalab.github.io/pages/harvardx.html
www.sciencedirect.com www.sciencedirect.com

Genetic and Functional Drivers of Diffuse Large B Cell Lymphoma

1
1. rschulz 06 Nov 2017
  
  in Public
  
  pairwise overlaps using Fisher’s test and mutual exclusion (Leiserson et al., 2016xA weighted exact test for mutually exclusive mutations in cancer. Leiserson, M.D.M., Reyna, M.A., and Raphael, B.J. Bioinformatics. 2016; 32: i736–i745Crossref | PubMed | Scopus (4)See all ReferencesLeiserson et al., 2016)
  
  cancer statistics statistical-test Fisher
Visit annotations in context

Tags

cancer

statistics

statistical-test

Fisher

Annotators

rschulz

URL

sciencedirect.com/science/article/pii/S0092867417311212
www.ncbi.nlm.nih.gov www.ncbi.nlm.nih.gov

Gene Set Enrichment Analysis Made Simple

1
1. rschulz 02 Nov 2017
  
  in Public
  
  Gene Set Enrichment Analysis Made Simple
  
  using aggregate t or chi^2 statistic to test if a set of genes is on aggregate differentially expressed
  
  statistics gene-expression gene-set-enrichment GO aggregate
Visit annotations in context

Tags

GO

gene-set-enrichment

statistics

aggregate

gene-expression

Annotators

rschulz

URL

ncbi.nlm.nih.gov/pmc/articles/PMC3134237/
Oct 2017
www.restore.ac.uk www.restore.ac.uk

4.2 An Introduction to Odds, Odds Ratios and Exponents

1
1. rschulz 24 Oct 2017
  
  in Public
  
  An Introduction to Odds, Odds Ratios and Exponents
  
  statistics logit regression logistic
Visit annotations in context

Tags

logistic

statistics

logit

regression

Annotators

rschulz

URL

restore.ac.uk/srme/www/fac/soc/wie/research-new/srme/modules/mod4/2/index.html
tedunderwood.com tedunderwood.com

Seven ways humanists are using computers to understand text.

1
1. jeremydean 15 Oct 2017
  
  in Public
  
  One of the main ways computers are changing the textual humanities is by mediating new connections to social science. The statistical models that help sociologists understand social stratification and social change haven’t in the past contributed much to the humanities, because it’s been difficult to connect quantitative models to the richer, looser sort of evidence provided by written documents.
  
  DH as moving English more toward the statistical...
  
  Digital Humanities statistics social science English
Visit annotations in context

Tags

English

social science

statistics

Digital Humanities

Annotators

jeremydean

URL

tedunderwood.com/2015/06/04/seven-ways-humanists-are-using-computers-to-understand-text/
May 2017
fivethirtyeight.com fivethirtyeight.com

Dissecting Trump’s Most Rabid Online Following

1
1. daveh70 31 May 2017
  
  in Public
  
  Analysis of a subreddit for Trump supporters, based on comparisons of the users of various subreddits.
  
  social media statistics data analysis
Visit annotations in context

Tags

social media

statistics

data analysis

Annotators

daveh70

URL

fivethirtyeight.com/features/dissecting-trumps-most-rabid-online-following/
static1.squarespace.com static1.squarespace.com

Narrative and Database: Natural Symbionts

1
1. gilmanhernandez 01 May 2017
  
  in Public
  
  s. If we want to understand the effects of global warming or whether the economy is headed for a recessio
  
  The class on The Rhetorical Situation brought up discussion on the evolving notion of "weather" as a changeable, even rhetorical, thing. Moving to integrate database and narrative as symbionts makes a connection between data and delivery/appeal.
  
  #STEM #Statistics
Visit annotations in context

Tags

#STEM

#Statistics

Annotators

gilmanhernandez

URL

static1.squarespace.com/static/53713bf0e4b0297decd1ab8b/t/588810c417bffc6698ada4e4/1485312199068/hayles_narrative_and_database_natural_symbionts.pdf
Apr 2017
bangordailynews.com bangordailynews.com

As paper mills die, here’s how Maine’s loggers hope to survive

1
1. mshook 05 Apr 2017
  
  in Public
  
  The annual drop in Maine wood demand since 2014 would fill that imaginary 1,770-mile caravan. The loss equals about 350 fewer truckloads of wood a day, every day of the year.
  
  maine wood forest industry business factoid stats statistics april 2017
Visit annotations in context

Tags

stats

forest

wood

2017

maine

industry

april

business

statistics

factoid

Annotators

mshook

URL

bangordailynews.com/2017/04/03/the-point/as-paper-mills-die-heres-how-maines-loggers-hope-to-survive/
Mar 2017
bangordailynews.com bangordailynews.com

Maine growers struggle with surplus blueberries, plummeting prices

2
1. mshook 27 Mar 2017
  
  in Public
  
  The state has pumped more than 100 million pounds of low bush fruit into the frozen market each year for the last three growing cycles.
  
  maine business politics statistics bdn march 2017
2. mshook 27 Mar 2017
  
  in Public
  
  A typical acre of blueberry barrens will yield about 2,000 to 4,000 pounds of berries, depending on pollination and other factors.
  
  maine stats statistics business agriculture blue march 2017 bdn
Visit annotations in context

Tags

stats

march

blue

2017

agriculture

statistics

bdn

maine

politics

business

Annotators

mshook

URL

bangordailynews.com/2017/03/25/business/maine-growers-struggle-with-surplus-blueberries-plummeting-prices/
tachesdesens.blogspot.com tachesdesens.blogspot.com

Out of the box.

1
1. Sensor63 25 Mar 2017
  
  in Public
  
  I never regret the eleven months which hardened my resolve, to go beyond 98 'Nos' to get to the precious, unexpected 'Yes's'. I was nobody, I was selling nothing, I could be nobody selling anything.
  
  Numbers
  
  Statistics
  
  Alienation
  
  alienation numbers statistics story science
Visit annotations in context

Tags

story

alienation

numbers

science

statistics

Annotators

Sensor63

URL

tachesdesens.blogspot.com/2014/09/out-of-box.html
Feb 2017
static1.squarespace.com static1.squarespace.com

campbell_from_the_philosophy_of_rhetoric.pdf

3
1. gilmanhernandez 08 Feb 2017
  
  in Public
  
  That two dice marked in the common way will tum up seven, is thrice as probable as that they will tum up eleven, and six times as probable as that they will tum up twelve
  
  D&D has made me embarrassingly good at estimating probable outcomes of platonic die in my head.
  
  #Nerd #Shame #Statistics
2. gilmanhernandez 08 Feb 2017
  
  in Public
  
  In moral reasoning we ascend from pos-~ibility, by an insensible gradation, to probabil-ity, and thence, in the same manner, to the sum-mit of moral certainty.
  
  I believe Campbell addresses some of the uncertainty of Inductive Reasoning here. The phrase "insensible gradation" seems meaningful--how we go from a possibility to moral certainty is something fundamentally difficult in a manner Hume cannot accept. But Campbell explains in this section many of the difficulties of this, and how it's still useable, for moral judgment.
  
  On the same side, I come back to Bayesian Probabilities, wondering if Campbell knew about them, and how they transfer statistical, mathematical knowledge towards determining if a hypothesis is true. Once again, I'm hesitant that I'll exceed my grasp of stats if I talk to much about it, though.
  
  #Statistics #Induction
3. gilmanhernandez 08 Feb 2017
  
  in Public
  
  The course of nature will be the same tomorrow that it is today; or, the future will resemble the past"
  
  Apparently, this is a surprisingly successful rationale for meteorology. If you just assume "tomorrow's weather will resemble today's," you'll end up more right than not, and can actually beat some meteorologists. Then again, Jim Flowers and the KMTV Accu-Weather Forecast might have just been terrible.
  
  #Childhood #Statistics
Visit annotations in context

Tags

#Statistics

#Childhood

#Shame

#Induction

#Nerd

Annotators

gilmanhernandez

URL

static1.squarespace.com/static/53713bf0e4b0297decd1ab8b/t/586d26126b8f5b6deb443c43/1483548199126/campbell_from_the_philosophy_of_rhetoric.pdf
Oct 2016
www.businessinsider.com www.businessinsider.com

How IoT in Education is Changing the Way We Learn

1
1. Enkerli 18 Oct 2016
  
  in Public
  
  With figures like those, it's clear that the education system isn't going away anytime soon.
  
  How so?
  
  Business Models for Education statistics quants
Visit annotations in context

Tags

quants

statistics

Business Models for Education

Annotators

Enkerli

URL

businessinsider.com/internet-of-things-education-2016-9
Sep 2016
www.thelocal.se www.thelocal.se

Swedish head reported for using 'hon' not 'hen'

1
1. daniel.odonnell 26 Sep 2016
  
  in Public
  
  According to the language periodical Språktidningen, ‘hen’ was by 2014 used once in the Swedish media for every 300 used of ‘hon’ or ‘han’, up from one in every 13,000 in 2011
  
  Increasing rate of usage of hen vs. hon or han: 1/13,000 in 2011; 1/300 in 2014.
  
  grammar usage statistics newspapers hen swedish
Visit annotations in context

Tags

newspapers

swedish

hen

usage

statistics

grammar

Annotators

daniel.odonnell

URL

thelocal.se/20160402/swedish-school-head-reported-for
May 2016
www.propublica.org www.propublica.org

Machine Bias: There’s Software Used Across the Country to Predict Future Criminals. And it’s Biased Against Blacks.

1
1. mikarv 23 May 2016
  
  in Public
  
  the algorithm was somewhat more accurate than a coin flip
  
  In machine learning it's also important to evaluate not just against random, but against how well other methods (e.g. parole boards) do. That kind of analysis would be nice to see.
  
  machine learning statistics
Visit annotations in context

Tags

machine learning

statistics

Annotators

mikarv

URL

propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing/
medium.com medium.com

A guy just transcribed 30 years of for-rent ads. Here’s what it taught us about housing prices — Medium

1
1. daveh70 18 May 2016
  
  in Public
  
  A study of housing cost in San Francisco from the 1950s to 2016.
  
  economics statistics housing cost rent control
Visit annotations in context

Tags

housing cost

statistics

economics

rent control

Annotators

daveh70

URL

medium.com/@andersem/a-guy-just-transcribed-30-years-of-for-rent-ads-heres-what-it-taught-us-about-sf-housing-prices-bd61fd0e4ef9
Mar 2016
amstat.tandfonline.com amstat.tandfonline.com

The ASA's statement on p-values: context, process, and purpose

1
1. daveh70 07 Mar 2016
  
  in Public
  
  American Statistical Association statement on p-values
  
  statistics data analysis science
Visit annotations in context

Tags

data analysis

science

statistics

Annotators

daveh70

URL

amstat.tandfonline.com/doi/abs/10.1080/00031305.2016.1154108
Feb 2016
www.apa.org www.apa.org

APA Survey Shows Money Stress Weighing on Americans’ Health Nationwide

1
1. vegetabile 29 Feb 2016
  
  in Public
  
  3,068 adults in August 2014, found that 72 percent of Americans reported feeling stressed about money at least some of the time during the past month. Twenty-two percent said that they experienced extreme stress about money during the past month (an 8, 9 or 10 on a 10-point scale, where 1 is “little or no stress” and 10 is “a great deal of stress”). For the majority of Americans (64 percent), money is a somewhat or very significant source of stress, but especially for parents and younger adults (77 percent of parents, 75 percent of millennials [18 to 35 years old] and 76 percent of Gen Xers [36 to 49 years old]).
  
  Along the lines of the first paragraph except putting some percentages into it. Almost three quarters of Americans (out of a 3,000 person survey) feels some kind of "extreme stress about money" each month, the majority coming from parents, adults and young adults (18-35). I'll incorporate this into my paper by using statistics to show how money is a huge reason for stress in adults.
  
  Money statistics
Visit annotations in context

Tags

Money statistics

Annotators

vegetabile

URL

apa.org/news/press/releases/2015/02/money-stress.aspx
bangordailynews.com bangordailynews.com

State forester: Wood harvesting near Quimby’s land part of larger plan

1
1. mshook 23 Feb 2016
  
  in Public
  
  He expects that the logging project near Quimby’s land will likely generate about $755,250 at the state’s average sale price, $50.35 per cord of wood. The land has about 1,500 harvestable acres that contain about 30 cords of wood per acre, or 45,000 cords, but only about a third of that will be cut because the land is environmentally sensitive, Denico said. The Bureau of Parks and Lands expects to generate about $6.6 million in revenue this year selling about 130,000 cords of wood from its lots, Denico said. Last year, the bureau generated about $7 million harvesting about 139,000 cords of wood. The Legislature allows the cutting of about 160,000 cords of wood on state land annually, although the LePage administration has sought to increase that amount.
  
  forest maine wood business february 2016 bdn lepage park stats statistics
Visit annotations in context

Tags

stats

forest

park

2016

wood

bdn

lepage

maine

february

business

statistics

Annotators

mshook

URL

bangordailynews.com/2016/02/23/news/penobscot/state-forester-wood-harvesting-near-quimbys-land-part-of-larger-plan/
www.psychologytoday.com www.psychologytoday.com

The Faulty Foundation of Higher Education

2
1. daveh70 20 Feb 2016
  
  in Public
  
  From 1926 until the early 1950s, US military aircraft relied on a "one size fits all" design based on average measurements of hundreds of male pilots.
  
  But a 1950 study by Lt. Gilbert Daniels showed that out of 4,063 airmen, not even one was average in all ten measurements. They started designing cockpits and controls to be adjustable. Accidents decreased, and pilot performance increased.
  
  Standardized education makes the same mistake.
  
  education statistics
2. daveh70 20 Feb 2016
  
  in Public
  
  The science of the individual relies on dynamic systems theory rather than group statistics. Its research methodology is characterized by “analyze, then aggregate” (analyze each subject separately, then combine individual patterns into collective understanding) rather than “aggregate, then analyze” (derive group statistics based on aggregate data, then use these statistics to evaluate and understand individuals).
  
  A mathematical psychologist at Penn State University, Molenaar extended ergodic theory (link is external) to prove that it was not mathematically permissible to use assessment instruments based on group averages to evaluate individuals.
  
  A Manifesto on Psychology as Idiographic Science, Peter Molenaar
  
  science statistics psychology
Visit annotations in context

Tags

education

science

statistics

psychology

Annotators

daveh70

URL

psychologytoday.com/blog/the-science-the-individual/201602/the-faulty-foundation-higher-education
leanpub.com leanpub.com

Roger D. Peng

1
1. daveh70 02 Feb 2016
  
  in Public
  
  Books on data science and R programming by Roger D. Peng of Johns Hopkins.
  
  statistics data science data analysis data visualization
Visit annotations in context

Tags

data analysis

data science

statistics

data visualization

Annotators

daveh70

URL

leanpub.com/u/rdpeng
blog.cloudera.com blog.cloudera.com

Common Probability Distributions: The Data Scientist's Crib Sheet - Cloudera Engineering Blog

1
1. daveh70 01 Feb 2016
  
  in Public
  
  Great explanation of 15 common probability distributions: Bernouli, Uniform, Binomial, Geometric, Negative Binomial, Exponential, Weibull, Hypergeometric, Poisson, Normal, Log Normal, Student's t, Chi-Squared, Gamma, Beta.
  
  statistics probability data science
Visit annotations in context

Tags

probability

data science

statistics

Annotators

daveh70

URL

blog.cloudera.com/blog/2015/12/common-probability-distributions-the-data-scientists-crib-sheet/
Jan 2016
courses.csail.mit.edu courses.csail.mit.edu

50YearsDataScience.pdf

1
1. daveh70 31 Jan 2016
 
 in Public
 
 50 Years of Data Science, David Donoho 2015, 41 pages
 
 This paper reviews some ingredients of the current "Data Science moment", including recent commentary about data science in the popular media, and about how/whether Data Science is really different from Statistics.
 
 The now-contemplated field of Data Science amounts to a superset of the fields of statistics and machine learning which adds some technology for 'scaling up' to 'big data'.
 
 data science data analysis statistics science big data
Visit annotations in context

Tags

data analysis

data science

science

statistics

big data

Annotators

daveh70

URL

courses.csail.mit.edu/18.337/2015/docs/50YearsDataScience.pdf
blogs.scientificamerican.com blogs.scientificamerican.com

Bayes's Theorem: What's the Big Deal?

3
1. mshook 28 Jan 2016
  
  in Public
  
  P(B|E) = P(B) X P(E|B) / P(E), with P standing for probability, B for belief and E for evidence. P(B) is the probability that B is true, and P(E) is the probability that E is true. P(B|E) means the probability of B if E is true, and P(E|B) is the probability of E if B is true.
  
  bayes bayesian math stats statistics january 2016 critique probability
2. mshook 28 Jan 2016
  
  in Public
  
  The probability that a belief is true given new evidence equals the probability that the belief is true regardless of that evidence times the probability that the evidence is true given that the belief is true divided by the probability that the evidence is true regardless of whether the belief is true. Got that?
  
  bayes bayesian stats statistics belief math kalman critique january 2016
3. mshook 28 Jan 2016
  
  in Public
  
  Initial belief plus new evidence = new and improved belief.
  
  bayes bayesian stats statistics critique kalman january 2016
Visit annotations in context

Tags

probability

stats

kalman

2016

january

bayes

critique

statistics

belief

math

bayesian

Annotators

mshook

URL

blogs.scientificamerican.com/cross-check/bayes-s-theorem-what-s-the-big-deal/
www.cdc.gov www.cdc.gov

SGR Report 2004 - Health Consequences of Smoking - Chapter 1

1
1. filip.bartek 10 Jan 2016
  
  in Public
  
  This criterion is not based on any specific shape of the dose-response relationship.
  
  I would expect that the relationship must be monotonic to support the causal hypothesis.
  
  statistics biologic gradient
Visit annotations in context

Tags

statistics

biologic gradient

Annotators

filip.bartek

URL

cdc.gov/tobacco/data_statistics/sgr/2004/pdfs/chapter1.pdf
phys.org phys.org

Why too much evidence can be a bad thing

1
1. daveh70 10 Jan 2016
  
  in Public
  
  paradox of unanimity - Unanimous or nearly unanimous agreement doesn't always indicate the correct answer. If agreement is unlikely, it indicates a problem with the system.
  
  Witnesses who only saw a suspect for a moment are not likely to be able to pick them out of a lineup accurately. If several witnesses all pick the same suspect, you should be suspicious that bias is at work. Perhaps these witnesses were cherry-picked, or they were somehow encouraged to choose a particular suspect.
  
  science statistics data analysis probability
Visit annotations in context

Tags

probability

data analysis

science

statistics

Annotators

daveh70

URL

phys.org/news/2016-01-evidence-bad.html
rpy2.readthedocs.org rpy2.readthedocs.org

Documentation for rpy2 — rpy2 2.7.6 documentation

1
1. daveh70 08 Jan 2016
 
 in Public
 
 Python interface to the R programming language. Use R functions and packages from Python. https://pypi.python.org/pypi/rpy2
 
 statistics data analysis data visualization machine learning
Visit annotations in context

Tags

data analysis

statistics

machine learning

data visualization

Annotators

daveh70

URL

rpy2.readthedocs.org/en/version_2.7.x/
Oct 2015
cms.whittier.edu cms.whittier.edu

Coates The Case for Reparations.pdf

1
1. lovingjoy 14 Oct 2015
  
  in Public
  
  In 1930 its population was 112,000. Today it is 36,000. The halcyon talk of “interracial living” is dead. The neighborhood is 92 percent black. Its homicide rate is 45 per 100,000—triple the rate of the city as a whole. The infant-mortality rate is 14 per 1,000—more than twice the national average.
  
  These are some intense statistics.. It'd be interesting to compare them to other cities in the area..
  
  social statistics ANTH330F15 poverty
Visit annotations in context

Tags

ANTH330F15

social statistics

poverty

Annotators

lovingjoy

URL

cms.whittier.edu/pluginfile.php/335790/mod_resource/content/0/Coates The Case for Reparations.pdf
Aug 2015
www.vox.com www.vox.com

Tech nerds are smart. But they can't seem to get their heads around politics.

1
1. tilgovi 28 Aug 2015
  
  in Public
  
  What's really being measured is heterogeneity of opinion, not centrism.
  
  statistics polling politics in America politics
Visit annotations in context

Tags

polling

statistics

politics in America

politics

Annotators

tilgovi

URL

vox.com/2015/8/27/9214015/tech-nerds-politics
Jul 2015
www.eblida.org www.eblida.org

Public Libraries - Statistics - European Bureau of Library Information and Documentation Associations (EBLIDA)

1
1. mavery 22 Jul 2015
  
  in Public
  
  European Bureau of Libraries in Europe Public libraries- statistics
  
  EBLIDA Public libraries statistics Europe ty: web resource
Visit annotations in context

Tags

EBLIDA

Europe

ty: web resource

Public libraries

statistics

Annotators

mavery

URL

eblida.org/activities/kic/public-libraries-statistics.html
localhost:8080 localhost:8080

Consilience - Document Viewer

1
1. ekraffmiller 20 Jul 2015
  
  in Public
  
  analyses
  
  another, with tag
  
  statistics
Visit annotations in context

Tags

statistics

Annotators

ekraffmiller

URL

localhost:8080/text/DocumentViewIFrame/5589b1313004fb6be70e469e
Feb 2015
en.wikipedia.org en.wikipedia.org

Variance - Wikipedia, the free encyclopedia

2
1. mehu 16 Feb 2015
  
  in Public
  
  The use of the term n − 1 is called Bessel's correction, and it is also used in sample covariance and the sample standard deviation (the square root of variance)
  
  Why in $\sigma^2$ is not equal to $s^2$
  
  statistics
2. mehu 16 Feb 2015
  
  in Public
  
  Sample variance can also be applied to the estimation of the variance of a continuous distribution from a sample of that distribution.
  
  statistics
Visit annotations in context

Tags

statistics

Annotators

mehu

URL

en.wikipedia.org/wiki/Variance
www.emathzone.com www.emathzone.com

Coefficient of Standard Deviation and Variation

1
1. mehu 15 Feb 2015
  
  in Public
  
  Suppose the value of for wages is 10% and the values of for kilograms of meat is 25%. This means that the wages of workers are consistent. Their wages are close to the overall average of their wages. But the families consume meat in quite different quantities. Some families use very small quantities of meat and some others use large quantities of meat. We say that there is greater variation in their consumption of meat. The observations about the quantity of meat are more dispersed or more variant.
  
  Interpretation of Relative Deviation Coefficient
  
  statistics
Visit annotations in context

Tags

statistics

Annotators

mehu

URL

emathzone.com/tutorials/basic-statistics/coefficient-of-standard-deviation-and-variation.html
Nov 2013
cran.r-project.org cran.r-project.org

Untitled document

1
1. aculich 03 Nov 2013
 
 in Public
 
 n its space-time representation (Ogata, 1998), the ETASmodel is a temporal marked point process model, and a special case of marked Hawks process, withconditional intensity function(t;x;yjHt) =(x;y) +Xti<tk(mi)g(tti)f(xxi;yyijmi)
 
 Testing out PDF annotation that also include LaTeX rendered formulas.
 
 ETAS statistics stat157 formula LaTeX
Visit annotations in context

Tags

stat157

formula

LaTeX

statistics

ETAS

Annotators

aculich

URL

cran.r-project.org/web/packages/ETAS/ETAS.pdf
www.plosone.org www.plosone.org

Untitled document

1
1. rdhyee 01 Nov 2013
  
  in Public
  
  two cadavers
  
  why two? For comparative purposes? Limited sample size?
  
  statistics
Visit annotations in context

Tags

statistics

Annotators

rdhyee

URL

plosone.org/article/info:doi/10.1371/journal.pone.0077733
Sep 2013
rhetoric.eserver.org rhetoric.eserver.org

Book I - Chapter 1 : Aristotle's Rhetoric

1
1. alex_hinerman 30 Sep 2013
  
  in Public
  
  Hence the man who makes a good guess at truth is likely to make a good guess at probabilities
  
  At first, I didn't like this quote, then I thought back to good ol' Oakley's stats class. We make scientific theories based on what idea is most likely to happen (we reject/do not reject the null hypothesis, but we do not say we accept the null hypothesis). Science: putting me in my place since I had a place to be put.
  
  3860 science statistics JokeyJokeMaker
Visit annotations in context

Tags

3860

science

statistics

JokeyJokeMaker

Annotators

alex_hinerman

URL

rhetoric.eserver.org/aristotle/rhet1-1.html

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL