Hypothesis

246 Matching Annotations

Apr 2020
psyarxiv.com psyarxiv.com

Use of Internet data to track Chinese behavior and interest in COVID-19

1
1. edampf 27 Apr 2020
  
  in BehSci
  
  Wang, T., Chen, X., Zhang, Q., & Jin, X. (2020, April 26). Use of Internet data to track Chinese behavior and interest in COVID-19. https://doi.org/10.31234/osf.io/j6m8q
  
  is:preprint COVID-19 lang:en China Baidu index internet data public health prevention information real-time trend analysis search geographic information demographics Wuhan tracking online behavior concern decision making education policy behavioral science
Visit annotations in context

Tags

behavioral science

online behavior

concern

is:preprint

demographics

policy

real-time

public health

Wuhan

decision making

internet

China

COVID-19

Baidu index

education

lang:en

data

search

prevention

geographic information

trend analysis

information

tracking

Annotators

edampf

URL

psyarxiv.com/j6m8q/
sciencebusiness.net sciencebusiness.net

University of Amsterdam scientists launch website that seeks ideal COVID-19 exit strategy

1
1. edampf 24 Apr 2020
  
  in BehSci
  
  University of Amsterdam scientists launch website that seeks ideal COVID-19 exit strategy. (2020 April 21) Science|Business. https://sciencebusiness.net/network-updates/university-amsterdam-scientists-launch-website-seeks-ideal-covid-19-exit-strategy
  
  is:webpage press release COVID-19 lang:en University of Amsterdam exit strategy data science psychology economics programming resources project epidemiology behavior intervention the Netherlands collaboration crowdsourcing strategy policy science
Visit annotations in context

Tags

economics

is:webpage

programming

behavior

collaboration

science

exit strategy

resources

policy

University of Amsterdam

project

press release

data science

strategy

epidemiology

COVID-19

lang:en

intervention

the Netherlands

crowdsourcing

psychology

Annotators

edampf

URL

sciencebusiness.net/network-updates/university-amsterdam-scientists-launch-website-seeks-ideal-covid-19-exit-strategy
arxiv.org arxiv.org

Survey Data and Human Computation for Improved Flu Tracking

1
1. edampf 23 Apr 2020
  
  in BehSci
  
  Wojcik, S., et al. (2020 March 30). Survey data and human computation for improved flu tracking. Cornell University. arXiv:2003.13822
  
  is:preprint lang:en citation influenza tracking behavioral science data computation influenza
Visit annotations in context

Tags

behavioral science

lang:en

computation

data

influenza

is:preprint

citation

tracking

Annotators

edampf

URL

arxiv.org/abs/2003.13822
www.medrxiv.org www.medrxiv.org

Contacts in context: large-scale setting-specific social mixing matrices from the BBC Pandemic project

1
1. edampf 23 Apr 2020
  
  in BehSci
  
  Klepac, P., Kucharski, A. J., Conlan, A. J., Kissler, S., Tang, M., Fry, H., & Gog, J. R. (2020). Contacts in context: Large-scale setting-specific social mixing matrices from the BBC Pandemic project [Preprint]. Epidemiology. https://doi.org/10.1101/2020.02.16.20023754
  
  lang:en is:article social mixing contact transmission infection infectious disease survey education school closure social distancing working from home data BBC Pandemic science public health intervention UK
Visit annotations in context

Tags

social distancing

school closure

science

contact

UK

working from home

public health

is:article

BBC Pandemic

social mixing

education

lang:en

data

intervention

infection

infectious disease

survey

transmission

Annotators

edampf

URL

medrxiv.org/content/10.1101/2020.02.16.20023754v2
www.cell.com www.cell.com

COVID-19 Is a Data Science Issue

1
1. edampf 23 Apr 2020
  
  in BehSci
  
  Callaghan, S. (2020). COVID-19 Is a Data Science Issue. Patterns, 100022. https://doi.org/10.1016/j.patter.2020.100022
  
  is:article COVID-19 lang:en data science public health health system healthcare mobility data collection data interpretation modeling prediction data visualization
Visit annotations in context

Tags

data visualization

data collection

data interpretation

lang:en

modeling

healthcare

mobility

public health

prediction

data science

is:article

health system

COVID-19

Annotators

edampf

URL

cell.com/patterns/fulltext/S2666-3899(20)30022-2
www.cell.com www.cell.com

A Primer on Biodefense Data Science for Pandemic Preparedness

1
1. edampf 23 Apr 2020
  
  in BehSci
  
  Perakslis, E. (2020). A Primer on Biodefense Data Science for Pandemic Preparedness. Patterns, 1(1), 100018. https://doi.org/10.1016/j.patter.2020.100018
  
  is:article lang:en COVID-19 opinion science biodefense data science policy USA biosafety prevention resilience containment response minimizing recovery
Visit annotations in context

Tags

science

policy

USA

biosafety

minimizing

is:article

data science

COVID-19

lang:en

opinion

containment

prevention

response

biodefense

resilience

recovery

Annotators

edampf

URL

cell.com/patterns/fulltext/S2666-3899(20)30018-0
www.thelancet.com www.thelancet.com

Multidisciplinary research priorities for the COVID-19 pandemic: a call for action for mental health science

1
1. edampf 23 Apr 2020
  
  in BehSci
  
  Holmes, E. A., O’Connor, R. C., Perry, V. H., Tracey, I., Wessely, S., Arseneault, L., Ballard, C., Christensen, H., Cohen Silver, R., Everall, I., Ford, T., John, A., Kabir, T., King, K., Madan, I., Michie, S., Przybylski, A. K., Shafran, R., Sweeney, A., … Bullmore, E. (2020). Multidisciplinary research priorities for the COVID-19 pandemic: A call for action for mental health science. The Lancet Psychiatry, S2215036620301681. https://doi.org/10.1016/S2215-0366(20)30168-1
  
  is:article COVID-19 lang:en research mental health multidisciplinary physical health psychology effects priority UK data collection collaboration vulnerable groups cognitive science social media consumption emotional health strategy resources
Visit annotations in context

Tags

data collection

cognitive science

physical health

collaboration

vulnerable groups

resources

UK

social media

is:article

strategy

consumption

priority

effects

COVID-19

lang:en

research

emotional health

multidisciplinary

psychology

mental health

Annotators

edampf

URL

thelancet.com/pdfs/journals/lanpsy/PIIS2215-0366(20)30168-1.pdf
twitter.com twitter.com

ReconfigBehSci en Twitter: "“proper science without the drag” – Move to the medical model of journal review: “Yes/No” decision. We suggest the temporary adoption of this model for crisis-relevant material by journals. [happening already, but potentially even better models: @Meta_psy and @F1000Research?]" / Twitter

1
1. Marlene_Wulf 23 Apr 2020
  
  in BehSci
  
  ReconfigBehSci en Twitter: “‘Proper science without the drag’ – Move to the medical model of journal review: ‘Yes/No’ decision. We suggest the temporary adoption of this model for crisis-relevant material by journals. [happening already, but potentially even better models: @Meta_psy and @F1000Research?]” / Twitter. (n.d.). Twitter. Retrieved April 15, 2020, from https://twitter.com/scibeh/status/1242094075312046082
  
  is:twitter lang:en COVID-19 policy behavioral science funding data code modeling commonality database think tank
Visit annotations in context

Tags

behavioral science

database

lang:en

modeling

data

think tank

is:twitter

policy

code

funding

commonality

COVID-19

Annotators

Marlene_Wulf

URL

twitter.com/scibeh/status/1242094075312046082
sciencebusiness.net sciencebusiness.net

Viewpoint: COVID-19, open science, and a ‘red alert’ health indicator

1
1. Marlene_Wulf 23 Apr 2020
  
  in BehSci
  
  Viewpoint: COVID-19, open science, and a ‘red alert’ health indicator. (n.d.). Science|Business. Retrieved April 17, 2020, from https://sciencebusiness.net/viewpoint/viewpoint-covid-19-open-science-and-red-alert-health-indicator
  
  is:webpage lang:en COVID-19 science policy health disaster collaboration government cooperation data open science leadership viewpoint
Visit annotations in context

Tags

open science

lang:en

is:webpage

data

science

collaboration

leadership

cooperation

policy

government

viewpoint

health

COVID-19

disaster

Annotators

Marlene_Wulf

URL

sciencebusiness.net/viewpoint/viewpoint-covid-19-open-science-and-red-alert-health-indicator
www.reddit.com www.reddit.com

r/BehSciResearch - Behavioural science research for guiding societies out of lockdown

1
1. Marlene_Wulf 23 Apr 2020
  
  in BehSci
  
  r/BehSciResearch—Behavioural science research for guiding societies out of lockdown. (n.d.). Reddit. Retrieved April 20, 2020, from https://www.reddit.com/r/BehSciResearch/comments/g2bm09/behavioural_science_research_for_guiding/
  
  is:blog lang:en COVID-19 behavioral science research lockdown gradual exit policy society data reporting system tracing testing medical protective equipment
Visit annotations in context

Tags

behavioral science

policy

equipment

system

testing

society

COVID-19

medical

reporting

lockdown

lang:en

gradual

data

research

protective

tracing

exit

is:blog

Annotators

Marlene_Wulf

URL

reddit.com/r/BehSciResearch/comments/g2bm09/behavioural_science_research_for_guiding/
trello.com trello.com

Collective Intelligence and COVID-19 | Trello

1
1. Marlene_Wulf 23 Apr 2020
  
  in BehSci
  
  Collective Intelligence and COVID-19 | Trello. (n.d.). Retrieved April 20, 2020, from https://trello.com/b/STdgEhvX/collective-intelligence-and-covid-19
  
  is:webpage lang:en COVID-19 collective intelligence modeling crowdprediction dataset data analysis science project collaboration mapping crowdsourcing symptom self-assessment contact tracing community network hackathon repository
Visit annotations in context

Tags

modeling

is:webpage

science

collaboration

contact

community

project

self-assessment

dataset

COVID-19

symptom

intelligence

lang:en

data

crowdprediction

repository

mapping

collective

analysis

crowdsourcing

tracing

hackathon

network

Annotators

Marlene_Wulf

URL

trello.com/b/STdgEhvX/collective-intelligence-and-covid-19
en.wikipedia.org en.wikipedia.org

Data visualization - Wikipedia

1
1. TylerRick 17 Apr 2020
  
  in Public
  
  Data visualization is both an art and a science
  
  art science intersection data visualization
Visit annotations in context

Tags

science

intersection

art

data visualization

Annotators

TylerRick

URL

en.wikipedia.org/wiki/Data_visualization
Jan 2020
www.nwo.nl www.nwo.nl

NWO to update its data management protocol in January 2020

1
1. mlenc 06 Jan 2020
  
  in Public
  
  dmp data management plan open science academic policy
Visit annotations in context

Tags

academic policy

open science

dmp

data management plan

Annotators

mlenc

URL

nwo.nl/en/news-and-events/news/2019/12/nwo-to-update-its-data-management-protocol-in-january-2020.html
Nov 2019
rstudio-pubs-static.s3.amazonaws.com rstudio-pubs-static.s3.amazonaws.com

Manipulating Time Series Data in R with xts & zoo

1
1. udaybhaskar 30 Nov 2019
  
  in Public
  
  1. Introduction to eXtensible Time Series, using xts and zoo for time series Introducing xts and
  
  question?
  
  data analysis data science
Visit annotations in context

Tags

data analysis

data science

Annotators

udaybhaskar

URL

rstudio-pubs-static.s3.amazonaws.com/288218_117e183e74964557a5da4fc5902fc671.html
Jun 2019
Local file Local file

Practical Data Science with R, Second Edition MEAP V05

1
1. intelligence.refinery 23 Jun 2019
  
  in Public
  
  Success ina data science project comes not from access to any one exotic tool, but from having quantifiablegoals, good methodology, crossdiscipline interactions, and a repeatable workflow.
  
  Data science
Tags

Data science

Annotators

intelligence.refinery
Apr 2019
www.go-fair.org www.go-fair.org

Discovery - GO FAIR

1
1. mlenc 18 Apr 2019
  
  in Public
  
  open science fair data strategy academic strategies research outputs
Visit annotations in context

Tags

research outputs

open science

data strategy

academic strategies

fair

Annotators

mlenc

URL

go-fair.org/implementation-networks/overview/discovery/
blog.socialcops.com blog.socialcops.com

Technology Archives - SocialCops

1
1. d3vr 13 Apr 2019
  
  in Public
  
  Interesting data science / development / technology blog from an Indian Start up
  
  programming data science development
Visit annotations in context

Tags

programming

data science

development

Annotators

d3vr

URL

blog.socialcops.com/category/technology/
Dec 2018
bid.berkeley.edu bid.berkeley.edu

Main Page - CS 294-1 Spring 2012

1
1. ildar 03 Dec 2018
  
  in Public
  
  data science @course
Visit annotations in context

Tags

@course

data science

Annotators

ildar

URL

bid.berkeley.edu/cs294-1-spring12/index.php/Main_Page
bcourses.berkeley.edu bcourses.berkeley.edu

Introduction to Data Science Fall 2015

1
1. ildar 03 Dec 2018
  
  in Public
  
  data science @course
Visit annotations in context

Tags

@course

data science

Annotators

ildar

URL

bcourses.berkeley.edu/courses/1377158/
Nov 2018
multithreaded.stitchfix.com multithreaded.stitchfix.com

Engineers Shouldn’t Write ETL: A Guide to Building a High Functioning Data Science Department | Stitch Fix Technology – Multithreaded

1
1. IanMulvany 08 Nov 2018
  
  in Public
  
  Unless you need to push the boundaries of what these technologies are capable of, you probably don’t need a highly specialized team of dedicated engineers to build solutions on top of them. If you manage to hire them, they will be bored. If they are bored, they will leave you for Google, Facebook, LinkedIn, Twitter, … – places where their expertise is actually needed. If they are not bored, chances are they are pretty mediocre. Mediocre engineers really excel at building enormously over complicated, awful-to-work-with messes they call “solutions”. Messes tend to necessitate specialization.
  
  data-science data-enginerring engineering
Visit annotations in context

Tags

data-science

data-enginerring

engineering

Annotators

IanMulvany

URL

multithreaded.stitchfix.com/blog/2016/03/16/engineers-shouldnt-write-etl/
Oct 2018
www.springboard.com www.springboard.com

Data Science Career Paths: Different Roles in the Industry - Springboard Blog

1
1. tgrrr 19 Oct 2018
  
  in Public
  
  tl;dr: data engineer = software, coding, cleaning data sets data architects = structure the technology to manage data models and database admin data scientist = stats + math models business analysts = communication and domain expertise
  
  data science data engineer data architects
Visit annotations in context

Tags

data engineer

data architects

data science

Annotators

tgrrr

URL

springboard.com/blog/data-science-career-paths-different-roles-industry/
May 2018
www.audit.vic.gov.au www.audit.vic.gov.au

Improving Victoria’s Air Quality

1
1. equivalentideas 21 May 2018
  
  in Public
  
  Negative values included when assessing air quality In computing average pollutant concentrations, EPA includes recorded values that are below zero. EPA advised that this is consistent with NEPM AAQ procedures. Logically, however, the lowest possible value for air pollutant concentrations is zero. Either it is present, even if in very small amounts, or it is not. Negative values are an artefact of the measurement and recording process. Leaving negative values in the data introduces a negative bias, which potentially under represents actual concentrations of pollutants. We noted a considerable number of negative values recorded. For example, in 2016, negative values comprised 5.3 per cent of recorded hourly PM2.5 values, and 1.3 per cent of hourly PM10 values. When we excluded negative values from the calculation of one‐day averages, there were five more exceedance days for PM2.5 and one more for PM10 during 2016.
  
  air quality monitoring citizen science data validation westconnex research
Visit annotations in context

Tags

citizen science

westconnex research

air quality monitoring

data validation

Annotators

equivalentideas

URL

audit.vic.gov.au/sites/default/files/2018-03/20180308-Improving-Air-Quality.pdf
Sep 2017
blog.dmptool.org blog.dmptool.org

NSF EAGER Grant for Actionable DMPs

1
1. IanMulvany 27 Sep 2017
  
  in Public
  
  We’re delighted to announce that the California Digital Library has been awarded a 2-year NSF EAGER grant to support active, machine-actionable data management plans (DMPs).
  
  open science research data data
Visit annotations in context

Tags

open science

research data

data

Annotators

IanMulvany

URL

blog.dmptool.org/2017/09/18/nsf-eager-grant-for-making-dmps-actionable/
Mar 2017
cs231n.github.io cs231n.github.io

CS231n Convolutional Neural Networks for Visual Recognition

1
1. ksagou 05 Mar 2017
  
  in Public
  
  Great course
  
  CNN Neural nets Data Science
Visit annotations in context

Tags

CNN

Neural nets

Data Science

Annotators

ksagou

URL

cs231n.github.io/
Feb 2017
wiki.dbpedia.org wiki.dbpedia.org

DBpedia

1
1. JanosHaits 28 Feb 2017
  
  in Public
  
  DBpedia data database Semantic web computer science
Visit annotations in context

Tags

Semantic web

database

data

computer science

DBpedia

Annotators

JanosHaits

URL

wiki.dbpedia.org/
semanticweb.org semanticweb.org

semanticweb.org.edu

1
1. JanosHaits 28 Feb 2017
  
  in Public
  
  semantic SemWeb Web3.0 data computer science
Visit annotations in context

Tags

Web3.0

semantic

SemWeb

data

computer science

Annotators

JanosHaits

URL

semanticweb.org/wiki/Main_Page.html
demo.dbpedia-spotlight.org demo.dbpedia-spotlight.org

DBpedia Spotlight

1
1. JanosHaits 28 Feb 2017
  
  in Public
  
  DBpedia Open data data SemWeb Semantic web Web3.0 computer science demo
Visit annotations in context

Tags

demo

data

computer science

Semantic web

Web3.0

SemWeb

Open data

DBpedia

Annotators

JanosHaits

URL

demo.dbpedia-spotlight.org/
lod-cloud.net lod-cloud.net

The Linking Open Data cloud diagram

1
1. JanosHaits 28 Feb 2017
  
  in Public
  
  Semantic web data Open data computer science
Visit annotations in context

Tags

Semantic web

computer science

Open data

data

Annotators

JanosHaits

URL

lod-cloud.net/
cognonto.com cognonto.com

Cognonto - Knowledge Graph

1
1. JanosHaits 28 Feb 2017
  
  in Public
  
  SemWeb Semantic web Web3.0 computer science data Knowledge Graph knowledge
Visit annotations in context

Tags

Semantic web

Web3.0

Knowledge Graph

SemWeb

data

computer science

knowledge

Annotators

JanosHaits

URL

cognonto.com/knowledge-graph/
en.lodlive.it en.lodlive.it

LodLive - browsing the Web of Data

1
1. JanosHaits 26 Feb 2017
  
  in Public
  
  SemWeb semantic Web3.0 computer science RDF Linked Data data
Visit annotations in context

Tags

Web3.0

Linked Data

RDF

semantic

data

SemWeb

computer science

Annotators

JanosHaits

URL

en.lodlive.it/
www.nytimes.com www.nytimes.com

In Age of Trump, Scientists Show Signs of a Political Pulse

2
1. heatherstaines 07 Feb 2017
  
  in Public
  
  After a brief training session, participants spent six hours archiving environmental data from government websites, including those of the National Oceanic and Atmospheric Administration and the Interior Department.
  
  A worthwhile effort.
  
  science data
2. heatherstaines 07 Feb 2017
  
  in Public
  
  An anonymous donor has provided storage on Amazon servers, and the information can be searched from a website at the University of Pennsylvania called Data Refuge. Though the Federal Records Act theoretically protects government data from deletion, scientists who rely on it say would rather be safe than sorry.
  
  Data refuge.
  
  data science
Visit annotations in context

Tags

science

data

Annotators

heatherstaines

URL

nytimes.com/2017/02/06/science/donald-trump-scientists-politics.html
Oct 2016
m.pnas.org m.pnas.org

PNAS | Mobile

1
1. awakenting 06 Oct 2016
  
  in Public
  
  (courses.csail.mit.edu/18.337/2015/docs/50YearsDataScience.pdf)
  
  nice reference !
  
  data science
Visit annotations in context

Tags

data science

Annotators

awakenting

URL

m.pnas.org/content/113/34/9384.long
Sep 2016
www.sr.ithaka.org www.sr.ithaka.org

Untitled document

1
1. Enkerli 08 Sep 2016
  
  in Public
  
  Activities such as time spent on task and discussion board interactions are at the forefront of research.
  
  Really? These aren’t uncontroversial, to say the least. For instance, discussion board interactions often call for careful, mixed-method work with an eye to preventing instructor effect and confirmation bias. “Time on task” is almost a codeword for distinctions between models of learning. Research in cognitive science gives very nuanced value to “time spent on task” while the Malcolm Gladwells of the world usurp some research results. A major insight behind Competency-Based Education is that it can allow for some variance in terms of “time on task”. So it’s kind of surprising that this summary puts those two things to the fore.
  
  Learning Analytics learner data measurability Time on task #CompetencyBasedEducation Cognitive Science Malcolm Gladwell Discourse Analysis #ConfirmationBias Instructor Effect
Visit annotations in context

Tags

learner data

Cognitive Science

Discourse Analysis

Malcolm Gladwell

#CompetencyBasedEducation

measurability

Learning Analytics

#ConfirmationBias

Time on task

Instructor Effect

Annotators

Enkerli

URL

sr.ithaka.org/publications/student-data-in-the-digital-era/
Jul 2016
books.google.ca books.google.ca

The Data Revolution

1
1. daniel.odonnell 27 Jul 2016
  
  in Public
  
  p. 141
  
  Initially, the digital humanities consisted of the curation and analysis of data that were born digital, and the digitisation and archiving projects that sought to render analogue texts and material objects into digital forms that could be organised and searched and be subjects to basic forms of overarching, automated or guided analysis, such as summary visualisations of content or connections between documents, people or places. Subsequently, its advocates have argued that the field has evolved to provide more sophisticated tools for handling, searching, linking, sharing and analysing data that seek to complement and augment existing humanities methods, and facilitate traditional forms of interpretation and theory building, rather than replacing traditional methods or providing an empiricist or positivistic approach to humanities scholarship.
  
  summary of history of digital humanities
  
  Kitchin 2014 data humanities data digital humanities history history of science history of ideas
Visit annotations in context

Tags

Kitchin 2014

digital humanities

history of ideas

history

humanities data

data

history of science

Annotators

daniel.odonnell

URL

books.google.ca/books/about/The_Data_Revolution.html
Apr 2016
mitpress.mit.edu mitpress.mit.edu

Great Principles of Computing

1
1. daveh70 23 Apr 2016
  
  in Public
  
  Great Principles of Computing<br> Peter J. Denning, Craig H. Martell
  
  This is a book about the whole of computing—its algorithms, architectures, and designs.
  
  Denning and Martell divide the great principles of computing into six categories: communication, computation, coordination, recollection, evaluation, and design.
  
  "Programmers have the largest impact when they are designers; otherwise, they are just coders for someone else's design."
  
  computer science technology programming data science electronics
Visit annotations in context

Tags

technology

data science

electronics

programming

computer science

Annotators

daveh70

URL

mitpress.mit.edu/books/great-principles-computing
Mar 2016
amstat.tandfonline.com amstat.tandfonline.com

The ASA's statement on p-values: context, process, and purpose

1
1. daveh70 07 Mar 2016
  
  in Public
  
  American Statistical Association statement on p-values
  
  statistics data analysis science
Visit annotations in context

Tags

statistics

data analysis

science

Annotators

daveh70

URL

amstat.tandfonline.com/doi/abs/10.1080/00031305.2016.1154108
Feb 2016
leanpub.com leanpub.com

Roger D. Peng

1
1. daveh70 02 Feb 2016
  
  in Public
  
  Books on data science and R programming by Roger D. Peng of Johns Hopkins.
  
  statistics data science data analysis data visualization
Visit annotations in context

Tags

statistics

data analysis

data visualization

data science

Annotators

daveh70

URL

leanpub.com/u/rdpeng
blog.cloudera.com blog.cloudera.com

Common Probability Distributions: The Data Scientist's Crib Sheet - Cloudera Engineering Blog

1
1. daveh70 01 Feb 2016
  
  in Public
  
  Great explanation of 15 common probability distributions: Bernouli, Uniform, Binomial, Geometric, Negative Binomial, Exponential, Weibull, Hypergeometric, Poisson, Normal, Log Normal, Student's t, Chi-Squared, Gamma, Beta.
  
  statistics probability data science
Visit annotations in context

Tags

statistics

probability

data science

Annotators

daveh70

URL

blog.cloudera.com/blog/2015/12/common-probability-distributions-the-data-scientists-crib-sheet/
f1000research.com f1000research.com

Software Carpentry: lessons learned

1
1. daveh70 01 Feb 2016
  
  in Public
  
  Since its start in 1998, Software Carpentry has evolved from a week-long training course at the US national laboratories into a worldwide volunteer effort to improve researchers' computing skills. This paper explains what we have learned along the way, the challenges we now face, and our plans for the future.
  
  http://software-carpentry.org/lessons/<br> Basic programming skills for scientific researchers.<br> SQL, and Python, R, or MATLAB.
  
  http://www.datacarpentry.org/lessons/<br> Managing and analyzing data.
  
  programming science data analysis
Visit annotations in context

Tags

programming

data analysis

science

Annotators

daveh70

URL

f1000research.com/articles/3-62/v2
Jan 2016
courses.csail.mit.edu courses.csail.mit.edu

50YearsDataScience.pdf

1
1. daveh70 31 Jan 2016
  
  in Public
  
  50 Years of Data Science, David Donoho<br> 2015, 41 pages
  
  This paper reviews some ingredients of the current "Data Science moment", including recent commentary about data science in the popular media, and about how/whether Data Science is really different from Statistics.
  
  The now-contemplated field of Data Science amounts to a superset of the fields of statistics and machine learning which adds some technology for 'scaling up' to 'big data'.
  
  data science data analysis statistics science big data
Visit annotations in context

Tags

data analysis

data science

statistics

big data

science

Annotators

daveh70

URL

courses.csail.mit.edu/18.337/2015/docs/50YearsDataScience.pdf
quoracast.quora.com quoracast.quora.com

Dima Korolev: Engineering, Entrepreneurship, an... - The Quoracast - Quora

1
1. johngravesdm 12 Jan 2016
  
  in Public
  
  "A friend of mine said a really great phrase: 'remember those times in early 1990's when every single brick-and-mortar store wanted a webmaster and a small website. Now they want to have a data scientist.' It's good for an industry when an attitude precedes the technology."
  
  data science
Visit annotations in context

Tags

data science

Annotators

johngravesdm

URL

quoracast.quora.com/Dima-Korolev-Engineering-Entrepreneurship-and-Big-Data
phys.org phys.org

Why too much evidence can be a bad thing

1
1. daveh70 10 Jan 2016
  
  in Public
  
  paradox of unanimity - Unanimous or nearly unanimous agreement doesn't always indicate the correct answer. If agreement is unlikely, it indicates a problem with the system.
  
  Witnesses who only saw a suspect for a moment are not likely to be able to pick them out of a lineup accurately. If several witnesses all pick the same suspect, you should be suspicious that bias is at work. Perhaps these witnesses were cherry-picked, or they were somehow encouraged to choose a particular suspect.
  
  science statistics data analysis probability
Visit annotations in context

Tags

science

probability

data analysis

statistics

Annotators

daveh70

URL

phys.org/news/2016-01-evidence-bad.html
Dec 2015
code.facebook.com code.facebook.com

Facebook to open-source AI hardware design

1
1. daveh70 11 Dec 2015
  
  in Public
  
  Big Sur is our newest Open Rack-compatible hardware designed for AI computing at a large scale. In collaboration with partners, we've built Big Sur to incorporate eight high-performance GPUs
  
  ai artificial intelligence machine learning data science
Visit annotations in context

Tags

artificial intelligence

ai

data science

machine learning

Annotators

daveh70

URL

code.facebook.com/posts/1687861518126048/facebook-to-open-source-ai-hardware-design/
Nov 2015
www.randalolson.com www.randalolson.com

Introducing TPOT, the Data Science Assistant

1
1. daveh70 16 Nov 2015
  
  in Public
  
  TPOT is a Python tool that automatically creates and optimizes machine learning pipelines using genetic programming. Think of TPOT as your “Data Science Assistant”: TPOT will automate the most tedious part of machine learning by intelligently exploring thousands of possible pipelines, then recommending the pipelines that work best for your data.
  
  https://github.com/rhiever/tpot TPOT (Tree-based Pipeline Optimization Tool) Built on numpy, scipy, pandas, scikit-learn, and deap.
  
  machine learning artificial intelligence data science
Visit annotations in context

Tags

artificial intelligence

data science

machine learning

Annotators

daveh70

URL

randalolson.com/2015/11/15/introducing-tpot-the-data-science-assistant/
Apr 2015
dmm.biologists.org dmm.biologists.org

Shining a light on dark data

1
1. judell 03 Apr 2015
  
  in Public
  
  Wouldn’t it be useful, both to the scientific community or the wider world, to increase the publication of negative results?
  
  science dark-data
Visit annotations in context

Tags

science

dark-data

Annotators

judell

URL

dmm.biologists.org/content/2/11-12/521