Hypothesis

8 Matching Annotations

Jun 2024
nih-r25-modelersandstorytellers.github.io nih-r25-modelersandstorytellers.github.io

Data Science and R

4
1. DrHuaZhou 24 Jun 2024
  
  in Public
  
  following steps
  
  I can demo these steps if needed.
2. DrHuaZhou 24 Jun 2024
  
  in Public
  
  many
  
  Again don't worry about syntaxes. Focus on concepts of data wrangling, which are universal among many languages (SQL, Python, Julia).
3. DrHuaZhou 24 Jun 2024
  
  in Public
  
  Tidyverse
  
  Tidyverse is not the only choice. data.table package is a popular framework for data wrangling as well.
4. DrHuaZhou 24 Jun 2024
  
  in Public
  
  the life cycle of a data science project
  
  Don't be overwhelmed by syntax. GenAI tools such as GitHub Copilot and ChatGPT alleviate lots of programming details. More important to grasp the tasks and workflow.
Visit annotations in context

Annotators

DrHuaZhou

URL

nih-r25-modelersandstorytellers.github.io/2024/data-science-tutorials/01-dsintro/dsintro.html
nih-r25-modelersandstorytellers.github.io nih-r25-modelersandstorytellers.github.io

Policy Evaluation by Double Machine Learning

1
1. DrHuaZhou 24 Jun 2024
  
  in Public
  
  Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W., and Robins, J. (2018). Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21(1), C1-C68.
  
  Thousands of citations already. Called "the monster" in big tech. Save billions of $$$ at Amazon by applying DML to online experimentation such as A/B testing.
Visit annotations in context

Annotators

DrHuaZhou

URL

nih-r25-modelersandstorytellers.github.io/2024/data-science-tutorials/05-dml/dml.html
nih-r25-modelersandstorytellers.github.io nih-r25-modelersandstorytellers.github.io

Predictive Modeling - Tree-Based Models

1
1. DrHuaZhou 24 Jun 2024
  
  in Public
  
  CART
  
  Tree-based methods such as random forest and boosting have been one of the most successful out-of-box machine learning methods for structured/tabular data.
Visit annotations in context

Annotators

DrHuaZhou

URL

nih-r25-modelersandstorytellers.github.io/2024/data-science-tutorials/04-rf/rf.html
nih-r25-modelersandstorytellers.github.io nih-r25-modelersandstorytellers.github.io

Current Population Survey Food Security Supplement - Ingest, Wrangle, Visualize

2
1. DrHuaZhou 24 Jun 2024
  
  in Public
  
  tidycensus
  
  Last year's R25 program had many examples of using tidycensus to explore the Census and ACS data.
2. DrHuaZhou 24 Jun 2024
  
  in Public
  
  as_tibble() |>
  
  Optional.
Visit annotations in context

Annotators

DrHuaZhou

URL

nih-r25-modelersandstorytellers.github.io/2024/data-science-tutorials/02-wrangle/wrangle.html

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL