Hypothesis

109 Matching Annotations

Nov 2016
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - Technical appendix

1
1. efosse 20 Nov 2016
  
  in Public
  
  In the main text, I discussed making causal claims from non-experimental data using natural experiments and matching. In this appendix, I will introduce the potential outcomes model, and define more precisely the conditions that are required for causal inference from observational data. This chapter will draw on Morgan and Winship (2014) and Imbens and Rubin (2015).
  
  My preference would be for a discussion that includes Pearl's DAGs as well as Rubin's potential outcomes framework.
  
  Edit: My take is that Rubin's framework is rooted in a 20th century Fisherian orientation (which is why it's especially popular among statisticians), while Pearl's framework in part reflects new insights on probabilistic graphical models (which is why it's popular among computer scientists). The future, I suspect, will entail both approaches.
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/observing-technical/
Sep 2016
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Mass collaboration - 5.6 Conclusion

3
1. efosse 22 Sep 2016
  
  in Public
  
  mass collaboration projects also have democratizing potential
  
  This is a great point and I think one at least sociologists will be sympathetic towards. I'm thinking here of Howard Becker's "hierarchy of credibility" principle.
2. efosse 22 Sep 2016
  
  in Public
  
  but I am optimistic
  
  Perhaps include a few sentences on why you're optimistic? I'm optimistic but I (or other readers) could have different reasons.
3. efosse 22 Sep 2016
  
  in Public
  
  enables mass collaboration
  
  I initially expected a discussion of mass collaboration in terms of researchers with other researchers. The lone scholar working alone (e.g., Einstein) is replaced with a team of researchers across continents and disciplines (e.g., Large Hadron Collider). However, this kind of mass collaboration may be beyond the scope of your book.
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/mass-collaboration/collab-conclusion/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Mass collaboration - 5.5.6 Final design advice

1
1. efosse 22 Sep 2016
  
  in Public
  
  5
  
  five
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/mass-collaboration/design/design-advice/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Mass collaboration - 5.4 Distributed data collection

2
1. efosse 22 Sep 2016
  
  in Public
  
  show
  
  show,
2. efosse 22 Sep 2016
  
  in Public
  
  respondent’s
  
  respondents'?
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/mass-collaboration/dist-data-collection/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Mass collaboration - 5.3 Open calls

2
1. efosse 22 Sep 2016
  
  in Public
  
  In open call projects, the researcher poses a problem, solicits solutions from other people, and then picks the best.
  
  An analogue age version of this is the "Delphi Method" developed by RAND in the 1950s. It's has problems and it's about predicting the future, but I see some similarities since often open calls today entail predicting some outcomes. The problem with the "Delphi Method" is that it (a) relies on pre-selected experts, (b) has no clear criterion for what's "best", and (c) presumes consensus-building is truth.
2. efosse 09 Sep 2016
  
  in Public
  
  open call project
  
  Italicize?
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/mass-collaboration/open-calls/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Mass collaboration - 5.5 Designing your own

1
1. efosse 22 Sep 2016
  
  in Public
  
  5 general principles
  
  Depends on the style guide, but it seems most style guides suggest writing out numbers 1 to 9. E.g., "There are five general principles..."
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/mass-collaboration/design/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Running experiments - 4.2 What are experiments?

2
1. efosse 22 Sep 2016
  
  in Public
  
  get an estimate of the causal effect
  
  To nitpick: you can still get an estimate of a causal effect without randomization (or adjustment), but it's likely to be a lousy estimate (unless particular assumptions are met).
  
  minor
2. efosse 09 Sep 2016
  
  in Public
  
  recruitment, randomization, intervention, and outcomes
  
  How about controlling?
Visit annotations in context

Tags

minor

Annotators

efosse

URL

bitbybitbook.com/en/running-experiments/what-exp/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.4 Research strategies

3
1. efosse 22 Sep 2016
  
  in Public
  
  could not lead to interesting research, but that’s not the case
  
  Consider rewording: "...could lead to uninteresting research, but that's not the case."
2. efosse 08 Sep 2016
  
  in Public
  
  nowcasting
  
  Italicize, perhaps?
3. efosse 08 Sep 2016
  
  in Public
  
  data
  
  Do you mean observational big data or observational data in general?
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/designs/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Mass collaboration - 5.3.4 Conclusion

2
1. efosse 09 Sep 2016
  
  in Public
  
  Which of these approaches would work better? We don’t know, and in the process of finding out we might learn something important about families, neighborhoods, education, and social inequality. Further, these predictions might be used to guide future data collection.
  
  This is a really, really great idea!
2. efosse 09 Sep 2016
  
  in Public
  
  analogues
  
  Elsewhere you use the spelling "analogs."
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/mass-collaboration/open-calls/open-call-conclusion/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Mass collaboration - 5.2 Human computation

1
1. efosse 09 Sep 2016
  
  in Public
  
  there are actually many situations where social researchers want to code, classify, or label images or texts.
  
  E.g., using Google maps and humans to code for "broken windows" in various neighborhoods.
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/mass-collaboration/human-computation/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Running experiments - Further commentary

2
1. efosse 09 Sep 2016
  
  in Public
  
  comparison
  
  comparisons?
2. efosse 09 Sep 2016
  
  in Public
  
  Hawthorn
  
  Hawthorne
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/running-experiments/exp-further/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Running experiments - 4.6 Advice

1
1. efosse 09 Sep 2016
  
  in Public
  
  you should try to design a series of experiments that reinforce each other.
  
  It'd be incredibly helpful if you could briefly discuss an example of experiments reinforcing each other.
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/running-experiments/exp-advice/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Running experiments - 4.5.2 Partner with the powerful

1
1. efosse 09 Sep 2016
  
  in Public
  
  Figure 4.17:
  
  This figure would be clearer to me if the pictures were below the text "Info" and "Info + social."
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/running-experiments/making/partner/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Running experiments - 4.5.1.3 Build your own product

1
1. efosse 09 Sep 2016
  
  in Public
  
  I can’t find any other examples of success,
  
  If you have examples of failure, that may be informative as well.
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/running-experiments/making/just-do-it/build-your-own-product/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Running experiments - 4.5.1.1 Use existing environments

1
1. efosse 09 Sep 2016
  
  in Public
  
  cumulative advantage.
  
  Or what Merton called the "Matthew effect."
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/running-experiments/making/just-do-it/use-existing/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Running experiments - 4.4.3 Mechanisms

1
1. efosse 09 Sep 2016
  
  in Public
  
  incredibly important.
  
  You can specify here why mechanisms are incredibly important. My take is that often experiments have a "black box" approach and that we don't actually understand a causal effect until we understand the mechanisms.
  
  Conversely, understanding the mechanisms helps strengthen the case for a causal effect. The findings in psychology on supposed "psi" effects are weakened because there are no plausible mechanisms. Likewise, we knew smoking caused cancer back in the 1950s because we had a pretty good idea of the mechanism (e.g., tar) from qualitative data and simple observational studies.
  
  Edit: As well, mechanisms could be used to identify causal effects (e.g., Pearl's "front-door" criterion).
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/running-experiments/beyond-simple/mechanisms/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Running experiments - 4.4.1 Validity

2
1. efosse 09 Sep 2016
  
  in Public
  
  ensures
  
  Perhaps too strong?
2. efosse 09 Sep 2016
  
  in Public
  
  question
  
  questions
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/running-experiments/beyond-simple/validity/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Running experiments - 4.3 Two dimensions of experiments: lab-field and analog-digital

2
1. efosse 09 Sep 2016
  
  in Public
  
  experiments
  
  experiments'
2. efosse 09 Sep 2016
  
  in Public
  
  Figure 4.1
  
  Should this figure have some data points or text in the middle of the plot?
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/running-experiments/lab-field/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Running experiments - 4.1 Introduction

1
1. efosse 09 Sep 2016
  
  in Public
  
  In many situations, you just cannot measure and adjust for all the possible confounders.
  
  And you may condition on a pre-treatment collider variable that induces a back-door path (!).
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/running-experiments/exp-intro/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - Further commentary

1
1. efosse 09 Sep 2016
  
  in Public
  
  There is deep skepticism of certain types of stated preferences data in economics (Hausman 2012).
  
  Also among some social psychologists (e.g., Banaji's work), although still probably not as skeptical as economists.
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/asking-further/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.7 Conclusion

2
1. efosse 09 Sep 2016
  
  in Public
  
  Big data sources and surveys are complements not substitutes so as the amount of big data increases, I expect that the value of surveys will increases as well.
  
  I think you need to spell this out more clearly here. Will the value of surveys increase because of the decline of traditional landline surveys, so any plausibly reliable survey data will be more valuable? Or will the value increase because of the growth of big data, which can be combined with survey techniques?
2. efosse 09 Sep 2016
  
  in Public
  
  increases
  
  increase
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/sampling-conclusion/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.6.1 Amplified asking

4
1. efosse 09 Sep 2016
  
  in Public
  
  persons
  
  person
2. efosse 09 Sep 2016
  
  in Public
  
  ,
  
  Remove comma
3. efosse 09 Sep 2016
  
  in Public
  
  with 10-fold cross-validation
  
  Consider including a sentence discussing cross-validation for social scientists. I suspect many are not familiar with cross-validation.
4. efosse 09 Sep 2016
  
  in Public
  
  third-party
  
  third party
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/linking/amplified-asking/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.6 Surveys linked to other data

1
1. efosse 09 Sep 2016
  
  in Public
  
  There is just too much to be gained by linking survey data to other data sources, such as the digital trace data discussed in Chapter 2.
  
  Perhaps mention "data fusion"?
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/linking/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.4.2 Forecasting and nowcasting

4
1. efosse 09 Sep 2016
  
  in Public
  
  forecasting.
  
  Perhaps explain how forecasting is different from prediction? (I view prediction as a more general category than forecasting.)
2. efosse 09 Sep 2016
  
  in Public
  
  seem
  
  seems
3. efosse 09 Sep 2016
  
  in Public
  
  simple
  
  simpler?
4. efosse 09 Sep 2016
  
  in Public
  
  searchers
  
  searches
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/designs/forecasting/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.5.3 Gamification

1
1. efosse 09 Sep 2016
  
  in Public
  
  More generally, with some creativity and design work, it is possible to improve the user experience for survey participants.
  
  My informal experience is that people find open-ended survey questions (i.e., text responses) more enjoyable to answer than closed-ended survey questions. I have not seen any research on this, however.
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/how/gamification/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.5.2 Wiki surveys

1
1. efosse 09 Sep 2016
  
  in Public
  
  This domination is not because closed questions have been proven to provide better measurement, rather it is because they are much easier to use; the process of coding open-ended questions is complicated and expensive.
  
  I agree, although there's some psychological work suggesting that closed-ended questions are more predictive of human behavior (i.e., quickly answering closed-ended questions is akin to a quasi-implicit bias that affects behavior).
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/how/wiki/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.5.1 Ecological momentary assessments

1
1. efosse 09 Sep 2016
  
  in Public
  
  asking
  
  asking questions
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/how/ema/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.5 New ways of asking questions

2
1. efosse 09 Sep 2016
  
  in Public
  
  how we ask
  
  how we ask questions
2. efosse 09 Sep 2016
  
  in Public
  
  analogue
  
  Elsewhere you use the spelling "analog" rather than "analogue."
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/how/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.4.3 Non-probability samples: sample matching

1
1. efosse 09 Sep 2016
  
  in Public
  
  Of course, it would be better to do perfectly executed probability sampling, but that no longer appears to be a realistic option.
  
  Was a "perfectly executed probability sampling" ever a realistic option?
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/who/sample-matching/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.4.2 Non-probability samples: weighting

4
1. efosse 09 Sep 2016
  
  in Public
  
  Figure 3.4:
  
  The y-axis needs a label.
2. efosse 09 Sep 2016
  
  in Public
  
  ,
  
  Remove comma
3. efosse 09 Sep 2016
  
  in Public
  
  Remove space
  
  very minor
4. efosse 09 Sep 2016
  
  in Public
  
  weighing
  
  weighting?
Visit annotations in context

Tags

very minor

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/who/nonprobability-estimation/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.4.1 Probability sampling: data collection and data analysis

2
1. efosse 09 Sep 2016
  
  in Public
  
  that you are less likely to learn about.
  
  I find this phrasing somewhat confusing.
2. efosse 09 Sep 2016
  
  in Public
  
  given
  
  giving
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/who/probability-sampling/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.4 Who to ask

1
1. efosse 09 Sep 2016
  
  in Public
  
  the main text will be explained below
  
  this chapter will be explained
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/who/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.3.3 Cost

1
1. efosse 09 Sep 2016
  
  in Public
  
  Remove space
  
  very minor
Visit annotations in context

Tags

very minor

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/total-survey-error/cost/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.3 The total survey error framework

2
1. efosse 09 Sep 2016
  
  in Public
  
  ,
  
  Remove comma?
2. efosse 09 Sep 2016
  
  in Public
  
  scienitsts
  
  scientists
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/total-survey-error/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Introduction - 1.1 An ink blot

1
1. efosse 09 Sep 2016
  
  in Public
  
  1.1 An ink blot
  
  I like the Blumenstock et al. example, but I think the introduction would show the immense change going on with a parallel example from the analog age. E.g., compare Blau and Duncan's work on the American Occupational Structure, which required specifying hypotheses weeks in advance and entailed slow computation with punch cards.
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/introduction/ink-blot/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.2 Asking vs. observing

1
1. efosse 09 Sep 2016
  
  in Public
  
  Internal states exist only inside people’s heads, and sometimes the best way to learn about internal states is to ask.
  
  Cf. Implicit Association Test
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/asking-vs-observing/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Asking questions - 3.1 Introduction

3
1. efosse 09 Sep 2016
  
  in Public
  
  appears
  
  appear
2. efosse 09 Sep 2016
  
  in Public
  
  Many researchers
  
  Who?
3. efosse 09 Sep 2016
  
  in Public
  
  area probability sampling
  
  Perhaps mention what prompted the widespread use of probability-based sampling (to parallel the next paragraph, which explains why RDD was used)?
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/asking-questions/asking-introduction/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - Activities

1
1. efosse 09 Sep 2016
  
  in Public
  
  Some people
  
  Who?
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/observing-activities/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.4.3.2 Matching

4
1. efosse 09 Sep 2016
  
  in Public
  
  The growth of always-on, big data systems increases our ability to effectively use two existing methods: natural experiments and matching.
  
  There's a third approach, too. Causal discovery algorithms (i.e., computational improvements) and large amounts of diverse observational data (i.e., always-on big data systems) are enabling researchers to create and evaluate complex DAGs from observational data.
  
  Edit: There aren't many examples in the social sciences using these algorithms, but I think they have a lot of potential if used judiciously.
2. efosse 09 Sep 2016
  
  in Public
  
  together(Einav et al. 2015, Table 11).
  
  Add a space.
  
  very minor
3. efosse 09 Sep 2016
  
  in Public
  
  ,
  
  Remove this comma, perhaps?
4. efosse 09 Sep 2016
  
  in Public
  
  within
  
  from?
Visit annotations in context

Tags

very minor

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/designs/approximating-experiments/matching/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.5 Conclusion

1
1. efosse 09 Sep 2016
  
  in Public
  
  estimating causal effects with natural experiments and matching.
  
  See my previous point about causal discovery algorithms and large volumes of data.
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/observing-conclusion/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.4.3.1 Natural experiments

2
1. efosse 09 Sep 2016
  
  in Public
  
  As Table 2.3 makes clear, natural experiments are everywhere if you just know how to look for them.
  
  Or a critic might say "what people think are natural experiments are everywhere." Might be worthwhile to mention the criticisms of natural experiments (e.g., Rosenzweig and Wolpin 2000).
  
  Also, I think you can be more forceful in what I think you're claiming -- that we have more opportunities to find plausibly natural experiments in the digital age.
2. efosse 09 Sep 2016
  
  in Public
  
  mechanism
  
  Replace with "the mechanism" or "mechanisms".
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/designs/approximating-experiments/natural-experiments/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.4.1.3 Censorship of social media by the Chinese government

3
1. efosse 09 Sep 2016
  
  in Public
  
  First, in a step typically called pre-processing, the researchers converted the social media posts into a document-term matrix (see Grimmer and Stewart (2013) for more information). Second, the researchers hand-coded the sentiment of a small sample of posts. Third, the researchers trained a supervised learning model to classify the sentiment of posts. Fourth, the researchers used the supervised learning model to estimate the sentiment of all the posts.
  
  I think the figure would be clearer if you numbered the steps in the figure.
2. efosse 09 Sep 2016
  
  in Public
  
  post
  
  posts
3. efosse 09 Sep 2016
  
  in Public
  
  in
  
  is
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/designs/counting-things/china-censor/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.4.1.1 Taxis in New York City

3
1. efosse 08 Sep 2016
  
  in Public
  
  not incomplete
  
  complete
2. efosse 08 Sep 2016
  
  in Public
  
  not non-representative
  
  representative
3. efosse 08 Sep 2016
  
  in Public
  
  100perday—andworkuntilthattargetismet,thendriverswouldendupworkingfewerhoursondaysthattheyareearningmore.Forexample,ifyouwereatargetearner,youmightendupworking4hoursonagoodday(
  
  Check this text formatting. It's off on my computer.
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/designs/counting-things/taxis/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.4.1 Counting things

1
1. efosse 08 Sep 2016
  
  in Public
  
  Generally, people have a pretty good sense of what is important.
  
  This statement does not seem obvious to me. People's values (i.e., sense of what is important) can differ greatly.
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/designs/counting-things/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - Further commentary

7
1. efosse 08 Sep 2016
  
  in Public
  
  by Jon Kleinberg in a talk
  
  Perhaps provide some context on Jon Kleinberg. E.g., "by the computer scientist Jon Kleinberg in a talk on X at Y."
2. efosse 08 Sep 2016
  
  in Public
  
  run
  
  running?
3. efosse 08 Sep 2016
  
  in Public
  
  proceed
  
  process
4. efosse 08 Sep 2016
  
  in Public
  
  practical significance rather than statistical significance
  
  Consider adding a sentence defining the difference between these two kinds of significance.
5. efosse 08 Sep 2016
  
  in Public
  
  ,
  
  Remove this comma.
6. efosse 08 Sep 2016
  
  in Public
  
  There is no single consensus definition of “big data”, but many definitions seem to focus on the 3 Vs: volume, variety, and velocity (e.g., Japec et al. (2015)). Rather than focusing on the characteristics of the data, my definition focuses more on why the data was created.
  
  This definition is so common I'm thinking you should place this earlier when you discuss your definition of "big data."
7. efosse 08 Sep 2016
  
  in Public
  
  difference
  
  differences
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/observing-further/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Introduction - 1.3 Research design

2
1. efosse 08 Sep 2016
  
  in Public
  
  abstract
  
  abstruse?
  
  (You mention abstractions in a positive light elsewhere in the book.)
2. efosse 08 Sep 2016
  
  in Public
  
  but I will call them data scientists
  
  Where would you place statisticians? Are they an audience for your book?
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/introduction/research-design/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - 7.2.1 The blending of Readymades and Custommades

1
1. efosse 08 Sep 2016
  
  in Public
  
  either going to sacrifice quality by using ugly Readymades, or they are going to spend lots of time looking for the perfect urinal.
  
  Consider rephrasing. I understand what you mean here, but a perfect urinal is indeed an ugly readymade.
  
  very minor
Visit annotations in context

Tags

very minor

Annotators

efosse

URL

bitbybitbook.com/en/the-future/future-themes/blending/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - 7.3 Back to the beginning

2
1. efosse 08 Sep 2016
  
  in Public
  
  This study combines what we have done with in the past with what we can do in the present.
  
  Consider including another example or two to further support your point. (Although it won't strictly be "back to the beginning" if you include other examples.)
2. efosse 08 Sep 2016
  
  in Public
  
  The future of social research will be a combination of social science and data science.
  
  I understand why it's a good idea to combine social with data science, but this statement makes it seem like it's a near-inevitability. I'm thinking you could add more material in this section on barriers to combining social with data science, and how we can overcome them.
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/the-future/back/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - 7.1 Looking foward

1
1. efosse 08 Sep 2016
  
  in Public
  
  transition
  
  Consider replacing with "one". E.g., "...in the process of making a transition like the one from photography to cinematography."
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/the-future/looking-forward/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.3.2.3 Non-representative

1
1. efosse 08 Sep 2016
  
  in Public
  
  also flip
  
  Remove "also".
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/characteristics/bad/non-representative/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.3.2.2 Inaccessible

3
1. efosse 08 Sep 2016
  
  in Public
  
  researchers
  
  Consider adding: "as well as companies."
2. efosse 08 Sep 2016
  
  in Public
  
  And, eBay was also.
  
  Consider incorporating into the previous sentence.
3. efosse 08 Sep 2016
  
  in Public
  
  Research
  
  Researchers?
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/characteristics/bad/inaccessible/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.3.2.1 Incomplete

1
1. efosse 08 Sep 2016
  
  in Public
  
  the theoretical constructs in many existing theories.
  
  Consider rephrasing.
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/characteristics/bad/incomplete/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.3.1.3 Non-reactive

2
1. efosse 08 Sep 2016
  
  in Public
  
  search engine queries
  
  Search engine personalization presents some ambiguity, however, to the idea that these queries are non-reactive.
2. efosse 08 Sep 2016
  
  in Public
  
  researcher
  
  researchers
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/characteristics/good/non-reactive/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.3.1.2 Always-on

3
1. efosse 08 Sep 2016
  
  in Public
  
  always-on data systems enable researchers to study unexpected events and provide real-time information to policy makers.
  
  An admirable but flawed attempt at this was Argentina's Project Cybersyn in the early 1970s.
2. efosse 08 Sep 2016
  
  in Public
  
  For example, social media data can be used to guide responses to natural disasters (Castillo 2016).
  
  Perhaps you could be more specific here in the example? It will clarify your point more for readers, I think.
3. efosse 08 Sep 2016
  
  in Public
  
  ex-post panel
  
  Perhaps parenthetically define this phrase?
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/characteristics/good/always-on/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.3 Common characteristics of big data

1
1. efosse 08 Sep 2016
  
  in Public
  
  etc
  
  The period is missing from "etc." In general, I tend to prefer "and so on" or "and so forth" instead of "etc."
  
  minor
Visit annotations in context

Tags

minor

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/characteristics/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Observing behavior - 2.2 Big data

1
1. efosse 08 Sep 2016
  
  in Public
  
  often called digital traces
  
  It seems that you've already defined digital traces several times earlier, so I would consider removing the phrase "are often called digital traces, and".
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/observing-behavior/data/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Introduction - 1.5 Outline of the book

1
1. efosse 07 Sep 2016
  
  in Public
  
  doing survey research
  
  I think understand what you mean here, but I'm not sure all readers would understand how surveys entail an interaction with people.
Visit annotations in context

Annotators

efosse

URL

bitbybitbook.com/en/introduction/outline/
www.bitbybitbook.com www.bitbybitbook.com

Bit By Bit - Introduction - 1.2 Welcome to the digital age

5
1. efosse 07 Sep 2016
  
  in Public
  
  enough
  
  Might be more specific here. E.g., "enough to fully map the wealth distribution in Rwanda."
2. efosse 07 Sep 2016
  
  in Public
  
  transition
  
  You use the word "transition" a lot in the first few sentences. I'd consider replacing this word with "change" or "switch" (or something similar).
  
  minor
3. efosse 07 Sep 2016
  
  in Public
  
  the principles of social research in the past will inform the social research of the future.
  
  Do you mean that the principles of analog age social research will inform those for digital age social research?
4. efosse 07 Sep 2016
  
  in Public
  
  to run innovative surveys and to create mass collaboration
  
  Another innovation is that physical distance becomes less important. Arthur C. Clarke predicted back in the 1970s that these new forms of communication would render physical travel obsolete. He said that people in the future (i.e., today) would "communicate, not commute."
5. efosse 07 Sep 2016
  
  in Public
  
  These trends—increasing digital information and increasing computing—show no sign of slowing down.
  
  I generally agree with this view, although there has been some discussion regarding a slowdown in Moore's law. E.g., https://www.technologyreview.com/s/601102/intel-puts-the-brakes-on-moores-law/
Visit annotations in context

Tags

minor

Annotators

efosse

URL

bitbybitbook.com/en/introduction/digital-age/

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Tags