32 Matching Annotations
  1. Apr 2019
    1. List them in codebook form.

    2. upload your self testing data on PsyToolkit.

    3. This code is sufficient.

    4. Give the detailed description on this part.

    5. Inconsistency with replication: participants

    6. Error rate of lures and distraction Accuracy of Fillers

    7. Properties of words to be recognized: Lures (Critical words of memory lists) Distraction (Elementary words out of memory lists) Fillers (Elementary words in memory lists)

    8. effect size > 1.5; Optional stop by Bayes Factor

    9. 1.5 minutes?

  2. mp.weixin.qq.com mp.weixin.qq.com
    1. 要保持谦逊:兼容性评估的前提是用于计算区间的统计假设是正确的


    2. 和0.05的P值阈值一样,95%置信区间中的95%本身也是约定俗成的。其基础观点就存在问题,即计算出的区间有95%的可能性包含真值;并且95%这个数字让人有一种模糊的感觉——根据这个区间可以得出可靠自信的结论。


    3. 在给定假设的情况下,区间内数值与研究数据的兼容性并不完全相同

      原文“not all values inside are equally compatible with the data, given the assumptions. ” 這裡的assumption是指估計的參數,還是科學理論對現實狀況的預測,並沒有明確說明。如果是估計的參數,Amrhein等人也許將P(D|theta)當成P(theta)。

    4. 在给定假设的情况下,区间覆盖了与研究数据最兼容的数值,并不意味着区间外的数值一定与研究数据不兼容,只是兼容性不那么高


    5. 兼容区间


    6. 我们看到了大量具有“统计学显著性”的结果;而不具有“统计学显著性”的结果则被显著低估


    7. 在置信区间包含风险显著增高的情况下,仅因为结果不具有统计学显著性就推论药物与房颤发生“无关”十分可笑;据此就认为前后两项研究矛盾——即便风险比完全一致——同样非常荒谬。这些常见情况表明我们依赖的统计学显著性阈值有可能误导我们。

      Amrhein 等人以此例子顯示confidence interval能突顯不一致的研究之間,評估要測量的效應其實一致的資訊。

    1. Merely reporting likelihood ratios does not produce meaningful control of errors; nor do likelihood ratios mean the same thing in different contexts. So N-P consider the probability distribution of Λ(X), and they want to ensure the probability of the event {Λ(X) ≥ kα} is sufficiently small under H0

      NP started design the hypothesis testing from the likelihood ratios, a.k.a. Bayes Factor.

  3. Aug 2018
    1. social network information from the general pop-ulation of psychologists

      Third method: "a network scale-up estimate, where the general population (within research psychology) will be asked how many people they know who have used QRPs in the last 12 months."

    2. the unmatchedcount technique

      Second method: "an unmatched count estimate, where population size will be measured indirectly via item counts."

    3. We simply asked researchers toreport their own QRP use.

      First method: "a direct estimate, where participants are asked directly if they have used a QRP within the last 12 months."

    4. it addressed the larger issue of “prevalence”, by definingbehaviors performed within a specified time period.

      Second goal of this study. This study investigated the QRPs happened in the particular timing.

    5. replication crisis

      First goal of this study.

    6. 47.9%

      Third summary of QRPs. Italian psychologists.

    7. Fiedler and Schwarz (2016) found less than10% prevalence

      Second summary of QRPs survey. German psychologists

    8. John et al. (2012) found 63%

      First summary of QRPs survey. American psychologists.

    9. Questionable Research Practices of interest with examples

      Included 9 QRPs of John et al.(2012).

    1. The software you develop is part of the methods section, and it is the easiest part to share.

      This should be the core principle in every course of research method.

    2. others can learn from what we have done, and revealing how you carried out an experiment is at the heart of any publication.

      Every workshop for scientists should teach the latest open methodology.

    3. We should consider the data to be the main publication, and the paper a secondary, less important part

      I'm imaging there will be no human language as the primary publication language; or every human language is the primary language to express the scientific knowledge.

    4. imagine if open science was considered normal, and closed science considered weird. Wouldn’t the world be a better place?

      In every aspect of science, making "Open" mainstream.