18 Matching Annotations
  1. Apr 2021
    1. This protocol uses a window of 1500 variants, shifted by 10% for each new round of comparisons, and a threshold of R 2  > 0.2. The window size of 1500 variants corresponds to the large, high LD chromosome 8 inversion, while the shift of 10% represents a trade-off between efficiency and thoroughness

      测试过,pure LD后没有关联位点了。可能是假阳性?

    2. It is necessary to remove rare variants from GWAS because the certainty of the genotype call is reduced by their low minor allele count. Even in common variants, however, genotyping and genotype recalling are subject to technical error, with the result that a proportion of variants and samples are of low quality, and should be removed from the analysis.

      稀有变异的检出率不是很可靠?

    3. For the smallest studies, where fewer than 1000 individuals are investigated, a cut-off of 5% should be considered—this is in line with the analysis program GenAbel, for example, which uses a minor allele count of 5 as its cut-off [ 18 ].

      1000个样本以下推荐5%的maf,有空要测试一下

  2. Mar 2021
    1. The expectation is that IBD = 1 for duplicates or monozygotic twins, IBD = 0.5 for first-degree relatives, IBD = 0.25 for second-degree relatives and IBD = 0.125 for third-degree relatives. Due to genotyping error, LD and population structure there is often some variation around these theoretical values and it is typical to remove one individual from each pair with an IBD > 0.1875, which is halfway between the expected IBD for third- and second-degree relatives. For these same reasons an IBD > 0.98 identifies duplicates.

      IBD过滤标准.IBD即plink结果中的PI_HAT。

    2. The method works best when only independent SNPs are included in the analysis. To achieve this, regions of extended linkage disequilibrium (LD) (such as the HLA) are entirely removed from the dataset8 and remaining regions are typically pruned so that no pair of SNPs within a given window (say, 50kb) is correlated (typically taken as r2>0.2)

      LD过滤,但实际发现过滤后位点很少,关联效应很弱

    1. Our primary aim was to generate a cohort large enough to examine the heritability of prognostic therapy outcomes. However, the meta-analysis estimate of SNP heritability was low and non-significant (h2SNP = 0.09, SE = 0.17). A sample size of 2724 has 80% and 99% power to detect a SNP-heritability of 33% and 50%, respectively94. To achieve 80% power to detect a heritability of 20%, a sample of 4500 individuals will be required. A meta-analysis of 2 799 individuals was sufficient to detect a significant heritability estimate for therapy outcome to antidepressant drugs (h2SNP = 0.42, SE = 0.18) and this was the first evidence of a genetic component for treatments outcome of any kind

      统计方法值得学习

    2. The meta-analysis sample (n = 2724) had 80% power to detect variants explaining 1.5% of the variance and 42% power to detect variants explaining 1% of the variance. Therefore, it is not especially surprising that we do not detect any variants at genome-wide significance. Typically, GWAS of psychological traits have required tens of thousands of participants to detect SNPs at genome-wide significance

      心理疾病GWAS研究艰难,2724个样本的meta分析结果还是阴性

  3. Mar 2020
    1. which is the basis of our planned second release (PLINK 2.0).

      See the homepage for updates taking it towards PLINO 2.0 alpha https://www.cog-genomics.org/plink/2.0/

      We also have phased and annotated data for use in plink2.0 worked examples in GigaDB http://dx.doi.org/10.5524/100516

  4. Jun 2018
  5. May 2018
  6. Nov 2017
  7. Oct 2017
  8. Jun 2016