29 Matching Annotations
  1. Last 7 days
    1. The variants detected in the plasma were characterized by comparingthe plasma and buffy coat VF and sequencing results. A variant ischaracterized as being of tumor origin if the lower 95% confidenceinterval (CI) of the plasma VF is greater than the upper bound of the95% CI of the buffy coat VF. The variant is characterized as being ofCH origin if the buffy coat VF is less than 20% and the 95% CI overlapswith the plasma VF or if the buffy coat VF is less than 20% and this VFis greater than the plasma VF. It is characterized as germline if the buffycoat VF is 30% or higher and the plasma VF is less than the buffy coatVF. The variant source is considered unknown if the buffy coat VF wasbetween 20% and 30% to account for potential postzygotic mosaicismresulting in the presence of certain variants only in a subset of cells (13).Variants detected in genes that are known to commonly harbor CHmutations (e.g., ASXL1, JAK2, DNMT3A, and TET2) are characterizedas CH if the buffy coat VF is greater than the plasma VF, as outlinedabove and depicted in Fig. 1.

      Same as in Abraham et al. 2025

    Annotators

    1. we interro-gated 15 mutation hot spots in blood DNA from 4,219individuals using ultra-deep sequencing.

      Healthy individuals (no blood disorder) only

    Annotators

  2. Oct 2025
    1. For a sample, they calculate the number of total genomes using QCTs. Then, they calculate the fetal fraction using polymorphic loci. Based on the fetal fraction and the number of total genomes, they estimate the expected number of fetal antigen molecules (AEM) if the fetus is positive. If the AEM is lower than the threshold, they discard.

      Then, they calculate the number of antigen molecules detected (ADM) using the read counts and QCTs. Then, they calculate the detected fetal antigen fraction (CFAF) as the number of detected molecules over the number of expected molecules. Across the 5 RhD loci, they take the second highest CFAF value and they compare it to the detection ranges to make the "antigen detected" call.

    Annotators

    Annotators

    1. We observed distinct transcriptional con-tributions from solid tissue-specific cell types from the intestine,liver, lungs, pancreas, heart, and kidney (Fig. 1d and Extended DataFig. 4).

      They showed that cfRNA data included transcriptional signatures which did not match type-specific transcriptional signatures from hematopoietic cells but did match signatures from intestine, liver, lungs, pancreas, heart, & kidney cells.

    2. We used this matrix to deconvolve the cell types of origin inthe plasma cell-free transcriptome

      Then, they quantified each marker transcript in the public cfRNA data from healthy patients, and they used them to deconvolve how many of each cell type were present in each sample the cfRNA data.

    3. We used Tabula Sapiens ver-sion 1.0 (TSP)12, a multiple-donor whole-body cell atlas spanning24 tissues and organs, to define a basis matrix whose gene set accu-rately and simultaneously resolved the distinct cell types in TSP.

      Then, they took public scRNA-seq data (Tabula Sapiens) from cells with labeled types and used SVM to identify the smallest setof transcripts in those samples which discriminated cells of each type from each other.

    4. Marker genes for blood, brain, and liver cell types were readilydetected, as previously observed at tissue level 1,3,4,6,7, as well as thekidney, gastrointestinal tract, and pancreas (Fig. 1b).

      Basically, they took public cfRNA data from healthy patients and identified transcripts. Then they showed that the transcripts they detected included marker transcripts for certain cell types.

    Annotators

    1. Barcoded nanopore sequencing libraries were created using a custom low-input protocol for theOxford Nanopore Technologies SQK-NBD-114.24 and LSK-114 kits. Briefly, cDNA was end-repaired and A-tailed as specified, but incubated for 30 minutes, followed by a 30-minutedeactivation. Barcodes from the SQK-NBD-114.24 kit were incubated at 20 C for 4.5 hours.

      No UMIs

    2. Novel transcript discovery

      Quote GPT: "Let’s break this down step by step and look carefully at the computational procedure they used, assessing both its strengths and weaknesses.


      What They Did Right

      1. Choice of Reference and Annotation

      2. They used hg38 as the reference genome and the GENCODE v39 comprehensive transcript annotation set.

      3. This is good practice, since hg38 is the current gold-standard reference, and GENCODE is the most complete and high-quality human transcript annotation resource.

      4. Tool for Isoform Discovery (IsoQuant v3.3.1)

      5. IsoQuant is designed for long-read transcriptome analysis and is well-suited to Oxford Nanopore data, which has unique error profiles compared to Illumina.

      6. They set -d nanopore, which ensures that IsoQuant applies the correct error model and alignment heuristics.

      7. Quality Control (SQANTI3)

      8. They validated the assembled transcript models with SQANTI3, which is one of the standard tools for assessing structural and quality metrics of novel isoforms (junction accuracy, coverage, splice motif analysis, etc.).

      9. This helps filter out false positives and adds confidence to reported novel isoforms.

      10. Quantification Strategy

      11. Using --gene quantification all and --transcript quantification all ensures they didn’t limit themselves to annotated transcripts, which is important for discovery.


      Potential Weaknesses / Flaws

      1. Batch Analysis of All FASTQs Together

      2. They ran IsoQuant on all FASTQs in one batch.

      3. This risks pooling artifacts across samples: rare or sample-specific isoforms may look more convincing when aggregated, but they might not be real if not supported in individual samples.
      4. Ideally, one should assemble transcripts per sample first and then merge, to track sample-specific expression and minimize false discoveries.

      5. Strandedness Setting (-stranded none)

      6. For cfRNA, strandedness is often important to distinguish sense from antisense transcription.

      7. By setting it to "none," they lose orientation information, which could inflate the number of apparent novel transcripts or misclassify antisense transcription as novel.

      8. Loose Matching Strategy (-matching_strategy loose)

      9. This increases sensitivity but at the cost of specificity.

      10. Nanopore reads have higher error rates, and loose matching risks aligning spurious sequences as novel isoforms when they might be alignment artifacts.

      11. Novel Transcript Reporting (--report_novel_unspliced true)

      12. Allowing unspliced transcripts to be reported as novel might overcount pre-mRNA fragments or degraded cfRNA, which are abundant in plasma.

      13. Without careful filtering, many of these could be biologically irrelevant.

      14. SQANTI3 with Default Parameters

      15. While SQANTI is robust, running it with defaults only may not be enough. For cfRNA (which is fragmented and low-abundance), more stringent filters (on junction support, read coverage, canonical splice motifs) are usually needed.

      16. Otherwise, false positives could pass QC.

      Overall Reliability

      • Strengths: They used well-regarded tools (IsoQuant + SQANTI), a proper reference (hg38 + GENCODE), and applied the Nanopore-specific model. That’s a solid foundation.
      • Weaknesses: They pooled samples, did not use strandedness, allowed loose matching, and reported unspliced transcripts—all of which tend to inflate the number of “novel” transcripts and may misclassify artifacts as biology. For cfRNA, which is already fragmented and noisy, these issues are magnified.

      Conclusion

      Their method is adequate for generating hypotheses about novel cfRNA transcripts, but the reliability of their discoveries is questionable without stronger filtering and sample-level validation. The key weaknesses (pooling, unstranded analysis, permissive matching, unspliced reporting) suggest their reported “novel transcripts” are likely overestimates.

      If I were evaluating this, I’d say:

      • Good first pass, but high false discovery rate.
      • They should have assembled per-sample, used stranded protocols if possible, tightened alignment criteria, and reported only spliced, reproducible isoforms across samples."
    3. ptimallambda values for the logistic regression model were identified using 5-fold cross validation, andthe value 1 standard error away from the minimum cross-validated value of lambda was selectedfor final model training. The final model was trained utilizing the entire dataset, and furtherfeature reduction was performed using LASSO regression.

      They didn't do any validation or testing on a validation/test set, they just trained, subset features, and reported sens/spec

    Annotators

  3. Sep 2025

    Annotators

    1. We compared 48 cases of Alphapapillomavirus detection in WGSdata against the current gold standard test of mRNA PCR high-risk/tumorigenic subtypes of HPV. The performance using WGS datawas excellent, with only one sample not matching the gold standard(n = 48; sensitivity = 100%, specificity = 97.3%; Fig. 3A). This sam-ple had high HPV burden as detected by WGS and was likely a false-negative result for the PCR-based test.

      Somehow alphapapillomavirus leads to head and neck cancer

    Annotators

    Annotators

    1. while robustly preserving biological features42. Instead, in many cases, key biological featuresare inadvertently removed from datasets, while contaminants and artifacts are kept36

      discovery of low-abundance biosignatures

    Annotators

    1. sparse1-3, noisy4,5 contain low-abundancefeatures2,6 and come from disparate datasets7,8

      However, SoA approaches do not work in many critical applications

    2. directly quantifying information loss/gain by SOA approaches and MIRACLe using well-established information-theoretic, divergence, and distance metrics26,
      • Third, we will perform preliminary experiments to quantify the actual amount of
    3. currently impossible25 training of predictive AI models on low abundancebiomarkers

      However, SoA approaches do not work in many critical applications

    Annotators

    1. Long-probe quantitative amplifiedsignal (LQAS) assay master

      LQAS is often used as a way to do qPCR that is methylation-specific besides methylation-specific qPCR.

    Annotators

  4. Aug 2025
    1. Duplex recovery is assessed by the final number of dsDNA bases sequenced (in Gb)and total dsDNA coverage (in genome equivalents) across total bases sequenced

      Basically, "duplex recovery" is calculated as: identify every dsDNA read (a read which has at least 1 read from the opposite strand); recovery is "the number of bases on all dsDNA reads divided by the number of bases on all reads". Max is 100%.

    Annotators

    1. Quantitative real-time PCR (RT-qPCR) was used for quantification ofcfRNA. An RNA-specific primer was designed to cover a 97-bp ampli-con spanning 2 exon boundaries in the housekeeping gene GAPDH(forward 5′-GATCATCAGCAATGCCTCCT-3′, reverse 5′-TGTGGTCATGAGTCCTTCCA-3′). A DNA-specific primer was designed to covera 78-bp transcriptionally silent region of chromosome 12 (forward5′-TACGGTTGGTCCTTTCTTCG-3′, reverse 5′-TTTCCTTTGGGTCTGAATGC-3′). Reverse transcription was first performed using the High-Capacity cDNA Reverse Transcription kit (Applied Biosystems). qPCRwas then run using 2X Power SYBR Green PCR Master Mix (ThermoFisher Scientific) on an Applied Biosystems 7500 Fast Real-Time PCRor QuantStudio 7 Pro instruments. Universal Human Reference RNA(Thermo Fisher Scientific) was run in parallel to generate a standardcurve, and cfRNA concentrations were calculated by comparing theRNA-specific Ct value of the sample to the standard curve.

      They quant cfRNA after extraction but before library prep & capture. They use RT-qPCR of a GAPDH region vs a standard curve made using Universal Human Reference RNA. Therefore their actual RNA quant might be off - it's more like a rough overall expression estimate.

    Annotators