19 Matching Annotations
  1. Oct 2024
    1. When we only look at the viruses specified above, we find that plasma contains 18 out of the 21 viruses, whereas whole blood contains 17 out of the 21 viruses. Specifically, plasma uniquely contains HBV, whereas whole blood uniquely contains HTLV-1 and HTLV-2.

      I would also give the total number of matches for some of these viruses. If there are frequently only one match, than that could be spurious (BLASTing them could help in making the matching more confident)

    2. Figure 4: Relative abundance of selected human-infecting viruses in each dataset. Top: without controlling for human reads. Bottom: controlling for human reads.

      I would pick either the controlled for human RA or the not controlled for human RA numbers here.

      I would add alpha here (i.e., make the symbols slightly transaprent).

      I find it quite hard to quickly match the different symbols with the different studies.

      I would add more y tick labels

      The horizontal grid lines are slightly confusing, as they do not help in matching the symbols to the labels on the left.

      I would cut down on the number of viruses covered. E.g., you could have a cut off of ten samples minimum for each virus.

    3. We can try to get a better understanding of the performance of whole blood by controlling for human reads (Table 3). The number of viral reads becomes much higher when we control for human reads, making it comparable to the plasma datasets.

      simplify wording

    4. Even comparing viral enrichment methods, we can see that Cebria-Mendoza has a much higher fraction of viral reads compared to Thijssen.

      Easier to double-check this by having enriched studyies marked accordingly

    5. Choice of viral enrichment tends to have an impact on kingdom composition, as we can see that Cebria-Mendoza and Thijssen, both of which did viral enrichment, have significanlty lower human read fractions than Mengyi, despite all datasets coming from plasma.

      Improve wording

    6. This approach provides comprehensive genetic information but precludes separate analysis of DNA and RNA from individual samples.

      What kind of sequencing did they do (single-end or paired-end, what bp length)

    7. This analysis will help us understand the last piece missing from the puzzle

      For the final blog post, I'd change this wording. Don't know what puzzle you are talking about here.

  2. Aug 2024