10,000 Matching Annotations
  1. Last 7 days
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. Point-by-point description of the revisions


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this study, the authors investigated the effect of nutritional stress (HSD and HFD) on cardiac function by assessing multiple parameters on adult flies. They next identified the adaptive transcriptomic changes in the heart in response to these nutritional stresses and screened for their roles under ND, HSD and HFD. They identified fit gene, encoding a satiety gene, expressed by cardiomyocytes and pericardial cells.

      I think the characterisation is thorough; however, the conclusion is not well supported by the evidence. My main concern is that in many graphs, the difference between control and experiment is subtle, and, secondly, the authors showed some conflicting results (e.g. one RNAi showed a reduction of one parameter, however, the other independent RNAi did not. In this case, I believe the authors shouldn't conclude that the RNAi is functionally required, since the RNAis are meant to confirm each other.

      First, we thank the reviewer for her/his constructive comments and suggestions. We obtained new results presented in the last version of the manuscript, which consistently support our conclusions and improve the study.

      High-Sugar and High-Fat Diets modified cardiac performance

      They assessed how HSD and HFD affect Adult fly heart performance. Instead of performing 3 weeks of dietary manipulation as has been done before by other groups, they put adult flies on HSD for 7 days and HFD for only 3 days.

      We would like to clarify the nutritional challenge used. Cardiac function of flies was assessed at 10 days after emergence. Flies were put either in ND or HSD during these 10 days (ND and HSD conditions), or in ND for 7 days then transferred on HFD for 3days (HFD condition). Finally, all the females spent 10 days in a diet before being imaged or before hearts/brains dissection.

      They found: HSD increases HP and SI, and reduces AI. The difference is too small and not consistent between different control lines. Also, when the difference is this small, p value does not tell much!

      They probably intentionally induced a milder effect so that they could assess adaptive transcriptomic changes to this nutritional stress. In Fig. 1D SI is increased under HSD with control-KK, In Fig. S1C, SI is not changed under HSD with control-GD and control-GFP. Instead, DI is increased, which is also opposite to what they showed in Fig. 1 C. HFD increased ESD, EDD, SV, FS and CO.(Hypertrophy). This is not true with control-GD and control-GFP lines though! Comments: They have assessed many parameters in live animals with many different control lines, which is thorough. However, it is hard to draw any conclusions based on these conflicting results. Are these effect KK line specific?

      Globally, we agree with the reviewer that the results, presented in the first version of the manuscript, for the control lines were difficult to understand due to the inconsistency of the phenotypes. In this revised version, we performed new results in Figure 1 and __S1 __regarding the effect of 10 days HSD and 3 days HFD exposure vs ND.

      105 to 187 flies were imaged for the 3 control conditions, in the 3 diets concomitantly, to increase the power of our analysis. As mentioned in the main text (page 3, line 30-35; page 4, line 1-5), both diets deteriorate cardiac function with HFD leading to consistent phenotypes on heart diameters and rhythm and HSD milder effects. Indeed, the 3 control lines were uniformly affected by HFD after 3 days exposure, whereas 10 days in HSD was not sufficient to quantify a significant effect despite consistent the trends on several phenotypes (EDD, ESD, DI, AI and CO. These results revealed a different sensitivity of the cardiac performance when exposed to sugar and fat.

      As described in the text, we were nevertheless confident that our approach would be good to investigate the early molecular dysregulations induced by sugar. This was the purpose of our analysis, presented in the follow-up of the manuscript.

      Regarding the small differences measured in the phenotypes in HSD and HFD compared to ND, we would like to outline that the values presented are normalized values to control. Normalization is done for every independent experiment, performed at different dates, and permits the graphical representation of pooled values. Statistical analysis is performed using non-parametric Kruskal-Wallis test accordingly. Values are presented with the X axis cutting the Y axis at 0, this graphical representation also contributes to flattening the differences and p-values indicate their significance.

      Analysis of the fly cardiac transcriptome upon nutritional stress

      RNA seq to detect differentially expressed genes under HSD and HFD vs ND. Most DE genes are downregulated, which prompts them to assess how the downregulation of these genes adapts the animals to this nutritional stress.

      High Sugar Diet downregulated 1c-metabolism and Leloir galactose pathways.

      In this revised manuscript, we first present RT-qPCR validating the downregulation of Gnmt, Sardh and Galk expressions in the heart of 10days old HSD-fed females compared to ND-fed ones (Figure S3A).

      We apologize for the confused explanations in the first version of the manuscript. We show new results in Figure 3 and __S3 __on the cardiac function of both Gnmt and Sardh, where following reviewer’s suggestion, both genes were knocked down in the heart in ND and Gnmt overexpressed in HSD. No available tools allowed us to test Sardh overexpression in HSD and we could not get some for Galk.

      GNMT is downregulated under HSD and HFD.

      In ND, GNMT knockdown increased ESD, EDD and CO. Sardh knockdown did the same? However, Sardh knockdown did not affect ESD significantly.

      We reanalyze our first data and added new ones, comparing only knockdown or overexpression to the corresponding controls performed in concomitant experiments. Results are now shown in Figure 3C-E; S3C-H. Knocking down Gnmt in the heart increased HP, EDD, ESD and CO, Sardh knockdown in ND resulted in milder phenotypes but inducing significant hypertrophy in ND as Gnmt does. In both cases, FS was not impacted.

      Both genes have been previously shown as beneficial to muscular function in time-restricted feeding context (Livelo et al., 2023, Nat.Comm.), illustrating that, even if both enzymes are involved in opposite reaction, their function has the same effect on organ/tissue function, as they did for heart diameters. The text corresponding to results and discussion were updated accordingly (pages 5, 11).

      The conclusion here is: GNMT knockdown induces hypertrophy, similar to the effect of HFD.

      In HSD, further knockdown of GNMT reduced (rescued) HP, suggesting downregulation of GNMT under HSD is adaptive. Should overexpress GNMT under HSD to see if this manipulation further increases HP, to claim GNMT downregulation is an adaptive change to high sugar stress.

      We thank the reviewer for her/his suggestion. We now used UAS-GnmtWT (from FlyORF) to assess the role of Gnmt on cardiac function in HSD.

      As shown in (Figure 3C-E; S3C,F), overexpressing Gnmt in the heart in HSD was sufficient to rescue some sugar induced phenotypes or to induce other dysfunctions, when compared to corresponding controls evaluated in the same experiments in ND and HSD. Notably, HP increase and CO decrease are rescued by Gnmt cardiac overexpression in HSD. Interestingly, the cardiac diastolic constriction induced by HSD is associated to increased FS and CO in this genotype in sugar diet. These new results strengthen the positive effect of Gnmt on cardiac function, improving it in HSD and preventing its deterioration in this diet.

      Of note, Gnmt overexpression in ND did not trigger cardiac dysfunctions (data not shown).

      The results and conclusions have been corrected.

      Interestingly, HSD itself tends to decrease AI, a further knockdown of GNMT further decreases AI. This indicates GNMT downregulation under HSD contributes to AI reduction. Together, GNMT downregulation under HSD prevents HP from going higher, while its downregulation causes AI going down.

      In the manscript, the authors claim that " Gnmt KD led reduced HP and AI, suggesting that it is able to counteract the effect of HSD observed in control flies on these phenotypes". This is not true according to the logic in Results section 1. As in section 1, the effect of HSD on AI is not significant, so the authors shouldn't say" HS tended to reduce AI".

      Our reanalyzes and new results showed no Gnmt impact on AI, so these Figure panels were removed.

      Why GNMT knockdown reduced FS under ND (Fig. S3C), while increasing FS under HSD (Fig. 3F)? If GNMT knockdown induces hypertrophy, I would expect it to increase FS.

      Gnmt overexpression did not affect cardiac diameters in HSD, but it nevertheless led to an increased contractile efficacy compared to HSD controls (Figure S3F).

      These new results strengthen the positive effect of Gnmt on cardiac function, preventing its deterioration in sugar diet. The text was modified accordingly.

      High Fat Diet modulated CD36-scavenger receptor and Glut8 orthologues

      In this revised manuscript, we present RT-qPCR validating the downregulation of Snmp1 expression and the slight upregulation of nebu’s in the heart of 10days old HFD-fed females compared to ND-fed ones (Figure S3B).

      HFD: Snmp1 gene is downregulated, however, both overexpression and knockdown of Snmp1 in ND induced some phenotypes.

      Indeed, as mentioned in the revised manuscript (page 6, lines 21-24), in heart of females fed ND, both Snmp1 knockdown (Snmp1KK) and overexpression (Snmp1WT) showed a reduction of EDD and ESD (Figure 3J; S3J) but FS is increased accordingly only in Snmp1KK.

      As notified in the text, both downregulation and overexpression of Snmp1 led to side-phenotypes (page 6, lines 24-28): Snmp1KK exhibited abdominal fat increase (Figure S3K) and ostial cells seemed clearly malformed in Snmp1WT (Figure 3M). This may explain why the heart shows the same type of functional impairment in both genotypes.

      We now discussed the hypothesis that these similar cardiac dysfunctions may result from Snmp1 being a regulator of organismal or cardiac lipid homeostasis. Indeed, increasing body fat content is deleterious as is increasing the import of fat in the cardiomyocytes. Finally, both affects cardiac cells’ health and functioning.

      HFD: nebu has a role in regulating cardiac function under ND.

      HSD and HFD revealed the secretory function of the heart

      They identified diet-regulated secreted proteins that are required for cardiac dysfunction.

      Cardiac Fit expression impacted Cardiac performance.

      The author used Hand-G4 to knock down Fit using KK and GD lines, KK line showed a reduction in HP (Fig. 5A), but not GD line (Fig. S5D). How did the author conclude that Fit is required for cardiac function? Also, with the positive data, the difference is too subtle.

      We apologize and agree that the contradictory or inconsistent results obtained with the two RNAi lines were confusing.

      For this revised version, we first assess the effect of the two RNAi lines (KK and GD) on fit expression in the dissected hearts. RT-qPCR for KK line is presented in Figure S5A. GD line did not show a significant reduction of fit expression when expressed in the heart with Hand>, which can explain the former results presented (not shown but data are available). So, we removed all results obtained with the GD line in this revised version.

      To confirm the KK effects, we used fit KO allele (fit81) and truncated version of fit, without its signal peptide (fitDeltaSP), which has a dominant negative effect, both previously published and validated (Sun et al. 2017, Nat. Comm.). These two mutants were used to investigate the cardiac function of fit in our analysis. Results presented in Figure 5 and S5 confirm the phenotypes already observed with the KK line when expressed with Hand> in the heart and with Lsp2> in the fat body.

      Our results validate the effect of fit decrease on rhythmicity and contractility, the reverse effects being consistently observed in fit overexpression. In conclusion, we are confident in the requirement of Fit in the regulation of cardiac performance.

      These new data are now included in the results section “Cardiac Fit expression impacted Cardiac performance” (pages 8-9)

      **Referee cross-commenting**

      i agree with the experiments proposed by reviewer 2.

      Reviewer #1 (Significance (Required)):

      The study aims to examine the effect of diet on cardiac function.

      The strength is that a lot of characterisations were done.

      the weakness is the functional data regarding fit could not be validated in two different RNAis, thus the evidence is not strong to support the conclusions.

      We again would like to thank the reviewer for her/his remarks and suggestions. She/He highlights the weakness of the first analysis and this was an important and constructive feedbacks for us. We strengthened our results by increasing samples, reanalyzing data and performing mandatory new experiments that are now included in this revised version.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Khamvongsa-Charbonnier et al. reported a RNA-seq analysis and RNA interference screening on high-fat and high-sugar-induced cardiomyopathy in Drosophila. The authors uncovered novel genes in 1C-metabolism, galactose metabolism, CD36-scavenger receptor and glucose transporter, as adaptative factors of cardiac function under high-fat and high-sugar treatment. The authors also identified a satiety hormone, Fit, as a cardiokine to control food intake and , expressed by dilp5 secretion. In summary, this study leverages the powerful genetic model Drosophila to uncover a number of new factors in regulating cardiac function under nutritional stresses and potentially offers new insights into molecular mechanisms underlying diet-related cardiac diseases. I have a few concerns, as listed below.

      First, we would like to thank the reviewer for her/his comments and suggestions that deeply help us to improve the take-home messages of our manuscript. Following her/his recommendations and suggestions, we can now present a revised and stronger version of our manuscript.

      1. Quantitative RT-PCR is required to validate the expression patterns of candidate genes identified from the RNAseq analysis.

      RT-qPCR have been performed on hearts dissected from 10 days old females fed ND, HSD or HFD. Gnmt, Sardh and Galk validated downregulation are presented in Figure S3A, Snmp1 downregulation and nebu upregulation (trend but non-significant) in Figure S3B, fit downregulation in Figure S5A.

      The authors state that the dysregulated gene expression patterns reflect acute adaptation to HSD and HFD stresses. Most of the candidate genes in this study were downregulated upon HSD and HFD. However, it is recommended that overexpression of these genes, rather than knockdown, is needed to confirm whether the downregulation of these candidate genes upon stresses is an adaptative response.

      We agree with the reviewer and followed her/his recommendation when tools were accessible for our analysis.

      For example, HSD feeding induces the heart period. Knocking down Gnmt, specifically in the heart, under the HSD feeding changes can reduce the heart period. This evidence is insufficient to suggest the protective role of Gnmt under the HSD diet. Gnmt has already been downregulated under the HSD. Further knockdown of Gnmt, instead of returning the Gnmt expression to normal levels, to protect cardiac contractile performance complicates the model.

      We thank the reviewer for her/his suggestion. We used UAS-*GnmtWT * (from FlyORF) to perform these experiments.

      As shown in (Figure 3C-E; S3C,F), knocking down Gnmt in the heart increased HP, EDD, ESD and CO. In the same Figure panels and in Figure S3F, we showed that overexpressing Gnmt with Hand> in HSD was sufficient to rescue some sugar induced phenotypes or to induce some, when compared to corresponding controls evaluated in the same experiments in ND and HSD. Gnmt overexpression in ND did not trigger cardiac dysfunctions (data not shown).

      HP increase and CO decrease are rescued by Gnmt cardiac overexpression in HSD. Interestingly, the cardiac constriction induced by HSD is not rescued by Gnmt overexpression, but it is enough to increase FS and CO in sugar diet. These new results strengthen the positive effect of Gnmt on cardiac function, improving it in HSD and preventing its deterioration in this diet.

      Sardh knockdown in ND, resulted in milder phenotypes but induced significant hypertrophy in ND as Gnmt does. No available tools allowed us to test its overexpression in HSD.

      Nevertheless, as mentioned and discussed in the manuscript (page 5, line 27-30; page 11, lines 11-14), such protective role of muscular function and integrity has already been characterized in fly IFM in time-restricted feeding experiments for Gnmt and Sardh (Livelo et al., 2023, Nat.Comm.). Our experiments show that both genes encounter the same role in cardiac function upon nutritional stresses. The text was modified accordingly.

      The authors suggest that the effect of nebu on heart contractility is not dependent on diet. However, based on the result from Figure 3O-P, the HFD treatment blocks the effect of nebu knockdown on heart contractility. The authors need to further explain these results and modify their conclusions accordingly.

      We completely agree with the reviewer. We did not correctly analyze these results. We reanalyze our data, taking into account only the experiments of nebu knockdown that were performed in ND and in HFD concomitantly. Results are shown in Figure 3O-P; S3L-N.

      As mentioned in the manuscript (page 7, lines 3-8), nebu knockdown led to identical HP decrease in both diets but its constrictive effect (reduction of heart diameters) in ND is abrogated by fat diet.

      We modified the text accordingly in the results and discussion (page 7, lines 8-11; page 12, lines 7-12).

      It is a bit confusing that knockdown of fit using Hand-Gal4 induced food intake, but knockdown of fit using tin-Gal4 or Dot-Gal4 did not significantly induce food intake (Fig 6A). The author did not provide any explanation of these results. What is even more confusing is that overexpressing fit using Dot-Gal4 decreased food intake, but overexpressing fit using Hand-Gal4 or tin-Gal4 did not significantly decrease food intake (Fig 6B). Why was the strong food intake phenotype not observed using Hand-Gal4 in both experiments? These confusing results lead to a question, which cell type is responsible for the production of cardiokine, Fit?

      We apologize for the misleading results presented in the initial manuscript. We hope that our revised version will clarify Fit function regarding its remote impact.

      Concerning the requirement of Fit function and the cell types that produces Fit, the results we obtained when evaluating cardiac performance strongly suggest that both cardiomyocytes and pericardial cells are important and recapitulate the effect of Hand> (Figure 5A-C; S5G-H).

      In the case of food intake measurements, we now present results with newly performed food intake experiments for the Hand>fitWT (Figure 6D). They show a significant reduction of food intake in this condition, corroborating the results obtained with Dot>. We add a clarification in the manuscript for this point (page 10, lines 11-16).

      When testing the role of cardiac Fit in Dilp5 secretion, the authors subjected flies to starvation stress. However, the main focus of the present study is on HSD and HFD. The RNAseq analysis showed that Fit expression was downregulated by both HSD and HFD. Can the authors show that Dilp5 secretion is reduced by both HSD and HFD? Most importantly, can the authors test whether overexpression of cardiac Fit blocks HSD- or HFD-reduced Dilp5 secretion?

      We understand the point raised by the reviewer. First of all, we wanted to correlate the measured impact on food intake, when manipulating fit expression in the heart, to the level of Dilp release, as it has been used and validated in (Sun et al. 2017, Nat. Comm.). In this purpose, we used the same approach and protocol and results are shown in Figure 6 E-F.

      As mentioned by the reviewer, fit expression is downregulated in both HSD and HFD (which we confirmed by RT-qPCR in Figure S5A). As suggested by the reviewer, we performed Dilp5 immunostaining on CNS from females that were fed HSD of HFD for 10 days. Our results, in Figure 6B (left panels) and corresponding quantifications in Figure 6C, show that both diets strongly induce a decrease in Dilp5 amount in the IPCs and that it was not due to an altered Dilp2 or Dilp5 expression in the CNS (Figure S6A). In this condition, overexpressing fit, which has a promoting effect on Dilp secretion (Figure 6B, right panels ND), may only have an additive effect. This is shown in Figure 6B-C.

      Reviewer #2 (Significance (Required)):

      In summary, this study leverages the powerful genetic model Drosophila to uncover a number of new factors in regulating cardiac function under nutritional stresses and potentially offers new insights into molecular mechanisms underlying diet-related cardiac diseases.

      We again would like to thank the reviewer for her/his remarks and suggestions. Her/His important and constructive feedbacks helped us to improve and strengthen our study. Despite the weak points of the first version, she/he had supportive feedback and we deeply thank her/him. This revised version had improved results and analysis, thanks to the use of new genetic tools that strengthen this analysis.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Khamvongsa-Charbonnier et al. reported a RNA-seq analysis and RNA interference screening on high-fat and high-sugar-induced cardiomyopathy in Drosophila. The authors uncovered novel genes in 1C-metabolism, galactose metabolism, CD36-scavenger receptor and glucose transporter, as adaptative factors of cardiac function under high-fat and high-sugar treatment. The authors also identified a satiety hormone, Fit, as a cardiokine to control food intake and , expressed by dilp5 secretion. In summary, this study leverages the powerful genetic model Drosophila to uncover a number of new factors in regulating cardiac function under nutritional stresses and potentially offers new insights into molecular mechanisms underlying diet-related cardiac diseases. I have a few concerns, as listed below.

      1. Quantitative RT-PCR is required to validate the expression patterns of candidate genes identified from the RNAseq analysis.
      2. The authors state that the dysregulated gene expression patterns reflect acute adaptation to HSD and HFD stresses. Most of the candidate genes in this study were downregulated upon HSD and HFD. However, it is recommended that overexpression of these genes, rather than knockdown, is needed to confirm whether the downregulation of these candidate genes upon stresses is an adaptative response. For example, HSD feeding induces the heart period. Knocking down Gnmt, specifically in the heart, under the HSD feeding changes can reduce the heart period. This evidence is insufficient to suggest the protective role of Gnmt under the HSD diet. Gnmt has already been downregulated under the HSD. Further knockdown of Gnmt, instead of returning the Gnmt expression to normal levels, to protect cardiac contractile performance complicates the model.
      3. The authors suggest that the effect of nebu on heart contractility is not dependent on diet. However, based on the result from Figure 3O-P, the HFD treatment blocks the effect of nebu knockdown on heart contractility. The authors need to further explain these results and modify their conclusions accordingly.
      4. It is a bit confusing that knockdown of fit using Hand-Gal4 induced food intake, but knockdown of fit using tin-Gal4 or Dot-Gal4 did not significantly induce food intake (Fig 6A). The author did not provide any explanation of these results.

      What is even more confusing is that overexpressing fit using Dot-Gal4 decreased food intake, but overexpressing fit using Hand-Gal4 or tin-Gal4 did not significantly decrease food intake (Fig 6B). Why was the strong food intake phenotype not observed using Hand-Gal4 in both experiments?

      These confusing results lead to a question, which cell type is responsible for the production of cardiokine, Fit? 5. When testing the role of cardiac Fit in Dilp5 secretion, the authors subjected flies to starvation stress. However, the main focus of the present study is on HSD and HFD. The RNAseq analysis showed that Fit expression was downregulated by both HSD and HFD. Can the authors show that Dilp5 secretion is reduced by both HSD and HFD? Most importantly, can the authors test whether overexpression of cardiac Fit blocks HSD- or HFD-reduced Dilp5 secretion?

      Significance

      In summary, this study leverages the powerful genetic model Drosophila to uncover a number of new factors in regulating cardiac function under nutritional stresses and potentially offers new insights into molecular mechanisms underlying diet-related cardiac diseases.

    1. Reviewer #3 (Public review):

      Summary:

      Prudhomme et al report a detailed analysis of the role of vinculin in maintaining neuroepithelial integrity during cranial neurulation.

      Strengths:

      The authors use complementary experiments involving super-resolution microscopy, laser ablation, and live imaging of conditional knockout and ESC-derived embryos to demonstrate that loss of vinculin produces wide gaps between the adherens junctions of neuroepithelial cells at later stages of cranial neural fold elevation. The data presented are of extremely high quality, logically presented in a compelling story, and represent a very substantial contribution.

      Weaknesses:

      The authors are invited to consider the largely minor questions recommended below.

      (1) The laser ablations reported are a correlate of cell border, or 'junctional' tension. Please avoid broad statements such as 'mechanical forces are upregulated' (abstract), which invoke gene-like regulation of tissue-level forces (in Newtons). Changes in junctional tension are likely to relate to changes in force generated, but their relationship is not simple: higher tensile stress withstood by the shorter length of junctions in cells with smaller apical surfaces does not necessarily translate into greater force being produced by that cell. The junctional tension readout measured is perfectly relevant to the paper, more so than tissue-level forces would have been.

      (2) What is the mechanical mechanism by which loss of vinculin prevents neural fold elevation? The authors present exciting findings about the cellular consequences of losing Vcl at the late elevation stages when the tissue is quantifiably dysmorphic. A clear argument of how Vcl loss could lead to this dysmorphology would strengthen the paper, particularly given that junctional tension defects are excluded and apical non-constriction at the late stage is only mild.

      (3) Can the authors comment on the likely impacts of Vcl deletion on the basal domain of the cell? For example, they could cite live-imaging of distinct behaviours in Williams et al Dev Cell 2014, and the NTD phenotypes of some integrin/focal adhesion mutant mice.

      (4) The apparent uncoupling of apical area (larger in Vcl KO) from junctional tension (equivalent) in this model is noteworthy. Can the authors speculate on its potential basis?

      (5) Live imaging in Figure 7C appears to show a marked reduction in apical area before cleavage furrow formation (T0-18min), suggesting a large apical constriction event (post-mitotic?), as previously reported (e.g., Ampartzidis et al Dev Biol 2023). Do junctional gaps appear during these constrictions?

      (6) The live imaging setup used is clearly sufficient to identify differences between genotypes, so this is only a minor point. The gassing conditions listed in the methods specify 5% CO2, but E8.5 embryos also need low O2 to complete cranial closure. Was the O2 level controlled? Was tissue-level shape change observed to be consistent with ongoing neurulation during live-imaging?

      (7) Neither the multi-cell laser ablations in the pre-print by De La O cited here, nor the narrower junctional ablations in Bocanegra-Moreno et al., Nat Phys, (2023), identified differences in recoil between developmental stages. Why might those results be different from the findings reported here (e.g., analysis region - not specified in the latter paper)? Limitations to interpreting junctional ablations between cells with different junction lengths include more of the recoil being dissipated by retraction of the longer ablated border.

      (8) Is a truncated Vcl expressed in the ESC model, which could bind catenin without an F-actin anchor? The very high-contrast western shown is cropped so it is not clear whether the catenin-binding N-terminus is present. Does the antibody used recognise the head domain (this reviewer could not readily find the information)?

    1. Rapport de Synthèse : Analyse du Traitement Judiciaire de l'Inceste en France

      Résumé Exécutif

      Ce document synthétise l'audition de Sihem Ghars, fondatrice du collectif Incesticide France, devant la commission d'enquête sur le traitement judiciaire des violences sexuelles incestueuses.

      Le constat dressé est celui d'une défaillance systémique de l'institution judiciaire française, qualifiée de « torture institutionnelle » par le Comité contre la torture de l'ONU en 2025.

      Le témoignage met en lumière une impunité quasi totale (moins de 1 % de condamnations) et un processus de « désenfantement » frappant les mères qui dénoncent les violences.

      Au-delà des chiffres, le document expose les mécanismes de silenciation des victimes et propose des mesures d'urgence, affirmant que le cadre légal existant est suffisant mais que sa mise en œuvre est entravée par un déni institutionnel profond.

      --------------------------------------------------------------------------------

      1. État des Lieux et Données Chiffrées

      Le collectif Incesticide France, créé en 2020, documente la situation de centaines de familles.

      Les données présentées révèlent l'ampleur d'une crise qualifiée de « sanitaire » et de « pédophilique ».

      • Représentativité du collectif : Plus de 600 familles suivies.

      Une enquête citoyenne a recueilli 1 076 témoignages complets et 2 019 pièces justificatives en deux semaines.

      • Impunité judiciaire :

        • Moins de 1 % de condamnations pour viols sur mineurs dans le cadre de l'inceste.
      • Environ 3 % de condamnations pour les autres types de violences sexuelles.

      • Parmi le 1 % de condamnés, la moitié bénéficie d'un sursis.

      • Données médicales d'urgence : Les Unités Médico-Judiciaires (UMJ) ont recensé 614 bébés examinés pour viols en une seule année, soit environ 50 nourrissons par mois.

      • Estimation de la menace : Sur la base d'études internationales, le témoignage estime à 2 millions le nombre d'auteurs de violences sexuelles sur mineurs en liberté en France.

      --------------------------------------------------------------------------------

      2. Défaillances Systémiques et « Torture Institutionnelle »

      Le concept de « torture institutionnelle » s'appuie sur le rapport de mai 2025 du Comité contre la torture de l'ONU.

      La France y est pointée pour de graves violations des droits fondamentaux.

      Une chaîne d'acteurs en faillite

      Le témoignage souligne l'échec successif de tous les intervenants de la chaîne de protection :

      • Enquêteurs et police : Classements sans suite fréquents malgré des preuves matérielles (fissures anales constatées à l'hôpital, ADN, aveux sur PV).

      • Experts psychiatres et psychologues : Manque de formation en psychotrauma.

      Utilisation de rapports biaisés ou incompétents guidant les décisions des juges.

      • Services sociaux (ASE) : Accusés de collusion ou d'incompétence.

      Le terme « SS » est utilisé par certaines familles pour décrire des interventions perçues comme des « prises d'otages » d'enfants.

      • Magistrats : Surcharge de travail empêchant une étude approfondie des dossiers (environ 10 dossiers par jour, soit des milliers de pages à traiter).

      Le mensonge de l'État

      La délégation française auprès de l'ONU aurait affirmé qu'aucune mère n'est poursuivie sans vérification préalable des signalements de viol.

      Le témoignage dément formellement cette assertion, affirmant que les mères sont systématiquement poursuivies pour « non-représentation d'enfant » alors que l'enquête pénale est toujours en cours.

      --------------------------------------------------------------------------------

      3. La Situation des « Mères Empêchées » de Protéger

      Le document rejette le terme de « mères protectrices », lui préférant celui de « mères empêchées » par une justice destructrice.

      | Type de mesure | Conséquences pour la mère | | --- | --- | | Pénale | Poursuites pour non-représentation d'enfant, gardes à vue, incarcérations. | | Civile (JAF) | Astreintes financières massives (ex: 250 € à 600 € par jour de non-représentation). | | Éducationnelle | Accusations de « manipulation », de « syndrome d'aliénation parentale » (même si non reconnu) ou de « vengeance ». | | Droits parentaux | Retrait de l'autorité parentale, visites médiatisées (1h/mois sous surveillance) ou transfert de résidence chez l'agresseur présumé. |

      --------------------------------------------------------------------------------

      4. Les Stratégies des Agresseurs et la « Civilisation Parallèle »

      Le témoignage décrit l'inceste non comme un acte isolé, mais comme une pratique ancrée dans une « civilisation parallèle » utilisant des mécanismes sophistiqués pour garantir l'impunité.

      • Stratégies de silenciation :

        • Culpabilisation : Utilisation des réactions physiologiques de l'enfant (orgasme) pour induire une complicité.
      • Initiation : Forcer l'enfant à agresser un tiers pour l'empêcher de dénoncer.

      • Manipulation institutionnelle : Les agresseurs devancent souvent la mère en saisissant le Juge aux Affaires Familiales (JAF) pour instaurer un climat conflictuel avant que l'enfant ne parle.

      • Profil des agresseurs : Des individus souvent insérés socialement (médecins, banquiers, magistrats) qui « hackent » l'enfance par une volonté de domination.

      --------------------------------------------------------------------------------

      5. Critiques des Dispositifs Existants

      • Le FIGS (Fichier judiciaire des auteurs d'infractions sexuelles) : Qualifié de « grosse blague », car l'inscription nécessite généralement une condamnation à 5 ans de prison ferme, ce qui exclut la majorité des agresseurs bénéficiant de sursis ou de requalifications délictuelles (correctionnalisation).

      • Certificat d'honorabilité : Inefficace tant qu'il repose sur un FIGS incomplet et tant que l'accès au fichier est interdit aux parents (contrairement aux États-Unis).

      • Lois nouvelles : Le témoignage affirme qu'aucune nouvelle loi n'est nécessaire.

      L'interdiction de violer un enfant est claire ; c'est le manque de volonté politique et judiciaire de l'appliquer qui fait défaut.

      --------------------------------------------------------------------------------

      6. Propositions de Mesures d'Urgence et Recommandations

      Face à l'urgence, plusieurs axes d'action immédiate sont proposés :

      • Utilisation des Ordonnances de Placement Provisoire (OPP) : Les procureurs devraient utiliser systématiquement leur pouvoir pour placer les enfants en sécurité dans la famille protectrice dès le signalement (mesure de 8 jours renouvelable).

      • Recensement National : Création d'une cellule de crise pour répertorier et examiner les dossiers de toutes les familles ayant déposé plainte pour inceste et se sentant trahies par la justice.

      • Amnistie/Immunité pour l'état de nécessité : Accorder l'immunité aux mères en exil (« en cavale ») ayant fui pour protéger leurs enfants.

      • Priorité au JAF sur le Juge des Enfants (JE) : Centraliser les décisions de garde chez le JAF pour éviter les placements en foyer (ASE) souvent perçus comme des lieux de sur-victimisation.

      • Ordonnance de « Levée du Doute » : En cas de classement pour « faits insuffisamment caractérisés », imposer un suivi psychiatrique obligatoire à l'agresseur présumé pour évaluation sur le long terme.

      • Transparence des dossiers : Garantir aux parents un accès total et immédiat aux pièces de leur dossier pénal et aux rapports de l'ASE pour assurer le respect du contradictoire.

      Citations Clés

      « Le père violeur a un permis de violer son enfant par ordonnance de justice. »

      « Ce n'est pas une injustice, ce sont des crimes, ce sont des viols par ordonnance de justice. »

      « Le désespoir est l'ennemi de la justice. »

      « On ne naît pas violeur d'enfants, on le devient, bien souvent en ayant été soi-même violé. C'est une impunité à vie qu'ils recherchent. »

  2. social-media-ethics-automation.github.io social-media-ethics-automation.github.io
    1. Caroline Delbert. Some People Think 2+2=5, and They’re Right. Popular Mechanics, October 2023. URL: https://www.popularmechanics.com/science/math/a33547137/why-some-people-think-2-plus-2-equals-5/ (visited on 2023-11-24).

      I read this article about how numbers may seem clear and objective, even though they do not always reflect the real things they are supposed to measure. What stood out to me most was the part about sentiment ratings, IQ, and aggression scales. The article shows that even something as simple as 2+2=4 can become more complicated when the context changes. This made me think about social media too, because numbers like likes, views, and ratings often seem trustworthy even though they only show part of the picture.

    2. Caroline Delbert. Some People Think 2+2=5, and They’re Right. Popular Mechanics, October 2023

      This reminds of of 1984 by George Orwell, where the government brainwashed everyone into believing that 2+2=5 and that is the objective truth. But this article makes me think that 2+2=4 may be the false objective truth that we were all brain washed to believe and that we aren't opening our minds to the subjective truth of reality.

    1. Introduction

      After re-reading the Introduction, I'm finding it fragmented and hard to follow. You start with definition, then quickly jump to multiple algorithms, introducing representations (grids, NavMesh, probabilistic maps) mid-flow. “Issues with pathfinding” comes late, even though it should frame the need for PathMaker. And details about PathMaker come after a long technical buildup. I suggest you organize this chapter as follows:

      1 Introduction

      1.1 Pathfinding: Context and Challenges

      1.1.1 Common Pathfinding Approaches
      
      1.1.2 Challenges in Pathfinding
      

      1.2 PathMaker Overview and Contributions

      1.3 Design Choices (Brief Overview)

      1.4 Limitations of Existing Tools

      1.5 Contributions Summary

      1.6 Ethical Implications

      **What is Pathfinding? ** Keep: Define pathfinding Enhance: Give 2–3 application examples (GPS, games, robotics) Briefly introduce grid-based focus (move this up earlier)

      Move out / reduce: Detailed algorithm explanations (A*, Dijkstra) - shorten or defer emphasis Long discussion of representations - keep minimal here

      A*, Dijkstra, Weighted Grids - combine & condense to provide minimal technical grounding

      Keep very concise summaries (2–4 sentences each) Emphasize differences (optimality, cost handling, use cases) Avoid deep mechanics (no step-by-step descriptions)

      Issues with Pathfinding - Move earlier

      This is the main motivation Briefly connect to representations (grids vs NavMesh vs probabilistic) Keep probabilistic maps/NavMesh as examples, not deep dives

      Project Overview - Move earlier

      Refocus this section to explicitly answer:

      What is PathMaker? What problem does it solve? Why is it different from existing tools?

      Tighten: Avoid repeating motivation language Clearly list capabilities: map creation algorithm execution benchmarking metrics

      Implementation Details - Keep (But de-emphasize in intro)

      Keep short explanations of Rust + SDL2 Frame as: “lightweight, cross-platform, low-overhead” Avoid deep technical detail (belongs in methods)

      Current State of the Art - need to connect to your project to show gaps Structure it as: Visualizers - lack benchmarking Game engines - too complex APIs - too low-level Benchmark libraries - lack usability/integration

      End this section with a clear gap statement

      Motivation - Connect gap -> need for your tool

      Reduce repetition Focus on: - difficulty of evaluating algorithms in practice - need for controlled experimentation

      Goals of the Project Convert into a clean list of features: Custom map creation Algorithm implementation support Automated benchmarking Visualization + analysis

      Avoid repeating earlier explanations ** Ethical Implications**

    1. Reviewer #1 (Public review):

      Summary:

      Using electron microscopy, the authors report discontinuities in the plasma membrane of C. elegans embryos. They associate these discontinuities with cell division and speculate that membrane rupture and subsequent resealing contribute to cytokinesis. They further discuss the proximity of these sites to vesicles and propose a role for vesicle-mediated membrane extension.

      Weaknesses:

      (1) The possibility that the membrane discontinuity is an artifact

      Although the authors focus on discontinuities in the plasma membrane, similar discontinuities are also observed in mitochondria, the nuclear envelope, and yolk granules. This raises concerns about whether the electron micrographs presented are suitable for assessing membrane continuity.

      Electron micrographs result from a lengthy sample preparation process, including high-pressure freezing, freeze substitution in acetone containing OsO4, gradual warming, uranyl acetate staining, resin embedding, and ultrathin sectioning. In general, lipids are soluble in acetone at temperatures above −30 {degree sign}C, and preservation of membrane structures relies heavily on efficient OsO4 fixation. Insufficient OsO4 treatment would be expected to reduce membrane contrast.

      C. elegans embryos are encapsulated by an eggshell that forms at fertilization and gradually develops during the first few cell divisions. It is unclear how efficiently OsO4 in acetone penetrates the eggshell during freeze substitution, raising further concern about plasma membrane preservation under the conditions used.

      (2) Lack of evidence linking membrane discontinuity to cell division

      The reported plasma membrane discontinuities are not specific to mitotic cells. If this were a physiological process playing an important role in cytokinesis, it should occur in a temporally and spatially coordinated manner with nuclear division. However, it remains unclear at what stage of the cell cycle the membrane rupture occurs and where it is located relative to chromosomes and the mitotic spindle.

      (3) Lack of evidence for extension of the separated membrane

      Although the authors speculate that resealing of the ruptured membrane occurs via extension of the separated membrane, no direct evidence supporting this mechanism is presented. Proximity to vesicles alone does not demonstrate that membrane extension occurs through vesicle fusion. More direct evidence is required to support this claim.

      (4) Inconsistency with published work

      Numerous studies have examined cell division in developing C. elegans embryos using the GFP::PH(PLC1δ1) marker expressed from the ltIs38 transgene [pAA1; pie-1::GFP::PH(PLC1δ1) + unc-119(+)], generated by the Oegema lab (https://wormbase.org/species/c_elegans/transgene/WBTransgene00000911#01--10 ). To date, no study has reported membrane ruptures of the magnitude described here. The complexity of cell surface morphology from the 8- to 12-cell stages onward has been well documented, for example, by Fu et al. (2016) using light-sheet microscopy and 3D reconstruction (doi:10.1038/ncomms11088).

      Supplementary Movies 5, 6, and 10 of this paper illustrate how single-plane images can easily produce apparent membrane discontinuities, for example, due to membrane orientations nearly parallel to the imaging plane.

      The three single-plane images from only three embryos presented in Figure 6 are insufficient to support the authors' strong conclusions. Raw 3D data should be provided.

    2. Reviewer #2 (Public review):

      Summary:

      Liang et al. explore an unusual observation of membrane discontinuities in dividing C. elegans embryonic cells. This report is the first to demonstrate that, instead of the classical invagination of membranes during cytokinesis, cells in the early embryos of C. elegans exhibit separation of sister membranes that extend independently. TEM images of high-pressure-frozen samples provide strong evidence for the presence of Membrane Openings (MOs) in cells at various stages of the cell cycle, predominantly during mitosis. High-resolution images (x 30,000) clearly show the wrinkled plasma membrane and smooth MOs.<br /> The electron microscopy data are supported by the live cell imaging of strains with fluorescently tagged membrane markers. This study opens up the possibility of tracking MOs at other stages of C. elegans development, and also asks if it might be a common phenomenon in other species that exhibit rapid embryonic growth and divisions.

      Strengths:

      (1) Thorough verification of Membrane Openings (MO) by several methods:

      (a) 4 independent sample batches.

      (b) Examined historical collections.

      (c) Analysed embryos at different stages of development. The absence of MOs in later stages (comma) serves as a negative control and gives confidence that MOs are genuine and not technical artifacts.

      (2) Live cell imaging of strain with fluorescently labelled membranes provides real-time dynamics of membrane rupture.

      (3) After observing the membrane rupture, the next obvious question is - what prevents the cytosol from leaking out? The EM images showing PBL and PEL - extracellular matrix serving as barriers for the cytosol are convincing.

      Weakness:

      (1) The association of membrane discontinuities with cell division is not convincing, as there are 159 cells out of 425 showing MOs, but it is not mentioned clearly how many of these are undergoing cell division. Also, it's not clear whether the 20 dividing cells analysed for MOs are a part of the 159 cells or a separate dataset. A graphical representation of the number of samples and observed frequencies would be helpful to understand the data collection workflow.

      (2) In Figures 3A and 3B, the resolution of the images is not enough to verify 3A as classical membrane invagination and 3B as detached sister membranes.

      (3) Figure 6 lacks controls. How does the classical invagination look in this strain? Also, adding nuclear dye would be informative, in order to correlate the nuclear division with membrane rupture, as claimed.

    3. Author response:

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      Using electron microscopy, the authors report discontinuities in the plasma membrane of C. elegans embryos. They associate these discontinuities with cell division and speculate that membrane rupture and subsequent resealing contribute to cytokinesis. They further discuss the proximity of these sites to vesicles and propose a role for vesicle-mediated membrane extension. 

      Weaknesses:

      (1) The possibility that the membrane discontinuity is an artifact

      Although the authors focus on discontinuities in the plasma membrane, similar discontinuities are also observed in mitochondria, the nuclear envelope, and yolk granules. This raises concerns about whether the electron micrographs presented are suitable for assessing membrane continuity.

      Electron micrographs result from a lengthy sample preparation process, including high-pressure freezing, freeze substitution in acetone containing OsO4, gradual warming, uranyl acetate staining, resin embedding, and ultrathin sectioning. In general, lipids are soluble in acetone at temperatures above −30 {degree sign}C, and preservation of membrane structures relies heavily on efficient OsO4 fixation.

      Insufficient OsO4 treatment would be expected to reduce membrane contrast.

      C. elegans embryos are encapsulated by an eggshell that forms at fertilization and gradually develops during the first few cell divisions. It is unclear how efficiently OsO4 in acetone penetrates the eggshell during freeze substitution, raising further concern about plasma membrane preservation under the conditions used.

      We thank the reviewer for raising this important technical concern. We have taken this question seriously since first observing membrane discontinuities six years ago, and we have since conducted extensive controls to rule out fixation artifacts. Below, we present multiple lines of evidence—ranging from technical reproducibility to orthogonal imaging approaches—that collectively demonstrate the biological reality of these structures.

      (1) Technical expertise and standard protocols

      Our laboratory has extensive experience with electron microscopy across diverse biological systems, including neurons, muscle cells, and hypodermis in C. elegans, as well as tissues from Drosophila, mouse, bacteria, and cultured cells (Chen et al., 2013; Ding et al., 2018; Guan et al., 2022; Y. Li et al., 2018; Miao et al., 2024; Qin et al., 2014; Wang et al., 2026; J. Xu et al., 2022; M. Xu et al., 2021; L. Yang et al., 2020; X. Yang et al., 2019; Zhu et al., 2022). Importantly, we did not introduce any novel or unconventional steps in our EM preparation; all protocols were standard and well-established. Thus, the observed membrane discontinuities are unlikely to stem from technical inexperience or idiosyncratic methods.

      In addition to membrane discontinuities, we would like to emphasize that a large number of single plasma membranes separating adjacent cytoplasmic domains were also detected under EM (Figure 1, 3 and 4, for instance). This observation is particularly significant because the invagination model cannot generate single plasma membrane barriers between adjacent cytoplasmic domains. Instead, independent extension of detached sister membranes could explain the formation of cytoplasm-enclosed membranes. Furthermore, as the morphology and continuity of these single cytoplasm-immersed membrane structures are well preserved, this indicates successful EM processing and argues against inefficient fixation or other technical issues.

      (2) Reproducibility across independent preparations and techniques

      To test whether the discontinuities were preparation-specific, we examined four independent sample batches collected in the lab over the years. Membrane discontinuities, as well as cytoplasm-immersed membranes, on embryonic cells were consistently observed across all batches, indicating that the phenomenon is not dependent on a single preparation method. Furthermore, we validated our findings using two EM techniques: transmission electron microscopy (HPF-TEM) and dualbeam scanning electron microscopy (SEM). Membrane discontinuities were clearly identifiable with both techniques, further supporting their robustness.

      (3) Validation using an independent public dataset

      We examined the publicly available C. elegans embryo EM collection (WormAtlas). In several instances, particularly at the embryonic periphery where plasma membrane discontinuities are more readily visualized (https://www.wormimage.org/image.php?id=140265&page=1), we identified similar structures. The presence of these features in an independent dataset generated by different researchers confirms that they are not artifacts unique to our sample preparation.

      (4) Developmental regulation of membrane discontinuities

      We analyzed embryos across multiple developmental stages. Membrane discontinuities were observed in both intrauterine and laid embryos at early stages. However, as embryos reached the comma stage—a period marked by the onset of elongation and reduced cell proliferation—the incidence of discontinuities dropped dramatically (0/13, 0/17, and 0/30 cells examined). This developmental specificity argues strongly against a general fixation artifact, which would be expected to occur randomly across stages. Additionally, the eggshell is present throughout the embryonic stage of C. elegans; therefore, the dramatic reduction of membrane discontinuities in comma-stage of embryo argues against the possibility that the eggshell poses a fixation problem.

      (5) Rigorous criteria for identifying membrane discontinuities

      To ensure unbiased analysis, we systematically collected images from early embryonic cells using the following criteria:

      (1) Random section selection: For each sample, we randomly selected one section containing the largest number of embryos or cells (Sup Figure 2) for initial analysis. We found membrane discontinuities in 159 cells distributed across 57 embryos, representing 95% of the total sampled embryos This portion of the data is summarized in Figure 1.

      (2) Whole-membrane examination: Each putative membrane discontinuity was identified only after examining the entire plasma membrane of the cell on a given section. Importantly, aside from the discontinuity, the remainder of the plasma membrane remained intact. Moreover, in most cells, only a single discontinuity was present per section, arguing against random, widespread membrane tearing during preparation.

      (3) Neighboring section verification: Because EM preparation yields serial sections, we verified nearly all membrane discontinuities by examining adjacent sections. Again, the same membrane discontinuity was confirmed only after inspecting the entire plasma membrane on those neighboring sections as well. We will include this verification protocol in the revised Methods and additional imaging of consecutive sections would be provided if needed.

      (4) Serial section reconstruction: To further determine whether a dividing cell indeed contains one membrane rupture, we performed two serial reconstruction experiments.

      First, we used HPF-TEM to analyze 105 consecutive sections of a metaphase cell, reconstructing the entire plasma membrane and chromosome configuration. We found that one membrane rupture largely encircled the chromosomal disc (Figure 2 and Video S1), spatially aligning with the future segregation zone. Second, we used AutoCUTS-SEM to collect approximately 600 sections covering ~95% of a telophase cell containing three nuclei sharing a common cytoplasm. This tri-nucleated cell was enclosed by three distinct plasma membranes, each harboring a single rupture site. These three ruptures converged to form a Y-shaped exposed cytoplasmic region spanning >351 sections (Figure 5). Collectively, these reconstructions demonstrate that each cell contains only one discontinuity from a 3D point of view, further supporting that the phenomenon is not due to random sample preparation damage.

      (6) Orthogonal validation by live imaging: In addition to EM, we performed live imaging of plasma membrane dynamics. While live imaging provides important temporal context, we recognize its limitations in resolving membrane ultrastructure. The rapid kinetics of membrane extension (approximately 20–30 seconds for metaphase and less than 3 minutes for cytokinesis), combined with embryo motility, introduces spatiotemporal ambiguities. To capture dynamic membrane events, our live imaging using the GFP::PH membrane marker was performed at 4-second intervals, approaching the practical limit for single-section scanning of the embryo. With single-plane live imaging, nevertheless, both membrane ruptures and free-ended sister membrane structures could be detected (Figures 6), providing additional evidence that membrane rupture and independent extension of detached sister membranes underlie cytokinesis in C. elegans embryos. Notably, 3D membrane dynamics analysis using light-sheet microscopy (Fu et al. Imaging multicellular specimens with real-time optimized tiling light-sheet selective plane illumination microscopy. Nature Communications. 2016. DOI:10.1038/ncomms11088) revealed membrane ruptures in dividing early C. elegans embryonic cells, including during telophase or metaphase. Therefore, live imaging further validates the membrane rupture phenomena in dividing embryonic cells in C. elegans

      While future advances in imaging technology may enable real-time visualization at near-EM resolution, our extensive, multi-year effort to test the artifact hypothesis has convinced us that these membrane discontinuities are genuine biological features of dividing C. elegans embryonic cells.

      We are confident that the cumulative evidence presented here addresses the reviewer's concerns and demonstrates that the observed membrane discontinuities, as well as cytoplasm-immersed membranes, are not procedural artifacts but rather reflect a previously underappreciated aspect of plasma membrane dynamics during embryonic cell division.

      (2) Lack of evidence linking membrane discontinuity to cell division 

      The reported plasma membrane discontinuities are not specific to mitotic cells. If this were a physiological process playing an important role in cytokinesis, it should occur in a temporally and spatially coordinated manner with nuclear division. However, it remains unclear at what stage of the cell cycle the membrane rupture occurs and where it is located relative to chromosomes and the mitotic spindle.

      Thank you for this insightful comment. We agree that establishing a direct link between plasma membrane discontinuities and mitotic progression is critical, and we appreciate the opportunity to clarify this point.

      In C. elegans embryos, the early stages of development are characterized by rapid and extensive cell division. Within approximately 100 minutes, a two-cell embryo develops into an embryo containing nearly 30 cells. The majority of the electron microscopy analyses in our study were performed on embryos at stages with fewer than 30 cells, where most cells are actively dividing. Thus, it is reasonable to infer that the cells exhibiting membrane discontinuities are predominantly mitotic cells.

      Supporting this notion, as embryos reached the comma stage—a period marked by the onset of elongation and reduced cell proliferation—the incidence of membrane discontinuities dropped dramatically (0/13, 0/17, and 0/30 cells examined). This developmental specificity strongly suggests that membrane discontinuities are tightly linked to cell division.

      Importantly, mitotic features such as metaphase chromosomes aligned at the equatorial plane or two (or more) nuclei sharing common cytoplasm can be identified in EM images. In our single random EM section analysis, we captured membrane discontinuities in cells at metaphase, anaphase (characterized by fewer than 10 chromosomal clumps), and telophase (defined by two nuclei sharing cytoplasm). Hence, membrane discontinuities are indeed present on mitotic cells. In addition, a published work by Fu et al (Fu et al. Imaging multicellular specimens with real-time optimized tiling light-sheet selective plane illumination microscopy. Nature Communications. 2016. DOI:10.1038/ncomms11088) using light-sheet microscopy captured similar membrane discontinuities in cells displaying classical mitotic features, including anaphase or telophase.

      To further investigate the spatial relationship between membrane ruptures and chromosome organization, we performed three-dimensional reconstructions on a metaphase cell. As shown in Figure 2 and Video S1, the membrane discontinuities largely encircled the condensed chromosome disc and were spatially aligned with the future segregation zone, further revealing the relative location of membrane discontinuities to chromosomes, at least at metaphase.

      We further collected 3D information for a telophase cell containing three nuclei. This tri-nucleated cell was enclosed by three distinct plasma membranes, each harboring a single rupture site that merged to form a single rupture. The observation that membrane ruptures are present in a tri-nucleated cell is particularly informative. The tri-nucleated feature indicates that this cell underwent two rounds of cell division and that both divisions were at telophase. The presence of a single membrane rupture suggests that membrane discontinuities may persist throughout the cell cycle, as the second cell cycle began from a mother cell that still shared cytoplasm with its sister cell and already had one membrane rupture. Therefore, in addition to the mitotic phase, membrane discontinuities—at least in this context—also exist during the DNA synthesis stage.

      (3) Lack of evidence for extension of the separated membrane 

      Although the authors speculate that resealing of the ruptured membrane occurs via extension of the separated membrane, no direct evidence supporting this mechanism is presented. Proximity to vesicles alone does not demonstrate that membrane extension occurs through vesicle fusion. More direct evidence is required to support this claim.

      Thank you for raising this important point. We appreciate the opportunity to clarify our conclusion.

      In our study, EM analysis revealed the presence of cellular vesicles in close proximity to both free membrane edges and the already separated sister plasma membranes (Figure 4). However, we acknowledge that without advanced live-cell imaging, it is not possible to conclusively determine whether the extension of these separated sister membranes occurs through vesicle fusion.

      We realize that a statement in the Discussion section—“The expansion of the plasma membrane is generally driven by vesicle fusion”(page 16)—may have inadvertently led the reviewer to interpret this as our own conclusion regarding the mechanism of membrane extension in this context. In fact, that statement was intended to reflect the current general understanding of membrane expansion, not to imply that we had demonstrated such a mechanism for the free-ended sister membranes. As we subsequently noted, “However, this remains speculative and requires further experimental validation.”

      To avoid any misunderstanding, we will revise this section to clearly state that the mechanism by which the separated sister membranes extend remains unknown and that further investigation is needed to determine how existing models of membrane expansion may apply to or be adapted for this novel context.

      We thank the reviewer again for their thoughtful comment, which has helped us improve the clarity of our manuscript

      (4) Inconsistency with published work

      Numerous studies have examined cell division in developing C. elegans embryos using the GFP::PH(PLC1δ1) marker expressed from the ltIs38 transgene [pAA1; pie-1::GFP::PH(PLC1δ1) + unc-119(+)], generated by the Oegema lab (https://wormbase.org/species/c_elegans/transgene/WBTransgene00000911#01--10 ). To date, no study has reported membrane ruptures of the magnitude described here. The complexity of cell surface morphology from the 8- to 12-cell stages onward has been well documented, for example, by Fu et al. (2016) using light-sheet microscopy and 3D reconstruction (doi:10.1038/ncomms11088).

      Supplementary Movies 5, 6, and 10 of this paper illustrate how single-plane images can easily produce apparent membrane discontinuities, for example, due to membrane orientations nearly parallel to the imaging plane.

      The three single-plane images from only three embryos presented in Figure 6 are insufficient to support the authors' strong conclusions. Raw 3D data should be provided.

      Thank you for this important comment. We fully agree that the GFP::PH(PLC1δ1) marker, generated by the Oegema lab, has been widely and effectively used to study various aspects of C. elegans embryonic development. In fact, we also employed this same marker in our study to assess membrane integrity.

      However, while live imaging provides invaluable temporal resolution, its limitations in resolving membrane ultrastructure are substantial. In C. elegans embryos, early development is marked by rapid and extensive cell divisions. Within approximately 100 minutes, a two-cell embryo develops into one containing nearly 30 cells. During this fast-dividing stage, the rapid kinetics of membrane extension—approximately 20–30 seconds during metaphase and less than 3 minutes during cytokinesis— combined with embryo motility, introduce considerable spatiotemporal ambiguities. Furthermore, the longstanding invagination model of cytokinesis has shaped interpretations in the field, which may have led to ambiguous structures such as free-ended extensions being dismissed as potential artifacts rather than recognized as alternative morphological features. Theoretical and computational models have largely been built upon invagination-centric assumptions, which may have further constrained conceptual frameworks. Therefore, fluorescence protein-based live imaging analysis alone could not serve as a convincing approach to challenge the current dogma of cell division, nor did we intend it to.

      However, when reexamined in light of our findings, previous studies using this same GFP marker have in fact revealed membrane discontinuities that went unnoticed. For example, Fu et al (Fu et al. Imaging multicellular specimens with real-time optimized tiling light-sheet selective plane illumination microscopy. Nature Communications. 2016. DOI:10.1038/ncomms11088) using light-sheet microscopy and 3D reconstruction, captured membrane discontinuities in cells undergoing mitotic phases such as anaphase or telophase. Similarly, an earlier study by Harrell and Goldstein (Harrell and Goldstein. 2011. Internalization of multiple cells during C. elegans gastrulation depends on common cytoskeletal mechanisms but different cell polarity and cell fate regulators. Developmental Biology. DOI:10.1016/j.ydbio.2010.09.012) showed regions where the GFP::PH signal appeared fuzzy and discontinuous.

      Nevertheless, given the inherent limitations of fluorescence microscopy in resolving membrane ultrastructure, high-resolution electron microscopy—supported by rigorous controls and serial section analysis—remains the gold standard for definitively identifying such membrane discontinuities.

      We acknowledge that our findings are surprising. We did not set out to challenge the long-held view of membrane integrity during cell division. In fact, this study began when our dedicated EM technician, Jingjing Liang, first observed membrane discontinuity phenomena in control samples—wild-type embryos. Had she not come across this observation, we likely would never have pursued this line of inquiry.

      We appreciate the opportunity to clarify these points and thank the reviewer for thoughtful engagement with our work.

      Reviewer #2 (Public review):

      Summary:

      Liang et al. explore an unusual observation of membrane discontinuities in dividing C. elegans embryonic cells. This report is the first to demonstrate that, instead of the classical invagination of membranes during cytokinesis, cells in the early embryos of C. elegans exhibit separation of sister membranes that extend independently. TEM images of high-pressure-frozen samples provide strong evidence for the presence of Membrane Openings (MOs) in cells at various stages of the cell cycle, predominantly during mitosis. High-resolution images (x 30,000) clearly show the wrinkled plasma membrane and smooth MOs.

      The electron microscopy data are supported by the live cell imaging of strains with fluorescently tagged membrane markers. This study opens up the possibility of tracking MOs at other stages of C. elegans development, and also asks if it might be a common phenomenon in other species that exhibit rapid embryonic growth and divisions. 

      Strengths:

      (1) Thorough verification of Membrane Openings (MO) by several methods: 

      (a) 4 independent sample batches.

      (b) Examined historical collections.

      (c) Analysed embryos at different stages of development. The absence of MOs in later stages (comma) serves as a negative control and gives confidence that MOs are genuine and not technical artifacts. 

      (2) Live cell imaging of strain with fluorescently labelled membranes provides realtime dynamics of membrane rupture.

      (3) After observing the membrane rupture, the next obvious question is - what prevents the cytosol from leaking out? The EM images showing PBL and PEL - extracellular matrix serving as barriers for the cytosol are convincing.

      Thanks to the reviewer for the encouragement. Highly appreciated.

      Weakness:

      (1) The association of membrane discontinuities with cell division is not convincing, as there are 159 cells out of 425 showing MOs, but it is not mentioned clearly how many of these are undergoing cell division. Also, it's not clear whether the 20 dividing cells analysed for MOs are a part of the 159 cells or a separate dataset. A graphical representation of the number of samples and observed frequencies would be helpful to understand the data collection workflow.

      We sincerely thank the reviewer for raising this important question and appreciate the opportunity to clarify these points.

      (1) Relationship between membrane discontinuities and cell division

      In C. elegans embryos, early development is characterized by rapid and extensive cell division: within approximately 100 minutes, a two-cell embryo develops into one containing nearly 30 cells. Most of our electron microscopy (EM) analyses were performed on embryos at stages with fewer than 30 cells, in which the majority of cells are actively dividing. Therefore, it is reasonable to infer that the cells exhibiting membrane discontinuities (MOs) are predominantly mitotic. Supporting this, as embryos reached the comma stage—when cell proliferation declines and elongation begins—the incidence of MOs dropped sharply (0/13, 0/17, and 0/30 cells examined. This developmental specificity strongly links MOs to cell division.

      Moreover, in single random EM sections, we observed MOs in cells displaying clear mitotic features, such as metaphase chromosomes aligned at the equatorial plate, or anaphase/telophase configurations (fewer than 10 chromosomal clumps or two nuclei sharing common cytoplasm). Thus, MOs are indeed present in mitotic cells.

      From our 3D reconstruction (Figure 5), we identified a telophase cell containing three nuclei, each enclosed by its own plasma membrane, with each membrane harboring a single rupture that converged into a single opening. This tri-nucleated configuration indicates that the cell had undergone two rounds of division and was at telophase in both. The presence of a single membrane rupture in this context suggests that MOs can persist beyond mitosis, as the second cell cycle initiated from a mother cell that already shared cytoplasm with its sister and already contained a rupture. Thus, in this case, MOs were also present during DNA synthesis stage.

      (2) Clarification of sample numbers and datasets

      In Figure 1, we present results from a single EM section per embryonic cell, with sections randomly selected per embryo as detailed in Sup Figure 2. This initial dataset (425 cells) forms the basis of Figure 1.

      From the same pool of 425 cells, we used additional EM sections—distinct from those shown in Sup Figure 2—to locate 20 dividing cells for analysis of membrane discontinuities. Thus, while these 20 cells originated from the same set of embryos, they were not derived from the sections used in Figure 1 or Sup Figure 2.

      A graphical summary of sample numbers from the single-section analysis is already provided in Figure 1. Notably, cells with two clearly visible nuclei are more likely to be sectioned through or near their maximal diameter. In contrast, the randomly selected sections used for Figure 1 captured cells at variable planes, reducing the likelihood of observing MOs. Consistent with this, in the three embryos where no MOs were detected (one example is Sup Figure 2N), the sections likely passed through peripheral regions of the cells. Consequently, the frequency of MOs in randomly sectioned cells (Figure 1) is not directly comparable to that observed in the 20 dividing cells, which were analyzed using sections more likely to capture cells near their maximal diameter. These 20 dividing cells should therefore be considered a separate analysis. We will add detailed explanations in the Methods section to ensure this distinction is clearly understood.

      We are grateful for the reviewer’s thoughtful feedback and believe these clarifications will improve the clarity and rigor of the manuscript.

      (2) In Figures 3A and 3B, the resolution of the images is not enough to verify 3A as classical membrane invagination and 3B as detached sister membranes.

      Thank you for your valuable comment. In the revised manuscript, we will provide additional images at higher magnification to better illustrate the classical membrane invagination in Figure 3A and the detached sister membranes in Figure 3B.

      (3) Figure 6 lacks controls. How does the classical invagination look in this strain? Also, adding nuclear dye would be informative, in order to correlate the nuclear division with membrane rupture, as claimed. 

      Thank you for this important comment. As we addressed how we correlated nuclear division with membrane rupture in response to weakness (1), below we will focus on how we may distinguish classical invagination from membrane rupture.

      While live imaging provides invaluable temporal resolution, its limitations in resolving membrane ultrastructure are substantial. In C. elegans embryos, early development is marked by rapid and extensive cell divisions. Within approximately 100 minutes, a two-cell embryo develops into one containing nearly 30 cells. During this fast-dividing stage, the rapid kinetics of membrane extension—approximately 20–30 seconds during metaphase and less than 3 minutes during cytokinesis— combined with embryo motility, introduce considerable spatiotemporal ambiguities. Furthermore, the longstanding invagination model of cytokinesis has shaped interpretations in the field, which may have led to ambiguous structures such as free-ended extensions being dismissed as potential artifacts rather than recognized as alternative morphological features. Theoretical and computational models have largely been built upon invagination-centric assumptions, which may have further constrained conceptual frameworks. Therefore, fluorescence protein-based live imaging analysis alone could not serve as a convincing approach to challenge the current dogma of cell division, nor did we intend it to.

      However, when reexamined in light of our findings, previous studies using GFP::PH or similar markers have in fact revealed membrane discontinuities that went unnoticed. For example, using light-sheet microscopy and 3D reconstruction, Fu et al captured membrane discontinuities in cells undergoing division such as anaphase or telophase (Fu et al. Imaging multicellular specimens with real-time optimized tiling light-sheet selective plane illumination microscopy. Nature Communications. 2016.DOI:10.1038/ncomms11088)

      Similarly, an earlier study by Goldstein et al. (Harrell and Goldstein. 2011. Internalization of multiple cells during C. elegans gastrulation depends on common cytoskeletal mechanisms but different cell polarity and cell fate regulators. Developmental Biology. DOI:10.1016/j.ydbio.2010.09.012) showed regions where the GFP::PH signal appeared fuzzy and discontinuous.

      Here, to capture dynamic membrane events, our live imaging using the GFP::PH membrane marker was performed at 4-second intervals, approaching the practical limit for single-section scanning of the embryo. With single-plane live imaging, both membrane ruptures and free-ended sister membrane structures (Figures 6) could be detected, providing additional evidence that membrane rupture and independent extension of detached sister membranes underlie cytokinesis in C. elegans embryos.

      However, given the inherent limitations of fluorescence microscopy in resolving membrane ultrastructure, high-resolution electron microscopy—supported by rigorous controls and serial section analysis—remains the gold standard for definitively distinguishing invagination from membrane discontinuities.

      While future advances in imaging technology may enable real-time visualization at near-EM resolution, our extensive, multi-year effort to test the artifact hypothesis has convinced us that these membrane discontinuities are genuine biological features of dividing C. elegans embryonic cells.

      Reviewer #3 (Public review):

      Summary:

      In this manuscript, the authors challenge a dogma in cell biology, namely that cells are at any time point engulfed by a continuous plasma membrane. Liang et al. find that during C elegans embryogenesis, a high number of cells are not entirely surrounded by a plasma membrane but show membrane openings (MOs). These openings are enriched at the embryo's periphery, towards the eggshell. The authors propose that plasma membrane discontinuities emerge during metaphase of mitosis and that independent extension of "sister membranes" engulfs the daughter cells.

      Strengths:

      On the positive side, the authors find plasma membrane discontinuities not only by electron microscopy but also by fluorescence microscopy and provide information about the dynamics of membrane openings and their emergence. While this is assuring, the authors conclude that MOs emerge during metaphase. From what the authors show, this particular information cannot be deduced, as there is no dynamic capture of a membrane scission event together with a chromatin marker that would indicate mitosis. The authors could, however, attempt to find such events in live movies, given the high incidence of MOs reported from their EM data.

      Thanks to the reviewer for the encouragement. Highly appreciated.

      Weaknesses:

      In order to convincingly demonstrate the absence of any plasma membrane in the respective regions of the embryonic periphery or between cells of the embryo, the authors would have to show consecutive serial TEM sections where MOs are detected over more z-planes, beyond the mere 3D reconstructions. Although the authors state in the methods section that continuous ultrathin sections were cut for the metaphase sample (page 21, line 472), consecutive sections are never shown in TEM. While we do see the 3D reconstructions, better documentation of the underlying TEM data is missing. It would be necessary to show a membrane opening in consecutive z sections. Alternatively, the authors could seek the possibility to convincingly back up their claims with volume imaging by focused ion beam scanning EM (FIBSEM), where cellular volumes can be sectioned in almost isotropic resolution

      We Thank the reviewer for raising these important technical concerns. We have taken this question seriously since first observing membrane discontinuities six years ago, and we have since conducted extensive controls to rule out fixation artifacts.

      First of all, in addition to membrane discontinuities, we would like to highlight that a large number of single plasma membranes separating adjacent cytoplasmic domains were detected by EM (Figure 1, 3 and 4). This observation is particularly significant because the invagination model cannot account for the formation of single plasma membrane barriers between adjacent cytoplasmic domains. Instead, independent extension of detached sister membranes offers a plausible explanation for the generation of cytoplasm-immersed membranes. Furthermore, the morphology and continuity of these single cytoplasm-immersed membrane structures are well preserved, indicating successful EM processing and arguing against potential issues such as inadequate fixation or other technical limitations.

      Second, we applied rigorous criteria for identifying membrane discontinuities:

      (1) To test whether the discontinuities were preparation specific, we examined four independent sample batches and validated our findings using two EM techniques: transmission electron microscopy (HPF-TEM) and dual-beam scanning electron microscopy (SEM).

      (2) We analyzed embryos across multiple developmental stages. Membrane discontinuities were observed in both intrauterine and laid embryos at early stages. However, as embryos reached the comma stage—a period marked by the onset of elongation and reduced cell proliferation—the incidence of discontinuities dropped dramatically (0/13, 0/17, and 0/30 cells examined). This developmental specificity argues strongly against a general fixation artifact, which would be expected to occur randomly across stages. Additionally, the eggshell is present throughout the embryonic stage of C. elegans; therefore, the dramatic reduction of membrane discontinuities in comma-stage of embryo argues against the possibility that the eggshell poses a fixation problem.

      (3) Each putative membrane discontinuity was identified only after examining the entire plasma membrane of the cell on a given section. Importantly, aside from the discontinuity, the remainder of the plasma membrane remained intact. Moreover, in most cells, only a single discontinuity was present per section, arguing against random, widespread membrane tearing during preparation. Because EM preparation yields serial sections, we verified nearly all membrane discontinuities by examining adjacent sections. Again, the same membrane discontinuity was confirmed only after inspecting the entire plasma membrane on those neighboring sections as well. We will include this verification protocol in the revised Methods and additional imaging of consecutive sections would be provided if needed.

      To further determine whether a dividing cell indeed contains one membrane rupture, we performed two serial reconstruction experiments using consecutive sections, as the reviewer suggested. First, we used HPF-TEM to analyze 105 consecutive sections of a metaphase cell, reconstructing the entire plasma membrane and chromosome configuration. We found that one membrane rupture largely encircled the chromosomal disc (Figure 2 and Video S1), spatially aligning with the future segregation zone. Second, we used AutoCUTS-SEM to collect approximately 600 sections covering ~95% of a telophase cell containing three nuclei sharing a common cytoplasm. This tri-nucleated cell was enclosed by three distinct plasma membranes, each harboring a single rupture site. These three ruptures converged to form a Yshaped exposed cytoplasmic region spanning >351 sections (Figure 5). Collectively, these reconstructions demonstrate that each cell contains only one discontinuity from a 3D point of view, further supporting that the phenomenon is not due to random sample preparation damage.

      (4) In addition to EM, we performed live imaging of plasma membrane dynamics. While live imaging provides important temporal context, we recognize its limitations in resolving membrane ultrastructure. The rapid kinetics of membrane extension (approximately 20–30 seconds for metaphase and less than 3 minutes for cytokinesis), combined with embryo motility, introduces spatiotemporal ambiguities. To capture dynamic membrane events, our live imaging using the GFP::PH membrane marker was performed at 4-second intervals, approaching the practical limit for single-section scanning of the embryo. With single-plane live imaging, nevertheless, both putative membrane ruptures (Figure 6A) and free-ended sister membrane structures could be detected (Figures 6B and 6C), providing additional evidence that membrane rupture and independent extension of detached sister membranes underlie cytokinesis in C. elegans embryos. Notably, 3D membrane dynamics analysis using light-sheet microscopy (Fu et al. Imaging multicellular specimens with real-time optimized tiling light-sheet selective plane illumination microscopy. Nature Communications. 2016. DOI:10.1038/ncomms11088). revealed membrane ruptures in dividing early C. elegans embryonic cells, including during telophase and metaphase. Therefore, live imaging further validates the membrane rupture phenomena in dividing embryonic cells in C. elegans

      We are confident that the cumulative evidence presented here addresses the reviewer's concerns and demonstrates that the observed membrane discontinuities, as well as cytoplasm-immersed membranes, are not procedural artifacts but rather reflect a previously underappreciated aspect of plasma membrane dynamics during embryonic cell division.

      Another critical issue concerns the detection of the membrane discontinuities in electron micrographs, which, in my opinion, is ambiguous. How do the authors reliably discriminate in their TEM images whether there is a plasma membrane or not? The absence - or weak appearance - of the stain of the electron dense material at membranes, which seems to be their criterion for MOs, is also apparent at other, intracellular membranes, like at the NE or at the ER (for example, see Figure 1C). Also, the plasma membrane itself appears unevenly stained in regions that the authors delineate as intact (for example, Figure 1C, 2B/1).

      We thank the reviewer for raising this important concern.

      First, our laboratory has extensive experience with electron microscopy across diverse biological systems, including neurons, muscle cells, and hypodermis in C. elegans, as well as tissues from Drosophila, mouse, bacteria, and cultured cells (Chen et al., 2013; Ding et al., 2018; Guan et al., 2022; Y. Li et al., 2018; Miao et al., 2024; Qin et al., 2014; Wang et al., 2026; J. Xu et al., 2022; M. Xu et al., 2021; L. Yang et al., 2020; X. Yang et al., 2019; Zhu et al., 2022). Importantly, we did not introduce any novel or unconventional steps in our EM preparation; all protocols were standard and well established. Thus, the observed membrane discontinuities are unlikely to result from technical inexperience or idiosyncratic methods.

      Second, because EM preparation yields serial sections, we verified nearly all membrane discontinuities by examining adjacent sections. Specifically, a membrane discontinuity was confirmed only after inspecting the entirety of the plasma membrane in neighboring sections. We will include this verification protocol in the revised Methods section, and additional images of consecutive sections can be provided if needed.

      Third, in addition to membrane discontinuities, a large number of single plasma membranes separating adjacent cytoplasmic domains were detected by EM (Figure 1, 3 and 4). This observation is particularly significant because the invagination model cannot account for the formation of single plasma membrane barriers between adjacent cytoplasmic domains. Instead, independent extension of detached sister membranes offers a plausible explanation for the generation of cytoplasm-immersed membranes. Furthermore, the morphology and continuity of these single cytoplasm-immersed membrane structures are well preserved, indicating successful EM processing and arguing against potential issues such as inadequate fixation or other technical limitations.

      EM-related publications by Jingjing Liang:

      Chen D, Jian Y, Liu X, Zhang Y, Liang J, Qi X, Du H, Zou W, Chen L, Chai Y, Ou G, Miao L, Wang Y, Yang C. 2013. Clathrin and AP2 Are Required for Phagocytic Receptor-Mediated Apoptotic Cell Clearance in Caenorhabditis elegans. PLoS Genetics 9:e1003517. DOI: https://doi.org/10.1371/journal.pgen.1003517

      Ding L, Yang X, Tian H, Liang J, Zhang F, Wang G, Wang Y, Ding M, Shui G, Huang X. 2018. Seipin regulates lipid homeostasis by ensuring calcium‐dependent mitochondrial metabolism. The EMBO Journal 37:e97572. DOI: https://doi.org/10.15252/embj.201797572

      Guan L, Yang Y, Liang J, Miao Y, Shang A, Wang B, Wang Y, Ding M. 2022. ERGIC2 and ERGIC3 regulate the ER‐to‐Golgi transport of gap junction proteins in metazoans. Traffic 23:140–157. DOI: https://doi.org/10.1111/tra.12830

      Li Y, Zhang Y, Gan Q, Xu M, Ding X, Tang G, Liang J, Liu K, Liu X, Wang X, Guo L, Gao Z, Hao X, Yang C. 2018. C . elegans -based screen identifies lysosome-damaging alkaloids that induce STAT3-dependent lysosomal cell death. Protein & Cell 9:1013–1026. DOI: https://doi.org/10.1007/s13238-018-0520-0

      Miao Y, Du Y, Wang B, Liang J, Liang Y, Dang S, Liu J, Li D, He K, Ding M. 2024. Spatiotemporal recruitment of the ubiquitin-specific protease USP8 directs endosome maturation. eLife 13:RP96353. DOI: https://doi.org/10.7554/eLife.96353

      Qin J, Liang J, Ding M. 2014. Perlecan Antagonizes Collagen IV and ADAMTS9/GON-1 in Restricting the Growth of Presynaptic Boutons. Journal of Neuroscience 34:10311–10324. DOI: https://doi.org/10.1523/JNEUROSCI.5128-13.2014

      Wang Z, Zhang L, Zhou B, Liang J, Tian Y, Jiang Z, Tao J, Yin C, Chen S, Zhang W, Zhang J, Wei W. 2026. A single MYB transcription factor GmMYB331 regulates seed oil accumulation and seed size/weight in soybean. Journal of Integrative Plant Biology 68:470– 485. DOI: https://doi.org/10.1111/jipb.70101

      Xu J, Chen S, Wang W, Man Lam S, Xu Y, Zhang S, Pan H, Liang J, Huang Xiahe, Wang Yu, Li T, Jiang Y, Wang Yingchun, Ding M, Shui G, Yang H, Huang Xun. 2022. Hepatic CDP-diacylglycerol synthase 2 deficiency causes mitochondrial dysfunction and promotes rapid progression of NASH and fibrosis. Science Bulletin 67:299–314. DOI: https://doi.org/10.1016/j.scib.2021.10.014

      Xu M, Ding L, Liang J, Yang X, Liu Y, Wang Y, Ding M, Huang X. 2021. NAD kinase sustains lipogenesis and mitochondrial metabolism through fatty acid synthesis. Cell Reports 37:110157. DOI: https://doi.org/10.1016/j.celrep.2021.110157

      Yang L, Liang J, Lam SM, Yavuz A, Shui G, Ding M, Huang X. 2020. Neuronal lipolysis participates in PUFA‐mediated neural function and neurodegeneration. EMBO reports 21:e50214. DOI: https://doi.org/10.15252/embr.202050214

      Yang X, Liang J, Ding L, Li X, Lam S-M, Shui G, Ding M, Huang X. 2019. Phosphatidylserine synthase regulates cellular homeostasis through distinct metabolic mechanisms. PLOS Genetics 15:e1008548. DOI: https://doi.org/10.1371/journal.pgen.1008548

      Zhu J, Lam SM, Yang L, Liang J, Ding M, Shui G, Huang X. 2022. Reduced phosphatidylcholine synthesis suppresses the embryonic lethality of seipin deficiency. Life Metabolism 1:175–189. DOI: https://doi.org/10.1093/lifemeta/loac02

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      In their paper entitled "Alpha-Band Phase Modulates Perceptual Sensitivity by Changing Internal Noise and Sensory Tuning," Pilipenko et al. investigate how pre-stimulus alpha phase influences near-threshold visual perception. The authors aim to clarify whether alpha phase primarily shifts the criterion, multiplicatively amplifies signals, or changes the effective variance and tuning of sensory evidence. Six observers completed many thousands of trials in a double-pass Gabor-in-noise detection task while an EEG was recorded. The authors combine signal detection theory, phase-resolved analyses, and reverse correlation to test mechanistic predictions. The experimental design and analysis pipeline provide a clear conceptual scaffold, with SDT-based schematic models that make the empirical results accessible even for readers who are not specialists in classification-image methods.

      Strengths:

      The study presents a coherent and well-executed investigation with several notable strengths. First, the main behavioral and EEG results in Figure 2 demonstrate robust pre-stimulus coupling between alpha phase and d′ across a substantial portion of the pre-stimulus interval, with little evidence that the criterion is modulated to a comparable extent. The inverse phasic relationship between hit and false-alarm rates maps clearly onto the variance-reduction account, and the response-consistency analysis offers an intuitive behavioral complement: when two identical stimuli are both presented at the participant's optimal phase, responses are more consistent than when one or both occur at suboptimal phases. The frontal-occipital phase-difference result suggests a coordinated rather than purely local phase mechanism, supporting the central claim that alpha phase is linked to changes in sensitivity that behave like changes in internal variability rather than simple gain or criterion shifts. Supplementary analyses showing that alpha power has only a limited relationship with d′ and confidence reassure readers that the main effects are genuinely phase-linked rather than a recasting of amplitude differences.

      Second, the reverse-correlation results in Figure 3 extend this story in a satisfying way. The classification images and their Gaussian fits show that at the optimal phase, the weighting of stimulus energy is more sharply concentrated around target-relevant spatial frequencies and orientations, and the bootstrapped parameter distributions indicate that the suboptimal phase is best described by broader tuning and a modest change in gain rather than a pure criterion account. The authors' interpretation that optimal-phase perception reflects both reduced effective internal noise and sharpened sensory tuning is reasonable and well-supported. Overall, the data and figures largely achieve the stated aims, and the work is likely to have an impact both by clarifying the interpretation of alpha-phase effects and by illustrating a useful analytic framework that other groups can adopt.

      Weaknesses:

      The weaknesses are limited and relate primarily to framing and presentation rather than to the substance of the work. First, because contrast was titrated to maintain moderate performance (d′ between 1.2 and 1.8), the phase-linked changes in sensitivity appear modest in absolute terms, which could benefit from explicit contextualization. Second, a coding error resulted in unequal numbers of double-pass stimulus pairs across participants, which affects the interpretability of the response-consistency results. Third, several methodological details could be stated more explicitly to enhance transparency, including stimulus timing specifications, electrode selection criteria, and the purpose of phase alignment in group averaging. Finally, some mechanistic interpretations in the Discussion could be phrased more conservatively to clearly distinguish between measurement and inference, particularly regarding the relationship between reduced internal noise and sharpened tuning, and the physiological implementation of the frontal-occipital phase relationship.

      We appreciate the reviewer’s thoughtful and constructive feedback, particularly regarding clarity and framing. In response, we have made several revisions to improve transparency and contextualization throughout the manuscript.

      First, we now explicitly contextualize the relatively modest change in sensitivity by adding discussion of the contrast-titration procedure and its implications for effect size interpretation. Second, we address the coding error that led to unequal numbers of double-pass stimulus pairs across participants sooner in the manuscript by reporting the average number of pairs per participant in the Results (as well as the Methods), allowing for readers to interpret the results more appropriately. Third, we have provided additional detail, including precise stimulus timing parameters, electrode selection criteria, and a clearer explanation of the rationale for phase alignment in the Results (in addition to the Methods) section. Finally, we have revised portions of the Discussion to adopt more conservative language when interpreting our results, which more clearly distinguishes between empirical observations and mechanistic inferences, along with offering additional interpretations for the frontal-occipital phase relationship.

      We believe these revisions substantially improve the clarity, transparency, and interpretability of the manuscript.

      Reviewer #2 (Public review):

      Summary:

      The study of Pilipenko et al evaluated the role of alpha phase in a visual perception paradigm using the framework of signal detection theory and reverse correlation. Their findings suggest that phase-related modulations in perception are mediated by a reduction in internal noise and a moderate increase in tuning to relevant features of the stimuli in specific phases of the alpha cycle. Interestingly, the alpha phase did not affect the criterion. Criterion was related to modulations in alpha power, in agreement with previous research.

      Strengths:

      The experiment was carefully designed, and the analytical pipeline is original and suited to answer the research question. The authors frame the research question very well and propose several models that account for the possible mechanisms by which the alpha phase can modulate perception. This study can be very valuable for the ongoing discussion about the role of alpha activity in perception.

      Weaknesses:

      The sample size collected (N = 6) is, in my opinion, too small for the statistical approach adopted (group level). It is well known that small sample sizes result in an increased likelihood of false positives; even in the case of true positives, effect sizes are inflated (Button et al., 2013; Tamar and Orban de Xivry, 2019), negatively affecting the replicability of the effect.

      Although the experimental design allows for an accurate characterization of the effects at the single-subject level, conclusions are drawn from group-level aggregated measures. With only six subjects, the estimation of between-subject variability is not reliable. The authors need to acknowledge that the sample size is too small; therefore, results should be interpreted with caution.

      Conclusion:

      This study addresses an important and timely question and proposes an original and well-thought-out analytical framework to investigate the role of alpha phase in visual perception. While the experimental design and theoretical motivation are strong, the very limited sample size substantially constrains the strength of the conclusions that can be drawn at the group level.

      Bibliography:

      Button, K., Ioannidis, J., Mokrysz, C. et al. Power failure: why small sample size undermines the reliability of neuroscience. Nat Rev Neurosci 14, 365-376 (2013). https://doi.org/10.1038/nrn3475

      Tamar R Makin, Jean-Jacques Orban de Xivry (2019) Science Forum: Ten common statistical mistakes to watch out for when writing or reviewing a manuscript eLife 8:e48175 https://doi.org/10.7554/eLife.48175

      We thank the reviewer for their supportive remarks on our design and analysis, and for raising this important statistical concern about our sample size (n=6). Our choice of a small sample size was driven by methodological considerations. Specifically, our reverse correlation analysis requires a large number of trials per participant, as it estimates perceptual tuning by regressing behavioral responses against fluctuations in the energy of stimulus features (orientation and spatial frequency). This approach, as well as the computation of signal detection theory (SDT) metrics such as d′ and criterion, depends on high trial counts to obtain reliable estimates, particularly given that our analysis further subdivides trials across eight phase bins. For this reason, we prioritized collecting a large number of trials per participant (∼5,000), which is consistent with established practices in psychophysical research.

      Importantly, our approach means that our design is reliable on the individual level, which motivated us to include a new binomial probability testing in our revised paper. This binomial test helps address concerns about the generalizability of our results. Binomial testing considers each participant as an independent replication of the effect and then computes the p-value associated with the probability of having observed the given number of statistically significant participants by chance, with a false positive rate of 0.05. In our data, 3 out of 6 participants showed significant effects, which corresponds to a probability of 0.002 of having observed these effects by chance alone. We believe this converging evidence supports the replicability and generalizability of our results. To improve the transparency of the single-subject data, we have included single-participant results in the Supplemental Materials to allow readers to directly assess the consistency of effects across individuals and to better contextualize between-subject variability.

      Thank you again for your suggestions, we believe that these additions have greatly improved our manuscript by demonstrating the robustness of our findings and increasing the transparency of our single-subject results.

      Recommendations for the authors:

      Reviewing Editor Comments:

      The issue of generalizability arose during the review process, as your results are based on a small sample of participants who undertook a very large number of trials. In the revised version, it would be useful to discuss why this approach is valid, especially in the context of linking EEG with modeling (i.e., why it is more powerful than having many participants with fewer trials), and the extent to which your results can generalize to the population.

      We sincerely appreciate all of the helpful comments provided by the reviewers and hope we can address the concerns of our experimental approach. In the introduction, we have emphasized the importance of our current small sample size design, which allows us to reliably compute our signal detection theory metrics across 8 phase bins in addition to including the reverse correlation analysis. In the methods section, we have added a description of the binomial probability statistical framework, which addresses the generalizability of our results. In this framework, each participant is viewed as an independent replication and the p-value reflects the probability of having observed the number of individually significant subjects from the total sample size by chance. In this regard, observing a significant effect in 3 out of 6 participants (as in our study) from chance alone has a 0.002 probability, which we believe is unlikely and instead reflects a true effect present in the general population.

      Below I have copied our changes in the introduction and methods sections.

      “... in a large number of trials (6,020 per observer, n = 6) across multiple EEG sessions. This approach ensures a sufficient number of trials in order to reliably compute signal detection theory (SDT) metrics across multiple alpha phase bins while also affording enough statistical power for reverse correlation analysis (Xue et al., 2024), making it preferred over having a larger sample size with fewer trials.”

      “Additionally, we used a binomial probability testing framework that is designed for small sample sizes and treats each participant as an independent replication. As such, it computes the probability of having observed the number of statistically significant outcomes by chance given our sample size (Schwarzkopf & Huang, 2024).”

      Reviewer #1 (Recommendations for the authors):

      My suggestions are intended to be light-touch and focused on strengthening the clarity and durability of the Reviewed Preprint rather than on additional experimentation or major new analyses.

      (1) Limitation statement for the double-pass coding error:

      Add a short statement in the Methods or Results acknowledging that the coding error led to markedly fewer repeated stimulus pairs for the first three participants than for the last three. For the response-consistency result in Figure 2E, a simple acknowledgement that the available evidence is stronger for some participants than others will help readers calibrate their confidence without detracting from the main story.

      Thank you for this suggestion, we have now added a statement to this effect in the Results section, in addition to the description already mentioned in the Methods section.

      “To examine this, we implemented a double-pass stimulus presentation (~600 stimulus pairs for participants 1-3 and ~2,500 pairs for participants 4-6) and analyzed participant’s response consistency (Xue et al., 2024) to two identical stimuli.”

      (2) Contextualizing the titrated performance level:

      In the Discussion, explicitly note that contrast was titrated to keep d′ between approximately 1.2 and 1.8, which intentionally maintains moderate performance. This contextualization will help readers understand that while the phase-linked changes appear modest in absolute terms, they are mechanistically informative within this design.

      Thank you, we have included a sentence to the Discussions speaking to this point.

      “We also note that the observed modulation of d’ between optimal and suboptimal phases was relatively modest in absolute terms (0.21) in our study and could therefore require many trials per subject to detect. Two reasons for this modest effect size could be related to specific features of our task design. First, we titrated stimulus contrast to maintain consistent task performance. This titration could have reduced the magnitude of the phase effect on d’ that would otherwise be apparent if the stimulus intensity were kept constant. Additionally, the use of (relatively) high-contrast random noise likely means that trial-to-trial variability in perception is largely driven by random fluctuations in the noise properties and, to a lesser extent, internal brain state. Although both of these choices were necessary to perform SDT and reverse correlation analysis, they differ from many previous studies investigating alpha phase using only near-threshold detection in the absence of external noise and may contribute to an underestimation of the true effect size.”

      (3) Methods clarifications:

      (a) Replace placeholder text such as "{plus minus}" and "{degree sign}" with the appropriate symbols, and ensure that any equations implied in the reverse-correlation section are fully present.

      Thank you for bringing this to our attention, these placeholder texts are an artifact of the conversion process and we will correct this.

      (b) State explicitly that the 8 ms stimulus duration corresponds to a single frame on your 120 Hz display, which will clarify the timing in Figure 1A and the pre-stimulus windows in the phase analyses.

      Thank you, we have added language to both the Method and Results sections explicitly indicating that the 8 ms stimulus choice corresponds to a single screen refresh. Additionally, we changed the text in Figure 1A to include inter-trial interval timing (as opposed to merely saying “Start Trial”):

      “(A) Task design. Each trial contained a brief, filtered-noise stimulus (8 ms; one screen refresh) presented to the right or left of fixation with equal probability.”

      “Each participant (n = 6) completed 5-6 EEG sessions of a Yes/No detection paradigm whereby participants reported the presence or absence of a brief (8 ms; one screen refresh) vertical Gabor target (2 cycles per degree) with concurrent confidence judgments (see Figure 1A), along with an additional imagination judgement (reported in the supplemental materials).”

      (c) In the description of the post-stimulus taper, consider phrasing the rationale in terms of minimizing contamination from evoked responses rather than asserting that the taper ends before the earliest evoked response, which keeps the argument correct without committing to a precise latency boundary.

      Thank you for this suggestion. We have changed our rationale for the taper to “minimizing”, rather than avoiding, the evoked response.

      “This resulted in the post-stimulus data being flat after 70 ms, which is intended to minimize the evoked response in our data.”

      (4) Analysis transparency:

      (a) In the description of posterior electrode selection, explicitly note that channels were chosen solely on the basis of alpha power, independent of behavioural performance, and that the same electrodes were used for each participant across sessions.

      We have gladly made this clarification to the methods.

      “This was individually determined by rank-ordering 17 of the posterior channels (Pz, P3, P7, O1, Oz, O2, P4, P8, P1, P5, PO7, PO3, POz, PO4, PO8, P6, and P2) and algorithmically choosing the three with the highest power. This ensured that electrode selection was made independent of performance and instead was based upon maximizing alpha signal strength.”

      (b) Describe the phase-alignment step used to center each participant's optimal bin before group averaging as a device for visualization and summary, and clarify that inferential statistics are based on the underlying, non-aligned data as appropriate. This will reassure readers who are cautious about circularity.

      We agree that this should be made more explicit throughout the manuscript and have added statements clarifying this aspect in the Figure 2B caption, the Results, and Method sections.

      “The data have been aligned across participants so that each individual's highest d’ was assigned to bin 8 (omitted from the plot), with the remaining data circularly shifted, and is averaged across -450 ms to stimulus onset. This graph is for visualization purposes only. Error bars represent ± 1 SEM. The pattern shows a clear phasic modulation of d’ across bins.”

      “... requiring us to phase-align the performance data across participants in order to visualize the underlying phasic effects. To this end, we aligned all metrics (d’, c, HR, and FAR) by circularly shifting the data so that the bin with the highest d’ was assigned to bin 8, which was then omitted from further visualizations.”

      “Bin 8 was then omitted from further visualizations. The shifted data were then averaged across all time points from -450 ms to 0 ms, based on significant effects at the group level, and averaged across participants. No statistics were conducted on these shifted variables and instead are for visualization purposes only.”

      (c) Add a short note on the number of permutations and the cluster-forming threshold in the phase-coupling analyses, if not already stated in the Results or captions, to complete the description of your non-parametric testing procedure.

      Thank you, we agree that reiterating this information in the Results section is helpful for the reader to clarify the analysis procedure.

      “After smoothing the resultant vector length over time with a 50 ms moving average, we compared the observed vector lengths to a permuted threshold (95th percentile of 1,000 permutations) at each time point from –700 to 0 ms and performed cluster correction (95th percentile of the permuted cluster size) to account for multiple comparisons.”

      (5) Discussion framing:

      Make one or two small adjustments to your mechanistic phrasing so that the distinctions between measurement and interpretation are fully explicit:

      (a) State that the combination of phase-d′ coupling, counterphased hit and false-alarm rates, response consistency, and phase-dependent classification images is "consistent with" a reduction in effective internal noise and sharper estimated tuning at optimal alpha phase, within the assumptions of your SDT and reverse-correlation framework.

      Thank you for this suggestion. We have changed the language in the discussions to reflect this framing and interpretation of the results.

      “Moreover, our data are consistent with a model in which the variability of internal responses changes systematically across the alpha cycle, as reflected in the inverse relationship between hit rate and false alarm rate.”

      (b) Emphasize that reduced effective internal noise and sharpened sensory tuning are two complementary descriptions of a better match between sensory evidence and decision template rather than fully separable mechanisms.

      Thank you, we have added this language for clarity of our interpretation.

      “Together with decreases in the variance of sensory tuning during the optimal phase, our results suggest that alpha phase impacts sensitivity by shaping trial-to-trial variation in internal noise during perceptual decision making, leading to better matches between sensory evidence and decision templates as opposed to a change in the gain of internal sensory responses.”

      (c) Note that the frontal-occipital phase relationship is consistent with a coordinated, possibly top-down component to the alpha-phase effect, while remaining agnostic about the precise physiological implementation.

      Thank you for raising this additional interpretation. We have added this as a plausible alternative to the single-source account in the Discussion section.

      “Moreover, our results suggest that prior literature reporting phasic effects in the alpha-band range from both frontal and occipital regions may plausibly be reporting the same effect from a single projected dipole source; however, these results are also consistent with two synchronized alpha sources which are anti-phase.”

      Reviewer #2 (Recommendations for the authors):

      Major issues:

      Given that collecting more data may not be doable, the authors should take some actions to test the reliability of their results. For instance, simulations could be run to test the robustness of the results with such a small sample size (Zoefel, 2019). It would also be of interest to include in the report statistics and plots at the individual level, not only the aggregates. It is also important to report which electrodes were used in the analysis for each of the subjects, in the Methods section, it is clearly stated that these electrodes differed between subjects.

      Thank you for these suggestions. To assess the reliability of our results at the single-subject level, we have included a new binomial probability test which is a framework suitable for small sample size experiments with large trail numbers (Schwarzkopf & Huang, 2024). Binomial testing views each individual as an independent replication and considers the probability of having observed the number of significant participants given the total number tested participants, and outputs the probability of having observed the results by chance. We believe this framework adequately addresses the reviewer’s concern of generalizability in addition to being well-suited to the design of our study.

      To assess individual significance, we averaged the resultant vector length and permutations over the analysis window from -450 to 0 ms. If the resultant vector length exceeded the permutation for that participant, then they were considered to be a significant participant. In total, 3 out of 6 participants (participants 1, 4, and 5) showed significant d’ coupling. The binomial probability (equivalent to a p-value) of having observed this outcome as a result of three false positives at the individual-subject level is very small (p = 0.002), which is sufficiently low for psychological studies.

      Below is the text which we have added to the Results and Methods sections.

      “To interrogate the robustness of our findings at the single-subject level, we adopted a test of binomial probability, which is a statistical framework that treats each individual as an independent replication and is ideal for small sample size studies that utilize a large number of trials per observer (Schwarzkopf & Huang, 2024). For our data, we assessed individual significance by averaging the actual and permuted resultant vector lengths across time (-450 to 0ms) and comparing the real vector length to the 95% percentile of the permuted datasets. With this approach, 3 out of 6 participants showed significant d’-phase coupling which corresponds to a binomial probability of p = 0.002, indicating a very low probability that we observed these results by chance alone.”

      “Additionally, we used a binomial probability testing framework that is designed for small sample sizes and treats each participant as an independent replication. As such, it computes the probability of having observed the number of statistically significant outcomes by chance given our sample size (Schwarzkopf & Huang, 2024). To assess significance at the participant level, we averaged the participant’s resultant vector length and permutations from -450 to 0 ms and obtained the 95th percentile of the time-averaged permutations. We then compared the averaged resultant vector lengths to the permutation thresholds for each subject, which revealed 3 out of 6 significant subjects. We then used the MATLAB function myBinomTest.m (Nelson, 2026) to compute the p-value associated with the probability of having observed 3 out of 6 significant subjects by chance (with a false-positive rate of 0.05).”

      To address the reviewer's second request, we now include a supplemental figure which has each individual’s results for the main analysis (see Supplementary figure 3). These graphs, in addition to the methods, now provides the reader with each participant’s given set of analysis electrodes.

      “Each participant had a different combination of electrodes which were used in the analyses; however, the same three channels were used across sessions within a participant (participant 1: POz, PO3, O1; participant 2: P7, PO7, PO4; participant 3: P2, P1, Pz; participant 4: O1, Oz, O2; participant 5: O2, PO8, PO4; participant 6: Oz, O2, O1).”

      As an alternative approach, linear mixed models (LMM) could be used for statistics, as they are more suitable for small sample sizes (Wiley et al., 2019). LMM improve generalization by modelling subject-specific random effects. Although raw circular data is not suitable for LMM, the sine and cosine of the phases could have been used as predictors, for instance. Given that data were collected for 6 different sessions, sessions could be included as a factor in the model to improve statistical power.

      We appreciate the suggestion but feel that LMMs would be a challenge in this case not only because the main predictor variables are circular, but because the main outcome variables are not defined on the single-trial level and require many trials to be computed (e.g., classification images, SDT measures, response consistency). As such, computing these measures within a session may also lead to noisier estimates than we had designed our experiment for. We therefore prefer the more straightforward approach we have taken in the paper, which has now been supplemented by a binomial test of individual-subject level significance.

      Given that the number of subjects is quite small, I believe that individual data should be presented (either in the main text or supplementary materials) also for figures: 2A, B, C and D.

      Thank you, we have included all of these results to the individual graphs in the Supplemental Materials (see Supplementary figure 3).

      In plot 2B (HR and FAR) a p-value = 0.015 appears. However, in the text you write:

      "Indeed, this showed that the difference between the HR and FAR vector angle was significantly clustered around a mean of 180{degree sign} (v = 3.78, p = 0.01), indicating that the phase angle associated with the greatest hits was counterphase to the phase angle associated with the greatest false alarms."

      Which one is correct? Or do they refer to different tests?

      We appreciate you catching this confusing discrepancy. The two values refer to the same test which has a p-value of 0.0145. In the figure, this value was rounded to the thousandths decimal place (i.e., 0.015), whereas in the text it was rounded to the hundredths value (0.01). We now consistently report p-values out to three decimal places throughout the manuscript.

      Did you perform any statistical test for phasic modulation of dprime and criterion? I say that because in Figure 2B, you state that the data shows a "clear phasic modulation of d' across bins", but no statistic is mentioned. On the other hand, in Figure 2D, you state, "We did not & observe any significant phase-dependent relationship between phase and criterion." Is this sentence referring to both 2C and 2D panels or only to 2C?

      Figure 2B and 2D show the phase-behavior relationship across bins after aligning the phase bins to each participant's “best” d’ bin. This bin is omitted from the plots because it is used for alignment, making the analysis circular. Accordingly, these panels were intended purely for visualization and were not used for statistical inference. Additional language has been added to the figure caption highlighting this aspect.

      “The data have been aligned across participants so that each individual's highest d’ was assigned to bin 8 (omitted from the plot), with the remaining data circularly shifted, and is averaged across -450 ms to stimulus onset. This graph is for visualization purposes only.”

      The primary statistical test for phase-behavior coupling was performed using permutation testing of the resultant vector length, which quantifies the magnitude of phase-dependent modulation. These results are shown in Figures 2A (for d′) and 2C (for criterion). In the original manuscript, we reported only the time points that survived cluster-based correction, but did not explicitly report the cluster p-values. We have now added these cluster p-values to the manuscript for completeness.

      “The data revealed significant cluster-corrected coupling between alpha phase and d’ in the prestimulus window from -220 ms until stimulus onset (cluster p = 0.046),...”

      Additionally, we have changed the caption of Figure 2 to be separate for C) and D).

      “(C) No evidence for the coupling of criterion to pre-stimulus alpha-band phase. Graph C reveals the time course of the resultant vector lengths for alpha phase-criterion coupling, which shows no significant phase-dependent relationship between phase and criterion.

      (D) The underlying shifted c across phase bins (shifted to participants’ optimal phase, as in graph B) did not visually demonstrate a phasic modulation pattern.”

      Minor issues:

      In general, the paper is very clear. I found a statement confusing in the Response consistency section:

      "To quantify response consistency, we computed the proportion of trials in which participants provided the same response across the two identical trials. This procedure was done for each channel at each time point (from -450 to 0 ms) and then averaged."

      Which makes no sense, as response consistency is independent of channel and time point. I believe here you refer to the phase, maybe by just changing the order (start with response consistency and then proceed to phase), the paragraph would be clearer.

      We appreciate you catching this mistake. We have clarified the Methods section in the following way:

      “To quantify response consistency, we computed the proportion of trials in which participants provided the same response across the two identical trials. Since the optimal phase changes over time, the set of trials were classified as either both having occurred during the optimal phase (or otherwise) for each time point (from -450 to 0 ms) and channel. The proportion of consistent responses was then averaged across channels and time.”

      Could you include a plot of the power spectrum used for IAF estimation of all the subjects?

      Thank you for the suggestion. In Supplemental Figure 3 we have included the power spectrum that was used to estimate IAF in addition to a topoplot of alpha power (IAF +/- 2 Hz) that has the analysis electrodes labelled.

      Bibliography:

      Wiley RW, Rapp B. Statistical analysis in Small-N Designs: using linear mixed-effects modeling for evaluating intervention effectiveness. Aphasiology. 2019;33(1):1-30. doi: 10.1080/02687038.2018.1454884.

      Zoefel B, Davis MH, Valente G, Riecke L, How to test for phasic modulation of neural and behavioural responses, NeuroImage, Volume 202, 2019,116175, https://doi.org/10.1016/j.neuroimage.2019.116175.

    1. Author Response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Weaknesses:

      Despite this compelling data regarding the protective role of HSF1 in the febrile response, what remains unexplained and complicates the authors' model is the observation that losing LvHSF1 at 'normal' temperatures of 25 ℃ is not detrimental to survival, even though viral loads increase and nSWD is likely still subject to LvHSF1 regulation. These observations suggest that WSSV infection may have other detrimental effects on the cell not reflected by viral load and that LvHSF1 may play additional roles in protecting the organism from these effects of WSSV infection, such as perhaps, perturbations to protein homeostasis. This is worth discussing, especially in light of the rather complicated roles of hormesis in protection from infection, the role of HSF1 in hormesis responses, and the findings from other groups that the authors discuss.

      We are grateful for your unbiased advice by reviewer. And we have added the description about the role of HSF1 in hormesis responses in discussion in Lines 422-425 in the revised manuscript. Thank you.

      Reviewer #2 (Public review):

      Temperature is a critical factor affecting the progression of viral diseases in vertebrates and invertebrates. In the current study, the authors investigate mechanisms by which high temperatures promote anti-viral resistance in shrimp. They show that high temperatures induce HSF1 expression, which in turn upregulates AMPs. The AMPs target viral envelope proteins and inhibit viral infection/replication. The authors confirm this process in drosophila and suggest that there may be a conserved mechanism of high-temperature mediated anti-viral response in arthropods. These findings will enhance our understanding of how high temperature improves resistance to viral infection in animals.

      The conclusions of this paper are mostly well supported by data, but some aspects of data analysis need to be clarified and extended. Further investigation on how WSSV infection is affected by AMP would have strengthened the study.

      We are grateful for your unbiased advice by reviewer. We have provided additional experimental evidence and supplementary instructions in the revised manuscript. Thank you.

      Reviewer #3 (Public review):

      In the manuscript titled "Heat Shock Factor Regulation of Antimicrobial Peptides Expression Suggests a Conserved Defense Mechanism Induced by Febrile Temperature in Arthropods", the authors investigate the role of heat shock factor 1 (HSF1) in regulating antimicrobial peptides (AMPs) in response to viral infections, particularly focusing on febrile temperatures. Using shrimp (Litopenaeus vannamei) and Drosophila S2 cells as models, this study shows that HSF1 induces the expression of AMPs, which in turn inhibit viral replication, offering insights into how febrile temperatures enhance immune responses. The study demonstrates that HSF1 binds to heat shock elements (HSE) in AMPs, suggesting a conserved antiviral defense mechanism in arthropods. The findings are informative for understanding innate immunity against viral infections, particularly in aquaculture. However, the logical flow of the paper can be improved.

      We are grateful for the positive comments and the unbiased advice by reviewer. We have improved the logical flow of the paper and added corresponding instructions in the revised manuscript. Thank you.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      (1) Figure 1: The analysis compares Group TW to Group W (not the other way around).

      Thank you very much. To uncover the molecular mechanisms by which high temperature restricts WSSV infection, two shrimp groups, Group TW and Group W, were cultured at 25 °C. Group W comprised shrimp injected with WSSV and maintained at 25 °C continuously. In contrast, Group TW was subjected to a temperature increase to 32 °C at 24 hours post-injection (hpi). Gill samples were collected for analysis 12 hours post-temperature rise (hptr) and subjected to Illumina sequencing (Figure 1A). RNA-seq was used to identify genes responsive to high temperature, particularly those encoding potential transcriptional regulators. Thank you.

      (2) The RNA-seq data in Figure 1 focus only on the TFs. The manuscript would benefit from showing all the RNA-seq data and the differentially expressed genes. In particular, are the AMPs upregulated at the same time point? This should not be the case if LvHSF1 were responsible for the transcription of the AMPs, given the time lag between transcription and translation.

      Thank you for your suggestion. In Author response image 1, our previous study has revealed that classical heat shock proteins (such as HSP21, HSP70, HSP60, HSP83, HSP90, HSP27, HSP10, and Bip) were induced by RNA-seq between Group TW and Group W, suggesting heat shock proteins exert a crucial role in enhancing the resistance of shrimp to WSSV at elevated temperatures (32 ℃) and underscoring the reliability of our transcriptomic findings (Xiao et al., 2024).

      Additionally, we also analyzed the AMPs expression between Group TW and Group W, and the results show that some antimicrobial peptides such as Lysozyme and C-type lectin are upregulated between Group TW and Group W. Notably, we did not detect upregulated expression of SWD between Group TW and Group W. We agree with the reviewer's point of view that there is a time lag between transcription and translation. Supplementary experimental evidences show that the expression level of LvHSF1 is strongly induced by WSSV stimulation, and then the expression level of SWD begins to increase. We have added a description in Lines 136-138 in the revised manuscript.

      Author response image 1.

      The Figure of the heat shock proteins in Group TW and Group W

      Author response image 2.

      Transcriptional expression levels of HSF1 and SWD after WSSV stimulation

      Reference:

      Xiao, B., Wang, Y., He, J., Li, C., 2024. Febrile Temperature Acts through HSP70-Toll4 Signaling to Improve Shrimp Resistance to White Spot Syndrome Virus. J Immunol 213, 1187-1201.

      (3) The data showing the tissue distribution of LvHSF1 and nSWD is a rigorous approach and adds to the manuscript. A similar approach to understanding the time course of expression of AMPs in relationship to LvHSF1 expression levels would strengthen the authors' conclusions that LvHSF1 induction in response to high temperatures and viral infection, in turn, upregulates SWD and other antibacterial genes.

      Thank you for your suggestion. As you good suggestion, we detected the transcriptional expression levels of HSF1 and SWD after WSSV stimulation for 0, 2, 4, 6, 8, 12, 16, 20, and 24 hours. The transcriptional expression level of SWD was set to 1.00 at 0 h, in the early stage of WSSV infection (0-12 h, except 6 h), the expression level of LvHSF1 is strongly induced, and then the expression level of SWD begins to increase. Theses results show that LvHSF1 induction in response to viral infection, in turn, upregulates SWD and other antibacterial genes. Thank you.

      (4) The data (Figures 3 and 4) show that LvHSF1 is necessary to survive WSSV infection at high temperatures but does not affect survival at lower temperatures, even though LvHSF1 limits VP28 levels, and viral load at both temperatures is confusing. Does this suggest that LvHSF1 is not primarily important for protection against the virus but instead, for protection from the heat-induced damage caused by high temperatures, which would not be surprising? The manuscript would benefit if the authors could address this point. How do the authors envision the protection conferred by LvHSF1 only at high temperatures?

      Thank you for your comment. Although no significant difference in shrimp survival rates was observed between LvHSF1-silenced shrimp and GFP-silenced shrimp at low temperature (25 °C), shrimp with silenced LvHSF1 exhibited increased viral loads in hemocytes and gills, suggesting that upregulation of HSF1 expression can protect shrimp from WSSV infection.

      Notably, the tolerance temperature for L. vannamei growth ranges from 7.5 to 42 °C. When infected with WSSV, shrimp use behavioral fever to elevate their body temperature (~32 °C), thereby inhibiting WSSV infection (Rakhshaninejad et al., 2023; Xiao et al., 2024). And this temperature (~32 °C) will not cause heat-induced damage to the shrimp. Our results demonstrate that febrile temperatures induce HSF1, which in turn upregulates antimicrobial peptides (AMPs) that target viral envelope proteins and inhibit viral replication.

      Only at high temperatures, we observed that knockdown of HSF1 did not affect shrimp survival rate (Figure 4A). Thank you again for your valuable feedback.

      Reference:

      Rakhshaninejad, M., Zheng, L., Nauwynck, H., 2023. Shrimp (Penaeus vannamei) survive white spot syndrome virus infection by behavioral fever. Sci Rep 13, 18034.

      Xiao, B., Wang, Y., He, J., Li, C., 2024. Febrile Temperature Acts through HSP70-Toll4 Signaling to Improve Shrimp Resistance to White Spot Syndrome Virus. J Immunol 213, 1187-1201.

      (5) Related to the previous comment, the authors do not clearly distinguish between basal effects of LvHSF1 or nSWD induction and heat-induced effects and the differences related to the requirement of LvHSF1 for protection. Simply increasing LvHSF1 levels can result in increased nSWD. SWD levels increase upon WSSV infection even at 25 ℃, and the knockdown experiments suggest that this could also occur through LvHSF1. It would be useful to explicitly differentiate between basal functions of HSF1 and induced functions.

      Thank you for your suggestion. In previous responses, we have distinguished between basal effects of LvHSF1 or nSWD induction and heat-induced effects.

      As your good suggestion, we injected GST or rHSF1 protein into shrimp, the results showed that recombinant protein HSF1 could significantly induced the expression level of SWD (Supplementary Fig. 5C). Further, after knockdown of SWD, shrimp were injection with rLvHSF1 mixed with WSSV. The results showed that the viral load was significantly lower than the control group 48 hours post WSSV infection (Supplementary Fig. 5D). We have added these results to the Supplementary Figure 5C&5D and added a description in Lines 253-255 and Lines 290-293 in the revised manuscript. Thank you for your constructive comments.

      Reviewer #2 (Recommendations for the authors):

      (1) Two temperatures are used in the experiments of shrimp. It seems that HSF1 is also upregulated by WSSV infection at 25 ℃. However, this upregulation seems not to be able to protect the animals. The authors compare the infection at 25 and 32 ℃ but did not discuss the findings.

      Thank you for your comment. Although no significant difference in shrimp survival rates was observed between LvHSF1-silenced shrimp and GFP-silenced shrimp at low temperature (25 °C), shrimp with silenced LvHSF1 exhibited increased viral loads in hemocytes and gills, suggesting that upregulation of HSF1 expression can protect shrimp from WSSV infection. We have added a discussion of this finding in Lines 461-464 in the revised manuscript. Thank you.

      (2) In the abstract the authors say that "These insights provide new avenues for managing viral infections in aquaculture and other settings by leveraging environmental temperature control." However, this point has not been discussed in the main text.

      We appreciated your comments. We have added a discussion about the environmental temperature control in Lines 512-514 in the revised manuscript. Thank you.

      (3) Line 142: "These results suggest that LvHSF1 may play a key role in enhancing shrimp resistance to WSSV at elevated temperatures." Although this type of conclusion has been made in many studies, I think it is impossible to see a "KEY role" based mainly on change in expression.

      Thank you for your suggestion. We have revised this conclusion in the revised manuscript. Thank you.

      (4) Section 2.1 Induction of Heat Shock Factor 1 in Response to WSSV at High Temperature

      Figure 1. Identification of HSF1 as a key factor induced by high temperature.

      The two titles are confusing. Whether the upregulation of HSF1 is a response to high temperature or WSSV infection? I think it is more likely a response to high temperature. Did the authors see the difference in HSF1 expression in shrimp with and without WSSV infection at high temperatures?

      Thank you for your comment. We have modified the title of Section 2.1 in the revised manuscript. As your good suggestion, we have measured the expression of LvHSF1 after WSSV challenge at high temperatures (32 ℃) in revised Figure 2F-2H in Line 122 in the revised manuscript. The results demonstrate that the expression of LvHSF1 is strongly induced by WSSV stimulation at high temperatures (32 ℃) in the revised manuscript. Thank you.

      (5) Figure 2. Upregulation of LvHSF1 in shrimp challenged by WSSV at both low and high temperatures. Results for WSSV challenge at high temperatures are not included in this figure.

      Thank you for your suggestion. As your good suggestion, we have measured the expression of LvHSF1 after Poly (I: C) and WSSV challenge at high temperatures (32 ℃) in revised Figure 2C-2H. The results demonstrate that the expression of LvHSF1 is strongly induced by Poly (I: C) and WSSV stimulation at high temperatures (32 ℃). And we have added a description in Lines 168-179 in revised manuscript. Thank you.

      (6) Section 2.2 Expression Profiles of LvHSF1 in Shrimp Under Varied Temperature Conditions and WSSV Challenge. Did the authors try poly IC and WSSV challenge at 32℃, and compare with the un-challenge group? Why were only low temperature was analyzed?

      Thank you for your suggestion. As your good suggestion, we have measured the expression of LvHSF1 after Poly (I: C) and WSSV challenge at high temperatures (32 ℃) in revised Figure 2C-2H. And we have added a description about the expression of LvHSF1 after Poly (I: C) and WSSV challenge at high temperatures (32 ℃) in Lines 168-179 in revised manuscript. Thank you.

      (7) Figure 2: Please indicate the temperature used in C-E and F-H in the figure legend. Statistical significance: compared with which group? Please provide information in the legend or show it in the bar chart.

      Thank you for your suggestion. We have added the description of temperature used in revised Figures 2C-2E. The expression changes of HSF1 were compared with those of PBS control group at the corresponding time and we modified the comparison method of significance in revised Figures 2C-2E. Thank you.

      (8) Figure 3H: There are two groups (dsGFP+PBS; dsHSF1+PBS) showing with the same symbol (dot line).

      Thank you for your comment. The revised Figure 3H has used different symbols to distinguish the two groups. Thank you.

      (9) Line 205: qPCR

      Thank you for your careful checks. We have corrected this error in the revised manuscript. Thank you.

      (10) Figure 5d and f: Please indicate the sample in each row.

      Thank you for your suggestion. We have marked the samples in each row in the revised Figures 5d&5f.

      (11) Figure 3 and Figure 4: Why different tissues were analyzed in the two experiments? Low temperature: gill and hemocytes. High temperature: gill and muscle? It is better to use the same tissues so that they can be compared. Please indicate the tissue analyzed in D and d.

      Thank you for your suggestion. We have repeated the experiment to detect the copy number of WSSV in hemocyte at high temperature (32 °C) after LvHSF1 knockdown. The results showed that knockdown LvHSF1 showed increased viral loads in shrimp hemocyte (Figure 4C). We have supplemented the tissue information in Figure 4D&4d. Thank you.

      (12) Figure 2A The time for temperature treatment? hours or days?

      Thank you for your comment. Transcriptional expression of LvHSF1 in different tissues of healthy shrimp subjected to low (25 °C) and high (32 °C) temperatures for 12 hours. We have supplemented this information in the legend of Figure 2A in Lines 840-841 in revised manuscript. Thank you.

      (13) Line 249: purified by SDS-PAGE gel?

      Thank you for your comment. We have modified this description in Lines 272-274 in current manuscript. Thank you.

      (14) Line 258 "Next, to verify whether the anti-WSSV function of nSWD was mediated by LvHSF1 at high temperature". I think it is confusing to use "mediated" here. It seems that HSF1 is downstream of nSWD. Actually, HSF1 controls the expression of nSWD and thus regulates the anti-WSSV effect of shrimp at high temperatures.

      We appreciated your comments. We have modified this description in Lines 282-283 in current manuscript. Thank you.

      (15) Line 458 "The most probable anti-WSSV mechanism of nSWD is its direct interaction with WSSV envelope proteins VP24 and VP26, potentially inhibiting viral entry into target cells. I suggest the author analyze the entry of WSSV to see whether nSWD blocks this process.

      Thank you for your comment. In general, the antimicrobial mechanism of action of AMPs is thought to involve direct membrane disruption, especially for enveloped virus (such as WSSV) (Wilson et al., 2013).

      Thanks to the reviewers for their valuable comments. Our manuscript mainly focuses on the febrile temperature-inducible HSF in host antiviral immunity, and the role of HSF1 in regulating antimicrobial effectors (such as SWD). Due to the limitation of the manuscript's length, we will further investigate the functional mechanisms of SWD-specific anti-WSSV in future studies. Thank you.

      Reference:

      Wilson, S.S., Wiens, M.E., Smith, J.G., 2013. Antiviral Mechanisms of Human Defensins. Journal of Molecular Biology 425, 4965-4980.

      (16) Line 435-456 The author discusses the difference between two shrimp species. Did the two studies measure the same immune parameters? I wonder whether the different observation is due to true differences or different methods they used to evaluate the response. If no immune response was promoted in the previous study, what's the possible anti-viral mechanism?

      We appreciated your comments. Firstly, the shrimps in the two experimental groups have different adaptability to temperature. The optimal water temperature for M. japonicus growth ranges from 25 to 32 °C, and the tolerance temperature for L. vannamei growth ranges from 7.5 to 42 °C. Secondly, the experimental environmental factors are different in the two experimental groups. Ammonia is a key stress factor in aquatic environments that usually increases the risk of pathogenic diseases in aquatic animals, however, High temperatures (32°C) have been shown to inhibit the replication of WSSV and reduce mortality in WSSV-infected shrimp. Thirdly, the two studies tested different immune indicators. Ammonia-induced Hsf1 suppressed the production and function of MjVago-L, an arthropod interferon analog. In this study, our findings revealed the molecular mechanism through which the HSF-AMPs axis mediates host resistance to viruses induced by febrile temperature. Taken together, the benefits of HSF1 can be attributed to either the host or the pathogen, depending on the nature and context of the host-virus-environment interaction.

      (17) Line 472 "directly bind to WSSV envelope proteins and inhibit WSSV proliferation"

      I think it is confusing to use "proliferation" here. It seems that the binding of HSF affects the replication process. However, based on the authors' discussion, HSF may likely block viral entry.

      Thank you for your suggestion. We have modified this description in Lines 505-507 in the current manuscript. Thank you.

      Reviewer #3 (Recommendations for the authors):

      In the manuscript titled "Heat Shock Factor Regulation of Antimicrobial Peptides Expression Suggests a Conserved Defense Mechanism Induced by Febrile Temperature in Arthropods", the authors investigate the role of heat shock factor 1 (HSF1) in regulating antimicrobial peptides (AMPs) in response to viral infections, particularly focusing on febrile temperatures. Using shrimp (Litopenaeus vannamei) and Drosophila S2 cells as models, this study shows that HSF1 induces the expression of AMPs, which in turn inhibit viral replication, offering insights into how febrile temperatures enhance immune responses. The study demonstrates that HSF1 binds to heat shock elements (HSE) in AMPs, suggesting a conserved antiviral defense mechanism in arthropods. The findings are informative for understanding innate immunity against viral infections, particularly in aquaculture. However, the logical flow of the paper can be improved. Following are my specific concerns.

      Major comments

      (1) The study design is pretty good, but the logical flow is not. The following should be improved.

      (a) In Figure 1, the reason for selecting HSF1 as the focus of the study is not clearly explained.

      Thank you for your comment. In a previous study, we have revealed that heat shock proteins exerted a significant role in enhancing the resistance of shrimp to WSSV at elevated temperature (32 ℃) (Xiao et al., 2024). GO functional enrichment analysis of DEGs between group TW and group W, indicating that most DEGs were involved in biological processes such as protein refolding, chaperone-mediated protein folding, and heat response. Therefore, special attention has been paid to heat shock factor 1 (HSF1), the master regulator of the heat shock response. We have added the description in Lines 136-138 in the revised manuscript. Thank you.

      Reference:

      Xiao, B., Wang, Y., He, J., Li, C., 2024. Febrile Temperature Acts through HSP70-Toll4 Signaling to Improve Shrimp Resistance to White Spot Syndrome Virus. J Immunol 213, 1187-1201.

      (b) As the authors draw models in Figure 9, the established activation mechanism of HSF1 is via trimerization by the release of HSP90, which binds to misfolded proteins under stress conditions, such as heat shock. Therefore, the increase in the HSF1 mRNA level in Figure 1 is strange. The authors need to clarify this issue by explaining this established activation mechanism of HSF1 and also must provide the basis of upregulation of HSF1 by mRNA increase via citing papers in the Introduction.

      We appreciated your comments. Under non-stress conditions, HSF monomers are retained in the cytoplasm in a complex with HSP90. During the stress response, such as high temperature, HSF dissociates from the complex, trimerizes, and converts into a DNA-binding conformation through regulatory upstream promoter elements known as heat shock elements (HSEs) (Andrasi et al., 2021). Previous studies have demonstrated that the expression of HSF1 was remarkably induced by stress response, such as high temperature (Ren et al., 2025), virus infection (Merkling et al., 2015), and ammonia stress (Wang et al., 2024). Our results also showed that the expression of LvHSF1 was significant induced by WSSV infection and high temperature (Figure 2). Therefore, this is not surprising that the increase in the HSF1 mRNA level in Figure 1.

      In response, we have revised the proposed model to better reflect our experimental findings and the accompanying description. This revision ensures that the schematic is consistent with our data and accurately represents the proposed mechanism. We appreciate your careful review and constructive feedback.

      Reference:

      Andrasi, N., Pettko-Szandtner, A., Szabados, L., 2021. Diversity of plant heat shock factors: regulation, interactions, and functions. J Exp Bot 72, 1558-1575.

      Ren, Q., Li, L., Liu, L., Li, J., Shi, C., Sun, Y., Yao, X., Hou, Z., Xiang, S., 2025. The molecular mechanism of temperature-dependent phase separation of heat shock factor 1. Nature Chemical Biology.

      Merkling, S.H., Overheul, G.J., van Mierlo, J.T., Arends, D., Gilissen, C., van Rij, R.P., 2015. The heat shock response restricts virus infection in Drosophila. Sci Rep 5, 12758.

      Wang, X.X., Zhang, H., Gao, J., Wang, X.W., 2024. Ammonia stress-induced heat shock factor 1 enhances white spot syndrome virus infection by targeting the interferon-like system in shrimp. mBio 15, e0313623.

      (c) For RNA seq analysis in both in Figures 1 and 5, they need to provide changes in conventional HSF1 target chaperones (many HSPs) to validate their RNA seq data.

      Thank you for your suggestion. In Authopr response image 1, our previous study has revealed that classical heat shock proteins (such as HSP21, HSP70, HSP60, HSP83, HSP90, HSP27, HSP10, and Bip) were induced by RNA-seq between Group TW and Group W, suggesting heat shock proteins exert a crucial role in enhancing the resistance of shrimp to WSSV at elevated temperatures (32 ℃) and underscoring the reliability of our transcriptomic findings (Xiao et al., 2024). We have added the description in Lines 136-138 in the revised manuscript.

      In Figure 5, we have supplemented the heat shock proteins downregulated DEGs by transcriptome sequencing of dsGFP +WSSV (32 ℃) vs. dsLvHSF1 +WSSV (32 ℃) in Supplementary table 2. The results showed that the classical heat shock proteins were downregulated by the RNA-seq, underscoring the reliability of our transcriptomic findings. We have added the description in Lines 213-216 in the revised manuscript. Thank you.

      Reference:

      Xiao, B., Wang, Y., He, J., Li, C., 2024. Febrile Temperature Acts through HSP70-Toll4 Signaling to Improve Shrimp Resistance to White Spot Syndrome Virus. J Immunol 213, 1187-1201.

      (d) In Figure 5, they did experiments by focusing on the changes by HSF1 knockdown at 32 ℃. However, the logical flow should be focusing on genes whose expression was increased by 32 ℃ compared with 25 ℃ (in figure 1), among them they need to characterize HSF1 target genes. Here as mentioned above, classical HSP genes must be included in addition to those AMP genes.

      Thank you for your suggestion. As your good suggestion, we have supplemented the heat shock proteins downregulated DEGs by transcriptome sequencing of dsGFP +WSSV (32 ℃) vs. dsLvHSF1 +WSSV (32 ℃) in Supplementary table 2. The results showed that the classical heat shock proteins were downregulated by the RNA-seq, underscoring the reliability of our transcriptomic findings. We have added the description in Lines 213-216 in the revised manuscript. Thank you.

      (e) What is the logical basis of just picking nSWD? It is another example of cherry-picking similar to picking HSF1 in Figure 1.

      We appreciated your comments. To determine how temperature-induced LvHSF1 restricts WSSV infection, RNA-seq was performed to identify target genes regulated by HSF1. By analyzing the differentially expressed genes (DEGs), we screened eight candidate proteins for immunity-effector molecules, including SWD, CrustinⅠ, C-type lectin, Anti-lipopolysaccharide factor (ALF), and Vago. CrustinⅠ has been shown to play an important role in antiviral immunity (Li et al., 2020); C-type lectin (CTL1) can bind to the VP28, VP26, VP24, VP19, and VP14, thereby inhibiting the infection of WSSV (Zhao et al., 2009); Anti-lipopolysaccharide factor (ALF3) performs its anti-WSSV activity by binding to the envelope protein WSSV189 (Methatham et al., 2017); Vago can inhibit WSSV infection by activating the Jak/Stat pathway in shrimp (Gao et al., 2021). However, the detailed regulatory mechanism of SWD against WSSV was unclear, and particular attention was paid to the SWD. We have added the description in Lines 215-220 in the revised manuscript. Thank you for your valuable comments and the logic of the manuscript has been improved.

      Reference:

      Li, S., Lv, X., Yu, Y., Zhang, X., Li, F., 2020. Molecular and Functional Diversity of Crustin-Like Genes in the Shrimp Litopenaeus vannamei, Marine Drugs 18, 361.

      Zhao, Z.Y., Yin, Z.X., Xu, X.P., Weng, S.P., Rao, X.Y., Dai, Z.X., Luo, Y.W., Yang, G., Li, Z.S., Guan, H.J., Li, S.D., Chan, S.M., Yu, X.Q., He, J.G., 2009. A novel C-type lectin from the shrimp Litopenaeus vannamei possesses anti-white spot syndrome virus activity. Journal of Virology 83, 347-356.

      Methatham, T., Boonchuen, P., Jaree, P., Tassanakajon, A., Somboonwiwat, K., 2017. Antiviral action of the antimicrobial peptide ALFPm3 from Penaeus monodon against white spot syndrome virus. Dev Comp Immunol 69, 23-32.

      Gao, J., Zhao, B.R., Zhang, H., You, Y.L., Li, F., Wang, X.W., 2021. Interferon functional analog activates antiviral Jak/Stat signaling through integrin in an arthropod. Cell Rep 36, 109761.

      (f) Likewise, choosing Atta in S2 cells needs logic.

      We appreciated your comments. Our manuscript revealed that febrile temperature inducible HSF1 confers virus resistance by regulating the expression of antimicrobial peptides (AMPs) in L. vannamei. Further, we want to know that whether HSF1 regulation of antimicrobial peptides is a conserved defense mechanism induced by elevated temperature in arthropods, and experiments were performed in an invertebrate model system (Drosophila S2 cells). Previous study showed that DmAMPs (such as Attacin A, Cecropins A, Defensin, Metchnikowin, and Drosomycin) exerted a significant role in the antiviral immunity in Drosophila (Zhu et al., 2013). Our results showed that the expression of Attacin A, Cecropins A and Defensin were remarkably induced by DmHSF, and the expression of Attacin A was the highest induced. Therefore, DmAtta was chosen as a representative to further demonstrate that DmHSF1 exerts its anti-DCV function by regulating DmAMPs. We have added the description in Lines 328-330 and Lines 361-364 in the revised manuscript. Thank you for your valuable comments and the logic of the manuscript has been improved.

      Reference:

      Zhu, F., Ding, H., Zhu, B., 2013. Transcriptional profiling of Drosophila S2 cells in early response to Drosophila C virus. Virol J 10, 210.

      (2) From Figure 6I to 6K, the authors aimed to verify whether the anti-WSSV function of nSWD was mediated by LvHSF1 at high temperatures. However, what they showed was just showing that nSWD plays anti-WSSV function downstream of HSF1. The authors should show additional data for dsControl+rnSWD.

      Thank you for your suggestion. As your suggestion, after knockdown of SWD, shrimp were injection with rLvHSF1 mixed with WSSV. The results showed that the viral load was significantly lower than the control group 48 hours post WSSV infection (Supplementary Fig. 5D). We have added these results to the Supplementary Figure 5C&5D and added a description in Lines 290-293 in the revised manuscript. Thank you for your constructive comments.

      (3) For the physical interaction between nSWD and WSSV, it will be great if the authors perform Alphafold3 prediction analysis (Abramson et al PMID: 38718835).

      Thank you for your suggestion. As you suggestion, we performed Alphafold3 prediction analysis on SWD and WSSV (VP24 and VP26). The predicted template modeling (pTM) score measures the accuracy of the entire structure. A pTM score above 0.5 means the overall predicted fold for the complex might be similar to the true structure. The Alphafold3 prediction results show that there is a possible interaction between SWD and WSSV. Notably, our manuscript demonstrated that rSWD could interact with VP24 and VP26 by pulldown assays and confocal analysis.

      Author response image 3.

      Alphafold3 prediction analysis of SWD&VP24 as follow (pTM = 0.64)

      Author response image 4.

      Alphafold3 prediction analysis of SWD&VP26 as follow (pTM = 0.53)

      Minor comments

      (1) In the Abstract and many other places, the authors need to specifically write "Drosophila S2 cells" instead of "Drosophila" because conventionally Drosophila implies fruit fly as an organism. We don't say cultured human cells as "human" or "Homo sapiens" in papers.

      Thank you for your suggestion. We have modified the description of Drosophila in the revised manuscript. Thank you.

      (2) Figure numbers can be reduced for better readability. I would combine Figures 1 and 2, and Figures 3 and 4. If the combined figures are too crowded, some can go to into supplementary figures.

      Thank you for your suggestion. We have moved the Poly (I: C) data to Supplementary Figure 2 in the revised manuscript. However, we have added some experimental data to Figures 1, 2, 3, and 4. Therefore, we did not combine Figure 1 and Figure 2, and Figures 3 and 4. Thank you.

      (3) One of the best-understood roles of HSF1 in physiology other than heat shock response is longevity, in particular with C. elegans. The authors need to mention this in the Discussion by citing the following recent review paper (Lee PMID: 36380728).

      Thank you for your suggestion. We have supplemented the description of HSF1 regulating longevity and aging of organisms and cited the above reference in the revised manuscript (Lee and Lee, 2022). Thank you.

      Reference:

      Lee, H., Lee, S.V., 2022. Recent Progress in Regulation of Aging by Insulin/IGF-1 Signaling in Caenorhabditis elegans. Mol Cells 45, 763-770.

      (4) Please make your own label for small letter panels or transfer small letter panels to supplementary figures.

      Thank you for your suggestion. We have adjusted the relevant letter labels. The uppercase letters represent the main image of the Figure, and the small letter panels are the corresponding supplementary instructions in the revised manuscript. Thank you.

      (5) In the introduction part, I recommend changing the references for HSFs and HSR with recent ones.

      Thank you for your suggestion. We have added the latest references for HSFs and HSR in the Introduction part of the revised manuscript. Thank you.

      (6) In Figure 1, it is not intuitive to understand the name groups W and TW.

      We appreciated your comments. We have added the description of Group W and Group TW in revised Figure 1. Group W comprised shrimp injected with WSSV and maintained at 25 °C continuously. In contrast, Group TW was subjected to a temperature increase to 32 °C at 24 hours post-injection (hpi). Gill samples were collected for analysis 12 hours post-temperature rise (hptr) and subjected to Illumina sequencing. Thank you.

      (7) Please add some kinds of sequence comparisons of SWD and nSWD for readers to understand the homology.

      We appreciated your comments. We have added the multiple sequence alignment of SWD proteins in shrimp species in revised Supplementary Figure 3. Highly conserved amino acid residues and cysteine and residues are highlighted in red, indicating that LvSWD is a conserved antimicrobial peptide of the Crustin family. Thank you.

      (8) Naming nSWD with "newly identified" is strange as it will not be new anymore as time goes by. Please change the name.

      Thank you for your suggestion. We have modified the name of nSWD to SWD in the revised manuscript. Thank you.

      (9) Please write the full name for Lv (Litopenaeus vannamei), Dm (Drosophila melanogaster), ds (double-stranded) before using LvHSF1, DmHSF1, and dsLvHSF1.

      Thank you for your comments. We have added the full name of LvHSF1, DmHSF1, and dsLvHSF1 in the revised manuscript. Thank you.

      (10) In Figure 2, it will be better to transfer poly I:C data to supplementary figures.

      Thank you for your comments. We have moved the Poly (I: C) data to Supplementary Figure 2 in the revised manuscript. Thank you.

      (11) The label for pGL3-nSWD-M12 is confusing. M1 and M2 are OK. Please change M12 with M1/2 or another one.

      Thank you for your suggestion. We have changed pGL3-nSWD-M12 with pGL3-nSWD-M1/2 in the revised manuscript. Thank you.

    1. Author Response:

      The following is the authors’ response to the previous reviews

      eLife Assessment

      This article presents useful findings on how the timing of cooling affects the timing of autumn bud set in European beech saplings. The study leverages extensive experimental data and provides an interesting conceptual framework for the various ways in which warming can affect but set timing. The statistical analysis is compelling, but indicates some factors that may temper the authors' claims, while the designs of experiments offer incomplete support for the current claims as they rely on one population under extreme conditions for only one year each while a confounding effect (time in a chamber) sometimes lacks a control.

      We thank the editor and reviewers for their consideration of our revised manuscript and for their constructive suggestions. In response to the editor’s guidance, we have ensured that: 1) the experimental design is clearly presented as physiological forcing, 2) the Solstice-as-Phenology-Switch concept is explicitly defined, limited, and framed as inferred, 3) conclusions are strictly aligned with the scope of the evidence, and limitations are acknowledged transparently.

      We hope these revisions fully address the remaining concerns and clarify both the conceptual framework and the appropriate scope of inference.

      Public Review:

      Reviewer #1 (Public review):

      The authors identified the summer solstice (June 21) as a phenological "switch point", but the flexibility of this switch point remains poorly understood. A more precise explanation of what "flexibility" means in this context is needed, along with a description of the specific experimental results that would demonstrate this flexibility.

      We agree that the concept of “flexibility” required clearer definition and a more explicit link to the experimental results. In the Introduction, we now explicitly define flexibility as the capacity for the effective timing of the phenological switch to shift earlier or later depending on developmental progression, rather than occurring at a fixed calendar date. This switch occurs at the compensatory point between the antagonistic influences of early-season development [ESD effect] and late-season temperature [LST effect](L92-98). We have extended and clarified our explanation of the summer solstice’s role in this framework (L69-90). We propose that the solstice acts as an environmental switch that initiates the LST effect, as declining daylengths signal trees to become responsive to late-season cooling (L92-94). The compensatory point then occurs where the advancing ESD effect is balanced by the delaying LST effect. This point should therefore not be fixed to a calendar date but instead vary with developmental progression each year (L75-95).

      In the Discussion, we clarify that flexibility is demonstrated experimentally by the observation that the magnitude of July cooling effects (LST effect) on autumn phenology depend on prior developmental rate (ESD effect) [3.4 times greater delay in late-leafing trees], indicating that the position of the compensatory point is development-dependent rather than fixed to June 21 (L398-410). We have made consistent edits throughout the Discussion, in particular in the ‘Support for the Solstice-as-Phenology-Switch Hypothesis’ subsection (L514-530).

      The experiment did not directly measure the specific date of the phenological switch point. Instead, it was inferred by comparing temperature effects before and after the solstice. The manuscript should clearly state that this switch point remains an inferred conceptual node rather than a directly measured variable.

      We fully agree and have clarified this in the revised manuscript. In the Discussion, we now clearly state that the compensatory point is a conceptual node inferred from responses to cooling before the solstice (June), directly after it (July), or later in the growing season (August) rather than a directly observed phenological event (L352-358 & L405-406).

      In Experiment 1, the effect of bud type (terminal vs. lateral) was inconsistent across the overall model and the different leafing groups. The authors should provide a more thorough discussion of potential reasons for this inconsistency.

      This inconsistency reflects biological complexity. In the Discussion, we now expand our interpretation to note that terminal and lateral buds may differ in developmental status, resource allocation and hormonal context. We emphasize that bud-type effects are therefore expected to be context-dependent and to interact with wholeplant developmental state, which plausibly explains why effects differ across leafing groups and models (L390-396).

      In addition, the statistical model for Experiment 1 indicates that the measured variables (summer cooling and leaf emergence date) explain only 23.4% of the variation in bud formation timing. This leaves over 76% of the variation unexplained, suggesting that other important factors are involved. The discussion should address this limitation in greater depth, moving beyond a focus on the measured variables.

      We now discuss the explained and unexplained variance in more detail. We also make it clear that our experiment was designed to test specific mechanistic pathways rather than to fully explain all phenological variability or maximise predictive power L417-419).

      In the Discussion, we acknowledge that a substantial fraction of variation remains unexplained (L419-421). We discuss the possibility of other physiological mechanisms, such as photosynthetic assimilation, contributing to the unexplained variation (L421-427). However, large inter-individual variability is commonplace in autumn phenology. A low intra-class correlation coefficient (ICC = 0.26; see L276-280 for methods) suggests much of the remaining variation is attributable to individual-level differences rather than missing explanatory variables (L429-431). In line with the literature, we suggest that genetic and epigenetic differences likely contributed significantly to inter-individual variation, even within a single provenance population (L431-434). In this context of high individual variability, leaf-out timing (ESD effect) and summer cooling treatment (LST effect) together explaining 23.4% of variation in bud set timing is biologically meaningful and demonstrates the mechanistic importance of these processes (L438-441). For completeness, we also briefly discuss alternate sources of within-treatment variability (L434-437).

      Reviewer #2 (Public review):

      I think the experiments are interesting, but I found the exact methods of them somewhat extreme compared to how the authors present them.

      We appreciate this concern and have substantially revised the manuscript to clarify the experimental logic. In the Introduction, we now state explicitly that the study uses temperature regimes that were designed as strong physiological forcing treatments, intended to deeply constrain development and isolate mechanisms rather than to simulate natural or future climatic conditions (L113-115).

      In the Methods, we have enhanced our description of the non-linear effects of temperatures below 10°C on physiological processes (L154-158).

      At the start of the Discussion, we have added a dedicated paragraph clarifying the scope of inference: the experiment tests causality and constraint (i.e. whether specific physiological processes can drive phenological shifts), not quantitative responses under realistic climate scenarios (L346-363). Throughout the Discussion, we have revised language that could be read as scenario-based interpretation, replacing it with mechanistic phrasing.

      Further, given that much of the experiment happened outside, I am not sure how much we can generalize from one year for each experiment, especially when conducted on one population of one species.

      Given the large individual variation expected in phenological experiments, we used single experimental populations of single provenance beech saplings to minimise uncontrolled for variation arising from genetic differences (L358-360). This allowed us to elucidate mechanisms despite noisy biological heterogeneity associated with phenology.

      In the last round of revision, we toned down statements of generalisation. In the Discussion, we now go further to clarify what mechanistic understanding can be gleamed directly from our findings and then cautiously make suggestions how these mechanisms may play out in natural systems. We repeatedly state the intention of the study as mechanistic inference rather than predictive power, e.g. “However, extrapolations to more complex natural ecosystems should be made with caution as our experimental design prioritised mechanistic inference over generalisability and predictive power.” (L417-419). Alongside our previous calls for tests on other species, we now additionally call for tests on other provenances of beech (L511-512).

      I was also very concerned by the revisions.

      If this concern stems from the confusion regarding line-numbers and the two submitted versions of the manuscript (with tracked changes and without tracked changes; as required by eLife), then we hope that situation is now clarified. Otherwise, the authors do not understand why our previous revisions would be perceived as being concerning. Regardless, we have made every attempt to address the remaining comments comprehensively.

      Further, I am at a loss about their hypothesis, when they write in their letter: "Importantly, the Solstice-asPhenology-Switch hypothesis does not assume that the reversal is fixed to June 21." Why on earth reference the solstice if the authors do not mean to exactly reference the solstice?

      We appreciate this important conceptual point. The Solstice-as-Phenology-Switch hypothesis is central to our conceptual model and therefore requires clear explanation. In concert with our changes in response to Reviewer 1’s comment regarding flexibility, we have substantially revised and improved our description of this hypothesis (L69-108).

      Whilst the summer solstice is fixed to a calendar date (June 21), the timing of when trees change their autumn phenological responses to temperature is not (L88-90 & L515-517). This occurs when the compensatory point of two antagonistic effects is crossed. Higher early-season development rates (which are driven by temperature) have an advancing (negative) effect on autumn phenology, which we now refer to as the ESD effect (L71-78). Warmer late-season temperatures have a delaying (positive) effect because trees become phenologically susceptible to cooling, i.e. overwintering responses are induced in response to cooling, which we now refer to as the LST effect (L78-82). The point in time when these two effects balance each other out, i.e. the net effect = 0, is the compensatory point (L95-97 & L523-525). The reason this point occurs after the solstice, is because the LST effect only becomes active when days begin to shorten (L92-94 & L522-523). The solstice acts as an environmental switch, initiating trees’ susceptibility to cooling. Therefore, the solstice is referenced in the hypothesis because it forms a daylength barrier. In this framework, the compensatory point cannot occur earlier than the solstice because day lengths are still increasing (L517-519).

      In the Introduction and Discussion, we clarify that the solstice is referenced as a biologically meaningful photoperiodic cue, not as a fixed threshold date. We now emphasise that the hypothesis concerns a seasonal reversal in responses to temperature structured around photoperiod, whose effective timing depends on developmental state, rather than a reversal occurring precisely on June 21. To avoid confusion, we have reworded phrases such as “summer solstice effect reversal” to “reversal of phenological responses to temperature after the summer solstice” (L371). In accordance, we have also changed the title to “Developmental constraints mediate the reversal of temperature effects on the autumn phenology of European beech after the summer solstice”.

      The following comments stem from the first round of review. We have previously revised the manuscript in accordance with these comments. For most of these points we do not see further cause for changes except for any overlap with comments above. We therefore predominantly copy our previous responses in quotes for clarity, the exception being the comment regarding the framing of our results in relation to natural systems.

      The comments below relate to my original review with many of them still applying.

      Methods: As I read the Results I was surprised the authors did not give more info on the methods here. For example, they refer to the 'effect of July cooling' but never say what the cooling was. Once I read the methods I feared they were burying this as the methods feel quite extreme given the framing of the paper.

      “We understand the concern regarding the structure of the manuscript and note that the methods section was moved to the end of the paper in accordance with eLife’s recommended formatting. We have now moved the methods section before the results to ensure that readers are familiar with the treatments before encountering the outcomes.

      Regarding presentation, treatment details are now described in both the Methods and the relevant figure legends. Given this structure, we have chosen not to restate the full treatment conditions in the main Results text to avoid repetition.”

      The paper is framed as explaining observational results of natural systems, but the treatments are not natural for any system in Europe of which I have worked in. For example a low of 2 deg C at night and 7 deg C during the day through end of May and then 7/13 deg C in July is extreme. I think these methods need to be clearly laid out for the reader so they can judge what to make of the experiment before they see the results.

      We appreciate the reviewer’s concern regarding the use of relatively extreme temperature treatments and the need to ensure that our conclusions are consistent with the motivation for using them. The manuscript was also revised in this regard in the previous round, and we copy the relevant responses at the bottom of this response. Despite this, we agree that further explanation of how our experimental treatments suited the aims of our study was still required.

      The aim of these treatments was not to reproduce typical ambient conditions, but to act as a mechanistic probe. Such mechanisms are not readily identifiable from observations or mild manipulations, because the expected effects are small relative to natural variability; stronger perturbations are therefore required to generate a diagnostic contrast. By strongly constraining development in the early-season, and by providing a robust cooling signal in the late-season, we sought to reveal the causal structure underlying the observed solstice-related reversal in temperature effects on autumn phenology.

      Temperatures below 10°C intensively slow down cell division and mitotic rates, these rates then rapidly and non-linearly approach 0 as temperatures drop towards 0°C (Körner, 2021). As reflected in L152-158 of the revised manuscript, we selected a spring cooling regime of 2–7 °C to strongly slow developmental processes while maintaining a clear thermal safety margin that eliminates the risk of frost damage. Although a milder cooling regime (e.g. 5–10 °C) would be less extreme, it would also be expected to produce only a comparatively small reduction in developmental rates, thereby substantially reducing our ability to generate distinct early- and late-developing individuals and to detect carry-over effects on autumn phenology. Applying strong cooling therefore increases signal-to-noise and allows us to detect the underlying mechanism, which would not be possible with temperature treatments that represent average contemporary climatic variation.

      The use of conditions out with the norm is a standard practice to elucidate mechanisms in ecology, where organisms are often pushed to their physiological limits or transplanted into environments fundamentally different to those which they are adapted (Somero, 2010; Berend et al., 2019). Experiments targeting autumn phenology have utilised a broad range of environmental conditions from moderate to extreme manipulations (Tanino et al., 2010). For example, to test the controls of growth cessation and dormancy induction in Prunus species, one study applied a range of treatments including constant 9°C temperature and 24 hour photoperiod between April and July (Heide, 2008).

      Our experimental design aimed to reduce rates of development, cell division and maturation. In the Methods, we describe this aim and clearly state that the experimental design was not intended to mimic natural climatic variation (L154-156 & L181-186). Importantly, our conclusions are framed at the level of direction, timing, and interaction of effects, rather than the magnitude expected under contemporary or future field conditions (L360-363).

      This framing intends to reflect the primary inference of this study, which concerns when and why temperature effects reverse around the solstice, and how this timing depends on developmental state and diel temperature exposure, rather than making quantitative predictions for present-day or future climates. This aligns our conclusions with the experimental design. We have further revised the Discussion to explain these aims and conclusions more clearly, including the addition of a subsection at the beginning titled “Experimental forcing and scope of inference” (L346-363). We have also set up this expectation in the Introduction (L113-115).

      Additionally, we have improved the Discussion in a number of related aspects.

      We explicitly separate mechanistic conclusions and any relation to natural systems, remaining cautious to not overgeneralise or overstate our findings (L417-419).

      We now include a dedicated paragraph explaining that, although these specific conditions are not likely to be found in beech’s range, analogous developmental constraints can arise during cold springs, late cold spells following budburst, or at high-elevation and continental sites where temperatures remain low despite increasing photoperiod (L540-545, L583-588). We further explain that because developmental progression integrates temperature cumulatively over time, even short episodes of strong cooling can exert lasting carry-over effects on seasonal timing, thereby linking the forced experimental responses to processes relevant under natural, fluctuating conditions (L545-550).

      We explicitly state that the decoupling of day and night temperatures was not intended to represent realistic meteorological states (L458-460). We explain that this design was used diagnostically to isolate inherently diel physiological processes (e.g. nocturnal growth, cell division and expansion versus daytime carbon assimilation), and that the observed responses demonstrate the importance of diel timing of temperature exposure rather than the realism of the imposed cycles (L460-468).

      Previous response:

      We recognise that our temperature treatments were severe and do not mimic real world scenarios. They were deliberately designed to create large contrasts in developmental rates, thereby maximising our ability to detect the mechanisms underpinning the solstice switch. For example, the severe cooling between 4 April and 24 May was specifically designed to slow spring development as much as possible without damaging the plants. We have added text in the Methods to clarify this aim.

      I also think the control is confounded with growth chamber experience in Experiment 1. That is, the control plants never experience any time in a chamber, but all the treatments include significant time in a chamber. The authors mention how detrimental chamber time can be to saplings (indeed, they mention an aphid problem in experiment 2) so I think they need to be more upfront about this. The study is still very valuable, but -- again -- we may need to be more cautious in how much we infer from the results.

      We appreciate the reviewer’s concern about the potential confounding effect of chamber exposure in experiment 1. We have now discussed this limitation more explicitly, adding further explanation to the Methods and Discussion.

      Note that chamber-related problems (e.g. aphid infestations) primarily occurred under warm chamber conditions, whereas our experiment 1 cooling treatments maintained low temperatures that suppressed such issues. This means that an equivalent “warm chamber control” could have been associated with its own artefacts, as trees kept under warm chamber conditions would have been exposed to additional stressors that were not present under natural growing conditions. To address this point, we included a chamber control in experiment 2. While aphid abundance was indeed higher in the warm chamber controls, chamber exposure itself had no detectable effect on autumn phenology. This suggests that the main findings of experiment 1 are unlikely to be artefacts of chamber conditions.

      Nevertheless, we agree that chamber exposure remains a potential limitation of experiment 1, which requires clear acknowledgement. We now state this more explicitly in the manuscript while also emphasising that our results are supported by experiment 2 and by converging lines of external evidence.

      Also, I suggest the authors add a figure to explain their experiments as they are very hard to follow. Perhaps this could be added to Figure 1?

      We have now added figures to the methods section to depict the experimental timelines and settings more clearly (Figs. 2 and 3).

      Finally, given how much the authors extrapolate to carbon and forests, I would have liked to see some metrics related to carbon assimilation, versus just information on timing.

      We agree that carbon assimilation is an important component of forest carbon dynamics. However, the primary aim of this study was to identify how developmental state and diel cycles mediate temperature effects on autumn phenology, rather than to quantify carbon assimilation per se. Assessing photosynthetic controls on autumn phenology would require a substantially different experimental design and is therefore beyond the scope of the present study.

      That said, we were able to include measurements of photosynthetic assimilation during pre-solstice cooling (now presented as Fig. S12 for all treatments). These data show that cooling strongly reduced assimilation across all treatments, despite their markedly different phenological outcomes. This supports our interpretation that variation in assimilation alone cannot explain the observed phenological responses, consistent with previous manipulative and observational studies reporting a weak role of late-season assimilation in controlling autumn phenology.

      Fagus sylvatica: Fagus sylvatica is an extremely important tree to European forests, but it also has outlier responses to photoperiod and other cues (and leafs out very late) so using just this species to then state 'our results likely are generalisable across temperate tree species' seems questionable at best.

      We agree that Fagus sylvatica has a stronger photoperiod dependence than many other European tree species. As we note in our response to Reviewer 1, our findings align with previous research across temperate northern forests. Within our framework, interspecific variation in leaf-out timing would not alter the overall response pattern, though it could shift the specific timing of effect reversals. For example, earlier-leafing species may approach completion of development sooner and thus show sensitivity to late-season cooling earlier than F. sylvatica. Nevertheless, we acknowledge the importance of not overstating generality. We have therefore revised the manuscript to phrase conclusions more cautiously and highlight the need for further research across species.

      And the referenced response to Reviewer one:

      We agree that extrapolation from our experiments on Fagus sylvatica to other species and natural forests requires caution. However, it is precisely the controlled nature of our design that allowed us to isolate the precise mechanisms that appear to underpin the solstice switch, highlighting the role of diel and seasonal temperature variation. In natural systems, additional variables such as competition, precipitation, and soil heterogeneity can strongly influence phenology, but they also make it difficult to disentangle causal mechanisms. By minimising these confounding factors, our experiment provided a clear test of how temperature before and after the solstice regulates growth cessation.

      To acknowledge the limitation, we have toned down statements about generalisation (e.g. “likely generalisable” to “other temperate tree species may display similarities”) and explicitly call for follow-up studies across species and forest contexts. At the same time, we highlight that our findings align with independent evidence from manipulative experiments, satellite observations, flux measurements, and groundbased phenology, which suggests the mechanisms we report may extend beyond the specific populations studied here.”

      As described in responses above, we have further clarified what can be directly concluded from our study, avoiding overgeneralisation.

      Measuring end of season (EOS): It's well known that different parts of plants shut down at different times and each metric of end of season -- budset, end of radial expansion, leaf coloring etc. -- relate to different things. Thus I was surprised that the authors ignore all this complexity and seem to equate leaf coloring with budset (which can happen MONTHS before leaf coloring often) and with other metrics. The paper needs a much better connection to the physiology of end of season and a better explanation for the focus on budset. Relatedly, I was surprised the authors cite almost none of the literature on budset, which generally suggests is it is heavily controlled by photoperiod and population-level differences in photoperiod cues, meaning results may different with a different population of plants. 

      We thank the reviewer for pointing out that our discussion of the responses of different EOS metrics needs more clarity. We agree with much of this perspective, and we have added an additional analysis of leaf chlorophyll content data to use leaf discolouration as an alternative EOS marker. On this we would like to make two important points:

      Firstly, we agree that bud set often occurs before leaf discolouration, although this can depend on which definition of leaf discolouration is used. In experiment 1, budset occurred on average on day-of-year (DOY) 262 and leaf senescence (50% loss of leaf chlorophyll) occurred on DOY 320. However, we do not necessarily agree that this excludes the combined discussion of bud set and leaf senescence timing. Whilst environmental drivers can affect parts of plants differently, often responses from different end-of-season indicators (e.g. bud set and loss of leaf chlorophyll) are similar, even if only directionally. Figure S11 shows how, across both experiments, treatment effects were tightly conserved (R<sup>2</sup> = 0.49) amongst the two phenometrics. In accordance with these revisions, we have updated the manuscript title to “Developmental constraints mediate the summer solstice reversal of climate effects on the autumn phenology of European beech”.

      Secondly, shifts in bud set timing remain the primary focus of the manuscript as these shifts are of direct physiological relevance to plant development and dormancy induction, whereas leaf discolouration may simply follow bud set as a symptom of developmental completion. This is supported by our results, which show stronger responses of bud set than leaf senescence (Figs. 4 & 5 vs. Figs. S9 & S10).

      Following the reviewer’s suggestion, we have included more references on the topic of bud set and its environmental controls. The reviewer rightly stresses that photoperiod is considered the most important factor. Photoperiod is therefore key in our conceptual model. However, the responses we observed in F. sylvatica cannot be explained by photoperiod alone. For example, in experiment 1, July cooling delayed the autumn phenology of late-leafing trees but had negligible impact on early-leafing trees, even though both experienced the exact same photoperiod. Moreover, in experiment 2, day, night and full-day cooling showed substantial variations in their effects despite equal photoperiod across the climate regimes. This is why we suggest that the annual progression of photoperiod modulates the responses to temperature variations instead of eliciting complete control.

      Following the addition of an analysis of leaf senescence data, we also revised the terminology in places (including the title) from “primary growth cessation/bud set” to the broader term “autumn phenology.” This term is intended to encompass two distinct but related physiological processes—bud set and leaf senescence—both of which are commonly used as markers of autumn phenology and the end of the growing season.

      Somewhat minor comments:

      (1) How can a bud type -- which is apical or lateral -- be a random effect? The model needs to try to estimate a variance for each random effect so doing this for n=2 is quite odd to me. I think the authors should also report the results with bud type as fixed, or report the bud types separately.

      We have revised the analysis to include bud type as a fixed effect. There are only very minor numerical adjustments (e.g. rounding to 4.8 days instead of 4.9) and inferences are not altered. We also report the bud type effects for experiment 1 and experiment 2.

      (2) I didn't fully see how the authors results support the Solstice as Switch hypothesis, since what timing mattered seemed to depend on the timing of treatment and was not clearly related to solstice. Could it be that these results suggest the Solstice as Switch hypothesis is actually not well supported (e.g., line 135) and instead suggest that the pattern of climate in the summer months affects end of season timing?

      Our responses to the main comments in this new round of revision have comprehensively covered this topic.

      References

      Berend K, Haynes K, MacKenzie CM. 2019. Common garden experiments as a dynamic tool for ecological studies of alpine plants and communities in northeastern North America. Rhodora 121: 174.

      Heide OM. 2008. Interaction of photoperiod and temperature in the control of growth and dormancy of Prunus species. Scientia Horticulturae 115: 309–314.

      Körner C. 2021. Alpine Plant Life: Functional Plant Ecology of High Mountain Ecosystems. Cham: Springer International Publishing.

      Somero GN. 2010. The physiology of climate change: how potentials for acclimatization and genetic adaptation will determine ‘winners’ and ‘losers’. Journal of Experimental Biology 213: 912–920.

      Tanino KK, Kalcsits L, Silim S, Kendall E, Gray GR. 2010. Temperature-driven plasticity in growth cessation and dormancy development in deciduous woody plants: a working hypothesis suggesting how molecular and cellular function is affected by temperature during dormancy induction. Plant Molecular Biology 73: 49–65.

    1. Reviewer #1 (Public review):

      Summary:

      The authors use a gambling task with momentary mood ratings from Rutledge et al. and compare computational models of choice and mood to identify markers of decisional and affective impairments underlying risk-prone behavior in adolescents with suicidal thoughts and behaviors (STB). The results show that adolescents with STB show enhanced gambling behavior (choosing the gamble rather than the sure amount), and this is driven by a bias towards the largest possible win rather than insensitivity to possible losses. Moreover, this group shows a diminished effect of receiving a certain reward (in the non-gambling trials) on mood. The results were replicated in a general online sample where participants were divided into groups with or without STB based on their self-report of suicidal ideation on one question in the Beck Depression Inventory self-report instrument. The authors suggest, therefore, that adolescents diagnosed with depression or anxiety with decreased sensitivity to certain rewards may need to be monitored more closely for STB due to their increased propensity to take risky decisions aimed at (expected) gains (such as relief from an unbearable situation through suicide) regardless of the potential losses. However, such a result was only found in the clinical sample and cannot be generalized more broadly based on the current findings.

      Strengths:

      (1) The study uses a previously validated task design and replicates previously found results through well-explained model-free and model-based analyses.

      (2) Sampling of adolescents at high risk can help target early preventative diagnoses and treatments for suicide.

      (3) Replication of the results in an online cohort increases confidence in the findings.

      (4) The models considered for comparison are thorough and well-motivated. The chosen models allow for teasing apart which decision and mood sensitivity parameters relate to risky decision-making across groups based on their hypotheses.

      (5) Novel finding of mood (in)sensitivity to non-risky rewards and its relationship with risk behavior in STB.

      Weaknesses:

      (1) Sample size of 25 for S- group is low-powered, which is explicitly mentioned as a study limitation.

      (2) Modeling in the mediation analysis focused on predicting risk behavior in this task from the model-derived bias for gains and suicidal symptom scores. Thus, the implications of this work are more relevant to a basic-science understanding of the etiology of suicidal behavior than they are useful as a predictor of suicidal behavior, and it is not clear that a psychiatrist or psychologist could use this task to potentially determine who is at higher risk of attempting suicide and must be more closely monitored. Indeed, relationships between task parameters and behavior and suicidal behavior was limited to the clinical sample with a diagnosis of depression or anxiety disorder, and did not extend to the online sample. Therefore, the claim that these findings provide "computational markers for general suicidal tendency among adolescents" is unwarranted.

    2. Reviewer #2 (Public review):

      Summary:

      This article addresses a very pertinent question - what are the computational mechanisms underlying risky behaviour in patients having attempted suicide. In particular, it is impressive how the authors find a broad behavioral effect whose mechanisms they can then explain and refine through computational modeling. This work is important because currently, beyond previous suicide attempts, there has been a lack of predictive measures. This study is the first step towards that: understanding the cognition on a group level. Before then being able to include it in future predictive studies (based on the cross-sectional data, this study by itself cannot assess the predictive validity of the measure).

      Strengths:

      - Large sample size<br /> - Replication of their own findings<br /> - Well-controlled task with measures of behaviour and mood + precise and well-validated computational modeling

      Questions, based on revised manuscript and replies to other reviewers:

      (1) Replies to reviewers in general: Bayes Factors have been added, it would be good to also use common verbal terms to describe them (e.g. 'anecdotal', 'moderate' etc). For example, my reading of table S8 would be that for gambling rate there is only anecdotal evidence that it does not relate to PSWQ, BDI, and moderate evidence it does not relate to TAI.

      (2) Reply to reviewer 1 Q2 (Predicting STB):<br /> For the regression predicting suicidal ideation, it seems to me that what you did was a regression STB ~ gambling behaviour + approach + mood? Could you clarify? I had expected as a test of whether the task can predict STB risk something slightly different - a cross-validation (LOO or maybe 5-fold in the large sample): STB ~ gambling behaviour + approach [parameter from model] + mood [parameter from model]; and then computing in the left out participants: predicted STB. Then checking correlation between STB and predicted STB. This would allow testing whether the diverse task measures together predict STB (with the caveat, that it's cross-validated, rather than hold-out sample, unless you could train on one sample (in lab) and test on the other (online).

      (3) Reply to reviewer 2 Q1 (parameter recovery): I'm looking at S3, it seems to still show only the scatter plots and not the correlation matrices, which are now added as text notes. Can you actually show these matrices? An off-diagonal correlation of 0.63 appears quite high. I think it needs to be discussed exactly which parameters those are, and whether that impacts the interpretation of the results.

      (4) Reply to reviewer 3 Q3 (mood model): I would have imagined that the response would involve changing the mood equations (equation 8 main text) to include a term for whether the participant gambled or not, independent of the gamble value.

    3. Author response:

      The following is the authors’ response to the original reviews

      eLife Assessment

      This valuable study combined careful computational modeling, a large patient sample, and replication in an independent general population sample to provide a computational account of a difference in risk-taking between people who have attempted suicide and those who have not. It is proposed that this difference reflects a general change in the approach to risky (high-reward) options and a lower emotional response to certain rewards. Evidence for the specificity of the effect to suicide, however, is incomplete, which would require additional analyses.

      We thank the editors and reviewers for this important assessment. Based on clinical interviews, we included patients with and without suicidality (S<sup>+</sup> and S<sup>-</sup> groups). However, in line with suicidal-related literature (e.g., Tsypes et al., 2024), two groups also differed substantially in the severity of symptoms (see Table 1). To address the request for evidence on specificity to suicidality beyond general symptom severity, we performed separate linear regressions to explain in gambling behaviour, value-insensitive approach parameter (β<sub>gain</sub>), and mood sensitivity to certain rewards (β<sub>CR</sub>) with group as a predictor (1 for S<sup>+</sup> group and 0 for S<sup>-</sup> group) and scores for anxiety and depression as covariates. Results remained significant after controlling anxiety and depression (ps < 0.027; Table S8). Given high correlations among anxiety and depression questionnaires (rs > 0.753, ps < 0.001), we performed Principal Components Analysis (PCA) on the clinical questionnaire to extract the orthogonal components, where each component explained 86.95%, 7.09%, 3.27%, and 2.68% variance, respectively. We then performed linear regressions using these components as covariates to control for anxiety and depression. Our main results remained significant (ps < 0.027; Table S9). We believe that these analyses provide evidence that the main effects on gambling and on mood were specific to suicide.

      Moreover, as Reviewer 3 pointed out, these “absence of evidence” cannot provide insights of “evidence of absence”. Although we median-split patients by the scores of general symptoms (e.g., depression and anxiety-related questionnaires) and verified no significant differences in these severities (Figure S11), we additionally conducted Bayesian statistics in gambling behavior, value-insensitive approach parameter, and mood sensitivity to certain rewards. BF<sub>01</sub> is a Bayes factor comparing the null model (M<sub>0</sub>) to the alternative model (M<sub>1</sub>), where M<sub>0</sub> assumes no group difference. BF<sub>01</sub> > 1 indicates that evidence favors M<sub>0</sub>. As can be seen in Table S7, most results supported null hypothesis, suggesting that general symptoms of anxiety and depression overall did not influence our main results. Overall, we believe that these analyses provide compelling evidence for the specificity of the effect to suicide, above and beyond depression and anxiety.

      Beyond these specific findings, this work highlights the broader utility of computational modelling and mood to better understand behavioral effect, showing how to use both mood and choice data to better comprehend a psychiatric issue. 

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      The authors use a gambling task with momentary mood ratings from Rutledge et al. and compare computational models of choice and mood to identify markers of decisional and affective impairments underlying risk-prone behavior in adolescents with suicidal thoughts and behaviors (STB). The results show that adolescents with STB show enhanced gambling behavior (choosing the gamble rather than the sure amount), and this is driven by a bias towards the largest possible win rather than insensitivity to possible losses. Moreover, this group shows a diminished effect of receiving a certain reward (in the non-gambling trials) on mood. The results were replicated in an undifferentiated online sample where participants were divided into groups with or without STB based on their self-report of suicidal ideation on one question in the Beck Depression Inventory self-report instrument. The authors suggest, therefore, that adolescents with decreased sensitivity to certain rewards may need to be monitored more closely for STB due to their increased propensity to take risky decisions aimed at (expected) gains (such as relief from an unbearable situation through suicide), regardless of the potential losses.

      Strengths:

      (1) The study uses a previously validated task design and replicates previously found results through well-explained model-free and model-based analyses.

      (2) Sampling choice is optimal, with adolescents at high risk; an ideal cohort to target early preventative diagnoses and treatments for suicide.

      (3) Replication of the results in an online cohort increases confidence in the findings.

      (4) The models considered for comparison are thorough and well-motivated. The chosen models allow for teasing apart which decision and mood sensitivity parameters relate to risky decision-making across groups based on their hypotheses.

      (5) Novel finding of mood (in)sensitivity to non-risky rewards and its relationship with risk behavior in STB.

      Weaknesses:

      (1) The sample size of 25 for the S- group was justified based on previous studies (lines 181-183); however, all three papers cited mention that their sample was low powered as a study limitation.

      We thank the Reviewer for rising this concern. We agree that the sample size for S<sup>-</sup> group (n=25) is modest, and the prior studies we cited also acknowledged limited power. We wanted to point out that we obtained a comparable sample size to a prior study. In the revision, we therefore updated the section to justify this sample size in which we acknowledge the limited power of our study in the limitation section. Please see our clarification below:

      Page 32:

      “Third, despite replicating our main results in an independent dataset (n=747), the modest S<sup>-</sup> subgroup size (n=25) has a limited statistical power.”

      (2) Modeling in the mediation analysis focused on predicting risk behavior in this task from the model-derived bias for gains and suicidal symptom scores. However, the prediction of clinical interest is of suicidal behaviors from task parameters/behavior - as a psychiatrist or psychologist, I would want to use this task to potentially determine who is at higher risk of attempting suicide and therefore needs to be more closely watched rather than the other way around (predicting behavior in the task from their symptom profile). Unfortunately, the analyses presented do not show that this prediction can be made using the current task. I was left wondering: is there a correlation between beta_gain and STB? It is also important to test for the same relationships between task parameters and behavior in the healthy control group, or to clarify that the recommendations for potential clinical relevance of these findings apply exclusively to people with a diagnosis of depression or anxiety disorder. Indeed, in line 672, the authors claim their results provide "computational markers for general suicidal tendency among adolescents", but this was not shown here, as there were no models predicting STB within patient groups or across patients and healthy controls.

      Thank you for these thoughtful comments. Our study focuses on why adolescent patients with suicidality have increased risk behavior, aiming to provide a mechanism-based target for suicide prevention. Therefore, our dependent variable in the mediation model was gambling behavior. We also agree that the clinically relevant question is whether suicidality can be predicted from task-derived behavior/parameters. We thus used risky behavior and the potential mental parameters to predict STB. Linear regressions showed that gambling behavior, as well as the value-insensitive approach parameter, can predict suicidal symptom scores among patients (former: β = 9.189, t = 2.004, p = 0.048; latter: β = 5.587, t = 2.890, p = 0.005). In healthy controls, these predictions failed (gambling behavior: β = 1.471, t = 0.825, p = 0.411; approach: β = 0.874, t = 1.178, p = 0.241). These results suggest that clinical relevance of these findings apply exclusively to people with a diagnosis of depression or anxiety disorder. We found same patterns for the mood parameter (mood sensitivity to certain rewards: patients: β = -28.706, t = -2.801, p = 0.006; healthy controls: β = -2.204, t = -0.528, p = 0.599). In sum, we believe that our statement of “computational markers for general suicidal tendency among adolescents” is reasonable now. Please see our revisions below:

      Page 17:

      “Furthermore, linear regression showed that gambling rate can predict the current suicidal ideation score (BSI-C, β = 9.189, t = 2.004, p = 0.048) among patients, but not among HC (β = 1.471, t = 0.825, p = 0.411), suggesting that gambling behavior has patient-specific predictive utility for suicidal symptoms.”

      Page 19:

      “Furthermore, linear regression showed that approach parameter can predict the current suicidal ideation score (β = 5.587, t = 2.890, p = 0.005) among patients, but not among HC (β = 0.874, t = 1.178, p = 0.241), suggesting that value-insensitive approach parameter has patient-specific predictive utility for suicidal symptoms.”

      Page 21:

      “Furthermore, linear regression showed that mood sensitivity to CR can predict the current suicidal ideation score (β = -28.706, t = -2.801, p = 0.006) among patients, but not among HC (β = -2.204, t = 0.528, p = 0.599), suggesting that mood sensitivity to CR has patient-specific predictive utility for suicidal symptoms.”

      (3) The FDR correction for multiple comparisons mentioned briefly in lines 536-538 was not clear. Which analyses were included in the FDR correction? In particular, did the correlations between gambling rate and BSI-C/BSI-W survive such correction? Were there other correlations tested here (e.g., with the TAI score or ERQ-R and ERQ-S) that should be corrected for? Did the mediation model survive FDR correction? Was there a correction for other mediation models (e.g., with BSI-W as a predictor), or was this specific model hypothesized and pre-registered, and therefore no other models were considered? Did the differences in beta_gain across groups survive FDR when including comparisons of all other parameters across groups? Because the results were replicated in the online dataset, it is ok if they did not survive FDR in the patient dataset, but it is important to be clear about this in presenting the findings in the patient dataset.

      Thank you for raising the important issue of multiple testing and for asking us to clarify exactly which tests were covered by the FDR procedure. In the clinical dataset we conducted a large number of inferential tests (χ<sup>2</sup>, t-tests, ANOVAs, regressions) spanning: (i) group differences in demographic/clinical characteristics; (ii) sanity checks (e.g., anxiety/depression questionnaires); (iii) primary hypotheses (e.g., group differences in risky behavior); (iv) model-based analyses (parameter checks and between-group contrasts); and (v) control/sensitivity analyses. Post-hoc t-tests were performed only when the three-group ANOVA was significant. This yielded >150 p-values. FDR was applied using all these p-values. Please see our clarification below:

      Supplementary Page 4:

      “Supplementary Note 8: Clarification for FDR correction.

      In the clinical dataset we conducted a large number of inferential tests (χ<sup2\</sup>, t-tests, ANOVAs, regressions) spanning: (i) group differences in demographic/clinical characteristics; (ii) sanity checks (e.g., anxiety/depression questionnaires); (iii) primary hypotheses (e.g., group differences in risky behavior); (iv) model-based analyses (parameter checks and between-group contrasts); and (v) control/sensitivity analyses. Post-hoc t-tests were performed only when the three-group ANOVA was significant. This yielded >150 p-values. FDR was applied using all these p-values.”

      (4) There is a lack of explicit mention when replication analyses differ from the analyses in the patient sample. For instance, the mediation model is different in the two samples: in the patient sample, it is only tested in S+ and S- groups, but not in healthy controls, and the model relates a dimensional measure of suicidal symptoms to gambling in the task, whereas in the online sample, the model includes all participants (including those who are presumably equivalent to healthy controls) and the predictor is a binary measure of S+ versus S- rather than the response to item 9 in the BDI. Indeed, some results did not replicate at all and this needs to be emphasized more as the lack of replication can be interpreted not only as "the link between mood sensitivity to CR and gambling behavior may be specifically observable in suicidal patients" (lines 582-585) - it may also be that this link is not truly there, and without a replication it needs to be interpreted with caution.

      Thank you for these important comments. This study focused on cognitive and affective computational mechanisms underlying increased risky behavior in STB. Accordingly, we compared patients with STB (S<sup>+</sup>) with patients without STB (S<sup>-</sup>) and healthy controls (HC) to examine the effects of STB on risky behavior. Therefore, group comparison, instead of dimensional measure of suicidal symptoms by Beck Scale for Suicidal Ideation, can answer our research questions directly.

      To enhance consistency between the clinical and replication datasets, we included all participants in each dataset when performing the mediation analysis. Given that S<sup>-</sup> and HC did not differ in gambling behavior or the approach parameter in the clinical dataset, we merged these two groups. In the replication dataset, to mirror the S<sup>+</sup> vs. S<sup>-</sup> contrast used clinically, we categorized the general sample into S+ and S<sup>-</sup> based on BDI item 9. The mediation results remained significant in both datasets (the clinical dataset: a×b = 0.321, 95% CI = [0.070, 0.549], p = 0.016; the replication dataset: a×b = 0.143, 95% CI = [0.016, 0.288], p = 0.031), suggesting that STB is associated with increased risk behavior via stronger approach motivation.

      We also acknowledge the non-replication of the correlation between gambling behavior and mood sensitivity to certain rewards in the online sample. While this pattern might indicate that the link is specific to suicidal patients, it may also reflect sample-specific or unstable effects; thus, we now state this explicitly and interpret the finding with caution. Please see our revisions below:

      Page 15:

      “We next verified our results in an independent dataset, including the same task and BDI questionnaire in 747 general participants (500 females; age: 20.90±2.41) (46). One item in BDI involves the measurement of STB. In item 9 of BDI, participants chose one option that describes them best: Option 1, “I don't have any thoughts of killing myself.”; Option 2, “I have thoughts of killing myself, but I would not carry them out.”; Option 3, “I would like to kill myself.”; Option 4, “I would kill myself if I had the chance.”. In line with the current definition of S<sup>+</sup>/S<sup>-</sup> in the clinical dataset, we identified S<sup>+</sup> group as choosing Option 2, 3, or 4, while participants selecting Option 1 were categorized as S<sup>-</sup> group.”

      Page 19:

      “Given significant correlations between group, approach parameter, and gambling rate for gain trials (ps < 0.017), we further conducted a mediation analysis with the assumption of the mediating effect of approach motivation of suicidality on the risk behavior. Given that we aimed to test the effect of STB, with S<sup>-</sup> and HC as controls, and given that S<sup>-</sup> and HC did not differ in gambling behavior or in the approach parameter, we merged these two groups for the mediation analysis. Results supported our hypothesis (a×b = 0.321, 95% CI = [0.070, 0.549], p = 0.016; Figure 2C), confirming that suicidal thoughts and behavior increase risk behavior through stronger approach motivation.”

      Page 26:

      “However, we did not observe any significant correlation between mood sensitivity to CR and gambling behavior (ps > 0.389), which suggests that the link between mood sensitivity to CR and gambling behavior may be specifically observable in suicidal patients. Alternatively, this non-replicated result may also reflect sample-specific or unstable effects, which needs to be interpreted with caution.”

      (5) In interpreting their results, the authors use terms such as "motivation" (line 594) or "risk attitude" (line 606) that are not clear. In particular, how was risk attitude operationalized in this task? Is a bias for risky rewards not indicative of risk attitude? I ask because the claim is that "we did not observe a difference in risk attitude per se between STB and controls". However, it seems that participants with STB chose the risky option more often, so why is there no difference in risk attitude between the groups?

      Thank you for pointing out the ambiguity. In our manuscript, “motivation” and “risk attitude” are defined at the computational level. Following prior work with this task Rutledge et al., (2015, 2016), we decompose observed gambling into (i) value-dependent valuation parameters that capture risk attitude (e.g., risk aversion and loss aversion, which scale the subjective value of outcomes), and (ii) value-insensitive, valence-dependent biases that capture approach/avoidance motivation. Accordingly, a higher gambling rate does not imply a change in risk attitude per se: it can arise from an increased value-insensitive approach bias even when risk-attitude parameters are comparable between groups—which is what we observe for S<sup>+</sup> vs. controls. We have clarified this point in the computational modeling section.

      Pages 12-13:

      “Please note that a higher gambling rate does not imply a change in risk attitude per se: it can arise from an increased value-insensitive approach bias even when risk-attitude parameters are comparable between groups. Risk attitude is indeed conceptualized in economics as the curvature of the utility function (i.e., the subjective value) of the objective outcomes, with concave curves associated with risk aversion, and convex curves associated with risk seeking (54,56). By contrast, the approach or avoidance bias apply to all the value. A possible interpretation of the approach bias is that participant approach the option with the highest possible gain (the lottery) in the gain frame; the avoidance bias would then reflect a tendency to systematically avoid the highest potential losses (the lottery) in the loss frame.”

      Reviewer #2 (Public review):

      Summary:

      This article addresses a very pertinent question: what are the computational mechanisms underlying risky behaviour in patients who have attempted suicide? In particular, it is impressive how the authors find a broad behavioural effect whose mechanisms they can then explain and refine through computational modeling. This work is important because, currently, beyond previous suicide attempts, there has been a lack of predictive measures. This study is the first step towards that: understanding the cognition on a group level. This is before being able to include it in future predictive studies (based on the cross-sectional data, this study by itself cannot assess the predictive validity of the measure).

      Strengths:

      (1) Large sample size.

      (2) Replication of their own findings.

      (3) Well-controlled task with measures of behaviour and mood + precise and well-validated computational modeling.

      Weaknesses:

      I can't really see any major weakness, but I have a few questions:

      (1) I can see from the parameter recovery that the parameters are very well identified. Is it surprising that this is the case, given how many parameters there are for 90 trials? Could the authors show cross-correlations? I.e., make a correlation matrix with all real parameters and all fitted parameters to show that not only the diagonal (i.e., same data is the scatter plots in S3) are high, but that the off-diagonals are low.

      Thank you for raising these thoughtful concerns. The current task consisted of 90 choices and 36 mood ratings. There were 5 choice parameters and 4 mood parameters. The apparently strong identifiability is not unexpected, as 90 choice trials and 36 mood ratings are comparable to those in prior computational modeling literature (Blain & Rutledge, 2022).

      As suggested, we computed cross-correlations between all generating (“true”) and recovered (“fitted”) parameters. The resulting matrix showed high diagonal (choice winning model: rs > 0.91; mood winning model: rs > 0.90) and low off-diagonal (choice winning model: abs(rs) < 0.63; mood winning model: abs(rs) > 0.40) correlations, further supporting parameter recovery. Please see our clarifications below:

      Supplementary Pages 2-3:

      “Parameter recovery: Figure S3 shows good parameter recovery for both choice and mood winning model (choice: rs > 0.91, ps < 0.001; intraclass coefficients > 0.78; mood: rs > 0.90, ps < 0.001; intraclass coefficients > 0.86). Moreover, we computed cross-correlations between all generating (“true”) and recovered (“fitted”) parameters. The resulting matrix showed high diagonal (choice winning model: rs > 0.91; mood winning model: rs > 0.90) and low off-diagonal (choice winning model: abs(rs) < 0.63; mood winning model: abs(rs) > 0.40) correlations, further supporting parameter recovery.”

      Page 10:

      “The numbers of choice trials and mood ratings were comparable to those in prior computational modeling studies (34,35).”

      (2) Could the authors clarify the result in Figure 2B of a correlation between gambling rate and suicidal ideation score, is that a different result than they had before with the group main effect? I.e., is your analysis like this: gambling rate ~ suicide ideation + group assignment? (or a partial correlation)? I'm asking because BSI-C is also different between the groups. [same comment for later analyses, e.g. on approach parameter].

      Thank you for pointing out the lack of clarity. We performed group difference analysis and correlation of suicidal ideation analysis, separately. We first performed group difference analysis to test our hypothesis of STB effects. We then conducted correlational analysis to further specify our findings.

      (3) The authors correlate the impact of certain rewards on mood with the % gambling variable. Could there not be a more direct analysis by including mood directly in the choice model?

      Thank you for this insightful suggestion. As suggested, we tried to integrate mood into choice models by adding mood bias component(s) in line with previous literature (Vinckier et al., 2018). The first model (mcM1) assumes that mood biases choice, building on cM3 (the winning choice model). cmM2 further separated the mood bias parameter into two components according to participants’ choices.

      However, model comparison using BIC supported cM3 (Table S6), that is, without consideration of mood in choice modeling. This can be due to the lack of block design in our experimental design unlike e.g., Vinckier et al., (2018) and Eldar & Niv, (2015). Please see our clarifications below:

      Supplementary Pages 3-4:

      “Supplementary Note 6: integration of mood into choice models

      Although we modeled choice and mood separately to examine cognitive and affective mechanisms underlying increased risk behavior in adolescent suicidal patients, one interesting question was whether mood responses influence subsequent gambling choices and how to model them. First, we median-split mood responses (except the final rating) to compare gambling rate. Results showed a trend for less gambling rate in higher mood (t = -1.971, p = 0.050). However, there was no significant group difference (F = 0.680, p = 0.507). Second, with the assumption that mood biases choice, we constructed mcM1 based on cM3 (the winning choice model).

      Based on our finding of the negative correlation between mood sensitivity to certain rewards and gambling rate in S<sup>+</sup>, we separated β<sub>Mood</sub> parameter into β<sub>Mood-CR</sub> and β<sub>Mood-GR</sub> (cmM2).

      Model comparison using BIC supported cM3 (Table S6), that is, without consideration of mood in choice modeling. The mood bias parameters in neither cM2 nor cM3 reached significance (ps > 0.091), which may be due to the absence of a blocked design in our experiment, unlike in Vinckier et al. (2018) and Eldar and Niv (2015).”

      (4) In the large online sample, you split all participants into S+ and S-. I would have imagined that instead, you would do analyses that control for other clinical traits. Or, for example, you have in the S- group only participants who also have high depression scores, but low suicide items.

      Thank you for this insightful suggestion. Following prior suicide-related literature (Tsypes et al., 2024), we controlled for depression by including them as covariates. Note that depression scores were derived from our established bifactor model (Wang et al., 2025), which decomposed depression from the anxiety. These results remained largely significant (ps ≤ 0.050), except a marginally significant effect of group on gambling behavior (p = 0.059). Despite a trend, this effect with covariates of depression-related questionnaires is strong in our clinical cohort (p = 0.024; Table S8). This suggests that the link between suicidality and risky behavior persists above and beyond general depressive symptoms.

      Please see our clarifications below:

      Page 26:

      “After controlling for depression severity using our established bifactor model (see ref 60 for details), these results remained significant (ps ≤ 0.050), except a marginally significant effect of group on gambling behavior (p = 0.059). Despite a trend, this effect with covariates of depression-related questionnaires is strong in our clinical cohort (p = 0.024; Table S8). This suggests that the link between suicidality and risky behavior persists above and beyond general depressive symptoms.”

      Reviewer #3 (Public review):

      This manuscript investigates computational mechanisms underlying increased risk-taking behavior in adolescent patients with suicidal thoughts and behaviors. Using a well-established gambling task that incorporates momentary mood ratings and previously established computational modeling approaches, the authors identify particular aspects of choice behavior (which they term approach bias) and mood responsivity (to certain rewards) that differ as a function of suicidality. The authors replicate their findings on both clinical and large-scale non-clinical samples.

      (1) The main problem, however, is that the results do not seem to support a specific conclusion with regard to suicidality. The S+ and S- groups differ substantially in the severity of symptoms, as can be seen by all symptom questionnaires and the baseline and mean mood, where S- is closer to HC than it is to S+. The main analyses control for illness duration and medication but not for symptom severity. The supplementary analysis in Figure S11 is insufficient as it mistakes the absence of evidence (i.e., p > 0.05) for evidence of absence. Therefore, the results do not adequately deconfound suicidality from general symptom severity.

      Thank you for this important comment. Based on clinical interviews, we included patients with and without suicidality (S<sup>+</sup> and S<sup>-</sup> groups). However, in line with suicidal-related literature (e.g., Tsypes et al., 2024), two groups also differed substantially in the severity of symptoms (see Table 1). To address the request for evidence on specificity to suicidality beyond general symptom severity, we performed separate linear regressions to explain in gambling behaviour, value-insensitive approach parameter (β<sub>gain</sub>), and mood sensitivity to certain rewards (β<sub>CR</sub>) with group as a predictor (1 for S<sup>+</sup> group and 0 for S<sup>-</sup> group) and scores for anxiety and depression as covariates. Results remained significant after controlling anxiety and depression (ps < 0.027; Table S8). Given high correlations among anxiety and depression questionnaires (rs > 0.753, ps < 0.001), we performed Principal Components Analysis (PCA) on the clinical questionnaire to extract the orthogonal components, where each component explained 86.95%, 7.09%, 3.27%, and 2.68% variance, respectively. We then performed linear regressions using these components as covariates to control for anxiety and depression. Our main results remained significant (ps < 0.027; Table S9). We believe that these analyses provide evidence that the main effects on gambling and on mood were specific to suicide.

      As pointed out, these “absence of evidence” cannot provide insights of “evidence of absence”. Although we median-split patients by the scores of general symptoms (e.g., depression and anxiety-related questionnaires) and verified no significant differences in these severities (Figure S11), we additionally conducted Bayesian statistics in gambling behavior, value-insensitive approach parameter, and mood sensitivity to certain rewards. BF<sub>01</sub> is a Bayes factor comparing the null model (M<sub>0</sub>) to the alternative model (M₁), where M<sub>0</sub> assumes no group difference. BF<sub>01</sub> > 1 indicates that evidence favors M<sub>0</sub>. As can be seen in Table S7, most results supported null hypothesis, suggesting that general symptoms of anxiety and depression overall did not influence our main results. Overall, we believe that these analyses provide compelling evidence for the specificity of the effect to suicide, above and beyond depression and anxiety.

      Please see our revisions below:

      Page 17:

      “Within patients, this group effect on gambling rate remained significant after controlling for sex, illness duration, family history, diagnosis, and various medications use (ps < 0.05), as well as general symptoms (e.g., depression and anxiety; p = 0.024; also see Figure S11, Table S7 and Table S8). Given high correlations among anxiety and depression questionnaires (rs > 0.753, ps < 0.001), we performed Principal Components Analysis (PCA) to extract main components, where each component explained 86.95%, 7.09%, 3.27%, and 2.68% variance, respectively. To further control for anxiety and depression, linear regression using these components as covariates revealed that the group effect on gambling rate remained significant (p = 0.024; Table S9).”

      Pages 18-19:

      “Within patients, this group effect on the approach parameter remained significant after controlling for sex, illness duration, family history, diagnosis, and various medications use (ps < 0.05), as well as general symptoms (e.g., depression and anxiety; p = 0.027; also see Figure S11, Table S7 and Table S8). Linear regression using PCA components as covariates revealed that the group effect on approach parameter remained significant (p = 0.027; Table S9).”

      Page 21:

      “Within patients, this group effect on βCR remained significant after controlling for gambling rate, earnings, mood-related outcome effect, mood drift effect, sex, illness duration, family history, diagnosis, and various medications use (ps < 0.032), as well as general symptoms (e.g., depression and anxiety; p = 0.001; also see Figure S11, Table S7 and Table S8). Linear regression using PCA components as covariates revealed that the group effect on this mood parameter remained significant (p = 0.001; Table S9).”

      (2) The second main issue is that the relationship between an increased approach bias and decreased mood response to CR is conceptually unclear. In this respect, it would be natural to test whether mood responses influence subsequent gambling choices. This could be done either within the model by having mood moderate the approach bias or outside the model using model-agnostic analyses.

      Thank you for this important suggestion. As suggested, one interesting question was whether mood responses influence subsequent gambling choices and how to model them. First, we median-split mood responses (except the final rating) to compare gambling rate. Results showed a trend for less gambling rate in higher mood (t = -1.971, p = 0.050). However, there was no significant group difference (F = 0.680, p = 0.507). Second, with the assumption that mood biases choice, we constructed mcM1 based on cM3 (the winning choice model). Based on our finding of the negative correlation between mood sensitivity to certain rewards and gambling rate in S<sup>+</sup>, we separated β<sub>Mood</sub> parameter into β<sub>Mood-CR</sub> and β<sub>Mood-GR</sub> (cmM2). Model comparison using BIC supported cM3 (Table S6), that is, without consideration of mood in choice modeling. This can be due to the lack of block design in our experimental design unlike e.g., Vinckier et al., (2018) and Eldar & Niv, (2015). Please see Supplementary Pages 3-4:

      (3) Additionally, there is a conceptual inconsistency between the choice and mood findings that partly results from the analytic strategy. The approach bias is implemented in choice as a categorical value-independent effect, whereas the mood responses always scale linearly with the magnitude of outcomes. One way to make the models more conceptually related would be to include a categorical value-independent mood response to choosing to gamble/not to gamble.

      We apologise for the unclear statement. The approach bias is implemented in choice as a continuous value-independent effect, ranging from -1 to 1.

      It was true that the mood responses always scale with the magnitude of outcomes, since mood ratings were request after the outcomes. Therefore, mood parameters and the approach bias were both continuous.

      We also attempted to integrate mood into choice modelling. See Response 2 for Reviewer 3 for details.

      (4) The manuscript requires editing to improve clarity and precision. The use of terms such as "mood" and "approach motivation" is often inaccurate or not sufficiently specific. There are also many grammatical errors throughout the text.

      Thank you for this important suggestion. We have now explained motivation and mood in the Introduction section and the computational modeling section. Please see our clarifications below:

      Pages 3-4:

      “A growing literature indeed shows that risky behavior can be far better explained after adding value-insensitive approach and avoidance components to prospect theory(18,19), that is by including a decision bias in favor of the highest gain (approach) and another decision bias against the lowest loss (avoidance), above and beyond options value difference. This class of models highlights the important role of value-insensitive motivational components in decision making in addition to risk attitude-driven valuation (e.g., loss/risk aversion)(20).”

      Page 5:

      “Although mood is thought to persist for hours, days, or even weeks(30-33), momentary mood, measured over the timescale in the laboratory setting, represents the accumulation of the impact of multiple events at the scale of minutes(30,32,34-38). Momentary mood external validity is demonstrated e.g., through its association with depression symptoms(37). Mood is different from emotions, which reflect immediate affective reactivity and is more transient (e.g., from surprise to fear)(31-33,39).”

      We have corrected grammatical errors throughout the manuscript.

      5) Claims of clinical relevance should be toned down, given that the findings are based on noisy parameter estimates whose clinical utility for the treatment of an individual patient is doubtful at best.

      Thank you for this comment. We agree that we did not evaluate the noise in our estimate e.g., by assessing the test-retest reliability on the task parameters, which is outside the scope of the study, and it is indeed possible that parameter estimate is somehow noisy. Therefore, we tone down the clinical relevance of our results. Please see our revision below:

      Page 32:

      “Next, we did not evaluate the noise in our estimate e.g., by assessing the test-retest reliability on the task parameters and it is indeed possible that parameter estimate is somehow noisy.”

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      (1) Title: I believe "aberrant mood dynamics" is both too general and overstating the results of this study, which did not measure mood dynamics longitudinally. "Aberrant" is also overly pathologizing. I would suggest sticking more directly to the results, for instance, "Insensitivity of momentary mood to non-risky rewards in adolescent suicidal patients".

      Thank you for this suggestion. We have now corrected it.

      (2) Abstract: in line 61, "Our study uncovers the cognitive and affective mechanisms" suggests that these are the only ones, and you uncovered them. Of course, there could be more mechanisms contributing to risk behavior in STB, so I would suggest removing the word "the" or adding "one of the".

      Thank you for this suggestion. We have now corrected it.

      (3) One major weakness of this study is that suicidal thoughts and behaviors were not assessed via a clinical instrument such as the Columbia Suicide Severity Rating Scale - this should be mentioned upfront.

      Thank you for this comment. According to medical records and information from family and friends by the researcher and psychiatrists, patients with suicidal thoughts and behaviors were categorized as suicidal group (S<sup>+</sup>), while patients without suicidal thoughts and behaviors were identified as control group (S<sup>-</sup>). Note that medical records and information were recorded from clinical interviews where the psychiatrists were vigilant for signs of suicidal ideation and inquired about suicidal-related thoughts and behaviors from both the patients and their families. Therefore, the current group operation was possibly comparable to Columbia Suicide Severity Rating Scale.

      (4) Table 1: female/male are sex, not gender (gender is man/woman/transgender/non-binary).

      Thank you for this suggestion. We have now corrected it.

      (5) Equation 1: It would be good to clarify what happens in gain-only or loss-only trials (the other value is then 0, but this can be clarified as it is not technically a loss or a gain).

      Thank you for this suggestion. We have now corrected it. Please see below for our revision:

      Page 12:

      “Please note that V<sub>gain</sub> is 0 in gain trials and V<sub>loss</sub> is 0 in loss trials.”

      (6) Figure 1E: The model prediction is not informative here. Given the linear regression model, there is no other option except that the mean prediction would overlap with the mean empirical measurement (unless the model was specified incorrectly). The same is true in Figure 2A.

      Thank you for this suggestion. We have now removed plots for model prediction.

      (7) Figure 1G: There was no analysis of the differences between groups in terms of earnings, given that the ANOVA was not significant. Still, if the claim is that risky behavior is sometimes suboptimal in this task, it would be good to show that there is a correlation between, say, symptoms of STB across groups and 1) risky behavior and 2) earnings.

      Thank you for this insightful comment. In the patient cohort, risky behavior (gambling rate)—but not earnings—predicted the current suicidal ideation score (BSI-C, β = 9.189, t = 2.004, p = 0.048; earnings, β = 0.001, t = 0.582, p = 0.562). The lack of association for earnings is consistent with the task design, in which there is no stable optimal policy and payouts are only a coarse proxy for decision quality. Future work in learning paradigms, where optimality is well defined, may be better suited to test earnings-based links to STB. We have clarified this point below:

      Page 32:

      “Second, although we assumed that increased risky behavior in STB was suboptimal, the current task was not suited to test this, given the task design of random feedback for gambling option. Future work in learning paradigms, where optimality is well defined, may be better suited to test earnings-based links to STB.”

      (8) Line 290: "beta_gain: -1-1" is unclear. I believe you meant beta_gain \in [-1,1].

      Thank you for this suggestion. We have now corrected it to make it clear.

      (9) The gain and loss biases are modeled as minimum and maximum probabilities for choosing the gamble. This is a legitimate choice for value-agnostic biases, but it is not the traditional choice (as far as I know). I wonder if the same results would hold with the more traditional formulation of the bias as an added constant to the utility of the gamble, i.e., p(gamble) = 1/(1+ exp(-mu(U_gamble + beta_gain - U_certain)). I believe in this case, you would also not have to specify different equations for positive or negative biases, or to limit the bias to the range of [-1,1] (indeed, the bias would be in reward-equivalent units).

      Thank you for this suggestion. The winning choice model we used here was consistent with previous literature (Rutledge et al., 2015 & 2016), which decomposed the decision process into risk-attitude-driven valuation (e.g., loss and risk aversion) and value-insensitive motivational components. These approach/avoidance parameters are a decision bias in favor of the highest gain (approach) and another decision bias against the lowest loss (avoidance), above and beyond options value difference.

      As suggested, we also compared the traditional bias choice model. Model comparison did not support this. Please see our revision below:

      Supplementary Page 4:

      “We also considered the traditional bias parameter (cM4), rather than approach/avoidance parameters. We limited the bias to the range of [-100, 100], which was in reward-equivalent units.

      However, model comparison did not support cM4 (Table S6).”

      (10) Also, for equations 5-8, it seems that 5-6 are identical to 7-8 except for the use of beta_gain versus beta_loss. You might want to consider simplifying by putting beta in the equations and specifying in the text that, depending on the trial type (loss or gain), the relevant beta is used.

      Thank you for this suggestion. We have now simplified it. Please see response to Reviewer 2, point 3.

      (11) It is not clear what equations are applied to mixed trials in cM3.

      Sorry for the confusion. We have now clarified this point.

      Page 12:

      “Approach/avoidance parameters are not applied to in mixed trials.”

      (12) Model comparison: the mood models are nested within each other (e.g., mM3 can be derived from mM1 by setting beta_EV = beta_RPE). In this case, model comparison can use the likelihood ratio test instead of BIC, which can be too conservative (and therefore does not support the extra beta parameter for RPE, different from previous results in the literature). I wonder if a likelihood ratio test would lead to results more in line with previous findings with this task?

      Thanks for this suggestion. We agree that mM1 (CR+EV+RPE) and mM3 (CR+GR) are nested. However, our model space also included unnested models, such as mM5 (CR+GR<sub>better</sub>+GR<sub>worse</sub>). Therefore, it was not reasonable in our model space to use likelihood ratio tests.

      (13) Line 346: The replication sample is described as "healthy participants," however, their health (or mental health) status was not assessed, and they may as well have mental health concerns. I would suggest calling this a general sample or an undifferentiated sample - but not a healthy sample.

      Sorry for the confusion. We have now corrected this phrase.

      (14) Line 363: "in addition to the replication of previous findings in the validation dataset" is unclear. Are those tests not two-tailed?

      Sorry for the unclear statement. In the replication analyses, we used one-tailed t-tests because the direction of the effect was revealed on the clinical dataset. Please see our clarification below:

      Page 15:

      “For the replication of previous findings in the validation dataset, we used one-tailed tests in line with our clinically motivated directional hypothesis.”

      (15) Line 372: "validating our group manipulation" - the presented work does not have a manipulation. Maybe you meant "validating our grouping of participants"?

      Thank you for this suggestion. We have now corrected it to make it clear.

      (16) Figure 2B: It is not clear how the data were binned for illustration purposes only, and why this binning is necessary (I have not seen it in other papers) - presenting the data from each subject and the correlation line with error margins (as is done here) should be sufficient.

      Thank you for flagging this. For illustration only, we binned the data proportional to group sizes: in the patient sample (S<sup>-</sup> n = 25; S<sup>+</sup> n = 58; ≈1:2), we displayed 3 bins for S<sup>-</sup> and 6 bins for S<sup>+</sup>. We agree that binning is not necessary; all statistics were computed on raw, unbinned data. The binned panel was included solely for visualization, consistent with our prior work (Blain et al., 2023).

      (17) Table 2: delta BIC should be presented per subject (that is, divided by the number of subjects in each group), as the groups are of different sizes, so as presented now, the columns are not comparable across groups.

      Thank you for the helpful suggestion. Our goal in Table 2 is not to compare ΔBIC magnitudes across groups, but to identify the winning model within each group. The ΔBICs are aggregated at the group level solely to rank models for that group. Dividing by the number of participants would rescale each group’s column by a constant and would therefore not affect the within-group ranking or the conclusion that cM3 is the best model in all groups. For this reason, we retain the current presentation and interpret each column within group rather than across groups.

      (18) Line 640 - the effect of expectations and prediction errors on mood was not only shown in healthy people, but also in people with depression (Rutledge et al., 2007, https://pubmed.ncbi.nlm.nih.gov/28678984/)

      Thank you for this comment. Indeed, Rutledge et al., (2017) showed evidence for CR+EV+RPE mood model in adult people with depression. However, our study recruited adolescents with depression or anxiety, given that adolescent period might provide a developmental window for opportunities for early intervention of suicidality. Therefore, it is also possible that the current winning model was specific to adolescents. Please see our clarifications below:

      Page 28:

      “It is also possible that the current winning model was specific to adolescents. Given that Rutledge et al., (2017) supported the “CR-EV-RPE model” in adults with depression, our study with adolescent populations may suggest a developmental change for mood sensitivities.”

      (19) Supplemental material: Is the R2 section about R-squared? Perhaps you can use superscript on the 2 to make that clearer? For Figure S2, how was model recovery determined? Should I interpret the confusion matrix as suggesting that the winning model for each and every simulated subject was the generating model, or was the winning model determined for the whole simulated population in each of the 100 simulations? Traditionally, confusion matrices use the former measure, but the results of 100% recoverability make me suspect the latter was used here. In Figure S3, should we not be looking at simulated parameters and recovered parameters? What are "real parameters" here?

      Thank you for these important comments. We now consistently denote the coefficient of determination as R<sup>2</sup> (with a superscript 2) throughout the manuscript and Supplementary Materials.

      For the model recovery analysis in Figure S2, we have clarified that the confusion matrix is computed at the population level. Specifically, for each of the 100 simulations we generated a full dataset under each candidate model, fit all models to that dataset, and selected the winning model based on group-level model evidence (BIC). Each cell in the confusion matrix therefore reflects the proportion of simulations in which model j was selected as the best-fitting model when the data were generated by model i. This operation was reasonable because the decision of the winning model is made on the population-level dataset rather than on individual subjects.

      In Figure S3, the term “real parameters” referred to the parameters used to generate the simulated data. To avoid confusion, we now relabel these as “simulated (generating) parameters” and explicitly describe the figure as showing the relationship between simulated (generating) parameters and recovered parameters. Please see our revisions below:

      Supplementary Pages 2-3:

      “Model recovery: We generated 100 simulated datasets for each model (3 choice models and 8 mood models) using the fitted parameters of each model as the ground truth. Each dataset contained 201 trials and included 3 (or 8) sets of simulated data corresponding to the respective models. For each simulated dataset, we then fit all models and determined the winning model at the population level based on group-level BIC, yielding a confusion matrix in which each entry represents the proportion of simulations in which model j was selected as the best-fitting model when the data were generated by model i. As shown in Figure S2, all models are highly identifiable, indicating excellent recovery performance for both the choice and mood models.”

      “Parameter recovery: Figure S3 shows good parameter recovery for both choice and mood winning model (choice: rs > 0.91, ps < 0.001; intraclass coefficients > 0.78; mood: rs > 0.90, ps < 0.001; intraclass coefficients > 0.86). Moreover, we computed cross-correlations between all generating (“generating”) and recovered (“fitted”) parameters. The resulting matrix showed high diagonal (choice winning model: rs > 0.91; mood winning model: rs > 0.90) and low off-diagonal (choice winning model: abs(rs) < 0.63; mood winning model: abs(rs) > 0.40) correlations, further supporting parameter recovery.”

      Typos:

      (1) Line 90: original → originate

      (2) Line 596-598 - the same phrase is repeated twice.

      (3) Line 616: on the other word → hand.

      Sorry for the mistakes. We have now corrected them throughout the manuscript.

      Reviewer #2 (Recommendations for the authors):

      For people unfamiliar with interpersonal theory or motivational-volitional model, or three-step theory (lines 105-106), could you briefly explain the key idea of mood and suicide before going to the decision-making tasks? And from this, maybe motivate the predictions in your task? In particular, in the abstract and introduction, the phrasing could be a bit more concise and simpler. In the abstract, sentences were sometimes quite long. In the introduction, some paragraphs are somewhat repetitive. In the discussion, there were some typos.

      Thank you for these suggestions. We have now explained the key idea of mood and suicide before going to the decision-making tasks in the introduction, which can be seen below:

      Pages 4-5:

      “Contemporary theories of suicide converge on the idea that STB is initially caused by low mood experience. The interpersonal theory of suicide proposes that suicidal desire arises when people simultaneously feel socially disconnected (“thwarted belongingness”) and like a burden on others (“perceived burdensomeness”), experiences that are tightly linked to chronically low mood(25). The motivational–volitional model(26) and the three-step theory(27,28) similarly emphasize that when negative mood and feelings of defeat or entrapment are experienced as inescapable, they can give rise to suicidal ideation, and that the progression from ideation to suicide attempts depends on additional factors such as reduced fear of death, increased pain tolerance, and a tendency to act impulsively under intense affect. Some official organizations, e.g., National Institute of Mental Health, have also listed mood problems as warning signals(8). Interestingly, within the framework of decision making under uncertainty, gambling on lotteries with a revealed outcome has been found to induce high mood variance(29), providing an opportunity to assess the relationship between deficient mood and increased gambling decisions in STB.”

      We have also refined the wording and corrected typos throughout the manuscript.

      Reviewer #3 (Recommendations for the authors):

      (1) Since many readers might only read the abstract, it is important that it is both informative and accurate. I have two suggestions in this respect. First, for the abstract to be more informative, it may be helpful to indicate already there that these are value-insensitive approach-avoidance parameters, in the sense that they favor/disfavor the gamble regardless of the potential outcomes' magnitude or probability. This issue is also present throughout the text, where the phrases "approach and avoidance motivation" are referred to as if they have established and precise computational definitions. In my view, these terms could just as easily be interpreted as parameters that multiply the value of potential gains or losses, which is not what the authors mean. It would be helpful to clarify this terminology.

      Thank you for these suggestions. In line with previous literature (Rutledge et al., 2015 & 2016), approach and avoidance motivation are indeed defined at the computational level, referring to a decision bias in favor of the highest gain (approach) and another decision bias against the lowest loss (avoidance), above and beyond options value difference. We have cited these papers in the manuscript. We also make it clear to further clarify approach and avoidance parameters in the abstract and introduction. Please see our revisions below:

      Page 2 (Abstract):

      “Using a prospect theory model enhanced with value-insensitive approach-avoidance parameters revealed that this rise in risky behavior resulted only from a heightened approach parameter in S<sup>+</sup>.Altogether, model-based choice data analysis indicated dysfunction in the approach system in S<sup>+</sup>, leading to greater propensity for gambling in the gain domain regardless of the lottery expected value.”

      Page 3 (Introduction):

      “A growing literature indeed shows that risky behavior can be far better explained after adding value-insensitive approach and avoidance components to prospect theory(18,19), that is by including a decision bias in favor of the highest gain (approach) and another decision bias against the lowest loss (avoidance), above and beyond options value difference. This class of models highlights the important role of value-insensitive motivational components in decision making in addition to risk attitude-driven valuation (e.g., loss/risk aversion)(20).”

      (2) The statement "our study uncovers the cognitive and affective mechanisms contributing to increased risk behavior in STB" is overstating the findings, as the study may have uncovered some contributing mechanisms, but likely not all of them. Removing the word "the" would fix this issue.

      Thank you for this suggestion. We have now corrected it.

      (3) Since mood is typically defined as lasting hours, it's inappropriate to refer to ratings that only reflect the last few trials as self-reports of mood. To be sure, I view the distinction between emotions and moods as quantitative, not qualitative, so I do not think there is a problem studying the former to understand the latter, but to avoid confusion, the terminology should follow common usage.

      Thank you for this suggestion. We follow previous work and operational definitions regarding mood (Rutledge et al., 2014, Eldar & Niv, 2015, Vinckier et al., 2018). Emotion is usually a very brief response to a specific stimulus (Emanuel & Eldar, 2023), e.g., leading to rapid changes like surprise then fear. In contrast, mood is defined as a diffuse state that is not specific to one stimulus. Here, we operationally and computationally define mood as an affective state reflecting the recent history of safe and gamble outcomes. We now clarify that point in the main text. Please see our revision below:

      Page 5:

      “Although mood is thought to persist for hours, days, or even weeks(30-33), momentary mood, measured over the timescale in the laboratory setting, represents the accumulation of the impact of multiple events at the scale of minutes(30,32,34-38). Momentary mood external validity is demonstrated e.g., through its association with depression symptoms(37). Mood is different from emotions, which reflect immediate affective reactivity and is more transient (e.g. from surprise to fear)(31-33,39).”

      (4) Line 78: The phrases "increase in risk attitude", "decrease in loss attitude", and "decrease in value-independent choice biases" are unclear to me in terms of their directionality. An attitude might be avoidant or embracing. If it is the former then increasing it would decrease risk-taking.

      Thank you for pointing out the ambiguity. We have now corrected them throughout the manuscript. Please see our revision below:

      Page 4:

      “We therefore hypothesized that heightened approach motivation, or weakened avoidance motivation, would account for increased risk behavior in STB.”

      (5) Line 125: I was not sure why one would expect the mood response to gamble-related quantities (EV and RPE) to be lower in STB and not higher.

      Sorry for the typo. We hypothesized that mood would respond more strongly to gambling-related quantities—expected value (EV) and reward prediction error (RPE)—in adolescents with STB than in controls, given prior evidence that STB is associated with greater risk-taking.

      (6) The text could use proofreading, as there are many typos. These are from the first 100 lines alone:

      a) Abstract: regardless the lotteries -> regardless of the lotteries'.

      b) Line 78: it remains whether.

      c) Line 80: can each -> each can.

      d) Line 90: may original from.

      Sorry for the mistakes. We have now corrected them throughout the manuscript.

      (7) The rationale for focusing on the S+ group for mood model comparison is incorrect. The purpose is to identify parameters that vary as a function of suicidality, and for that, the S- group is just as important.

      Thank you for this comment. We agree that the S<sup>-</sup> group is as important as the S<sup>+</sup> group. A direct comparison was complicated because the winning mood models differed (S<sup>+</sup>: mM3; S<sup>-</sup>: mM5; Table 3). To ensure comparability, we checked results from both model specifications (mM3 and mM5). The conclusions were convergent: mood sensitivity to certain rewards (CR) was lower in S<sup>+</sup> than in S<sup>-</sup> (see Fig. 3 for mM3 and Fig. S8 for mM5).

      (8) There appears to be a contradiction between the inclusion criteria, which include having experienced suicidal thoughts and behaviors, and the definition of the S- group as not having suicidality.

      Thank you for pointing out this mistake. The corrected version of inclusion criteria can be seen on Page 7:

      “Patients were included if they met the following criteria: 1) both the researcher and psychiatrists agreed on their group classification; 2) they had a current diagnosis of major depressive disorder (MDD; unipolar depression), generalized anxiety disorder (GAD), or bipolar disorder with depressive episodes (BD), confirmed by two experienced psychiatrists using the Structured Clinical Interview for DSM-IV-TR-Patient Edition (SCID-P, 2/2001 revision; see Supplementary Note 1 for details); 3) they were between 10 and 19 years of age; 4) they had no organic brain disorders, intellectual disability, or head trauma; 5) they had no history of substance abuse; 6) they had no experience of electroconvulsive therapy.”

      (9) It would be helpful to specify whether mood modeling was based on objective or subjective values, and why.

      Thank you for this helpful suggestion. We have now clarified whether mood modeling was based on objective or subjective values, and why. Specifically, we constructed two model families: one in which mood was driven by objective monetary outcomes (objective values) and one in which mood was driven by subjective values derived from each participant’s fitted choice model (subjective values). We then used the VBA_groupBMC function in the VBA toolbox to perform family-wise model comparison, with 8 candidate mood models within each family. Consistent with previous literature, the objective-value family provided a clearly superior fit to the data (exceedance probability, EP = 1.000). Based on this result and for parsimony, we report and interpret the mood modeling results from the objective-value family in the main text. We have clarified this point below:

      Supplement Pages 4-5:

      “Supplementary Note 9: Mood model comparison using subjective values.

      To identify whether mood modeling was based on objective or subjective values, we constructed two model families: one in which mood was driven by objective monetary outcomes (objective values) and one in which mood was driven by subjective values derived from each participant’s fitted choice model (subjective values). We then used the VBA_groupBMC function in the VBA toolbox (Daunizeau et al., 2014) to perform family-wise model comparison, with 8 candidate mood models within each family. Consistent with previous literature, the objective-value family provided a clearly superior fit to the data (exceedance probability, EP = 1.000).”

    1. Reviewer #1 (Public review):

      Summary

      From transcriptomic comparisons of adult mouse cochlear and vestibular hair cells, Xu et al. provide a broad and well-organized overview of differences across 4 established hair cell types (2 cochlear and 2 vestibular). They go on to demonstrate the power of such analyses to provide functional insights by focusing on the differentiated expression of ciliary genes, building to the hypothesis that kinociliary motility occurs in adult vestibular hair cells.

      Background

      Cilia are prominent in sensory receptors, including vertebrate photoreceptors, olfactory neurons and mechanosensitive hair cells of the inner ear and lateral line. Cilia can be motile or nonmotile depending on their axonemal structure: motile cilia require dynein and the inner 2 singlet microtubules of the 9+2 array. Primary cilia, present early in development, are considered to have sensory functions and to be nonmotile (Mill et al., Nature Rev Gen 2023).

      In hair cells, the kinocilium anchors and polarizes the mechanosensitive hair bundle of specialized microvilli. The kinocilium matures from the primary cilium of a newborn hair cell; behind it the bundle of mechanosensory microvilli rises in a descending staircase of rows. During maturation of the mammalian cochlea, all hair cells lose the kinocilium, though not the associated basal body. The consensus for many years has been that most vertebrate kinocilia, and especially mammalian kinocilia, are nonmotile, based largely on the lack of spontaneous motility in excised mammalian vestibular organs, but also on the impression that the rare examples of spontaneous beating motility even in non-mammalian hair cells are associated with deterioration of the preparation (Rüsch & Thurm 1990).

      Strengths

      In comparing RNA expression across the 4 major types of mouse hair cells - 2 cochlear and 2 vestibular - Xu et al. provide rich data sets for exploration of structure-function differences between these highly specialized cell types. The revised paper significantly improves the organization, interpretation and readability of the presentation of overall findings. smFISH and immuno-staining back up key RNA data, and comparisons are made with published data.

      The ciliary motility focus of the rest of the paper is creative and highly interesting. The authors curated the ciliary genes into types associated with different aspects of beating motility, and also investigated the expression of genes typical of primary cilia, which are considered to have sensory and cell signaling functions and to be nonmotile. Their data justify suggesting a role for kinociliary motility (or force generation) in adult mammalian vestibular hair cells, in opposition to a long-held assumption. The results should stimulate investigation of the implications for mechanosensitivity.

      Weaknesses

      Data

      Functional data on kinocilia motility: The technical difficulty in making such measurements in small mouse hair bundles led the authors to work with bullfrog crista bundles. Though not extensively studied here, the ciliary motility shown is convincing. Mouse hair bundle motions are also shown but the evidence connecting the data to kinociliary motion are more suggestive than convincing. But the authors are not dogmatic about these data, and it is reasonable to show them.

      Interpretation

      The authors take the view that kinociliary motility is likely to be normally present but is rare in their observations because conditions are not right. But while others have described some (rare) kinociliary motility in fish organs (Rusch & Thurm 1990), they interpreted its occurrence as a sign of pathology. Indeed, in this paper, it is not clear what role kinociliary motility would play in mature hair bundles. The authors have added a discussion of this question in the revision.

      An underlying rationale for the hypothesis that ciliary motility manifests in mammalian vestibular hair cells seems to rest on the presence of the necessary mRNA and its contrasting absence in cochlear hair cells. Another way to look at this difference could be that evolution acted on cochlear hair cells to shed kinocilia as one of many changes to improve mechanosensitivity at much higher sound frequencies. In vestibular hair cells, kinociliary motion might be useful to enhance mechanostimulation in the developing vestibule (as suggested in this revision) and not so active in maturity. Nevertheless, with their scholarly analysis of the expression of ciliary genes, the authors make a significant argument for further investigation of when and why hair cell kinocilia show active motility.

    2. Reviewer #2 (Public review):

      Summary:

      In this study the authors compared the transcriptomes of the various different types of hair cells contained in the sensory epithelia of the cochlea and vestibular organs of the mouse inner ear. The analysis of their transcriptomic data lead to novel insights into the potential function of the kinocilium.

      Strengths:

      The novel findings for the kinocilium gene expression along with the demonstration that some kinocilia demonstrate rhythmic beating as would be seen for known motile cilia is fascinating. It is possible that perhaps the kinocilium known to play a very important role in the orientation of the stereocilia, may have a gene expression pattern that is more like a primary cilium early in development and later in mature hair cells more like a motile cilium. Since the kinocilium is retained in vestibular hair cells it makes sense that it is playing a different role in these mature cells than its role in the cochlea.

      Another major strength of this study which cannot be overstated is that for the transcriptome analysis they are using mature mice. To date there is a lot of data from many labs for embryonic and neonatal hair cells but very little transcriptomic data on the mature hair cells. They do a nice job in presenting the differences in marker gene expression between the 4 hair cell types. This information is very useful to those labs studying regeneration or generation of hair cells from ES cell cultures. One of the biggest questions these labs confront is what type of hair cell develop in these systems. The more markers available the better. These data will also allow researchers in the field to compare developing hair cells with mature hair cell to see what genes are only required during development and not in later functioning hair cells.

      Comments on revision:

      I am satisfied with the revision, the authors made an effort to incorporate the changes requested.

    3. Author Response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Weaknesses:

      (1) Data:

      (a) The main weakness in the data is the lack of functional and anatomical data from mouse hair bundles. While the authors compensate in part for this difficulty with bullfrog crista bundles, those data are also fragmentary - one TEM and 2 exemplar videos. Much of the novelty of the EM depends on the different appearance of stretches of a single kinocilium - can we be sure of the absence of the central microtubule singlets at the ends?

      Our single-cell RNA-seq findings show that genes related to motile cilia are specifically expressed in vestibular hair cells. This has not been demonstrated before. We have also provided supporting evidence using electrophysiology and imaging from bullfrogs and mice. Although no ultrastructural images of mouse vestibular kinocilia were provided in our study, transmission electron micrograph of mouse vestibular kinocilia has been published (O’Donnell and Zheng, 2022). The mouse vestibular kinocilia have a “9+2” microtubule configuration with nine doublet microtubules surrounding two central singlet microtubules. This finding contrasts with a previous study, which demonstrated that the vestibular kinocilia from guinea pigs lack central singlet microtubules and inner dynein arms, whereas outer dynein arms and radial spokes are present (Kikuchi et al., 1989). The central pair of microtubules is absent at the end of the bullfrog saccular kinocilium (Fig. 7A). We would like to point out that the dual identity of primary and motile cilia is not just based on the TEM images. The kinocilium has long been considered a specialized cilium, and its role as a primary cilium during development has been demonstrated before (Moon et al., 2020; Shi et al., 2022).

      In most motile cilia, the central pair complex (CPC) does not originate directly from the basal body; instead, it begins a short distance above the transition zone, a feature that already illustrates variation in CPC assembly across systems (Lechtreck et al., 2013). The CPC can also show variation in its spatial extent: for example, in mammalian sperm axonemes, it can terminate before reaching the distal end of the axoneme (Fawcett and Ito, 1965). In addition, CPC orientation differs across organisms: in metazoans and Trypanosoma, the CPC is fixed relative to the outer doublets, whereas in Chlamydomonas and ciliates it twists within the axoneme (Lechtreck et al., 2013). Such variation has been described in multiple motile cilia and flagella and is therefore not unique to vestibular kinocilia. What appears more unusual in our data is the organization at the distal tip, where a distinct distal head is present, similar to cilia tip morphologies recently described in human islet cells (Polino et al., 2023). Although this feature is intriguing, we interpret it primarily as a structural signature rather than as evidence for a specialized motile adaptation, and we have moderated our interpretation accordingly in the revision.

      (b) While it was a good idea to compare ciliary motility expression in published P2 datasets for mouse cochlear and vestibular hair cells for comparison with the authors' adult hair cell data, the presentation is too superficial to assess (Figure 6C-E; text from line 336) - it is hard to see the basis for concluding that motility genes are specifically lower in P2 cochlear hair cells than vestibular hair cells. Visually, it is striking that CHCs have much darker bands for about 10 motility-related genes.

      While these genes (e.g., Dynll1, Dynll2, Dynlrb1, Cetn2, and Mdh1) appear more highly expressed in P2 cochlear hair cells, they are not uniquely associated with the axoneme. For example, Dynll1/2 and Dynlrb1 are components of the cytoplasmic dynein-1 complex (Pfister et al., 2006), Cetn2 has multiple basic cellular functions beyond cilia (e.g., centrosome organization, DNA repair), and Mdh1 encodes a cytosolic malate dehydrogenase involved in central metabolic pathways such as the citric acid cycle and malate–aspartate shuttle. This contrasts with axonemal dyneins, which are uniquely required for cilia motility. To avoid ambiguity, we have marked such cytoplasmic or multifunctional genes with red asterisks in both Fig. 5G and Fig. 6D in the revised manuscript.

      Our comparison showed that key genes for motile machinery are not detected in cochlear hair cells. For example, Dnah6 and Dnah5 are not expressed in the P2 cochlear hair cells. Dnah6 and Dnah5 encode axonemal dynein and are part of inner and outer dynein arms. Importantly, we did not detect the expression of CCDC39 and CCDC40 in kinocilia of P2 cochlear hair cells. Furthermore, axonemal CCDC39 and CCDC40, the molecular rulers that organize the axonemal structure in the 96-nm repeating interactome were not detected in cochlear hair cells. We have revised the text to emphasize key differences.

      (2) Interpretation:

      The authors take the view that kinociliary motility is likely to be normally present but is rare in their observations because the conditions are not right. But while others have described some (rare) kinociliary motility in fish organs (Rusch & Thurm 1990), they interpreted its occurrence as a sign of pathology. Indeed, in this paper, it is not clear, or even discussed, how kinociliary motility would help with mechanosensitivity in mature hair bundles. Rather, the presence of an autonomous rhythm would actively interfere with generating temporally faithful representations of the head motions that drive vestibular hair cells.

      Spontaneous flagella-like rhythmic beating of kinocilia in vestibular HCs in frogs and eels (Flock et al., 1977; Rüsch and Thurm, 1990) and in zebrafish early otic vesicle (Stooke-Vaughan et al., 2012; Wu et al., 2011) has been reported previously. Based on Rüsch and Thurm (1990), spontaneous kinocilia motility occurred under non-physiological conditions and was interpreted as a sign of cellular deterioration rather than a normal feature. We speculate that deterioration under non-physiological conditions may lead to the disruption of lateral links between the kinocilium and the stereociliary bundle, effectively unloading the kinocilium and allowing it to move more freely. Additionally, fluctuations in intracellular ATP levels may contribute, as ciliary motility is highly ATP-dependent; when ATP is depleted, beating ceases. Similar phenomena have been documented in respiratory epithelia, where ciliary activity can temporarily pause. Nevertheless, the fact that kinocilia can exhibit spontaneous motility under these conditions indicates that they possess the motile machinery necessary for such beating. Irrespective of the condition, cilia without the molecular machinery required for motility will not be able to move.

      We agree with the reviewer that, based on the present data, it is difficult to know the functional role of kinocilia and whether the presence of such autonomous rhythm would interfere with temporal fidelity. Spontaneous bundle motion, driven by the active process associated with mechanotransduction, was observed in bullfrog saccular hair cells (Benser et al., 1996; Martin et al., 2003). We have revised the discussion to clarify this important point of the reviewer. Specifically, we will emphasize that our observations of ciliary beating in the ex vivo conditions may not reflect its properties in the mature in vivo context, but rather a byproduct of motile machinery clearly present in the kinocilia. We speculate that this machinery in mature hair cells could operate in a more subtle mode—modulating the rigor state of dynein arms or related axonemal structures to influence kinociliary mechanics and, in turn, bundle stiffness in response to stimuli or signaling cues. Such a mechanism could either enhance sensitivity or introduce filtering properties, thereby contributing to the fine control of mechanosensory function without compromising temporal fidelity. Future studies using loss-of-function approach will be needed to reveal the unexplored role(s) of kinocilia for vestibular hair cells in vertebrates.

      We note that spontaneous activity exits throughout nervous system. It allows the nervous system to maintain baseline activity and interpret signals. Retinal cells are spontaneously active even in the dark and spiral ganglion neurons also fire spontaneously. Spontaneous hair bundle motion driven by mechanotransduction-related mechanism has been observed in bullfrog saccular hair cells. So, it is unlikely that spontaneous kinocilia beating would interfere with generating temporally faithful representations.

      Could kinociliary beating play other roles, possibly during development - for example, by interacting with forming accessory structures (but see Whitfield 2020) or by activating mechanosensitivity cell-autonomously, before mature stimulation mechanisms are in place? Then a latent capacity to beat in mature vestibular hair cells might be activated by stressful conditions, as speculated regarding persistent Piezo channels that are normally silent in mature cochlear hair cells but may reappear when TMC channel gating is broken (Beurg and Fettiplace 2017). While these are highly speculative thoughts, there is a need in the paper for more nuanced consideration of whether the observed motility is normal and what good it would do.

      We thank the reviewer for these excellent suggestions. We agree that kinociliary motility could plausibly serve roles during development, for example by guiding hair bundle formation or by contributing to early mechanosensitivity and spontaneous neural activity before mature stimulation mechanisms are established. It is also possible that the motility machinery represents a latent capacity in mature vestibular hair cells that could be reactivated under stress or pathological conditions. We have revised the Discussion to address these possibilities and to provide a more nuanced consideration of whether the observed motility is normal and what potential functions it might serve.

      Reviewer #2 (Public review):

      Summary:

      In this study, the authors compared the transcriptomes of the various types of hair cells contained in the sensory epithelia of the cochlea and vestibular organs of the mouse inner ear. The analysis of their transcriptomic data led to novel insights into the potential function of the kinocilium.

      Strengths:

      The novel findings for the kinocilium gene expression, along with the demonstration that some kinocilia demonstrate rhythmic beating as would be seen for known motile cilia, are fascinating. It is possible that perhaps the kinocilium, known to play a very important role in the orientation of the stereocilia, may have a gene expression pattern that is more like a primary cilium early in development and later in mature hair cells, more like a motile cilium. Since the kinocilium is retained in vestibular hair cells, it makes sense that it is playing a different role in these mature cells than its role in the cochlea.

      Another major strength of this study, which cannot be overstated, is that for the transcriptome analysis, they are using mature mice. To date, there is a lot of data from many labs for embryonic and neonatal hair cells, but very little transcriptomic data on the mature hair cells. They do a nice job in presenting the differences in marker gene expression between the 4 hair cell types. This information is very useful to those labs studying regeneration or generation of hair cells from ES cell cultures. One of the biggest questions these labs confront is what type of hair cells develop in these systems. The more markers available, the better. These data will also allow researchers in the field to compare developing hair cells with mature hair cells to see what genes are only required during development and not in later functioning hair cells.

      We would like to thank reviewer 2 for his/her comments and hope that the datasets provided in this manuscript will be a useful resource for researchers in the auditory and vestibular neuroscience community.

      Joint Recommendations for the authors:

      (1) Figure 1 - Explain how hair cell types are recognized after dissociation. Figure 1 will not be clear in this regard for non-aficionados. Some of the dissociated cells shown appear quite distorted and even unhealthy - e.g., the bottom right crista type II hair cell; the second from left crista type I hair cell; can you address why this doesn't matter for the purposes of this study?

      HC types in Fig. 1C were identified based on their morphological features: Type I HCs are flask-shaped with a narrow neck while type II HCs are cylindrical and short. We have replaced those cells with new images. In our study, HCs were identified based on their marker genes. Although some HCs such as those shown in Fig. 3C were impossible to avoid during preparation of single cells for library (most people did not examine their morphology), quality of mRNA and sequencing was high, better than those datasets published in previous studies.

      (2) Line 98 - Explain accessory cells (as opposed to supporting cells).

      We changed accessory cells to other cell types.

      (3) Line 246 - The primary cilium is...

      Changed.

      (4) Figure 6D - The scale bar is missing. Please use arrows to point to the genes you call out in the text. Also, the genes called out in the text as differently expressed (line 342) are quite faint bands in both cell types. It would be a service to the reader to point them out in the panel.

      A scale bar has been added. We also marked those genes as suggested and edited the text accordingly.

      (5) Figure 7 - mixes frog crista and mouse middle ear images with waveforms and FFTs from frog crista, mouse middle ear, and mouse crista. Related to these still images are 2 videos of frog kinocilium beating (2 hair cells). The mouse images must be underwhelming, or we would have been shown those, yet they were considered adequate to analyze.

      Yes, the spontaneous kinocilia motion of mouse crista HCs is very small. The peak motion is about 40 nm, which is very close to the resolution of our camera. That is why we used photodiode technique to detect its motion. Photodiode is more sensitive, and this technique allows us to observe dynamic response waveform.

      (6) I recommend labeling each figure panel with the tissue of origin to avoid confusion.

      Labeled as suggested.

      (7) I suggest dropping the mouse middle ear data, as they are not directly adequate as a positive control (or no more so than the more beautiful frog data).

      We keep the waveforms of middle ear cilia movement in Fig. 7. The main reason is that we would like to show the magnitude difference between airway cilia and kinocilia. The kinocilia movement was at least an order of magnitude less than the movement of airway cilia. This has led to our effort to generate a model to predict the 96-nm modular repeat and explain why kinocilia movement in mice is much smaller than airway cilia and bullfrog kinocilia.

      (8) Focus on the hair bundle motions:

      (a) Show the waveforms for the frog crista hair cells and their FFTs.

      These images were captured many years ago using camera. The kinocilia motion is between 5 and 10 Hz. We did not present any waveforms of kinocilia motion since we no longer have access to bullfrogs. However, although we did not present response waveforms, the videos are very powerful for visualization of kinocilia beat of bullfrog saccular HCs.

      (b) Find some way to show us how you measured the mouse hair bundle beating.

      Photodiode technique was used to measure spontaneous kinocilia motion in mice. More details are now included in the text.

      (c) Does EGTA break links between kinocilium and stereocilia? (Could that contribute to the higher beat frequency?) Just applying the same treatment and viewing from above could clarify whether kinocilia dissociate from stereocilia rows. This would likely be more straightforward with an otolith organ.

      All these links (tip links, side links) are vulnerable to Ca concentration and Ca-free medium is often used to break these links as shown in many previous studies. Breaking the kinocilia links leads to reduced load to the kinocilia, which may result in larger motion of the kinocilia. The frequency is inherent to motile machinery and subject to temperature and intracellular ATP concentration. When facing upward, the hair bundles in otolith organ do not have a good contrast against HCs in the background. This makes measurement of their motion difficult, especially when the motion is small and random and can’t be averaged to improve signal to noise ratio. Besides, unlike cochlear HCs whose hair bundles are short and can easily be oriented in parallel with light path, the long hair bundle of vestibular HCs is more difficult to orient and image. For these reasons, we chose to use crista hair bundles for our measurements since they can be oriented in perpendicular to the light path without interference from background HCs. The lateral motion of the entire bundle is also relatively easy to measure in this preparation.

      (6) Is there no reason to cite McInturff et al. (2018), given that they compared type I and II VHC transcriptomes at P12 and P100? This database is also available on gEAR.

      Their studies are now cited. We also compared their datasets with ours.

      (7) Line 374 - Eatock et al., 1998 citation does not work for this purpose. Eatock & Songer (2011) would be better, or Li, Xue, Peterson (2008): mouse utricle anatomy; significant discussion of relative heights of kinocilia and tallest stereocilia.

      Changed and cited.

      (8) In Figure 3, 2 of the 18 panels in B are missing labels.

      The bar, applied to all panels, was there at the bottom of Fig. 3B. The bar is bigger and more visible in the revision.

      (9) Line 187 should "Sppl1" be Spp1?

      Corrected.

      (10) Define BBSome on line 244.

      Added.

      (11) Looking at Figure 5, it seems that all the motile genes are expressed in the vestibular hair cells and not the cochlear hair cells. It is surprising that there are any cilia-related genes expressed in these adult cochlear hair cells, given that they do not retain their cilia into adulthood. Could the authors make a comment on this finding in the discussion? Also, are there any ciliopathies that show a vestibular defect but normal hearing in mice or humans? Have you compared the cilia-related gene expression in neonatal/embryonic vestibular hair cells to your dataset?

      There are many kinocilia related genes still expressing adult cochlear HCs. It is not surprising to see many kinocilia related genes in cochlear HCs. Most of these genes are related to primary cilia structure including the basal body and transporters in cilia. The basal body is still present in cochlear HCs. Many other primary cilia-related proteins are also expressed in soma, especially those related to signal transduction, microtubule cytoskeleton, actin cytoskeleton, vesicle transport, metabolic enzyme, protein folding, translation, nuclear transport, ubiquitination, RNA binding, mitochondrial proteins and transcription factors. Of course, some of them are vestigial. We added discussion of this in the text. Comparison between neonatal cochlear and vestibular was presented in Fig. 6D. We compared those genes related to the axonemal repeat (96 nm repeat complex). Due to quality of mRNA, the total genes and genes related to kinocilia detected in previous developmental studies were much less than our datasets. While we detected 112 out of 128 genes related to axonemal repeat, only 90 genes were detected in previous studies (Burns et al., 2015; McInturff et al., 2018). Therefore, we only compared neonatal cochlear and vestibular HCs using their datasets. As far as we know, no ciliopathies with vestibular defects but normal hearing have been reported in mice or humans. But we plan to use a Ccdc39 mutant mouse model to examine how loss of function of a key motile cilia signature gene would affect kinocilia motility and vestibular function.

      (12) How is "expression level" in the violin plots being calculated? Is this a measure of read count? The normalization is cursorily explained in the methods. Is this value comparable across genes? Did the authors switch to z-score by Figure 6?

      We dissected the auditory and vestibular sensory epithelia from the same groups of mice and prepared libraries and sequenced them at the same time. All parameters are the same. The violin Plots are based on values presented in Supplementary Table 1. Each dot in the plot reflects an aggregated number of reads across all cells for each gene. They are all normalized across different HC types and biological repeats. The details for normalization are now provided.

      (13) The authors comment on the 16/128 motile cilia axonemal repeat genes that are not expressed in the vestibular hair cells. Listing these somewhere may be helpful to the readers.

      We thank the reviewer for this helpful suggestion. Most of the 128 motile cilia axonemal repeat genes were listed in Figs 8C and S5, along with known loss-of-function mutations and ciliopathy associations identified in human diseases or observed in animal models. To improve clarity, we have now included Table S2, which provides the complete list of all 128 motile cilia axonemal repeat genes, including those not expressed in vestibular HCs.

      (14) Figure 5D needs some refinement. While the authors used databases, including CiliaCarta, SYSCILIA gold standard, and CilioGenics, to identify the primary cilia-related genes, they have included many genes that are not highly specific to primary cilia function (e.g., HSP90, HSPA8, DNAJA4, GNAS...). Perhaps the authors would be able to do a better job of specifically querying primary cilia function by using genes that are common to these three databases.

      We presented comparison and analysis based on three major cilia databases, which are generated from proteomics of cilia from different tissues/organisms. In addition, we have provided more comprehensive list of primary cilia-related genes in Fig. S2. While majority of cilia-related genes/proteins are highly conserved, some genes/proteins are tissue-/organism-specific. Majority of the genes presented in Fig. 5D of our manuscript are shared among all three databases. The cilium is a complex structure, composed of proteins for microtubule cytoskeleton, actin cytoskeleton, vesicle transport, metabolic enzyme, signaling, and protein folding. It also contains proteins for translation, nuclear transport, ubiquitination, RNA binding as well as mitochondrial proteins and transcription factors (https://ciliogenics.com/?page=Home). Proteins such as HSP90 and HSPA8 are important for protein folding. HSPA8 also functions as an ATPase in the disassembly of clathrin-coated vesicles during transport of membrane components through the cell. GNAS is part of a G protein complex that transmits signals. DNAJA4 is one of the high-confidence cilia proteins (mean score of 1.26, expression rank is 938). These proteins are detected in cilia according to CilioGenics (https://ciliogenics.com/?page=Home). These proteins are not highly specific to cilia and are expressed in soma as well. Most of these proteins for signaling such as WNT (Supplementary Fig. 2) are detected in both cilia and soma.

      (15) The authors state, "Furthermore, we observed robust spontaneous kinocilia motility in bullfrog crista HCs and small spontaneous bundle motion in mouse crista HCs." This statement should be moderated by acknowledging that this motility was observed in only some cells. The authors favor the hypothesis that the lack of motility in some crista HCs is due to depolarization or damage to the sample. The authors should also acknowledge the possibility that there may be cell-to-cell variability in the motility of the kinocilia.

      We address these issues in public review section. We modified the statement as suggested.

      (16) The first few pages of the Results section include many lists of genes. Readability may be improved if this is curtailed modestly.

      Changed as suggested. We removed comparison among different types of HCs and replotted Fig. 2B. This has reduced the number of genes mentioned in the text.

    1. Author Response:

      The following is the authors’ response to the previous reviews

      Public Reviews:

      Reviewer #1 (Public review):

      In this revised manuscript, Qin and colleagues aim to delineate a neural mechanism that is engaged specifically in the sated flies to suppress the intake of sugar solution (the "brake" mechanism for sugar consumption). They identified a three-step neuropeptidergic system that downregulates the sensitivity of sweet-sensing gustatory sensory neurons in sated flies. First, neurons that release a neuropeptide Hugin (which is an insect homolog of vertebrate Neuromedin U (NMU)) are in active state when the concentration of glucose is high. This activation depends on the cell-autonomous function of Hugin-releasing neurons that sense hemolymph glucose levels directly. Next, the Hugin neuropeptides activate Allatostatin A (AstA)-releasing neurons via one of Hugin receptors, PK2-R1. Finally, the released AstA neuropeptide suppresses sugar response in sugar-sensing Gr5a-expressing gustatory sensory neurons through AstA-R1 receptor. Suppression of sugar response in Gr5a-expressing neurons reduces fly's sugar intake motivation. They also found that NMU-expressing neurons in the ventromedial hypothalamus (VMH) of mice (which project to the rostal nucleus of the solitary tract (rNST)) are also activated by high concentrations of glucose independent of synaptic transmission, and that injection of NMU reduces the glucose-induced activity in the downstream of NMU-expressing neurons in rNST. These data suggest that the function of Hugin neuropeptide in the fly is analogous to the function of NMU in the mouse.

      The shift of the narrative, which focuses specifically on the hugin-AstA axis as the "brake" on the satiety signal and feeding behavior, clarified the central message of the presented work. The authors have provided multiple lines of compelling evidence generated through rigorous experiments. The parallel study in mice adds a unique comparative perspective that makes the paper interesting to a wide range of readers.

      While I deeply appreciate the authors' efforts to substantially restructure the manuscript, I have a few suggestions for further improvements. First, there remains room for discussion whether the "brake" function of the hugin-AstA axis is truly satiety state-dependent. The fact that neural activation (Fig. Supp. 8), peptide injection (Fig. 3A, 4A), receptor knockdown (Fig. 3C,G, 4E), and receptor mutants (Fig. Supp. 10, 12) all robustly modulate PER irrespective of the feeding status suggests that the hugin-AstA axis influences feeding behaviors both in sated and hungry flies. Additionally, their new data (Fig. Supp. 13B, C) now shows that synaptic transmission from hugin-releasing neurons is necessary for completely suppressing feeding even in sated flies. If the hugin-AstA axis engages specifically in sated (high glucose) state, disruption of this neuromodulatory system is expected to have relatively little effect in starved flies (in which the "brake" is already disengaged).

      We thank the reviewer for pointing out this inconsistency. We have corrected this interpretation. Specifically:

      (1) We removed statements suggesting that the circuit is fully disengaged during starvation.

      (2) We now state that endogenous hugin activity is reduced during starvation, but the circuit retains modulatory capacity when experimentally perturbed.

      (3) The Discussion now emphasizes that the system operates as a state-modulated inhibitory tone rather than a strictly fed-state switch.

      We believe this revised framing resolves the discrepancy.

      In this context, it is intriguing that the knockdown of PK2-R2 hugin receptor modestly but consistently decreases proboscis extension reflex specifically in starved flies (Fig. 3D, H). The manuscript does not discuss this interesting phenotype at all. Given the heterogeneity of hugin-releasing neurons (Fig. Supp. 7), there remains a possibility that a subset of hugin-releasing neurons and/or downstream neurons can provide a complementary (or even opposing) effect on the feeding behavior.

      We agree that this is an important observation. Although the effect size is modest, it is reproducible and suggests that hugin signaling may not operate as a strictly linear pathway.

      To address this:

      (1) We added a paragraph in the Results acknowledging the PK2-R2-dependent phenotype.

      (2) We included a discussion noting the potential functional heterogeneity of hugin neurons.

      (3) The schematic model (now Figure Supplementary 17, previously Figure Supplementary 16) includes a dashed line indicating a possible parallel PK2-R2-dependent branch.

      Given these intriguing yet unresolved issues, it is important to acknowledge that whether this system is "selectively engaged in fed states to dampen sweet sensation (in Discussion)" requires further functional investigations. Consistent effects of manipulation of the hugin-AstA system across multiple experimental approaches underscores the importance of this molecular circuitry axis for controlling feeding behaviors. Moderation of conclusions to accommodate alternative interpretation of data will be beneficial for field to determine the precise mechanism that controls feeding behaviors in future studies.

      We fully agree with the reviewer. Our original description of the circuit as a “satiety brake” implied exclusive engagement in fed states, which is not strictly supported by the behavioral data. Although endogenous hugin activity is elevated under fed conditions (as shown by CaMPARI), experimental manipulations demonstrate that the circuit retains functional capacity to modulate feeding behavior across feeding states.

      To address this concern, we have:

      (1) Removed the term “satiety-specific brake” throughout the manuscript.

      (2) Reframed the circuit as a glucose-responsive, state-modulated inhibitory module.

      (3) Revised the Discussion to explicitly state that the hugin–AstA pathway biases sweet sensitivity according to circulating glucose levels rather than functioning as an on/off switch.

      (4) Substantially revised Supplementary Figure 17 to reflect graded modulation across metabolic states rather than binary state engagement.

      These changes better align our conclusions with the experimental observations.

      Reviewer #2 (Public review):

      Summary:

      The question of how caloric and taste information interact and consolidate remains both active and highly relevant to human health and cognition. The authors of this work sought to understand how nutrient sensing of glucose modulates sweet sensation. They found that glucose intake activates hugin signaling to AstA neurons to suppress feeding, which contributes to our mechanistic understanding of nutrient sensation. They did this by leveraging the genetic tools of Drosophila to carry out nuanced experimental manipulations, and confirmed the conservation of their main mechanism in a mammalian model. This work builds on previous studies examining sugar taste and caloric sensing, enhancing the resolution of our understanding.

      Strengths:

      Fully discovering neural circuits that connect body state with perception remains central to understanding homeostasis and behavior. This study expands our understanding of sugar sensing, providing mechanistic evidence for a hugin/AstA circuit that is responsive to sugar intake and suppresses feeding. In addition to effectively leveraging the genetic tools of Drosophila, this study further extends their findings into a mammalian model with the discovery that NMU neural signaling is also responsive to sugar intake.

      Weaknesses:

      The effect of Glut1 knockdown on PER in hugin neurons is modest in both fed and starved flies, suggesting that glucose intake through Glut1 may only be part of the mechanism.

      We agree that the modest PER phenotype suggests that Glut1-mediated glucose uptake represents one component of glucose sensing in hugin neurons. We have clarified this in the Discussion and now explicitly state that additional glucose-sensing mechanisms may contribute to hugin activation.

      Additionally, many of the manipulations testing the "brake" circuitry throughout the study show similar effects in both fed and starved flies. This suggests that the focus of the discussion and Supplemental Figure 16 on a satiety-specific "brake" mechanism may not be fully supported by the data.

      We fully agree that the previous framing overstated state specificity.

      As described above, we have:

      (1) Removed “satiety-specific brake” terminology.

      (2) Reframed the circuit as a glucose-responsive inhibitory module.

      (3) Revised the Discussion to explicitly acknowledge modulation across feeding states.

      (4) Updated the schematic model (Figure Supplementary 17, formerly Figure Supplementary 16) accordingly.

      Recommendations for the authors:

      Reviewing Editor (Recommendations for the authors):

      Both the reviewers and I agree that the conclusion about a "satiety-dependent" brake needs to be modified to discuss the phenotypes that are also observed under starved conditions. Reviewer 1 would further like to emphasize that the authors are not required to follow through with the specific recommendations suggested by them. Modifying the conclusion and Supplementary Figure 16 should suffice.

      We sincerely thank the Reviewing Editor for the clear guidance. We fully agree that our previous framing of the hugin–AstA circuit as a strictly “satiety-dependent” brake may have overstated the state specificity of the system.

      In response to this recommendation, we have:

      (1) Revised the Abstract, Results, and Discussion to moderate the conclusion and explicitly acknowledge the phenotypes observed under starved conditions.

      (2) Reframed the circuit as a glucose-responsive, state-modulated inhibitory module, rather than a satiety-exclusive brake.

      (3) Supplementary Figure 17 (formerly Figure Supplementary 16) has been substantially revised to illustrate graded modulation across metabolic states rather than binary engagement.

      We appreciate the clarification that no additional experiments were required and are grateful for the opportunity to improve the conceptual framing of our work.

      Please include full statistical reporting in the main manuscript (e.g., figure legends or results).

      We have revised all figure legends to include full statistical reporting.

      Reviewer #1 (Recommendations for the authors):

      By re-framing their finding as the "brake" mechanism on satiety-induced suppression of feeding behavior and sensitivity to sweet taste, the authors substantially improved the clarity of their findings and their significance. The additional data (Fig. Supp. 13B, C) allows "apple-to-apple" comparisons of behavioral data. I support the publication of this manuscript with no further experiments, although I have several suggestions for the text.

      As I write in the public review, I have a reservation on the authors' argument that hugin-AstA system is the "'satiety brake' - that is selectively engaged in fed states to dampen sweet sensation (lines 392-394)". Manipulation of both hugin system (Fig. 2C, Fig. 3A, C, D, G, Fig. Supp. 8A, C, Fig. Supp. 10A-C, Fig. Supp. 13B, C) and AstA system (Fig. 4A, E, Fig. Supp., 8C, D, Fig. Supp. 12A-C, Fig. Supp. 13D) all indicate that hugin-AstA system suppresses feeding regardless of the satiety state. Specifically, Fig. Supp. 13B shows that synaptic blockade does further increases PER, causing contradictions to authors' statements ("silencing hugin+ neurons led to enhanced sweet-driven feeding behavior (line 299-300)" and "...further silencing has little additional effect (line 402)"). The CaMPARI data (Fig. 1J) provides the link between the activity levels of hugin-releasing neurons and satiety state. However, the fact that eliminating hugin-AstA signal can promote further PER in starved flies suggests that this brake is not completely satiety-dependent. I ask authors to at least discuss this perceived discrepancy between their data and conclusions.

      Also, the authors' finding that PK2-R2 reduction actually suppresses PER specifically among starved flies (Fig. 3D, H), albeit with relatively small effect size, suggests that hugin-AstA axis is not a singular, linear pathway as authors suggest in Fig. Supp. 16. While delineating the PK2-R2-dependent pathway is beyond the scope of this study, at least a line of discussion would be helpful.

      Minor comments:

      (1) Fig. Supp. 8 (dTRPA1 activation of hugin and AstA neurons), and Fig. Supp. 13B-D (inhibition of hugin and AstA neurons) should be in the main figure given its relevance to the narrative of this manuscript.

      We agree with the reviewer regarding their importance. The key behavioral panels from these figures have now been moved to the main figures to strengthen the narrative flow.

      (2) Fig. Supp. 11 (PER and imaging using decapitated heads only), despite its creativity, leaves me wonder how PER of fly heads looks like. It is a highly artificial and invasive experiment. Supplementary movies would be helpful.

      We apologize for the lack of clarity in our description. In this experiment, flies were not decapitated. Instead, we surgically severed the connection between the brain and the ventral nerve cord (VNC), while keeping the body and proboscis musculature intact. Thus, the flies remained physically intact, and PER was measured using the same behavioral protocol as in intact animals.

      We have revised the figure legend to clarify this point and avoid confusion. Because the behavioral procedure was identical to standard PER assays and the flies retained normal proboscis motor function, we did not include supplementary videos.

      (3) Expression patterns of PK2-R1 and AstA-R2 in proboscis are mentioned in text but with no data (lines 229 and 279). I strongly encourage authors to show images.

      We have now included the relevant expression images in the revised manuscript.

      (4) A citation for the "previous study (line 486)" describing PER method is required.

      The appropriate citation has been added.

    1. Author Response:

      The following is the authors’ response to the previous reviews

      Public Review:

      We thank the editor and reviewers for their thoughtful and constructive feedback, which has enabled us to greatly strengthen the manuscript. We apologize for the delay in resubmitting this as we were dealing with a large turnover in the lab due to trainee graduations which has We have carefully revised the text, figures, and supplementary materials in response to these comments. Below, we summarize the key revisions made followed by a point-by-point response to the reviewers’ critiques.

      (1) Performed CUTS analyses in human neuronal system: In the revised manuscript, we included new data demonstrating that the CUTS system can be applied to additional cellular models, specifically neuronal cells (Figure 5, Figure S4). To address whether CUTS functions effectively in neuronal contexts, we generated stable CUTS-expressing lines in differentiated BE(2)-C and ReN VM–derived differentiated neurons (Figure 5A-D, Figure S4 A-C). To ensure this was neuronal expression, we developed a new Tet-On3G system construct where the Tet-On3G transactivating protein is driven by the SYN1 promoter to ensure neuron-specific inducible expression for these experiments.

      (2) Define the relationship between CUTS and endogenous/physiological cryptic exons inclusion: To evaluate how well the CUTS system reflects physiological cryptic exon regulation, we performed RT-PCR analysis of several cryptic exons previously reported by us and evaluated CUTS activation at the RNA level in parallel (Figure S2E) . CUTS is sensitive to low-mild reductions in TDP-43 levels, whereas the tested endogenous cryptic exons exhibit variable responses to TDP-43 knockdown.

      (3) Defining stress-induced TDP-43 loss of function: We included new data demonstrating that the CUTS system can detect TDP-43 loss of function induced by acute sodium arsenite (NaAsO₂) treatment in HEK cells (Figure 3D–I). We have also tested additional stressor as part of a separate ongoing study where this work will be expanded upon (Xie et al., 2025). We selected this paradigm since TDP-43 loss of function in response to acute NaAsO₂ treatment is also supported by work from other labs(Huang et al., 2024).

      (4) Implications of using a TDP-43 Loss-of-Function sensor for therapeutic applications: In the revised manuscript, we clarify that CUTS-TDP43 is auto-regulated and we highlight two potential therapeutic applications: i) TDP-43 Knockdown-and-replacement: CUTS-TDP43 provides a strategy for simultaneous depletion of pathological TDP-43 species while enabling autoregulated re-expression of wild-type TDP-43. This design mitigates the risk of supraphysiologic overexpression, a known liability in conventional replacement approaches, by restoring TDP-43 within a self-limiting regulatory network that maintains homeostatic control. ii) Aggregation-independent correction: Because CUTS is autoregulatory, it can be repurposed to regulate alternative downstream effectors, including splicing modifiers or TDP-43 functional interactors, without expressing TDP-43 itself. This approach provides a potential aggregation-independent strategy to compensate for TDP-43 loss-of-function (LOF) by restoring downstream splicing. We are evaluating this work in a follow up study (Xie et al., 2025). In these ongoing studies, we show that CUTS-regulated expression of splicing proteins in response to TDP-43 loss restored subsets of cryptic exon events (24/28 events evaluated). These findings suggest CUTS as a versatile tool for both autoregulated TDP-43 replacement and trans-regulatory therapeutic correction. We expanded on this concept in the discussion section of this revised manuscript. We also note that autoregulatory TDP-43 biosensor strategies have been proposed in related systems, including TDP-Reg, underscoring broader interest in self-regulated TDP-43 systems (Wilkins et al., 2024).

      (5) Clarified mechanism of TDP-43 5FL causing strong loss of function: The TDP-43 5FL exhibits reduced RNA binding capacity, and we previously showed that the lack of RNA binding promotes aberrant homotypic phase separation of TDP-43 (Mann et al., 2019). Expression of RNA-deficient TDP-43 variant forms nuclear “anisomes” (Yu et al., 2021), which evidence suggests sequesters endogenous TDP-43 protein into insoluble structures. We expanded on this in our results section in this revised manuscript.

      (6) Improved figure clarity and data presentation: To enhance clarity and organization, we maintained the main structure of the manuscript while reorganizing figures and improved data visualization. Some examples include:

      Figure 1: We revised the schematic layout for greater clarity and simplicity. The figure now focuses more specifically on the CUTS data, with additional data on the UNC13A-TS and CFTR-TS moved to Figure S1. To improve readability, titles were added to all schematic panels. Visual consistency was also improved by refining the color labelling for each sensor in Figures 1C and 1D and adjusting the corresponding bar graphs accordingly.

      Figure 2: We reorganized the figure to clearly distinguish between protein and mRNA analyses for greater clarity. In the revised layout, western blot quantifications of TDP-43 and CUTS (GFP) signals are shown in Figures 2D and 2E, respectively, while the corresponding qPCR analyses are presented in Figures 2H and 2I. Minor edits include removing the percentage knockdown and fold-change annotations from the graphs and incorporating these values into a mini-table in Figure S2E.

      The original Figure 2D and 2G were reincorportated as reference panels in Figure S2A–B, while new graphs showing CUTS protein-level changes as a function of TDP-43 knockdown were added (Figure S2C–D). We also incorporated new data showing the behavior of endogenous cryptic exons under low siTDP-43 treatment (Figure S2E).

      Figure 3: We added new data demonstrating that the application of the CUTS system in detecting TDP-43 loss of function induced by stress conditions. Specifically, we show that sodium arsenite (NaAsO₂) treatment leads to TDP-43 functional impairment detectable by CUTS and supported with endogenous cryptic exon via RT-PCR (Figure 3D-I).

      Figure 5 and Figure S4: We introduced a new figure that demonstrates the effective application of the CUTS system in differentiated neuronal systems, thereby extending its usability to disease-relevant cell types.

      Figures 2SA and 4B were edited to include the corresponding labels on the sides of each image for clarity. Sup Figure 2A was moved to Sup Figure 3A, while Figure 4B remains in its original configuration.

      We thank the reviewers again for their insightful critiques and helpful suggestions, which have enabled us to substantially improve the manuscript. Please find our detailed response to each review below:

      Reviewer #1 (Public review):

      Summary:

      The authors create an elegant sensor for TDP -43 loss of function based on cryptic splicing of CFTR and UNC13A. The usefulness of this sensor primarily lies in its use in eventual high throughput screening and eventual in vivo models. The TDP-43 loss of function sensor was also used to express TDP-43 upon reduction of its levels.

      Strengths:

      The validation is convincing, the sensor was tested in models of TDP-43 loss of function, knockdown and models of TDP-43 mislocalization and aggregation. The sensor is susceptible to a minimal decrease of TDP-43 and can be used at the protein level unlike most of the tests currently employed,

      Weaknesses:

      Although the LOF sensor described in this study may be a primary readout for high-throughput screens, ALS/TDP-43 models typically employ primary readouts such as protein aggregation or mislocalization. The information in the two following points would assist users in making informed choices.

      (1) Testing the sensor in other cell lines

      We thank the reviewer for raising this important point. In agreement with this suggestion, we generated ReN VM cell lines and used a neuroblastoma cell line model (BE(2)-C) expressing the TetOn3G CUTS system under a human synapsin I (hSYN1) promoter. In this construct the transactivator protein is under the control of a neuronal specific hSYN1 promoter whereas the classical TetOn3G system uses a CMV-like promoter. Several studies have reported reduced activity or silencing of CMV and PGK-driven transgenes in neurons. Therefore, we for our neuronal experiments, we removed this promoter to generate a new version of a doxycycline-inducible CUTS system in which Tet-On 3G transactivator is now driven by the hSYN1 promoter which will express CUTS in response to doxycycline treatment. In this improved construct, we also replaced mCherry with mScarlet to enhance the fluorescent signal.

      To test this neuronal-adapted system, we established stable CUTS expression in undifferentiated BE(2)-C cells, a subclone of the SK-N-BE(2) neuroblastoma line that has been used to study TDP-43–dependent splicing function(Brown et al., 2022). This model can be differentiated into neuron-like cells within 10 days, as shown in Supplementary Figure 4A. Using this model, we confirmed that TDP-43 knockdown leads to robust activation of the CUTS system (Figure 5B-E). We additionally tested this in in a stable polyclonal ReN VM cells following differentiation into cortical-like neurons (Figure 5D, Figure S4B-C).

      (2) Establishing a correlation between the sensor's readout and the loss of function (LOF) in the physiological genes would be useful given that the LOF sensor is a hybrid structure and doesn't represent any physiological gene. It would be beneficial to determine if a minor decrease (e.g., 2%) in TDP-43 levels is physiologically significant for a subset of exons whose splicing is controlled by TDP43.

      We agree with the reviewer that correlating the sensor’s readout with physiological TDP-43 splicing targets is essential to validate its biological relevance. To this end, we complemented our sensor expression profile with endogenous cryptic exons (CEs) sensitive to TDP-43 depletion. We tested a panel of five physiological cryptic exons regulated by TDP-43 (LRP8, EPB41L4A, ARHGAP32, HDGFL2, and ACBD3). To address the reviewer’s concerned, we performed RT-PCR on samples from the low-dose siTDP-43 experiment shown in Figure S2E.

      The endogenous CEs used in the panel were selected based on our own and others’ preliminary observations. Among these, HDGFL2 showed a particularly robust increase in cryptic exon inclusion at very low siTDP-43 concentrations (38 pM), while untreated samples showed almost no CE inclusion. This finding strongly supports a direct mechanism linking mild TDP-43 reduction to loss of physiological splicing control.

      (3) Considering that most TDP-LOF pathologically occurs due to aggregation and or mislocalization, and in most cases the endogenous TDP-43 gene is functional but the protein becomes non-functional, the use of the loss of function sensor as a switch to produce TDP-43 and its eventual use as gene therapy would have to contend with the fact that the protein produced may also become nonfunctional. This would eventually be easy to test in one of the aggregation modes that were used to test the sensor.. However, as the authors suggest, this is a very interesting system to deliver other genetic modifiers of TDP-43 proteinopathy in a regulated fashion and timely fashion.

      We thank the reviewer for this thoughtful point and agree that in the disease-relevant context where endogenous TDP-43 is intact but TDP-43 function is lost due to mislocalization and/or aggregation, a re-supply of TDP-43 risks sequestration and loss of activity. In our manuscript, the CUTS-TDP43 module was presented as a control circuit proof-of-concept rather than a stand-alone approach: it demonstrates that CUTS can (i) sense LOF with high dynamic range and proportionality, and (ii) drive a payload under negative feedback such that total TDP-43 remains near baseline while partially rescuing a splicing readout (CFTR minigene) under knockdown conditions.

      Importantly, we evaluated CUTS in aggregation/mislocalization-prone contexts: ΔNLS, 5FL, and ΔNLS+5FL variants trigger CUTS activation (ref), allowing us to quantify LOF arising from these aggregation modes. This confirms that CUTS can operate precisely in the very settings where sequestration is likely to occur.

      To directly address the reviewer’s suggestion, in the revision we (i) clarify in the Discussion that CUTS-TDP43 is a circuit demonstration and not our proposed monotherapy in aggregation-dominant disease; and (ii) expand our therapeutic framing into two approaches:

      Knockdown-and-replacement: concurrently deplete aggregation-prone/endogenous pathologic TDP-43 species (i.e., mutant TDP-43) while using CUTS to re-deliver wild-type TDP-43 under autoregulation. Aggregation-independent correction: use of CUTS to deliver modifiers that bypass TDP-43 sequestration (e.g., downstream effectors or splicing correctors that restore LOF consequences without expressing TDP-43 itself).

      (4) I don't think the quantity of siRNA is directly proportional to the degree of TDP-43 knockdown/extent of TDP-43 loss. Therefore, to enhance the utility of the dose-response curves, I'd suggest using TDP-43 levels as the variable on the x-axis, rather than the amount of siRNA administered or even just adding a plot alongside the current plots would enable readers to quickly evaluate LOF response levels concerning the protein. While I understand that the sensitivity of Western blots for quantification might be why the authors have not created the graphs in this manner, having this information would be useful.

      We appreciate the reviewer’s insightful comment. As noted, in the original version of the graph, we incorporated the percentage of TDP-43 knockdown corresponding to each siTDP-43 concentration (indicated in red text). However, we agree that this format was not easy to interpret, given the amount of information presented. To address this, we generated two new plots in which the x-axis represents TDP-43 levels (percentage of remaining protein or mRNA), and the y-axis shows the fold change in CUTS signal measured by (i) TDP-43 protein pixel intensity and (ii) TDP-43 mRNA levels, respectively. These new plots are now included as Supplementary Figures 2C–D, which allow a clearer visualization of CUTS readout in relation to actual TDP-43 levels rather than siRNA dose. As the reviewer anticipated, the reason we did not originally present the data in this format was that at low siTDP-43 concentrations, the fold change is minimal and more difficult to quantify by Western blot. Nevertheless, we have now incorporated the revised plots to strengthen the interpretation of the dose–response relationship. Additionally, we experience batch effects across siRNA lots. We believe this revised format should enhance the clarity of the result.

      (5) p3 line 74: one of the reasons cited as a pitfall of using the endogenous cryptic exons exhibit variable responses to TDP-43 loss and may be cell type-specific. has the sensor been used in different cell lines?

      We tested the CUTS system in differentiated neuronal models using two differentiated neuronal cell types, BE(2)C and ReN VM cells. The results are presented in Figure 5 and Figure S4 of the revised manuscript.

      (6) The order of the text describing 1A and 1B is confusing. The text starts describing the TS cassettes referring to 1A using the CUTS cassettes which haven't been introduced yet as an example. I'd suggest reorganising this section. The graph, always in 1A showing readout proportional to GFP should be taken out or highlighted in the figure legend that it is theoretical.

      We agree with the reviewer’s point. In the original schematic (Figure 1A), we included the CUTS system as an example to introduce the TS cassette design, since it contains the three possible sensor configurations. However, we recognize that this could be confusing. Therefore, we have removed the CUTS cassette from Figure 1A, along with the theoretical graph showing GFP readout proportional to the degree of TDP-43 LOF. In agreement with this change, we also restructured Figure 1. As the focus is the CUTS system, we have moved the Western blot and quantification of UNC13A-TS and CFTR-TS to Supplementary Figure 1.

      Reviewer #2 (Public review):

      Summary:

      The authors goal is to develop a more accurate system that reports TDP-43 activity as a splicing regulator. Prior to this, most methods employed western blotting or QPCR-based assays to determine whether targets of TDP-43 were up or down-regulated. The problem with that is the sensitivity. This approach uses an ectopic delivered construct containing splicing elements from CFTR and UNC13A (two known splicing targets) fused to a GFP reporter. Not only does it report TDP-43 function well, but it operates at extremely sensitive TDP-43 levels, requiring only picomolar TDP-43 knockdown for detection. This reporter should supersede the use of current TDP-43 activity assays, it's cost-effective, rapid and reliable.

      Strengths:

      In general, the experiments are convincing and well designed. The rigor, number of samples and statistics, and gradient of TDP-43 knockdown were all viewed as strengths. In addition, the use of multiple assays to confirm the splicing changes were viewed as complimentary (ie PCR and GFPfluorescence) adding additional rigor. The final major strength I'll add is the very clever approach to tether TDP-43 to the loss of function cassette such that when TDP-43 is inactive it would autoregulate and induce wild-type TDP-43. This has many implications for the use of other genes, not just TDP-43, but also other protective factors that may need to be re-established upon TDP-43 loss of function.

      Weaknesses:

      (1) Admittedly, one needs to initially characterize the sensor and the use of cell lines is an obvious advantage, but it begs the question of whether this will work in neurons. Additional future experiments in primary neurons will be needed.

      We thank the reviewer for highlighting the importance of validating the sensor in neuronal models, given the central role of TDP-43 dysfunction in ALS/FTD and related neurodegenerative disorders. While initial characterization in established cell lines provides experimental control and scalability, we agree that demonstrating functionality in neuronal systems is essential. To address this, we adapted the CUTS platform for neuronal application by incorporating the human synapsin-1 (hSYN1) promoter into the Tet-On 3G system to enable inducible, neuronal specific expression. We validated this configuration in differentiated BE(2)-C cells (Figures 5A-C, S4A-C), where CUTS retained robust responsiveness to TDP-43 perturbation. In parallel, we generated stable CUTS-expressing ReN VM neural progenitor cells and differentiated them for three weeks prior to functional assessment (Figures 5A-C, S4A-C). In both neuronal models, CUTS was functional and responsive to TDP-43 siRNA. We are currently optimizing promoter selection and expression paradigms for fully differentiated iPSC-derived neuronal models and will be the subject of future studies.

      (2) The bulk analysis of GFP-positive cells is a bit crude. As mentioned in the manuscript, flow sorting would be an easy and obvious approach to get more accurate homogenous data. This is especially relevant since the GFP signal is quite heterogeneous in the image panels, for example, Figure 1C, meaning the siRNA is not fully penetrant. Therefore, stating that 1% TDP-43 knockdown achieves the desired sensor regulation might be misleading. Flow sorting would provide a much more accurate quantification of how subtle changes in TDP-43 protein levels track with GFP fluorescence.

      We thank the reviewer for this thoughtful suggestion. We agree that flow cytometry and sorting of GFP-positive populations would provide a higher-resolution, single-cell–level relationship between TDP-43 abundance and sensor output. Such an approach would reduce heterogeneity arising from incomplete siRNA penetrance and allow more precise quantification of how incremental changes in TDP-43 protein levels track with GFP fluorescence. In the present study, our goal was to establish proof-of-principle functionality of the CUTS circuit and to demonstrate that graded TDP-43 depletion produces a proportional sensor response at the population level. While GFP signal heterogeneity is visible in imaging panels, we hypothesize that this variability likely reflects known differences in siRNA uptake and transfection efficiency rather than instability of the circuit itself. Importantly, bulk measurements consistently demonstrated dose-dependent sensor regulation across independent experiments, supporting the robustness of the system despite cellular heterogeneity. Furthermore, we were able to quantify CUTS activation in HeLa TARDBP<sup>-/-</sup> cells. We also note that CUTS was developed as a practical tool for rapid assessment of TDP-43 LOF in standard laboratory settings. Although flow cytometry increases resolution, the ability to detect functional perturbation using bulk fluorescence measurements supports the utility of the system for routine and high-throughput applications.

      We agree that flow cytometry would provide a more refined analysis of the dynamic range and sensitivity of CUTS, particularly for defining thresholds such as minimal TDP-43 knockdown required for measurable activation. We plan to include this work in future studies. Specifically, we have implemented FACs sorting of CUTS-expressing cells in a parallel study in which we are conducting a CRISPR knockout screen to identify modifiers of TDP-43 splicing function. For this, we incorporate TDP-43 knockdown followed by FACs to stratify cells based on CUTS activation. This strategy enables direct evaluation of the relationship between the extent of TDP-43 LOF and CUTS sensor activation. These analyses are ongoing and provide a more quantitative analyses linking TDP-43 depletion to CUTS activation and address the reviewer’s concern regarding heterogeneity in bulk measurements. We plan to include this in a future study.

      (3) Some panels in the manuscript would benefit from additional clarity to make the data easier to visualize. For example, Figure 2D and 2G could be presented in a more clear manner, possibly split into additional graphs since there are too many outputs.

      We thank the reviewer for this suggestion. In response, we have split the graphs previously shown in Figures 2D and 2G to improve clarity, as we agree that these panels contained an extensive amount of data. We Specifically split Figure 2D into two separate graphs showing TDP-43 and GFP pixel intensity from Western blots on the Y-axis, plotted against low siTDP-43 treatment on the X-axis. Please see this data as Figure 2 D and Figure 2E in the new manuscript.

      Furthermore, for Figure 2G we also split into graphs showing the fold change of mRNA for TDP-43 and the CUTS cryptic exon plotted against low siTDP-43 treatment on the X-axis. Please see this data as Figure 2 H and Figure 2I in the new manuscript. We have maintained the previous graphs in Supplementary Figure 2 to preserve the full dataset for reference.

      (4) Sup Figure 2A image panels would benefit from being labeled, its difficult to tell what antibodies or fluorophores were used. Same with Figure 4B.

      We appreciate the reviewer’s careful observation. In both figures, we are showing mCherry and GFP signals. In the revised version, we have added the corresponding labels to the side of each image for clarity. Therefore, Sup Figure 2A has been moved and is now Sup Figure 3A, while Figure 4B remains in its original configuration.

      (5) Figure 3 is an important addition to this manuscript and in general is convincing showing that TDP43 loss of function mutants can alter the sensor. However, there is still wild-type endogenous TDP-43 in these cells, and it's unclear whether the 5FL mutant is acting as a dominant negative to deplete the total TDP-43 pool, which is what the data would suggest. This could have been clarified.

      The TDP-43 5FL variant exhibits reduced RNA-binding capacity, and we previously demonstrated that impaired RNA binding promotes aberrant homotypic phase separation of TDP-43. Consistent with this mechanism, expression of RNA-binding–deficient TDP-43 variants induces the formation of nuclear “anisomes” which have been shown to sequester endogenous TDP-43 into insoluble fractions via dominant-negative mechanisms (Cohen et al., 2015; Keating et al., 2023; Mann et al., 2019; Yu et al., 2021). These findings support a model in which disruption of RNA engagement alters TDP-43 biophysical behavior and promotes functional depletion through self-association. We have expanded this mechanistic explanation in the Results section of the revised manuscript to better contextualize the behavior of the 5FL construct and its impact on endogenous TDP-43.

      (6) Additional treatment with stressors that inactivate TDP-43 could be tested in future studies.

      We appreciate this suggestion and agree with this important point. Due to the lack of methods to directly induce endogenous TDP-43 aggregation and loss of function, the use of stressors has become a partial solution to address this issue. In line with this, our group has tested several stressors in follow-up research, including sodium arsenite (NaAsO₂), puromycin, KCl, MG132, sorbitol, and tunicamycin, using HEK cells expressing the CUTS system(Xie et al., 2025). We were able to show a dose-response relationship in relative GFP intensity under these conditions, with sodium arsenite showing the strongest effect, consistent with previous reports(Huang et al., 2024). To provide additional relevant findings in the current manuscript, we expanded this analysis by testing sodium arsenite in the CUTS system while also including endogenous cryptic exons. We therefore added a new figure showing the effect of sodium arsenite on the CUTS system, including GFP intensity measurements, qPCR using CUTS cryptic exon primers, and three endogenous cryptic exon reporters (ATG4B, GPSM2, and KCNQ2).

      Overall, the authors definitely achieved their goals by developing a very sensitive readout for TDP-43 function. The results are convincing, rigorous, and support their main conclusions. There are some minor weaknesses listed above, chief of which is the use of flow sorting to improve the data analysis. But regardless, this study will have an immediate impact for those who need a rapid, reliable, and sensitive assessment of TDP-43 activity, and it will be particularly impactful once this reporter can be used in isolated primary cells (ie neurons) and in vivo in animal models. Since TDP-43 loss of function is thought to be a dominant pathological mechanism in ALS/FTD and likely many other disorders, having these types of sensors is a major boost to the field and will change our ability to see sub-threshold changes in TDP-43 function that might otherwise not be possible with current approaches.

      (7) Regarding the methods, they seem a bit sparse and would benefit from additional detail. For example, I do not see a section in the methods where microscopy images were quantified (%GFP positive cells for example). This information is important and is lacking in the current form.

      We thank the reviewers, and we add the following information in the method section: For live imaging quantification, we measured the mean GFP signal intensity for each group. The values were averaged, and the fold change was calculated and plotted. For immunofluorescent imaging, we first created maximum intensity projection images. We then applied masks to the GFP, mCherry, and Hoechst signals. By overlapping the GFP and mCherry signals, we identified the number of GFP-positive cells. Similarly, by overlapping the mCherry signal with the Hoechst mask, we identified the CUTS-expressing cells. We then calculated the ratio of GFPpositive cells to CUTS-expressing cells and plotted it as a percentage of GFP-positive cells. All analyses were performed using the Nikon NIS software. This information is included in the methods of the revised manuscript.

      Reviewer #3 (Public review):

      The DNA and RNA binding protein TDP-43 has been pathologically implicated in a number of neurodegenerative diseases including ALS, FTD, and AD. Normally residing in the nucleus, in TDP-43 proteinopathies, TDP-43 mislocalizes to the cytoplasm where it is found in cytoplasmic aggregates. It is thought that both loss of nuclear function and cytoplasmic gain of toxic function are contributors to disease pathogenesis in TDP-43 proteinopathies. Recent studies have demonstrated that depletion of nuclear TDP-43 leads to loss of its nuclear function characterized by changes in gene expression and splicing of target mRNAs. However, to date, most readouts of TDP-43 loss of function events are dependent upon PCR-based assays for single mRNA targets. Thus, reliable and robust assays for detection of global changes in TDP-43 splicing events are lacking. In this manuscript, Xie, Merjane, Bergmann and colleagues describe a biosensor that reports on TDP-43 splicing function in real time. Overall, this is a well described unique resource that would be of high interest and utility to a number of researchers. Nonetheless, a couple of points should be addressed by the authors to enhance the overall utility and applicability of this biosensor.

      (1) While the rationale for selecting UNC13A CE as the reporting CE species is understood given the relevance to disease, could the authors please comment on whether other CE sequences would behave similarly or as robustly? This is particularly critical given the multitude of different splicing changes that can occur as a result of TDP-43 loss of function (ie cryptic exons of differing sensitivity, skiptic exons, premature polyadenylation).

      We thank the reviewer for this question regarding generalizability beyond the UNC13A CE. While UNC13A was selected due to its strong disease relevance and well-characterized sensitivity to TDP-43 loss-of-function (LOF), our platform is not intrinsically restricted to this sequence. In the manuscript, we directly compared three architectures: UNC13A-TS, CFTR-TS, and the combined CUTS sensor incorporating additional UG motif optimization. Under matched conditions in stable HEK293 lines, CUTS demonstrated superior specificity and sensitivity, exhibiting near-zero baseline activity and a proportional, log-linear response across low-dose siTDP43 (38–1200 pM) (Figures 1–2). Importantly, this head-to-head comparison demonstrates that sensor performance can be engineered and optimized beyond a single CE species.

      TDP-43 LOF is known to induce a spectrum of RNA processing defects, including cryptic exons with differing sensitivities and cell-type dependence, premature polyadenylation events (e.g., STMN2), and, under conditions of excess nuclear TDP-43, exon skipping (“skiptic exons”). This diversity supports the concept in which alternative CE elements, or other TDP-43 regulated RNAs, can be incorporated into the same sensor backbone and tuned for specific biological scenarios (cell type, specific stress responses, etc...). Consistent with this, the recently described TDP-REG system (Wilkins et al., 2024) designed and AI-generated de novo CE sequences to express reporters or gene payloads, and screened multiple candidates to identify the appropriate RNA elements required for this response. These findings demonstrate that CE sequences beyond UNC13A can serve as robust TDP-43 sensing elements when optimized. Our results complement this work by demonstrating that CUTS achieves tight baseline control and a steep dynamic range (>110,000-fold induction over baseline in HEK293 cells), while maintaining compatibility across both non-neuronal and neuronal model systems, as shown in the revised manuscript.

      In the revised manuscript, we show direct comparisons indicating that CUTS outperforms single-CE sensors such as UNC13A-TS and CFTR-TS under identical conditions. This supports independent work from other groups that alternative CE sequences can be engineered into effective sensors, depending on their paradigm and model systems. We have clarified this in the revised Discussion and now note that CUTS is adaptable to alternative CE inserts.

      (3) Could the authors provide evidence of the utility of their biosensor in disease relevant systems that do not rely on TDP-43 KD? For example, does this biosensor report on TDP-43 loss of function in C9orf72 iPSNs in a time-dependent manner? Alternatively, groups have modeled TDP-43 proteinopathy in wildtype iPSNs via MG132 treatment.

      We thank the reviewer for this important suggestion. We agree that demonstrating CUTS responsiveness in disease-relevant models independent of artificial TDP-43 knockdown would further strengthen its translational relevance. In the current study, our primary objective was to establish the sensitivity, dynamic range, and autoregulatory properties of the CUTS circuit under controlled perturbation of TDP-43 levels. siRNA-mediated depletion provides a reliable approach to establish the relationship between graded TDP-43 LOF and the CUTS sensor sensitivity/specificity. That said, CUTS is designed to detect functional TDP-43 loss irrespective of the upstream cause. As the reviewer notes, disease-relevant systems, such as C9orf72 iPSC-derived neurons and proteotoxic stress paradigms (e.g., MG132-induced impairment of TDP-43 nuclear function), are important for future studies. We are currently evaluating CUTS in iPSC-derived neuronal models of TDP-43 proteinopathy, but are optimizing the induction system, promoters, and timing. It should be noted that C9orf72 iPSC neurons do not exhibit TDP-43 LOF using standard differentiation protocols. Regarding pharmacological stress, we have shown that acute sodium arsenite treatment can activate CUTS (Figure 3). In a concurrent study under revision, we show that MG132 similarly causes TDP-43 LOF and CUTS activation (Xie et al., 2025). Notably, none of these induce complete nuclear loss of TDP-43; instead, they show nuclear TDP-43 retention or modest mislocalization. This suggests that TDP-43 LOF may also result from nuclear redistribution and dysfunction under these stress conditions, rather than from complete nuclear loss. We look forward to presenting these ongoing studies in the future.

      References

      Brown A-L, Wilkins OG, Keuss MJ, Kargbo-Hill SE, Zanovello M, Lee WC, Bampton A, Lee FCY, Masino L, Qi YA, Bryce-Smith S, Gatt A, Hallegger M, Fagegaltier D, Phatnani H, NYGC ALS Consortium, Newcombe J, Gustavsson EK, Seddighi S, Reyes JF, Coon SL, Ramos D, Schiavo G, Fisher EMC, Raj T, Secrier M, Lashley T, Ule J, Buratti E, Humphrey J, Ward ME, Fratta P. 2022. TDP-43 loss and ALS-risk SNPs drive mis-splicing and depletion of UNC13A. Nature 603:131–137. doi:10.1038/s41586-022-04436-3

      Cohen TJ, Hwang AW, Restrepo CR, Yuan C-X, Trojanowski JQ, Lee VMY. 2015. An acetylation switch controls TDP-43 function and aggregation propensity. Nat Commun 6:5845. doi:10.1038/ncomms6845

      Huang W-P, Ellis BCS, Hodgson RE, Sanchez Avila A, Kumar V, Rayment J, Moll T, Shelkovnikova TA. 2024. Stress-induced TDP-43 nuclear condensation causes splicing loss of function and STMN2 depletion. Cell Rep 43:114421. doi:10.1016/j.celrep.2024.114421

      Keating SS, Bademosi AT, San Gil R, Walker AK. 2023. Aggregation-prone TDP-43 sequesters and drives pathological transitions of free nuclear TDP-43. Cell Mol Life Sci 80:95. doi:10.1007/s00018-023-04739-2

      Mann JR, Gleixner AM, Mauna JC, Gomes E, DeChellis-Marks MR, Needham PG, Copley KE, Hurtle B, Portz B, Pyles NJ, Guo L, Calder CB, Wills ZP, Pandey UB, Kofler JK, Brodsky JL, Thathiah A, Shorter J, Donnelly CJ. 2019. RNA Binding Antagonizes Neurotoxic Phase Transitions of TDP-43. Neuron 102:321-338.e8. doi:10.1016/j.neuron.2019.01.048

      Wilkins OG, Chien MZYJ, Wlaschin JJ, Barattucci S, Harley P, Mattedi F, Mehta PR, Pisliakova M, Ryadnov E, Keuss MJ, Thompson D, Digby H, Knez L, Simkin RL, Diaz JA, Zanovello M, Brown A-L, Darbey A, Karda R, Fisher EMC, Cunningham TJ, Le Pichon CE, Ule J, Fratta P. 2024. Creation of de novo cryptic splicing for ALS and FTD precision medicine. Science 386:61–69. doi:10.1126/science.adk2539

      Xie L, Zhu Y, Hurtle BT, Wright M, Robinson JL, Mauna JC, Brown EE, Ngo M, Bergmann CA, Xu J, Merjane J, Gleixner AM, Grigorean G, Liu F, Rossoll W, Lee EB, Kiskinis E, Chikina M, Donnelly CJ. 2025. Contextdependent Interactors Regulate TDP-43 Dysfunction in ALS/FTLD. BioRxiv. doi:10.1101/2025.04.07.646890

      Yu H, Lu S, Gasior K, Singh D, Vazquez-Sanchez S, Tapia O, Toprani D, Beccari MS, Yates JR, Da Cruz S, Newby JM, Lafarga M, Gladfelter AS, Villa E, Cleveland DW. 2021. HSP70 chaperones RNA-free TDP-43 into anisotropic intranuclear liquid spherical shells. Science 371. doi:10.1126/science.abb4309.

    1. Author Response:

      The following is the authors’ response to the previous reviews

      Public Review:

      Reviewer #1 (Public review):

      The weaknesses are in the clarity and resolution of the data that forms the basis of the model. In addition to general whole embryo morphology that is used as evidence for CE defects, two forms of data are presented, co-expression and IP, as well as a strong reliance on IF of exogenously expressed proteins. Thus, it is critical that both forms of evidence be very strong and clear, and this is where there are deficiencies; 1) For vast majority of experiments general morphology and LWR was used as evidence of effects on convergent extension movements rather than keller explants or actual cell movements in the embryo. 2) the microscopy would benefit from super resolution microscopy since in many cases the differences in protein localization are not very pronounced. 3) the IP and Western analysis data often shows very subtle differences, and some cases not apparent.

      Major points.

      (1) Assessment of CE movement

      The authors conducted an analysis of the subcellular localization of PCP core proteins, including Vangl2, Pk, Fz, and Dvl, within animal cap explants (ectodermal explants). The authors primarily used the length-to-width ratio (LWR) to evaluate CE movement as a basis for their model. However, LWR can be influenced by multiple factors and is not sufficient to directly and clearly represent CE defects. While the author showed that Prickle knockdown suppresses animal cap elongation mediated by Activin treatment, they did not test their model using standard assays such as animal cap elongation or dorsal marginal zone (DMZ) Keller explants. Furthermore, although various imaging analyses were performed in Wnt11-overexpressing animal caps and DMZ explants, the Wnt11-overexpressing animal caps did not undergo CE movement. Given that this study focuses on the molecular mechanisms of Vangl2 and Ror2 regulation of Dvl2 during CE, the model should be validated in more appropriate tissues, such as DMZ explants.

      (2) Overexpression conditions

      Another concern is that most analyses were performed with overexpression conditions. PCP core proteins (Vangl2, Pk, Dvl, and Fz receptors) are known to display polarized subcellular localization in both the neural epithelium and DMZ explants (Ref: PCP and Septins govern the polarized organization of the actin cytoskeleton during convergent extension, Current Biology, 2024). However, in this study, overexpressed PCP core proteins failed to show polarized localization. Previous studies, such as those from the Wallingford lab, typically used 10-30 pg of RNA for PCP core proteins, whereas this study injected 100-500 pg, which is likely excessive and may have created artificial conditions that confound the imaging results.

      (3) Subtle and insufficient effects

      Several of the reported results show quite modest changes in imaging and immunoprecipitation analyses, which are not sufficient to strongly support the proposed molecular model. For example, most Dvl2 remained localized with Fz7 even under Vangl2 and Pk overexpression (Fig. 4). Similarly, Wnt11 overexpression only slightly reduced the association between Vangl2 and Dvl2 (Sup. Fig. 8), and the Ror2-related experiments also produced only subtle effects (Fig. 8, Sup. Fig. 15).

      We thank reviewer 1 for careful reading of our revised manuscript, and additional constructive criticisms. Since the two reviewers had divergent opinions towards our revised manuscript, we think that it might be more productive to request a Version of Record at this point, and have our proposed model debated/ tested by others in the field. We will keep the reviewer’s suggestions in mind while design ongoing studies. We would like to address the criticisms collectively below:

      (1) The primary goal of our current manuscript is to build a mechanistic model for non-canonical Wnt signaling through elucidating the functional relationships between Dvl, Vangl, PK and Ror during CE. They each have been studied extensively in prior literature using DMZ injected embryos, and DMZ, Keller and animal cap explants, so there is little doubt that the reduced LWR following their over-expression or knockdown in DMZ is due to disruption of CE. In the context of our study in the current manuscript, we primarily performed their co-injections in different combinations to differentiate synergistic vs. antagonistic relationship, and in the majority cases we relied on epistatsis to draw conclusions (e.g. Fig. 1; Fig. 2h, I; Suppl. Fig. 6; Suppl. Fig. 14). Nevertheless, we did follow the reviewer’s suggestion and used animal cap elongation as an additional assay to confirm that Pk and Vangl2 did synergize to disrupt CE, and their synergy could be blocked by Dvl2 co-overexpression; the new data is added to Fig. 1 (Fig. 1h, h’). Therefore, given the prior literature, our new animal cap explant data, and the specific scope of our current study, we feel that the LWR measurement is a reasonable assay to determine CE phenotype in this manuscript. We fully agree with the reviewer that our model will need to be tested at the cellular level through live imaging of DMZ explants; it is indeed the direction of our future study, but is beyond the scope of the current manuscript.

      (2) A salient feature of non-canonical Wnt signaling is that loss or over-expression of any components can often cause identical CE defects at the tissue/ embryo level. We used many co-injection experiments to demonstrate that this is due, at least in part, to a counterbalance between Dvl/Ror and Vangl/PK (e.g. Fig. 1; Fig. 2h, I; Suppl. Fig. 6; Suppl. Fig. 14). It is in this context that we planned the imaging and biochemical experiments to determine the possible molecular mechanisms underlying their functional interaction, and we feel that the moderate over-expression used is reasonable in this case for us to build the first integrated model. We do plan to test our model using lower expression in the future. To acknowledge the limitation of our study, we also added the following sentences in the Discussion:

      “We acknowledge, however, that our model explains primarily the potential molecular actions underlying the regulation of CE at the tissue level. Whether and how our model may explain the cellular behavior during CE, such as polarized remodeling of cell junction or extension of cell protrusions, will require further study.”

      (3) The Wnt11 induced reduction of Dvl2-Vangl2 co-IP (Suppl. Fig. 8, 15) may be moderate, but is statistically significant and reproducible, and we have reported similar findings in two other publications (DOI: 10.1093/hmg/ddx095; DOI: 10.1038/s41467-025-57658-0). Given the limitation of co-IP, we had to rely on high level over-expression to make the experiments feasible. We are building proximity based assays such as NanoBRET, and plan to verify the result with lower level expression in the future.

      Reviewer #2 (Public review):

      We thank the reviewer for the encouraging comments, and the suggestion to clarify the description related to Suppl. Fig. 15. We made revision according to the reviewer’s suggestion, and added Suppl. Fig. 16 to further examine the effect of Ror2 knockdown on the steady state interaction between Dvl2 and Vangl2 using imaging approach.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript presents an end-to-end pipeline, intended to accelerate EM-based connectomics by combining low-resolution imaging for large volumes with synapse-level imaging only in selected regions of interest. In principle, this strategy can substantially reduce imaging time, computational demands, analysis time, and overall cost.

      General note:

      Overall, I found the manuscript interesting and valuable, particularly as a description of how one laboratory has assembled and applied a practical workflow to reconstruct and analyze the central complex across multiple insect species. In that sense, the work is compelling as an account of a real, functioning strategy for comparative connectomics, and I appreciated reading it. My main reservation is not about the relevance of the biological problem or the utility of the pipeline in the authors' own hands, but about whether the manuscript, in its current form, fully meets the expectations of a paper that is focused on tools and resources. The expectation would be that this paper would be a venue for sharing new techniques, software tools, datasets, and other resources intended to be usable by the community. Here, because much of the pipeline appears to build on existing methods and software, the key value added should be a particularly clear demonstration of how these components were adapted, integrated, validated, and documented for this specific use case in a way that others could realistically reproduce and adopt. At present, that translational and reproducibility-oriented component does not yet seem sufficiently developed, despite the clear promise of the overall approach.

      Major comments:

      (1) The work is valuable as a practical integration and application of multiple existing tools into a coherent pipeline, together with a new multi-resolution imaging strategy. However, the manuscript at times reads as though it introduces an entirely novel workflow. I would encourage the authors to clarify the contribution more explicitly: which components are genuinely new (for example, the acquisition strategy and the end-to-end integration/validation), and which are adaptations of already established methods or software. This would make the scope and novelty of the paper easier to assess.

      (2) The most distinctive element is the multi-resolution acquisition strategy. However, as described, the selection of high-resolution regions seems to be decided a priori based on anatomy (guided by xCT localization of the CX), rather than being determined automatically from the data (i.e., ROI placement is anatomy-driven rather than data-driven). A more data-driven or machine learning-guided ROI strategy would strengthen the methodological contribution and the adaptability to new scenarios, along the lines of approaches such as SmartEM [1].

      (3) The manuscript emphasizes open-source availability and reduced barriers to entry, but the current software release, as referenced, does not yet appear to support straightforward external reuse. Since much of the pipeline builds on existing methods, the main added value lies in how these technologies were adapted, combined, and validated for the present problem. A clear and complete explanation of this adaptation is therefore essential, but is currently missing. I would suggest the following concrete improvements:<br /> a) Provide a single landing page or umbrella repository that links each pipeline step in the paper to the corresponding codebase, including version tags/commits and expected inputs/outputs for each step.<br /> b) Include step-by-step tutorials for each component.<br /> c) Provide an example dataset together with a full reproduction walkthrough in a controlled environment.<br /> d) Clearly explain the required parameters and configuration for each step, including how they should be adjusted for other datasets or scenarios.<br /> e) Follow packaging and distribution best practices (for example, PyPI/conda releases, Docker containers, and version pinning).

      (4) In my own attempt to set up and run parts of the released code, I encountered issues that currently limit reproducibility. For example, when creating an environment for EMalign (https://github.com/Heinze-lab/EMalign), the required Python version is not specified, and installation did not succeed under Python 3.12 due to dependency constraints. Additionally, synful_312 (https://github.com/Heinze-lab/synful_312) and SegToPCG (https://github.com/Heinze-lab/SegToPCG) appear to be empty despite being referenced in the manuscript. These are fixable issues, but addressing them is important if the paper is to deliver on its "low entry cost" claim.

      (5) Table 1 reports acquisition times, which is helpful. However, the multi-resolution approach adds essential processing steps that appear due to the strategy followed (e.g., "XY alignment high-res" and "high-res to low-res alignment"). Please include registration/alignment (and other major post-processing) runtimes and resource requirements, such as storage, in a comparable table so readers can assess true end-to-end cost.

      References:

      [1] Meirovitch, Y., et al. "SmartEM: machine learning-guided electron microscopy." Nature Methods (2025).

    2. Reviewer #2 (Public review):

      Summary:

      The paper proposes a workflow to accelerate EM connectomics by combining multi-scale imaging with image processing and analysis (image alignment, registration, neuron tracing, automated segmentation and synapse prediction, proof-reading) to derive a brain region connectome. The paper argues and (partially) demonstrates that this approach facilitates comparative connectomics.

      The data acquisition pipeline uses a well-established sample preparation protocol, uCT guided acquisition, and SBEM imaging at cellular and synaptic resolution.

      Data processing and analysis combine existing state-of-the-art components and focus on the alignment and complementary analysis of the two SBEM resolution levels. The paper applies the workflow to the central complex of six different insects and performs some preliminary analysis based on this (which is acceptable for a resource/tool).

      Disclaimer for the rest of the review: I am an expert in image analysis and segmentation, so I have mainly focused on these aspects as I am not qualified to analyze the details of image acquisition.

      Strengths:

      The paper addresses an important problem and promises an acceleration and democratization of comparable connectomics. The time savings of the imaging approach are well-motivated and derived. The methods used for image alignment, segmentation, synapse detection, and proofreading are state-of-the-art.

      Weaknesses:

      I see two major weaknesses in the paper:

      (1) The paper introduces the (approximate) equivalence of the projectome and connectome in the insect brain very prominently in the introduction and uses this as a central motivation for the multi-resolution image acquisition protocol. But - to me - it is unclear how this principle is really used in the analysis presented in the last results and if this assumption is evaluated at all. Specifically, Figure 4 a shows the anatomical neuron reconstructions (from cellular resolution SBEM), d-g show connectome-level analysis from the synaptic resolution data. The only link I can see between the two is that the neural processes in the synapse-resolution data can be mapped to the neurons from the cellular resolution data, thanks to the image alignment. This is certainly important, BUT it is only tangentially related to the projectome vs. connectome claim from the introduction. This claim implies that a tentative connectome is derived from projectome-level data (e.g. by assuming a uniform probability of synapse-formation given surface or distance between projections) that is then validated by the "true" connectome data from synaptic resolution. Instead, what is actually solved - to my understanding - is mapping the local connectome to the projectome. While related, these are different things and the current framing of the paper and the quite brief description of the section on comparative connectomics (also no corresponding Methods section) make this claim inadequately supported.

      (2) Reporting on segmentation and proofreading is purely qualitative. Given that this is claimed as a core contribution of the paper (e.g. statement in line 497 and following), I would expect substantially more reporting and evaluation of this claim:<br /> a) Report the actual time needed for proofreading the segmentations in CAVE. I could not find any numbers on this.<br /> b) Report the initial segmentation quality of the model: How many errors does it make? Note: There is a brief mention of VoI-based quantification in Methods (around line 1060), but the results are not reported.

      What should be done: Report the error rates (with an accurate measure such as skeleton VoI) independently for all 6 volumes. Given that the authors have the proofread versions, this is feasible. Only then can the claims be made here be evaluated. Note that the F1-score of synapse prediction is quantified. This is a good starting point, but could also be extended to further species in order to assess the actual transferability. Furthermore, none of the data from the study seems to be available. The training data of the network has to be made available. If possible, high-resolution data should be proofread too.

      Further points:

      (1) Why isn't reconstruction at the cellular level addressed with ML? This is surely possible and should be easier than the full connectome analysis. Similar to before, the actual times needed for tracing with CATMAID are not reported; the manuscript only states that this can be done in minutes for a neuron, but it's unclear if this is the best or average case. It would help to have quantitative numbers to assess whether automation would bring any benefits.

      (2) Finally, regarding the underlying software. I did not try this myself due to time constraints, but did check the repositories. They seem to be in an ok state with some documentation in a README. However, given the central role of the software contribution, I would expect a centralized doc page that explains how to use the different parts of the software, including a full example with sample data. Without this, application by other labs - a central claim - will be difficult.

    1. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors test the hypothesis that whole-brain functional magnetic resonance imaging in behaving mice, coupled with reinforcement-learning modeling, can dissociate neural substrates of initial cue-reward acquisition versus contingency reversal, and potentially reveal underappreciated contributors to cognitive flexibility. Using a head-fixed go/no-go odor discrimination task with subsequent rule reversal in a subset of mice, they model trial-by-trial state-action values with a model-free Q-learning algorithm (hierarchical Bayesian fit) and use the model-derived decision variable as a parametric regressor in whole-brain analyses. They report that acquisition-related signals prominently involve ventral and dorsal striatal regions, whereas reversal learning additionally recruits the periaqueductal gray (negative correlation with the decision variable) and shows an apparent double dissociation between nucleus accumbens and periaqueductal gray responses for hit versus correct-rejection outcomes during reversal.

      Strengths:

      (1) The reversal manipulation is implemented without explicit punishment, targeting suppression of previously rewarded actions under reward omission - an underexplored regime for midbrain contributions beyond canonical threat/pain framing.

      (2) The manuscript provides a credible MR-compatible olfactory/licking platform with synchronized sniff/lick/valve/reward timing and high-field imaging, supporting feasibility and broader utility for mesoscale systems neuroscience in rodents.

      (3) Trial-by-trial value estimates from a Q-learning variant are fit via hierarchical Bayesian inference and explicitly integrated into subject-level general linear models with a mouse hemodynamic response function, which is appropriate for leveraging within-subject dynamics in small-N rodent fMRI.

      (4) The decision-variable maps during acquisition recover expected basal ganglia involvement (including nucleus accumbens and dorsal striatum), providing face validity; the reversal-stage map yields an interpretable set of cortical/striatal/pallidal regions plus periaqueductal gray/hippocampus.

      (5) The finite impulse response analysis stratified by behavioral outcomes (hit, false alarm, correct rejection, miss) adds interpretability beyond the model regressor alone, and the reported crossover interaction between nucleus accumbens and periaqueductal gray is potentially impactful if robust.

      Weaknesses:

      (1) The core claim regarding selective periaqueductal gray engagement rests on a subset of n = 6 mice for reversal. With permutation-based whole-brain inference and very small cluster sizes, the robustness of the periaqueductal gray effect to reasonable analytic perturbations is not yet convincing. I would suggest providing leave-one-animal-out analyses for the periaqueductal gray cluster/ROI effects and reporting how often the key findings survive.

      (2) The authors note that due to temporal resolution and hemodynamics, they cannot separate stimulus, choice, and feedback and therefore model "whole trials." This limitation creates ambiguity about whether periaqueductal gray signals reflect value updating, action inhibition (no-lick), reward omission, autonomic arousal, or motor preparation/withholding, especially given the strong hit versus correct-rejection opponency. I would suggest adding targeted analyses that disambiguate "withholding" from "reversal-related updating".

      (3) ROIs are defined from the whole-brain decision-variable maps and then interrogated by outcome types; the manuscript acknowledges non-independence. This can inflate apparent dissociations. It would be better if the authors define ROIs independently (anatomical periaqueductal gray/nucleus accumbens masks, or split-half ROI definition with held-out data) and repeat the key ROI conclusions.

      (4) The reversal group is a subset of the acquisition cohort and also experiences a different task phase structure and additional sessions; the paper attempts to address exposure differences descriptively. I would suggest that the authors formally test whether periaqueductal gray effects are explained by session count, time-in-scanner, or learning rate differences (e.g., include these as covariates, or match sessions more strictly).

      (5) The platform records sniffing and licking, but the imaging models described include motion, global, and ventricle regressors and do not clearly include trialwise lick/sniff covariates. Given the periaqueductal gray's known autonomic and defensive coordination roles, physiological state confounding is a major concern. Could the authors incorporate sniff and lick metrics (and their derivatives) as nuisance regressors and show whether the periaqueductal gray effects persist?

    1. Reviewer #1 (Public review):

      Summary:

      In their manuscript, Zhou and colleagues present a detailed look at how the JSP functions differently in the various cells of a breast tumor. The authors have effectively shown that the JSP acts as a double-edged sword, as it helps T cells fight cancer but also allows tumor cells to grow and avoid ferroptosis. These findings are important because they identify a useful biomarker to predict how TNBC patients might respond to PD-1 inhibitors.

      Strengths:

      This work is important because it provides a clear explanation for the conflicting roles of the JSP in the tumor environment. The evidence is solid, as it combines data from thousands of patients with single-cell analysis and lab experiments to confirm the role of STAT4 in cancer progression and immunity.

      Weaknesses:

      However, there are areas for improvement in the scope of the review, the depth of analysis, and the potential for broader clinical implications. The authors are encouraged to address these issues to enhance the scientific and clinical impact of the study.

      Major Issues:

      (1) The authors demonstrate that STAT4 upregulates SLC47A1, but this is currently supported only by expression correlation and western blot data. To confirm a direct link, the authors are encouraged to perform ChIP-qPCR or luciferase reporter assays to show that STAT4 binds directly to the SLC47A1 promoter.

      (2) The conclusion that the MIF-CD74 axis drives immunosuppression is based on computational inference. To support this, the authors could consider mining publicly available breast cancer spatial transcriptomics data to show the co-localization of MIF and CD74. Alternatively, performing simple dual-color immunofluorescence staining on a few clinical sections would effectively demonstrate the physical proximity of these cells.

      (3) TNBC is highly heterogeneous and includes subtypes like mesenchymal and immunomodulatory groups. The authors should analyze whether the JSP score or STAT4 levels vary significantly between these subtypes, as this could further refine the selection of patients for JAK1 inhibitors.

      (4) While the JSP score works well in the current datasets, the authors should consider validating its predictive accuracy in additional independent immunotherapy cohorts, such as the TONIC trial, to ensure the biomarker is robust across different treatment settings.

      Minor Issue:

      The manuscript mentions a U-shaped trajectory of JSP activity during tumor transition. A more detailed biological explanation of why the pathway activity initially drops and then rises would add depth to the discussion.

    1. There are four strategies that Spanish/English bilinguals have recourse to in order toconvey additional stylistic meaning and achieve personal communicative intentions: (1)emphasis or contrast, (2) mode or topic shift, (3) controlling the addressee, and (4) personalization or objectivization

      .

    1. “You know, like all these authors and illustrators, you could use translanguaging in your memoirs if it will help you tell your story truthfully, more like the way it really happened.” Their faces seemed to open up like a thank you.

      It is so important to find picture books that use translanguaging. These books can serve as models. Students can see how authors use multiple languages and apply that to their own writing. But these books also serve a deeper purpose. They show students that their languages belong in school. Many students are used to thinking they should only write in English. Seeing these books challenges that idea. It almost gives them permission to use all their language resources and write in a way that feels natural to them. It tells them, “these authors are doing it and you can do it too.”

      This is why text choice is so important. The books we use can send powerful messages to students. Books that include translanguaging show students that their language practices are important and something we want to see in their own writing. This is why the students’ “faces seemed to open up like a thank you.”

    2. They approach writing tasks with more flexibility, knowing that they can use a broader language repertoire that is much more congruent with their lives outside school.

      Talia Zoref: I agree, translanguaging is beneficial for students because they are not limited by one language. They may want to say something in a certain way but can only do so in their native language. It also makes them feel more connected as they may talk another language at home. These students are not put into a box when they have this flexibility. Translanguaging promotes student identity and culture through cultural names, places, and phrases.

    3. Translanguaging does not demarcate children’s languages; instead, translanguaging maintains children’s active negotiation of language resources and practices on a continuum in order to express themselves in particular contexts.

      I find this description of translanguaging to be a very good one because translanguaging does not limit or isolate language and rather it sees it as a collection of linguistic abilities that can be used in unison to express meaning and feeling. Being worded as an "active negotiation of language" is such an interesting way to put it because as somebody who translanguages every single day, it sounds like a good way to express the idea that the two languages aren't competing, but are instead flowing in our brains and bargaining depending on the situation or circumstance that we find ourselves in.

    1. On 2026-01-19 13:00:58, user Gene C Koh wrote:

      Gene Ching Chiek Koh, Serena Nik-Zainal

      Department of Genomic Medicine, University of Cambridge, CB2 0QQ, UK.

      We commend Kanwal et al. for their timely evaluation of the in vivo mutagenic potential of CX-5461. This follows our report that CX-5461 induces substantial mutagenesis in cultured mammalian cells1. The authors analysed samples from four patients treated with CX-5461, including marrow aspirates, trephine biopsies, PBMCs, and skin lesions collected at early treatment timepoints (baseline; days 1, 2, or 9; and end-of-treatment of a 21-/28-day cycle), and used error-corrected duplex sequencing to detect low-frequency mutations. They concluded that CX-5461 exposure did not increase single-/ double-base substitution or indel burdens, nor reproduced the mutational signatures reported in our in vitro study. While we welcome their contribution, several methodological and interpretive shortcomings limit the conclusions that can be drawn.

      1. Data presentation<br /> Figures 1–3 present absolute mutation counts instead of frequencies normalized to total informative duplex bases per sample. In duplex sequencing, normalization is a basic requirement to account for variability in sequencing depth and library complexity; without it, true mutation accrual or fold-change differences versus controls (if any) cannot be assessed reliably.

      2. Experimental controls, assay sensitivity, and performance<br /> The study lacks essential positive and negative controls making it impossible to evaluate whether the sequencing and analytical processes used by the authors have worked. Clinical samples with known mutational signatures detectable through this approach should have been included to confirm assay sensitivity and substantiate a true negative finding. This is fundamental. Samples from patients unexposed to CX-5461 were also required as negative controls to establish background variability, affording confidence intervals and statistical robustness.<br /> Moreover, the authors have not shown awareness of the assay’s limit of detection (LOD). What is the smallest measurable fold-change at the reported sequencing depth? Without this, one cannot determine the smallest mutational differences that could have been missed. The authors have not disclosed quality-control metrics required to understand whether sufficient data quality was achieved for detecting differential mutagenesis. P/S: TwinStrand kit has an error rate ~0.5e-7 to 1e-7 depending on the protocols, and this can be considerably higher if DNA quality is low or from fixed biopsies.

      3. Lack of curation, comparisons to literature<br /> The reported mutation counts did not make sense (baseline values exceeding treated samples, patient samples sometimes lower than kit control). The authors should perform some ‘sanity check’ comparisons with published mutation frequencies of respective normal adult tissues from other duplex-sequencing studies2,3. Analytical rigour would include, for example, examining whether detected variants represent driver mutations from clonal haematopoiesis or occurred in genes under post-treatment selection. Such analyses would have demonstrated critical evaluation of data quality and biological relevance.

      4. Cell-type considerations, sampling window<br /> Most analysed compartments—PBMCs, MACS-sorted marrow fractions—are dominated by mature, non-dividing cells that rarely fix new mutations. A more relevant population for assessing mutagenicity is the haematopoietic stem and progenitor cells (HSPCs), typically <0.5% of marrow cells. A null result in the analysed compartments could just mean no widespread mutation fixation in mature immune cells; it does not exclude the possibility of mutagenesis in progenitors below the detection threshold of the current assay.<br /> In addition, samples were taken at very early timepoints (days 1, 2, 9, or EOT) of the first treatment cycle. At such intervals, mutagenic events are unlikely to have become fixed, as mutagen-induced DNA damage will need time to become embedded through DNA repair and replication. Exposure in terminally-differentiated cells might yield no detectable mutations. If exposure occurs on dividing cells, mutational footprints may only become detectable months or years after exposure. The current dataset lacks the temporal window necessary to assess cumulative in vivo mutagenicity.

      5. Expected evidence of prior treatments <br /> All four patients reportedly had “measurable, relapsed, or refractory advanced haematologic malignancies without any standard therapeutic options available”4. Although treatment histories were not provided, these patients likely received multiple prior therapies (e.g., doxorubicin, cyclophosphamide, etc) that could induce characteristic mutational signatures in normal haematopoietic cells5. Were signatures of prior therapy detected by the authors? Their absence raises concerns regarding the overall assay sensitivity and/or suggests that sampling strategy was suboptimal for detecting mutagenic exposures.

      6. Interpretation of model data<br /> While critical of our findings in cultured human cells as “not adequately representative of physiological human tissue” – a limitation we explicitly acknowledged in our manuscript’s title and discussion – the authors cited a C. elegans study6 in support of their argument of “low non-selective mutagenic potential of CX-5461”. This interpretation is incorrect: the worm study reported high copy-number aberrations, high SNV burdens, and a distinct A>T/T>A-rich signature after CX-5461 exposure, with survival requiring multiple repair pathways (homology-directed repair, microhomology-mediated end joining, nucleotide excision repair, and translesion synthesis). If anything, these cross-species findings reinforce rather than contradict our observations that CX-5461 is highly mutagenic. The concentrations used in that study were chosen to promote viability in the worms, not to minimise mutagenicity. Selective viability does not equate to selective mutagenicity.

      7. Clinical mutagenicity testing<br /> We agree that clinical safety assessments must be rigorous and physiologically relevant. The authors dismissed our experiments as not rivalling the “GLP-compliant, non-mutagenic” results of the CX-5461 drug development pathway. However, those mutagenicity data are not available in the public domain and have neither been shared by the authors nor the company that distributes CX-5461.

      We urge the authors to reconsider and not simply dismiss our findings. First, the primary clinical quality mutagenicity assay (required by agencies such as the US Food and Drug Administration (FDA), European Medicines Agency, and UK Medicines and Healthcare Regulatory Agency (MHRA)) referred to by the authors comprises the Ames test – a reverse gene mutation test performed in prokaryotes (e.g., E.coli, Salmonella).

      Second, according to the FDA’s ICH S2(R1) guidance for a standard battery of mutagenicity assays (Safety Implementation Working Group of the International Conference on Harmonisation of Technical Requirements for Registration of Pharmaceuticals for Human Use), additional genotoxic assays should be performed in mammalian cells in vitro (where some of the more common assays include metaphase chromosome aberration assays, the micronucleus assay, and the mouse lymphoma L5178Y cell Tk (thymidine kinase) gene mutation assay (MLA)) or in in vivo studies as necessary.

      Third, the FDA guidance acknowledges that “no single test is capable of detecting all genotoxic mechanisms relevant in tumorigenesis” and that the standard battery serves primarily for hazard identification rather than comprehensive assessment of mutagenic potential. For negative in vivo results, the ICH S2(R1) guidance requires evidence of adequate target-tissue exposure (e.g., toxicity in the tissue, TK/PK data, or direct tissue concentrations) to validate interpretability. Without such data, negative findings have limited meaning, especially where in vitro systems demonstrate strong mutagenicity.

      Fourth, while the Ames test served its purpose for decades, there are well-described problems including false positives, false negatives and critically, a lack of human metabolism that even supplementation with rodent S9 mix cannot always overcome.

      Finally, a point also raised by the accompanying commentary to our publication is that perhaps the time has come to re-evaluate how mutagenicity assays are performed. Current assays cannot capture the genome-wide mutation patterns revealed by whole-genome sequencing in human cells, and as a community we should consider using unbiased, agnostic, modern genomic approaches capable of detecting all classes of mutational changes in human cells. This is not an attack on CX-5461; rather, it is a call to the community to consider re-evaluation of mutagenicity assays in drug development.

      8. Unsubstantiated claims<br /> The claim of potential contaminants accounting for the mutagenic outcomes we and others have observed is speculative and unsupported. The fact that multiple studies1,6 observed the same mutagenic outcomes using CX-5461 from independent sources suggests that this is unlikely. The authors showed no analytic chemistry (LC-MS/MS) and/or spiking experiments to substantiate this claim.

      9. Inadequate supporting material throughout <br /> There were many gaps in the methods/supporting information, including adequate clinical annotation, precise sampling times/total treatment cycle, and basic quality-control metrics. Experimental details (e.g., antibodies used for MACS sorting, essential for interpreting analysed subpopulations) were not provided. These omissions limit transparency, reproducibility, and the interpretability of the findings.

      10. Beneficence, non-maleficence, autonomy, justice<br /> First, in academia and medicine, we are guided by the principle of doing no harm. In identifying mutagenesis in experimental systems (an incidental finding), we acted in the best interest of the community – reporting an observation that could have an impact on patients and acknowledging the limitations of our system. We have no role in the (dis)continuation of clinical trials; we simply presented our data transparently and highlighted potential risk. <br /> Second, while the authors chose to discontinue their trial, several others remained active (e.g., NCT04890613, NCT06606990, NCT07069699, NCT07147231, NCT07137416). Their decision was conservative, and in our view, scientifically prudent. We commend their caution. However, it does not justify criticism of those of us reporting safety concerns in good faith.<br /> Third, as a community, we serve society better by being aware of issues, addressing the problems with robust experiments rather than polarising into groups “for” or “against” a compound, so that truly beneficial compounds can get to patients as quickly as possible. <br /> Finally, safety concerns may extend beyond mutagenesis and include tumour promotion effects. CX-5461’s interaction with TOP2B, for example, has been linked to serious, late-emerging toxicities, including therapy-induced leukaemia and cardiotoxicity7-10.

      Concluding remarks<br /> Given the experimental and analytical shortcomings outlined above, definitive conclusions regarding CX-5461’s in vivo mutagenicity cannot yet be drawn. The absence of evidence should not be taken as evidence of absence. Rigorous, longitudinal studies with appropriate controls and independent oversight are required to assess true medium- to long-term risks.

      We share the authors’ view that thorough, transparent evaluation of anticancer agents is essential. Given the authors’ vested interest in finding a negative result, we suggest independent individuals be involved in performing the analysis/interpretation of their studies to negate potential conflicts of interest. We remain open to collaboration in this effort, in the shared interest of patient safety and scientific integrity.

      1. Koh, G.C.C., Boushaki, S., Zhao, S.J., Pregnall, A.M., Sadiyah, F., Badja, C., Memari, Y., Georgakopoulos-Soares, I., and Nik-Zainal, S. (2024). The chemotherapeutic drug CX-5461 is a potent mutagen in cultured human cells. Nat Genet 56, 23-26. 10.1038/s41588-023-01602-9.
      2. Abascal, F., Harvey, L.M.R., Mitchell, E., Lawson, A.R.J., Lensing, S.V., Ellis, P., Russell, A.J.C., Alcantara, R.E., Baez-Ortega, A., Wang, Y., et al. (2021). Somatic mutation landscapes at single-molecule resolution. Nature 593, 405-410. 10.1038/s41586-021-03477-4.
      3. Machado, H.E., Mitchell, E., Obro, N.F., Kubler, K., Davies, M., Leongamornlert, D., Cull, A., Maura, F., Sanders, M.A., Cagan, A.T.J., et al. (2022). Diverse mutational landscapes in human lymphocytes. Nature 608, 724-732. 10.1038/s41586-022-05072-7.
      4. Khot, A., Brajanovski, N., Cameron, D.P., Hein, N., Maclachlan, K.H., Sanij, E., Lim, J., Soong, J., Link, E., Blombery, P., et al. (2019). First-in-Human RNA Polymerase I Transcription Inhibitor CX-5461 in Patients with Advanced Hematologic Cancers: Results of a Phase I Dose-Escalation Study. Cancer Discov 9, 1036-1049. 10.1158/2159-8290.CD-18-1455.
      5. Mitchell, E., Pham, M.H., Clay, A., Sanghvi, R., Williams, N., Pietsch, S., Hsu, J.I., Obro, N.F., Jung, H., Vedi, A., et al. (2025). The long-term effects of chemotherapy on normal blood cells. Nat Genet 57, 1684-1694. 10.1038/s41588-025-02234-x.
      6. Ye, F.B., Hamza, A., Singh, T., Flibotte, S., Hieter, P., and O'Neil, N.J. (2020). A Multimodal Genotoxic Anticancer Drug Characterized by Pharmacogenetic Analysis in Caenorhabditis elegans. Genetics 215, 609-621. 10.1534/genetics.120.303169.
      7. Pan, M., Wright, W.C., Chapple, R.H., Zubair, A., Sandhu, M., Batchelder, J.E., Huddle, B.C., Low, J., Blankenship, K.B., Wang, Y., et al. (2021). The chemotherapeutic CX-5461 primarily targets TOP2B and exhibits selective activity in high-risk neuroblastoma. Nat Commun 12, 6468. 10.1038/s41467-021-26640-x.
      8. Zhang, W., Gou, P., Dupret, J.M., Chomienne, C., and Rodrigues-Lima, F. (2021). Etoposide, an anticancer drug involved in therapy-related secondary leukemia: Enzymes at play. Transl Oncol 14, 101169. 10.1016/j.tranon.2021.101169.
      9. Cowell, I.G., Sondka, Z., Smith, K., Lee, K.C., Manville, C.M., Sidorczuk-Lesthuruge, M., Rance, H.A., Padget, K., Jackson, G.H., Adachi, N., and Austin, C.A. (2012). Model for MLL translocations in therapy-related leukemia involving topoisomerase IIbeta-mediated DNA strand breaks and gene proximity. Proc Natl Acad Sci U S A 109, 8989-8994. 10.1073/pnas.1204406109.
      10. Zhang, S., Liu, X., Bawa-Khalfe, T., Lu, L.S., Lyu, Y.L., Liu, L.F., and Yeh, E.T. (2012). Identification of the molecular basis of doxorubicin-induced cardiotoxicity. Nat Med 18, 1639-1642. 10.1038/nm.2919.
    1. On 2026-01-13 16:13:38, user Christine Stabell Benn wrote:

      Comment on “Non-specific effects of vaccines on all-cause mortality: a meta-analysis of randomized controlled trials (RCTs) 2012–2025”<br /> Christine Stabell Benn, Frederik Schaltz-Buchholzer, Sebastian Nielsen, Peter Aaby<br /> We commend the authors for addressing the important and contentious question of non-specific effects (NSEs) of vaccines on all-cause mortality. However, we have several major concerns regarding the framing, completeness, methodology, and interpretation of the preprint. Collectively, these issues undermine the conclusions drawn.

      1. Restricted research question and dismissal of large parts of the evidence baseThe authors explicitly restrict their review to randomized controlled trials (RCTs) published after the WHO review of non-specific effects(1). If the stated objective is to assess the evidence for NSEs on all-cause mortality in randomized trials, an updated meta-analysis incorporating all relevant RCTs, rather than an arbitrarily time-limited subset, would be more informative. The decision to exclude pre-2012 RCTs from the main analysis appears methodological rather than substantive and risks answering a narrow procedural question rather than addressing the broader scientific question.

      More importantly, NSEs represent a research area in which randomized trials are inherently difficult or impossible to conduct at scale, because the vaccines in question are already part of routine immunization schedules. As in other areas of public health - such as smoking, breastfeeding, or nutrition - causal inference therefore relies on triangulation across multiple study designs, including observational studies and natural experiments, supported by biological and immunological evidence.<br /> If the intention is to provide a meaningful update on the state of the evidence for NSEs, a comprehensive synthesis that acknowledges the strengths and limitations of all relevant study designs - or at minimum a clear and balanced justification for excluding them - is required.

      2. Incomplete identification of relevant randomized trialsDespite claiming a comprehensive search, the review misses several important randomized controlled trials that are directly relevant to NSEs, including recent RCTs published well within the stated search window (e.g. PubMed IDs: 39357573, 38350670, 33893799, 30256314). The omission of these trials raises concerns about the sensitivity of the search strategy and undermines confidence in the completeness of the evidence base.

      3. Extreme clinical and methodological heterogeneity invalidates the pooled meta-analysis<br /> The meta-analysis combines trials of three different vaccines (BCG, measles vaccine, and OPV) administered at vastly different ages (birth to 59 months), with follow-up periods ranging from days to five years, and using different randomization schemes and outcomes structures. This is not merely “heterogeneity,” but fundamentally different interventions addressing different biological hypotheses.

      Pooling these studies is not equivalent to combining “apples and bananas,” but rather apples and cars. The resulting pooled estimate does not correspond to a coherent causal treatment effect and is therefore not interpretable.

      4. Non-adherence with the WHO meta-analysis methodologyBy pooling all vaccines together, and furthermore by not focusing on the time window where a given vaccine is the most recent, the authors of the new meta-analysis violates the principles set out in the WHO meta-analysis, which emphasized vaccine-specific analyses and the importance of the most recent vaccine exposure.

      5. Overreliance on conservative confidence interval methods without adequate justificationThe authors emphasize the use of the Hartung-Knapp-Sidik-Jonkman (HKSJ) method as providing “more reliable and conservative control of type I error.” While HKSJ can be appropriate when few studies estimate the same underlying effect, its application here - given the very marked heterogeneity and conceptual incoherence of the pooled treatment effect - adds statistical conservatism without resolving the more fundamental problem of model misspecification. The resulting wide confidence intervals should not be interpreted as robust evidence against NSEs.

      6. Misinterpretation of heterogeneity statistics (I²)The statement that an I² of ~44% indicates that “approximately half the differences in the results are due to actual variations between studies” is misleading in this context. I² is meaningful only when studies estimate the same underlying causal association. When fundamentally different interventions are pooled, I² no longer has the interpretation implied by the authors.

      7. Speculation that early BCG effects are due to bias is unsubstantiatedThe manuscript repeatedly suggests that observed mortality reductions within the first 1–3 days after BCG vaccination may reflect bias due to lack of blinding. This speculation appears inconsistent with the design and reporting of the original trials. In Guinea-Bissau randomization occurred at discharge, and post-randomization care was not provided by study staff(2). In the Indian trial, the authors explicitly state that it is unlikely that the lack of blinding influenced the result. In previous open label randomized trials of BCG Russian strain in the same sites, no difference in neonatal mortality was found, which suggests that the lack of blinding did not bias the findings(3).

      Given these safeguards, attributing early effects to bias is unsupported by trial evidence and suggests that the original studies were not carefully read or adequately considered.

      8. Ignoring extensive mechanistic evidence for rapid BCG effectsThe authors further imply that effects within days are biologically implausible. This overlooks a substantial body of experimental and clinical evidence demonstrating that BCG induces trained innate immunity, including rapid functional reprogramming of myeloid cells and emergency granulopoiesis, which can occur within days and protect against severe infections such as sepsis(4, 5). These mechanisms provide a biologically coherent explanation for early effects and should have been discussed as plausible alternatives to bias.

      9. Failure to engage with established explanations for heterogeneous measles vaccine effectsThe manuscript notes heterogeneity across measles vaccine trials but does not engage with recent work offering compelling explanations for these differences, including interactions with OPV campaigns and vaccination sequence effects(6). Ignoring this literature leads to an oversimplified interpretation in which heterogeneity is treated primarily as noise rather than as potentially informative signal.

      10. Introduction of an a posteriori unifying hypothesisLate in the discussion, the authors invoke a new hypothesis that all live-attenuated vaccines should yield similar NSEs on all-cause mortality. This hypothesis appears post hoc and is not clearly justified biologically. It has never been a hypothesis within the NSE field and is biologically implausible, not least because baseline mortality differs substantially by age. Introducing this assumption only after the pooled analysis further weakens the inferential logic of the paper.

      Overall assessmentThe manuscript raises an important question, but its conclusions are undermined by:<br /> • an artificially restricted scope,<br /> • incomplete inclusion of relevant RCTs,<br /> • inappropriate pooling across fundamentally different interventions,<br /> • speculative dismissal of biologically plausible findings,<br /> • and inconsistent use of hypotheses introduced after the analysis.<br /> As currently written, the preprint does not provide a reliable basis for concluding that NSEs of vaccines on all-cause mortality are absent or unimportant. A substantially revised analysis - grounded in a comprehensive evidence base, clearer causal questions, and vaccine-specific syntheses - would be required to support such claims.

      References1. Higgins JP, Soares-Weiser K, Lopez-Lopez JA, Kakourou A, Chaplin K, Christensen H, et al. Association of BCG, DTP, and measles containing vaccines with childhood mortality: systematic review. BMJ. 2016;355:i5170.<br /> 2. Biering-Sorensen S, Aaby P, Lund N, Monteiro I, Jensen KJ, Eriksen HB, et al. Early BCG-Denmark and Neonatal Mortality Among Infants Weighing <2500 g: A Randomized Controlled Trial. Clin Infect Dis. 2017;65(7):1183-90.<br /> 3. Adhisivam B, Kamalarathnam C, Bhat BV, Jayaraman K, Namachivayam SP, Shann F, et al. Effect of BCG Danish and oral polio vaccine on neonatal mortality in newborn babies weighing less than 2000 g in India: multicentre open label randomised controlled trial (BLOW2). BMJ. 2025;390:e084745.<br /> 4. Kleinnijenhuis J, Quintin J, Preijers F, Joosten LA, Ifrim DC, Saeed S, et al. Bacille Calmette-Guerin induces NOD2-dependent nonspecific protection from reinfection via epigenetic reprogramming of monocytes. Proc Natl Acad Sci U S A. 2012;109(43):17537-42.<br /> 5. Brook B, Harbeson DJ, Shannon CP, Cai B, He D, Ben-Othman R, et al. BCG vaccination-induced emergency granulopoiesis provides rapid protection from neonatal sepsis. Sci Transl Med. 2020;12(542):eaax4517.<br /> 6. Nielsen S, Fisker AB, da Silva I, Byberg S, Biering-Sørensen S, Balé C, et al. Effect of early two-dose measles vaccination on childhood mortality and modification by maternal measles antibody in Guinea-Bissau, West Africa: A single-centre open-label randomised controlled trial. EClinicalMedicine. 2022;49:101467.

    1. On 2025-12-19 17:14:16, user Cesar Ugarte wrote:

      The preprint by Fajardo et al. addresses an important evidence gap regarding the utility of combined antigen–antibody tests for detecting acute HIV infection. Although the authors adopt a valuable global perspective, the interpretation and synthesis of the data would benefit from greater nuance to enhance clinical relevance. The authors' QUADAS-2 assessment shows High Risk of Bias regarding patient selection and Unclear Risk regarding the conduct of the index test. In diagnostic epidemiology, such findings are not just descriptive but also signal a huge spectrum effect and possible threshold bias. Therefore, the summary estimates presented in Figures 3 and 4 may reflect a statistical average of disparate clinical realities rather than a reliable indicator of test performance (for example in Figure 3 there are 10 studies with a sensitivity less than 10%, including some with 0%, so the evaluation in detail of these studies should be done to see if these studies can be combined with the other ones). Another issue is the inclusion of "obsolete" diagnostic platforms that have been withdrawn due to suboptimal performance. A sensitivity analysis or subgroup stratification should be restricted to tests currently on the market. This would enable the reader to distinguish between the historical evolution of the technology and the expected performance in contemporary clinical practice.

      The interpretation of diagnostic performance also should be addressed in detail. Whereas sensitivity and specificity have usually been considered "intrinsic" to a test (so doesn´t depends on disease prevalence), evidence suggests significant variation across clinical settings. The underlying epidemiological status and operator expertise can affect the test’s accuracy. Finally, I agree with the authors that real-world evidence on cost-effectiveness and implementation barriers is lacking. However, we should be very careful to avoid having a biased meta-analytic estimate that leads to the premature abandonment of "imperfect" but viable diagnostic solutions. In the case of acute HIV infection, for which early detection is critical to ART initiation and reduction of secondary transmission, interpretation of this evidence needs to balance statistical rigor against the urgent public health need for early diagnosis.

    Annotators

    1. Creativity and Innovation – Students are able to demonstrate creative thinking, constructknowledge, and develop innovative products and processes using technology.2. Communication and Collaboration - Students are able to use digital media and environmentsto communicate and work collaboratively, including at a distance, to support individuallearning and contribute to the learning of others.3. Research and Information Fluency - Students are able to apply digital tools to gather,evaluate, and use information.4. Critical Thinking, Problem Solving, and Decision-Making - Students are able to use criticalthinking skills to plan and conduct research, manage projects, solve problems and makeinformed decisions using appropriate digital tools and resources.5. Digital Citizenship - Students are able to understand human, cultural, and societal issuesrelated to technology and practice legal and ethical behavior.

      I think this article is very helpful if you are learning about digital literacy. The examples used really helps to understand what we are expected to be able to do. This article solidified the fact that I think it is very important to be prepared in digital literacy.

    1. A GIS-based earthquake damage assessment and settlement methodology

      这篇论文的核心内容可以概括为:


      一、研究目的

      提出一种基于GIS的地震损失评估方法,用于在地震发生前或发生后,快速评估城市中:

      • 建筑损坏情况
      • 人员伤亡
      • 道路阻塞情况 并辅助灾害应急与城市规划决策。

      二、主要方法

      论文构建了一个综合评估流程,包括:

      1. 地震强度计算 利用震级、断层距离和土壤条件,计算每个建筑位置的地震烈度(MMI)。

      2. 建筑损伤评估 通过fragility曲线(脆弱性曲线),将地震烈度转换为建筑损坏比例。

      3. 人员伤亡估计 根据建筑损伤程度、人口分布和建筑类型,计算死亡和受伤人数。

      4. 道路阻塞分析 根据倒塌建筑产生的废墟体积,评估道路被阻塞的程度。

      5. 避难可达性分析 判断哪些建筑在震后无法通过道路到达临时避难点。


      三、案例研究

      以Tehran某区域为例进行模拟:

      • 预测结果显示:

      • 建筑损坏最高可达约64%

      • 死亡人数约占33%
      • 严重受伤约27%
      • 约22%的道路完全阻塞

      四、验证与结论

      • 使用2003 Bam earthquake进行验证,模型结果略偏保守(偏高估损失)
      • 结论:

      • GIS方法能够有效整合多源数据进行地震损失评估

      • 该模型可用于:

        • 灾前风险评估
        • 城市规划(加固建筑、拓宽道路)
        • 灾后应急救援规划

      一句话总结

      这篇论文提出了一套基于GIS的地震损失评估模型,通过“地震强度 → 建筑损伤 → 人员伤亡 → 道路阻塞”的链式分析,实现城市级灾害风险预测与决策支持。

  3. inst-fs-iad-prod.inscloudgate.net inst-fs-iad-prod.inscloudgate.net
    1. Such environments may give players the disorientingand somewhat anaesthetizing sense that this could be anywhere or nowhereat all, conveniently overlooking ecological concerns with the finite charac-ter of the natural world and entropic limitations on energy and throughput,or carrying capacity (chapter 4). In an era of widespread anxiety over climatechange, increasingly scarce fuel reserves, and population control, it shouldcome as no surprise that an especially popular recourse is the abstract, ever-receding pastoral ideal that Raymond Williams once derisively called “ababble of green fields” (in a nod to Shakespeare),10 which lurks in all themedieval and pre- or alter-industrial lands of games like Blizzard’s World ofWarcraft or Nintendo’s Legend of Zelda series, and rather explicitly in themultitudes of crop-management games like Harvest Moon (1996) and Farm-Ville. Ecological specificity and accuracy are neither necessary nor sufficientcriteria for successful commercial games, but when we measure games asinstruments of public knowledge, it suddenly becomes worthwhile to makegames that are more meaningfully local, which take seriously the goal of envi-ronmental realism—not solely in terms of visual rendering, but also in sounddesign, weather, species density and distribution, and the arrangement oforganic and inorganic actors in complex interrelation

      Crazy... like free to play white. You can also check soundscape studies from games like Fortnite in Playthrough Poetics, and it's crazy how little nature embeds the area. Compare that to Red Dead 2, Rain World, Flow, Death Stranding 2, Proteus, Caves of Qud, Everything, or Kenshi

    1. Jelle Jolles onderscheidt in Het tienerbrein. Over de adolescent tussen biologie en omgeving de volgende executieve functies: Filteren Organiseren van aandacht Impulsremming Nieuwsgierigheid en initiatief nemen Werkgeheugen Doelgerichtheid Gedrags-, motorische en cognitieve flexibiliteit Planmatig handelen Kiezen en beslissen Zelfinzicht Zelfregulatie Metacognitie Monitoring Empathie en perspectiefname Motivatie

      1 filteren, 2 aandacht, 4 nieuwsgierigheid, 6 doelgerichtheid, 8/9 planmatig handelen, kiezen/beslissen, 12 metacognitie, 15 motivatie

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Figure 1D: It would be useful to indicate the number of embryos analyzed for these experiments (n = ?).

      Number of embryos now included in figure legend

      Figure 3B: The control condition for gcl⁻/⁻; ras-RNAi is labeled as "EV". This terminology (presumably "empty vector") is not defined in either the text or the figure legend. In addition, the magenta channel for the Ras-G37 condition appears to be flipped horizontally.

      We replaced with “-“ in figure and figure legend

      Page 7: The text states that "Ras-C40 activates the PI3K pathway," whereas the figure depicts Ras-C40 as activating the RalA pathway. This discrepancy could be confusing for the reader and should be corrected.

      The diagram has been corrected

      Figures 4 and 5: To facilitate interpretation, it may be helpful to include a schematic of the PI3K complex indicating the different subunits used in the study, along with information (potentially color-coded) about whether each construct primarily acts as an activator or inhibitor of PI3K function.

      Figure 4E and Figure 5E were added

      Figure 4A and 4B: For clarity and consistency with the text, the panels (and corresponding plots) for dp110-WT and dp110-CAAX could be placed before those for dp110-D954A and dp110-ΔRBD.

      Order of constructs was rearranged

      Figure 5C: The term "p60-TCEp3," which appears to correspond to the germ plasm-targeted p60-WT construct, is not defined in either the figure legend or the main text.

      Clarification was added to the text (p.11, line 225)

      Page 12: The reference "(Fig. S1A, Movie 1)" should be corrected to "(Fig. S2A, Movie 1)."

      Corrected

      Page 13: There is a missing word in the sentence "the biosensor appeared to be enrich to...", which should be corrected to "enriched."

      Corrected

      Figure 7A: Although the data presented are interesting and ultimately support the authors' conclusion that Torso regulates PIP3 levels, the results are somewhat counter-intuitive and may be confusing for readers. The authors might consider moving this panel to the Supplementary Figures. In addition, it could be informative to include PIP3 measurements for gcl⁻/⁻ (and possibly gcl⁺/⁻) pole buds in Figure 7B, as PIP3 appears particularly enriched in these conditions compared to wild type.

      We agree that at first the findings in the early embryos were confusing, but we prefer including them in the main figure to demonstrate changes in PIP 3 distributions in torso mutants. We are now providing a possible explanation for these findings (p13 line 270-). The differences are quite clear in the older embryos and measurements shown in 7B-D. Pole bud measurements for gcl-/- and gcl+/- are shown in figure 6 E-G.

      Reviewer #2

      Fig. legends to 1C and 1D are swapped.

      Corrected

      Why is csw not necessary for PGC formation? It acts upstream of Ras. This is not discussed.

      We now highlight this point in the text (and refer to studies on the sevenless kinase, which suggested a similar position of Csw parallel or downstream of Ras (page 6 line 107-).

      Fig 3C. Consider changing the order of the ras-variants used: S35, G37, C40 instead of S35, C40, G37.

      We changed the schematic in Figure 3C that should make the order of Ras variants more intuitive.

      Fig 4A, B: Consider changing the order of the panels. Control, dp110-wt, dp110-CAAX, dp110-D954A, dp110-deltaRBD.

      Order of constructs was rearranged

      Fig S4 is mentioned in the text before S2 and S3. Consider changing the suppl. figure order.

      Order of supplementary figures was rearranged

      Page 12: Fig S1 A does not show PIP2 dynamics. Movie 1 is not available to this reviewer. The authors most likely refer to fig. S2.

      Movie 1 was uploaded and figure calls were corrected

      Page 13, 1st para: Why do the authors use glc heterozygous embryos to look at PIP3 and PIP2? Particularly so when they report later in the MS that glc+/- behave differently to wt controls in terms of PIP3 levels (Fig. 7C). By looking at gcl+/+, they might find that now PIP2 levels are different in gcl mutant embryos or that the differences between PIP3 levels in +/+ and -/- are larger than compared with +/-.

      Since gcl+/- embryos form the same number of PGCs as WT but show a statistically significant increase in PI3K activity when comparing membrane to cytoplasm staining intensity, we favor using gcl+/- embryos, as these embryos may represent a more sensitive test for PIP2 and PIP3 levels.

      Pages 15 and 16: revise figure calls in the text.

      Figure calls were revised

      M+M: How were gcl+/- and gcl-/- embryos identified?

      Since all genetic manipulations in this alter the maternal contribution to the embryo, we us the term ‘mutant’ embryos referring to the maternal genotype (indicated on page 3 line 33 and more clearly stated in material and methods and reagent table). Embryos derived from mother of a specific maternal genotype are all identical, thus we can easily distinguish between embryos derived from homozygous mutant mothers (gcl-/-) or heterozygous mutant mothers (gcl-/+) In the reagents table we include the precise genotype description. “CyO” refers to the balancer chromosome commonly used to identify heterozygotes on the second chromosome. Flies with the CyO balancer have curly wings.

      Reviewer #3

      Figure 1B: The authors describe that embryos with OptoSos still form buds which protruded from the cortex, but PGCs largely fail to cellularize (described in pg. 5). I'm not sure what they meant by "fail to cellularize" as this is not obvious to me when looking at the figure. The authors should describe how they know it's cellularized in the controls and not in the OptoSos or change the wording to "suggesting a failure to cellularize".

      We used the word ‘protruded’ to describe our live observations. PGCs were quantified in fixed embryos, immunostained with anti-Vasa antibody to count Vasa positive cells (Fig 1C and D. We observe a lack of Vasa-positive PGCs, only in the light-activated OptoSos condition.

      Fig. 1B, lines 4-5: at what stage are these embryos? Cycle 9? Cycle 14? Both?

      Nuclear cycles of embryos for each panel are noted on the left side of each panel

      Fig. 4A: add dp110-CAAX results to Results section

      dp110-CAAX results are included in the Results section (p.9. line 177)

      Figure 5C: The hyper-clustered phenotype they describe is hard to visualize in this figure (described in pg. 11). The authors should describe what is meant by "hyper-clustered".

      We agree and re-worded the description of this observation to be clearer, page 11, line 226-.

      Figure 7: When comparing Fig. 7A and 7B torsoHH/WK images, we can see that in Fig. 7A that PIP3 pattern changes such that PIP3 is now at the most posterior end where PGC will eventually form (compared to control that has low PIP3 in this region), but then in Fig. 7B they are looking at the buds and they say PIP3 levels decrease, which does not correspond to Fig. 7A. Are these simply different stages and PIP3 levels change over time (looking at Fig. 7C, PIP3 does not seem to change a lot over time)?

      The figure legend now states more clearly that embryos were of different ages. We also explain in the text the apparent discrepancy in the patterns before and during budding (page13 line 266). The time points in figure 7C span nuclear cycle 10, not earlier (page14 line 274). By measuring membrane to cytoplasmic distribution, a more accurate comparison is possible at this stage.

      p. 5, line 5: "Optosos" is written "OptoSos" elsewhere (suggest using OptoSos throughout)

      Corrected

      Is it possible that inhibition of myosin II recruitment is due to conversion of PIP2 -> PIP3, thus loss of PIP2, or is it that myosin is specifically recruited to regions where PIP2 is high? This seems like a point that should be added to the discussion.

      This point is now discussed on page 20, line 403

      p. 5, line 6: suggest adding a comma after "Ras" for clarity

      Corrected

      p. 5, last line: the genotype is "w^1118" (with ^ indicating a superscript), not "w^-1118", and is italicized (this should be corrected throughout)

      Corrected

      p. 6, line 2: replace "cellularizing" with "cellularization"

      Corrected

      p. 6, lines 11-13: Where is it shown that knockdown of csw, dsor1 and rolled did not restore PGC formation? The data are not present in Fig. 2C (could include in supp fig?)

      We added these data as Supplementary figure 1

      p. 7, line 1: replace "interfere" with "interferes"

      Corrected

      p. 7, last three lines: what is stated here, "Ras-G37 [activates] both the RalA and the PI3K pathways, and Ras-C40 activates the PI3K pathway" is not consistent with what is diagrammed in Fig. 3C, where Ras-C40 is indicated as activating RalA (please correct either the text or the diagram)

      We apologize and corrected the figure

      p. 11, lines 1-2: the Pi3K21B gene and transcript should be italicized (note that Pi3K21B is the official gene name on FlyBase)

      Gene name was italicized

      p. 11, lines 6-10: it might be helpful to explain how the p60 construct was overexpressed (current lines 9-10) before describing the results (current lines 7-8)

      Clarification on p60 construct was added to p.11, line 215-

      p. 12, paragraph 2, line 2: the PIP2 biosensor should be written as "PLCgamma[PH]:mCherry" throughout, not "PLCy[PH]:mCherry"; this should be changed in the figures as well as the text (Symbol font can be used to turn "g" into lower-case "gamma", both in Word and in Illustrator)

      Gamma symbol was added

      It would also be helpful to show the overlap of the PIP2 and PIP3 signals in control vs. gcl mutants at different stages so the relative distribution and intensity of the signals can be better appreciated (consider adding this as a supplementary figure).

      Our data show that PIP2 is not affected by lack of GCL (Fig 6 B-D). We thus do not think that simultaneous imaging of PIP2 and PIP3 in gcl-/- would add to our conclusions. Furthermore, these experiments would require a significant time investment to generate the respective genotypes. Thus, we agree with the reviewer that this is experiment is beyond the scope of the paper.

      p. 12, paragraph 2, line 3: it does not appear that the two PIP markers were used "simultaneously" in Fig. 6A; however, this is evident from Fig. S2 and Movie 1 (consider placing callouts to these earlier in the paragraph or moving the description of simultaneous expression and observation of the two markers later in the paragraph to avoid confusion)

      We did simultaneously image PIP2 and PIP3 sensors and have added this as Movie 1 and also in supplementary Figure S4, which are now clearly referred to in the text.

      p. 12, paragraph 2, line 7: replace "Fig. S1A" with "Fig. S2" (this was confusing)

      Figure call was updated

      p. 16: change "Fig. 7G-I" to "Fig. 8G-I"

      Figure call was updated

      p. 20, Deming reference: there appears to be a stray asterisk in the title

      Asterisk was removed from reference

      Fig. 1D: need to explain that the colors in the graph indicate the numbers of PGCs formed (this could also be added as a label across the top of the graph); in addition, the number of embryos examined for each genotype should be included in the legend

      We added a label at the top of the graph and ‘n’ were added to figure legend

      Fig. 2B: spell out where csw, dsor1 and rolled data are shown; also, "n" is not defined; was this the number of embryos per genotype?

      We added these data as Supplemental Figure 1

      Fig. 3B: "EV" should be defined in the legend; is this "empty vector"?

      We are using a “-“ to mark controls without transgene

      Fig. 3C: see previous comment re: mistake in the diagram; I believe Ras-C40 was described as activating PI3K, not RalA

      We apologize and corrected the figure

      Fig. 4B, line 2: was the graph plotted from the data in panel (C) or panel (A)? panel (A) seems more likely, because the data in C is plotted in D; please correct the panel callout

      Figure legend was updated to refer to the correct panel

      Fig. 5C: describe "p60-TCEp3" in the legend

      We added germplasm-targeting 3’UTR (TCEp3) to legend and the construct and reference are provided in Material and Methods section

      Figure 6: In Fig. 6E-G, the "brightness" of PIP3 at the membrane corresponds to the images even with different views (posterior and orthogonal) and agrees with the graph.

      However, when looking at Fig. 6B, it looks to me that PIP2 is brighter in gcl+/-, but the opposite is true when looking at Fig. 6D (i.e., PIP2 looks brighter in gcl-/-). The authors might want to comment on this.

      We have updated the figure to better reflect our observations.

      Fig. 6A: define "(fire)" here or in the first figure legend where this is used

      We added an inset for the fire lookup table to clearly define the pseudcolor scheme used in the image

      Figure 8 title: "Actin fluorescence is increased in gcl-/- pole buds",But their graph in Fig. 8B comparing actin in gcl+/- to -/- is not significant

      Thanks for catching our mistake, myosin not actin is changed

      Fig. 8I: replace "Scarlett" with "Scarlet"

      Corrected

      Fig. 8D-F: Although the plots in panel E agree with the images in panel D, it is unclear why those in panel F are not more concordant. In F, myosin appears enriched at the cortex relative to the cytoplasm in gcl-/- mutants, which is hard to reconcile with the data in D-E.

      We have updated the figure to better reflect our observations.

      Fig. S2A: define the three time points shown here, and clarify that these are shown left to right (if this is indeed the case)

      We removed S2A and updated the movie to replace it

      Fig. S4: change "P60" to "p60" in the figure title

      Corrected

      Movie: The movies showing PIP2 and PIP3 in whole embryos are nice, but it would also be helpful to also include merged images of the two channels, so the reader can examine the relative accumulation of the two PIPs over time.

      Merged images panel was added to the movie.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Although Torso is known to antagonize primordial germ cell (PGC) formation, the underlying mechanisms remain unclear. Canonical Torso signalling typically results in activation of Ras. However, the authors show that Ras-mediated suppression of PGC formation is independent of the Raf/MEK/ERK pathway. Instead, they uncover an unexpected role for Torso in activating phosphoinositide 3-kinase (PI3K) that promotes formation of PIP3 enriched posterior membrane domains. The resulting increase in PI3K activity disrupts PGC formation. Furthermore, they show that by promoting Torso degradation, the ubiquitin ligase adaptor Germ Cell-Less (GCL) primes the posterior membrane with reduced PIP3 to facilitate PGC formation. Lastly, the authors suggest a model where antagonistic relationship between GCL and Torso influences actomyosin contractility that may allow the bud to constrict for proper PGC formation.

      Major comments:

      Figure 1B: The authors describe that embryos with OptoSos still form buds which protruded from the cortex, but PGCs largely fail to cellularize (described in pg. 5). I'm not sure what they meant by "fail to cellularize" as this is not obvious to me when looking at the figure. The authors should describe how they know it's cellularized in the controls and not in the OptoSos or change the wording to "suggesting a failure to cellularize".

      Figure 5C: The hyper-clustered phenotype they describe is hard to visualize in this figure (described in pg. 11). The authors should describe what is meant by "hyper-clustered".

      Figure 6: In Fig. 6E-G, the "brightness" of PIP3 at the membrane corresponds to the images even with different views (posterior and orthogonal) and agrees with the graph. However, when looking at Fig. 6B, it looks to me that PIP2 is brighter in gcl+/-, but the opposite is true when looking at Fig. 6D (i.e., PIP2 looks brighter in gcl-/-). The authors might want to comment on this.

      It would also be helpful to show the overlap of the PIP2 and PIP3 signals in control vs. gcl mutants at different stages so the relative distribution and intensity of the signals can be better appreciated (consider adding this as a supplementary figure).

      Figure 7: When comparing Fig. 7A and 7B torsoHH/WK images, we can see that in Fig. 7A that PIP3 pattern changes such that PIP3 is now at the most posterior end where PGC will eventually form (compared to control that has low PIP3 in this region), but then in Fig. 7B they are looking at the buds and they say PIP3 levels decrease, which does not correspond to Fig. 7A. Are these simply different stages and PIP3 levels change over time (looking at Fig. 7C, PIP3 does not seem to change a lot over time)?

      Page 15, last paragraph: "If myosin II recruitment is inhibited when PIP3 levels are high" Is it possible that inhibition of myosin II recruitment is due to conversion of PIP2 -> PIP3, thus loss of PIP2, or is it that myosin is specifically recruited to regions where PIP2 is high? This seems like a point that should be added to the discussion.

      Overall, I think their claim that antagonistic activities of GCL and Torso is crucial for PGC formation is well justified. The combination of optogenetic tools with activation and lof mutants is nicely done. Some clarification regarding the PIP3 and PIP2 levels will be helpful to the reader (see my comments above). The myosin claim is less convincing (see my comment on Fig. 8D-F below).

      Minor comments on the text:

      p. 5, line 5: "Optosos" is written "OptoSos" elsewhere (suggest using OptoSos throughout) p. 5, line 6: suggest adding a comma after "Ras" for clarity p. 5, last line: the genotype is "w^1118" (with ^ indicating a superscript), not "w^-1118", and is italicized (this should be corrected throughout) p. 6, line 2: replace "cellularizing" with "cellularization" p. 6, lines 11-13: Where is it shown that knockdown of csw, dsor1 and rolled did not restore PGC formation? The data are not present in Fig. 2C (could include in supp fig?) p. 7, line 1: replace "interfere" with "interferes" p. 7, last three lines: what is stated here, "Ras-G37 [activates] both the RalA and the PI3K pathways, and Ras-C40 activates the PI3K pathway" is not consistent with what is diagrammed in Fig. 3C, where Ras-C40 is indicated as activating RalA (please correct either the text or the diagram) p. 11, lines 1-2: the Pi3K21B gene and transcript should be italicized (note that Pi3K21B is the official gene name on FlyBase) p. 11, lines 6-10: it might be helpful to explain how the p60 construct was overexpressed (current lines 9-10) before describing the results (current lines 7-8) p. 12, paragraph 2, line 2: the PIP2 biosensor should be written as "PLCgamma[PH]:mCherry" throughout, not "PLCy[PH]:mCherry"; this should be changed in the figures as well as the text (Symbol font can be used to turn "g" into lower-case "gamma", both in Word and in Illustrator) p. 12, paragraph 2, line 3: it does not appear that the two PIP markers were used "simultaneously" in Fig. 6A; however, this is evident from Fig. S2 and Movie 1 (consider placing callouts to these earlier in the paragraph or moving the description of simultaneous expression and observation of the two markers later in the paragraph to avoid confusion) p. 12, paragraph 2, line 7: replace "Fig. S1A" with "Fig. S2" (this was confusing) p. 16: change "Fig. 7G-I" to "Fig. 8G-I" p. 20, Deming reference: there appears to be a stray asterisk in the title

      Minor comments on the figures and figure legends:

      Fig. 1B, lines 4-5: at what stage are these embryos? Cycle 9? Cycle 14? Both? Fig. 1C: see previous comment about "w^1118" genotype nomenclature Fig. 1D: need to explain that the colors in the graph indicate the numbers of PGCs formed (this could also be added as a label across the top of the graph); in addition, the number of embryos examined for each genotype should be included in the legend Fig. 2B: spell out where csw, dsor1 and rolled data are shown; also, "n" is not defined; was this the number of embryos per genotype? Fig. 3B: "EV" should be defined in the legend; is this "empty vector"? Fig. 3C: see previous comment re: mistake in the diagram; I believe Ras-C40 was described as activating PI3K, not RalA Fig. 3E: fix "w^1118" as described above Fig. 4A: add dp110-CAAX results to Results section Fig. 4B, line 2: was the graph plotted from the data in panel (C) or panel (A)? panel (A) seems more likely, because the data in C is plotted in D; please correct the panel callout Fig. 5C: describe "p60-TCEp3" in the legend Fig. 6A: define "(fire)" here or in the first figure legend where this is used Figure 8 title: "Actin fluorescence is increased in gcl-/- pole buds",But their graph in Fig. 8B comparing actin in gcl+/- to -/- is not significant Fig. 8D-F: Although the plots in panel E agree with the images in panel D, it is unclear why those in panel F are not more concordant. In F, myosin appears enriched at the cortex relative to the cytoplasm in gcl-/- mutants, which is hard to reconcile with the data in D-E. Fig. 8I: replace "Scarlett" with "Scarlet" Fig. S2A: define the three time points shown here, and clarify that these are shown left to right (if this is indeed the case) Fig. S4: change "P60" to "p60" in the figure title

      Movie: The movies showing PIP2 and PIP3 in whole embryos are nice, but it would also be helpful to also include merged images of the two channels, so the reader can examine the relative accumulation of the two PIPs over time.

      Referees cross-commenting

      I agree enthusiastically with the comments of the other reviewers, who often came to the same conclusion I did about the manuscript and the data, including some of the detailed points about the figures, etc.

      Significance

      General assessment:

      The many strengths of this manuscript include elegant genetic and optogenetic approaches using well-designed transgenes.

      The main weakness is the lack of experiments showing simultaneous live imaging of the PIP2 and PIP3 sensors in gcl-/- and other genetic backgrounds, which would help the reader better envision how regulators of this pathway affect phospholipid distribution at the level of whole embryos and prospective pole cells. Note that because of the time required, I do not insist that they do this.

      Advance:

      Study demonstrates for the first time an unexpected role of Torso in PI3K regulation

      Audience:

      germ cell afficionados, developmental biologists, cell biologists, PI3K researchers

      My field of expertise:

      Drosophila, germ cell development, genetics, cell biology, live imaging, phosphoinositides

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is an outstanding and elegant study that addresses an important question in developmental and germline biology: how the soma-germline boundary is established during embryogenesis. This represents one of the most fundamental cell-fate decisions during organismal development. The authors combine elegant genetics, optogenetics, quantitative live imaging, and lipid biosensors to provide a compelling mechanistic framework linking receptor degradation, lipid signaling, and cytoskeletal dynamics. They show that Torso signaling via PI3K and PIP3 antagonizes primordial germ cell (PGC) formation and promotes somatic cell fate. Furthermore, they demonstrate that GCL-mediated degradation of Torso at the posterior pole creates a PIP3-low membrane domain that permits myosin II recruitment and pole bud constriction, thereby enabling PGC formation. Together, these results clearly demonstrate how the soma-germline boundary is established.

      We have only minor comments on the manuscript, primarily aimed at improving clarity for non-specialist readers:

      1. Figure 1D: It would be useful to indicate the number of embryos analyzed for these experiments (n = ?).
      2. Figure 3B: The control condition for gcl⁻/⁻; ras-RNAi is labeled as "EV". This terminology (presumably "empty vector") is not defined in either the text or the figure legend. In addition, the magenta channel for the Ras-G37 condition appears to be flipped horizontally.
      3. Page 7: The text states that "Ras-C40 activates the PI3K pathway," whereas the figure depicts Ras-C40 as activating the RalA pathway. This discrepancy could be confusing for the reader and should be corrected.
      4. Figures 4 and 5: To facilitate interpretation, it may be helpful to include a schematic of the PI3K complex indicating the different subunits used in the study, along with information (potentially color-coded) about whether each construct primarily acts as an activator or inhibitor of PI3K function.
      5. Figure 4A and 4B: For clarity and consistency with the text, the panels (and corresponding plots) for dp110-WT and dp110-CAAX could be placed before those for dp110-D954A and dp110-ΔRBD.
      6. Figure 5C: The term "p60-TCEp3," which appears to correspond to the germ plasm-targeted p60-WT construct, is not defined in either the figure legend or the main text.
      7. Page 12: The reference "(Fig. S1A, Movie 1)" should be corrected to "(Fig. S2A, Movie 1)."
      8. Page 13: There is a missing word in the sentence "the biosensor appeared to be enrich to...", which should be corrected to "enriched."
      9. Figure 7A: Although the data presented are interesting and ultimately support the authors' conclusion that Torso regulates PIP3 levels, the results are somewhat counter-intuitive and may be confusing for readers. The authors might consider moving this panel to the Supplementary Figures. In addition, it could be informative to include PIP3 measurements for gcl⁻/⁻ (and possibly gcl⁺/⁻) pole buds in Figure 7B, as PIP3 appears particularly enriched in these conditions compared to wild type.

      Significance

      The biological question is highly interesting, the experimental design is very clear, and the data are convincing throughout. The imaging, quantification, and movies are of very high quality and strongly support the authors' conclusions. Overall, this manuscript represents a significant conceptual and technical advance and will be of broad interest to the fields of germline biology, membrane biology, and embryonic morphogenesis.

    1. Multiple antibiotic classes are effective against both streptococci and staphylococci, with the optimal choice depending on whether methicillin-resistant Staphylococcus aureus (MRSA) is suspected and the severity of infection.

      For methicillin-susceptible strains, beta-lactam antibiotics remain the mainstay of treatment. These include penicillinase-resistant penicillins (dicloxacillin, oxacillin, nafcillin), first-generation cephalosporins (cephalexin, cefazolin), and amoxicillin-clavulanate, all of which provide excellent coverage against both streptococci and methicillin-susceptible staphylococci (MSSA). [1-4]

      Clindamycin is particularly valuable as it demonstrates activity against approximately 80% or more of community-associated MRSA strains while maintaining excellent activity against group A streptococcus, making it an attractive single-agent option for skin and soft tissue infections. [5] However, clindamycin resistance appears to be increasing. [5]

      For MRSA coverage with dual activity, several options exist. Linezolid and tedizolid (oxazolidinones) have activity against both MRSA and streptococci. [2][6-7] Vancomycin provides coverage against both organisms and has been the traditional mainstay for MRSA infections. [2][7] When MRSA is suspected but streptococcal coverage is also needed, combination therapy with trimethoprim-sulfamethoxazole (TMP-SMX) plus a beta-lactam (such as cephalexin or penicillin) is recommended, as TMP-SMX alone has poorly defined activity against streptococci. [1][5]

      Doxycycline and minocycycline are effective against most MRSA isolates but have less well-defined activity against group A streptococcus, so they may require combination with a beta-lactam when streptococcal infection cannot be excluded. [1][5]

      Would you like me to provide specific dosing recommendations for any of these antibiotics or discuss treatment duration for particular infections?

      1

    1. Urgent treatment for neoplasm consists of (1) cautious use of intravenous diuretics and (2) mediastinal irradiation, starting within 24 hours, with a treatment plan designed to give a high daily dose of radiation but a short total course of therapy to rapidly shrink the local tumor. Intensive radiation therapy combined with chemotherapy will palliate the process in up to 90% of patients. In patients with a subacute presentation, radiation therapy alone usually suffices. Chemotherapy is added if lymphoma or small-cell carcinoma is diagnosed

      endovascular stenting emerging as first-line therapy for rapid symptom relief, while definitive treatment targets the underlying cause

      Glucocorticoids (dexamethasone 4 mg every 6 hours) are commonly prescribed but lack robust supporting data; they may be more beneficial in lymphoma or thymoma and as prophylaxis against radiation-induced edema. [2-4] Importantly, SVC syndrome is no longer considered a medical emergency except in rare cases with life-threatening cerebral edema, laryngeal edema, or altered mental status. When thrombosis is present, catheter-directed thrombolysis or aspiration thrombectomy should be performed within 2-5 days of symptom onset before thrombus organization occurs. [3] The role of long-term anticoagulation after stenting remains unclear, though it is standard when significant thrombosis is present The American College of Chest Physicians recommends obtaining histologic diagnosis before treatment in suspected lung cancer cases, as stenting does not interfere with tissue diagnosis. [2] For small cell lung cancer (SCLC), chemotherapy alone is recommended as first-line treatment given rapid response rates. [2] For non-small cell lung cancer (NSCLC), radiation therapy and/or stent insertion are recommended, with response rates of 59% for chemotherapy and 63% for radiation therapy. [2] Patients with chemotherapy- or radiation-refractory disease should receive vascular stents For device-related thrombosis (catheters, pacemakers), catheter removal should be considered in conjunction with anticoagulation. [4] Endovascular therapy is first-line for device-related obstruction, while surgical bypass may be preferred for mediastinal fibrosis. [7] Both approaches show good mid-term patency, though secondary interventions are common (approximately 27-28%

    Tags

    Annotators

    URL

    1. Treatment of superficial vein reflux (see Varicose Veins, above) has been shown to decrease the recurrence rate of venous ulcers. Where there is substantial obstruction of the femoral or popliteal deep venous system, superficial varicosities supply the venous return and should not be removed.

      Failure of venous insufficiency ulcerations to heal is most often due to inconsistent use of first-line treatment methods. Ongoing control of edema is essential to prevent recurrent ulceration; the use of compression stockings following ulcer healing is critical, with recurrence rates 2–20 times higher if compression stockings are not used

      Duplex ultrasound evaluation should assess blood flow direction, venous reflux, and venous obstruction, and include examination of the deep venous system, great saphenous vein (GSV), small saphenous vein (SSV) and its thigh extension (Giacomini vein), accessory saphenous veins, and perforating veins. Venography is recommended primarily in patients with post-thrombotic disease, especially when intervention is planned, as it provides greater anatomic detail than duplex ultrasonograph The examination also identifies patterns of disease that have treatment implications. Axial reflux is defined as uninterrupted retrograde flow from groin to calf and can occur in either superficial or deep systems. [4] Junctional reflux is limited to the saphenofemoral or saphenopopliteal junction, while segmental reflux occurs in a portion of a truncal vein. [4] Understanding whether reflux originates from superficial junctions versus deep venous incompetence fundamentally changes treatment planning, as superficial disease is amenable to ablation while deep disease typically requires conservative management Management of secondary varicose veins from post-thrombotic syndrome (PTS) is fundamentally different and more challenging. Compression therapy, lifestyle modifications, and symptom management form the cornerstone of PTS treatment. [4-8] Elastic compression stockings (20-30 mm Hg), leg elevation, weight loss, and exercise constitute the primary therapeutic approach Endovascular interventions for PTS—including percutaneous transluminal venoplasty and stenting—are reserved for select patients with significant iliofemoral obstruction who have failed conservative management. [7] These procedures require careful patient selection and standardized criteria. The role of superficial venous ablation in PTS patients with concomitant superficial reflux remains controversial and should be approached cautiously, as the underlying deep venous pathology may limit benefit
    2. Primary varicose veins result from a complex multifactorial process involving genetic predisposition and environmental factors. The pathophysiology involves initial structural weakness within the vein wall leading to vein dilation, or valve incompetence causing blood pooling and subsequent vein dilation. [1][3] Risk factors include family history (autosomal dominant inheritance with variable penetrance), female sex, multiparity, pregnancy, prolonged standing, obesity, and advanced age. [2-3] If both parents have varicose veins, offspring have a 90% chance of developing them. [2]

      Secondary varicose veins develop through a distinct mechanism involving inflammation, thrombosis, and recanalization that results in venous wall damage, dilation, and valve insufficiency. [3] The clinical picture manifests as post-thrombotic syndrome, which can include pain, edema, skin changes, and venous leg ulcers. [3] The 2020 CEAP classification further subdivides secondary causes into intravenous (Esi) and extravenous (Ese) etiologies. [4] Intravenous causes include any condition causing venous wall or valve damage from within the lumen, such as DVT, traumatic arteriovenous fistulas, or primary intravenous sarcoma. Extravenous causes involve conditions affecting venous hemodynamics without direct wall or valve damage, such as central venous hypertension from obesity or heart failure, extrinsic compression from tumors or retroperitoneal fibrosis, or muscle pump dysfunction from paraplegia or arthritis

    Tags

    Annotators

    URL

    1. How to Kill the Code Review

      1. The Scaling Crisis (Why Reviews are "Dead")

      • Throughput vs. Cognition: The fundamental problem is a mismatch in speed. AI agents (like Cursor, Claude Code, or GitHub Copilot) can generate code at a rate 10x–100x faster than a human. If a developer uses an agent to clear 20 tickets in a morning, the human reviewer becomes a permanent, overwhelmed bottleneck.
      • The "Rubber Stamp" Failure: In high-volume AI environments, human reviewers stop actually reading the code and start "rubber-stamping" (approving without checking). This creates a false sense of security while letting "slop" (low-quality, redundant, or slightly buggy AI code) leak into the codebase.

      2. The Shift to "Spec-Driven Development"

      The author posits that the "checkpoint" for quality is moving "upstream." * Reviewing Intent, Not Implementation: Engineers will spend their time reviewing the Specification or the Prompt rather than the resulting lines of code. If the spec is correct and the constraints are tight, the implementation is treated as a disposable artifact. * Natural Language as the New Source Code: The high-level description of a feature becomes the primary source of truth. If a bug is found, the human doesn't "fix the code"; they "fix the spec" and have the agent regenerate the solution.

      3. Verification over Review

      The traditional manual "looking at code" is being replaced by automated verification loops: * Agentic Testing: Agents are now responsible for writing their own unit tests and integration tests. The human's job is to review the test plan and verify that the tests actually cover the business requirements. * Formal Methods & Type Safety: The article emphasizes using "bounded interfaces" (like Infrastructure-from-Code or strict APIs). When the architecture is strictly typed, the AI is physically unable to make certain classes of errors, reducing the need for human oversight of the "plumbing."

      4. The Emergence of "Cognitive Debt"

      A primary warning in the text is the rise of "Cognitive Debt"—a state where a codebase is so dominated by AI-generated logic that no single human understands how the entire system works. * Tech Debt vs. Cognitive Debt: While tech debt is "bad code that's hard to change," cognitive debt is "working code that is impossible to reason about." * The New Senior Role: Senior engineers must transition into "System Curators" who manage this debt by enforcing architectural simplicity and ensuring that AI-generated modules remain modular and replaceable.

      5. Future Prediction

      • 2025: The year human-written code became the minority.
      • 2026: The year the "Pull Request" as a human-to-human review ritual officially becomes obsolete in high-performing teams, replaced by automated agent-to-agent validation and human-to-spec oversight.
    1. Section 4 — Decommissioning for Non-Viability (a) If the commanding officer fails to remediate after warning, any EC member may move to decommission the vessel. (b) Decommissioning requires a motion, a second, and a majority vote of the EC. (c) Upon decommissioning, the crew is reassigned through the Fleet Placement Officer. The commanding officer’s rank and standing are not affected unless separate disciplinary action is taken. Section 5 — Removal of a Commanding Officer (a) The EC may remove a commanding officer for violations of the Constitution or its Bylaws, independent of ship viability. (b) Upon removal, the First Officer or ranking officer assumes temporary command under Bylaw 5, Section 2. (c) The EC may restrict the removed officer’s access to community channels during the proceedings. (d) Removal requires a motion, a second, and a majority vote of the EC.

      I suggest swapping sections four and five to better reflect a logical progression of actions. Reading the document sequentially, it currently implies that decommissioning is the first step following a good-faith warning.

      Decommissioning is quite a drastic action and has a massive knock-on effect for the whole fleet, not to mention the possible stress it may cause for the crew and the interrupting of an individual's progress towards command or any other goals they may have in the fleet.

      I think here, stability should be the priority, and I think the EC's default should be that the current section five comes first, with decommissioning happening only in the most extreme circumstances.

      I look at the Astraeus's decommissioning and from my perspective as a non-CC simmer at the time, I was left wondering why the first officer couldn't take over, or a new/experienced commanding officer couldn't be air dropped in like had been done with Denali.

    1. Document de Synthèse : Réalités, Mécanismes et Enjeux de la Violence Conjugale

      Résumé Exécutif

      Ce document analyse les témoignages et les interventions documentés au sein de l'association Flora Tristan, mettant en lumière la complexité systémique des violences conjugales.

      Les principaux points à retenir sont :

      • La cyclicité de la violence : Le processus de l'emprise repose sur un cycle répétitif (tension, crise, transfert de responsabilité, lune de miel) qui rend le départ des victimes extrêmement difficile (5 à 7 tentatives en moyenne).

      • L'impact sur les enfants : Les enfants ne sont pas de simples témoins mais des victimes directes, subissant des traumatismes durables qui affectent leur développement et leur sécurité.

      • Les défaillances institutionnelles : Le manque de places d'hébergement d'urgence, la précarité des solutions hôtelières (115) et l'accueil parfois inadéquat des forces de l'ordre constituent des obstacles majeurs à la mise en sécurité.

      • Le rôle vital des structures spécialisées : L'accompagnement pluridisciplinaire (juridique, psychologique, social) est indispensable pour permettre aux femmes de passer du statut de victime à celui de survivante et de reconstruire leur estime de soi.

      --------------------------------------------------------------------------------

      1. Les Mécanismes de l'Emprise et le Cycle de la Violence

      La violence conjugale ne se limite pas aux coups physiques ; elle s'inscrit dans une stratégie de manipulation et d'isolement.

      Le Cycle de la Violence

      Le document identifie quatre phases distinctes qui s'enchaînent de manière circulaire :

      • Phase de tension : Accumulation d'agressivité chez l'agresseur.

      • Phase de crise (Explosion) : Passage à l'acte violent (physique, verbal ou sexuel).

      • Phase de transfert de responsabilité : L'agresseur justifie ses actes en culpabilisant la victime ("C'est toi qui me rends violent").

      • Phase de lune de miel : L'agresseur demande pardon, se montre aimant et promet de changer.

      C'est durant cette phase que les victimes sont le plus enclines à retirer leur plainte, espérant une amélioration.

      Typologie des Violences

      Les interventions révèlent une multiplicité de formes de violence qui s'entrecroisent :

      • Psychologique et verbale : Insultes, dénigrement permanent, menaces de mort prises à la "rigolade".

      • Physique : Coups, strangulation, séquestration.

      • Sexuelle : Rapports imposés par le chantage ou la force physique (viols conjugaux souvent non identifiés comme tels par les victimes elles-mêmes).

      • Économique et Administrative : Contrôle des ressources, confiscation des documents, promesses fallacieuses (notamment dans le cadre de l'immigration).

      • Isolement : Interdiction de voir la famille, les amis, d'utiliser les réseaux sociaux ou même d'ouvrir la porte au gardien.

      --------------------------------------------------------------------------------

      2. L'Impact sur les Victimes : Entre Culpabilité et Honte

      Le traumatisme psychologique crée des barrières internes puissantes qui s'ajoutent aux menaces extérieures.

      | Sentiment | Description et Conséquences | | --- | --- | | Culpabilité | Les victimes se demandent souvent si elles sont responsables de la situation ou si elles auraient pu "sortir par la porte ouverte". | | Déni de statut | Difficulté à se reconnaître comme "victime", surtout pour les femmes se percevant comme ayant un caractère fort. | | Honte | Un obstacle majeur à la parole, particulièrement vis-à-vis de la famille ou de la société, exacerbé par le sentiment d'avoir échoué à "sauver" le couple. | | Banalisation | L'accumulation de petits incidents ("C'est pas grave") qui finissent par former une "montagne" de violences graves. |

      « On doit pas avoir honte, c'est à eux d'avoir honte. » – Citation d'une survivante lors d'une manifestation.

      --------------------------------------------------------------------------------

      3. Les Enfants : Victimes Directes et Enjeux de Protection

      Le document souligne que l'exposition à la violence conjugale est un traumatisme majeur pour les enfants, avec des conséquences à long terme.

      • Traumatismes visibles : Troubles du sommeil (insomnies, cauchemars sur la sécurité de la mère), agitation extrême, retards dans les apprentissages et anxiété profonde.

      • Mécanismes de survie : Certains enfants adoptent le comportement de l'agresseur pour tenter de se protéger eux-mêmes ou pour s'identifier à la figure de pouvoir.

      • Défaillances du système judiciaire :

        • Les juges privilégient souvent le maintien du lien parental (autorité parentale) au détriment de la sécurité réelle de l'enfant.
      • Les procédures sont extrêmement longues (reports de délibérés), prolongeant l'insécurité.

      • L'enfant est parfois utilisé comme une "arme" par l'agresseur pour continuer à détruire la mère après la séparation.

      --------------------------------------------------------------------------------

      4. Les Limites du Soutien Institutionnel et Social

      Malgré l'existence de structures comme Flora Tristan, le système global présente des failles critiques.

      Les Obstacles à la Plainte

      L'accueil au commissariat est décrit comme parfois "odieux".

      Les témoignages rapportent des policiers remettant en cause la parole des victimes, rappelant avec insistance la présomption d'innocence ou menaçant de sanctions en cas de "fausses déclarations", ce qui décourage le dépôt de plainte, notamment pour les violences sexuelles.

      La Crise de l'Hébergement

      • Saturations des structures : En une période donnée, 217 demandes d'hébergement d'urgence ont été refusées par une seule association faute de places.

      • L'échec du "115" (Hébergement hôtelier) :

        • Conditions insalubres (rats, punaises de lit).
      • Éloignement géographique (jusqu'à 2h30 des écoles ou du travail), mettant en péril l'emploi et la scolarité.

      • Insécurité chronique poussant certaines femmes à retourner au domicile conjugal par désespoir.

      --------------------------------------------------------------------------------

      5. Le Chemin vers la Reconstruction

      La sortie de la violence est un processus long qui nécessite une réappropriation de son corps et de son identité.

      • Sécurité et Cohabitation : Les structures comme Flora Tristan offrent un répit immédiat, bien que la cohabitation avec d'autres familles soit un défi supplémentaire.

      • Reconnexion au corps : Des activités comme la boxe professionnelle permettent de reprendre confiance en soi, d'extérioriser la colère et de transformer l'énergie négative en force positive.

      • Autonomie retrouvée : Le succès passe par la reprise d'études, la recherche d'un emploi stable et l'obtention d'un logement social pérenne (processus pouvant durer 18 mois à 2 ans).

      • Évolution de l'identité : Passer du statut exclusif de "mère protectrice" à celui de "femme libre" ayant droit à ses propres besoins et désirs.

      --------------------------------------------------------------------------------

      Conclusion : Un Appel à l'Action Politique

      Le personnel encadrant et les survivantes s'accordent sur le fait que la lutte contre les violences conjugales nécessite une volonté politique forte se traduisant par :

      • Un financement massif et pérenne de places d'hébergement spécialisées.

      • Une réforme de la justice pour mieux prendre en compte la parole des enfants et protéger les victimes des procédures abusives.

      • Une sensibilisation accrue pour supprimer la "violence étatique" qui s'ajoute à la violence conjugale lorsque les institutions échouent à protéger les plus vulnérables.

    1. Briefing : Lutte contre les manipulations de l’information et les ingérences numériques étrangères

      Résumé exécutif

      Ce document synthétise l'audition de Nicolas Roche, Secrétaire général de la défense et de la sécurité nationale (SGDSN), portant sur la stratégie nationale de lutte contre les manipulations de l'information (LMI) pour la période 2026-2030. Face à une menace caractérisée par une « persistance stratégique » et une mutation technologique rapide, la France a consolidé son dispositif opérationnel via le service Viginum.

      Les points clés de l'intervention soulignent :

      • L'évolution de la menace : Une activité permanente, hors périodes électorales, menée par des acteurs étatiques (Russie, Chine, Iran) et non étatiques (sphère MAGA), utilisant l'intelligence artificielle pour massifier la diffusion.- Le rôle de Viginum : Un service technique axé sur la détection des infrastructures d'amplification inauthentique, et non sur le contrôle des contenus (« ministère de la vérité »).- La stratégie 2026-2030 : Fondée sur quatre piliers, elle donne la priorité à la résilience de la nation par l'éducation et la transparence, plutôt qu'à une simple réponse régalienne.- Le cadre européen : La nécessité d'une mise en œuvre plus agressive du Règlement sur les services numériques (DSA) pour imposer une transparence algorithmique aux grandes plateformes.

      --------------------------------------------------------------------------------

      I. Analyse de la menace : Une triple mutation

      Le paysage des ingérences numériques étrangères (INE) a subi des transformations profondes selon trois axes majeurs :

      1. Paramètre stratégique : La persistance

      Les adversaires ne se limitent plus aux coups d'éclat lors des scrutins. Ils s'inscrivent dans une logique de continuité, cherchant à s'implanter durablement dans le débat numérique français en se faisant passer pour des acteurs authentiques. Les principaux acteurs identifiés incluent :

      • États : Russie, Chine, Iran, Azerbaïdjan (ce dernier ciblant spécifiquement les Outre-mer).- Acteurs non étatiques : Notamment la sphère MAGA.

      2. Paramètre technologique : L'impact de l'IA

      L'intelligence artificielle transforme l'économie de l'information :

      • Contenus synthétiques (Deepfakes) : Bien que spectaculaires, ils sont jugés moins préoccupants car plus faciles à détecter.- Massification et réplication : L'IA est surtout utilisée pour contourner la modération des plateformes par la technique du « copy-paste » modifié, permettant une diffusion massive et automatisée.- Pollution de modèles (Data poisoning) : Un sujet de recherche prospectif visant à biaiser les réponses des IA natives dès leur phase d'entraînement.

      3. Paramètre économique : L'économie de l'attention

      L'ingérence peut avoir une finalité strictement lucrative. La polarisation du débat génère des clics et donc des revenus publicitaires. L'émergence des influenceurs comme relais, volontaires ou non, complique davantage cet écosystème.

      --------------------------------------------------------------------------------

      II. Viginum : Un modèle technique et transparent

      Créé en 2021, Viginum est le service de référence pour la détection des INE. Nicolas Roche définit sa mission autour de critères stricts :

      | Critère | Description | | --- | --- | | Extranéité | Seuls les acteurs étrangers (étatiques ou non) sont ciblés. | | Inauthenticité | Focus sur l'amplification massive et délibérée via des infrastructures techniques. | | Gravité | L'attaque doit porter atteinte aux intérêts fondamentaux de la nation. | | Transparence | Service non-renseignement, soumis à la CNIL et à un comité éthique et scientifique. |

      Évolution du cadre juridique (Décret de février 2026)

      Le cadre juridique a été adapté pour répondre à l'évolution de la menace :

      • Abaisser le seuil de collecte : Suppression du seuil de 5 millions d'utilisateurs/jour pour permettre la surveillance de plateformes plus petites où les opérations débutent souvent.- Conservation des données : Allongement des durées pour suivre la construction des infrastructures numériques sur le long terme.

      --------------------------------------------------------------------------------

      III. Retour d'expérience : Élections municipales 2026

      Le dispositif de surveillance spécifique mis en place pour les municipales de 2026 a permis de tester la robustesse du modèle français.

      • Constat : Quatre opérations majeures d'ingérence détectées.- Impact : Jugé « assez faible » en termes de visibilité. Le rapport coût-bénéfice pour les adversaires a été médiocre.- Méthodologie : Publication de 10 bulletins d'information hebdomadaires pour éviter tout emballement médiatique et informer les citoyens en temps réel.- Gouvernance : Création d'un réseau de coordination incluant le SGDSN, le ministère de l'Intérieur, mais aussi des autorités indépendantes (ARCOM, Commission de contrôle du financement de la vie politique).

      --------------------------------------------------------------------------------

      IV. La Stratégie Nationale 2026-2030

      La stratégie repose sur quatre piliers fondamentaux, avec un changement de paradigme vers la société civile.

      Pilier 1 : La Résilience (Priorité absolue)

      L'État reconnaît qu'il ne peut pas lutter seul. La résilience passe par :

      • L'Académie de lutte contre les manipulations de l'information : Un projet pour 2026 visant à transférer l'expertise technique de Viginum vers les journalistes, les enseignants et les élus.- L'Éducation aux médias : Partenariat avec le Clémi et le ministère de l'Éducation nationale pour outiller les lycéens et collégiens.

      Pilier 2 : Régulation des plateformes et IA

      • Application du DSA : Exigence de transparence algorithmique (Article 35) pour vérifier si les plateformes modifient leurs recommandations en période électorale.- Audit algorithmique : Nécessité d'objectiver scientifiquement les biais des algorithmes, souvent considérés comme des « boîtes noires ».

      Pilier 3 : Capacités opérationnelles

      Renforcement des moyens humains et techniques de Viginum pour maintenir l'avance technologique face aux méthodes d'anonymisation sophistiquées.

      Pilier 4 : Coopération internationale

      Il n'est pas envisagé de « Viginum européen ». La compétence opérationnelle reste nationale, mais l'Union européenne est essentielle pour sa puissance normative face aux plateformes américaines.

      --------------------------------------------------------------------------------

      V. Enjeux spécifiques et vulnérabilités

      Les Outre-mer

      Le SGDSN souligne une surexposition des territoires ultramarins. L'Azerbaïdjan a notamment été identifié comme ayant ciblé la Nouvelle-Calédonie et d'autres territoires via des modes opératoires informationnels documentés. Un guide spécifique a été transmis aux préfets et hauts-commissaires pour élever le niveau de vigilance local.

      Les Jeunes

      Nicolas Roche récuse l'idée d'une vulnérabilité intrinsèque des jeunes, notant que ces derniers sont souvent plus « natifs » et sensibilisés aux mécanismes numériques que les générations précédentes. La stratégie doit donc être segmentée de façon plus fine que par simple critère d'âge.

      Le Droit comme arme

      Des évolutions législatives sont envisagées pour :

      • Étendre le « référé fake news » (L163-2 du code électoral) aux élections locales, actuellement limité aux scrutins nationaux.- Clarifier l'article L97 du code électoral pour mieux contrer les nouvelles formes d'ingérence.

      « Notre travail ne consiste pas à dire ce qui est vrai ou faux. Notre travail consiste à préserver la souveraineté française. » — Nicolas Roche

    1. Synthèse de l'Audition du Haut Conseil de la Santé Publique (HCSP) sur la Stratégie Nationale de Santé

      Résumé Exécutif

      L'audition du Haut Conseil de la santé publique (HCSP) devant la commission des affaires sociales de l'Assemblée nationale met en lumière un constat alarmant : le système de santé français fait face à une crise majeure et à un risque systémique.

      Pour y répondre, le HCSP préconise une transformation profonde, passant d'un modèle quasi exclusivement centré sur le soin curatif à une approche systémique de santé publique sur le temps long (10 ans).

      Les priorités identifiées incluent la refonte de l'articulation entre santé publique et soins, la lutte contre les maladies chroniques, la réduction des inégalités sociales et territoriales, et l'intégration de la "santé dans toutes les politiques".

      Malgré l'urgence, les échanges révèlent un blocage opérationnel majeur : l'absence de décret publiant officiellement la Stratégie Nationale de Santé (SNS) 2023-2033, limitant ainsi la visibilité des acteurs et le pilotage territorial.

      --------------------------------------------------------------------------------

      1. Identité et Rôle du Haut Conseil de la Santé Publique (HCSP)

      Le HCSP est une instance d'expertise pluridisciplinaire au service de l'État.

      Ses caractéristiques clés sont les suivantes :

      • Composition : 120 experts indépendants et bénévoles.

      • Organisation : Cinq commissions spécialisées et des groupes de travail permanents transversaux coordonnés par un collège.

      • Missions :

        • Contribuer à l'élaboration, au suivi et à l'évaluation de la SNS.
      • Fournir des réflexions prospectives et des conseils sur les questions de santé publique.

      • Répondre aux saisines ministérielles ou parlementaires et produire des auto-saisines sur des sujets d'intérêt public.

      • Transparence : L'ensemble des avis et rapports est publié en accès libre.

      --------------------------------------------------------------------------------

      2. Diagnostic et Vision Stratégique 2023-2033

      Le HCSP a publié en mars 2023 ses travaux pour la nouvelle SNS.

      Le diagnostic est celui d'une crise majeure nécessitant un changement de paradigme.

      Le choix de la décennie

      Contrairement à la durée légale de 5 ans, le HCSP préconise une stratégie sur 10 ans pour permettre l'atteinte d'objectifs ambitieux à moyen et long terme, s'alignant ainsi sur les dynamiques européennes.

      Les quatre grands objectifs cardinaux

      Le HCSP structure sa vision autour de quatre piliers :

      • Reconstruction du système : Articuler le système de santé publique et le système de soins (objectifs à 5 et 10 ans), renforcer l'attractivité des métiers et valoriser la territorialisation.

      • Maladies chroniques : Diminuer le recours aux soins lié aux maladies chroniques (5 ans), ralentir l'augmentation de leur prévalence par la détection précoce (10 ans) et augmenter l'espérance de vie en bonne santé (15 ans).

      • Réduction des inégalités : Renforcer la politique du "dernier kilomètre" via l'universalisme proportionné et les dispositifs d' "aller-vers" pour réduire la mortalité prématurée.

      • Prévention systémique : Agir sur les déterminants de santé tout au long de la vie (petite enfance, milieu scolaire, travail, seniors) via une approche populationnelle.

      --------------------------------------------------------------------------------

      3. Les 12 Actions Stratégiques Préconisées

      Le HCSP a détaillé 12 leviers pour transformer le système :

      | Action | Description Key Point | | --- | --- | | 1\. Durée de 10 ans | Adopter un temps long pour la SNS. | | 2\. Évolution par étapes | Fixer des jalons à court, moyen et long terme. | | 3\. Santé dans toutes les politiques | Définir la santé comme paramètre intersectoriel et interministériel. | | 4\. Participation citoyenne | Impliquer les populations dans l'élaboration et la mise en œuvre territoriale. | | 5\. Politique de prévention intégrée | Approche socio-écologique par milieu de vie et développement de la littératie en santé. | | 6\. Environnement et Climat | Inscrire l'environnement comme déterminant majeur (approche One Health). | | 7\. Agilité territoriale | Valoriser les acquis numériques de la crise COVID et mobiliser les acteurs locaux. | | 8\. Plan Outre-mer spécifique | Rattraper le retard des indicateurs de santé dans les territoires ultramarins. | | 9\. Gouvernance et Financement | Réformer l'ONDAM et évaluer le retour sur investissement des dépenses de santé. | | 10\. Système d'information | Assurer l'interopérabilité des données et l'accès aux données en vie réelle. | | 11\. Recherche en santé publique | Loi de programmation avec financement pluriannuel sanctuarisé. | | 12\. Prospective | Développer les capacités de compréhension des forces futures pour guider l'action. |

      --------------------------------------------------------------------------------

      4. Thématiques Critiques et Enjeux de Société

      Santé Mentale : Un secteur en "échec de soins"

      Le constat des parlementaires, partagé par le HCSP, souligne une dégradation alarmante :

      • Augmentation de 80 % des épisodes dépressifs chez les 18-24 ans en quatre ans.

      • Chute de 34 % du nombre de pédopsychiatres entre 2010 et 2022.

      • Le HCSP insiste sur le besoin de temps pour rattraper le retard structurel de ce secteur, malgré son statut de "grande cause nationale".

      Territoires et Outre-mer

      La stratégie doit s'adapter aux contextes locaux extrêmes :

      • Mayotte : Crise de l'eau, pression migratoire, épidémies de choléra et manque de spécialistes.

      • Guyane : Problématiques d'éloignement (forêt amazonienne) nécessitant des modes de prévention spécifiques.

      • Principe : Le HCSP intègre systématiquement un volet ultramarin dans ses rapports pour éviter une approche standardisée inopérante.

      Inégalités et Discriminations

      L'audition a soulevé la question des stéréotypes racistes en médecine (ex: "syndrome méditerranéen").

      Bien que non explicitement détaillé dans le rapport SNS du HCSP, l'instance rappelle que l'équité et l'égalité en droit sont des principes fondamentaux, et que ces sujets relèvent de la formation éthique et juridique des professionnels de santé.

      --------------------------------------------------------------------------------

      5. Analyse des Freins Politiques et Budgétaires

      L'absence de décret SNS 2023-2033

      Un point de tension majeur apparaît entre les élus et le HCSP : le gouvernement n'a toujours pas pris le décret actant la nouvelle stratégie.

      • Conséquence : Un manque de visibilité pour le Parlement et les citoyens, ainsi qu'une difficulté pour les Agences Régionales de Santé (ARS) à actualiser leurs projets régionaux de santé (PRS).

      • Position du HCSP : L'instance rappelle son rôle strictement consultatif, déclinant toute responsabilité quant à l'agenda réglementaire de l'exécutif.

      Le défi du financement

      Le passage d'un modèle curatif à un modèle préventif pose la question de l'allocation des ressources :

      • Les dépenses de prévention représentent environ 8 % des 300 milliards d'euros de budget santé.

      • Le HCSP suggère que le financement de la prévention ne doit pas reposer uniquement sur l'Assurance Maladie, mais doit être intersectoriel (éducation, sport, alimentation).

      • Efficience : Des économies significatives (estimées à 300 000 € par an pour un service de 10 urologues à titre d'exemple cité par un député) pourraient être réalisées en améliorant la pertinence des actes et la coopération interprofessionnelle.

      --------------------------------------------------------------------------------

      Citations Clés

      « Le diagnostic [...] est bien celui d'une crise majeure et d'un risque systémique de l'ensemble du système de soins et du système de santé. » — Véronique Ghileron, Présidente du HCSP

      « La santé est bien un sujet politique et elle est largement produite par nos façons de vivre. Elle doit être abordée dans tous les secteurs. » — Véronique Ghileron

      « Nous ne sommes plus face à un secteur en tension mais face à un secteur en échec de soins. » — Sandrine Rousseau, Députée (à propos de la santé mentale)

      « La prévention n'est pas un sujet médical uniquement... c'est une politique de vie. » — HCSP

    1. Harcèlement scolaire : Analyse des impacts durables et des processus de reconstruction

      Synthèse

      Le harcèlement scolaire ne constitue pas une simple phase transitoire de l'enfance, mais agit comme un traumatisme fondateur dont les répercussions s'étendent sur plusieurs décennies.

      Le document met en lumière une « double peine » pour les victimes : au traumatisme initial s'ajoute un retard social et émotionnel persistant à l'âge adulte.

      Les témoignages de Nathalie, Laurine et Samuel révèlent que les stigmates — qu'ils soient psychologiques (manque de confiance, anxiété, colère) ou sociaux (difficultés relationnelles et amoureuses) — forgent une identité marquée par la défiance et le sentiment de temps perdu.

      Si des dispositifs modernes comme le programme « Phare » et le rôle des élèves ambassadeurs marquent un progrès institutionnel, la reconstruction demeure un processus long, laborieux et parfois inabouti.

      --------------------------------------------------------------------------------

      1. Profils des victimes et mécanismes de l'oppression

      Le harcèlement s'appuie systématiquement sur la détection d'une différence ou d'une vulnérabilité, qu'elle soit physique, cognitive ou sociale.

      Facteurs de vulnérabilité identifiés

      • Particularités physiques et santé : L'acné sévère, la corpulence, ou des problèmes dentaires/maxillaires (cas de Laurine) servent de catalyseurs aux moqueries.

      • Troubles neurodéveloppementaux : La dyspraxie (cas de Samuel) entraîne des difficultés motrices et une fatigue rapide, perçues par les harceleurs comme une forme de « saleté » ou de « maladie contagieuse ».

      • Décalage cognitif et émotionnel : Une grande sensibilité, un tempérament « fantasque » ou un décalage entre les capacités intellectuelles précoces et une immaturité sociale (entrer en primaire à 4 ans) créent une cible isolée.

      • Identité culturelle : L'usage d'une langue maternelle différente (l'alsacien pour Nathalie) et un accent prononcé ont été des vecteurs d'exclusion dès l'entrée au collège.

      Typologie des actes de harcèlement

      | Forme de harcèlement | Exemples concrets issus des témoignages | | --- | --- | | Physique | Tapes sur la tête, bousculades, jets de sucreries mâchées (carambars/malabars), vols de sacs, gifles dans les transports scolaires. | | Verbal et Psychologique | Surnoms dénigrants (« la calculatrice »), insultes sexistes ou homophobes, rumeurs de perversion ou d'inceste familial. | | Social et Environnemental | Violation de l'intimité (agressions dans les toilettes), exclusion des groupes de discussion (cyberharcèlement), mépris silencieux et regards condescendants. |

      --------------------------------------------------------------------------------

      2. La « Double Peine » : Conséquences à l'âge adulte

      Les sources soulignent que le harcèlement « paramètre » les individus, altérant leur perception du monde et d'eux-mêmes bien après la fin de la scolarité.

      Le retard social et relationnel

      Les victimes font état d'un « coche raté » durant l'adolescence.

      N'ayant pas appris les codes sociaux et amoureux au moment opportun par peur du rejet, elles développent :

      • Une méfiance généralisée : Une difficulté à nouer des liens, percevant chaque soupir ou rire comme une moquerie potentielle.

      • Un sentiment d'indignité : La conviction profonde d'être « dégoûtant » ou indigne d'être aimé, rendant les relations amoureuses catastrophiques ou inexistantes.

      • L'atrophie de la vie sociale : Une tendance à l'isolement (« devenir une sauvage ») et une incapacité à se projeter dans l'avenir.

      Impacts sur la santé mentale

      • Traumatismes persistants : Présence d'une mémoire traumatique qui se réactive dans des contextes banals (prendre le tramway, passer devant son ancien collège).

      • Pathologies lourdes : États dépressifs sévères, idées noires, recours aux antidépresseurs sur de longues durées et épisodes d'hospitalisation en psychiatrie.

      • Érosion de l'estime de soi : Un manque de confiance en miettes, une haine du propre corps et une exigence envers soi-même parfois écrasante.

      --------------------------------------------------------------------------------

      3. Le rôle des institutions et de l'entourage

      L'analyse montre une évolution historique entre la gestion passée, marquée par l'indifférence, et les tentatives actuelles de prise en charge.

      La défaillance historique des adultes

      Dans les années 80 et 90, le harcèlement était souvent minimisé par le personnel éducatif :

      • Invisibilisation : Les adultes considéraient ces situations comme des « histoires entre gamins ».

      • Impuissance apprise : Des cadres éducatifs (CPE, surveillants) refusaient d'intervenir sous prétexte que les faits se déroulaient « devant la grille » et non à l'intérieur.

      • Injonction à l'affirmation : Les victimes se voyaient reprocher leur manque de caractère (« il fallait s'affirmer »), plaçant la responsabilité du harcèlement sur la proie.

      Les dispositifs contemporains

      • Programme Phare : Plan de lutte officiel de l'Éducation nationale visant à structurer la réponse au harcèlement.

      • Élèves Ambassadeurs : Des élèves formés pour agir comme « veilleurs », repérer les situations de détresse et offrir un soutien entre pairs, inversant ainsi le rapport de force collectif.

      • Limites persistantes : Le cyberharcèlement reste une frontière difficile à réguler, et la visibilité des situations de harcèlement demeure problématique pour les élèves qui ne savent pas toujours distinguer la théorie de la pratique.

      --------------------------------------------------------------------------------

      4. Processus de reconstruction et résilience

      La reconstruction est décrite non pas comme un retour à l'état initial, mais comme une cicatrisation complexe.

      • L'accompagnement thérapeutique : Le travail avec des psychologues ou psychiatres est jugé indispensable pour traiter la culpabilité et la colère.

      Des exercices comme la rédaction de lettres aux harceleurs ou la reconstruction de la chronologie des faits aident à extérioriser le vécu.

      • L'engagement professionnel : Certains anciens harcelés deviennent enseignants, voyant dans ce métier une évidence pour protéger les nouvelles générations, tout en restant conscients de leur propre fragilité émotionnelle face au sujet.

      • Le sport et la réussite académique : Pour certains, le sport permet de reprendre possession de leur corps, tandis que l'excellence professionnelle (études de médecine) sert de moteur de reconstruction, bien que la personnalité reste « écorchée ».

      • L'acceptation de la colère : Contrairement aux injonctions sociales, le pardon n'est pas toujours souhaité. La colère est parfois perçue comme un moteur de survie légitime face à l'injustice subie.

      Conclusion

      Le harcèlement scolaire est un préjudice global qui prive les individus de leur dignité et de leur capacité à choisir leur rythme de vie.

      Si la résilience est possible par le dialogue et le soutien thérapeutique, les sources indiquent que l'on ne « guérit » jamais totalement : on apprend à vivre avec une identité forgée par l'adversité.

      Le temps perdu, particulièrement les « amours de jeunesse » et l'insouciance, demeure une perte inéluctable.

      L'enjeu majeur pour l'institution reste la détection précoce afin d'éviter que le « petit enfant harcelé » ne devienne systématiquement un « adulte abîmé ».

    1. Synthèse de l'Audition du Président du Conseil d’Évaluation de l’École (CEE)

      Résumé Exécutif

      Ce document synthétise les points clés de l'audition du président du Conseil d’évaluation de l’école (CEE) devant la Commission de la culture et de l’éducation de l'Assemblée nationale.

      Le CEE, succédant au Cnesco suite à la loi de 2019, a pour mission de structurer et d'analyser l'évaluation systémique des établissements scolaires français.

      Les points saillants de l'audition incluent :

      • Succès quantitatif : 100 % des établissements du second degré et 66 % du premier degré ont été évalués.

      La France a comblé son retard par rapport aux autres pays de l'OCDE.

      • Changement de paradigme : L'évaluation n'est pas un audit de conformité, mais un outil d'auto-évaluation visant à renforcer l'autonomie et le pouvoir d'agir des équipes locales.

      • Phase de transformation : Le passage de la simple évaluation (Vague 1) à la transformation opérationnelle (Vague 2) est l'enjeu majeur.

      Les projets d'établissement doivent devenir des « plans de réussite collectifs ».

      • Pilotage par la donnée : Une culture de la donnée s'installe, avec des outils innovants comme le projet « Alumni » pour suivre le devenir des élèves 3 à 5 ans après leur sortie.

      • Défis persistants : Les critiques parlementaires soulignent des risques de doublons administratifs, l'urgence face à la dégradation des conditions de travail des enseignants et l'inertie potentielle du système.

      --------------------------------------------------------------------------------

      I. Missions et Cadre Institutionnel du CEE

      Le Conseil d’évaluation de l’école (CEE), créé par la loi de 2019 pour une école de la confiance, remplit quatre missions fondamentales définies par le législateur :

      • Cohérence des évaluations : Veiller à l'unité des travaux portés par le ministère (notamment ceux de la DEPP) concernant les acquis des élèves et l'école inclusive.

      • Expertise méthodologique : Donner un avis sur les outils et les résultats des évaluations du système éducatif.

      • Mesure des inégalités : Proposer des méthodologies pour évaluer les disparités territoriales et formuler des recommandations pour les réduire (équité scolaire).

      • Cadre de l'évaluation des établissements : Définir la méthodologie, les outils d'auto-évaluation et la fréquence des évaluations externes.

      --------------------------------------------------------------------------------

      II. Bilan des Évaluations d'Établissements

      1. État d'avancement quantitatif

      Le président souligne un « succès collectif » concernant le déploiement des évaluations depuis 2020 :

      • Second degré : 100 % des établissements évalués à ce jour.

      • Premier degré : 66 % de couverture atteinte cette année.

      • Comparaison internationale : La France, précédemment dernière de l'OCDE (après la Bulgarie) en matière d'évaluation systémique, a rattrapé son retard.

      2. La Philosophie de l'Évaluation

      L'évaluation se distingue radicalement d'un audit de conformité.

      Son but est d'interroger les décisions propres de l'établissement dans le cadre de ses marges d'autonomie.

      • Auto-évaluation : Un diagnostic partagé par les équipes locales.

      • Évaluation externe : Un regard extérieur pour challenger et objectiver le diagnostic initial.

      • Objectif : Accompagner le « pouvoir d'agir » des acteurs de terrain plutôt que de vérifier l'application de directives verticales.

      --------------------------------------------------------------------------------

      III. Les Domaines Opérationnels de l'Évaluation

      Pour simplifier la réflexion des équipes, le cadre de l'évaluation s'articule autour de 4 domaines principaux déclinés en 12 portes d'entrée :

      | Domaine | Thématiques couvertes | | --- | --- | | Acquis des élèves | Parcours scolaires, orientation, maîtrise des fondamentaux (français/maths). | | Bien-être et Climat | Bien-être des élèves et personnels, lutte contre le harcèlement, climat scolaire. | | Organisation et Pédagogie | Choix pédagogiques, ressources (manuels), formation continue des personnels. | | Partenariats | Liaisons inter-degrés, relations avec les familles et partenaires extérieurs. |

      --------------------------------------------------------------------------------

      IV. Stratégie de Transformation et Perspectives

      Le président du CEE présente une stratégie en trois phases pour assurer l'utilité des évaluations :

      • Vague 1 (Évaluation) : Constat et massification des données.

      • Vague 2 (Transformation) : L'évaluation doit servir de socle aux projets d'établissement.

      Actuellement, moins de 20 % des projets d'établissement sont jugés opérationnels.

      L'objectif est de les transformer en plans de réussite collectifs basés sur les résultats de l'évaluation.

      • Vague 3 (Adaptation systémique) : Si la transformation est réelle, le système devra s'adapter pour répondre aux besoins spécifiques exprimés par les établissements (renversement du plan national de formation).

      Innovations Clés :

      • Projet Alumni : En partenariat avec l'ONICEP, mise à disposition d'indicateurs de suivi des anciens élèves pour mesurer la contribution réelle de l'établissement à l'égalité des chances et à l'insertion (ex: accès des filles aux filières scientifiques 5 ans après le collège).

      • Indicateurs simplifiés : Dotation des chefs d'établissement de trois indicateurs simples de performance scolaire pour piloter sur le temps long.

      • Leadership scolaire : Renforcement des compétences managériales des cadres (via l'IH2EF) pour passer d'une gestion administrative à un pilotage pédagogique et stratégique.

      --------------------------------------------------------------------------------

      V. Synthèse des Interventions Parlementaires

      L'audition a donné lieu à des échanges critiques sur l'efficacité et la structure du CEE :

      1. Critiques sur l'utilité et la structure (RN)

      • Remise en question de la valeur ajoutée du CEE face à des organismes existants (DEPP, Inspection Générale, Cour des Comptes).

      • Dénonciation d'un système qui est « administré » mais ni « dirigé », ni « piloté ».

      2. Conditions de travail et implication des enseignants (Socialistes)

      • Alerte sur l'épuisement professionnel : 70 % des enseignants ne recommanderaient pas leur métier.

      • Nécessité absolue d'associer étroitement les enseignants aux conclusions de l'auto-évaluation pour qu'elles soient suivies d'effets.

      3. Opérationnalité et délais (Droite Républicaine)

      • Inquiétude sur le temps long : s'il a fallu 7 ans pour la phase d'évaluation, la transformation risque d'être trop lente.

      • Besoin de garanties sur la traduction des rapports en décisions tangibles et en moyens.

      4. Enjeux territoriaux et démographiques (Horizons, Députés individuels)

      • Contradiction entre la fermeture de classes due à la baisse démographique et la dégradation du niveau scolaire.

      • Questionnement sur la pertinence de l'évaluation nationale face aux spécificités locales et à la mixité sociale (ségrégation scolaire).

      • Décrochage en mathématiques (classements PISA) et nécessité de réformer la formation continue.

      --------------------------------------------------------------------------------

      VI. Réponses du Président aux Interrogations

      Le président a apporté les précisions suivantes :

      • Sur la démographie : La réduction du nombre d'élèves par classe a un impact marginal sur la performance.

      La solution réside dans l'organisation collective et le pilotage du parcours de l'élève (vision horizontale sur 15 ans).

      • Sur les doublons : Le CEE est complémentaire. Contrairement à la DEPP ou l'Inspection, il est le seul à promouvoir l'auto-évaluation et à interroger le terrain sur ses marges de manœuvre.

      • Sur la transparence : Depuis février 2024, le CEE a voté pour que les rapports d'évaluation soient accessibles à l'ensemble de la communauté éducative, incluant les parents et les élèves.

      • Sur la confiance : La transformation nécessite de restaurer la confiance entre les enseignants et les directions (perte de 14 points de confiance en 10 ans). L'évaluation doit être un levier pour reconnaître la professionnalité des acteurs.

    1. Briefing : Le traitement judiciaire des violences sexuelles incestueuses sur mineurs

      Ce document synthétise les témoignages et analyses fournis par les représentants des syndicats de la magistrature (USM, Syndicat de la Magistrature, Unité Magistrat) lors de leur audition devant la commission d'enquête de l'Assemblée nationale.

      Résumé exécutif

      L'institution judiciaire française traverse une crise structurelle profonde qui entrave le traitement efficace des violences sexuelles incestueuses.

      Malgré une libération de la parole et une augmentation des signalements, la réponse judiciaire reste marquée par un taux de classement sans suite alarmant (environ 70 % à 80 % dans les affaires de viols sur mineurs) et des délais de traitement inacceptables pouvant dépasser un an et demi entre le signalement et les premières mesures de garde à vue.

      Les principaux points de blocage identifiés sont :

      • Un manque criant de moyens humains et matériels : La France dispose de deux à quatre fois moins de magistrats et procureurs que la moyenne européenne, entraînant une gestion des flux au détriment de la qualité.

      • Une faillite de la chaîne de protection : Les enquêtes sont ralenties par le sous-effectif des brigades spécialisées et la pénurie d'experts psychiatres qualifiés.

      • Un besoin de réforme des outils civils : Les syndicats plaident pour un renforcement de l'ordonnance de protection, gérée par le juge aux affaires familiales (JAF), pour protéger l'enfant dès la phase de soupçon.

      • Le poids du passé : Le traumatisme de l'affaire d'Outreau continue d'influencer la perception de la parole de l'enfant, oscillant entre déni et suspicion.

      --------------------------------------------------------------------------------

      1. Une institution sous tension : le constat des moyens

      L'analyse des syndicats souligne un décalage majeur entre les attentes sociétales et la réalité budgétaire de la justice.

      Déficit de personnel et comparaison européenne

      Le système judiciaire français est sous-dimensionné par rapport à ses voisins européens (chiffres de la CEPEJ) :

      | Catégorie | France (pour 100 000 hab.) | Moyenne Européenne | | --- | --- | --- | | Procureurs | 3,2 | 12,2 | | Juges | 11,3 | 21,9 |

      Il manquerait environ 20 000 magistrats pour assurer une justice de qualité, alors que seulement 8 500 sont actuellement en poste.

      Conséquences sur les enquêtes

      Le manque d'enquêteurs spécialisés crée des stocks de dossiers "morts".

      • À Nantes, 500 dossiers étaient en attente en mars 2024 avec seulement 9 enquêteurs.

      • À Paris, le délai entre un signalement et une garde à vue est estimé entre 12 et 18 mois.

      • La priorité est souvent donnée à la délinquance de voie publique (opérations "place nette") au détriment des enquêtes complexes sur l'inceste qui exigent du temps (auditions multiples, perquisitions informatiques).

      Obsolescence des outils informatiques

      Les magistrats travaillent encore en "silos".

      Le logiciel des juges des enfants (non "webisé") ne permet pas de savoir si un parent fait l'objet d'une procédure dans une autre juridiction.

      L'absence d'interopérabilité entre le civil et le pénal freine la circulation des informations cruciales pour la protection des mineurs.

      --------------------------------------------------------------------------------

      2. La parole de l'enfant et l'expertise judiciaire

      Le traitement de la parole de l'enfant reste le point le plus critique de la procédure.

      La formation des acteurs

      Bien que la formation initiale à l'École Nationale de la Magistrature (ENM) intègre des modules sur les violences sexuelles (auditions filmées, protocoles NI-CHD), la formation continue est souvent délaissée par les magistrats en raison d'une charge de travail excessive.

      • Statistique clé : 60 % des auditions de mineurs victimes ne sont toujours pas réalisées par des enquêteurs formés.

      La crise de l'expertise

      Le recours aux experts psychiatres et psychologues est décrit comme une "misère".

      • Pénurie : Certaines cours d'appel n'ont aucun pédopsychiatre inscrit.

      • Qualité variable : Des expertises cruciales tiennent parfois sur une seule page.

      • Délais : Une demande de contre-expertise peut rallonger la procédure de 18 mois à 2 ans.- Coût : La faible rémunération des experts dissuade les professionnels les plus qualifiés de travailler pour la justice.

      Le traumatisme d'Outreau

      L'affaire d'Outreau est citée comme une "régression fatale".

      Elle a instauré un climat de suspicion systématique envers la parole de l'enfant, poussant certains magistrats à exiger des preuves matérielles impossibles à obtenir dans des dossiers d'inceste (souvent "parole contre parole").

      --------------------------------------------------------------------------------

      3. Propositions de réformes et débats juridiques

      Les syndicats proposent plusieurs leviers pour améliorer la protection et le jugement.

      Élargissement du cadre légal

      • Définition de l'inceste : Unanimité sur l'intégration des cousins et cousines dans la définition légale.

      • Ordonnance de protection : Proposition de calquer l'ordonnance de protection des enfants sur celle des violences conjugales.

      Elle permettrait au JAF de statuer en 6 jours sur la suspension de l'autorité parentale sur la base de la "vraisemblance" des faits, sans attendre la preuve pénale.

      Opposition au "Plaider-coupable" criminel

      Le projet de loi visant à instaurer un "plaider-coupable" pour les crimes (dont les viols incestueux) rencontre une vive opposition :

      • Risque d'une justice "expéditive" et dégradée.

      • Nécessité de l'audience publique pour poser l'interdit social.

      • Doute sur la réalité de la reconnaissance des faits par les auteurs (souvent assortie d'une minimisation du type "elle était d'accord").

      La question de l'imprescriptibilité

      La majorité des intervenants est défavorable à l'imprescriptibilité.

      L'argument principal est que cela "vend du rêve" aux victimes : plus le temps passe, plus les preuves se dégradent, rendant la condamnation quasi impossible et le procès déceptif.

      Le Syndrome d'Aliénation Parentale (SAP)

      Unité Magistrat et le Syndicat de la Magistrature alertent sur la persistance de ce concept "pseudo-scientifique".

      Utilisé pour discréditer les mères protectrices en les accusant d'instrumentaliser l'enfant, le SAP contribuerait à masquer les situations réelles d'inceste.

      --------------------------------------------------------------------------------

      4. Dysfonctionnements systémiques et protection de l'enfant

      Le classement "21" (infraction insuffisamment caractérisée)

      Ce motif de classement est majoritaire.

      Les syndicats précisent qu'un classement ne signifie pas que les faits n'ont pas eu lieu, mais que l'enquête n'a pas permis de réunir des preuves suffisantes pour un procès pénal.

      Cela crée un sentiment de déni violent pour l'enfant.

      La non-représentation d'enfant (NRE)

      Le Syndicat de la Magistrature prône la dépénalisation de ce délit, souvent utilisé contre les mères qui refusent de confier leur enfant à un père suspecté de violences.

      Actuellement, 83 % des personnes condamnées pour NRE sont des mères.

      Manque de coordination "Transversalité"

      Le besoin d'un "référent inceste" en juridiction est évoqué, mais la multiplication des casquettes de "référent" (VIF, mineurs, etc.) sans moyens dédiés est jugée inefficace.

      La priorité doit être la circulation fluide des dossiers entre le procureur, le juge des enfants et le juge aux affaires familiales.

      --------------------------------------------------------------------------------

      Citations clés

      « La justice n'est pas réparée [...]. Elle a besoin de cap clair quant aux priorités, car tout ne peut pas être prioritaire. » — Ludovic Fria (USM)

      « Un classement ne signifie pas que des faits n'ont pas existé. C'est une vérité judiciaire, pas une vérité absolue. » — Ségolène Marquet (SM)

      « Nous sommes obligés de prioriser les urgences entre elles. C'est compliqué quand on reçoit les victimes et qu'on leur dit : "Votre dossier n'a pas avancé car les enquêteurs ont 15 commissions rogatoires avant la vôtre". » — Stéphanie Caprin (USM)

      « L'inceste est le tabou par excellence. Ce n'est pas un crime comme un autre. » — Béatrice Brugère (Unité Magistrat)

    1. Briefing : Traitement judiciaire et médical des violences sexuelles incestueuses

      Ce document de synthèse analyse les témoignages et expertises présentés devant la commission d'enquête parlementaire relative au traitement judiciaire des viols et agressions sexuelles incestueuses sur mineurs.

      Il compile les observations de trois experts pédopsychiatres : les docteurs Maurice Berger, Françoise Fericelli et Myriam Pieron-Bertier.

      Résumé exécutif

      L'analyse des sources révèle une crise systémique dans la protection de l'enfance face à l'inceste en France.

      Les points critiques identifiés sont les suivants :

      • Terrorisation du corps médical : Les médecins hésitent à signaler les soupçons d'inceste par crainte de sanctions disciplinaires systématiques de la part du Conseil de l'Ordre des médecins (CNOM), souvent à la suite de plaintes déposées par les agresseurs présumés.

      • Infiltration d'idéologies non scientifiques : Le "Syndrome d'Aliénation Parentale" (SAP), bien qu'exclu des classifications internationales (DSM-5, CIM-11), continue d'influencer massivement les experts et les magistrats, conduisant à discréditer la parole de l'enfant et du parent protecteur.

      • Carence d'expertise spécialisée : Les expertises judiciaires sont trop souvent confiées à des psychiatres d'adultes sans formation en clinique de l'enfant, menant à des évaluations superficielles, voire traumatisantes.

      • Déni spécifique chez le nourrisson : Il existe un déni massif concernant les agressions sexuelles sur les enfants de moins de trois ans, malgré une sémiologie clinique et psychosomatique précise.

      • L'idéologie du "lien à tout prix" : La priorité donnée au maintien des liens familiaux et à la coparentalité, même en présence de violences avérées, met les enfants en danger immédiat de réexposition traumatique.

      --------------------------------------------------------------------------------

      1. Obstacles au signalement et rôle du Conseil de l'Ordre des médecins

      Le système actuel entrave la détection précoce de l'inceste en raison de la vulnérabilité juridique des médecins signalants.

      La "profession terrorisée"

      • Les médecins libéraux (généralistes, pédiatres) sont les plus exposés aux attaques du Conseil de l'Ordre lorsqu'ils rédigent un certificat ou un signalement.

      • Même l'utilisation des modèles officiels fournis par le CNOM ne garantit aucune immunité.

      Un médecin a été condamné à 15 jours d'interdiction d'exercer malgré la validation préalable de son écrit par le président de son conseil départemental.

      • Conséquence : Le taux de signalements provenant des médecins est alarmant (entre 1 % et 2 % selon les chiffres récents de l'UMJ de l'Hôtel-Dieu).

      Nécessité d'une réforme structurelle

      • Limitation du pouvoir disciplinaire : Les experts préconisent que le CNOM perde son pouvoir de sanction dans les affaires de signalements concernant les enfants, s'inspirant des modèles suisse ou anglais où ces questions relèvent d'instances mixtes incluant des magistrats et des représentants de la société civile.

      • Sécurisation juridique : L'obligation de signalement ne peut être envisagée sans une protection préalable et absolue du médecin contre les poursuites ordinales (Préconisation 17 de la Civise).

      --------------------------------------------------------------------------------

      2. Défaillances et normes de l'expertise judiciaire

      L'expertise est le pivot de la décision judiciaire, mais sa qualité est jugée erratique et souvent insuffisante.

      Les lacunes de la pratique actuelle

      • Absence de spécialisation : Des psychiatres d'adultes expertisent des enfants sans maîtriser le développement psycho-affectif spécifique à chaque âge.

      • Conditions inadaptées : Entretiens trop courts (parfois 10 minutes), absence de matériel de jeu ou de dessin, et réception des enfants à des heures inappropriées (sieste, soirée).

      • Manquements méthodologiques : Défaut d'observation des interactions parent-enfant (diades), omission de la lecture du carnet de santé et absence de contact avec les professionnels suivant l'enfant (enseignants, éducateurs).

      Les critères d'une expertise de qualité

      Une expertise rigoureuse devrait répondre aux standards suivants :

      • Durée : Entre 12 et 25 heures de travail effectif pour une évaluation complète de la famille.

      • Méthodologie : Analyse clinique, utilisation de protocoles validés (NICHD/Nich pour l'audition, SVA pour la crédibilité des déclarations).

      • Formation : Exigence d'une pratique clinique de 5 à 10 ans en pédopsychiatrie avant l'inscription sur les listes d'experts.

      --------------------------------------------------------------------------------

      3. L'influence du "Syndrome d'Aliénation Parentale" (SAP)

      Le SAP est décrit comme un "fourre-tout conceptuel" utilisé stratégiquement par les agresseurs pour obtenir l'impunité.

      Un outil de disqualification

      • Lorsqu'un enfant dénonce un inceste dans un contexte de séparation, l'agresseur accuse fréquemment la mère d'être "aliénante" ou "fusionnelle".

      • Cette rhétorique conduit à une inversion accusatoire : la détresse normale d'un parent protecteur est pathologisée et utilisée pour justifier le transfert de la garde à l'agresseur présumé.

      Une infiltration institutionnelle

      • Bien que rejeté par l'OMS et le Parlement européen, le lobby du SAP a infiltré les manuels scolaires et a longtemps été enseigné à l'École Nationale de la Magistrature (ENM).

      • Les experts soulignent que la "mère fusionnelle" est souvent une mère parfaitement adaptée aux besoins de protection d'un enfant traumatisé.

      --------------------------------------------------------------------------------

      4. Clinique de l'inceste chez le nourrisson (0-3 ans)

      Les agressions sur les très jeunes enfants constituent un angle mort majeur du système judiciaire.

      Sémiologie spécifique

      L'inceste chez le bébé se manifeste par des signes cliniques précis que les experts non formés ignorent :

      • Troubles psychosomatiques : Eczéma brutal, infections vulvovaginales ou angines à répétition, béance anale ou vaginale.

      • Troubles du comportement : Masturbation compulsive, phobie des soins d'hygiène (change, douche), agitation extrême (souvent confondue avec un TDH) ou retrait relationnel massif.

      • Données probantes : Le carnet de santé est un document biographique crucial qui révèle souvent des ruptures de courbes de croissance ou des hospitalisations suspectes.

      La fiabilité de la parole précoce

      • Les enfants de 2 ou 3 ans "ne mentent pas" car ils n'ont aucune connaissance préalable de la sexualité adulte.

      • Leur vocabulaire est spécifique (ex: "manger le pipi" pour désigner des actes sexuels oraux).

      Ignorer ces propos au motif de "l'âge de raison" (7 ans) est une erreur scientifique.

      --------------------------------------------------------------------------------

      5. Conséquences judiciaires et "Idéologie du lien"

      La justice française est critiquée pour son attachement dogmatique au maintien du lien biologique, même au détriment de la sécurité de l'enfant.

      La réexposition traumatique

      • Visites médiatisées : Le maintien de contacts en présence de professionnels est souvent inefficace pour protéger l'enfant du retraumatisme.

      Des cas de viols ont été signalés lors de ces rencontres.

      • Syndrome de Stockholm : Le fait qu'un enfant se précipite vers son parent agresseur au tribunal ne prouve pas l'absence de violences, mais témoigne d'un mécanisme de survie et d'emprise.

      Statistiques et faits saillants

      | Catégorie | Donnée statistique | | --- | --- | | Profil des agresseurs | 94-96 % d'hommes (pères, oncles, grands-parents) ; 4-6 % de femmes. | | Risque corrélé | 6,5 fois plus de risques d'inceste dans les foyers marqués par des violences conjugales. | | Crédibilité | Les fausses allégations volontaires représentent seulement 0,5 % à 6 % des cas. | | Rétractations | 22 % des enfants se rétractent par peur, mais 93 % d'entre eux reviennent ensuite sur leur rétractation. | | Démographie médicale | Environ 600 pédopsychiatres en France, moyenne d'âge de 62 ans. |

      --------------------------------------------------------------------------------

      6. Préconisations concrètes

      Pour remédier à ces dysfonctionnements, les experts proposent :

      • Réforme de la déontologie : Sortir les signalements d'enfants de la compétence disciplinaire du Conseil de l'Ordre.

      • Spécialisation des experts : Réserver les expertises de mineurs aux seuls professionnels justifiant d'une pratique clinique longue auprès des enfants (minimum 5 ans).

      • Formation à l'ENM : Remplacer les enseignements basés sur le SAP par des modules sur la victimologie et le psychotraumatisme de l'enfant.

      • Application des protocoles SVA/Nich : Généraliser ces outils de validation scientifique pour limiter l'arbitraire de l'expert.

      • Remise en cause du lien absolu : Reconnaître juridiquement l'existence de "disparentalités définitives" et suspendre tout droit de visite dès lors que la santé psychique de l'enfant est gravement compromise.

    1. Las estructuras secundarias que forman los ARNlnc son complejas e importantes para su función, lo cual les permite su unión con RNA o DNA a través de apareamiento de bases; también interactúan con proteínas a través de reconocimiento de la secuencia del RNA con una fracción de la estructura proteica; también pueden actuar como escalafones para permitir la interacción de múltiples proteínas, lo que resulta en la reunión de factores que de otra manera no podrían interactuar, con lo cual se favorece o inhibe la expresión génica (figura 8-8). A pesar de que sólo una fracción pequeña (cerca de 5%) de ARNlnc se ha caracterizado de forma funcional, el rango de procesos biológicos en los que se conoce que están involucrados está creciendo continuamente; algunos de éstos tienen implicaciones importantes en procesos fisiológicos y patológicos. Dentro de los primeros se encuentra la inactivación del cromosoma X en el sexo femenino, la regulación de la respuesta inmune, la diferenciación celular; respecto a los procesos patológicos, existe evidencia sobre su participación en diferentes tipos de cáncer, diabetes, fibrosis hepática, entre otras enfermedades. ++ Algunos de los mecanismos en los que participa el ARNlnc en la regulación génica son los siguientes: modificación de histonas y del estado de la cromatina: los ARNlnc se unen a complejos proteicos que modifican la cromatina, por ejemplo, complejos que metilan la histona H3. También se unen a proteínas represoras, activadoras, factores transcripcionales y factores de corte y empalme de genes. De igual forma, actúan para guiar proteínas a sus blancos o como peldaños que unen complejos proteicos en loci específicos. Existen datos acerca de su función como esponjas o a través de apareamiento de bases con miARN al inhibir su procesamiento a miARN maduros.

      Este tema de los ARN largos no codificantes (lncRNA) es de lo más avanzado en biología molecular actual, Gaby. Básicamente, son los "directores de orquesta" del genoma: no se convierten en proteínas, pero le dicen a todas las demás moléculas qué hacer.

      Aquí te resumo los puntos clave de tu texto para que los entiendas de forma sencilla:

      1. ¿Qué son? (Sus "credenciales") Tamaño: Tienen más de 200 nucleótidos (son grandes).

      Aspecto: Engañan a la célula porque parecen un ARNm normal (tienen su Cap 5', su cola Poli-A y sufren splicing), pero no codifican proteínas.

      Especificidad: Son muy "especiales"; no están en todas las células igual, sino que aparecen en momentos específicos (control temporal) y en células específicas.

      1. ¿Cómo funcionan? (Sus 4 roles principales) El texto menciona que actúan de varias formas gracias a su estructura compleja:

      Como Guías: Se pegan a una proteína y la llevan de la mano hasta el lugar exacto del ADN donde debe trabajar.

      Como Escalafones (Andamios): Sirven como una plataforma donde varias proteínas se sientan juntas para poder interactuar entre ellas. Sin el lncRNA, estas proteínas estarían perdidas por la célula y no se encontrarían.

      Como "Esponjas": Se pegan a los microARN (miARN) y los atrapan. Al "secuestrarlos", evitan que los miARN destruyan a otros mensajes. Es un guardaespaldas de los ARNm.

      Control de la Cromatina: Se unen a complejos que metilan histonas (como la H3). Esto decide si el ADN se enrolla (se apaga) o se desenrolla (se enciende).

      1. Importancia Médica (¿Por qué te lo preguntan en medicina?) Tu texto destaca procesos vitales donde los lncRNA son los protagonistas:

      Inactivación del cromosoma X: En las mujeres, uno de los dos cromosomas X debe "apagarse" para no tener doble dosis de genes. Un lncRNA llamado Xist es el encargado de cubrir todo ese cromosoma y silenciarlo.

      Diferenciación celular: Ayudan a que una célula madre sepa si debe ser neurona o hepatocito.

      Patologías: Si estos lncRNA fallan o se expresan de más, pueden causar cáncer (ayudando a que los tumores crezcan), diabetes o fibrosis hepática.

    2. Adición de grupos químicos a proteínas ++ Otro mecanismo por el cual se puede regular la expresión de un gen es la adición de diferentes grupos químicos a cualquiera de los aminoácidos que conforman una proteína. Sin la adición de estos compuestos la proteína no será una proteína madura y funcional. Entre las adiciones más comunes se encuentran las acetilaciones, las carboxilaciones, las metilaciones, las hidroxilaciones y las fosforilaciones.

      osforilación: El "Interruptor" MaestroEs la adición de un grupo fosfato ($PO_4^{3-}$) a aminoácidos como la serina, treonina o tirosina.Enzimas: Las encargadas son las quinasas (lo pegan) y las fosfatasas (lo quitan).Función: Activa o desactiva proteínas al instante. Es la base de la comunicación celular (señalización).2. Acetilación y Metilación: El Control del ADNAunque ocurren en muchas proteínas, son famosísimas en las histonas (las proteínas que enrollan el ADN).Acetilación: Generalmente "abre" la cromatina para que los genes se puedan leer.Metilación: Puede activar o silenciar genes dependiendo de dónde se pegue.3. Hidroxilación: La Fuerza del ColágenoConsiste en añadir un grupo hidroxilo ($-OH$).Ejemplo médico: Es vital para el colágeno. Sin vitamina C (cofactor), la enzima no puede hidroxilar la prolina del colágeno, las fibras no se trenzan bien y da escorbuto (encías sangrantes, debilidad).4. Carboxilación: La Coagulación SangrientaSe añade un grupo carboxilo ($-COOH$).Ejemplo médico: Es esencial para los factores de coagulación (como la protrombina). Este proceso necesita vitamina K. Si no hay carboxilación, los factores no pueden unirse al calcio y no puedes dejar de sangrar.

    1. 278Dædalus, the Journal of the American Academy of Arts & SciencesThe Turing Trap: The Promise & Peril of Human-Like Artificial IntelligenceIn 1988, robotics researcher Hans Moravec noted that “it is comparatively easyto make computers exhibit adult level performance on intelligence tests or play-ing checkers, and difficult or impossible to give them the skills of a one-year-oldwhen it comes to perception and mobility.”33 But I would argue that in many do-mains, Moravec was not nearly ambitious enough. It is often comparatively easierfor a machine to achieve superhuman performance in new domains than to matchordinary humans in the tasks they do regularly.Humans have evolved over millions of years to be able to comfort a baby, nav-igate a cluttered forest, or pluck the ripest blueberry from a bush. These tasksare difficult if not impossible for current machines. But machines excel when itcomes to seeing X-rays, etching millions of transistors on a fragment of silicon, orscanning billions of webpages to find the most relevant one. Imagine how feebleand limited our technology would be if past engineers set their sights on merelymatching human-levels of perception, actuation, and cognition.Augmenting humans with technology opens an endless frontier of new abili-ties and opportunities. The set of tasks that humans and machines can do togetheris undoubtedly much larger than those humans can do alone (Figure 1). Machinescan perceive things that are imperceptible to humans, they can act on objects inways that no human can, and, most intriguingly, they can comprehend things thatare incomprehensible to the human brain. As Demis Hassabis, CEO of DeepMind,put it, the AI system “doesn’t play like a human, and it doesn’t play like a program.It plays in a third, almost alien, way . . . it’s like chess from another dimension.”34Computer scientist Jonathan Schaeffer explains the source of its superiority: “I’mabsolutely convinced it’s because it hasn’t learned from humans.”35 More funda-mentally, inventing tools that augment the process of invention itself promises toexpand not only our collective abilities, but to accelerate the rate of expansion ofthose abilities.What about businesspeople? They often find that substituting machinery forhuman labor is the low-hanging fruit of innovation. The simplest approach is toimplement plug-and-play automation: swap in a piece of machinery for each taska human is currently doing. That mindset reduces the need for more radical chang-es to business processes.36 Task-level automation reduces the need to understandsubtle interdependencies and creates easy A-B tests, by focusing on a known taskwith easily measurable performance improvement.Similarly, because labor costs are the biggest line item in almost every company’sbudget, automating jobs is a popular strategy for managers. Cutting costs–whichcan be an internally coordinated effort–is often easier than expanding markets.Moreover, many investors prefer “scalable” business models, which is often a syn-onym for a business that can grow without hiring and the complexities that entails.But here again, when businesspeople focus on automation, they often set outto achieve a task that is both less ambitious and more difficult than it need be.151 (2) Spring 2022279Erik BrynjolfssonTo understand the limits of substitution-oriented automation, consider a thoughtexperiment. Imagine that our old friend Dædalus had at his disposal an extreme-ly talented team of engineers 3,500 years ago and built human-like machines thatfully automated every work-related task that his fellow Greeks were doing.9 Herding sheep? Automated.9 Making clay pottery? Automated.9 Weaving tunics? Automated.9 Repairing horse-drawn carts? Automated.9 Incense and chanting for victims of disease? Automated.The good news is that labor productivity would soar, freeing the ancientGreeks for a life of leisure. The bad news is that their living standards and healthoutcomes would come nowhere near matching ours. After all, there is only somuch value one can get from clay pots and horse-drawn carts, even with unlimit-ed quantities and zero prices.In contrast, most of the value that our economy has created since ancient timescomes from new goods and services that not even the kings of ancient empireshad, not from cheaper versions of existing goods.37 In turn, myriad new tasks areFigure 1Opportunities for Augmenting Humans Are Far Greater thanOpportunities to Automate Existing TasksNew Tasks ThatHumans Can Do withthe Help of MachinesTasks ThatHumans Can DoHuman TasksThat MachinesCould Automate280Dædalus, the Journal of the American Academy of Arts & SciencesThe Turing Trap: The Promise & Peril of Human-Like Artificial Intelligencerequired: fully 60 percent of people are now employed in occupations that did notexist in 1940. 38 In short, automating labor ultimately unlocks less value than aug-menting it to create something new.At the same time, automating a whole job is often brutally difficult. Every jobinvolves multiple different tasks, including some that are extremely challengingto automate, even with the cleverest technologies. For example, AI may be able toread mammograms better than a human radiologist, but it is not very good at theother twenty-six tasks associated with the job, according to O-NET, such as com-forting a concerned patient or coordinating on a care plan with other doctors.39My work with Tom Mitchell and Daniel Rock on the suitability for machine learn-ing analyzed 950 distinct occupations. We found that machines could perform atleast some tasks in most occupations, but zero in which machine learning coulddo 100 percent of the tasks.40The same principle applies to the more complex production systems that in-volve multiple people working together.41 To be successful, firms typically need toadopt a new technology as part of a system of mutually reinforcing organizationalchanges. 42 Consider another thought experiment: Imagine if Jeff Bezos had “au-tomated” existing bookstores by simply replacing all the human cashiers with ro-bot cashiers. That might have cut costs a bit, but the total impact would have beenmuted. Instead, Amazon reinvented the concept of a bookstore by combining hu-mans and machines in a novel way. As a result, they offer vastly greater productselection, ratings, reviews, and advice, and enable 24/7 retail access from the com-fort of customers’ homes. The power of the technology was not in automating thework of humans in the existing retail bookstore concept but in reinventing andaugmenting how customers find, assess, purchase, and receive books and, in turn,other retail goods.Third, policy-makers have also often tilted the playing field toward automat-ing human labor rather than augmenting it. For instance, the U.S. tax code cur-rently encourages capital investment over investment in labor through effectivetax rates that are much higher on labor than on plants and equipment.43Consider a third thought experiment: Two potential ventures each use AI tocreate $1 billion of profits. If one of them achieves this by augmenting and em-ploying a thousand workers, the firm will owe corporate and payroll taxes, whilethe employees will pay income taxes, payroll taxes, and other taxes. If the secondbusiness has no employees, the government may collect the same corporate taxes,but no payroll taxes and no taxes paid by workers. As a result, the second businessmodel pays far less in total taxes.This disparity is amplified because the tax code treats labor income moreharshly than capital income. In 1986, top tax rates on capital income and laborincome were equalized in the United States, but since then, successive changeshave created a large disparity, with the 2021 top marginal federal tax rates on labor151 (2) Spring 2022281Erik Brynjolfssonincome of 37 percent, while long capital gains have a variety of favorable rules, in-cluding a lower statutory tax rate of 20 percent, the deferral of taxes until capitalgains are realized, and the “step-up basis” rule that resets capital gains to zero,wiping out the associated taxes, when assets are inherited.The first rule of tax policy is simple: you tend to get less of whatever you tax.Thus, a tax code that treats income that uses labor less favorably than income de-rived from capital will favor automation over augmentation. Treating both busi-ness models equally would lead to more balanced incentives. In fact, given thepositive externalities of more widely shared prosperity, a case could be made fortreating wage income more favorably than capital income, for instance by expand-ing the earned income tax credit.44 It is unlikely that any government official candefine in advance exactly which technologies and innovations augment humansrather than merely substitute for them; indeed, most technologies have elementsof each and the outcome depends a great deal on how they are deployed. Thus,rather than prescribe or proscribe specific technologies, a broad-based set of in-centives can gently nudge technologists and managers toward augmentation onthe margin, much as carbon taxes encourage myriad types of cleaner energy orresearch and development tax credits encourage greater investments in research.Government policy in other areas could also do more to steer the economy clearof the Turing Trap. The growing use of AI, even if only for complementing work-ers, and the further reinvention of organizations around this new general-purposetechnology imply a great need for worker training or retraining. In fact, for eachdollar spent on machine learning technology, companies may need to spend ninedollars on intangible human capital.45 However, education and training sufferfrom a serious externality issue: companies that incur the costs to train or retrainworkers may reap only a fraction of the benefits of those investments, with therest potentially going to other companies, including competitors, as these work-ers are free to bring their skills to their new employers. At the same time, work-ers are often cash- and credit-constrained, limiting their ability to invest in theirown skills development. 46 This implies that government policy should directlyprovide education and training or provide incentives for corporate training thatoffset the externalities created by labor mobility. 47In sum, the risks of the Turing Trap are increased not by just one group in oursociety, but by the misaligned incentives of technologists, businesspeople, andpolicy-makers.T he future is not preordained. We control the extent to which AI either ex-pands human opportunity through augmentation or replaces humansthrough automation. We can work on challenges that are easy for ma-chines and hard for humans, rather than hard for machines and easy for humans.The first option offers the opportunity of growing and sharing the economic pie282Dædalus, the Journal of the American Academy of Arts & SciencesThe Turing Trap: The Promise & Peril of Human-Like Artificial Intelligenceby augmenting the workforce with tools and platforms. The second option risksdividing the economic pie among an ever-smaller number of people by creatingautomation that displaces ever-more types of workers.While both approaches can and do contribute to productivity and progress,technologists, businesspeople, and policy-makers have each been putting a fingeron the scales in favor of replacement. Moreover, the tendency of a greater concen-tration of technological and economic power to beget a greater concentration ofpolitical power risks trapping a powerless majority into an unhappy equilibrium:the Turing Trap.The backlash against free trade offers a cautionary tale. Economists have longargued that free trade and globalization tend to grow the economic pie through thepower of comparative advantage and specialization. They have also acknowledgedthat market forces alone do not ensure that every person in every country willcome out ahead. So they proposed a grand bargain: maximize free trade to max-imize wealth creation and then distribute the benefits broadly to compensate anyinjured occupations, industries, and regions. It has not worked as they had hoped.As the economic winners gained power, they reneged on the second part of the bar-gain, leaving many workers worse off than before.48 The result helped fuel a popu-list backlash that led to import tariffs and other barriers to free trade. Economistswept.Some of the same dynamics are already underway with AI. More and moreAmericans, and indeed workers around the world, believe that while the technolo-gy may be creating a new billionaire class, it is not working for them. The more tech-nology is used to replace rather than augment labor, the worse the disparity may be-come, and the greater the resentments that feed destructive political instincts andactions. More fundamentally, the moral imperative of treating people as ends, andnot merely as means, calls for everyone to share in the gains of automation.The solution is not to slow down technology, but rather to eliminate or reversethe excess incentives for automation over augmentation. A good start would be toreplace the Turing Test, and the mindset it embodies, with a new set of practicalbenchmarks that steer progress toward AI-powered systems that exceed anythingthat could be done by humans alone. In concert, we must build political and eco-nomic institutions that are robust in the face of the growing power of AI. We canreverse the growing tech backlash by creating the kind of prosperous society thatinspires discovery, boosts living standards, and offers political inclusion for ev-eryone. By redirecting our efforts, we can avoid the Turing Trap and create pros-perity for the many, not just the few.151 (2) Spring 2022283Erik Brynjolfssonauthor’s noteThe core ideas in this essay were inspired by a series of conversations with JamesManyika and Andrew McAfee. I am grateful for valuable comments and sugges-tions on this work from Matt Beane, Seth Benzell, Avi Goldfarb, Katya Klinova, Ale-na Kykalova, Gary Marcus, Andrea Meyer, Dana Meyer, and numerous participantsat seminars at the Stanford Digital Economy Lab and the University of TorontoCreative Destruction Lab, but they should not be held responsible for any errors oropinions in the essay.about the authorErik Brynjolfsson is the Jerry Yang and Akiko Yamazaki Professor and SeniorFellow at the Institute for Human-Centered AI and Director of the Digital Econ-omy Lab at Stanford University. He is also the Ralph Landau Senior Fellow at theInstitute for Economic Policy Research and Professor by Courtesy at the Gradu-ate School of Business and Department of Economics at Stanford University; and aResearch Associate at the National Bureau of Economic Research. He is the authoror coauthor of seven books, including Machine, Platform, Crowd: Harnessing Our Digi-tal Future (2017), The Second Machine Age: Work, Progress, and Prosperity in a Time of Bril-liant Technologies (2014), and Race against the Machine: How the Digital Revolution Is Acceler-ating Innovation, Driving Productivity, and Irreversibly Transforming Employment and the Econ-omy (2011) with Andrew McAfee, and Wired for Innovation: How Information TechnologyIs Reshaping the Economy (2009) with Adam Saunders.endnotes1 Alan Turing, “Computing Machinery and Intelligence,” Mind 59 (236): 433–460, https://doi.org/10.1093/mind/LIX.236.433. An earlier articulation of this test comes from Des-cartes in The Discourse, in which he wrote,If there were machines which bore a resemblance to our bodies and imitated ouractions as closely as possible for all practical purposes, we should still have twovery certain means of recognizing that they were not real men. The first is thatthey could never use words, or put together signs, as we do in order to declare ourthoughts to others. . . . Secondly, even though some machines might do some thingsas well as we do them, or perhaps even better, they would inevitably fail in others,which would reveal that they are acting not from understanding.2 Carolyn Price, “Plato, Opinions and the Statues of Daedalus,” OpenLearn, updatedJune 19, 2019, https://www.open.edu/openlearn/history-the-arts/philosophy/plato-opinions-and-the-statues-daedalus; and Andrew Stewart, “The Archaic Period,” PerseusDigital Library, http://www.perseus.tufts.edu/hopper/text?doc=Perseus:text:1999.04.0008:part=2:chapter=1&highlight=daedalus.3 “The Origin of the Word ‘Robot,’” Science Friday, April 22, 2011, https://www.sciencefriday.com/segments/the-origin-of-the-word-robot/.4 Millions of people are now working alongside robots. For a recent survey on the diffusionof robots, AI, and other advanced technologies in the United States, see Nikolas Zolas,284Dædalus, the Journal of the American Academy of Arts & SciencesThe Turing Trap: The Promise & Peril of Human-Like Artificial IntelligenceZachary Kroff, Erik Brynjolfsson, et al., “Advanced Technologies Adoption and Useby U.S. Firms: Evidence from the Annual Business Survey,” NBER Working Paper No.28290 (Cambridge, Mass.: National Bureau of Economic Research, 2020).5 Apologies to Arthur C. Clarke.6 See, for example, Daniel Zhang, Saurabh Mishra, Erik Brynjolfsson, et al., “The AI Index2021 Annual Report,” arXiv (2021), esp. chap. 2, https://arxiv.org/abs/2103.06312. Inregard to image recognition, see, for instance, the success of image recognition systemsin Olga Russakovsky, Jia Deng, Hao Su, et al., “Imagenet Large Scale Visual Recogni-tion Challenge,” International Journal of Computer Vision 115 (3) (2015): 211–252. A broadarray of business application is discussed in Erik Brynjolfsson and Andrew McAfee,“The Business of Artificial Intelligence,” Harvard Business Review (2017): 3–11.7 See, for example, Hubert Dreyfus, What Computers Can’t Do (Cambridge, Mass.: MIT Press,1972); Nils J. Nilsson, “Human-Level Artificial Intelligence? Be Serious!” AI Magazine26 (4) (2005): 68; and Gary Marcus, Francesca Rossi, and Manuela Veloso, “Beyondthe Turing Test,” AI Magazine 37 (1) (2016): 3–4.8 Nilsson, “Human-Level Artificial Intelligence?” 68.9 John Searle was the first to use the terms strong AI and weak AI, writing that with weak AI,“the principal value of the computer . . . is that it gives us a very powerful tool,” whilestrong AI “really is a mind.” Ed Feigenbaum has argued that creating such intelligenceis the “manifest destiny” of computer science. John R. Searle, “Minds, Brains, and Pro-grams,” Behavioral and Brain Sciences 3 (3) (1980): 417–457.10 However, this does not necessarily mean living standards would rise without bound.In fact, if working hours fall faster than productivity rises, it is theoretically possible,though empirically unlikely, that output and consumption (other than leisure time)would fall.11 See, for example, Robert M. Solow, “A Contribution to the Theory of Economic Growth,”The Quarterly Journal of Economics 70 (1) (1956): 65–94.12 See, for example, Daron Acemoglu, “Directed Technical Change,” Review of EconomicStudies 69 (4) (2002): 781–809.13 See, for instance, Erik Brynjolfsson and Andrew McAfee, Race Against the Machine: Howthe Digital Revolution Is Accelerating Innovation, Driving Productivity, and Irreversibly TransformingEmployment and the Economy (Lexington, Mass.: Digital Frontier Press, 2011); and DaronAcemoglu and Pascual Restrepo, “The Race Between Machine and Man: Implicationsof Technology for Growth, Factor Shares, and Employment,” American Economic Review108 (6) (2018): 1488–1542.14 For instance, the real wage of a building laborer in Great Britain is estimated to havegrown from sixteen times the amount needed for subsistence in 1820 to 167 times thatlevel by the year 2000, according to Jan Luiten Van Zanden, Joerg Baten, Marco Mirad’Ercole, et al., eds., How Was Life? Global Well-Being since 1820 (Paris: OECD Publishing,2014).15 For instance, a majority of aircraft on U.S. Navy aircraft carriers are likely to be un-manned. See Oriana Pawlyk, “Future Navy Carriers Could Have More Drones ThanManned Aircraft, Admiral Says,” Military.com, March 30, 2021. Similarly, companieslike Kittyhawk have developed pilotless aircraft (“flying cars”) for civilian passengers.151 (2) Spring 2022285Erik Brynjolfsson16 Loukas Karabarbounis and Brent Neiman, “The Global Decline of the Labor Share,” TheQuarterly Journal of Economics 129 (1) (2014): 61–103; and David Autor, “Work of the Past,Work of the Future,” NBER Working Paper No. 25588 (Cambridge, Mass.: National Bu-reau of Economic Research, 2019). For a broader survey, see Morgan R. Frank, DavidAutor, James E. Bessen, et al., “Toward Understanding the Impact of Artificial Intelli-gence on Labor,” Proceedings of the National Academy of Sciences 116 (14) (2019): 6531–6539.17 Daron Acemoglu and David Autor, “Skills, Tasks and Technologies: Implications forEmployment and Earnings,” Handbook of Labor Economics 4 (2011): 1043–1171.18 Seth G. Benzell and Erik Brynjolfsson, “Digital Abundance and Scarce Architects:Implications for Wages, Interest Rates, and Growth,” NBER Working Paper No. 25585(Cambridge, Mass.: National Bureau of Economic Research, 2021).19 Prasanna Tambe, Lorin Hitt, Daniel Rock, and Erik Brynjolfsson, “Digital Capital andSuperstar Firms,” Hutchins Center Working Paper #73 (Washington, D.C.: HutchinsCenter at Brookings, 2021), https://www.brookings.edu/research/digital-capital-and-superstar-firms.20 There is some evidence that capital is already becoming an increasingly good substitutefor labor. See, for instance, the discussion in Michael Knoblach and Fabian Stöckl,“What Determines the Elasticity of Substitution between Capital and Labor? A Litera-ture Review,” Journal of Economic Surveys 34 (4) (2020): 852.21 See, for example, Tyler Cowen, Average Is Over: Powering America beyond the Age of the GreatStagnation (New York: Penguin, 2013). Or more provocatively, Yuval Noah Harari,“The Rise of the Useless Class,” Ted Talk, February 24, 2017, https://ideas.ted.com/the-rise-of-the-useless-class/.22 Anton Korinek and Joseph E. Stiglitz, “Artificial Intelligence and Its Implications for In-come Distribution and Unemployment,” in The Economics of Artificial Intelligence, ed. AjayAgrawal, Joshua Gans, and Avi Goldfarb (Chicago: University of Chicago Press, 2019),349–390.23 Erik Brynjolfsson and Andrew McAfee, “Artificial Intelligence, for Real,” Harvard BusinessReview, August 7, 2017.24 Robert D. Putnam, Our Kids: The American Dream in Crisis (New York: Simon and Schuster,2016) describes the negative effects of joblessness, while Anne Case and Angus Deaton,Deaths of Despair and the Future of Capitalism (Princeton, N.J.: Princeton University Press,2021) documents the sharp decline in life expectancy among many of the same people.25 Simon Smith Kuznets, Economic Growth and Structure: Selected Essays (New York: W. W.Norton & Co., 1965).26 Friedrich August Hayek, “The Use of Knowledge in Society,” The American Economic Review35 (4) (1945): 519–530.27 Erik Brynjolfsson, “Information Assets, Technology and Organization,” ManagementScience 40 (12) (1994): 1645–1662, https://doi.org/10.1287/mnsc.40.12.1645.28 For instance, in the year 2000, an estimated 85 billion (mostly analog) photos were tak-en, but by 2020, that had grown nearly twenty-fold to 1.4 trillion (almost all digital)photos.286Dædalus, the Journal of the American Academy of Arts & SciencesThe Turing Trap: The Promise & Peril of Human-Like Artificial Intelligence29 Andrew Ng, “What Data Scientists Should Know about Deep Learning,” speech pre-sented at Extract Data Conference, November 24, 2015, https://www.slideshare.net/ExtractConf/andrew-ng-chief-scientist-at-baidu (accessed September 9, 2021).30 Sanford J. Grossman and Oliver D. Hart, “The Costs and Benefits of Ownership: A The-ory of Vertical and Lateral Integration,” Journal of Political Economy 94 (4) (1986): 691–719; and Oliver D. Hart and John Moore, “Property Rights and the Nature of the Firm,”Journal of Political Economy 98 (6) (1990): 1119–1158.31 Erik Brynjolfsson and Andrew Ng, “Big AI Can Centralize Decisionmaking and Power.And That’s a Problem,” MILA-UNESCO Working Paper (Montreal: MILA-UNESCO,2021).32 “Simon Electronic Brain–Complete History of the Simon Computer,” History Com-puter, January 4, 2021, https://history-computer.com/simon-electronic-brain-complete-history-of-the-simon-computer/.33 Hans Moravec, Mind Children: The Future of Robot and Human Intelligence (Cambridge,Mass.: Harvard University Press, 1988).34 Will Knight, “Alpha Zero’s ‘Alien’ Chess Shows the Power, and the Peculiarity, of AI,”Technology Review, December 2017.35 Richard Waters, “Techmate: How AI Rewrote the Rules of Chess,” Financial Times, Janu-ary 12, 2018.36 Matt Beane and Erik Brynjolfsson, “Working with Robots in a Post-Pandemic World,”MIT Sloan Management Review 62 (1) (2020): 1–5.37 Timothy Bresnahan and Robert J. Gordon, “Introduction,” The Economics of New Goods(Chicago: University of Chicago Press, 1996).38 David Autor, Anna Salomons, and Bryan Seegmiller, “New Frontiers: The Origins andContent of New Work, 1940–2018,” NBER Preprint, July 26, 2021.39 David Killock, “AI Outperforms Radiologists in Mammographic Screening,” NatureReviews Clinical Oncology 17 (134) (2020), https://doi.org/10.1038/s41571-020-0329-7.40 Erik Brynjolfsson, Tom Mitchell, and Daniel Rock, “What Can Machines Learn, andWhat Does It Mean for Occupations and the Economy?” AEA Papers and Proceedings(2018): 43–47.41 Erik Brynjolfsson, Daniel Rock, and Prasanna Tambe, “How Will Machine LearningTransform the Labor Market?” Governance in an Emerging New World (619) (2019), https://www.hoover.org/research/how-will-machine-learning-transform-labor-market.42 Paul Milgrom and John Roberts, “The Economics of Modern Manufacturing: Technol-ogy, Strategy, and Organization,” American Economic Review 80 (3) (1990): 511–528.43 See Daron Acemoglu, Andrea Manera, and Pascual Restrepo, “Does the U.S. Tax CodeFavor Automation?” Brookings Papers on Economic Activity (Spring 2020); and Daron Ace-moglu, ed., Redesigning AI (Cambridge, Mass.: MIT Press, 2021).44 This reverses the classic result suggesting that taxes on capital should be lower than taxeson labor. Christophe Chamley, “Optimal Taxation of Capital Income in General Equi-librium with Infinite Lives,” Econometrica 54 (3) (1986): 607–622; and Kenneth L. Judd,“Redistributive Taxation in a Simple Perfect Foresight Model,” Journal of Public Econom-ics 28 (1) (1985): 59–83.151 (2) Spring 2022287Erik Brynjolfsson45 Tambe et al., “Digital Capital and Superstar Firms.”46 Katherine S. Newman, Chutes and Ladders: Navigating the Low-Wage Labor Market (Cam-bridge, Mass.: Harvard University Press, 2006).47 While the distinction between complements and substitutes is clear in economic theory,it can be trickier in practice. Part of the appeal of broad training and/or tax incentives,rather than specific technology mandates or prohibitions, is that they allow technol-ogies, entrepreneurs, and, ultimately, the market to reward approaches that augmentlabor rather than replace it.48 See David H. Autor, David Dorn, and Gordon H. Hanson, “The China Shock: Learningfrom Labor-Market Adjustment to Large Changes in Trade,” Annual Review of Economics8 (2016): 205–240.
    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      In this manuscript, the authors investigate the role of the tubulin polyglutamylase TTLL6 in maintaining colonic epithelial homeostasis and its potential role in colorectal cancer (CRC). Using transcriptomic analyses, mouse genetics, histology, and proteomics, the authors report that TTLL6 is highly expressed in colonic epithelial cells and decreases during CRC progression. Constitutive and epithelial-specific deletion of Ttll6 in mice leads to elongated colonic crypts, expansion of proliferative and stem cell compartments, and increased susceptibility to chemically induced colitis-associated carcinogenesis. Mechanistically, the authors identify the nucleic acid-binding protein PurA as a potential non-tubulin substrate of TTLL6. They propose that TTLL6-mediated polyglutamylation of PurA regulates its nuclear localization, thereby contributing to epithelial homeostasis in the colon. Together, the study suggests a TTLL6-PurA axis that may restrain early colorectal tumorigenesis.

      Major comments

      1. Evidence that PurA is a physiologically relevant TTLL6 substrate remains incomplete. A central conclusion of the manuscript is that PurA is a substrate of TTLL6 whose polyglutamylation regulates nuclear localization. While the authors present several lines of evidence (PolyE immunoprecipitation, co-transfection experiments, and mutagenesis of the PurA C-terminal glutamate residues), the physiological relevance of this modification remains somewhat indirect. For example, polyglutamylation of endogenous PurA in colonic epithelial cells is inferred but not directly demonstrated. The PolyE antibody detects glutamate chains but does not identify the specific modified protein in tissue. Direct evidence that PurA is polyglutamylated in vivo (e.g., MS identification of the modification site on PurA or PurA immunoprecipitation followed by PolyE detection) would strengthen the mechanistic claim. At present, the data convincingly show that TTLL6 can glutamylate PurA in an overexpression system, but the endogenous modification remains less clearly demonstrated.
      2. Mechanistic link between PurA localization and the epithelial phenotype is not established. The authors propose that loss of TTLL6 disrupts PurA nuclear localization and thereby alters epithelial homeostasis. However, the manuscript does not establish a causal relationship between PurA localization and the observed crypt phenotypes. Specifically, it is not shown whether PurA loss phenocopies Ttll6 deficiency in the colon. No experiments test whether restoring nuclear PurA rescues the Ttll6 phenotype. Downstream transcriptional or signaling pathways regulated by PurA are not explored. Thus, while the TTLL6-PurA relationship is intriguing, the study remains largely correlative with respect to functional consequences.
      3. Interpretation of the tumorigenesis data should be tempered. The authors conclude that Ttll6 deficiency promotes colon carcinogenesis. However, the tumor data appear somewhat limited. Increased tumor numbers are reported only at an early time point (day 40) and are described as a trend toward significance. By day 70, tumor numbers and sizes appear comparable between groups. The increased incidence of vimentin-positive crypts is interesting but does not clearly establish increased tumor burden. Given these results, the conclusion that TTLL6 restrains tumorigenesis may be stronger than supported by the data. The authors may wish to frame this as enhanced early tumor development or altered tumor progression rather than increased tumorigenesis per se.
      4. Expansion of multiple epithelial cell populations requires clarification. The authors report that Ttll6-deficient colons exhibit expansion of stem/progenitor compartments as well as increased numbers of differentiated cells (e.g., goblet cells and enterocytes). While these findings are interesting, the biological interpretation is somewhat unclear. For example, expansion of stem/progenitor compartments typically accompanies reduced differentiation rather than increased differentiation. It is not clear whether the increased numbers of differentiated cells reflect overall crypt enlargement or altered lineage allocation. Quantification of cell-type proportions rather than absolute cell numbers would help clarify whether differentiation programs are altered.
      5. Nuclear polyglutamylation requires further clarification The authors report nuclear PolyE staining in colonic epithelial cells and propose that this reflects polyglutamylation of non-tubulin substrates such as PurA. However, it is not clear whether other nuclear proteins could account for this signal. The specificity of the nuclear PolyE signal should be better validated. Additional controls (e.g., peptide competition or validation with alternative approaches) would strengthen the interpretation.

      Minor comments

      1. The manuscript would benefit from clearer distinction between tubulin vs non-tubulin glutamylation throughout the text.
      2. Some conclusions in the Discussion appear slightly overstated relative to the data (e.g., the role of the TTLL6-PurA axis in tumor suppression).
      3. The description of the Ttll6 mouse models (constitutive vs conditional deletion) could be clarified earlier in the Results section.
      4. Quantification methods for histological analyses (crypt length, cell counts, marker-positive cells) should be described in greater detail in the Methods.
      5. It would be useful to include representative images of PurA localization in control vs Ttll6-deficient colon tissue in the main figures.
      6. Several minor typographical issues appear throughout the manuscript and should be corrected during revision.

      Significance

      General assessment

      This study investigates the role of the polyglutamylase TTLL6 in intestinal epithelial biology and colorectal cancer. The identification of a potential non-tubulin substrate (PurA) is conceptually interesting and expands the emerging view that tubulin-modifying enzymes can regulate additional cellular proteins. The study combines mouse genetics, histological analysis, transcriptomic datasets, and proteomics, which together provide a substantial dataset supporting a role for TTLL6 in regulating crypt architecture and epithelial proliferation. However, the mechanistic link between TTLL6 activity, PurA modification, and epithelial homeostasis remains incompletely resolved. The tumorigenesis data also suggest only modest effects on carcinogenesis.

      Advance relative to previous literature

      Previous studies have linked members of the TTLL family primarily to microtubule regulation and ciliary biology. This work extends these findings by suggesting a tissue-specific function of TTLL6 in the colon, and the existence of non-tubulin substrates regulating epithelial biology. If further validated, the identification of PurA polyglutamylation could represent an interesting conceptual advance.

      The manuscript will likely be of interest to researchers working in cytoskeletal post-translational modifications, intestinal epithelial biology, colorectal cancer biology

      My expertise lies in cytoskeletal regulation, epithelial biology, and intestinal tissue organization, which are directly relevant to the central themes of this manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: In this study, the authors investigate novel functions of the tubulin typrosine ligase-like protein 6 (TTLL6), which covalently adds glutamate residues to the C-terminus of a given protein. The authors have already published previously on this topic. In the current study, the role of TTLL6 in colon function and pathologies was investigated. The study consists of two major parts. In the first part, a mouse model is used to show that TTLL6 is expressed at elevated levels and activity in epithelial cells of the colon. A database search indicated that TTL6 expression positively correlates with prognosis of patients with colorectal cancer. The authors generated a TTLL6 KO mouse and showed that induced tumor growth at 40 days was more positive for vimentin in the crypts of these mice, which should correlate with tumor aggressiveness. This difference was not observed anymore after 70 days. Morphological analyses of the crypts showed that in TTLL6 KO mice the crypts increased in length, a difference in proliferation markers, and a change of cell types in the crypts was observed.<br /> In the second part, the authors used a modification-specific antibody to immunoprecipitate (IP) proteins modified by TTLLs. To identify TTLL6-dependently modified target proteins, they compared these results with IPs from TTLL6 KO mice. A total of 43 proteins were identified this way. Because of their similarity to the tubuline tail sequence, two of the enriched proteins, PurA and PurB, were further analyzed. The authors provide evidence that PurA but not PurB is modified by TTLL6, which as a result changes its subcellular localization. While the first part of this work provides convincing novel insights into TTLL6's function with potential pathological relevance, the second part raises some concerns. I would therefore tend to rate the quality of the first part significantly higher than the second part.

      Major comments:

      1) When considering the results of the induced colorectal cancer test, the only significant difference between WT and KO was the moderately higher expression of Vimentin (figure 5E-F). Since this is the main evidence for a pathological relevance of TTLL6 in cancer, it is important to understand how the quantification of Vimentin in the complex tissue shown in figure 5E was done. A detailed description of how these images were analysed and perhaps a table with raw data would be essential to convince the reader of the conclusions. In the currently presented form, I find the analyses not too convincing.

      2) Figure 7A: It was somewhat surprising that two of the least significant (PurA is just below the cutoff) were used for further analyses. Although the authors explain that both proteins have strong sequence similarity to the know TTLL6 target, tubuline, the C-terminal, genomically encoded protein sequence of PurA and PurB already contain several glutamates. This raises the concern that the polyE antibody in the IP possibly detected the non-modified C-terminal tail of PurA and PurB and that both proteins may not be modified by TTLL6. Because of this and the lower significance than other candidates, the authors should consider focussing on other hits (OPTIONAL). Besides being much more significant, they lack an accumulation of glutamates in their C-terminus (at least the ones I looked at). Alternatively, the concern of having potentially IP-ed unmodified proteins should be addressed.

      3) Figure 8A: this figure compares PurA with a modified PurA that lacks the C-terminal EEE stretch. The authors conclude that the subcellular localization is different between both and that the nuclear localization of WT PURA must be due to modification by the co-expressed TTLL6. There are two major concerns with this conclusion: Firstly, the expression of PurA without TTLL6 co-expression is a missing essential control. This would show if PurA itself is already predominantly located in the nucleus regardless of potential modifications (PurA seems to have different nucleocytoplasmic localization in different cell types). Secondly, both depicted cells look very different. In PurA the nuclei are much smaller and the cytoplasm seems also much smaller than in the PurA DDD-expressing cells. Furthermore, IF staining without proper quantification of several cells seem less than ideal for such conclusions. In case, the authors want to convincingly validate this conclusion such a quantification with several cells would be required. OPTIONAL: an alternative approach would be a nucleo-cytoplasmic fractionation experiment followed by a western blot.

      Figure 8B: it seems that the contrast between the images of the upper and lower panel is very different. For this reason, I find it difficult to follow the conclusions. However, even when ignoring this aspect, I have great problems coming to the same conclusions as the authors.

      Minor comments:

      1) In figure 3A it would help if the legend describes what exactly "Control (+ or -)" means.

      2) In figure 3E-F, a label inside of the figure (what is the red bar, what the blue) would help the reader to faster grasp the subfigures.

      3) Figure 7C-D: these experiments are based on strong overexpression of TTLLs, which might result in unphysiological modifications of PurA. I would suggest to include a note of caution in the discussion that this is a possibility.

      4) In the discussion (page 9, last paragraph), it is stated: "Our findings suggest that the polyglutamylation of PurA is essential for maintaining colonic homeostasis". I do not understand this statement, as this study does not provide any evidence that modification of PurA does play a functional role in the colon (expression itself is not an evidence for function importance or even being "essential"). I recommend to remove this statement.

      5) Not all abbreviations are introduced properly (like CRC).

      Significance

      In general, this study addresses a very interesting aspect - i.e. the covalent addition of multiple glutamate residues to the C-terminus of a target gene by the enzyme TTLL6. The authors convincingly show that this protein regulates the morphology and composition of crypts in subregions of the colon. This is certainly a new and important finding that expands our knowledge about the functional breadth of this class of enzymes. If convincingly validated (see major concerns), also the pathological relevance of this enzyme for cancer progression would be of general interest. However, this statement has to be considered with a note of caution as this is not my area of expertise.

      The validation of novel targets of TTLL6 after IP is - at this stage of the manuscript - not very convincing to me. In particular the claim that PurA does play a functional role in the TTLL6-dependent regulation (of crypts) is not justified by the data. However, given that the list of other candidates contains several important gene regulators, this work might have the potential to open up to open up the field for new research directions.

      The reviewer's areas of expertise: cell biology, biochemistry, histology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This study shows that loss of TTLL6 affects colonic epithelial homeostasis (crypt architecture and proliferation/differentiation markers) and proposes that TTLL6 contributes to a nuclear glutamylation signal, with PurA presented as a candidate non-tubulin substrate. The authors also connect TTLL6 mRNA levels to human CRC progression and outcome. Overall, the observations are potentially interesting, but the manuscript currently does not establish a clear mechanistic link between TTLL6 activity, PurA, and the in vivo phenotype.

      Major comments:

      1. The "TTLL6-PurA link" framing is currently too strong. The pull-down data indicate multiple candidate substrates, and the study does not show that PurA is the key functional mediator of the epithelial phenotype. As written, the manuscript reads as though PurA is the central downstream effector, which is not yet supported. Requested change: Either add substrate-specific functional evidence (additional KO/rescue-type experiments) or soften the language throughout (title/abstract/discussion) to reflect that PurA is one candidate among several.
      2. PurA glutamylation should be demonstrated directly by MS. PolyE/GT335 immunoblotting and enrichment in PolyE pull-downs are suggestive, but they do not conclusively establish glutamylation of PurA at the C-terminal end (antibody specificity and/or co-precipitating glutamylated proteins remain possible explanations). Essential experiment: MS/MS identification of glutamylated residue(s) on PurA, ideally with evidence that the modification is TTLL6-dependent (WT vs KO or epithelial-inducible KO).
      3. Regional TTLL6 expression vs phenotype needs to be reconciled. TTLL6 expression is reported to be highest in distal colon, yet the strongest crypt-length phenotype appears in transverse colon (as presented). Proximal colon data are not shown in the main text. Requested revision: Provide complete regional analyses (proximal/transverse/distal) with consistent quantification and statistics, and discuss explicitly why TTLL6 expression levels and phenotype do (or do not) align.
      4. Several internal inconsistencies and missing statistics.
        • Fig. 1A vs 1B: CEC enrichment appears ~80-fold in panel A and ~4-fold in the panel B; if these reflect the same enrichment workflow, this discrepancy needs a clear explanation (normalization, starting material, ....).
        • Fig. 2A: statistics are missing.
        • Fig. 5D: the effects appear borderline; the conclusions should match the statistical support/significance. Requested revision: Ensure complete statistical reporting in the manuscript (n, definition of replicates, test used, p-values/thresholds) and avoid interpretive language where differences are not significant.
      5. PurA Localization claims would benefit from stronger imaging and quantification. For nuclear localization/redistribution conclusions (main Fig. 8 and related supplement), confocal imaging with Z-stacks (and orthogonal views) would be more convincing than representative single-plane images. In addition, conditions with PurA-only expression need a clear baseline description and quantification. Requested additions: confocal Z-stacks + blinded quantification of nuclear/cytosolic localization across replicates; ideally support with subcellular fractionation and quantitative immunoblotting.
      6. Overexpression artifacts should be considered more carefully. If TTLL6 has been described as an elongase in prior work (Mahalingan, NSMB, 2020, DOI: 10.1038/s41594-020-0462-0) high-level overexpression may generate non-physiological modifications or localization patterns. Requested revision: Soften conclusions drawn from overexpression experiments and provide appropriate expression controls and/or supportive evidence in more physiological settings.
      7. Mouse tumor data should be interpreted more cautiously relative to the human correlations. The human datasets suggest a correlation between TTLL6 mRNA levels and clinical features/outcome (including recurrence-free survival), which is potentially interesting. In contrast, the mouse CAC results appear modest/borderline and, in places, are interpreted as stronger evidence than the data support. Requested change: Avoid strong claims about TTLL6 promoting or suppressing tumor growth unless supported by robust, clearly significant differences and comprehensive burden metrics.

      Minor comments:

      • Every figure should clearly state n (biological vs technical), statistical test, and multiple-comparison correction where applicable.
      • Where effects are segment-specific, the text should reflect that specificity and avoid global statements.
      • The Discussion would benefit from a clearer separation of what is directly shown versus what is proposed (especially near the end).
      • TTLL6 expression is largely presented at the transcript level; it would help to make this explicit throughout and avoid wording that implies protein-level validation where it is not shown.

      Significance

      The manuscript has the potential to be of interest because it points to a possible role for TTLL6 in non-tubulin, nuclear glutamylation in the intestinal epithelium, and it links TTLL6 expression to human CRC datasets. At present, however, the broader impact is limited by (i) insufficient direct evidence that PurA is glutamylated in vivo and (ii) the lack of a causal connection between PurA and the epithelial phenotype. In addition, while the human data show correlations between TTLL6 expression and clinical parameters/outcome, the mouse CAC phenotype appears comparatively modest/borderline and should be interpreted with appropriate caution. With stronger biochemical validation (MS), improved localization quantification, and more restrained framing (or additional functional data), the work could appeal to readers in intestinal epithelial biology, post-translational modification biology, and CRC research.

      Expertise: enzymology; post-translational modifications; microscopy; cancer mechanisms.

    1. Rapport de Synthèse : Le Traitement Judiciaire de l’Inceste Parental sur Mineurs

      Résumé Exécutif

      Ce document synthétise les témoignages et analyses de Mme Hélène Romano (psychologue et docteure en droit) et de Mme Eugénie Izard (pédopsychiatre) lors de leur audition par la commission d'enquête sur l'inceste.

      Le constat est celui d'un échec structurel du système français : seuls 8 % des enfants ayant révélé des faits d'inceste sont effectivement protégés, tandis que 95 % des procédures sont classées sans suite.

      Les points critiques identifiés sont :

      • L'institutionnalisation du doute : Un basculement idéologique post-affaire d'Outreau qui érige l'enfant en menteur de principe et le parent protecteur en manipulateur.

      • L'absence de méthodologie scientifique : L'inexistence d'outils standardisés (comme le protocole SVA) pour évaluer la crédibilité de la parole de l'enfant.

      • Le rôle délétère de l'Ordre des Médecins : Une institution accusée de réduire les médecins au silence et de sanctionner ceux qui signalent des maltraitances.

      • La primauté de la famille sur l'enfant : Une sacralisation de la "coparentalité" et du lien biologique qui contraint les enfants à maintenir des liens avec leurs agresseurs présumés.

      --------------------------------------------------------------------------------

      1. Un Système Structurellement Défaillant

      L'inceste n'est pas une accumulation de faits divers conjoncturels, mais un phénomène systémique et organisé.

      Malgré une libération de la parole médiatique, les statistiques révèlent une impunité persistante.

      Chiffres clés du traitement de l'inceste en France

      | Indicateur | Donnée | | --- | --- | | Enfants victimes de violences sexuelles par an | 160 000 | | Part des violences étant d'ordre incestueux | 80 % | | Taux de protection des enfants après révélation | 8 % | | Taux de procédures classées sans suite | 95 % | | Condamnations annuelles | Environ 1 000 |

      Le système actuel est décrit comme une "inversion perverse" où l'enfant qui parle n'est pas entendu et se voit souvent remis à la garde du parent désigné comme agresseur.

      --------------------------------------------------------------------------------

      2. L'Idéologie du Doute et l'Inversion de la Culpabilité

      Depuis l'affaire d'Outreau, une "idéologie du mensonge" imprègne les institutions judiciaires et médico-sociales.

      La disqualification du parent protecteur

      Majoritairement des mères (95 % des auteurs d'inceste étant des hommes), les parents protecteurs subissent un processus de disqualification en plusieurs étapes :

      • Disqualification psychologique : Mère jugée "trop fusionnelle", fatiguée ou déprimée.

      • Biais de confirmation : L'épuisement de la mère face au système est utilisé pour confirmer son instabilité psychique.

      • Accusations de pathologie : Recours au Syndrome d'Aliénation Parentale (SAP) ou au Syndrome de Münchhausen par procuration pour accuser la mère de manipuler l'enfant.

      Note : Le SAP n'est pas reconnu par le DSM et son usage est proscrit par plusieurs instances internationales, mais il reste largement utilisé pour discréditer les signalements.

      Les conséquences pour l'enfant

      L'enfant est souvent réduit à l'état d'objet.

      S'il ne parle pas (contexte de terreur, handicap, bas âge), son silence est utilisé contre lui.

      S'il parle, il subit une "sur-violence" institutionnelle : répétition épuisante des récits (parfois plus de 20 fois), confrontations traumatisantes et absence de protection.

      --------------------------------------------------------------------------------

      3. Défaillances de l'Expertise et Absence de Méthodologie

      La France souffre d'un manque criant de professionnels formés spécifiquement au psychotrauma de l'enfant et à l'évaluation de sa parole.

      • Le "Chifoumi" judiciaire : En l'absence de méthodes validées, les décisions sont souvent le fruit de biais cognitifs et de stéréotypes (ex: "les enfants mentent", "l'inceste n'existe pas").

      • L'outil SVA (State Validity Assessment) : Cette méthode, utilisée à l'étranger mais proscrite en France après Outreau, repose sur 19 critères permettant de distinguer un récit vécu d'un récit fabriqué (cohérence, détails atypiques, dialogues rapportés).

      • Incompétence des experts : Des experts psychiatres d'adultes sont parfois mandatés pour des enfants sans avoir de pratique clinique pédiatrique, rendant des rapports basés sur des entretiens de quelques minutes.

      --------------------------------------------------------------------------------

      4. L'Ordre des Médecins : Un Obstacle à la Protection

      Le témoignage du Dr Izard met en lumière une "silenciation" des médecins par leur propre institution.

      • Pressions et sanctions : Des médecins sont interdits d'exercer pour avoir signalé des enfants en danger, l'Ordre invoquant souvent "l'immixtion dans les affaires de famille".

      • Entrave à la loi : Bien que la loi de 2015 dégage les médecins de leur responsabilité en cas de signalement de bonne foi, l'Ordre est accusé d'ignorer la loi pénale pour sanctionner disciplinairement les praticiens.

      • Résultat : Seul 1 % des signalements de violences sexuelles incestueuses proviennent de médecins, par crainte de représailles ordinales.

      --------------------------------------------------------------------------------

      5. La Sacralisation du Lien Familial et de la Coparentalité

      Le système français privilégie le maintien du lien familial au détriment de la sécurité de l'enfant.

      • Le paradigme de la réparation : L'enfant est souvent utilisé comme un "outil de réparation" pour le parent agresseur via des visites médiatisées forcées, au nom de la coparentalité.

      • Le conflit vs la violence : Le terme "conflit parental" est abusivement utilisé pour invisibiliser les violences sexuelles.

      Or, il ne peut y avoir de coparentalité dans une situation d'inceste.

      • La violence judiciaire : Le cas de Mme Romano illustre l'extrême violence subie par les parents protecteurs : gardes à vue humiliantes, perquisitions, ruine financière (frais d'avocats s'élevant à plusieurs centaines de milliers d'euros) et condamnations pour "non-représentation d'enfant" même face à des preuves médicales de violences.

      --------------------------------------------------------------------------------

      6. Pistes de Réformes et Recommandations

      Les intervenantes proposent plusieurs leviers pour transformer radicalement la prise en charge :

      Réformes Procédurales et Judiciaires

      • Création d'un crime d'inceste spécifique dans le code pénal, intégrant la notion de domination et de crime contre l'humanité de l'enfant.

      • Reconnaissance de l'imprescriptibilité pour les crimes sexuels sur mineurs.

      • Priorisation de l'ordonnance de protection immédiate dès la révélation.

      • Saisine directe du Juge aux Affaires Familiales (JAF) par le procureur pour suspendre les droits du parent mis en cause pendant l'enquête.

      • Suppression du délit de non-représentation d'enfant en cas de suspicion de violences.

      Évaluation et Santé

      • Usage de méthodologies standardisées (SVA, protocole NICHD) pour l'audition et l'expertise.

      • Enregistrement audiovisuel systématique de toutes les expertises.

      • Création de collèges d'experts spécialisés en psychotraumatologie infantile.

      • Immunité disciplinaire effective pour les médecins signalant des maltraitances de bonne foi.

      • Anonymat possible pour les professionnels signalant des faits graves.

      Transformation Sociétale

      • Sortir du déni de la domination masculine et patriarcale au sein de la famille.

      • Former massivement tous les professionnels au contact des enfants (école, santé, social) au repérage des troubles spécifiques.

      • Protéger l'enfant avant de protéger la famille, en cessant de considérer le lien biologique comme sacré au-delà de la sécurité physique et psychique.

    1. 此外,通过三种典型的等温模型(Langmuir、Freundlich 和 Temkin 模型)分析了吸附剂(CUH 和 CUF)与吸附质(磺胺甲恶唑)之间的吸附相互作用,结果如图 4b-d 和表 S4 所示。CUH 和 CUF 的实验数据与 Freundlich 模型拟合度最高(R 2 最接近),表明多分子层吸附占主导作用。Freundlich 指数(n = 异质性因子)的值可用于解释三种吸附行为:当 1/n < 1、=1 和>1 时,分别对应物理过程、线性过程和化学过程[34]。CUH 和 CUF 的 n 值分别为 0.894 和 0.868,表明催化剂表面存在物理吸附。

      freundlish吸附线和其他等温吸附线判断吸附类型

    1. “You are my intellectual sparring partner. Your job is to disagree with me constructively, not to agree. For every idea I present: 1) identify and challenge hidden assumptions; 2) build a strong counter-argument; 3) stress-test my logic for flaws, logical gaps, or weaknesses; 4) offer alternative perspectives to mine; and 5) prioritize truth over consensus.”

      Annotating this to keep the prompt handy. I am going to test this to see how the opinion is swayed.

    1. Claude: Resources page shortening suggestions:

      1. Remove duplicate tool cards — the Dashboard + Learn page cards are identical to what's on the About page. Replace with a compact one-liner: 'Interactive tools: Dashboard · How CM is Made · TEA Comparison · Technical Docs'

      2. Add TEA Comparison link — the new compare.html page systematically compares Pasitka, Humbird, CE Delft, GFI, and Goodwin. This is directly relevant to the reading list context.

      3. Keep the Shared Reading Collection — it serves a unique purpose (downloadable PDFs) not available elsewhere.

      4. Keep the PQ Database section — it's the canonical link set.

    1. Claude: Content trimming plan for this page (1,147 lines → ~700 target)

      Proposed changes in priority order: 1. Fold FBS/serum-free section — historical context, no longer reflects current practice. One visible sentence + collapsed detail. 2. Move jurisdiction & 'Which market?' discussions to workshop — these are workshop topics, not dashboard content. Replace with one-line links. 3. Fold GF signaling diagram — educational but not cost-relevant. Keep the price/solutions tables visible. 4. Bold audit — reduce bold to headings + key numbers only (addresses multiple reviewer comments). 5. Trim Further Resources — link to workshop resources page for full list. 6. CSS fixes — larger diagrams, less whitespace around SVGs.

      Optional: fold 2-3 SVG diagrams (cell banking, seed train) to reduce scroll length — the text already explains these steps.

      Full plan at .private/content_trimming_plan.md. Feedback welcome here or in chat.

    1. Author Response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      (1) The sample size for the ex vivo electrophysiology is small. Given the difficulty and complexity of the preparation, this is understandable. However, a larger sample size would have strengthened the authors' conclusions.

      We appreciate that the sample size is small, but this was limited by the technical difficulty and relatively low yield with this preparation. From a total of 16 experiments, we were able to obtain successful recordings in 6 cases, and these provided the characterisation of the 11 cells reported in Figure 4. We believe that this is sufficient to “strongly suggest” that the cells with dense Trpm8 input correspond to cold-selective cells. We have toned down the statements in the abstract (line 23) and the Results section (line 246).

      (2) The authors used tdTomato expression to identify brain targets innervated by these coldselective lamina I projection neurons. Since tdTomato is a soluble fluorescent protein that fills the entire cell, using synaptophysin reporters (e.g., synaptophysin-GFP) would have been more convincing in revealing the synaptic targets of these projection neurons.

      As the Reviewer says, tdTomato labelling fills the entire cell. However, examination at high magnification reveals numerous varicosities along the labelled axons, presumably corresponding to synaptic boutons. We now illustrate this in Figure 6–figure supplement 2F.

      In addition, we have provided further evidence that these varicosities correspond to (glutamatergic) synaptic boutons by immunostaining sections through the LPB for the postsynaptic density protein Homer1, and showing Homer1 puncta apposed to varicosities (Figure 6–figure supplement 2 G,H). This new information now appears in the Results section (lines 374-380).

      (3) The summary cartoon shown in Figure 7 can be misleading because this study did not determine whether these cold - selective lamina I projection neurons have collateral branches to multiple brain targets or if there are anatomical subtypes that may project exclusively to specific targets. For example, a recent study (Ding et al., Neuron, 2025) demonstrated that there are PBN-projecting spinal neurons that do not project to other rostral brain areas. Furthermore, based on the authors' bulk labeling experiments, the three main brain targets are NTS, PBNrel, and cPAG. The VPL projection is very sparse and almost negligible.

      We agree that branches to different brain nuclei may originate from specific subsets of ALS3 neurons and this is now stated in the figure legend. It is true that there are projections to other brain regions (including NTS). These are not included in the diagram, because their circuitry in relation to cold-sensing is less well understood. Although the projection to VPL from lumbar cord is sparse, this is likely to be explained by the very low proportion of lamina I projection neurons with axons that reach the thalamus. Our retrograde tracing data (e.g. Figure 6-figure supplement 4) had already revealed many cells in the C7 segment that were densely coated with Trpm8 afferents and retrogradely labelled from the lateral thalamus. We have carried out additional experiments in which AAV1.Cre<sup>ON</sup>.td Tomato was injected into the cervical enlargement of Calb1<sup>Cre</sup> mice.This resulted in much denser labelling in the VPL and PoT thalamic nuclei, supporting the suggestion that cold-selective lamina I neurons in the cervical enlargement project to these nuclei. This is now described in lines 381-387 and illustrated in Figure 6–figure supplement 3.

      Reviewer #2 (Public review):

      (1) In the characterization of recorded neurons in close contact or in the absence of this contact with TRPM8 afferents, the number of recorded neurons is relatively low. In addition, the strength of thermal stimuli is not very well controlled, preventing a more precise characterization of the connectivity.

      We fully accept that the sample size is small (please see response to Reviewer 1 above). We also accept that the thermal stimulation was not that well controlled. Unfortunately, commercially available probes for controlling skin temperature are too large to apply to the skin in this preparation. For this reason, we have used application of hot and cold saline, as in our previous studies with this preparation.

      (2) The authors could provide some sense of the effort needed to record from the 6 coldactivated neurons described. How many preparations were needed, etc?

      We now state that 6 out of 16 experiments resulted in successful recordings for this part of the study (lines 858-861).

      Reviewer #3 (Public review):

      (1) While anatomical evidence for direct synaptic connectivity between Trpm8+ afferents and lamina I projection neurons is compelling, a physiological demonstration of strict monosynaptic transmission is not shown. The conclusion that these inputs are exclusively monosynaptic should be toned down. Similarly, the statement that "Lamina I ALS neurons that are surrounded by Trpm8 afferents are cold-selective" should also be toned down as only a few neurons have been tested and it cannot be excluded that other neurons with similar characteristics may be polymodal.

      We have now carried out optogenetic experiments by expressing channelrhodopsin in Trpm8 afferents and retrogradely labelling ALS neurons with tdTomato. This has allowed us to directly demonstrate monosynaptic input. This is described in the Results section (lines 180-202) and the Methods section has been updated. As noted above, we have toned down the statement about lamina I neurons surrounded by Trpm8 afferents being coldselective (line 246).

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      (1) The patch innervation of Trpm8+ sensory neurons in lamina I of the spinal cord dorsal horn is interesting. Do they occupy specific areas within lamina I along the mediolateral axis, or are their placements random? Quantifying the distribution of these terminals in lamina I might be worthwhile.

      Although we have not studied the mediolateral distribution systematically, it appears that the locations of the patches in the mediolateral axis is random, and they could be seen in medial, central and lateral parts of lamina I (as shown in Figure 2). We have added a comment to this effect in the Results section (lines 114-116). Quantifying Trpm8 terminals would be very labour-intensive, and we do not feel that this would be of great benefit.

      (2) Quantification for the percentage of Trpm8+ boutons contacting Phox2a+ neurons that are vGlut3+

      The main purpose of this part of the study was to provide a possible explanation for the finding by Li et al (2015) that some lamina I cells were associated with Vglut3-

      immunoreactive boutons. We found that the percentages of Trpm8+ boutons that contained Vglut3 varied considerably from cell to cell, and this is now stated in the text (lines 133134). However, knowing exact proportions was not an important aspect of the study, we have therefore not carried out a detailed analysis.

      (3) Quantification for the percentage of PBN projections neurons densely innervated by Trpm8+ axons that are calb1+.

      As requested, we have carried out immunohistochemistry to determine the proportion of lamina I ALS neurons with dense Trpm8 input that are calbindin-immunoreactive. We examined 31 neurons from 3 different mice and found that all but 4 (i.e. 87%) were immunoreactive. This is now described (lines 287-293) and illustrated (Figure 5–figure supplement 1). We have now put the electrophysiological characterisation that was in this figure into a separate supplement (Figure 5–figure supplement 2).

      (4) It might be helpful to confirm the brain projection targets of Cal1b+ lamina 1 projection neurons using AAV1-CreON-Synaptophysin-GFP (or other fluorescent proteins) injections

      Please see our response to Public review Reviewer 1 comment 2 above. We have provided further evidence that the brain regions that received input from the Calb1+ cells contain axonal boutons (lines 374-380 and Figure 6–figure supplement 2F-H).

      (5) Figure 6 - Figure Supplements 3 and 4 are duplicated

      We apologise for this duplication, which was made in error in the version originally submitted to eLife. This has now been corrected.

      Reviewer #2 (Recommendations for the authors):

      (1) As mentioned, in the characterization of recorded neurons in close contact or in the absence of this contact with TRPM8 afferents, the number of recorded neurons is relatively low, some recorded in current clamp, a few in voltage clamp. This prevents any solid statistical evaluation of the findings

      Please see response to response to the first point made by Reviewer 1 in the Public reviews. As stated above, we have toned down the statement about the relationship between cells with dense Trpm8 input and cold-selective cells (line 246).

      (2) In addition, the strength of thermal stimuli is not very well controlled, preventing a more precise characterization of the synaptic connection between afferents and ALS projection neurons.

      Please see our response to the Public review comment made by this Reviewer.

      (3) Line 35. In the description of the anterolateral system and the effects of lesions, the species(s) should be specified since rodents and humans have a different anatomical distribution of spinal tracts.

      We now state that while ALS axons ascend in the anterolateral quadrant in humans, they are located in the dorsolateral white matter in rodents (lines 40-42)

      (4) To describe the semi-intact preparation used for recording and stimulation from the periphery, the authors cite a study by Julien Allard (reference 25). However, that study describes an in vivo preparation. I believe there is an error in the citation.

      We thank the Reviewer for pointing this out – it has now been corrected.

      (5) Line 726. Dorsal horn recordings were performed at 25 ºC. What is the temperature of the skin? How would this low temperature affect the excitability of cold afferents and their axons? Perhaps a comment about this issue would be appropriate.

      The skin temperature in this preparation is the same as that of the spinal cord (25 °C). At this temperature, Trpm8 afferents would be active, but are likely to have adapted during the course of the experiment. Since this temperature is below 37 °C, it is likely that the conduction velocity of these afferents will be slower than in the in vivo situation. We have added a comment to this effect (lines 818-821).

      (6) Line 401. The authors could not detect Trpv1-immunoreactivity in the central terminals of Trpm8Flp;RCE:FRT mice. Could they detect Trpv1 immunoreactivity in any central terminal? Do they have positive evidence that their immunostaining worked?

      Trpv1 was readily detected in central terminals with the Trpv1 antibody. An example showing lack of detectable Trpv1-immunoreactivity in GFP-labelled (Trpm8-expressing) afferents is now shown in Figure 2–figure supplement 1K-M.

      (7) Line 437. What is the expected anterograde transport time for YFP from the lumbar cord to the brainstem? Are 2-3 weeks not sufficient based on the literature? I noticed the authors are using longer survival times after intraspinal injections

      In preliminary experiments for a previous study Substance P-expressing excitatory interneurons in the mouse superficial dorsal horn provide a propriospinal input to the lateral spinal nucleus | Brain Structure and Function we had found that a 2 week survival time after injection of AAV1.Cre<sup>ON</sup>.GFP into the lumbar spinal cord of Tac1<sup>Cre</sup> mice was not sufficient to label axons in the brain, although at 4 weeks we saw brain labelling. We have also found that extending survival times from 4 to 6 weeks gives greatly improved labelling, especially in the thalamus.

      (8) Figure 5A. Many of the labelled cells appear to have the somas in the white matter, which makes little sense. It seems the reference section to plot the cells is not optimal

      The placement of cells is accurate. Many spinal projection neurons are present outside the main region of grey matter (i.e. laminae I-X). These cells are found in 2 main regions – the lateral spinal nucleus (LSN) and the lateral reticulated part of lamina V. These two regions are intermediate between grey and white matter – i.e. they contain scattered cell bodies amongst a dense collection of axons. For this reason they appear outside the grey/white border as it is conventionally shown on diagrams of this type. This has been reported in numerous studies, e.g. see Figure 2 in The cells of origin of the spinothalamic tract of the rat: a quantitative reexamination - PubMed.

      (9) Recent transcriptomic studies suggest the presence of more than one subpopulation of Trpm8-expressing DRG or trigeminal neurons. It is unclear to what extent the Trpm8-Flp line is capturing this diversity.

      We are aware that there are at least 3 transcriptomic subsets of Trpm8-expressing primary sensory neurons. However, we are not aware of any suitable molecular markers that would allow us to discriminate between them, and therefore address this point.

      (10) Could the patchy distribution of Trpm8 afferents in lamina I reflect incomplete recombination; the empty spaces could be occupied by unmarked afferents?

      In theory it could, but this seems unlikely. The Trpm8<sup>Flp</sup> line (crossed with RCE:FRT) captures ~83% of Trpm8-positive cell bodies, and it seems very unlikely that the remaining 17% of Trpm8-expressing afferents would fill the spaces between GFP bundles that we see in lamina I. This is now stated in the Results section (lines 116-120).

      Reviewer #3 (Recommendations for the authors):

      (1) It would be a nice addition to the validation of the Trpm8-Flp line to specify what ages (if multiple) have been analysed and whether there are any differences. In addition, is labelling different at different levels of the spinal cord, and is there any labeling in supraspinal regions?

      The tissue used for this part of the study was obtained from mice aged 5-9 weeks and this is now stated (lines 78-79). We did not observe any differences with age, but we did not look at this in detail. Labelling was similar at different levels of the spinal cord, and this is stated (lines 108-109). We have added a brief account of the distribution of GFP labelling in the brain (lines 140-144).

      (2) Line 169. It is not clear how ALS neurons are labeled. It is explained in the material and methods (I believe it is AAV9.mCherry into the LPB or CVLM). Although I could not find a mention of a tdTomato AAV, maybe I missed it. In any case, it would be great to have the experimental strategy briefly explained in the text. For the same reason, I would recommend moving Figure 4 Supplement 1A and 1B schematics to the main figure, very helpful for understanding the experiment.

      We thank the Reviewer for this suggestion. We now explain in the Results section how the ALS neurons were labelled (lines 209-212), and as the Reviewer recommends we have put the schematic diagrams from Figure 4–figure supplement 1 into the main Figure. As noted in the text, the tdTomato labelling resulted from injection of an AAV coding for Cre into mice that contained the Ai9 allele. We have also updated the descriptions of brain injections in the Methods section to cover the new experiments (optogenetics, and calbindin immunohistochemistry).

      (3) Line 184. "Figure 4" would be good to specify the panels; I believe it should be 4A-C. Same for line 194, 4D-F?

      We apologise that this was omitted from the original version – we have now specified the panels.

      (4) Line 179. It would be great to specifiy in the text and figures the temperature used for hot and warm water. In addition, would the responses be different using different temperatures? Can you test ramps? These would go a great way to compare with responses shown in vivo by Ran and colleagues.

      We now specify the hot and cold saline temperatures used to stimulate the skin in the semiintact preparation in the legend for Figure 4 and in the Results section (lines 222-223). As noted above, it is difficult to use more accurate thermal stimuli in this preparation. Please see response to Reviewer 2 public comment 1.

      (5) Figure 4-Figure supplement 1F. It looks like these are very slow responses (1 sec?) for monosynaptic connectivity.

      In this figure (now part 1D) the action potential frequency was determined from counts of APs in 1 sec bins, and this is now stated in the legend. This might have given the impression of slow responses.

      (6) Line 203. I would tone down the statement, as only 6 cells "that were clearly associated with numerous GFP-labelled afferents" have been tested. Thus, it cannot be excluded that other cells with similar anatomical characteristics may also respond to other stimuli

      As requested, we have toned down this statement (line 246).

      (7) Line 230. Here AAV11.CreON.td Tomato is used, in previous retrograde experiments, AAV9 has been used (Figure 4), why the switch to 11? Is the tropism the same? Is it possible that because you are using a different serotype, you are targeting different neurons?

      We have found that although AAV9 coding for fluorescent proteins is very good for retrograde labelling, AAV9 coding for Cre-dependent constructs (e.g. AAV.Cre<sup>ON</sup>.tdTomato) gives very poor recombination in spinal projection neurons, for reasons that we do not understand. We recently became aware of the AAV11 serotype, which was recommended as being suitable for retrograde transport AAV11 enables efficient retrograde targeting of projection neurons and enhances astrocyte-directed transduction | Nature Communications. We have found that this works very well for labelling ALS cells throughout the spinal cord when using Cre-dependent constructs. We have added a reference to this paper at this point in the text. We are not able to say whether tropism is the same or different, but in each case many ALS neurons (including many of those in lamina I) are captured.

      (8) Line 234. Is there any positional organization for the "tdTomato-labelled cells densely innervated byTrpm8 afferents", do they preferentially cluster in some position of lamina I?

      These cells are found throughout the mediolateral extent of the dorsal horn, and this is now stated (lines 279-280).

      (9) Line 237. The actual number of cells/mm would be informative.

      This would be difficult to estimate, as the sections were cut in the horizontal plane, which means that lamina I can appear on a variable number of sections.

      (10) Line 249. From the figures, the action potentials of the Calb+ neurons seem to have a delayed onset (at the end of cold saline treatment, Figure 5, Supplement 1l) compared to lamina I ALS neurons recorded in Figure 4, Supplement 1f. If real, it is an interesting difference in the time-course of response that could indicate different coding properties e.g., response to cooling (general neurons) vs. response to absolute temperature (calb + neurons).

      As for Fig 4-figure supplement 4 (see response to point #5 above), action potential frequency was determined from APs counted in 1 sec bins, and this is now stated in the legend.

      (11) Figure 7. In the model, the disynaptic pathway should also be shown

      We have added a comment to the legend stating that there may also be indirect (“polysynaptic”) input from Trpm8 afferents to ALS3 neurons.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      CCK is the most abundant neuropeptide in the brain, and many studies have investigated the role of CCK and inhibitory CCK interneurons in modulating neural circuits, especially in the hippocampus. The manuscript presents interesting questions regarding the role of excitatory CCK+ neurons in the hippocampus, which has been much less studied compared to the well-known roles of inhibitory CCK neurons in regulating network function. The authors adopt several methods, including transgenic mice and viruses, optogenetics, chemogenetics, RNAi, and behavioral tasks to explore these less-studied roles of excitatory CCK neurons in CA3. They find that the excitatory CCK neurons are involved in hippocampal-dependent tasks such as spatial learning and memory formation, and that CCK-knockdown impairs these tasks.

      However, these questions are very dependent on ensuring that the study is properly targeting excitatory CCK neurons (and thus their specific contributions to behavior). There needs to be much more characterization of the CCK transgenic mice and viruses to confirm the targeting. Without this, it is unclear whether the study is looking at excitatory CCK neurons or a more general heterogeneous CCK neuron population.

      Strengths:

      This field has focused mainly on inhibitory CCK+ interneurons and their role in network function and activity, and thus, this manuscript raises interesting questions regarding the role of excitatory CCK+ neurons, which have been much less studied.

      Weaknesses:

      (1a) This manuscript is dependent on ensuring that the study is indeed investigating the role of excitatory CCK-expressing neurons themselves and their specific contribution to behavior. There needs to be much more characterization of the CCK-expressing mice (crossed with Ai14 or transduced with various viruses) to confirm the excitatory-cell targeting. Without this, it is unclear whether the study is looking at excitatory CCK neurons or a more general heterogeneous CCK neuron population.

      Thank you for this constructive comment. Indeed, the current study lacks comprehensive strategies to unequivocally distinguish excitatory CCK neurons from heterogeneous CCK neuronal populations. Nevertheless, we provide multiple lines of evidence supporting the distribution of CaMKIIα/Vglut1-expressing CCK<sup>+</sup> neurons in the hippocampus (Figure 1F), using complementary approaches including transgenic mouse models as well as viral and antibody-based labeling (Figure 1A, Figure 1H-I). In addition, we demonstrate that 635 nm light reliably evokes field excitatory postsynaptic potentials (fEPSPs) at CA3-Schaffer collateral synapses expressing DIO-CaMKIIα-ChrimsonR in vitro (Figure 2A-F). Importantly, these light-evoked excitatory synaptic responses are abolished by AMPA and NMDA receptor antagonists (CNQX and APV), confirming the excitatory nature of the DIO-CaMKIIα-ChrimsonR-expressing synapses. To demonstrate the future works that can further support our findings and conclusions, we have added the strategies that can be conducted in the Discussion section in the revision:

      “Due to technical limitations at the current stage, we were unable to perform whole-cell recordings or pharmacological manipulations using CCK receptor antagonists. In future studies, the application of these approaches to directly record and selectively block EPSPs from excitatory CCK neurons in the hippocampus will further strengthen and validate our conclusions.” (Line 265 - line 269 in the revision).

      (1b) For the experiments that use a virus with the CCK-IRES-Cre mouse, there is no information or characterization on how well the virus targets excitatory CCK-expressing neurons. (Additionally, it has been reported that with CaMKIIa-driven protein expression, using viruses, can be seen in both pyramidal and inhibitory cells.

      We thank the reviewer for this insightful comment regarding the specificity of viral targeting in CCK-IRES-Cre mice.

      To address this concern, we performed additional characterization of viral expression in CA3. We found that DIO-CaMKIIα-mCherry expression showed a high degree of colocalization with CaMKIIα immunoreactivity, indicating preferential targeting of excitatory neurons (sFigure 1A-B; sFigure 2A-B; sFigure 3A-B). We showed an example to confirmed the high specificity of the AAV for infecting the excitatory CCK neurons in CA3 area.

      Besides, we acknowledge prior reports showing that CaMKIIα-driven viral expression can, in some cases, be detected in a small subset of inhibitory neurons. However, because CA3-Schaffer collateral projections to CA1 arise exclusively from excitatory CA3 pyramidal neurons, any potential expression in inhibitory CCK<sup>+</sup> interneurons are unlikely to directly contribute to the recorded CA1 synaptic responses in our electrophysiological experiments. That said, we cannot fully exclude the possibility that a minor population of inhibitory CCK⁺ neurons could indirectly modulate CA3 pyramidal neuron activity via local circuit mechanisms, particularly in experiments involving optogenetic manipulation or shRNA expression. We now explicitly acknowledge this limitation in the revised manuscript:

      “Importantly, to further improve cell-type specificity, we propose an intersectional genetic strategy using CCK-IRES-Cre × VGlut1-Flp mice combined with a Cre-On/Flp-On (Con/Fon) AAV, which would restrict expression exclusively to excitatory CCK-expressing neurons and eliminate potential contributions from inhibitory CCK<sup>+</sup> cells. This approach will be implemented in future studies to refine circuit specificity.” (Line 269 - line 273 in the revision).

      (2) The methods and figure legends are extremely sparse, leading to many questions regarding methodology and accuracy. More details would be useful in evaluating the tools and data. More details would be useful in evaluating the tools and data. Additionally, further quantification would be useful-e.g. in some places, only % values are noted, or only images are presented.

      Thank you for these constructive comments. We have expanded the methodological descriptions in both the Methods section and the figure legends to provide sufficient detail for evaluating the experimental tools and data accuracy. In addition, we have added quantitative analyses where previously only representative images or percentage values were shown. Specifically, quantification has now been included for each AAV condition in the corresponding figures in the revised manuscript.

      (3) It is unclear whether the reduced CCK expression is correlated, or directly causing the impairments in hippocampal function. Does the CCK-shRNA have any additional detrimental effects besides affecting CCK-expression (e.g., is the CCK-shRNA also affecting some other essential (but not CCK-related) aspect of the neuron itself?)? Is there any histology comparison between the shRNA and the scrambled shRNA?

      Recent studies from our lab demonstrated that knockout the CCK gene expression significantly attenuates the hippocampal-dependent spatial learning and CA3-CA1 LTP, indicating CCK plays a critical role in modulating the hippocampal functions[1,2]. Additionally, CCK-shRNA or CCK-scramble did not significantly affect the excitatory synaptic transmission in the CA3-CA1 projections, hinting that CCK-shRNA may exhibits no obvious adverse effect on other neural components.

      Finally, we have provided the histology comparison between the shRNA and the scrambled shRNA regrading the expression level of the CCK protein (Pro-CCK) in the revision. Our result shows that CCK-shRNA (left panel) significantly reduced CCK expression in CA3<sup>CCK</sup>-positive neurons compared with the CCK-Scramble group (right panel).

      Citation:

      (1) Wang, J. L., Sha, X. Y., Shao, Y., Zhang, Z. H., Huang, S. M., Lin, H., ... & Sun, J. P. (2025). Elucidating pathway-selective biased CCKBR agonism for Alzheimer’s disease treatment. Cell.

      (2) Zhang, N., Sui, Y., Jendrichovsky, P., Feng, H., Shi, H., Zhang, X., ... & He, J. (2024). Cholecystokinin B receptor agonists alleviates anterograde amnesia in cholecystokinin-deficient and aged Alzheimer's disease mice. Alzheimer's research & therapy, 16(1), 109.

      https://doi.org/10.7554/eLife.109001.1.sa2

      Reviewer #2 (Public review):

      Summary:

      In this study, the authors have demonstrated, through a comprehensive approach combining electrophysiology, chemogenetics, fiber photometry, RNA interference, and multiple behavioral tasks, the necessity of projections from CCK+ CAMKIIergic neurons in the hippocampal CA3 region to the CA1 region for regulating spatial memory in mice. Specifically, authors have shown that CA3-CCK CAMKIIergic neurons are selectively activated by novel locations during a spatial memory task. Furthermore, authors have identified the CA3-CA1 pathway as crucial for this spatial working memory function, thereby suggesting a pivotal role for CA3 excitatory CCK neurons in influencing CA1 LTP. The data presented appear to be well-organized and comprehensive.

      Strengths:

      (1) This work combined various methods to validate the excitatory CCK neurons in the CA3 area; these data are convincing and solid.

      (2) This study demonstrated that the CA3-CCK CAMKIIergic neurons are involved in the spatial memory tasks; these are interesting findings, which suggest that these neurons are important targets for manipulating the memory-related diseases.

      (3) This manuscript also measured the endogenous CCK from the CA3-CCK CAMKIIergic neurons; this means that CCK can be released under certain conditions.

      Weaknesses:

      (1) The authors do not mention which receptors of the CCK modulate these processes.

      We appreciate the reviewer for raising this important question. Based on our recent work, CCK-B receptors are the primary neural components mediating CCK functions in the hippocampus at both the synaptic plasticity and behavioral levels (Su et al., 2023; Zhang et al., 2024; Wang et al., 2025). To clarify this mechanism, we have added the following content to the revised manuscript:

      “Based on our recent work, CCK signaling in the hippocampus is predominantly mediated by CCK-B receptors, which play a critical role in regulating synaptic plasticity and spatial memory-related behaviors.” (Line 105 - line 106 in the revision).

      (2) This author does not test the CCK gene knockout mice or the CCK receptor knockout mice in these neural processes.

      Thank you for this insightful comment. We previously tested these experiments in an earlier study. Our results showed that high-frequency electrical stimulation failed to induce significant LTP in the CA3-CA1 pathway in both CCK gene knockout (CCK-KO) mice and CCK-B receptor knockout (CCK-BR-KO) mice in vitro (Su et al., 2023; Zhang et al., 2024; Wang et al., 2025). These findings indicate that CCK mediates its synaptic effects predominantly through CCK-B receptors in the CA3-CA1 pathway. Accordingly, we have added this description to the revised manuscript.

      “Additionally, high-frequency electrical stimulation fails to induce LTP in the CA3-CA1 pathway in both CCK-KO and CCK-BR-KO mice, indicating that CCK-dependent synaptic plasticity in this circuit is primarily mediated by CCK-B receptors.” (Line 170 - line 173 in the revision).

      (3) The author does not test the source of CCK release during the behavioral tasks.

      We thank the reviewer for raising this important point. In our previous work, we directly monitored CCK release in the hippocampus during an object-exploration task using a GPCR-based CCK-BR sensor combined with fiber photometry (Su et al., 2023). During object exploration, we observed a rapid and robust increase in CCK-BR sensor fluorescence, indicating activity-dependent CCK release in the hippocampus. Based on these findings, we deduced that hippocampal CCK release plays a critical role in hippocampus-dependent behavioral tasks.

      We acknowledge that hippocampal neurons receive CCK-positive projections from multiple brain regions, making it technically challenging to isolate and monitor the precise source of CCK release in the CA1 area during behavioral tasks in vivo. One potential strategy to address this limitation is selective overexpression of CCK in CA3 neurons (e.g., AAV-CCK delivery), followed by assessment of CCK-BR sensor responses during hippocampal-dependent behaviors. We have added this discussion to the revised manuscript to clarify the source and functional relevance of CCK release during behavioral tasks.

      “Besides, using a GPCR-based CCK-BR sensor combined with fiber photometry, our previous work demonstrated rapid, activity-dependent CCK release in the hippocampus during object-exploratory behavior, supporting a functional role for hippocampal CCK signaling in cognitive tasks (Su et al., 2023). Given that hippocampal neurons receive CCK-positive projections from multiple brain regions, it remains technically challenging to precisely identify the cellular source of CCK release in CA1 during behavior. Future studies employing selective CCK overexpression in CA3 neurons, together with CCK-BR sensor recordings, may help further delineate the contribution of CA3-derived CCK to hippocampal-dependent behaviors.” (Line 313 - line 321 in the revision).

      Citation:

      (1) Wang, J. L., Sha, X. Y., Shao, Y., Zhang, Z. H., Huang, S. M., Lin, H., ... & Sun, J. P. (2025). Elucidating pathway-selective biased CCKBR agonism for Alzheimer’s disease treatment. Cell.

      (2) Zhang, N., Sui, Y., Jendrichovsky, P., Feng, H., Shi, H., Zhang, X., ... & He, J. (2024). Cholecystokinin B receptor agonists alleviates anterograde amnesia in cholecystokinin-deficient and aged Alzheimer's disease mice. Alzheimer's research & therapy, 16(1), 109.

      (3) Su, J., Huang, F., Tian, Y., Tian, R., Qianqian, G., Bello, S. T., ... & He, J. (2023). Entorhinohippocampal cholecystokinin modulates spatial learning by facilitating neuroplasticity of hippocampal CA3-CA1 synapses. Cell Reports, 42(12).

      https://doi.org/10.7554/eLife.109001.1.sa1

      Reviewer #3 (Public review):

      Summary:

      Fengwen Huang et al. used multiple neuroscience techniques (transgenetic mouse, immunochemistry, bulk calcium recording, neural sensor, hippocampal-dependent task, optogenetics, chemogenetics, and interfer RNA technique) to elucidate the role of the excitatory cholecystokinin-positive pyramidal neurons in the hippocampus in regulating the hippocampal functions, including navigation and neuroplasticity.

      Strengths:

      (1) The authors provided the distribution profiles of excitatory cholecystokinin in the dorsal hippocampus via the transgenetic mice (Ai14::CCK Cre mice), immunochemistry, and retrograde AAV.

      (2) The authors used the neural sensor and light stimulation to monitor the CCK release from the CA3 area, indicating that CCK can be secreted by activation of the excitatory CCK neurons.

      (3) The authors showed that the activity of the excitatory CCK neurons in CA3 is necessary for navigation learning.

      (4) The authors demonstrated that inhibition of the excitatory CCK neurons and knockdown of the CCK gene expression in CA3 impaired the navigation learning and the neuroplasticity of CA3-CA1 projections.

      Weaknesses:

      (1) The causal relationship between navigation learning and CCK secretion?

      Thank you for pointing out this important issue. Previous studies have shown that CCK can be rapidly secreted during exploratory behaviors, as detected by the CCK-BR sensor. In parallel, CCK-positive neurons have been demonstrated to play a critical role in the precise execution of hippocampus-dependent spatial learning. Together, these findings suggest that exploratory behavior induces CCK secretion, which in turn contributes to the accuracy of hippocampal-dependent learning and memory processes. Based on this evidence, we propose that CCK secretion serves as a functional link between behavioral exploration and spatial learning. We have added these explanations in the revised manuscript to better clarify the causal relationship between behavioral exploration and CCK secretion:

      “Besides, using a GPCR-based CCK-BR sensor combined with fiber photometry, our previous work demonstrated rapid, activity-dependent CCK release in the hippocampus during object-exploratory behavior, supporting a functional role for hippocampal CCK signaling in cognitive tasks (Su et al., 2023). Given that hippocampal neurons receive CCK-positive projections from multiple brain regions, it remains technically challenging to precisely identify the cellular source of CCK release in CA1 during behavior. Future studies employing selective CCK overexpression in CA3 neurons, together with CCK-BR sensor recordings, may help further delineate the contribution of CA3-derived CCK to hippocampal-dependent behaviors.” (Line 313 - line 321 in the revision)

      (2) The effect of overexpression of the CCK gene on hippocampal functions?

      We thank the reviewer for this comment. In fact, an earlier study from our laboratory demonstrated that intraperitoneal injection of exogenous CCK-4 significantly improved performance in hippocampus-dependent spatial learning tasks in both CCK gene knockout (CCK-KO) mice and Alzheimer’s disease (AD) mouse models. These findings suggest that enhancing CCK signaling can ameliorate hippocampal dysfunction at both the behavioral and synaptic plasticity levels (Zhang et al., 2024; Wang et al., 2025). Accordingly, although direct genetic overexpression of CCK in the hippocampus has not yet been extensively characterized, the observed benefits of exogenous CCK delivery support the notion that increased CCK availability positively modulates hippocampal function and spatial learning. We have cited this study in the revised manuscript to support this interpretation.

      “Interestingly, an earlier study demonstrated that intraperitoneal injection of exogenous CCK-4 significantly improved performance in hippocampus-dependent spatial learning tasks in both CCK gene knockout (CCK-KO) mice and Alzheimer’s disease (AD) mouse models (Zhang et al., 2024). These findings suggest that enhancing CCK signaling can ameliorate hippocampal dysfunction at both the behavioral and synaptic plasticity levels.” (Line 291 - line 297 in the revision)

      (3) What are the functional differences between the excitatory and inhibitory CCK neurons in the hippocampus?

      In the hippocampus, CCK-expressing neurons consist of two major populations with distinct functions: excitatory (glutamatergic) and inhibitory (GABAergic) neurons. Excitatory CCK neurons are relatively sparse and intermingled with pyramidal cells. By releasing glutamate, they directly contribute to excitatory transmission and are thought to participate in synaptic plasticity and information processing related to learning and memory. In contrast, inhibitory CCK neurons are more abundant and include well-characterized interneuron subtypes such as CCK-positive basket cells. These neurons release GABA and primarily target the perisomatic region of pyramidal neurons, providing strong control over neuronal firing. Notably, inhibitory CCK interneurons are highly sensitive to neuromodulatory signals, particularly endocannabinoids via CB1 receptors, enabling dynamic regulation of inhibitory tone and network activity. Together, excitatory CCK neurons mainly support hippocampal excitation and plasticity, whereas inhibitory CCK neurons regulate network dynamics and spike timing. As the focus of the present study is on excitatory CCK neurons, a detailed comparison between these two populations was not included in the original manuscript.

      (4) Do CCK sources come from the local CA3 or entorhinal cortex (EC) during the high-frequency electrical stimulation?

      Thank you for this insightful comment. Our data indicate that the CCK detected during high-frequency stimulation originates from CA3 neurons rather than the entorhinal cortex (EC). As shown in Figure 2, we used an optogenetic approach combined with a GPCR-based CCK sensor to selectively examine CCK release from the CA3-CA1 pathway. ChrimsonR was specifically expressed in CA3 neurons projecting to CA1, restricting light stimulation to CA3 axon terminals. In parallel, the CCK sensor was locally expressed in CA1, allowing real-time detection of CCK release at CA3 presynaptic sites. High-frequency light stimulation robustly evoked CCK signals in CA1, demonstrating activity-dependent CCK release from CA3 terminals. Importantly, EC inputs were neither genetically targeted nor optically stimulated in this experiment, excluding the EC as a source of the detected CCK. Together, these results support the conclusion that CCK released during high-frequency stimulation is derived from local CA3 projections to CA1. Similarly, as the focus of the present study is on excitatory CCK neurons in the CA3 area, a detailed comparison between these two CCK sources was not included in the original manuscript.

      Citation:

      (4) Wang, J. L., Sha, X. Y., Shao, Y., Zhang, Z. H., Huang, S. M., Lin, H., ... & Sun, J. P. (2025). Elucidating pathway-selective biased CCKBR agonism for Alzheimer’s disease treatment. Cell.

      (5) Zhang, N., Sui, Y., Jendrichovsky, P., Feng, H., Shi, H., Zhang, X., ... & He, J. (2024). Cholecystokinin B receptor agonists alleviates anterograde amnesia in cholecystokinin-deficient and aged Alzheimer's disease mice. Alzheimer's research & therapy, 16(1), 109.

      (6) Su, J., Huang, F., Tian, Y., Tian, R., Qianqian, G., Bello, S. T., ... & He, J. (2023). Entorhinohippocampal cholecystokinin modulates spatial learning by facilitating neuroplasticity of hippocampal CA3-CA1 synapses. Cell Reports, 42(12).

    1. Reviewer #1 (Public review):

      Summary:

      The authors test whether the frog buccal ventilatory rhythm generator behaves as a discrete, anatomically localized oscillator or as a distributed, state-dependent network. They combine reduced preparations (segment/subsegment work), systematic extracellular unit surveys over a defined grid, and local AMPA/GABA microinjections in a hemisected brainstem preparation. Based on these approaches, the authors conclude that mild global excitation (bath AMPA) broadens the distribution of rhythmically active units and renders a previously defined "buccal area" functionally non-identifiable as a unique necessary/sufficient locus.

      The central idea is plausible, and the overall experimental strategy is appropriate for the question being asked. However, in its current form, the manuscript overstates the strength of inference supporting the "expansion" and "loss of necessity/sufficiency" conclusions. This is primarily due to (a) statistical treatment of unit-mapping data that does not respect clustering by preparation/animal, (b) inconsistent statistical reporting across sections, and (c) limited interpretability of focal inhibitory perturbations under a globally excited state.

      Strengths:

      (1) The manuscript addresses a clear mechanistic question with broader relevance: whether rhythm generation is best conceptualized as a localized kernel or as an emergent distributed property that changes with excitatory state.

      (2) The authors use convergent approaches (reduced preparations, mapping, and necessity/sufficiency-style pharmacological perturbations), which is appropriate for circuit-level inference.

      (3) A strong element is the within-unit analysis supporting state-dependent changes in phase coupling for a subset of units ("lung" units adopting a buccal-like pattern). The authors' offline PCA-based spike sorting (with cluster-quality selection via silhouette score) provides some reassurance that the reported pre/post injection changes are not simply driven by unit misidentification.

      Weaknesses:

      (1) Pseudoreplication in unit-survey statistics undermines the main mapping inference. The Methods state that "Units were pooled from multiple preparations" and that chi-squared tests were used to compare proportions across conditions (baseline vs 60 nM AMPA). The Results similarly report proportion changes (e.g., 110 units pooled from three preparations vs 137 units pooled from three additional animals) analyzed with chi-squared tests. Because many units come from the same preparation/animal, independence is unlikely to hold; therefore, inference about state-dependent reorganization at the systems level should be made at the preparation/animal level or via hierarchical models that explicitly account for clustering.

      (2) Statistical methods are inconsistently described and need harmonization. In the segment dose-response "Analysis," values are described as compared to zero using a "One-sample t-test." Yet Table 1 is titled as using a "Wilcoxon One-sample Test." These discrepancies must be resolved throughout (Methods, Results, figure legends, and tables), including clear reporting of the unit of n and exact test statistics.

      (3) Unit classification and operational definitions raise interpretational concerns. The unit classification scheme defines "buccal units" as those firing during buccal bursts as well as lung bursts, and explicitly notes that "no units were found which fired only during buccal bursts." This is a consequential result, and it currently reads more like a limitation of detection/classification (or state-space sampled) than a robust biological conclusion. Without additional evidence, it weakens claims about a distinct buccal rhythmogenic module and complicates the interpretation of "buccal identity" changes under excitation.

      (4) Microinjection mapping: high exclusion rate and alternative explanations for 'loss of necessity' under excitation. The manuscript reports that 15 experiments were conducted, but 9 were excluded because the buccal area was not found or the preparation was "overdriven." This exclusion rate is too high to leave implicit; it raises concerns about selection bias and demands transparent accounting. Moreover, under baseline conditions, GABA (or AMPA-GABA) microinjections reliably reduce/abolish buccal bursts, but under bath 60 nM AMPA, the same injections produce no significant change in instantaneous frequency. This pattern can be interpreted as network redistribution, but it can also reflect state-dependent changes in gain, dynamic range, or local pharmacological impact (e.g., inhibition being comparatively underpowered in the globally excited state). Additional controls/analyses are required to distinguish these explanations.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors investigate the response of the amphibian respiratory rhythm generator under varying excitability conditions. They use pharmacological agents to increase and/ or decrease synaptic excitability and demonstrate the resilience of buccal rhythms under different conditions. They employ these results to formulate their primary thesis, that there is no obligatory locus of the buccal respiratory rhythm in the frog, and that their respiratory rhythmogenic mechanisms should be considered diffuse and anatomically distributed across a larger brainstem region.

      Strengths:

      This manuscript is well written, with a sufficiently large number of experiments, for which the authors should be congratulated.

      Weaknesses:

      The presented results don't support the authors' main conclusions, and the interpretation of the data is heavily biased toward their hypothesis. This impregnates an unsubstantiated narrative in the Abstract, Introduction, and Discussion of this manuscript, which must be reexamined with the following points in consideration:

      (1) The authors seem to confuse degeneracy with redundancy. For instance, at line 54, they state, "These findings support the broader hypothesis that respiratory rhythm-generating circuits can switch to being diffuse and redundant, with discrete oscillators quickly drowning in a sea of excitations."

      Redundancy means having the same component repeated multiple times to buffer the failure of any single component, whereas degeneracy means different functional components that compensate for one another under perturbations (Goaillard and Marder, ARN 2021)

      Since the premotor-lung units get converted to buccal units under high excitability, this suggests a degenerate mechanism for respiratory rhythm generation- rather than a redundant mechanism, where there should be multiple buccal units that get recruited under different excitability conditions.

      (2) Line 83, "but the essential requirement for a discrete, rudimentary buccal oscillator is also lost".

      This statement is not supported by the data presented in this study. How does the expansion of the buccal unit imply that the essential requirement for discreteness is lost? Under increased excitability, does the burst/rhythm initiation zone also expand? Or does it still remain centered around the location of buccal units under physiological conditions? Increased excitability can lead to recruitment of a larger area, without a change in the location of the rhythmogenic kernel.

      (3) Line 86, "... oscillators should be viewed as promiscuous flexible functional entities that expand or contract...".

      Oscillators can be regarded as promiscuous only if, under physiological conditions, they switch positions. Under high excitability, only the flexibility argument holds, which has been established in mammals before (e.g., CA Del Negro, K Kam, JA Hayes, JL Feldman, The Journal of physiology 587 (6), 1217-1231; CA Del Negro, C Morgado-Valle, JL Feldman,Neuron 34 (5), 821-830; NA Baertsch, LJ Severs, TM Anderson, JM Ramirez, Proceedings of the National Academy of Sciences 116 (15), 7493-7502; NA Baertsch, HC Baertsch, JM Ramirez Nature communications 9 (1), 843).

      Results:

      (4) Interpretation of data in Figure 6.

      How does the Buccal activity and L2 Power stroke change with 60nm AMPA (in CN5)? Does the increase in the Buccal neurons and decrease in power stroke neurons also reflect in the CN5 activity? Also see comments on Figure 9 data below.

      (5) Interpretation of data in Figure 7.

      Here, classifying buccal neurons solely by spiking may obscure the fact that the 'silent' neurons under baseline conditions were part of the rhythmic network but could not spike due to subthreshold inputs. 60 nM AMPA increased their firing in response to previously subthreshold synchronous inputs during the buccal burst. Intracellular recordings are required to negate this possibility and establish that the neuronal classification is robust.

      (6) Interpretation of data in Figure 8.

      "Lung units can transform into buccal units under excitation".<br /> CN5 buccal and lung bursts need to be compared before and after AMPA injection. From Figure 8 A-D, it is apparent that the example Unit2's activity increases during the buccal bursts, after AMPA injection. However, they are also present in buccal burst pre-AMPA, albeit with less frequency.

      It is striking that the pre-AMPA epoch (panel A) is less than half of the post-AMPA epoch. This would, in itself, lead to a biased estimate of lung units that are active under the baseline condition during the buccal bursts.

      Figure 8G, meta-analysis of lung units spiking during the baseline buccal bursts is warranted to interpret the main claim of this figure. Similarly, analysis of spiking per lung burst for the post-AMPA condition is essential for comparing the lung unit's contribution under high excitability.

      (7) Interpretation of data in Figure 9

      "Buccal area loses importance under increased excitation."

      This interpretation is not fully supported by the data presented in this manuscript. Under 60 nm AMPA, does the ratio of lung burst to buccal burst change in CN5? This analysis is crucial for determining whether the lung units are indeed converted into buccal bursts at the expense of lung activity or whether their appearance during buccal bursts is incidental due to increased excitability. In the baseline, there are 4-5 buccal bursts per lung burst, whereas under high excitability, there are 2-3 buccal bursts per lung burst (Figure 9 A-B). This seems inconsistent with the conclusion that increased excitability converts lung units into buccal units (Figures 6 &7).

      Could the authors comment on the connectivity between the lung and the buccal units? Results in Figure 9A-B indicate that lung units may receive an efference copy of buccal units, and under high excitability, their spikes may generate negative feedback onto the buccal units, terminating their bursts. This could explain the decrease in the buccal-to-lung burst in high-AMPA conditions. This type of circuit interaction resembles the mammalian breathing CPG, in which the parafacial/RTN (which controls the abdominal muscles) and preBötC (which controls the diaphragm) interact and cross-inhibit each other.

      (8) Line 382.

      "Buccal-like bursting produced from two independent slices".

      The two "independent" slices have portions of the same anatomical kernel, the buccal rhythm generator. This experiment is like the sandwich slice preparation of preBötC (Del Negro Lab), in which two thinner slices exhibit rhythmic activity. Thus, the two slices are not independent; they are anatomically adjacent and functionally overlapping.

    3. Author response:

      Reviewer #1 (Public review):

      Hierarchical Inference (Unit Survey)

      We agree that pooling units across preparations can overstate the strength of inference if preparation-level clustering is ignored. We will therefore reanalyze the unit-survey dataset using a hierarchical approach in which the preparation/animal is treated as the unit of inference. Our pooled dataset was derived from three chunk preparations exposed to AMPA and three baseline preparations, allowing us to report per-preparation proportions and variability as requested.

      A preliminary reanalysis of the buccal segment preparations is summarized below. In this analysis, the unit of inference is shifted from individual recorded units to the preparation level (n = 3 baseline; n = 3 at 60 nM AMPA), thereby accounting for potential within-preparation dependence.

      Author response table 1.

      The distribution of units for each of the three preparations per condition is as follows:

      Using the proportion of buccal units per preparation as the dependent variable:

      Baseline (n = 3): mean proportion of buccal units = 6.5% (SD 5.7%).

      60 nM AMPA (n = 3): mean proportion of buccal units = 53.2% (SD 6.0%).

      Absolute difference in proportions = 46.7% (95% CI 33.4% to 59.8%).

      Independent-samples t-test on per-preparation proportions: t(4) = 9.77, p = 0.0006.

      Thus, this preliminary hierarchical reanalysis indicates that the observed recruitment is consistent across preparations and is not driven by outlier data from a single animal. These results support substantial expansion of the buccal oscillator with excitation.

      Statistical Standardization: In the revision, we will better justify our use of parametric and non-parametric versions of the one-sample tests and review usage in the Methods, Table 1, and figure legends for consistency.

      Exclusion criteria for microinjection experiments: We will extend the description of these experiments by including a flow diagram summarizing the 15 attempted microinjection experiments and documenting the technical reasons for the 9 exclusions. These exclusions reflected the technical requirements of the preparation: (a) the buccal area had to be localized before AMPA excitation so that the effects of buccal-area manipulation during excitation could be interpreted reliably, which was not always possible; and (b) preparations had to exhibit sufficiently sustained periods of consecutive buccal bursting to permit quantification of buccal burst frequency, whereas some preparations expressed motor patterns dominated by lung bursts.

      Pharmacological Potency and Necessity: We will revise the wording of this section to make the causal interpretation more precise. Our data already show that local GABA microinjections can reverse the excitatory effects of local AMPA microinjections, providing an internal control for local pharmacological efficacy of GABA when the local network is excited. Notably, the local AMPA concentration used in these experiments (5 µM) is nearly two orders of magnitude greater than the 60 nM concentration used in bath application. We therefore interpret the failure of focal GABA inhibition to abolish rhythm during global excitation as being consistent with expansion of rhythmogenic capacity beyond the spatial reach of the local injection, rather than with failure of the GABA manipulation itself.

      Finding an inhibitory site that remains sensitive in bath applied AMPA is an interesting experiment but this would require identifying the anatomical substrate of a brainstem circuit for a non-ventilatory circuit in Rana that is guaranteed not to undergo reconfiguration with AMPA. This is beyond the scope of the current manuscript; based on our work to identify the neuronal substrate for ventilation in Rana, this would take at least five years to complete. In addition, having identified such a circuit there would be no guarantee that AMPA would not cause reconfiguration in this case too. With regards to transection boundaries and location of injections, we agree these would be useful refinements. We used the location of nerves as reliable landmarks to guide transections and located the buccal area using stereotactic coordinates to guide micropipette insertion and functional criteria (AMPA and GABA sufficiency and necessity tests) to locate the exact position based on our previous work.

      Unit Classification: We will review the nomenclature we use to define units to ensure it does not cause confusion and provide more explicit criteria for unit classes. This will include clarification of the absence of “buccal-only” units as currently defined. Specifically, when both buccal and lung rhythms are present, units active during buccal bursts are also active during lung bursts in our preparation. This does not conflict with the multiple interacting oscillator model we have proposed previously. Rather, recruitment of buccal-area neurons during lung bursts is consistent with a model in which the lung oscillator excites the buccal oscillator. It is also consistent with prior evidence that lung bursts persist after buccal-area ablation. In addition, burst frequency during lung episodes exceeds buccal burst frequency during intervening buccal periods. We will revise the text to make this logic clearer.

      Reviewer #2 (Public review):

      (1) Degeneracy vs. Redundancy

      We agree that degeneracy is the more precise term for the phenomenon our data demonstrate, in which structurally and functionally distinct neurons (lung units) acquire the capacity to participate in buccal rhythm generation under excitation. The Discussion already uses this language (e.g., "necessity and sufficiency may not work in a large degenerate network where rhythm generation is distributed across many elements"), but we used the word "redundant" in the Key Points Summary and Abstract in the broader sense of distributed robustness that a wider readership could grasp. Nonetheless, we recognize the distinction drawn by Goaillard and Marder (2021) and, considering the reviewers concerns, we will revise the Abstract and Key Points to adopt the degeneracy framework consistently.

      (2) Loss of Essential Requirement for a Discrete Oscillator

      The reviewer asks whether expansion of the rhythmically active region necessarily implies loss of the rhythmogenic kernel. We believe our necessity and sufficiency experiments (Figure 9) directly address this. Under baseline conditions, GABA microinjection into the buccal area reliably abolishes buccal bursting; under 60 nM bath AMPA, the same injection at the same location and volume has no significant effect on buccal frequency. If the kernel remained essential and the surrounding recruitment were merely supplementary, local inhibition of the kernel should still slow or abolish the rhythm. It does not. We interpret this as evidence that the essential requirement for the discrete buccal area is lost under excitation, not merely that a larger area has been recruited around a still-critical core. We acknowledge, however, that the word "lost" could be read as implying permanent elimination rather than state-dependent suspension, and we will temper this language in the revision.

      (3) Novelty Relative to Mammalian Studies

      We appreciate the reviewer drawing attention to the cited mammalian literature (Del Negro et al., 2002, 2009; Baertsch et al., 2018, 2019), which we discuss in detail in the manuscript. However, we respectfully note that our findings extend this literature in several ways that the public review does not acknowledge. First, Baertsch et al. demonstrated recruitment of tonic or silent neurons to become phasically active during inspiration; we show that neurons already assigned to one oscillator phase (lung) can be dynamically reassigned to another (buccal), which represents a qualitatively different form of reconfiguration. Second, we developed a novel approach to functionally ablate motor neuron pools using high-frequency nerve stimulation, enabling the unit survey to be interpreted at the premotor level which was not achieved in the mammalian studies cited. Third, our data provide the first demonstration of state-dependent oscillator expansion in a non-mammalian tetrapod, offering evolutionary context that strengthens the generality of the principle. We will revise the term "promiscuous" if it overstates the claim, but we maintain that our data support the conclusion that oscillator boundaries are flexible, which goes beyond what has been shown in mammals.

      (4) Figure 6, CN5 Output Under AMPA

      The reviewer asks whether the shift in premotor unit composition is reflected in CN5 motor output. This is a reasonable question. As noted in the manuscript, 60 nM AMPA produces only minor changes in the overt motor pattern as recorded from CN5, which is precisely why we interpret the premotor changes as a reorganization of the network's internal architecture that is not readily apparent from motor output alone. This is in sharp contrast to observations of substantive network reconfiguration in mammals in which eupnea is replaced by the pathological condition of gasping. We will add quantification of CN5 burst parameters (amplitude, duration, frequency) under baseline and 60 nM AMPA to make this point explicit.

      (5) Subthreshold Recruitment vs. Network Expansion

      The reviewer suggests that neurons classified as newly rhythmic under AMPA may have been part of the rhythmic network all along, receiving subthreshold inputs at baseline. We are grateful to the reviewer for highlighting this and hope they would agree that the literature clearly demonstrates that all respiratory neurons receive subthreshold phasic inputs of one kind or another, perhaps providing a clue that reconfiguration is a common feature of respiratory networks generally. Regardless of the implications for other animals, we agree this is likely the mechanism at work in the frog, and indeed our manuscript states that "this increase in the number and proportion of premotor buccal units is due in part to recruitment of sub-threshold buccal neurons that, under low excitability, only fire during lung bursts," citing intracellular evidence from Kogo and Remmers (1994) that lung neurons in this region receive subthreshold buccal-timed input. We note that this observation does not diminish our conclusion and likely explains the mechanism by which network expansion occurs. Whether one calls these neurons "newly recruited" or "pushed above threshold," the functional consequence is the same: a larger population of neurons is now rhythmically active during buccal bursts, and the necessity of the original buccal area is lost. We will clarify this reasoning in the revision and acknowledge the limitation that additional intracellular recordings from our preparation would be needed to fully characterize the subthreshold dynamics.

      (6) Figure 8, Epoch Length and Meta-analysis

      The reviewer notes that the pre-AMPA epoch appears shorter than the post-AMPA epoch in Figure 8A, which could bias unit classification. We will address this in the revision by reporting epoch durations explicitly and addressing its implication on spike counts where appropriate. Regarding the request for meta-analysis of lung unit spiking during baseline buccal bursts: this analysis is part of the rationale for the phase-recruitment panels, and we will expand Figure 8 to include the requested cross-condition comparisons (lung unit activity during baseline buccal bursts, and during post-AMPA lung bursts) as also suggested by Reviewer 3.

      (7) Figure 9, Buccal-to-Lung Burst Ratio

      The reviewer observes that the ratio of buccal to lung bursts decreases from approximately 4-5:1 under baseline to 2-3:1 under 60 nM AMPA, and suggests this is inconsistent with conversion of lung units into buccal units. We do not believe this is inconsistent. The buccal-to-lung burst ratio reflects the overt motor pattern, which is determined by the interaction of multiple oscillators and is influenced by AMPA at both buccal and lung levels. A change in this ratio does not speak to whether individual premotor units have acquired buccal-timed activity; the unit survey and the single-unit transformation data (Figure 8) address that question directly. Regarding the alternative model involving efference copy and cross-inhibition: this is an interesting hypothesis, but it is speculative and not tested by the current dataset. We are happy to discuss lung-buccal interactions more fully in the revision, including the parallels to parafacial/preBötC interactions in mammals, but we note that our data on unit transformation are better explained by network reconfiguration than by a feedback model that remains to be tested.

      (8) "Independent" Slices

      The reviewer compares our Level 2 transection to the preBötC sandwich slice preparation and argues the two resulting slices are not independent. We take the reviewer's point that "independent" may be read as implying no shared developmental or functional origin, which is not our intent. By "independent" we mean that the two physically separated slices can each generate rhythmic output without being synaptically connected to each other. This is, in fact, our central point: rhythmogenic capacity is distributed across a region broad enough to endow two separated slices with independent rhythm-generating capability when excited. We note that the analogy to the sandwich slice is imperfect because in our Level 1 cuts, only the rostral slice containing the buccal area generates rhythm -- the caudal slice does not -- whereas Level 2 cuts that bisect the buccal area produce rhythmicity in both halves, consistent with distributed capacity specifically within the buccal region. We will revise the wording to clarify what we mean by "independent" in this context.

      Reviewer #3 (Public review):

      Physiological Parallels: We will expand the Discussion to place these findings in a broader comparative context, including the eupnea-to-gasping transition in mammals as an example of state-dependent reconfiguration of respiratory networks. This will also allow us to clarify two advances that may otherwise be missed when comparing our work to that in mammals: (a) we developed a novel approach to functionally eliminate motor neurons, allowing mapped units to be interpreted as premotor; and (b) the state-dependent reconfiguration of the buccal oscillator occurred without qualitative changes in the overt lung-buccal motor pattern.

      Unit Transformation Analysis: We will revise Figure 8 to improve clarity around the observed lung-to-buccal transformation by expanding the phase-recruitment panels as suggested and will revisit the operational definitions of lung and buccal unit identity to reduce ambiguity. The central observation is that some units active only during lung bursts under one condition become active during buccal bursts when network excitation is increased.

      Saturation vs. Network Expansion: We will directly address the possibility that 60 nM bath-applied AMPA simply pushes the network toward a frequency ceiling. Two observations strongly argue against this interpretation: (a) 60 nM global AMPA produced only mild changes in buccal frequency, whereas local AMPA injection at much higher concentrations produced larger effects; and (b) local GABA was sufficient to reverse the effects of high-concentration local AMPA microinjections but insufficient to abolish rhythm during low-concentration global AMPA application. Together, these findings are more consistent with global AMPA endowing the network with distributed rhythm-generating capacity than with simple saturation of a discrete local oscillator. Notwithstanding these arguments, we will attempt to extend AMPA/GABA dose response experiment as suggested or add the lack of such experiments as a caveat to our interpretation.

      Figure 9C Correction: We will correct the statistical markings in Figure 9C to align with the text in the Results regarding the significance of frequency changes under 60 nM AMPA.

      In total, we believe these revisions will improve the rigor and clarity of the manuscript while preserving the central conclusion supported by the data: that the organization of the frog respiratory rhythmogenic network is state dependent and becomes more distributed under excitation.

    1. Reviewer #1 (Public review):

      This manuscript investigates how dentate gyrus (DG) granule cell subregions, specifically suprapyramidal (SB) and infrapyramidal (IB) blades, are differentially recruited during a high cognitive demand pattern separation task. The authors combine TRAP2 activity labeling, touchscreen-based TUNL behavior, and chemogenetic inhibition of adult-born dentate granule cells (abDGCs) or mature granule cells (mGCs) to dissect circuit contributions.

      This manuscript presents an interesting and well-designed investigation into DG activity patterns under varying cognitive demands and the role of abDGCs in shaping mGC activity. The integration of TRAP2-based activity labeling, chemogenetic manipulation, and behavioral assays provides valuable insight into DG subregional organization and functional recruitment. However, several methodological and quantitative issues limit the interpretability of the findings. Addressing the concerns below will greatly strengthen the rigor and clarity of the study.

      Major points:

      (1) Quantification methods for TRAP+ cells are not applied consistently across panels in Figure 1, making interpretation difficult. Specifically, Figure 1F reports TRAP+ mGCs as density, whereas Figure 1G reports TRAP+ abDGCs as a percentage, hindering direct comparison. Additionally, Figure 1H presents reactivation analysis only for mGCs; a parallel analysis for abDGCs is needed for comparison across cell types.

      (2) The anatomical distribution of TRAP+ cells is different between low- and high-cognitive demand conditions (Figure 2). Are these sections from dorsal or ventral DG? Is this specific to dorsal DG, as itis preferentially involved in cognitive function? What happens in ventral DG?

      (3) The activity manipulation using chemogenetic inhibition of abDGCs in AsclCreER; hM4 mice was performed; however, because tamoxifen chow was administered for 4 or 7 weeks, the labeled abDGC population was not properly birth-dated. Instead, it consisted of a heterogeneous cohort of cells ranging from 0 to 5-7 weeks old. Thus, caution should be taken when interpreting these results, and the limitations of this approach should be acknowledged.

      (4) There is a major issue related to the quantification of the DREADD experiments in Figure 4, Figure 5, Figure 6, and Figure 7. The hM4 mouse line used in this study should be quantified using HA, rather than mCitrine, to reliably identify cells derived from the Ascl lineage. mCitrine expression in this mouse line is not specific to adult-born neurons (off-targets), and its expression does not accurately reflect hM4 expression.

      (5) Key markers needed to assess the maturation state of abDGCs are missing from the quantification. Incorporating DCX and NeuN into the analysis would provide essential information about the developmental stage of these cells.

      Minor points:

      (1) The labeling (Distance from the hilus) in Figure 2B is misleading. Is that the same location as the subgranular zone (SGZ)? If so, it's better to use the term SGZ to avoid confusion.

      (2) Cell number information is missing from Figures 2B and 2C; please include this data.

      (3) Sample DG images should clearly delineate the borders between the dentate gyrus and the hilus. In several images, this boundary is difficult to discern.

      (4) In Figure 6, it is not clear how tamoxifen was administered to selectively inhibit the more mature 6-7-week-old abDGC population, nor how this paradigm differs from the chow-based approach. Please clarify the tamoxifen administration protocol and the rationale for its specificity.

      Comments on revisions:

      I appreciate the authors' careful and thorough revisions. They have addressed all of my previous concerns satisfactorily, and the manuscript is now significantly strengthened. I have no further concerns.

    2. Reviewer #3 (Public review):

      This study examines the role of dentate gyrus neuronal populations, reflecting neurogenesis and anatomical location (suprapyramidal vs infrapyramidal blade), in a mnemonic discrimination task that taxes the pattern separation functions of the dentate. The authors measure dentate gyrus activity resulting from cognitive training and test whether adult neurogenesis is required for both the anatomical patterns of activity and performance in the cognitive task. The authors find that more cognitively challenging variants of the task evoked more dentate activity, but also distinct patterns of activity (more activity in the suprapyramidal blade, less in the infdrapyramidal blade). Using chemogenetic approaches they silence mature vs immature dentate gyrus neurons and find that only mature neurons (either the general population or specifically mature adult-born neurons), and not immature adult-born neurons, are required for the difficult version of the task. Inhibition of mature adult-born neurons furthermore increased overall activity in the dentate and reduced the biased pattern of activity across the blades, consistent with evidence that adult-born neurons broadly regulate dentate gyrus activity.

      Comments on revisions:

      I appreciate the efforts the authors have taken to revise this manuscript. I have only minor concerns with this revised version of the manuscript:

      Methods state that significance is defined as P<0.05 but some results are interpreted as significant when P=0.05. Either the alpha value needs to change or the interpretation needs to change.

      I believe the statistical results for group and blade effects for the ANOVAs, in Figs 2,3 & 4, appear to be switched (blade should be significant, not group).

      I appreciate that sometimes there is not a perfect overlap between immunohistochemical signals, but I continue to believe that the spatially-non-overlapping TRAP and EDU signals in Fig 3 is caused by these 2 markers being in different cells. A Z-stack or orthogonal projection could verify/disprove this concern.

    3. Author Response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      This manuscript investigates how dentate gyrus (DG) granule cell subregions, specifically suprapyramidal (SB) and infrapyramidal (IB) blades, are differentially recruited during a high cognitive demand pattern separation task. The authors combine TRAP2 activity labeling, touchscreen-based TUNL behavior, and chemogenetic inhibition of adult-born dentate granule cells (abDGCs) or mature granule cells (mGCs) to dissect circuit contributions.

      This manuscript presents an interesting and well-designed investigation into DG activity patterns under varying cognitive demands and the role of abDGCs in shaping mGC activity. The integration of TRAP2-based activity labeling, chemogenetic manipulation, and behavioral assays provides valuable insight into DG subregional organization and functional recruitment. However, several methodological and quantitative issues limit the interpretability of the findings. Addressing the concerns below will greatly strengthen the rigor and clarity of the study.

      Major points:

      (1) Quantification methods for TRAP+ cells are not applied consistently across panels in Figure 1, making interpretation difficult. Specifically, Figure 1F reports TRAP+ mGCs as density, whereas Figure 1G reports TRAP+ abDGCs as a percentage, hindering direct comparison. Additionally, Figure 1H presents reactivation analysis only for mGCs; a parallel analysis for abDGCs is needed for comparison across cell types.

      In Figure 1G and 1H we report TRAP+ abDGCs as a percentage rather than density because we are analyzing colocalization of the two markers, which are very sparse in this population. Given the very low number of double-labeled abDGCs, calculating density would not be practical. In the revised manuscript we have clarified the rationale for using these measures. As noted in the current text, we did not observe abDGCs co-expressing TRAP and c-Fos; we have made this point more explicit to guide interpretation of these data.

      (2) The anatomical distribution of TRAP+ cells is different between low- and high-cognitive demand conditions (Figure 2). Are these sections from dorsal or ventral DG? Is this specific to dorsal DG, as it is preferentially involved in cognitive function? What happens in ventral DG?

      The sections shown in Figure 2 were obtained from the dorsal dentate gyrus (see Methods, “Histology and imaging”: stereotaxic coordinates −1.20 to −2.30 mm relative to bregma, Paxinos atlas). From a feasibility standpoint, it is not possible to analyze the entire longitudinal extent of the hippocampus with these low-throughput histological approaches. We therefore focused on the dorsal DG, for which there is a strong functional rationale. A large body of work indicates that the dorsal hippocampus, and specifically the dorsal DG, is preferentially involved in spatial memory and in the fine contextual discrimination that underlies pattern separation. The dorsal hippocampus is critical for encoding and distinguishing similar spatial representations, a core component of the high-cognitive demand task used here. In contrast, the ventral DG is more strongly associated with emotional regulation and affective memory processing and is less implicated in high-resolution spatial encoding. For these reasons, the present study was designed to assess TRAP+ cell distributions specifically in the dorsal DG.

      (3) The activity manipulation using chemogenetic inhibition of abDGCs in AsclCreER; hM4 mice was performed; however, because tamoxifen chow was administered for 4 or 7 weeks, the labeled abDGC population was not properly birth-dated. Instead, it consisted of a heterogeneous cohort of cells ranging from 0 to 5-7 weeks old. Thus, caution should be taken when interpreting these results, and the limitations of this approach should be acknowledged.

      We agree that prolonged tamoxifen administration results in labeling a heterogeneous population of abDGCs spanning approximately 0 to 5–7 weeks of age, rather than a precisely birth-dated cohort. This is a limitation of this approach and we have included discussion of this in more detail in the revised manuscript.

      (4) There is a major issue related to the quantification of the DREADD experiments in Figure 4, Figure 5, Figure 6, and Figure 7. The hM4 mouse line used in this study should be quantified using HA, rather than mCitrine, to reliably identify cells derived from the Ascl lineage. mCitrine expression in this mouse line is not specific to adult-born neurons (off-targets), and its expression does not accurately reflect hM4 expression.

      We agree that mCitrine is not a marker that allows localization of hM4Di as it is well known that the mCitrine can be independently expressed in a Cre independent manner in this mouse. As suggested, we have removed the figure that showed the mCitrine and have performed immunohistochemical localization of the DREADD with an antibody against the HA tag. This is now shown in Figure 5.

      (5) Key markers needed to assess the maturation state of abDGCs are missing from the quantification. Incorporating DCX and NeuN into the analysis would provide essential information about the developmental stage of these cells.

      The goal of this study was to examine activity patterns of adult-born versus mature granule cells, rather than to assess maturation state. The adult-born neurons analyzed were 25–39 days old, an age at which point most cells have progressed beyond the DCX⁺ stage and are expected to express NeuN based on prior work. We therefore do not think that including DCX or NeuN quantification would provide additional information relevant to the aims or interpretation of this study.

      Minor points:

      (1) The labeling (Distance from the hilus) in Figure 2B is misleading. Is that the same location as the subgranular zone (SGZ)? If so, it's better to use the term SGZ to avoid confusion.

      We have updated Figure 2B, the Methods, and the main text to more explicitly localize this which it the boundary between the subgranular zone (SGZ) and the hilus.

      (2) Cell number information is missing from Figures 2B and 2C; please include this data.

      We have now added the cell number information to the figure legends. In Figures 2B and 2C, each point corresponds to a single cell, with an equal number of mice per group. The total number of TRAP⁺ cells per mouse is shown in Figure 1F, which reports TRAP⁺ cell densities by group.

      (3) Sample DG images should clearly delineate the borders between the dentate gyrus and the hilus. In several images, this boundary is difficult to discern.

      We made the DG-hilus boundaries clearer in the sample images to improve visualization and interpretation.

      (4) In Figure 6, it is not clear how tamoxifen was administered to selectively inhibit the more mature 6-7-week-old abDGC population, nor how this paradigm differs from the chow-based approach. Please clarify the tamoxifen administration protocol and the rationale for its specificity.

      We apologize for the confusion here. The protocol used in Figure 6 is the same tamoxifen chow–based approach as in Figure 5, differing only in the duration of tamoxifen exposure. Mice in Figure 5 received tamoxifen chow for 7 weeks, whereas mice in Figure 6 received it for 4 weeks, restricting labeling to a younger and narrower cohort of adult-born DGCs. Thus, the population targeted in Figure 6 is younger than that in Figure 5 and does not correspond to mature 6–7-week-old neurons. By contrast, the experiment in Figure 4 targets a more mature population, consisting predominantly of ~5-week-old adult-born neurons as well as mature granule cells, which are Dock10-positive and express Cre endogenously, allowing selective manipulation of this later-stage population.

      We have corrected the paragraph accordingly and clarified the age range of the labeled populations in the revised manuscript.

      Reviewer #2 (Public review):

      Summary

      In this manuscript, the authors combine an automated touchscreen-based trial-unique nonmatching-to-location (TUNL) task with activity-dependent labeling (TRAP/c-Fos) and birth-dating of adult-born dentate granule cells (abDGCs) to examine how cognitive demand modulates dentate gyrus (DG) activity patterns. By varying spatial separation between sample and choice locations, the authors operationally increase task difficulty and show that higher demand is associated with increased mature granule cell (mGC) activity and an amplified suprapyramidal (SB) versus infrapyramidal (IB) blade bias. Using chemogenetic inhibition, they further demonstrate dissociable contributions of abDGCs and mGCs to task performance and DG activation patterns.

      The combination of behavioral manipulation, spatially resolved activity tagging, and temporally defined abDGC perturbations is a strength of the study and provides a novel circuit-level perspective on how adult neurogenesis modulates DG function. In particular, the comparison across different abDGC maturation windows is well designed and narrows the functionally relevant population to neurons within the critical period (~4-7 weeks). The finding that overall mGC activity levels, in addition to spatially biased activation patterns, are required for successful performance under high cognitive demand is intriguing.

      Major Comments

      (1) Individual variability and the relationship between performance and DG activation.

      The manuscript reports substantial inter-animal variability in the number of days required to reach the criterion, particularly during large-separation training. Given this variability, it would be informative to examine whether individual differences in performance correlate with TRAP+ or c-Fos+ density and/or spatial bias metrics. While the authors report no correlation between success and TRAP+ density in some analyses, a more systematic correlation across learning rate, final performance, and DG activation patterns (mGC vs abDGC, SB vs IB) could strengthen the interpretation that DG activity reflects task engagement rather than performance only.

      As mentioned, we previously reported no correlation between task success and TRAP+ density. We have now performed additional analyses examining correlations with learning rate, final performance, and DG activation patterns (mGC vs abDGC, SB vs IB), and found no significant relationships. Therefore, as we did not find any positive correlations the original interpretation that DG activity primarily reflects task engagement rather than performance level seems the most parsimonious.

      (2) Operational definition of "cognitive demand".

      The distinction between low (large separation) and high (small separation) cognitive demand is central to the manuscript, yet the definition remains somewhat broad. Reduced spatial separation likely alters multiple behavioral variables beyond cognitive load, including reward expectation, attentional demands, confidence, engagement, and potentially motivation. The authors should more explicitly acknowledge these alternative interpretations and clarify whether "cognitive demand" is intended as a composite construct rather than a strictly defined cognitive operation.

      We agree that reducing spatial separation between stimuli likely engages multiple behavioral and cognitive processes beyond a single, strictly defined operation. We have now clarified this point in the manuscript and explicitly state that our use of the term “cognitive demand” reflects a multidimensional behavioral challenge rather than a singular cognitive process (see Discussion).

      (3) Potential effects of task engagement on neurogenesis.

      Given the extensive behavioral training and known effects of experience on adult neurogenesis, it remains unclear whether the task itself alters the size or maturation state of the abDGC population. Although the focus is on activity and function rather than cell number, it would be useful to clarify whether neurogenesis rates were assessed or controlled for, or to explicitly state this as a limitation.

      While the primary goal of this study was to examine activity and functional recruitment of adult-born granule cells, we also quantified the survival of birth-dated neurons at the end of behavioral training. Density measurements of BrdU⁺ and EdU⁺ cells revealed no differences across experimental groups, indicating that engagement in the pattern separation task, across low to high cognitive demand conditions, did not significantly alter survival of adult-born neurons. In addition, we examined the spatial distribution of BrdU⁺ and EdU⁺ neurons between the suprapyramidal and infrapyramidal blades of the dentate gyrus. The proportion of newborn neurons was consistent across all groups, with approximately 60% located in the suprapyramidal blade and 40% in the infrapyramidal blade. These findings indicate that behavioral training did not alter the baseline distribution of adult-born neurons. We have now clarified these points in the manuscript (See Results).

      (4) Temporal resolution of activity tagging.

      TRAP and c-Fos labeling provide a snapshot of neural activity integrated over a temporal window, making it difficult to determine which task epochs or trial types drive the observed activation patterns. This limitation is partially acknowledged, but the conclusions occasionally imply trial-specific or demand-specific encoding. The authors should more clearly distinguish between sustained task engagement and moment-to-moment trial processing, and temper interpretations accordingly. While beyond the scope of the current study, this also motivates future experiments using in vivo recording approaches.

      We agree and have made changes to the manuscript to discuss these points (see Discussion and Limitations).

      (5) Interpretation of altered spatial patterns following abDGC inhibition.

      In the abDGC inhibition experiments, Cre+ DCZ animals show delayed learning relative to controls. As a result, when animals are sacrificed, they may be at an intermediate learning stage rather than at an equivalent behavioral endpoint. This raises the possibility that altered DG activation patterns reflect the learning stage rather than a direct circuit effect of abDGC inhibition. Additional clarification or analysis controlling for the learning stage would strengthen the causal interpretation.

      We agree that differences in learning stage could in principle confound the interpretation of DG activation patterns. However, although Cre+ DCZ-treated mice exhibited delayed learning, they ultimately reached the same performance criterion as control animals. Thus, adult-born DGC inhibition did not prevent learning but increased the time required to reach criterion, indicating that these neurons are beneficial for learning efficiency rather than strictly necessary for task acquisition. Importantly, all animals were sacrificed only after reaching the predefined success criterion. Therefore, the immunohistochemical analyses were performed at the same behavioral endpoint for Cre+ DCZ and control groups, even though the number of training days differed. Consequently, the observed differences in DG activation reflect circuit recruitment at equivalent task mastery rather than differences in learning stage.

      (6) Relationship between c-Fos density and behavioral performance.

      The study reports that abDGC inhibition increases c-Fos density while impairing performance, whereas mGC inhibition decreases c-Fos density and also impairs performance. This raises an important conceptual question regarding the relationship between overall activity levels and task success. The authors suggest that both sufficient activity and appropriate spatial patterning are required, but the manuscript would benefit from a more explicit discussion of how different perturbations may shift the identity, composition, or coordination of the active neuronal ensemble rather than simply altering total activity levels.

      We agree that our findings highlight that successful performance is not determined solely by the overall level of dentate gyrus activity, but rather by the composition and spatial organization of the active neuronal ensemble. In our study, inhibition of abDGCs increased overall mGC activity while disrupting the spatially organized, blade-biased activation pattern and impaired performance. In contrast, direct inhibition of mGCs reduced global excitability but preserved the relative spatial organization of active neurons in animals that continued to perform the task. These findings suggest that different perturbations alter task performance by shifting the identity and coordination of the active neuronal ensemble, rather than simply increasing or decreasing total activity levels. We have now expanded the Discussion to more explicitly address how dentate gyrus computations may depend on the structured recruitment of granule cell ensembles and how distinct manipulations differentially disrupt this organization.

      Reviewer #3 (Public review):

      Summary:

      The authors used genetic models and immunohistochemistry to identify how training in a spatial discrimination working memory task influences activity in the dentate gyrus subregion of the hippocampus. Finding that more cognitively challenging variants of the task evoked more and distinct patterns of activity, they then investigated whether newborn neurons in particular were important for learning this task and regulating the spatial activity patterns.

      Strengths:

      The focus on precise anatomical locations of activity is relatively novel and potentially important, given that little is known about how DG subregions contribute to behavior. The authors also use a task that is known to depend on this memory-related part of the brain.

      Weaknesses:

      Statistical rigor is insufficient. Many statistical results are not stated, inappropriate tests are used, and sample sizes differ across experiments (which appear to potentially underlie null results). The chemogenetic approach to inhibit adult-born neurons also does not appear to be targeting these neurons, as judged by their location in the DG.

      Please refer to the updated statistical analyses in response to the recommendations below.

      Recommendations for the authors:

      Reviewing Editor Comments

      Please note that reviewers agreed that appropriate revisions are needed to increase the strength of evidence for the paper's claims. Concerns were raised about a lack of statistical rigor in the statistical analyses used. Results of statistical tests were not consistently provided (i.e., statistic applied, value of statistic, degrees of freedom, p-value), and seemingly inappropriate statistical tests were used in some instances. Also, some comparisons had lower statistical power than others. When clarifying the statistical approaches used in the manuscript, we also encourage you to consider reading this article that outlines common statistical mistakes (Makin TR, Orban de Xivry JJ. Ten common statistical mistakes to watch out for when writing or reviewing a manuscript. Elife. 2019 Oct 9;8:e48175. doi: 10.7554/eLife.48175.), such as the importance of not basing conclusions on a significant p-value for one pair-wise comparison vs a non-significant p-value for another pairwise comparison (i.e., groups that are being compared should be included in the same statistical analysis, and interaction effects should be reported when appropriate). We hope that you find this information to be helpful should you decide to submit a revised manuscript to eLife.

      Reviewer #1 (Recommendations for the authors):

      (1) Standardize TRAP+ quantification across Figure 1.

      Please report TRAP+ cell numbers using consistent metrics (e.g., density or percentage) to enable comparison across cell types. In addition, extend the TRAP+ reactivation analysis in Figure 1H to include abDGCs so that reactivation dynamics can be compared directly between mGCs and abDGCs.

      Reply in Public Review

      (2) Clarify whether dorsal or ventral DG was analyzed in Figure 2.

      The differing anatomical distributions of TRAP+ cells under low- and high-demand conditions raise important questions about DG axis specificity. Please indicate whether analyses were performed in dorsal DG, ventral DG, or both, and provide data or justification accordingly.

      Reply in Public Review

      (3) Acknowledge limitations of the tamoxifen-chow labeling strategy in AsclCreER; hM4 experiments.

      Since tamoxifen chow administered over 4-7 weeks labels a heterogeneous abDGC population spanning a broad age range, this approach does not generate birth-dated cohorts. This limitation should be clearly addressed in the text and interpretations, particularly related to cell age-dependent effects, should be tempered.

      Reply in Public Review

      (4) Revise DREADD quantification using HA rather than mCitrine.

      The hM4 mouse line requires HA immunostaining to accurately identify Ascl-lineage cells expressing the DREADD receptor. Because mCitrine is not specific to adult-born neurons and does not reliably reflect hM4 expression, quantification based on mCitrine should be revised.

      Reply in Public Review

      (5) Include markers to assess abDGC maturation state.

      Adding quantification of DCX and NeuN would help define the developmental stage of abDGCs in key experiments and improve the interpretation of cell-age-dependent effects.

      Reply in Public Review

      (6) Clarify DG layer boundaries and terminology in Figure 2.

      If the metric labeled "Distance from the hilus" corresponds to the subgranular zone (SGZ), using SGZ terminology would prevent confusion. Additionally, please provide clearer delineation of DG and hilus borders in sample images.

      Reply in Public Review

      (7) Provide missing cell number data for Figures 2B and 2C.

      Reply in Public Review

      (8) Clarify the tamoxifen administration protocol in Figure 6.

      Please describe how the protocol selectively targets 6-7-week-old abDGCs and how it differs from the chow-based approach. This will help readers understand the intended specificity of the manipulation.

      Reply in Public Review

      Reviewer #2 (Recommendations for the authors):

      (1) EdU birth-dating timeline

      The manuscript would benefit from a clearer description of the EdU birth-dating timeline, ideally with a schematic similar to that provided for BrdU in Supplementary Figure 1.

      We appreciate the suggestion. However, we did not include a separate schematic for EdU because its use and birth-dating logic are identical to BrdU (both are thymidine analogs administered systemically and incorporated during S-phase). Therefore, the timeline shown in Supplementary Figure 1 applies equally to both markers. We have clarified this point in the Methods section to avoid confusion.

      (2) Clarity of TUNL task description.

      The description of the TUNL task, particularly for readers unfamiliar with touchscreen-based paradigms, is difficult to follow without consulting prior literature. A simplified schematic or a clearer step-by-step explanation in the main text or supplementary material would improve accessibility.

      We note that the main steps of the TUNL protocol are illustrated in Figure 1A, Supplementary Figure 2A and 2B. Nevertheless, we agree that the description in the text can be made clearer for readers less familiar with touchscreen-based tasks. Thus , we have now revised the Methods section to provide a clearer step-by-step description of the TUNL.

      (3) Influence of outliers in Figure 1G.

      In Figure 1G, the reported trend that ~1% of 25-39-day-old abDGCs are TRAP+ during LS trials appears to be driven by a small number of outliers. This should be acknowledged, and the wording of the conclusion moderated to reflect the variability in the data.

      We agree with the reviewer that the apparent outliers reflect the inherent sparsity of TRAP labeling in this population. In absolute terms, this corresponds to between 0 and 2 TRAP⁺ 25–39-day-old abDGCs per mouse, such that the presence or absence of a small number of labeled cells can appear as outliers when expressed as a percentage. We have revised the text to acknowledge this (see Results).

      (4) Presentation of learning curves.

      Rather than focusing primarily on "days before criterion" (DBC), it would be helpful to show full learning curves across the entire training period. This would provide a clearer picture of acquisition dynamics and inter-animal variability.

      We agree that learning curves can be informative in many behavioral paradigms. However, in our protocol, mice do not undergo the same number of training days because training stops individually once each animal reaches criterion. As a result, plotting full learning curves would produce trajectories of different lengths, making group comparisons difficult and visually cluttered. For this reason, we aligned animals based on days before criterion (DBC), which allows direct comparison of learning dynamics relative to task acquisition. We also consider the cumulative probability representation to be the most appropriate way to summarize learning progression across animals in this context which are also included in the figures.

      (5) Clarification of Figure 3B labeling

      In Figure 3B, the identity of the orange-labeled group above the LS condition is unclear. Clarification in the figure legend would improve interoperability.

      Figure 3B includes two experimental groups. One group performed both the large- and small-separation conditions; this group is shown in orange and labeled LS. Within this group, the upper orange trace corresponds to performance in the large-separation condition, while the lower orange trace corresponds to performance in the small-separation condition. The second group is a control group that performed only the large-separation configuration, and therefore only a single green trace is shown. We agree that this distinction was not sufficiently clear and have revised the figure legend and text to clarify the identity of each trace.

      Reviewer #3 (Recommendations for the authors):

      (1) Please label figures and, even better, put the legends on the same page.

      (2) Just to confirm, in establishing the task, mice performed above 70% for the small separation trials in one of the sessions on 2 consecutive days, for each criterion? Performance seems to be below 70%.

      Yes. To meet the criterion, each mouse had to reach ≥70% correct performance in at least one of the two daily sessions on two consecutive days. We then averaged the performance across both sessions for each of those days. As a result, if one session was ≥70% but the other was lower, the daily average could fall below 70%. The values shown in the figure correspond to these daily averages, further averaged across mice.

      (3) mGC needs to be explicitly defined. Am I assuming any non-birthdated GC is an mGC according to the authors? (which means it is unknown whether they are in fact mature, though likely most of them are).

      In this study, “mature granule cells” (mGCs) refer operationally to granule cells that are not birth-dated with BrdU or EdU and therefore are not classified as adult-born neurons within the defined labeling window. We agree that this population is not directly age-defined, and that while the majority are expected to be mature based on their birth timing relative to the labeling period, we cannot exclude the possibility that a small fraction may include younger, unlabeled neurons. We have now explicitly defined this usage of mGCs in the Methods and clarified this point in the text to avoid ambiguity.

      (4) Methods state that Kruskal-Wallis tests were used when more than 3 groups were compared, but I don't see these stats presented (e.g., for trap data in Figure 1, blade x task TRAP expt in Figure 3 (should be 2-way RM anova here and elsewhere), etc) or any corrections for multiple comparisons. I appreciate that the mean rates of TRAPed abGCs are higher in the S and LS groups than in the shaping group, but most mice do not have any BrdU+ cells that are also TRAPed, and there are no statistics here to support the claim. I don't think there is enough sampling to accurately quantify activation of abGCs. Also, no stats to support the claim that TRAPing increases at the "tip of the SB after the more demanding LS task".

      We agree with this comment. We have now systematically tested all datasets for normality (by group) and applied parametric tests when the data met normality assumptions, and non-parametric tests otherwise. The statistical analyses have been revised accordingly. We added the appropriate tests (including two-way ANOVA where relevant, such as for blade × group comparisons) and now report full statistics in the figure legends and results sections. For the TRAP analyses in adult-born DGCs, we explicitly acknowledge the very low number of BrdU⁺/TRAP⁺ cells, which limits statistical power and, in some cases, precludes robust statistical testing. These limitations are now clearly stated in the Results and Discussion, and the corresponding interpretations have been tempered. For all Kruskal–Wallis tests, post hoc pairwise comparisons were performed using Dunn’s test, with Bonferroni correction for multiple comparisons, as now specified in the Methods section. We also expanded the Methods to describe the statistical workflow in detail. In addition, we have added the previously missing statistical analysis for Figure 2C. Comparisons were performed between the 0–50% and 50–100% portions of the blade, where 0% corresponds to the apex and 100% corresponds to the distal tip of the blade.

      (5) Figure 3I: I can't figure out which effect is statistically significant here (what does the asterisk signify?). Why no individual data points in this graph?

      We agree that the absence of individual data points reduced interpretability, and we have now updated the figure to include individual data points to better illustrate data distribution and variability.

      (6) The gradient of activity (shap < S < LS) could be due to how long they've been trained on a given stage (e.g. less activity during shaping because they have habituated, and neurons encoding that task phase have already been selected)

      We agree that task duration and habituation could, in principle, influence activity levels. Under this interpretation, higher activity would primarily reflect task novelty rather than cognitive demand. However, our data do not support this explanation. Specifically, we found no correlation between the number of training days required to reach criterion and c-Fos–positive or TRAP-positive cell density within a given stage. Thus, animals that reached criterion rapidly did not show higher activity levels than animals that required more days of training and were presumably more habituated to the task demands. This suggests that the observed activity gradient (shaping < S < LS) is not driven by exposure duration or habituation, but rather reflects differences in cognitive demand across task stages.

      (7) The TRAP+ EDU+ cell in Figure 3 looks odd because the BrdU signal is (a lot) larger than the TRAP signal, but BrdU is in the nucleus and should be smaller.

      We agree that the example in Figure 3 is not optimal. In dividing cells, BrdU/EdU signals can sometimes appear broader or closely apposed, which may affect their apparent size.

      (8) For the Ascl-HM4Di experiment, HM4Di appears to be expressed in all of the areas of the granule cell layer where abGCs are NOT located (i.e. no expression in the deep cell layer, near the sgz). This is problematic because it suggests perhaps abGCs are not inhibited as expected.

      As noted in our response to Reviewer #1, we did not use the mCitrine to localize the DREADD receptor as it has been demonstrated that mCitrine expression is expressed in a Cre-independent manner and not correlated with hM4Di expression. In the revised manuscript we include a representative image were we performed immunostaining using an HA antibody to directly visualize hM4Di and confirm its expression in adult-born granule cells (Figure 5).

      (9) Line 267: "6-7 week old neurons by themselves do not influence either the performance of mice in the task". I don't think this is fair because this experiment wasn't designed with as much power to detect an effect. The group trends are in the same direction, but there are many fewer mice in this experiment (n=6/group) than in the =<7w experiment (n=11/group), where the effect just reached statistical significance.

      We are sorry for this confusion which came from an incorrect version. The experiment shown in Figure 6 does not target 6–7-week-old neurons specifically. It uses the same tamoxifen chow–based protocol as Figure 5, but with a shorter exposure (4 weeks vs. 7 weeks), thereby labeling a younger and more restricted cohort of adult-born DGCs. By contrast, Figure 4 targets a more mature population, consisting predominantly of ~5-week-old adult-born neurons as well as mature granule cells (Dock10+).

      We have corrected the paragraph accordingly and clarified the age range of the labeled populations in the revised manuscript.

    1. Author Response:

      The following is the authors’ response to the original reviews.

      We thank the reviewers for their constructive comments. A central concern raised is the comparison of performance with existing motion-correction methods. In response, we performed motion correction using several widely used approaches and compared results using the number of particles detected by 2DTM and their associated SNR. To minimize potential bias, we selected parameters to give each method a comparable level of model flexibility so that the results are as directly comparable as possible. Overall, Unbend performs the best. We note that extensive, method-specific parameter optimization could further affect absolute performance, and a comprehensive benchmarking study is therefore beyond the scope of this work

      Public Reviews:

      Reviewer #1 (Public review):

      Kong et al.'s work describes a new approach that does exactly what the title states: "Correction of local beam-induced sample motion in cryo-EM images using a 3D spline model." I find the method appropriate, logical, and well-explained. Additionally, the work suggests using 2DTM-related measurements to quantify the improvement of the new method compared to the old one in cisTEM, Unblur. I find this part engaging; it is straightforward, accurate, and, of course, the group has a strong command of 2DTM, presenting a thorough study.

      However, everything in the paper (except some correct general references) refers to comparisons with the full-frame approach, Unblur. Still, we have known for more than a decade that local correction approaches perform better than global ones, so I do not find anything truly novel in their proposal of using local methods (the method itself- Unbend- is new, but many others have been described previously). In fact, the use of 2DTM is perhaps a more interesting novelty of the work, and here, a more systematic study comparing different methods with these proposed well-defined metrics would be very valuable. As currently presented, there is no doubt that it is better than an older, well-established approach, and the way to measure "better" is very interesting, but there is no indication of how the situation stands regarding newer methods.

      Regarding practical aspects, it seems that the current implementation of the method is significantly slower than other patch-based approaches. If its results are shown to exceed those of existing local methods, then exploring the use of Unbend, possibly optimizing its code first, could be a valuable task. However, without more recent comparisons, the impact of Unbend remains unclear.

      We thank the reviewer for this important point. We agree that comparing against modern local motion-correction approaches is a valuable task. To address this, we added a new benchmarking section (pp. 17–18, lines 444–492, Fig. 8, Fig. 8—figure supplement 1) that compares Unbend against widely used patch-based local correction methods, including MotionCor2, MotionCor3, Warp, and CryoSPARC. Using the same 2DTM-based metrics described in the manuscript (detections per micrograph and SNR distributions for commonly detected particles), we find that Unbend provides the most stable performance across the tested datasets and, in most cases, yields higher detection counts and higher SNR than the alternative methods.

      Regarding runtime, the current implementation is CPU-based and is therefore slower than some optimized GPU-accelerated packages. We now clarify this limitation in the manuscript (line 498–499). Our primary goal in this study is to improve motion-correction accuracy and quantify its impact using 2DTM-based measures. Importantly, higher-quality motion-corrected micrographs can reduce downstream processing cost (e.g., by increasing particle detection efficiency and reducing ambiguous candidates), so modest additional compute times at the motion-correction stage can be offset later in the workflow. We also note that GPU acceleration and additional code-level optimizations are planned for future releases (line 501-503); however, they are not required to evaluate the methodological contribution and the benchmarking results presented here.

      Reviewer #2 (Public review):

      Summary:

      The authors present a new method, Unbend, for measuring motion in cryo-EM images, with a particular emphasis on more challenging in situ samples such as lamella and whole cells (that can be more prone to overall motion and/or variability in motion across a field of view). Building on their previous approach of full-frame alignment (Unblur), they now perform full-frame alignment followed by patch alignment, and then use these outputs to generate a 3D cubic spline model of the motion. This model allows them to estimate a continuous, per-pixel shift field for each movie frame that aims to better describe complex motions and so ultimately generate improved motion-corrected micrographs. Performance of Unbend is evaluated using the 2D template matching (2DTM) method developed previously by the lab, and results are compared to using full-frame correction alone. Several different in situ samples are used for evaluation, covering a broad range that will be of interest to the rapidly growing in situ cryo-EM community.

      Strengths:

      The method appears to be an elegant way of describing complex motions in cryo-EM samples, and the authors present convincing data that Unbend generally improves SNR of aligned micrographs as well as increases detection of particles matching the 60S ribosome template when compared to using full-frame correction alone. The authors also give interesting insights into how different areas of a lamella behave with respect to motion by using Unbend on a montage dataset collected previously by the group. There is growing interest in imaging larger areas of in situ samples at high resolution, and these insights contribute valuable knowledge. Additionally, the availability of data collected in this study through the EMPIAR repository will be much appreciated by the field.

      Thank you for this positive assessment.

      Weaknesses:

      While the improvements with Unbend vs. Unblur appear clear, it is less obvious whether Unbend provides substantial gains over patch motion correction alone (the current norm in the field). It might be helpful for readers if this comparison were investigated for the in situ datasets. Additionally, the authors are open that in cases where full motion correction already does a good job, the extra degrees of freedom in Unbend can perhaps overfit the motions, making the corrections ultimately worse. I wonder if an adaptive approach could be explored, for example, using the readout from full-frame or patch correction to decide whether a movie should proceed to the full Unbend pipeline, or whether correction should stop at the patch estimation stage.

      We thank the reviewer for suggesting an adaptive criterion to decide whether to proceed patch alignment or not. We agree that such an approach could be valuable for efficiency and for avoiding unnecessary model flexibility. However, our results indicate that a simple criterion based on the magnitude of estimated local patch motion is unlikely to be sufficient. For example, in the BS-C-1 cell lysate dataset, (see line 412-417 on page 16), we observe minimal local motion (Figure 4b) with mean patch shifts of only 0.7Å and full-frame alignment already yields comparable detection counts, yet local correction still produces a measurable SNR gain (13.84 ± 0.04 to 14.25 ± 0.04, 3%) and improves SNR for ~70% of the commonly detected targets (Figure 6c). This suggests that residual local distortion can remain even when overall local motion appears small. Establishing a robust, dataset-agnostic stopping rule would therefore require a dedicated, systematic benchmarking study across many samples and acquisition conditions.

      Reviewer #3 (Public review):

      Summary

      Kong and coauthors describe and implement a method to correct local deformations due to beam-induced motion in cryo-EM movie frames. This is done by fitting a 3D spline model to a stack of micrograph frames using cross-correlation-based local patch alignment to describe the deformations across the micrograph in each frame, and then computing the value of the deformed micrograph at each pixel by interpolating the undeformed micrograph at the displacement positions given by the spline model. A graphical interface in cisTEM allows the user to visualise the deformations in the sample, and the method has been proven to be successful by showing improvements in 2D template matching (2DTM) results on the corrected micrographs using five in situ samples.

      Impact

      This method has great potential to further streamline the cryo-EM single particle analysis pipeline by shortening the required processing time as a result of obtaining higher quality particles early in the pipeline, and is applicable to both old and new datasets, therefore being relevant to all cryo-EM users.

      Strengths

      (1) One key idea of the paper is that local beam induced motion affects frames continuously in space (in the image plane) as well as in time (along the frame stack), so one can obtain improvements in the image quality by correcting such deformations in a continuous way (deformations vary continuously from pixel to pixel and from frame to frame) rather than based on local discrete patches only. 3D splines are used to model the deformations: they are initialised using local patch alignments and further refined using cross-correlation between individual patch frames and the average of the other frames in the same patch stack.

      (2) Another strength of the paper is using 2DTM to show that correcting such deformations continuously using the proposed method does indeed lead to improvements. This is shown using five in situ datasets, where local motion is quantified using statistics based on the estimated motions of ribosomes.

      Thank you for this positive assessment.

      Weaknesses

      (1) While very interesting, it is not clear how the proposed method using 3D splines for estimating local deformations compares with other existing methods that also aim to correct local beam-induced motion by approximating the deformations throughout the frames using other types of approximation, such as polynomials, as done, for example MotionCor2.

      We thank the reviewer for this suggestion. We agree that positioning Unbend relative to existing local motion-correction methods is important. In the revised manuscript, we added a dedicated benchmarking section comparing Unbend with widely used local correction approaches, including MotionCor2, MotionCor3, Warp, and CryoSPARC, using the same 2DTM-based metrics (Fig. 8, Fig. 8—figure supplement 1). This section is included on pp. 17–18, lines 444–492. To make the comparison as fair as possible, we matched nominal model flexibility across methods and otherwise used default parameters to reduce method-specific tuning. This expanded comparison provides a direct baseline against current patch-/spline-based approaches and shows that Unbend performs consistently across the in situ datasets evaluated here, with improvements in detection counts and/or SNR in multiple cases.

      (2) The use of 2DTM is appropriate, and the results of the analysis are enlightening, but one shortcoming is that some relevant technical details are missing. For example, the 2DTM SNR is not defined in the article, and it is not clear how the authors ensured that no false positives were included in the particles counted before and after deformation correction. The Jupyter notebooks where this analysis was performed have not been made publicly available.

      We agree that these technical details improve clarity and reproducibility. We have therefore made three changes.

      (1) Definition of 2DTM SNR. We added an explicit definition of the 2DTM SNR in Section “2DTM provides a one-step verification for motion correction”, pp. 11, lines 277–287). Briefly, at each image location we compute cross-correlation values over the searched orientation space and define the 2DTM SNR as the maximum per location z-score across orientations.

      (2) False-positive control / detection threshold. We clarified how detection thresholds were set to control false positives (pp. 11, lines 285–287). Specifically, we used the standard 2DTM statistical framework in which the threshold  is chosen using the one-false-positive (1-FP) criterion (or equivalently, a specified expected false-positive rate). We applied the same thresholding procedure consistently across all motion-corrected micrographs. This ensures that particle counts before/after correction reflect changes in signal recovery.

      (3) Reproducibility of the analysis. We have made the script used for the benchmarking and figure generation publicly available (pp. 24 line 622-623), and we provide a link in the Data Availability statement (pp. 25 line 650). The repository includes sample .star files and a python package that computes detections per micrograph, commonly detected particles, and SNR comparisons.

      (3) It is also not clear how the proposed deformation correction method is affected by CTF defocus in the different samples (are the defocus values used in the different datasets similar or significantly different?) or if there is any effect at all.

      We thank the reviewer for raising this point. In the revised manuscript, we now report the defocus ranges used for each dataset (Table 1) and clarify that all motion-correction comparisons were performed within each dataset using the same CTF estimation and 2DTM settings (pp. 23 line 615-618). Across the five datasets, four were collected at similar defocus ranges (1.0 µm to 1.5µm), whereas one dataset includes near-focus (0.4 µm) micrographs (Table 1). Because Unbend operates on frame alignment/warping rather than CTF modeling, we do not expect a defocus specific effect beyond indirect influences through image SNR and reliability of cross-correlation-based alignment.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      The obvious recommendation would be to use their 2DTM approach for a comparison of their new method with other currently used ones

      We agree and added a new comparison section (pp. 17–18, lines 444–492). Addressed above in Response to Reviewer #1 Public Review.

      Reviewer #2 (Recommendations for the authors):

      (1) Line 29, typo. 3 ~ 8% > 3 - 8%.

      Corrected.

      (2) Lines 220 and 226. Should this be e-/Angstrom squared for the exposure?

      Corrected to e<sup>-</sup>/Å<sup>2</sup> (Now pp. 9 lines 230, 236).

      (3) Figure 2 c-d. These are good for instinctively seeing the movement, but I found the legend confusing, as a 10 x 10 pixel array is mentioned, yet the schematics show a higher sampling (30 x 30 pixels? in c-e).

      Thank you for pointing this out. The “10×10” annotation refers to the physical scale, whereas the grid represents pixel sampling. We removed the “10×10” label and now show only the pixel grid to avoid confusion. The caption has been updated to state that the grid corresponds to a 30×30 pixel sampling. (Fig. 2c, d; pp. 31, line 766)

      (4) Figure 4. It would be good if the n of movies analyzed was given in the figure legend.

      Thank you for noticing this. We report the number of movies per dataset in the corresponding summary table (Table 1).

      (5) Figure 5. X/Y axes labels missing (assume pixels). Also, suggest changing the strain scale to % to match the main text description of this figure.

      We added X/Y axis labels, changed the strain scale to % (Figure 5), and specified that the strains are per pixel on pp. 14 line 367. Correspondingly, the X/Y labels and strain scale in strain plots in Figure 4—figure supplementary 1 to 5 are also changed.

      (6) Unify labelling of Figure 4 and 6 (i.e., Bacteria vs. M. pneumoniae, etc.).

      Corrected. Sample labels are now consistent across figures. (Figures 4 and 6)

      Reviewer #3 (Recommendations for the authors):

      Some recommendations related to the points mentioned in the 'Weaknesses' section in the public review:

      (1) If feasible, it would be useful to see a comparison with other existing methods that estimate local deformations (e.g., MotionCor2), at least on some of the datasets. For example, does the proposed method lead to better 2DTM SNR in the detected particles compared to other methods, or higher detection numbers? Alternatively, if such a comparison would require too much additional work and the authors have good reasons to believe that the results are evident, it would be helpful to include a discussion about why the proposed method is expected to perform better, both in terms of the general approach and specific implementation details.

      We agree that this comparison is important. (pp. 17–18, lines 444–492). Addressed above in Response to Reviewer #3 Public Review (1).

      (2) It would be useful to define the 2DTM SNR in the main text of the paper, as well as to address the point about false positives in the picked particles.

      We added an explicit definition of 2DTM SNR and clarified the detection thresholding/false-positive control used in our analysis (pp. 11, lines 277–287). Addressed above in Response to Reviewer #3 Public Review (2.1 and 2.2).

      (3) Regarding the results shown in Figures 4 and 6: do the authors have any insight about how the CTF defocus affects the deformation estimation and correction across the different sample types?

      We now report the defocus ranges used for each dataset (Table 1). We have addressed this problem in Response to Reviewer #3 Public Review (3).

      (4) Will the Jupyter notebooks used for the 2DTM analysis be made publicly available?

      Yes. We have deposited a python script used for the 2DTM benchmarking and figure generation in a public repository and added the link in Data Availability statement. (pp. 23 line 622, pp. 25 line 650). Addressed above in Response to Reviewer #3 Public Review (2.3).

      (5) I would also appreciate a few words about the implementation details of the 3D spline model (e.g., what libraries have been used, if any, or if the authors have implemented their own code for this).

      The 3D spline model and warping code were implemented by us (no external spline library was used) and the relevant implementation details are described in the “Sample distortion modeling and correction” section (pp. 7–10, lines 174–246). For optimization, we used the L-BFGS implementation provided by the dlib library, which is now explicitly cited (pp. 10, line 264).

      Some comments regarding the presentation of the work:

      (1) I found the mathematical background on splines on pages 7-9 a little distracting from the main ideas of the paper, and I believe it could be moved to the methods section. A short description of this in the main text of the paper would suffice, and it would be useful to state clearly when this is background material and when it is the authors' contribution.

      We appreciate the suggestion. Because Unbend includes an in-house spline implementation (no external spline library) and it is the central part of this work, we retained the spline description to support reproducibility. (pp. 7–10, lines 174–246).

      (2) More generally, I found the whole method very interesting, but understanding exactly what all the steps involved were was a bit cumbersome, as they are spread across different sections of the main text. I think it would be useful to have a dedicated section giving the exact steps taken in the algorithm, possibly pointing to the relevant section in the text for more details about each step. This could be, for example, in the form of an 'Algorithm' box or a flowchart.

      We added an Algorithm box as Figure 2 supplement summarizing the end-to-end workflow and pointing to the relevant sections for details (Figure 2—figure supplement 1 Algorithm, pp. 4, line 96–103, pp. 32 line 799). This is intended to make the sequence of steps easier to follow.

      (3) In Figure 3, panels (b) and (c), the difference between the two micrographs, before and after correction, is not very noticeable, particularly the Thon rings in the spectra. I don't know if this is due to the image quality in the paper or if a better example could be shown. For example, the differences are clear in some of the supplementary figures.

      Thank you for the suggestion. We revised the figure by adding annotations to show the recovered Thon rings. This figure shows a vertex motion and is intended not only to show improvement but also to illustrate complex, spatially varying deformation patterns that motivate the 3D spline model (pp. 12, lines 304–308). The supplementary figures display those with highest motions in each sample type, thus the Thon rings for the motion corrected micrograph in higher frequency space look more obvious. We also refer readers to the supplementary examples where the differences are more pronounced (pp. 12, lines 310–312).

    1. Author Response:

      The following is the authors’ response to the previous reviews

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      Here Bansal et al., present a study on the fundamental blood and nectar feeding behaviors of the critical disease vector, Anopheles stephensi. The study encompasses not just the fundamental changes in blood feeding behaviors of the crucially understudied vector, but then use a transcriptomic approach to identify candidate neuromodulation path ways which influence blood feeding behavior in this mosquito species. The authors then provide evidence through RNAi knockdown of candidate pathways that the neuromodulators sNPF and Rya modulate feeding either via their physiological activity in the brain alone or through joint physiological activity along the brain-gut axis (but critically not the gut alone). Overall, I found this study to be built on tractable, well-designed behavioral experiments.

      Their study begins with a well-structured experiment to assess how the feeding behaviors of A. stephensi changes over the course of its life history and in response to its age, mating and oviposition status. The authors are careful and validate their experimental paradigm in the more well-studied Ae. aegypti, and are able to recapitulate the results of prior studies which show that mating is pre-requisite for blood feeding behaviors in Ae. aegypt. Here they find A. stephensi like another Anopheline mosquitoes has a more nuanced regulation of its blood and nectar feeding behaviors.

      The authors then go on to show in a Y- maze olfactometer that to some degree, changes in blood feeding status depend on behavioral modulation to host-cues, and this is not likely to be a simple change to the biting behaviors alone. I was especially struck by the swap in valence of the host-cues for the blood-fed and mated individuals which had not yet oviposited. This indicates that there is a change in behavior that is not simply desensitization to host-cues while navigating in flight, but something much more exciting happening.

      The authors then use a transcriptomic approach to identify candidate genes in the blood feeding stages of the mosquito's life cycle to identify a list of 9 candidates which have a role in regulating the host-seeking status of A. stephensi. Then through investigations of gene knockdown of candidates they identify the dual action of RYa and sNPF and candidate neuromodulators of host-seeking in this species. Overrall, I found the experiments to be welldesigned. I found the molecular approach to be sound. While I do not think the molecular approach is necessarily an all-encompassing mechanism identification (owing mostly to the fact that genetic resources are not yet available in A. stephensi as they are in other dipteran models), I think it sets up a rich lines of research questions for the neurobiology of mosquito behavioral plasticity and comparative evolution of neuromodulator action.

      Strengths:

      I am especially impressed by the authors' attention to small details in the course of this article. As I read and evaluated this article I continued to think how many crucial details I may have missed if I were the scientist conducting these experiments. That attention to detail paid off in spades and allowed the authors to carefully tease apart molecular candidates of blood-seeking stages. The authors top down approach to identifying RYamide and sNPF starting from first principles behavioral experiments is especially comprehensive. The results from both the behavioral and molecular target studies will have broad implications for the vectorial capacity of this species and comparative evolution of neural circuit modulation.

      I believe the authors have adequately addressed all of my concerns; however, I think an accompanying figure to match the explained methods of the tissue-specific knockdown would help readers. The methods are now explicitly written for the timing and concentrations required to achieve tissue-specific knockdown, but seeing the data as a supplement would be especially reassuring given the critical nature of tissue-specific knockdown to the final interpretations of this paper.

      We thank the reviewer for the suggestion and have now incorporated a schematic in the supplementary figure S9B, explaining our methodology for achieving tissue-specific knockdowns.

      Reviewer #2 (Public review):

      Summary:

      In this manuscript, Bansal et al examine and characterize feeding behaviour in Anopheles stephensi mosquitoes. While sharing some similarities to the well-studied Aedes aegypti mosquito, the authors demonstrate that mated-females, but not unmated (virgin) females, exhibit suppression in their blood-feeding behaviour. Using brain transcriptomic analysis comparing sugar fed, blood fed and starved mosquitoes, several candidate genes potentially responsible for influencing blood-feeding behaviour were identified, including two neuropeptides (short NPF and RYamide) that are known to modulate feeding behaviour in other mosquito species. Using molecular tools including in situ hybridization, the authors map the distribution of cells producing these neuropeptides in the nervous system and in the gut. Further, by implementing systemic RNA interference (RNAi), the study suggests that both neuropeptides appear to promote blood-feeding (but do not impact sugar feeding) although the impact was observed only after both neuropeptide genes underwent knockdown.

      While the authors have addressed most of the concerns of the original manuscript, a few issues remain. Particularly, the following two points:

      (5) Figure 4

      The authors state that there is more efficient knockdown in the head of unfed females; however, this is not accurate since they only get knockdown in unfed animals, and no evidence of any knockdown in fed animals (panel D). This point should be revised in the results test as well.

      Perhaps we do not understand the reviewer's point or there has been a misunderstanding. In Figure 4D, we show that while there is more robust gene knockdown in unfed females, bloodfed females also showed modest but measurable knockdowns ranging from 5-40% for RYamide and 2-21% for sNPF.

      NEW-

      In both the dsRNA treatments where animals were fed, neither was significantly different from control. Therefore, there is no change, and indeed this is confirmed by the author's labelling of the figure stats in panel 4D.

      We agree with the reviewer and thank them for pointing it out. We have now revised the figure legend and the text to reflect these results (see lines 351-354).

      In addition, do the uninjected and dsGFP-injected relative mRNA expression data reflect combined RYa and sNPF levels? Why is there no variation in these data,...

      In these qPCRs, we calculated relative mRNA expression using the delta-delta Ct method (see line 975). For each neuropeptide its respective control was used. For simplicity, we combined the RYa and sNPF control data into a single representation. The value of this control is invariant because this method sets the control baseline to a value of 1.

      NEW-

      The authors are claiming that there is no variation between individual qPCR experiments (particularly in their controls)? Normally, one uses a known standard value (or calibrator) across multiple experiments/plates so that variation across biological replicates can be assessed. This has an impact on statistical analyses since there is no variation in the control data. Indeed, this impacts all figures/datasets in the manuscript where qPCR data is presented. All the controls have zero variation!

      We are truly thankful to this reviewer for insisting on this point. It has made us revisit what we thought we understood and now realise were doing wrong (though many in literature do it this way!). We were – incorrectly – setting each control to 1 and calculating relative fold changes for each replicate independently. While this is often seen in literature, we now realise that it is incorrect. We have revisited all our analyses and normalized all samples to the mean ΔCt of the control group, which captures biological variation in both control and experimental groups. All data are now re-plotted to show individual data points for both control and experimental groups, and the error bars on controls represent the biological variation across replicates (Figure 4D, 4F, 4G, S8, S9). Statistical analyses were also revised accordingly, and, importantly, they do not change any conclusions. Please note that the abdominal expression of sNPF and RYa are so low that the controls show very variable baseline expression values.

      Reviewer #3 (Public review):

      Summary:

      This manuscript investigates the regulation of host-seeking behavior in Anopheles stephensi females across different life stages and mating states. Through transcriptomic profiling, the authors identify differential gene expression between "blood-hungry" and "blood-sated" states. Two neuropeptides, sNPF and RYamide, are highlighted as potential mediators of host-seeking behavior. RNAi knockdown of these peptides alters host-seeking activity, and their expression is anatomically mapped in the mosquito brain (sNPF and RYamide) and midgut (sNPF only).

      Strengths:

      (1) The study addresses an important question in mosquito biology, with relevance to vector control and disease transmission.

      Transcriptomic profiling is used to uncover gene expression changes linked to behavioral states.

      (2) The identification of sNPF and RYamide as candidate regulators provides a clear focus for downstream mechanistic work.

      (3) RNAi experiments demonstrate that these neuropeptides are necessary for normal hostseeking behavior.

      (4) Anatomical localization of neuropeptide expression adds depth to the functional findings.

      Weaknesses:

      (1) The title implies that the neuropeptides promote host-seeking, but sufficiency is not demonstrated and some conclusions appear premature based on the current data. The support for this conclusion would be strengthened with functional validation using peptide injection or genetic manipulation.

      (2) The identification of candidate receptors is promising, but the manuscript would be significantly strengthened by testing whether receptor knockdowns phenocopy peptide knockdowns. Without this, it is difficult to conclude that the identified receptors mediate the behavioral effects.

      (3) Some important caveats, such as variation in knockdown efficiency and the possibility of offtarget effects, are not adequately discussed.

      These comments were addressed in the previous round.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Awesome paper everyone. A delight to read and review.

      Thank you very much! We appreciated your comments too!

    1. Reviewer #1 (Public review):

      Summary:

      The authors aimed to investigate how short-term visual deprivation influences tactile processing in the primary somatosensory cortex (S1) of sighted rats. They justify the study based on previous studies that have shown that long-term blindness can enhance tactile perception, and aim to investigate the change in neural representations underlying rapid, short-term cross-modal effects. The authors recorded local field potentials from S1 as rats encountered different tactile textures (smooth and rough sandpaper) under light and dark conditions. They used deep learning techniques to decode the neural signals and assess how tactile representations changed across the four different conditions. Their goal was to uncover whether the absence of visual cues leads to a rapid reorganization of tactile encoding in the brain.

      Strengths:

      The study effectively integrates high-density local field potential (LFP) recordings with convolutional neural network (CNN) analysis. This combination allows for decoding high-dimensional population-level signals, revealing changes in neural representations that traditional analyses (e.g., amplitude measures) failed to detect. The custom treadmill paradigm permits independent manipulation of visual and tactile inputs under stable locomotion conditions. Gait analysis confirms that motor behavior was consistent across conditions, strengthening the conclusion that neural changes are due to sensory input rather than movement artifacts.

      Weaknesses:

      (1) While the study interprets the emergence of more distinct texture representations in the dark as evidence of rapid cross-modal plasticity, the claim rests on correlational data from a short-term manipulation and decoding analysis. The authors show that CNN-derived feature embeddings cluster more clearly by texture in the dark, but this does not directly demonstrate plasticity in the classical sense (e.g., synaptic or circuit-level reorganization). The authors have noted this as a limitation and have clarified that the observed changes reflect functional reorganization rather than structural plasticity.

      (2) Although gait was controlled, changes in arousal or exploratory behavior in light versus dark conditions might play a role in the observed neural differences. The authors have controlled for various factors in relation to locomotion, but future studies would benefit from more direct behavioural readouts of arousal states (e.g., via pupillometry or cortical state indicators).

      (3) It should be noted that the time course of the observed changes (within 10 minutes) is quite rapid, and while intriguing, the study does not include direct evidence that the underlying circuits were reorganized-only that population-level signals become more discriminable. The authors have adequately discussed this as an avenue for more mechanistic future research.

      (4) The authors have adequately discussed that, while these findings are consistent with somatotopy and context-dependent dynamics, they do not provide strong independent evidence for novel spatial or temporal organization.

      (5) The authors have also discussed that, while the neural data suggest enhanced tactile representations, the study does not assess whether rats' actual tactile perception improved. Future studies including an assessment of a behavioral readout (e.g., discrimination accuracy), would be insightful.

      (6) The authors' discussion about the implications for sensory rehabilitation, including Braille training and haptic feedback enhancement was a bit premature, but they have amended this, and it remains an interesting translational potential to be explored in future studies.

      (7) While the CNN showed good performance, more transparent models (e.g., linear classifiers or dimensionality reduction) appear to not exceed chance level. The implications of this are that there is an underlying complex structure in the LFPs that has yet to be fully uncovered, on the mechanistic level. This would be important to push the findings forward in future studies.

      Therefore, while the authors raise interesting hypotheses around rapid plasticity, somatotopic dynamics, and rehabilitation, the evidence for each is indirect. Stronger claims will require future causal experiments, behavioral readouts, and mechanistic specificity beyond what the current data provides. However, the work represents an interesting starting point to a more mechanistic understanding in the future.

    2. Author Response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      (1) While the study interprets the emergence of more distinct texture representations in the dark as evidence of rapid cross-modal plasticity, the claim rests on correlational data from a short-term manipulation and decoding analysis. The authors show that CNN-derived feature embeddings cluster more clearly by texture in the dark, but this does not directly demonstrate plasticity in the classical sense (e.g., synaptic or circuit-level reorganization).

      Thank you for this insightful comment. We acknowledge that our claim of “rapid cross-modal plasticity” is based on correlational evidence and does not directly address synaptic or circuit-level reorganization, which would require more invasive methods. Our study instead focuses on changes in the representational structure of tactile stimuli when visual input is temporarily removed, highlighting the adaptability of sensory coding to environmental context. We agree that this distinction is important and have revised the manuscript to clarify that the observed changes reflect functional reorganization rather than structural plasticity, as indicated by the enhanced separability of texture representations in S1 during darkness.

      (2) Although gait was controlled, changes in arousal or exploratory behavior in light versus dark conditions might contribute to the observed neural differences. These factors are acknowledged but not directly measured (e.g., via pupillometry or cortical state indicators).

      Thank you for your insightful comment. We agree that arousal and exploratory behavior could influence neural differences and have considered these factors in our study. While gait was controlled, we did not directly measure arousal (e.g., via pupillometry or cortical indicators).

      To partially address this, we reviewed locomotor-speed traces (Supplementary Figure 1), which showed no significant differences between light and dark conditions, suggesting movement speed did not drive the neural differences. We also reversed the order of light and dark conditions, and although the separability of textures was not significantly different, it further supports that motivation did not confound our results.

      However, we acknowledge that arousal may still affect cortical dynamics, especially in the dark condition, where the lack of visual input might alter exploratory behavior. Due to technical limitations, we could not directly measure arousal states, and this is now discussed in the revised manuscript. While we cannot rule out the influence of arousal, the enhanced separability of texture representations suggests that sensory reorganization due to visual deprivation likely played a substantial role.

      (3) Moreover, the time course of the observed changes (within 10 minutes) is quite rapid, and while intriguing, the study does not include direct evidence that the underlying circuits were reorganized - only that population-level signals become more discriminable. As such, the term "plasticity" may overstate the conclusions and should be interpreted with caution unless validated by additional causal or longitudinal data.

      Thank you for your important comment. We agree that the term "plasticity" may overstate our conclusions, as our study focuses on population-level signal changes rather than direct evidence of circuit-level reorganization.

      To address this, we have revised the manuscript to clarify that while the observed changes in neural separability suggest functional reorganization of sensory representations, they do not confirm structural plasticity. We have updated the wording throughout the manuscript to emphasize that these findings reflect functional reorganization in response to short-term visual input loss, rather than structural or long-term plasticity.

      We also updated the discussion to highlight the need for future research with more invasive approaches to validate the causal mechanisms behind these rapid changes in neural dynamics.

      (4) The study highlights the forelimb region of S1 and a post-contact temporal window as particularly important for decoding texture, based on occlusion and integrated gradient analyses. However, this finding may be somewhat circular: The LFPs were aligned to forelimb contact, and the floor textures were sensed primarily via the forelimbs, making it unsurprising that forelimb electrodes were most informative. The observed temporal window corresponds directly to the event-aligned epoch, and while it may shift slightly in duration in the dark, this could reflect general differences in sensory gain or arousal, rather than changes in stimulus-specific encoding. Thus, while these findings are consistent with somatotopy and context-dependent dynamics, they do not provide strong independent evidence for novel spatial or temporal organization.

      Thank you for your insightful comment. We understand your concern that the finding of forelimb electrodes being most informative might seem circular, given that the LFPs were aligned to forelimb contact, and the floor textures were primarily sensed by the forelimbs. This design choice was intentional, as the task focused on texture perception through the forelimb, and the forelimb subregion of S1 is naturally expected to play a dominant role in this process. While this somatotopic specificity may make the results predictable, our aim was to emphasize the changes in temporal dynamics of neural processing under visual deprivation.

      We observed a shift in the temporal window's duration in the dark condition, which we interpret as a change in how texture information is processed without visual input. While this could reflect sensory gain or arousal differences, the lack of significant differences in locomotor speed or other behavioral measures (Supplementary Figure 1) suggests that these changes are more likely due to functional reorganization of sensory processing.

      We have clarified in the discussion that the shift in the temporal window is consistent with previous research on sensory reorganization involving both spatial and temporal cortical adjustments. While we do not claim novel spatial or temporal organization, we emphasize that the shift in temporal dynamics suggests adaptation in encoding strategy for texture perception in the absence of visual input. Future studies measuring arousal states (e.g., pupil diameter or cortical state markers) would help distinguish the contributions of arousal versus sensory reorganization to these dynamics.

      (5) While the neural data suggest enhanced tactile representations, the study does not assess whether rats' actual tactile perception improved. Without a behavioral readout (e.g., discrimination accuracy), claims about perceptual enhancement remain speculative.

      Thank you for raising this important point. We agree that while the neural data suggest enhanced separability of tactile representations in the dark condition, we do not directly assess whether these changes translate into improved tactile perception behaviorally.

      However, the primary aim of our study is not to claim perceptual enhancement, but to demonstrate that neural representations in the somatosensory cortex can rapidly reorganize in response to visual deprivation. To clarify this distinction, we have revised the manuscript to emphasize that the observed neural changes in S1 are consistent with functional reorganization of tactile representations, rather than a direct indication of perceptual improvement.

      Future studies will be crucial to directly test whether the enhanced separability of tactile representations in S1 correlates with improved tactile perception in a behavioral task. We have highlighted this as an avenue for future research to better understand the link between neural changes and perceptual outcomes.

      (6) In addition to point 4, the authors discuss implications for sensory rehabilitation, including Braille training and haptic feedback enhancement. However, the lack of actual chronic or even more acute pathological sensory deprivation, behavioral data, or subsequent intervention in this study limits the ability to draw translational conclusions. It remains unknown whether the more distinct neural representations observed actually translate into better tactile performance, discriminability, or perception. Additionally, extrapolating from rats walking on sandpaper in the dark to human rehabilitative contexts is speculative without a clearer behavioral or mechanistic bridge. The potential is certainly there, but the claim is currently aspirational rather than empirically grounded.

      Thank you for raising this important point. Upon careful consideration, we have decided to remove the discussion of sensory rehabilitation implications from the revised manuscript. We have refocused the manuscript to concentrate solely on the neural findings related to tactile encoding reorganization in response to short-term sensory deprivation, avoiding speculative extrapolation to human rehabilitative contexts. This revised approach ensures that the manuscript emphasizes the empirical findings without overstating the translational potential.

      (7) While the CNN showed good performance, details on generalization robustness and validation (e.g., cross-validation folds, variance across animals) are not deeply discussed. Also, while explainability tools were used, interpretability of CNNs remains limited, and more transparent models (e.g., linear classifiers or dimensionality reduction) could offer complementary insights.

      We appreciate the reviewer’s valuable feedback. In response to the concern about generalization robustness and validation, we have now conducted 5-fold cross-validation to assess the model's performance within animals (Figure 6C). We also have added supplementary information on the average silhouette scores across the different folds and animals (Supplementary Table 1, 2). These details are provided in the methods section and discussed in the results to offer a clearer picture of the model's robustness and consistency across rats.

      Regarding the interpretability of CNNs, we acknowledge that deep learning models can lack transparency. We also attempted classification using more transparent models such as PCA and SVM, but their performance did not exceed chance level (Supplementary Figure 2). This indicates that while these simpler models are more interpretable, they cannot capture the complex representations in the LFPs, making deep learning models like CNNs necessary for extracting these insights.

      Reviewer #2 (Public review):

      (1) Despite applying explainability techniques to the CNN-based decoder, the study does not clearly demonstrate the precise "subtle, high-dimensional patterns" exploited by the CNN for surface roughness decoding, limiting the physiological interpretability of the results. Additional analyses (e.g., detailed waveform morphology analysis on grand averages, time-frequency decompositions, or further use of explainability methods) are necessary to clarify the exact nature of the discriminative activity features enabling the CNN to decode surface roughness and how these change with the sensory context (i.e., in light or darkness).

      Thank you for your insightful comment. We recognize the importance of clarifying the exact nature of the high-dimensional neural patterns that the CNN exploits for surface roughness decoding. In response, we have performed additional analyses to provide a more detailed explanation of the CNN's decision-making process and the discriminative features it learned:

      Grand-Average LFP Waveforms Analysis: We calculated the grand-average LFP waveforms for each texture × lighting condition (Figure 4A). While visual inspection did not reveal distinct features in the averaged waveforms, we explored the channel-wise correlations between textures under both light and dark conditions (Figure 4B). We found that the correlation between textures was lower in the dark condition, suggesting that LFPs become more distinct between textures when visual input is absent, which aligns with the CNN’s output.

      Time-Frequency Decomposition (Wavelet Analysis): We also performed time-frequency decomposition of the LFPs using wavelet transforms (Figure 4D). No prominent differences emerged across texture × lighting conditions in the spectral domain. However, upon computing differences in wavelet features between light and dark conditions and analyzing the relationship with the CNN's attribution scores (Supplementary Figures 5A-C), we observed a negative correlation in the 50-60 Hz range and a positive correlation in the 80-90 Hz range. This suggests frequency-specific modulation in LFP activity that may contribute to texture representations, providing further support for the CNN’s learned features.

      (2) The claim regarding cross-modal representation reorganization heavily relies on a silhouette analysis (Figure 5C), which shows a modest effect size and borderline statistical significance (p≈0.05 with n=9+2). More rigorous statistical quantification, such as permutation tests and reporting underlying cluster distances for all animals, would strengthen confidence in this finding.

      Thank you for your thoughtful comment. We appreciate your suggestion to strengthen the statistical rigor of our analysis regarding the cross-modal representation reorganization. In response, we have implemented several additional analyses to more rigorously quantify the separability of neural representations between light and dark conditions:

      (1) Permutation Test for Cluster Separability: We performed a permutation test to assess whether the observed differences in cluster separability between light and dark conditions were statistically significant or could have arisen by chance. The results showed that the silhouette scores for the dark condition consistently exceeded the 95th percentile of the null distribution (Supplementary Figure 4). This permutation test strengthens the validity of our findings, indicating that the enhanced separability in darkness is a systematic reorganization of neural representations, not due to random fluctuations.

      (2) Reporting Cluster Distances: To address concerns about the modest effect size and borderline significance, we have explicitly reported the underlying cluster distances in the form of silhouette scores for each individual animal (Supplementary Table 1, 2). These values reflect the Euclidean distance between clusters within each rat, providing a clearer understanding of the separability observed.

      (3) Additional Statistical Analysis on Silhouette Scores: To further enhance the rigor of our statistical analysis, we recalculated the silhouette scores using 5-fold cross-validation within each animal, ensuring that our results are robust across multiple data splits (Figure 6C).

      By incorporating these additional analyses and reporting detailed cluster distances, we believe we have significantly strengthened the confidence in our claim of cross-modal reorganization.

      (3) While the authors recorded in the somatosensory cortex, primarily known for its tactile responsivity, I would be cautious not to rule out a priori the presence of crossmodal (visual) responses in the area. In this case, the stronger texture separation in darkness might be explained by the absence of some visually-evoked potentials (VEPs) rather than genuine cross-modal reorganization. Clarification is needed to rule out visual interference and this would strengthen the claim.

      Thank you for raising this important point. In response to your concern, we carefully examined whether visually-evoked potentials (VEPs) could be present in the S1 recordings, particularly under the light condition. However, we observed that this experiment did not involve any cue-guided visual stimulation, such as flashing lights or visual cues aligned with the LFP recordings. Without such external visual stimuli, it is unlikely that VEPs would be reliably evoked in the S1. Therefore, we believe the stronger texture separation observed in the dark condition is not due to visual interference, but rather reflects a genuine sensory reorganization in response to the absence of visual input.

      (4) Behavioural controls are limited to gross gait parameters; more detailed analyses of locomotor behavior and additional metrics (e.g., pupil size or locomotor variance) would robustly rule out potential arousal or motor confounds.

      Thank you for your insightful comment regarding behavioral controls. In response, we have added locomotor speed traces aligned with corresponding LFPs (Supplementary Figure 1) to demonstrate that locomotion remained consistent across trials, irrespective of environmental condition (light vs. dark). Additionally, we report locomotor speed variance over 10-minute blocks to confirm no significant motor changes affecting neural recordings. These analyses indicate that LFP differences are unlikely due to locomotor confounds.

      While measuring pupil size could be useful for assessing arousal, the camera resolution in our study was insufficient for reliable measurements. We have noted this limitation in the Discussion and recommend that future studies with high-resolution eye-tracking explore arousal's role in sensory processing in S1.

      (5) The consistent ordering of trials (10 minutes of light then 10 minutes of dark) could introduce confounds such as fatigue or satiation (and also related arousal state), which should be controlled by analyzing sessions with reversed condition ordering.

      Thank you for highlighting the potential confounds due to trial ordering. To address this, we reversed the condition order (dark before light) in a subset of sessions from six rats and reanalyzed the data (Supplementary Figure 3). The results showed not significant, but increase separability in the dark condition, suggesting that the enhanced separability in the dark condition is not due to trial order effects like fatigue or satiation. While order effects may contribute to trial-to-trial variability, the consistent pattern of enhanced separability in the dark further supports the interpretation that visual deprivation directly influences the reorganization of tactile representations in S1.

      (6) The focus on forelimb-aligned LFP analyses raises the possibility that hindlimb-aligned data might yield different conclusions, suggesting alignment effects might bias the results.

      Thank you for your insightful comment on the potential bias of forelimb-aligned LFP analyses. We acknowledge that the choice of alignment event can influence the results and appreciate the suggestion to consider hindlimb-aligned data. However, our experimental design specifically focused on forelimb S1. The forelimb region of S1 was oversampled in our array, and as expected, we observed larger responses there, consistent with the known somatotopic organization of S1.

      While hindlimb-aligned data could provide additional insights, it is not directly relevant to the primary question of how forelimb S1 codes tactile information under visual deprivation. We do not believe the forelimb alignment introduces a bias, as it aligns with the sensory task being investigated. However, we recognize the value of exploring alternative alignments and have now included a discussion in the Methods section regarding the rationale for our design choices.

      (7) The authors' dismissal of amplitude-based metrics as ineffective is inadequately substantiated. A clearer demonstration (e.g., event-related waveforms averaged by conditions, presented both spatially and temporally) would support this claim.

      Thank you for your constructive comment. In response, we have added a more detailed analysis of event-related waveforms, averaged across conditions (light vs. dark, smooth vs. rough textures), and presented them spatially and temporally aligned to forelimb contact (Figure 4A). These waveforms did not show clear, distinct features that could differentiate conditions, which highlights the limitations of traditional amplitude-based metrics in detecting subtle neural activity changes related to visual deprivation.

      We further performed channel-wise correlation analyses (Figure 4B), revealing stronger texture correlations in the light condition, indicating that averaged waveforms do not capture the nuanced differences in neural dynamics. Additionally, time-frequency spectrograms and channel–channel correlation matrices (Figures 4C and 4D) did not show distinct condition differences, reinforcing the limitations of amplitude-based metrics.

      These findings, along with the superior performance of machine learning-based decoding methods (e.g., CNN), support our claim that amplitude-based approaches are insufficient for fully capturing the complexity of the neural data.

      (8) Wording ambiguity regarding "attribution score" versus "activation amplitude" (Figure 5) complicates the interpretation of key findings. This distinction must be clarified for proper assessment of the results.

      Thank you for pointing out the ambiguity between "attribution score" and "activation amplitude." To address this, we have revised the manuscript to use "attribution score" only.

      (9) Generalization across animals remains unaddressed. The current within-subject decoding setup limits conclusions regarding shared neural representations across individuals. Adopting cross-validation strategies and exploring between-animal analyses would add significant value to the manuscript.

      Thank you for highlighting the importance of generalization across animals. While our study focused on within-subject decoding, we acknowledge that this limits conclusions about shared neural representations across individuals. We expect that inter-animal generalization would be challenging, as models trained on data from a single rat may not perform well on data from others due to differences in electrode placement, brain anatomy, and neural representations. We recognize the value of cross-validation strategies and between-animal analyses and will consider them in future work to address this limitation.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      (1) I would strongly recommend that the authors refine their introduction to be more concise. Many concepts and study aims are repeated many times and, therefore, present as highly redundant text. The introduction may be half the length and still contain the important concepts to set up the justification for the study. I would also suggest refining to be less about sensory deprivation (e.g., with blindness) and more in relation to context, as the acute nature of the study allows one to conclude more about the latter than the former.

      Thank you for your feedback on the introduction. We have revised the section to reduce redundancy and present the key concepts more concisely. We also streamlined the study aims and focused more on the context of the acute nature of the study, as you suggested, rather than emphasizing sensory deprivation. This revision better aligns with the main focus of the research and improves clarity. We believe the updated introduction provides a more direct justification for the study.

      (2) I am not sure if Figures 1-3 are meant to be in grey-scale for some reason (perhaps to represent light and dark), but I would encourage the authors to examine if this is necessary, as the use of color generally helps one more easily follow Figures.

      Thank you for this suggestion. Upon review, we agree that the use of color would enhance the clarity and readability of our figures. We have revised the figures including the newly added supplementary figures to incorporate color.

      (3) Figure 5, Figure legend title - check wording.

      Thank you for pointing this out. The title has been adjusted for consistency with the other figure legends.

      Reviewer #2 (Recommendations for the authors):

      (1) Analyses that would strengthen the main claims (major):

      (a) Identify the features exploited by the CNN.

      (i) Provide grand-average LFP waveforms for each texture × lighting condition (fore- and hind-limb channels shown separately, spatially arranged as in Figure 3C) and try to relate them to the decoding strategy learned by the CNN.

      Thank you for your helpful suggestion. We have calculated the grand-average LFP waveforms for each texture × lighting condition and included them in Figure 4A, with fore- and hind-limb channels shown separately and spatially arranged as in Figure 3C. Upon visual inspection, the mean waveforms did not reveal clear, distinct features. To further investigate, we computed the channel-wise correlation between different textures under both dark and light conditions. By subtracting the correlation coefficients for the dark environment from those in the light, we observed that the correlation between textures was lower in the dark environment (Figure 4B). This suggests that LFPs are more distinct between textures in the dark, supporting the CNN model's output. However, this also indicates that the CNN has captured more complex, nuanced information, as it is able to discriminate between LFPs on a single-trial basis, rather than relying on mean traces.

      To assess how the correlation between average LFP waveforms varied across channels, we also calculated the channel-channel correlation matrix for all 32 channels in each condition. While we found stronger correlations within each S1 subregion, we did not observe clear differences of correlation matrix between light and dark conditions, nor between different textures (Figure 4C).

      (ii) Add channel-wise and time-frequency maps (e.g., wavelet or spectrograms) for each texture × lighting condition and try to relate them to the decoding strategy learned by the CNN.

      Thank you for the valuable suggestion. We calculated wavelet features for each LFP segment and averaged them across trials to assess differences in LFP between light and dark conditions, as well as across textures (Figure 4D). However, no distinct differences were observed in the spectral map. To investigate further, we computed the differences in spectral maps for LFPs in light and dark trials. We then calculated the difference in attribution scores derived from the integrated gradient map (Supplementary Figure 4A). Subsequently, we calculated the correlation coefficients between the differences in integrated gradients and the differences in power across each frequency band in the spectral map (Supplementary Figures 4B and 4C). A negative correlation was found in the 50-60 Hz range, while a positive correlation was observed in the 80-90 Hz range. These findings suggest that frequency-specific patterns of LFP activity in different conditions may be linked to the texture representations captured by the CNN model. We have included a discussion of these findings in [lines 463-468].

      (b) Quantify the "enhanced separability in darkness" more rigorously.

      (i) Report cluster-distances (e.g. Euclidean) for each individual animal.

      We thank the reviewer for this helpful comment. When calculating the silhouette score, we used Euclidean distance as the distance metric. The silhouette score is defined for each data point as the difference between the average distance to points within its assigned cluster and the average distance to points in the nearest other cluster, normalized by the larger of the two values. Thus, the silhouette score inherently reflects the relative cluster distances both within and across conditions for each individual animal. Because we report and statistically analyze silhouette scores (Figure 6C), these values already quantify and compare the Euclidean cluster distances across conditions at the animal level. For clarity, we have now added a definition of the silhouette score in the Methods section of the main text [lines 269-278]. We also included the calculated silhouette scores in Supplementary Table 1.

      (ii) Run a permutation or bootstrap test (shuffling darkness/light labels within animals) to obtain an empirical null distribution for cluster separability in the network embedding space.

      We thank the reviewer for this important suggestion. In response, we implemented a permutation test to assess the robustness of our cluster separability results. Specifically, we shuffled the darkness/light labels within each animal and recalculated silhouette scores across 1000 resamples to generate an empirical null distribution. The observed separability between light and dark conditions consistently exceeded the 95th percentile of the null distribution (Supplementary Figure 3). This confirms that the enhanced cluster separability in darkness was not attributable to random fluctuations in labeling but instead reflected a systematic reorganization of neural representations.

      (c) Control for possible visually-evoked potentials (VEPs).

      (i) Search the LFPs recorded in light for stereotyped VEP components and/or comment on this possible confound (i.e., VEPs in S1?).

      Thank you for raising this point. Although it would be interesting to observe if a VEP is present in the S1 of rats, this experiment did not involve cue-guided visual stimulation. Additionally, there was no environmental visual cue that could serve as an external trigger to align the LFPs for VEP analysis in S1. Furthermore, since even the somatosensory evoked potential was not clearly visible in the S1 LFP without averaging the aligned LFPs, it is unlikely that we would be able to observe VEPs in single trials.

      (d) Address behavioral and arousal confounds.

      (i) Provide example locomotor-speed traces (aligned with corresponding LFPs) and report locomotor-speed variance across the 10-min blocks.

      Thank you for your comment. We had speedometer installed for the recording of the last two rats. We have now provided example speed traces and the speed variance across blocks in Supplementary Figure 1. The traces show that the locomotor-speed was stable in each trial.

      (ii) If available from the camera recordings, include pupil diameter as a proxy for arousal; otherwise, discuss explicitly how arousal changes might affect S1 LFPs.

      Thank you for this suggestion. We strongly agree that measuring pupil diameters should be incorporated into future studies. However, because our camera did not have sufficient resolution to capture pupil diameters, we have addressed this limitation in the discussion section [lines 525-537].

      (e) Address order effects (and motivation/satiety confounds)

      (i) Present at least a subset of sessions in which the dark block precedes the light block; re-analyze the silhouette score/discriminability with block order as a factor.

      Thank you for this helpful suggestion. We conducted additional analyses using sessions from 6 rats in which the dark block preceded the light block (Supplementary Figure 5A). Using the same model architecture, we calculated the silhouette score for each rat (Supplementary Figure 5B). However, when the order was reversed (dark preceding light), this discriminability effect disappeared. Thus, while we observed a trend toward higher scores in the dark condition, no statistically significant differences in texture discriminability were observed.

      If trial order alone accounted for the increase in discriminability, reversing the order would be expected to yield higher silhouette scores in the light condition. Our findings suggest that factors related to order (e.g., thirst or motivation, as you proposed) are not the sole contributors. Furthermore, previous studies in human participants have shown that brief blindfolding can produce lingering increases in tactile sensitivity, indicating a lasting effect of visual deprivation. Thus, the absence of significant differences in texture representation when the dark condition preceded the light condition may reflect such lasting effects. We have included a discussion in [lines 441-452].

      (ii) Discuss explicitly the potential confounding effect of motivational state/thirst.

      We appreciate the reviewer’s insightful comment. In the revised manuscript, we now explicitly address the potential confounding role of motivational state and thirst in shaping our results. Because animals were water-restricted to maintain task engagement, it is possible that increasing thirst or fluctuating motivation over the course of a session could alter arousal or attentional state, thereby influencing neural separability. However, when the trial order was reversed (dark condition preceding light), silhouette scores did not show a significant increase in the second (light) trial. Thus, while we acknowledge that motivational state may contribute to trial-to-trial variability, the systematic increase in separability during darkness cannot be fully explained by thirst or motivational confounds. This addition has been incorporated into the discussion section [lines 441-452].

      (f) Alignment control and the role of forelimb S1.

      (i) Repeat the decoding analysis with LFPs aligned to hind-limb strike; report whether the fore-limb dominance persists.

      Thank you for your thoughtful suggestion. We appreciate the opportunity to clarify. Our study was designed to ask a different question: how the absence of visual input reorganizes tactile encoding for the body part that actually initiates texture contact in our paradigm (the forepaw). Accordingly, all analyses were aligned to forelimb strike and our array intentionally oversampled S1-forelimb relative to S1-hindlimb (18 vs. 14 electrodes; Fig. 1F–G), yielding clear topographic forelimb-locked event-related responses (Fig. 3B–D) and forelimb-channel dominance in the decoding explainability analyses (Fig. 5D–E). Repeating the full decoding locked to hind-limb strike would test a different hypothesis and would be difficult to interpret for three reasons:

      Design/measurement alignment. Our kinematic detection was built to identify forelimb foot strikes. Extending the detector to hindlimb would require new model training/validation and introduces uncertainty in the exact contact timing relative to the LFP segments we analyze.

      Sampling asymmetry. The array and cortical magnification are not balanced across subregions (18 forelimb vs. 14 hindlimb electrodes; Fig. 1G), so a hind-limb–aligned comparison would be confounded by unequal coverage and signal-to-noise across S1 subdivisions rather than reflecting true “dominance.”

      Scope of the claim. We do not claim that the forelimb is globally more informative about texture; we show the intuitive and topographically specific result that “forelimb S1 codes textures touching the forelimb,” and that these representations become more separable in darkness (silhouette increase; Fig. 5C). A hind-limb–locked re-analysis would likely reveal hindlimb contributions when the hindpaw is the alignment event — but that would not change the central conclusion about darkness enhancing tactile representational separability.

      To address the underlying concern about generality without introducing the above confounds, we have clarified these design choices and limitations in the revised Methods [lines 194-197].

      (g) Amplitude-based baseline.

      (i) Show that a simple linear discriminant or logistic-regression model on peak amplitudes (and/or other simple features like trough width/slope) cannot reach the CNN's accuracy. This kind of "baseline" analysis could also be useful to pinpoint the discriminative features learned by the CNN.

      Thank you for your insightful suggestion. We agree that performing a baseline comparison with a simpler model could help highlight the advantage of using a CNN. However, in our dataset, individual LFP traces do not exhibit clear peaks or well-defined features such as peak amplitude, width, or energy, which makes feature extraction using traditional methods like linear discriminants or logistic regression challenging.

      To address this, we performed principal component analysis (PCA) on the raw LFP traces to reduce the dimensionality and applied a support vector machine (SVM) classifier on the reduced features, in line with the approach used for the CNN models (Supplementary Figure 2A). The results of this analysis, demonstrate that the SVM model struggles to effectively discriminate between conditions, further reinforcing the necessity of the CNN model. The CNN’s ability to automatically learn complex features from the raw LFP data appears to be a crucial factor in achieving superior classification performance (Supplementary Figure 2B).

      (h) Cross-validation and inter-animal generalization.

      (i) Consider replacing the single 80/20 split with k-fold cross-validation within animals.

      Thank you for this suggestion. Instead of using an 80/20 split, we performed 5-fold cross-validation on all rats. The silhouette scores were averaged within each animal across the five folds, and Figure 6C was updated accordingly. After performing a paired t-test, we still observed a significant difference in silhouette scores between the light and dark conditions.

      (ii) Comment on inter-animal generalization.

      Thank you for this valuable feedback. Although we did not explicitly test inter-animal generalization, it is unlikely that a model trained on data from one rat would perform equally well when classifying data recorded from another animal. This limitation arises from two main factors. First, despite careful efforts to implant electrodes in the same brain region and cortical layer across experiments, it is impossible to align all 32 electrodes to identical coordinates. Consequently, the recorded LFPs are obtained from slightly different locations, which may reflect distinct neural processing. Second, even within the same species, individual animals differ in brain size and neural circuit organization. Thus, even if electrodes could be placed at identical anatomical locations, inter-individual variability in brain structure would still lead to differences in the recorded signals. Because deep learning models are often sensitive to small perturbations in their input data, we believe that robust inter-animal generalization is unlikely without fine-tuning the model using data from the target animal. This comment has been inserted in the Discussion [lines 494-507].

      (2) Writing, figure and terminology improvements (minor):

      (a) Figure 5F-G axis label. Decide on either "attribution score" or "activation amplitude" and use that term consistently in panels, legend, and text (currently, I believe it could be confused with raw signal amplitude).

      We have unified the terminology to "attribution score" and applied this consistently across the panels, legend, and text.

      (b) Throughout the manuscript, use "population-level activity" or "average population dynamics" when discussing LFPs (I believe it is more correct to reserve "population code" for multiple single-unit datasets).

      We agree with the reviewer’s point and have adapted the term "population dynamics" to describe LFP information consistently throughout the manuscript.

      (c) Lines 219-221, state down-sampling to 2 kHz, whereas line 289 mentions 10 kHz. Reconcile these numbers.

      We apologize for the confusion and thank the reviewer for thoroughly reading the manuscript. Our original sampling rate was 30 kHz, and all analyses were performed on data resampled to 10 kHz. The reference to 2 kHz was an error, and we have corrected it.

      (d) Specify the tail of each statistical test mentioned in the manuscript and any multiple-comparison correction used.

      We have specified the tail of each statistical test and any multiple-comparison corrections used in the "Data Analysis" section of the Methods.

      (e) Line 244: "variables (He et al., 2015)" → "variables (He et al., 2015)".

      We have corrected this formatting issue and revised it to "variables (He et al., 2015)".

      (f) Line 253: "one-dimentional" → "one-dimensional".

      We have corrected the spelling error and revised it to "one-dimensional".

      (3) Data and code sharing:

      (a) Consider depositing data and code for the analysis in public open repositories.

      Thank you for your suggestion. We have set up a public GitHub repository to share the code. Since the full dataset is quite large (~400GB), we have uploaded a smaller example dataset for the analysis.

    1. Reviewer #2 (Public review):

      Summary:

      An abundant literature documents molecular changes in the rodent hypothalamus that occur during the transition from prepubertal to mature reproductive physiology. Equally well documented is the role of sex steroids and their receptors during this important period of reproductive development, as well as the importance of GABAergic and glutamatergic neurons. The medial preoptic area (MPOA) is known to play a central role in expression of sexually dimorphic reproductive function and previously reported sexually dimorphic patterns of gene expression are consistent with this role. The present manuscript extends this knowledge base and reports the results of a detailed evaluation of transcriptional dynamics in the MPOA during the adolescent transition to maturity with a particular focus on the role of the estrogen receptor gene (Esr1). Both single cell RNA sequencing (scRNseq) and multiplex in situ hybridization methods were employed and the results subjected to detailed computational analyses to demonstrate that the transcriptomic structure of MPOA neurons displays both sex and cell type specific expression profiles. In addition, both hormonal and genetic manipulations of Esr1 signaling during puberty altered the transcriptional profiles of MPOA neurons, and these changes aligned with maturation of hormone-dependent reproductive function. The authors provide this evidence to illustrate Esr1-dependent control of gene regulatory networks required for normal expression of reproductive behaviors expressed during the transition from adolescence to adulthood. The results presented in this manuscript are extensive and represent the most comprehensive evaluation of transcriptomic changes during reproductive maturation to date. The methods appear strong and the results provide a rich data set that will support a good deal of future analysis.

      Strengths:

      (1) The major strength of this manuscript is the extensive set of images and graphs that illustrate molecular changes that occur in MPOA neurons during adolescence, although additional spatial detail as to locations of the source neurons would be welcome in order to place the changes in the proper circuitry context.

      (2) Targeting Esr1 deletion to MPOA GABA neurons is a good choice, given how these cells have been implicated in sexual differentiation of reproductive behavior previously, and the lack of comparable responses in glutamatergic neurons is convincing. The AAV-frtFlex-Cre virus created by the investigators is a most useful tool for such studies. Profiling distinct transcriptomic trajectories in GABA and glutamatergic neurons during reproductive maturation is impressive and leads to some of the best supported conclusions in this paper.

      (3) Cellular and molecular resolution of the transcriptomics data appears excellent, however, because the source tissue for the scRNAseq analysis was obtained by bulk dissection of the MPOA anatomical resolution is limited. This problem is addressed to some extent by careful comparison of scRNAseq results with previously published spatial transcriptomics data. The HM-HCR-FISH analysis clearly documents spatially restricted changes in gene expression, but it is hard to discern where these changes occur based on the images presented or the descriptions included in the Results. The anatomical schematic included in Figure 4 suggests that investigators are not familiar with components of the MPOA (see Allen Mouse Brain Atlas).

      Weaknesses:

      (1) A major conceptual flaw is that the authors do not distinguish between genetically determined sex differences in patterns of gene expression and differences caused by the fact that MPOA neurons are exposed to different endocrine environments in adolescent males and females, which can cause different transcriptional trajectories independent of genetic sex. This issue does not render their results invalid, but their terminology should address the issue in the discussion and "limitations" section. At the very least the endocrine status of "intact females" should be included.

      (2) A major technical flaw is that the MPOA is treated as a functionally distinct brain region (block dissections) with uniform distribution of cell types (FISH data are not illustrated or reported with sufficient spatial detail). Thus, an enormous amount of molecular data is provided that cannot be mapped to distinct neural circuits, thereby limiting the neurobiological impact. This is also a weakness of the FISH data, which is presented with only small regions illustrated without anatomical detail. In fact, some images are compared that appear to illustrate different MPOA structures, although it is impossible to be certain of this due to the lack of morphological landmarks. The analysis of how Esr1 orchestrates regulatory gene networks is impressive and interesting, but the fact that many of the observed transcriptional events occur in neural circuits that do not overlap confounds interpretation.

      (3) The locations of the AAV injections should be characterized because deleting Esr1 in multiple distinct parts of the MPOA will likely confound interpretation. This is especially problematic given the limited number of mice used for parts of the RNAscope analysis.

      (4) Although the focus of these experiments on adolescence is welcome, neither the Introduction nor the Discussion do a good job of placing these studies in the context of what is already known about brain maturation during puberty. It is true that this is very much a results-focused manuscript, but the scholarship can be improved. Simply stating that your results are consistent with previous reports places an undue burden on the reader to go figure out what is new.

      (5) Throughout the manuscript, the authors utilize obscure abbreviations, which often makes reading their text overly cumbersome. This is certainly justified in certain instances where complex names of analytical methods are used repeatedly, but the authors are encouraged to try and simply their use of non-standard abbreviations.

      Comments on revisions:

      The authors have considered issues raised during the initial review. Although there do not appear to be significant changes to analyses, figures or conclusions, the authors have added important revisions listing limitations in study design and methodology that impact interpretation.

    2. Author Response:

      The following is the authors’ response to the original reviews.

      Public review:

      Reviewer #1 (Public review):

      Weaknesses:

      Two minor comments

      (1) Fig 4 (hormone treatment): In this experiment, testosterone is given to males, yet in Sup Fig 6 it is argued that Esr1 is more influential in driving transcriptional changes compared to AR. Does DHT treatment have the same outcome as testosterone? Or, does estrogen treatment in males have the same outcome as testosterone?

      We agree that to distinguish AR and Esr1 activation by testosterone and converted estrogen respectively is a limitation in our study. We added discussion in the “limitation of the study” section.

      Although HM-HCR experiments showed the bidirectional control of transcriptional progression during adolescence, it is unclear if the facilitation in male by testosterone supplement is via activation of AR or Esr1 or both because testosterone will likely be converted to estrogen in the brain. Future studies using dihydrotestosterone (DHT) and estrogen to males may address this issue.

      (2) Fig 3i: There appears to be an age-dependent transcriptional change in male Vgat HR-low cells. Can the authors comment on age-dependent (hormone-independent) transcriptional changes in males versus females.

      We agree that it is important to clarify hormone dependent changes and age dependent changes. We added pair-wise DE results in Vgat HR low population in the main text. As consistent with trajectory analysis, the number of age-dependent genes were fewer than hormonally associated genes.

      “Pair-wise DEG analysis consistently showed that larger number of DEGs between P35 and P23 in Vgat+Esr1+ (male: 146 genes; female: 162 genes) than Vgat+ hormone R<sup>Low</sup> (male: 26 genes; female: 1 gene).”

      Reviewer #2 (Public review):

      Weaknesses:

      (1) A major conceptual flaw is that the authors do not distinguish between genetically determined sex differences in patterns of gene expression and differences caused by the fact that MPOA neurons are exposed to different endocrine environments in adolescent males and females, which can cause different transcriptional trajectories independent of genetic sex. This issue does not render their results invalid, but their terminology should address the issue in the discussion and "limitations" section. At the very least the endocrine status of "intact females" should be included.

      We agree that this was ideal if perinatal and pubertal dynamics are analyzed within the same study to distinguish these two processes. We added discussion in the “limitation section”.

      “2. Although we have identified hormone/Esr1 dependent transcriptional trajectories during adolescence, the relations and interplay with genetically determined perinatal event, which is earlier and robust, are unclear. Some sex differences during adolescence might be an extension of perinatally established sex differences while others might be unique adolescent changes.”

      (2) A major technical flaw is that the MPOA is treated as a functionally distinct brain region (block dissections) with uniform distribution of cell types (FISH data are not illustrated or reported with sufficient spatial detail). Thus, an enormous amount of molecular data is provided that cannot be mapped to distinct neural circuits, thereby limiting the neurobiological impact. This is also a weakness of the FISH data, which is presented with only small regions illustrated without anatomical detail. In fact, some images are compared that appear to illustrate different MPOA structures, although it is impossible to be certain of this due to the lack of morphological landmarks. The analysis of how Esr1 orchestrates regulatory gene networks is impressive and interesting, but the fact that many of the observed transcriptional events occur in neural circuits that do not overlap confounds interpretation.

      We agree that while MPOA is defined based on brain atlas consistently across samples, the boundary is somewhat less obvious compared to other nuclei (e.g. hippocampus, VHM etc). To minimize the contaminations from adjacent areas, we have restricted quantitative analysis to mostly Vgat+ Esr1+ population which are densely located within the MPOA but not in immediately adjacent areas, except posterior BNST which is readily distinguishable. We added clarification in the method as well as added technical limitation in the discussion below.

      Method

      “To disambiguate the MPOA and adjacent brain regions, quantitative analysis is restricted to Vgat+ Esr1+ neurons and is devoid of posterior BNST.”

      Discussion

      “3. While we have observed robust effect of Esr1-KO in scRNAseq experiment which was further validated with FISH experiment, it is possible that there are further heterogeneous Vgat-Esr1 populations in the MPOA which might be differentially targeted in each virally injected sample. To mitigate this, 3-4 mice were pooled for each sample in scRNAseq experiment and in HCR-FISH experiment, in addition to confirming recombinase RNA expression within the MPOA, we included samples with robust Esr1 deletion in the MPOA. Interestingly, due to the technical challenge, Esr1 deletion tends to be more robust than weakly detected recombinase RNA expression (data not shown).”

      (3) The locations of the AAV injections should be characterized because deleting Esr1 in multiple distinct parts of the MPOA will likely confound interpretation. This is especially problematic given the limited number of mice used for parts of the RNAscope analysis.

      We agree that similar to #2, this is an important matter. For HCR experiment, we only included animal with recombinase RNA (Cre or Flp) expression within MPOA. Although the recombinase expression was sufficient enough to qualitatively determine the hit or miss, the detection was weak and it was challenging to determine the extent of viral spread. Thus, we also used successful Esr1 deletion as an additional inclusion criteria for AAV-Cre-YFP group. We have added inclusion criteria in the method and technical consideration in discussion.

      Method

      “For HCR2, AAV was injected unilaterally so that successful targeting of the MPOA with AAVCre-YFP (detection of recombinase RNA within the MPOA) and the deletion of Esr1 were confirmed for inclusion of samples.”

      Discussion

      “3. While we have observed robust effect of Esr1-KO in scRNAseq experiment which was further validated with FISH experiment, it is possible that there are further heterogeneous Vgat-Esr1 populations in the MPOA which might be differentially targeted in each virally injected sample. To mitigate this, 3-4 mice were pooled for each sample in scRNAseq experiment and in HCR-FISH experiment, in addition to confirming recombinase RNA expression within the MPOA, we included samples with robust Esr1 deletion in the MPOA. Interestingly, due to the technical challenge, Esr1 deletion tends to be more robust than weakly detected recombinase RNA expression (data not shown).”

      (4) Although the focus of these experiments on adolescence is welcome, neither the Introduction nor the Discussion do a good job of placing these studies in the context of what is already known about brain maturation during puberty. It is true that this is very much a results focused manuscript, but the scholarship can be improved. Simply stating that your results are consistent with previous reports places an undue burden on the reader to go figure out what is new.

      We agree that contextualizing our study in the scholarship will clarify the novelty and impacts that this study provides to the community. We have updated the introduction adding a review highlighting puberty associated genomic studies in the brain, which are all bulk (brain region level) as well as the very first puberty scRNAseq study in Human testis.

      “Despite the well-established role of these hormones in shaping behavior, the molecular mechanisms underlying their influence on brain development during adolescence are still limited to brain-region level (bulk)[8]in humans and model organisms and adolescent transcriptional dynamics at single cell resolution in the brain remain poorly understood (but see a pioneering study in the human testis[9]).”

      (5) Throughout the manuscript the authors utilize obscure abbreviations, which often makes reading their text overly cumbersome. This is certainly justified in certain instances where complex names of analytical methods are used repeatedly, but the authors are encouraged to try and simplify their use of non-standard abbreviations.

      We agree that this is helpful for readers to have the reference of abbreviations in handy at single location. We added an “abbreviation” section as a reference for readers.

      Medial preoptic area (MPOA)

      Single-cell RNA sequencing (scRNAseq)

      Estrogen receptor 1 (Esr1)

      GABAergic neurons (Vgat+)

      Glutamatergic neurons (Vglut2+)

      Hybridized chain reaction fluorescent in situ hybridization (HCR-FISH)

      Gonadectomized (GDX)

      Partition-based graph abstraction (PAGA)

      Hormone-associated differentially expressed genes (HA-DEGs)

      Multiplexed error-robust fluorescence in situ hybridization (MERFISH) differential gene expression (DE)

      Differentially expressed genes (DEGs)

      Support vector machine (SVM)

      Manifold Enhancement Latent Dimension (MELD)

      Potential of Heat-diffusion for Affinity-based Trajectory Embedding (PHATE)

      Androgen receptor (AR)

      single-cell regulatory network inference (SCENIC)

      Reviewer #3 (Public review):

      We appreciate reviewer for the constructive comments to improve our manuscript.

      Weaknesses:

      We already know that Esr1 is important within GABAergic but not glutamatergic neurons for mating behavior. However, there is not enough data to support the claim that disrupting Esr1 in glutamatergic MPOA neurons "had no observable effect." The MPOA is involved in many behaviors and physiologies that were not investigated. More assays would be required to report "no observable effect."

      The small number of cells included in the transcriptional studies is a general concern, as noted by the authors. This is a particular concern for conclusions related to the role of adolescence in glutamatergic MPOA neurons. The paper reports 24,627 neurons across all treatment groups, which include 3 time points, 2 sexes, and GDX conditions. It seems likely that not much was detected in the glutamatergic neurons because of insufficient power.

      Esr1 knockout is initiated in adolescence, not restricted to adolescence. Do we know that the effects on mating behavior are due to what is happening in adolescence vs. the function of Esr1 in adults? Are the effects different if Esr1 is knocked out in mature adults? This comparison would be important to demonstrate that adolescence is a critical time window for Esr1 function.

      We agree that 1. the relatively mild effects observed in Glutamatergic neurons may be partially due to the scale of the study, and 2. Esr1 deletion is permanent once induced and it is challenging to distinguish adolescent and adult transcriptional dynamics using existing viral strategies.

      We added discussion in the “limitation” section.

      “4. While we have observed robust transcriptional progression in Vgat<sup>+</sup> Esr1<sup>+</sup> neurons during adolescence, we observed more mild alternations in VgluT2<sup>+</sup> neurons. Although the scale of our study is comparable or exceeds prior scRNAseq studies in MPOA[22,29], future larger studies may have more sensitivity to detect adolescent transcriptional dynamics in VgluT2<sup>+</sup> neurons.”

      “5. Although we demonstrated adolescent transcriptional changes were observed as early as P35, and either hormonal deprivation or Esr1 KO in prior to adolescence prevented the transcriptional progression (arrested transcriptional state even at adult), given the viral incubation time and permanent deletion of Esr1 after viral injection, it is challenging to disambiguate the role of Esr1 during adolescence and adult. Future studies injecting the virus at adult may provide additional insights on the similarity and difference between transcriptional changes during puberty and maintained transcriptional states at adult.”

    1. Reviewer #1 (Public review):

      Summary:

      Overall, this is an interesting and well-written manuscript on a fascinating question in a "charismatic" model system.

      Strengths:

      1) The Introduction is concise, though it might be helpful to the non-specialist reader to learn a bit more about what is known about the social control of somatic growth across diverse species (including humans), which would help to make this work more generally interesting.

      (2) The experiment is well-designed.

      (3) The data collected are comprehensive.

      (4) The complementary analysis of both feeding and aggression/submission data with and without known social roles is a neat idea and compelling!

      Weaknesses:

      (1) I was surprised that the HPA/stress axis was not considered here at all. Wouldn't we expect that subordinates have increased stress axis activation, which in turn could inhibit their growth and aggressive behavior?

      (2) To what extent are growth, food intake, agonistic behavior, and/or gene expression patterns coordinated across P1 vs P2 pairs? The lack of such an analysis seems like a missed opportunity.

      (3) What was the rationale for using whole bodies for the transcriptome analysis? Given the hypotheses, the forebrain or hypothalamus and certain other organ systems (e.g., liver, gonads, skin, etc.) would have been obvious candidate tissues here. I realize that cost is always a consideration, but maybe a focus on the fore-/midbrain could have been prioritized.

      (4) Given the preceding point, why was a fold-change threshold used for assessing DEGs (supplementary Figure 3)? There is no biological justification to ever use a fold-change threshold, especially in bulk RNA-seq analysis. This is particularly true here, where whole bodies were used for RNA-seq analysis, which is a bit unusual. Relatively small cell populations (such as hypothalamic neurons that regulate growth or food intake) may show substantial gene expression variation across social types, yet will be masked by the masses of other cells in the whole body sample. However, gene expression may still vary significantly, albeit the fold-difference may be small. I therefore suggest a reanalysis that omits any fold-change threshold.

      (5) Why is the analysis of color (hue, saturation) buried in the supplementary materials? Based on the hypotheses that motivated the study, color seems just as relevant as food intake, growth, and agonistic behavior, so even if the results are negative, they should be presented in the main paper.

      (6) The Discussion is sometimes difficult to follow. The authors may want to consider including a conceptual graphic that integrates the different aspects of growth and satiety regulation, etc., into a work-in-progress model of sorts, which would also facilitate clearer hypotheses for future research.

    2. Reviewer #2 (Public review):

      In this manuscript, the authors test growth, behavior, and gene expression in pairs of clownfish as they establish social dominance hierarchies, examining patterns of gene expression in these pairs after dominance has been established. The authors show solid evidence that emerging dominant clownfish show increased growth, aggression, and food consumption compared to their submissive or solitary counterparts, eventually adopting distinct gene expression profiles.

      Major Comments:

      (1) The Introduction is comprehensive, but it could be condensed. Likewise, the discussion could be condensed. There is considerable redundancy between the methods, the results, and the legend in Figure 1. The authors should consolidate and remove the redundancy.

      (2) For Figure 3, the authors are showing PC2 and PC3; why is PC1 not shown? There is so much overlap between the three groups in PC2 vs PC3; it seems unlikely that researchers could conclusively identify any individual as belonging to a group based on the expression profile. The ovals shown do not capture all the points within each of the groups, and particularly the grey S oval seems misaligned with the datapoints shown.

      (3) The authors indicate that the 15 replicates exhibiting the greatest size difference between P1 and P2 were selected for gene profiling. Does this mean that each of the P1 and P2 were pairs with each other? Have the authors tried examining the gene expression patterns in a paired manner? E.g., for the pairs that showed the greatest size differences, do they also show the greatest differences in gene expression? Do the P1s show the most extreme differences from P2s that also show the most extreme P2 differences? Perhaps lines on Figure 3A connecting datapoints from the P1 and P2 pairs would be informative.

      (4) For the specific target pathways that are up- and downregulated in the different backgrounds, I recommend that the authors include boxplots (or heatmaps) showing the actual expression values for these targets. Figure 6 shows a heatmap for appetite-related genes, and it would be great to see a similar graph for the metabolism and glycolysis genes; it would also be informative to see similar graphs for hormonal and sexual maturation pathways as well.

      (5) Particularly given that there is a relatively small number of genes enriched in the different rank conditions, I did not understand the need to do the WGCNA module analysis. I thought that an analysis of GO terms across the dataset would have been more meaningful than the GO term analysis shown in Figure 4, which considers only genes assigned to the "brown WGCNA module". This should be simplified or clarified.

      (6) The authors say that they have identified coordinated changes in behaviors and the "underlying gene expression, leading to the emergence" of social roles. This is a little bit misleading, since the gene expression analysis occurred well after the behavioral and phenotypic differences emerged. Presumably, the hormonal and genetic shifts that actually caused the behavioral and phenotypic difference occurred during the weeks during which the experiment was underway, and earlier capture of the transcriptome would presumably reveal different patterns, and ones that would be considered more causative. The authors acknowledge this in 434-435, but it could be emphasized further.

      (7) The authors have measured a number of differences between the different dominance classes of fish. All these differences were measured relative to the other classes, but in my view, the Solitary group was the closest to a baseline control. So I'm not sure that it is fair to say that "P2 and S individuals showed consistent downregulation of these genes and pathways" (line 401). I encourage the authors to emphasize the differences in gene expression from the "perspective" of the P1 individuals compared to the baseline of P2 and S individuals. Line 474 says that "P2 fish showed significant upregulation" of a number of pathways. It should be very clear what that is compared to (compared to P1, presumably?)

      (8) Along the same lines, the authors say in line 514 that subordinates and solitaries strategically downregulate their growth. I'm not convinced that this is the case: I would consider this growth trajectory to be the default and the baseline. I would interpret that under certain social conditions, a P1 dominant pattern of growth, behavior, and gene expression is allowed to emerge.

    3. Author Response:

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      Overall, this is an interesting and well-written manuscript on a fascinating question in a"charismatic" model system.

      Strengths:

      (1) The Introduction is concise, though it might be helpful to the non-specialist reader to learn a bit more about what is known about the social control of somatic growth across diverse species (including humans), which would help to make this work more generally interesting.

      (2) The experiment is well-designed.

      (3) The data collected are comprehensive.

      (4) The complementary analysis of both feeding and aggression/submission data with and without known social roles is a neat idea and compelling!

      Thank you for the positive feedback!

      Here, we investigate phenotypic plasticity associated with the adoption of social roles in the clown anemonefish, with strategic growth being just one aspect of that plasticity. Strategic growth, also known as social control of growth, is a fascinating form of adaptive phenotypic plasticity, whereby individuals modify their growth and size in response to fine-scale changes in social conditions (Buston & Clutton-Brock, 2022). In cooperative breeding systems with high reproductive skew, particularly fishes and mammals (possibly including humans), individuals have been shown to i) increase growth/size on the acquisition of dominant status (Dengler-Crish & Catania, 2007; Johnston et al., 2021; Thorley et al., 2018; Van Schaik & Van Hooff, 1996; Walker & McCormick, 2009), ii) increase growth/size when paired with size matched reproductive rivals (Huchard et al., 2016; Reed et al., 2019; this study), and iii) decrease growth/size to avoid conflict (Buston, 2003; Heg et al., 2004; Wong et al., 2007). While strategic growth is fascinating and clearly occurring in this study, we show coordinated changes of multiple aspects of the phenotype as fish adopt social roles. Therefore, we deliberately framed the Introduction broadly to avoid biasing the reader toward viewing growth as the sole or main driver.

      Weaknesses:

      (1) I was surprised that the HPA/stress axis was not considered here at all. Wouldn't we expect that subordinates have increased stress axis activation, which in turn could inhibit their growth and aggressive behavior?

      We also expected to see the HPA/stress axis activated in subordinates, which is why we carried out a targeted exploration of genes known to play a role in this axis. We did not find any genes that were significantly differentially expressed. We believe that there could be two explanations for this. First, from a methodological perspective, it could be due to our use of a whole-body RNA-seq, which may have masked this signal. Alternatively, the stress axis might play a more complex role than just acting as a simple on/off switch for reduced growth. Its activation may peak when competition over size is at its highest (during week one) or, conversely, it may peak later and help maintain reduced growth once hierarchies are firmly established (particularly after the dominant individual reaches its maximum size). To understand the role of the stress axis, future studies should observe how its activation varies over time. We acknowledge that the absence of a stress‑axis signal and its potential explanations were not clearly discussed in the original manuscript, in the revised version, we will address this issue.

      (2) To what extent are growth, food intake, agonistic behavior, and/or gene expression patterns coordinated across P1 vs P2 pairs? The lack of such an analysis seems like a missed opportunity.

      We had a similar thought. Specifically, we were interested in testing the hypothesis that the final size ratio of pairs, which is indicative of the amount of conflict remaining, would predict gene expression. We examined gene expression within pairs to test for coordinated changes and repeated the analysis, accounting for the pair size ratio. In both cases, we found no clear or consistent pattern within pairs. We will consider including these figures in the Supplementary Materials document.

      (3) What was the rationale for using whole bodies for the transcriptome analysis? Given the hypotheses, the forebrain or hypothalamus and certain other organ systems (e.g.,liver, gonads, skin, etc.) would have been obvious candidate tissues here. I realize that cost is always a consideration, but maybe a focus on the fore-/midbrain could have been prioritized.

      We decided to use whole-body samples for this initial transcriptomic analysis to capture a broad view of gene-expression differences while keeping sequencing costs and sample requirements manageable. We agree with the reviewer that future work should explore specific tissues sampled from individuals at multiple time points to disentangle transcriptomic differences across tissue types.

      (4) Given the preceding point, why was a fold-change threshold used for assessing DEGs (supplementary Figure 3)? There is no biological justification to ever use a fold-change threshold, especially in bulk RNA-seq analysis. This is particularly true here, where wholebodies were used for RNA-seq analysis, which is a bit unusual. Relatively small cell populations (such as hypothalamic neurons that regulate growth or food intake) may show substantial gene expression variation across social types, yet will be masked by the masses of other cells in the whole body sample. However, gene expression may still vary significantly, albeit the fold-difference may be small. I therefore suggest a reanalysis that omits any fold-change threshold.

      We thank the reviewer for this important point, and agree that an arbitrary fold‑change cutoff is inappropriate/unnecessary. It should be noted that this fold-change cut-off was only used in this single figure, and all other analyses used p-values from the entire dataset. We will remove the fold‑change threshold cutoff and correct Supplementary Figure 3, and any corresponding text.

      (5) Why is the analysis of color (hue, saturation) buried in the supplementary materials?Based on the hypotheses that motivated the study, color seems just as relevant as food intake, growth, and agonistic behavior, so even if the results are negative, they should be presented in the main paper.

      We agree that color can be an important social signal, so we included color measurements in our experimental design. However, after careful consideration of the color results, we decided that our experimental timing and husbandry changes introduced multiple confounding factors, preventing us from drawing confident conclusions. Specifically, our fish were ≈1 month old at the transfer from larval to experimental tanks and had already begun to deepen their orange hue, before our experiment. (In the wild, they would settle at two weeks of age, prior to the deepening of the orange hue). Once individuals attain a certain hue, it seems that color development can be halted, but not reversed. The transfer also involved changes in lighting, tank background, and diet, factors known to strongly affect coloration. Our results show a uniform shift in orange hue and saturation across social groups, suggesting that these confounding factors might have dominated changes in hue.

      For transparency, we report the color data in the Supplementary Materials, but we caution against drawing any strong conclusions. In the revised manuscript, we will recommend that future work include a targeted experiment to robustly test for the effect of the adoption of social roles on coloration or the effect of coloration on the adoption of social roles.

      (6) The Discussion is sometimes difficult to follow. The authors may want to consider including a conceptual graphic that integrates the different aspects of growth and satiety regulation, etc., into a work-in-progress model of sorts, which would also facilitate clearer hypotheses for future research.

      Thank you for flagging that parts of the Discussion are a bit difficult to follow. In the revised manuscript, we will work to improve readability of the Discussion. We also appreciate the suggestion of including a conceptual schematic. We will consider whether adding such a graphic will add value to this manuscript or future manuscripts.

      Reviewer #2 (Public review):

      In this manuscript, the authors test growth, behavior, and gene expression in pairs of clownfish as they establish social dominance hierarchies, examining patterns of gene expression in these pairs after dominance has been established. The authors show solid evidence that emerging dominant clownfish show increased growth, aggression, and food consumption compared to their submissive or solitary counterparts, eventually adopting distinct gene expression profiles.

      Major Comments:

      (1) The Introduction is comprehensive, but it could be condensed. Likewise, the discussion could be condensed. There is considerable redundancy between the methods, the results,and the legend in Figure 1. The authors should consolidate and remove the redundancy.

      Thank you for flagging that parts of the manuscript could be condensed, we will work on this as we revise the manuscript.

      (2) For Figure 3, the authors are showing PC2 and PC3; why is PC1 not shown? There is so much overlap between the three groups in PC2 vs PC3; it seems unlikely that researchers could conclusively identify any individual as belonging to a group based on the expression profile. The ovals shown do not capture all the points within each of the groups, and particularly the grey S oval seems misaligned with the datapoints shown.

      We understand the concern raised by the reviewer about the overlap among points in the PCA. We have explored PC1-PC3 and found that PC2 and PC3 showed the clearest, statistically significant clustering by social position, while PC1 did not capture any variation due to social position. We have explored whether other factors might be masking differences, such as genetic relatedness, tank effects, total read count per sample, and found that none of these factors explained sample clustering. Regarding the ellipses shown around the points, they were not intended to capture all points, but rather they show the estimated 95% multivariate t-distribution for that given social group. We will make sure this is clearly explained in the figure legend, and Methods section. In addition, in the revised version, we will show PC1 and PC2, and PC1 and PC3, in the Supplements for transparency.

      (3) The authors indicate that the 15 replicates exhibiting the greatest size difference between P1 and P2 were selected for gene profiling. Does this mean that each of the P1and P2 were pairs with each other? Have the authors tried examining the gene expression patterns in a paired manner? E.g., for the pairs that showed the greatest size differences,do they also show the greatest differences in gene expression? Do the P1s show the most extreme differences from P2s that also show the most extreme P2 differences? Perhaps lines on Figure 3A connecting datapoints from the P1 and P2 pairs would be informative.

      Yes, “15 replicates exhibiting the greatest size difference between P1 and P2 were selected for gene profiling” refers to pairs of P1 and P2, we will make sure this is clearly stated in the revised Methods. Yes, we have explored gene expression data considering the size difference between pairs, and found that it showed no clear differences in gene expression patterns (see earlier response to Reviewer #1). We will consider including these figures in the Supplementary Materials document, as well as adding a version of Figure 3A that clearly shows information on pairs, as suggested by the reviewer.

      (4) For the specific target pathways that are up- and downregulated in the different backgrounds, I recommend that the authors include boxplots (or heatmaps) showing the actual expression values for these targets. Figure 6 shows a heatmap for appetite-related genes, and it would be great to see a similar graph for the metabolism and glycolytic genes; it would also be informative to see similar graphs for hormonal and sexual maturation pathways as well.

      We have explored genes across a broad set of metabolic pathways (glycolysis, TCA cycle, lactic fermentation, PDH complex, cholesterol biosynthesis, fatty-acid synthesis, and beta-oxidation) and show all metabolic genes that showed significant differential expression between P1, P2, and S in Figure 6. Overall, very few metabolism-associated genes were significantly differentially expressed, which is why we decided to combine appetite-regulation and metabolism-associated genes into a single figure (Figure 6). In the revised version, we will ensure that Figure 6 clearly shows the gene sets associated with appetite and metabolism.

      We also examined hormonal pathways (glucocorticoid and thyroid signaling), but did not find genes in these pathways that were significantly differentially expressed. Finally, we would like to clarify that our samples consist of two-month-old juvenile individuals that are sexually immature —under ideal conditions, clown anemonefish can mature in one to two years, but they can also remain sexually immature for a decade or more (Buston & García, 2007) — which is why we did not observe distinct molecular signatures of sexual maturation. We recognize that the sentence at line 520 may be misleading, as we did not identify any gene expression signature that we could confidently associate with signs of sexual maturation. We will make sure that these are clearly stated in the revised version of the manuscript.

      (5) Particularly given that there is a relatively small number of genes enriched in the different rank conditions, I did not understand the need to do the WGCNA module analysis. I thought that an analysis of GO terms across the dataset would have been more meaningful than the GO term analysis shown in Figure 4, which considers only genes assigned to the "brown WGCNA module". This should be simplified or clarified.

      To clarify, GO enrichment analysis does not establish correlations with traits, it only describes which functions or pathways are over-represented in a given gene set. That is why we began by using WGCNA to define gene sets (modules) that are correlated to phenotypes. Our primary rationale for WGCNA was to identify modules of co-expressed genes that show significant statistical correlation with the phenotypes of interest (social role: P1, P2, S; growth; and food intake). Pairwise differential expression analysis (Figure 3B) identified a few hundred significantly differentially expressed genes, but those tests treat genes independently and are not able to help us link coordinated changes of co-expressed genes to phenotypes of interest. Because WGCNA is blind to traits, it first identifies groups of co-expressed genes, which can help resolve gene expression patterns.

      We therefore ran WGCNA on the rlog-transformed dataset to identify modules of co-expressed genes that show significant correlation with phenotypes of interests. For every module that showed such a correlation, we performed GO enrichment and carefully evaluated the resulting GO enrichment trees (see Supplementary Figs. 4–5). The brown module was highlighted in the main text because it was one of the modules with a significant correlation to growth, and its associated GO enrichment showed clear growth-related signals that were not identified in the pairwise differential expression analysis results.

      (6) The authors say that they have identified coordinated changes in behaviors and the"underlying gene expression, leading to the emergence" of social roles. This is a little bit misleading, since the gene expression analysis occurred well after the behavioral and phenotypic differences emerged. Presumably, the hormonal and genetic shifts that actually caused the behavioral and phenotypic difference occurred during the weeks during which the experiment was underway, and earlier capture of the transcriptome would presumably reveal different patterns, and ones that would be considered more causative.The authors acknowledge this in 434-435, but it could be emphasized further.

      We appreciate the reviewer raising this point. In the updated version of the manuscript, we will revise wording to convey that food intake, agonistic behavior, size and growth, and gene expression are all changing continuously, in response to each other and in response to social feedback. An underappreciated aspect of this system (and likely many other systems) is that phenotype (including transcriptome) influences the outcome of social interactions, and the outcome of social interactions influences the phenotype (including the transcriptome). Earlier capture of the transcriptome would reveal different levels of gene expression, reflecting the state of the system at that moment in time.

      (7) The authors have measured a number of differences between the different dominance classes of fish. All these differences were measured relative to the other classes, but in my view, the Solitary group was the closest to a baseline control. So I'm not sure that it is fair to say that "P2 and S individuals showed consistent downregulation of these genes and pathways" (line 401). I encourage the authors to emphasize the differences in gene expression from the "perspective" of the P1 individuals compared to the baseline of P2and S individuals. Line 474 says that "P2 fish showed significant upregulation" of a number of pathways. It should be very clear what that is compared to (compared to P1, presumably?)

      We agree with the reviewer that solitary individuals are the most intuitive baseline. Indeed, the experimental design included solitary fish because we expected they would serve as a useful control. Without social restraint, we anticipated they would show unrestricted growth, feeding, behavior, and associated gene‑expression patterns, similar to dominants.

      We initially ran analyses using solitaries as the baseline, but after examining the results, which showed subordinate‑like characteristics for the solitary individuals, we concluded that solitary individuals are not an ecologically appropriate control for this context. Removing juveniles from a social context and housing them in isolation may be stressful and can affect physiology and behavior in ways that do not reflect a natural baseline. From a life‑history standpoint, solitary living is not the typical state for A. percula.

      For these reasons, we reanalysed the dataset using the dominant (P1) as the reference to enable more ecologically meaningful comparisons (this choice was somewhat arbitrary, subordinates could also have been used as the reference). Given that gene expression is relative, we interpret results from both the dominant (P1) and subordinate (P2) perspectives in the Discussion to provide a complete view. We will clarify wording throughout the manuscript to make it clear that everything is relative (e.g., revising Line 474).

      (8) Along the same lines, the authors say in line 514 that subordinates and solitaries strategically downregulate their growth. I'm not convinced that this is the case: I would consider this growth trajectory to be the default and the baseline. I would interpret that under certain social conditions, a P1 dominant pattern of growth, behavior, and gene expression is allowed to emerge.

      We respectfully disagree with the idea that a single baseline/reference growth trajectory exists for any individual of this species. Growth of individuals is entirely social context-dependent: neither fast nor slow growth represents an inherent baseline. When two size‑matched juveniles meet and compete to establish dominance, accelerated growth is the expected trajectory. By contrast, juveniles joining an existing hierarchy are expected to exhibit reduced growth, which minimizes conflict and facilitates their social integration. Unlike species that show non socially mediated growth trajectories, clown anemonefish do not have a context‑independent growth rate, rather, individuals constantly readjust their growth according to their immediate social environment.

      Therefore, growth trajectories must be considered from the perspective of all group members, because they emerge from interactions among individuals rather than reflecting an intrinsic baseline. In this study, we were interested in the establishment of dominance hierarchy and how individuals adjust their phenotypes during this process. By experimentally pairing size‑matched rivals, both individuals are initially expected to pursue the dominant trajectory, and thus neither individual represents a default state. Instead, the outcome reflects a social decision, after which both individuals reinforce their emerging social roles through coordinated changes.

      Reviewer #3 (Public review):

      Summary:

      The authors tested the hypothesis that interactions among size- and age-matched rivals will lead to the emergence of social roles, accompanied by divergence in four aspects of individual phenotypes: growth, feeding behavior, fighting behaviors, and gene expression in clownfish.

      Strengths:

      The data on growth, feeding rate, and fighting behaviors support the authors' claims.

      Thank you for the positive feedback!

      Weaknesses:

      Gene analysis conducted in this study is not sufficient to clarify how the relevant genes actually regulate growth and behavior.

      The information obtained from whole-body gene expression analysis is very limited.Various gene expression is associated with the regulation of fighting behaviors, food intake, growth, and metabolism, and these genes are regulated differently across tissues,even within a single individual. Gene expression analysis should be performed separately for each tissue.

      We understand the reviewer’s concern about whole‑body transcriptomes and agree that tissue‑specific sampling would provide greater resolution of the mechanisms linking gene expression to growth, agonistic behaviors, and food intake. For this initial study, however, we deliberately chose whole‑body samples to capture a broad, unbiased view of gene expression differences while keeping sequencing costs and sample requirements manageable. We explicitly acknowledge the resulting interpretational limits in the Discussion (lines 464; 529–533), and suggest in the last paragraph that the patterns reported here should be used to build on in future studies exploring targeted, tissue‑specific hypotheses.

      Clownfish undergo sex change depending on social status and body size, as the authors mention in the manuscript. Numerous gene expressions are affected by sex change. It is unclear how this issue was addressed.

      We thank the reviewer for raising this point. Sex change and sexual maturation can indeed drive major transcriptional shifts in clown anemonefish, but our experiment did not encompass such a life‑history transition. All individuals in this experiment were juveniles (≈1 month old at the start, ≈2 months old at the end) and were sexually immature at these ages. Clown anemonefish reach sexual maturation around one to two years under ideal conditions, can delay sexual maturation for years under normal conditions (Buston & García, 2007), and sex change in the genus Amphiprion is known to take over ~5 months (Moyer & Nakazono, 1978). Accordingly, individuals in this study were not sexually mature, and sex change was not biologically plausible over the five-week experimental period of our study. We recognize that the sentence at line 520 may be misleading, as we did not identify any gene expression signature that we could confidently associate with signs of sexual maturation. We will make sure that it is clearly stated that the fish in this study were sexually immature in the revised version.

      References:

      Buston, P. (2003). Forcible eviction and prevention of recruitment in the clown anemonefish. Behavioral Ecology, 14(4), 576–582. https://doi.org/10.1093/beheco/arg036

      Buston, P. M., & García, M. B. (2007). An extraordinary life span estimate for the clown anemonefish Amphiprion percula. Journal of Fish Biology, 70(6), 1710–1719. https://doi.org/10.1111/j.1095-8649.2007.01445.x

      Buston, P., & Clutton-Brock, Tim. (2022). Strategic growth in social vertebrates (WITH REVIEWER COMMENTS). Trends in Ecology & Evolution, 37(8), 694–705. https://doi.org/10.1016/j.tree.2022.03.010

      Dengler-Crish, C. M., & Catania, K. C. (2007). Phenotypic plasticity in female naked mole-rats after removal from reproductive suppression. THE JOURNAL OF EXPERIMENTAL BIOLOGY.

      Heg, D, Bender, N, & Hamilton, I. (2004). Strategic growth decisions in helper cichlids. Proceedings of the Royal Society of London. Series B: Biological Sciences, 271(suppl_6). https://doi.org/10.1098/rsbl.2004.0232

      Huchard, E, English, S, Bell, M B. V., Thavarajah, N, & Clutton-Brock, T. (2016). Competitive growth in a cooperative mammal. Nature, 533(7604), 532–534. https://doi.org/10.1038/nature17986

      Johnston, R A., Vullioud, P, Thorley, J, Kirveslahti, H., Shen, L., Mukherjee, S., Karner, C. M., Clutton-Brock, T, & Tung, J (2021). Morphological and genomic shifts in mole-rat ‘queens’ increase fecundity but reduce skeletal integrity. eLife, 10, e65760. https://doi.org/10.7554/eLife.65760

      Moyer, J. T., & Nakazono, A. (1978). Protandrous Hermaphroditism in Six Species of the Anemonefish Genus Amphiprion in Japan (No. 2). The Ichthyological Society of Japan. https://doi.org/10.11369/jji1950.25.101

      Reed, C., Branconi, R., Majoris, J., Johnson, C., & Buston, P. (2019). Competitive growth in a social fish. Biology Letters, 15(2), 20180737. https://doi.org/10.1098/rsbl.2018.0737

      Thorley, J, Katlein, N, Goddard, K, Zöttl, M, & Clutton-Brock, T. (2018). Reproduction triggers adaptive increases in body size in female mole-rats. Proceedings of the Royal Society B: Biological Sciences, 285(1880), 20180897. https://doi.org/10.1098/rspb.2018.0897

      Van Schaik, C P., & Van Hooff, J A. R. A. M. (1996). Toward an understanding of the orangutan’s social system. In Linda F. Marchant, Toshisada Nishida, & William C. McGrew (Eds.), Great Ape Societies (pp. 3–15). Cambridge University Press. https://doi.org/10.1017/CBO9780511752414.003

      Walker, S P. W., & McCormick, M I. (2009). Sexual selection explains sex-specific growth plasticity and positive allometry for sexual size dimorphism in a reef fish. Proceedings of the Royal Society B: Biological Sciences, 276(1671), 3335–3343. https://doi.org/10.1098/rspb.2009.0767

      Wong, M. Y. L., Buston, P. M., Munday, Philip L., & Jones, Geoffrey P. (2007). The threat of punishment enforces peaceful cooperation and stabilizes queues in a coral-reef fish. Proceedings of the Royal Society B: Biological Sciences, 274(1613), 1093–1099. https://doi.org/10.1098/rspb.2006.0284

    1. Author response:

      The following is the authors’ response to the previous reviews

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      In this paper, the authors investigate the effects of Miro1 on VSMC biology after injury. Using conditional knockout animals, they provide the important observation that Miro1 is required for neointima formation. They also confirm that Miro1 is expressed in human coronary arteries. Specifically, in conditions of coronary diseases, it is localized in both media and neointima and, in atherosclerotic plaque, Miro1 is expressed in proliferating cells.

      However, the role of Miro1 in VSMC in CV diseases is poorly studied and the data available are limited; therefore, the authors decided to deepen this aspect. The evidence that Miro-/- VSMCs show impaired proliferation and an arrest in S phase is solid and further sustained by restoring Miro1 to control levels, normalizing proliferation. Miro1 also affects mitochondrial distribution, which is strikingly changed after Miro1 deletion. Both effects are associated with impaired energy metabolism due to the ability of Miro1 to participate in MICOS/MIB complex assembly, influencing mitochondrial cristae folding. Interestingly, the authors also show the interaction of Miro1 with NDUFA9, globally affecting super complex 2 assembly and complex I activity.<br /> Finally, these important findings also apply to human cells and can be partially replicated using a pharmacological approach, proposing Miro1 as a target for vasoproliferative diseases.

      Strengths:

      The discovery of Miro1 relevance in neointima information is compelling, as well as the evidence in VSMC that MIRO1 loss impairs mitochondrial cristae formation, expanding observations previously obtained in embryonic fibroblasts.

      The identification of MIRO1 interaction with NDUFA9 is novel and adds value to this paper. Similarly, the findings that VSMC proliferation requires mitochondrial ATP support the new idea that these cells do not rely mostly on glycolysis.

      The revised manuscript includes additional data supporting mitochondrial bioenergetic impairment in MIRO1 knockout VSMCs. Measurements of oxygen consumption rate (OCR), along with Complex I (ETC-CI) and Complex V activity, have been added and analyzed across multiple experimental conditions. Collectively, these findings provide a more comprehensive characterization of the mitochondrial functional state. Following revision, the association between MIRO1 deficiency and impaired Complex I activity is more robust.

      Although the precise molecular mechanism of action remains to be fully elucidated, in this updated version, experiments using a MIRO1 reducing agent are presented with improved clarity

      Although some limitations remain, the authors have addressed nearly all the concerns raised, and the manuscript has substantially improved

      Weaknesses:

      Figure 6: The authors do not address the concern regarding the cristae shape; however, characterization of the cristae phenotype with MIRO1 ΔTM would have strengthened the mechanistic link between MIRO1 and the MIB/MICOS complex

      Although the authors clarified their reasoning, they did not explore in vivo validation of key biochemical findings, which represents a limitation of the current study. While their justification is acknowledged, at least a preliminary exploratory effort could have been evaluated to reinforce the translational relevance of the study.

      Finally, in line with the explanations outlined in the rebuttal, the Discussion section should mention the limits of MIRO1 reducer treatment.

      Reviewer #2 (Public review):

      Summary:

      This study identifies the outer‑mitochondrial GTPase MIRO1 as a central regulator of vascular smooth muscle cell (VSMC) proliferation and neointima formation after carotid injury in vivo and PDGF-stimulation ex vivo. Using smooth muscle-specific knockout male mice, complementary in vitro murine and human VSMC cell models, and analyses of mitochondrial positioning, cristae architecture and respirometry, the authors provide solid evidence that MIRO1 couples mitochondrial motility with ATP production to meet the energetic demands of the G1/S cell cycle transition. However, a component of the metabolic analyses are suboptimal and would benefit from more robust methodologies. The work is valuable because it links mitochondrial dynamics to vascular remodelling and suggests MIRO1 as a therapeutic target for vasoproliferative diseases, although whether pharmacological targeting of MIRO1 in vivo can effectively reduce neointima after carotid injury has not been explored. This paper will be of interest to those working on VSMCs and mitochondrial biology.

      Strengths:

      The strength of the study lies in its comprehensive approach assessing the role of MIRO1 in VSMC proliferation in vivo, ex vivo and importantly in human cells. The subject provides mechanistic links between MIRO1-mediated regulation of mitochondrial mobility and optimal respiratory chain function to cell cycle progression and proliferation. Finally, the findings are potentially clinically relevant given the presence of MIRO1 in human atherosclerotic plaques and the available small molecule MIRO1.

      Weaknesses:

      (1) High-resolution respirometry (Oroboros) to determine mitochondrial ETC activity in permeabilized VSMCs would be informative.

      (2) Therapeutic targeting of MIRO1 failed to prevent neointima formation, however, the technical difficulties of such an experiment is appreciated.

      Reviewer #3 (Public review):

      Summary:

      This study addresses the role of MIRO1 in vascular smooth muscle cell proliferation, proposing a link between MIRO1 loss and altered growth due to disrupted mitochondrial dynamics and function. While the findings are useful for understanding the importance of mitochondrial positioning and function in this specific cell type, the main bioenergetic and mechanistic claims are not strongly supported.

      Strengths:

      This study focuses on an important regulatory protein, MIRO1, and its role in vascular smooth muscle cell (VSMC) proliferation, a relatively underexplored context.

      This study explores the link between smooth muscle cell growth, mitochondrial dynamics, and bioenergetics, which is a significant area for both basic and translational biology.

      The use of both in vivo and in vitro systems provides a useful experimental framework to interrogate MIRO1 function in this context.

      Weaknesses:

      The proposed link between MIRO1 and respiratory supercomplex biogenesis or function is not clearly defined.

      Completeness and integration of mitochondrial assays is marginal, undermining the strength of the conclusions regarding oxidative phosphorylation.

      We thank the reviewers for their thoughtful and constructive feedback. We appreciate their recognition of our work’s value and the improvements made in this revised version.

      We are particularly grateful to Reviewer 3 for their detailed and insightful comments, which identified errors we (and other reviewers) had unfortunately overlooked. To address these concerns and ensure the manuscript meets the high standards of clarity and rigor we aim for, we have made additional corrections and refinements.

      As part of this process, we conducted a thorough review of the original source files. This was especially important given that the project spanned from 2018 to 2025, and many co-authors have since left their previous positions.

      We appreciate the opportunity to resubmit this manuscript and are confident that these updates fully address the concerns raised by the reviewer and the editorial team.

      Reviewer #3 (Recommendations for the authors):

      (1) I still do not see the data in WB 2G reflecting the quantification in 2H and 2I. Moreover, the authors state they performed 1 additional experiment, but it appears not to have been included in the analysis of 2H and 2I since the graphs remained the same from the last version of the manuscript.

      We apologize for this oversight. The additional experiment has now been incorporated into the analysis for Figures 2H and 2I, and the graphs have been updated accordingly. While we had uploaded the new blot, we inadvertently forgot to update the analysis graphs. Thank you for bringing this to our attention.

      (2) The authors talk several times about "supercomplexes 1 and 2" without testing their precise composition (there is a ton of literature about SC species in several mouse cell types, and separate BN-PAGE immunoblotting of individual MRC complexes would precisely define them in this context)

      We agree with the reviewer that this is an important point. However, structural differences between supercomplexes were outside the scope of this paper, and we did not perform such analyses. That said, examining the precise composition of supercomplexes could be a valuable direction for future work.

      (3) Steady-state levels of MRC subunits do not match the observations from BN-PAGE results. That might be potentially interpreted and explained by the possible accumulation of intermediates but this is not explored.

      We appreciate the reviewer’s observation. There is indeed a strong possibility that differences in the expression of structural components of mitochondrial complexes exist between WT and Miro1 -/- cells. However, in this study, we chose to focus on assessing potential differences in the enzymatic activities of the complexes rather than examining their structural composition. Exploring the accumulation of intermediates and structural differences could be an interesting avenue for future investigations.

      (4) Citrate synthase normalization of kinetic enzyme activities is claimed, yet it is not shown in any graph and no description of the method is provided.

      We sincerely thank the reviewer for pointing out this discrepancy. Upon careful review, we realized that our statement regarding citrate synthase normalization of kinetic enzyme activities in the last revised version was made in error. This was a miscommunication between co-authors, and we did not perform citrate synthase normalization. Instead, the normalization was performed against protein concentration, determined by the BCA assay as described in the manuscript. We regret this oversight and appreciate the opportunity to clarify this.

      (5) Complex I activity is still wrongfully described as NADPH oxidation in the methods

      We corrected this error.

      (6) The authors state 'Thank you for this comment. We believe this is due to a technical issue. Complex IV can be challenging to detect consistently, as its visibility is highly dependent on sample preparation conditions. In this specific case, we suspect that the buffer used during the isolation process may have influenced the detection of Complex IV'. I do not understand this, I find this justification insufficient and not substantiated by any experimental evidence. What buffer has been used for isolation? There are hundreds of protocols for isolation of intact mitochondria and MRC complexes. Also, DDM and digitonin are the gold-standard detergents for MRC complexes isolation and separation via BN-PAGE.

      We thank the reviewer for raising this important point. We have revised the response to clarify the exact experimental conditions and to provide supporting data.

      For BN-PAGE, mitochondrial fractions purified from cultured VSMCs or aortic tissue were prepared using a standard protocol (now explicitly detailed in the Methods). Briefly, mitochondria were resuspended in 6-aminocaproic acid (ACA) buffer containing 750 mM ACA, 50 mM Bis-Tris (pH 7.0), and protease inhibitors. Forty micrograms of mitochondrial protein were solubilized with 1.5% digitonin, using a final detergent-to-protein ratio of 8:1, and incubated on ice for 20 minutes prior to clarification by centrifugation at 16,000 g for 30 minutes at 4°C. Thus, consistent with established standards, digitonin—one of the gold-standard detergents for MRC complex solubilization and BN-PAGE—was used throughout.

      Despite using these widely accepted conditions, we found that detection of fully assembled Complex IV by BN-PAGE was inconsistent, a limitation that has been reported by others and is known to be sensitive to mitochondrial source, tissue type, and solubilization efficiency. To address this directly and avoid over-interpretation, we assessed Complex IV integrity by examining core subunits. As shown in Figure 6—figure supplement 1 (panels B and C), expression levels of MTCO1 and MTCO2, both essential core components of Complex IV, do not differ significantly between WT and Miro1-/- cells, supporting the conclusion that Complex IV abundance is not altered.

      We have revised the manuscript to clarify these methodological details and to explicitly state that conclusions regarding Complex IV are based on subunit analysis rather than BN-PAGE visualization alone.

      (7) Complex V IGA also does not seem to reflect its quantification.

      Thank you for highlighting this concern. To address it, we will include the numerical data alongside the figures to ensure clarity and alignment with our findings. We hope this will provide a more comprehensive understanding and resolve any ambiguity.

      (8) Figure 6 supplement 1, the authors state 'we concentrated on ETC1 and 5 and performed experiments in cells after expression of MIRO1 WT and MIRO1 mutants'. I do not understand, what background is being used? what mutants are being expressed? all the figures refer to Miro1 -/- which is, according to standard genetic nomenclature, a loss-of-function allele (KO).

      Thank you for your comment. To clarify, we first infected MIRO1fl/fl VSMCs with an adenovirus expressing the DNA recombinase Cre or a control adenovirus. Cells infected with the adenovirus expressing Cre are labeled as MIRO1-/- cells. In these MIRO1-/- cells, we then introduced MIRO1 wild type (WT) and MIRO1 mutants via adenoviral expression.

      The mutants include one lacking the transmembrane domain (MIRO1-ΔTM), and another in which the two EF hands of MIRO1 were point-mutated (MIRO1-KK). MIRO1-WT is denoted as Ad WT, the mutant MIRO1-KK as Ad KK, and MIRO1-ΔTM as Ad ΔTM in the figures. We hope this explanation clarifies the experimental background and nomenclature used.

      (9) Figure 6 supplement 1B, no normalization is provided (e.g. VDAC, TOM20 etc.). Interestingly, VDAC is then used to normalize the data in C-D-E-F-G. Also, why is MIRO1 detected in lane 4? Is the mutant stable or not? There is zero signal in A.

      Thank you very much for pointing out that the immunoblot for VDAC1 was missing in Figure 6—Supplement 1B. This figure has been reviewed several times, and unfortunately, this error was not detected. We sincerely apologize for this oversight. We have now revised the figure to include the immunoblot for VDAC1 to address this issue.

      Regarding the detection of MIRO1 in lane 4, we confirm that the "mutant" is not stable. To generate MIRO1 knockout cells, aortic smooth muscle cells from MIRO1fl/fl mice were isolated and cultured, followed by infection with an adenovirus expressing Cre. As these are primary cells and the deletion was induced by Cre expression, the recombination efficiency can vary, which is reflected in the variability observed in lanes 2 and 4 of the immunoblot.

      (10) Why are COX4 levels so low in the 2nd replicate in 7A? the authors 'We also performed anti-VDAC immunoblots on the same membranes as alternative loading control (see image below)'. I could not find the image.

      Thank you for your comment. The second pair of samples in Figure 7A is from a different preparation of mitochondria. In our experimental design, a control sample and a MIRO1 knockdown sample were processed side by side and run next to each other on the immunoblot.

      Regarding the anti-VDAC immunoblot, the image was included in our response to reviewers during the previous revision, as we did not believe it altered the message conveyed by the COX4 blot. However, to ensure clarity and address your concern, we have now included the anti-VDAC immunoblot directly in the figure. We hope this addition resolves any ambiguity and provides further confidence in the data presented.

      (11) The proposed interaction between MIRO1 and NDUFA9 is very difficult to reconcile, as the two proteins reside in distinct mitochondrial compartments. MIRO1 is anchored to the outer mitochondrial membrane (OMM), with its functional domains facing the cytosol, whereas NDUFA9 is a matrix-facing accessory subunit of mitochondrial Complex I, positioned at the interface between the N- and Q-modules.

      We appreciate the reviewer’s comment and agree that MIRO1 and NDUFA9 occupy distinct mitochondrial compartments. MIRO1 is anchored to the outer mitochondrial membrane with cytosol-facing domains, whereas NDUFA9 is a matrix-facing accessory subunit of Complex I at the N/Q-module interface.

      Our data do not suggest a stable, constitutive interaction within intact mitochondria. Rather, the observed association likely reflects an indirect, transient, or context-dependent interaction, potentially occurring during mitochondrial stress, remodeling, or turnover. Such associations may be mediated by multi-protein complexes spanning mitochondrial membranes, dynamic contact sites, or post-lysis interactions detected under experimental conditions. Increasing evidence supports functional coupling between outer mitochondrial membrane proteins and inner membrane or matrix pathways without direct physical binding.

      Additional comments:

      (12) All the raw data should be provided to the readers (uncropped and annotated WB, IHC images, numerical data with statistics applied).

      We agree with the reviewer and appreciate the emphasis on transparency. In accordance with eLife submission requirements, we have provided all raw data. The Source Data files associated with each figure now include uncropped and annotated immunoblots, as well as the numerical source data for all quantified analyses.

      During the compilation of these materials, we were unable to locate the original source files for Figure 2A. The control experiment depicted in the previous version, which demonstrates in vitro recombination, was performed in 2018. However, this experiment was repeated several times throughout the project. Therefore, to ensure the manuscript remains complete, we have replaced this panel with a representative immunoblot from a similar experiment. Additionally, during our review, we discovered a labeling error in Figure 3D and G. We have corrected these figures to ensure accuracy.

      All source files have been provided and carefully labeled to facilitate independent evaluation.

    1. Reviewer #1 (Public review):

      Summary:

      In this work, the authors investigate the mechanisms of low-frequency synaptic depression at cerebellar parallel fiber to interneuron synapses using unitary recordings that allow direct quantification of synaptic vesicle release. They show that sparse stimulation can induce robust synaptic depression even in the absence of substantial vesicle consumption, and that this depressed state is rapidly reversed when stimulation frequency is increased. To account for these observations, the authors propose a model in which low-frequency depression reflects a redistribution of vesicles within the readily releasable pool, in particular, a reduction in docking site occupancy due to vesicle undocking.

      Strengths:

      I found the experimental work to be of high quality throughout. The use of simple synapse recordings to count individual vesicle release events is particularly powerful in this context and allows questions to be addressed that are difficult to approach with more conventional approaches. The demonstration that low-frequency depression can occur independently of prior vesicle release, together with the rapid recovery observed during high-frequency stimulation, places strong constraints on possible underlying mechanisms and represents a clear strength of the study.

      The modeling framework is clearly laid out and helps organize a broad set of observations across stimulation frequencies. Several of the experimental tests appear well-motivated by the model, including the recovery train experiments, the analysis of failures, and the use of doublet stimulation. Taken together, the data provide a coherent phenomenological description of low-frequency depression and its relationship to vesicle availability within the readily releasable pool.

      Weaknesses:

      While the experimental results are strong, the manuscript would benefit from rebalancing the strength of the mechanistic conclusions drawn from the modeling in light of its limitations. The framework is clearly useful and provides a coherent interpretation of the data, but it is not uniquely constrained by the experimental observations, and alternative models or interpretations could plausibly account for the findings. The use of different model regimes concatenated across time, with substantially different parameter values, highlights the abstract nature of the approach. For these reasons, the model seems best presented as one plausible explanatory framework rather than a definitive biological mechanism. Clarifying the distinction between data-driven observations and model-based inferences would help readers assess which conclusions are strongly supported and which remain more speculative.

      The interpretation of the Ca2+-related experiments would benefit from more cautious wording. The absence of detectable changes in presynaptic Ca2+ signals does not exclude more localized or subtle Ca2+-dependent mechanisms, and conclusions regarding Ca2+ independence should therefore be framed accordingly. In addition, while low-frequency depression is still observed at reduced extracellular Ca2+, these experiments appear less diagnostic of the specific model-derived mechanism emphasized elsewhere in the manuscript - namely, a selective reduction in docking-site occupancy - and should be discussed with appropriate qualification in the text.

      Major points:

      (1) Clarify and qualify mechanistic claims derived from the model.

      Throughout the manuscript, changes in model parameters are at times described as if they directly reflected underlying physiological mechanisms. As a result, the conceptual distinction between experimentally observed phenomena, model-derived variables, and biological interpretation is not always clear. Several conclusions in the Results and Discussion are phrased as mechanistic statements, although they rest on assumptions intrinsic to the modeling framework. The authors should systematically review the text and explicitly distinguish between (i) experimentally observed changes in synaptic responses and (ii) inferences about vesicle docking states or transitions within the model.

      In particular, statements implying that vesicle undocking is the mechanism underlying low-frequency depression should be rephrased to reflect that this is an interpretation within the proposed framework rather than a uniquely demonstrated biological process. For example, statements such as "Low-frequency depression is caused by synaptic vesicle undocking" should be replaced with formulations such as "Within the framework of our model, low-frequency depression is accounted for by a redistribution of synaptic vesicles away from docking sites" or "Our results are consistent with a model in which changes in vesicle docking-state occupancy contribute to low-frequency depression."

      A particularly problematic example is the statement that "these experiments further confirm that LFD only involves a decrease in δ, without accompanying changes in ρ or IP size." Here, an experimentally defined phenomenon (LFD) is directly equated with changes in model-derived variables. Such statements should be revised to make clear that δ, ρ, and IP size are inferred quantities within the model, and that the experimental data are interpreted through this framework rather than directly confirming changes in these parameters. Similarly, over-generalizing statements such as "Undocking therefore represents the key mechanism controlling short-term depression across stimulation frequencies" should be softened to reflect that this conclusion emerges from the model rather than from direct experimental evidence.

      (2) Address the biological interpretation of time-dependent model regimes.

      The model relies on distinct parameter regimes applied at different time points, with some transitions effectively suppressed in certain regimes. While this approach captures the data well, its biological interpretation remains unclear. The authors should either (i) expand the discussion to outline plausible biological processes that could give rise to such regime changes (for example, calcium-dependent modulation of transition rates or activity-dependent changes in vesicle state stability), or (ii) more explicitly frame this aspect of the model as a descriptive abstraction rather than a mechanistic proposal. This further underscores the need to clearly separate the descriptive role of the model from claims about underlying biological mechanisms.

      (3) Reframe conclusions drawn from calcium-related experiments.

      The calcium imaging data demonstrate no detectable changes in the measured presynaptic calcium signals under the tested conditions, but they do not rule out that calcium signals contribute in ways undetectable by the assay. Conclusions should therefore be revised to reflect this limitation, avoiding statements that exclude a role for calcium-dependent mechanisms. Wording such as "we did not detect evidence for..." would be more appropriate than conclusions implying the absence of an effect.

      Similarly, while low-frequency depression is still observed at reduced extracellular calcium (1.5 mM Ca²⁺), the specific mechanistic signature emphasized elsewhere in the manuscript - namely a selectively reduced first response during a high-frequency recovery train - is no longer apparent. These experiments should therefore be discussed as consistent with the proposed framework, but not as providing independent support for a selective reduction in docking-site occupancy. Explicitly acknowledging this limitation would improve clarity and avoid over-interpreting these data.

      (4) Soften interpretations based on non-significant comparisons.

      In several places, comparisons that do not reach statistical significance are used to argue for equivalence between conditions (for example, comparisons involving failure versus non-failure trials or different LFD conditions). These conclusions should be revised to emphasize the limits of statistical power and framed as a lack of evidence for a difference rather than evidence of independence.

    2. Reviewer #3 (Public review):

      Summary:

      The manuscript builds on the observation that, at some synapses, low-frequency stimulation causes synaptic depression, which can be reversed by subsequent high-frequency stimulation. Such low-frequency depression (LFD) cannot be easily explained by the depletion of a single vesicle pool. Here, Silva and colleagues propose a model of activity-dependent vesicle trafficking to explain LFD at synapses between cerebellar granule cells and molecular layer interneurons.

      Strengths:

      Overall, LFD is interesting and worthy of examination, and the authors provide new experimental results that are of the high quality expected from this group.

      Weaknesses:

      The study proposes a novel model of vesicle trafficking that is not explained by known biological mechanisms, and the manuscript does not adequately compare or discuss alternative models.

      I have several concerns about how the authors interpret the data. First, the manuscript's primary conceptual advance is the idea that LFD involves vesicle undocking, rather than depletion. However, most experiments were performed under conditions that promote vesicle depletion (3 mM extracellular Ca2+). When experiments were repeated in physiological Ca2+, there appeared to be little or no LFD (stats are not provided). Second, the RS/DS/DU/undocking model, though not outside the realm of possibility, is not readily explained by known mechanisms and is only loosely supported by experimental findings. Third, when simulating LFD, the authors do not compare alternative models and use inappropriate language to imply that a model fit represents the truth (e.g., "the finding of identical experimental and simulated values confirms that the undocking mechanism accounts for LFD"). Finally, the model is presented in an overly complicated manner. The sheer amount of terms and nomenclature makes the manuscript confusing and difficult to read. Overall, the manuscript would benefit from added experiments and more statistics, a better justification and evaluation of the model, and more nuanced language.

      Major concerns:

      (1) Most experiments were performed under conditions that exacerbate depletion

      In order to attribute LFD to vesicle undocking rather than depletion, it is important to show LFD under conditions where depletion is minimal. As mentioned above, the authors only report significant LFD in elevated extracellular Ca2+. In a small number of experiments performed in more physiological Ca2+ (1.5 mM), there is no depression after a single stimulus, and it is not clear that there was statistically significant depression during a low-frequency train. Several studies cited in support of LFD share this problem:

      • Abrahamsson et al., (2007) recorded from Schaffer collaterals in 4 mM Ca, 3-4X physiological Ca2+.

      • Doussau et al., (2010) recorded from aplysia synapses in 3X Ca compared to seawater.

      • Rudolph et al., (2011) is cited as an example of LFD. However, this study performed experiments at high release probability cerebellar climbing fibers, and reported depression that increased monotonically with

      stimulation frequency, so it does not resemble the phenomenon studied in this paper. Lin et al., (2022) also largely describe monotonic depression at the calyx.

      The authors note that their results differ from those of Atluri and Regehr, but do not mention that a possible reason for the difference is the increased release probability in their experiments.

      The authors should provide statistics for the data obtained in 1.5 mM Ca, and discuss why LFD is increased in conditions that also elevate vesicle release probability.

      (2) Lack of biological mechanisms supporting the model

      The model is presented without compelling biological support. The evidence in support of vesicle undocking comes from experiments by the Watanabe lab, which showed fewer-than-expected docked vesicles under EM when cultured synapses were stimulated immediately prior to high-pressure freezing. Kusick et al were careful to note that these vesicles may have been lost to fusion.

      The putative undocking Kusick describes is immediate (< 5 ms after stimulation), and was not shown to be Ca2+ sensitive. This manuscript describes "calcium-dependent undocking" that proceeds from 10 ms - 200 ms. Multiple studies from the Watanabe lab show that a single stimulus lowers the number of docked vesicles, and subsequently, there is a transient redocking of vesicles that can be blocked by EGTA or Syt7 knockout.

      I also question the rationale for the authors' model that 2 vesicles are coupled in series to a single release site. Previous papers from this lab cited EM studies from frog and neuromuscular that showed filamentous connections between vesicles (do these synapses show LFD?). Here, the authors primarily cite their previous models to support their arguments. I encourage them to continue searching for ultrastructural evidence for 2-vesicle-docking-units and to cite such studies.

      (3) Comparison to other vesicle models

      The authors use overly assertive language to suggest that the model proves a mechanism. "Altogether, these results indicate that the slow phase of LFD ... reflects a δ decrease without significant changes in pr, in ρ or in IP size". Simulating data does not conclusively "indicate" the underlying mechanism, but the authors could state their data can be "explained by a model where..".

      However, LFD does not require activity-dependent undocking. Instead, the phenomenon has been explained by high-release probability, paired with an activity-dependent increase in either docking or release probability (Chiu and Carter, 2024; Doussau et al., 2017). Does the new model do a better job of replicating some facet of the data? If multiple models can explain the same data, how can we determine which model is correct? The "Alternative Presynaptic Depression Mechanisms" should be expanded to discuss these issues.

    3. Author response:

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      In this work, the authors investigate the mechanisms of low-frequency synaptic depression at cerebellar parallel fiber to interneuron synapses using unitary recordings that allow direct quantification of synaptic vesicle release. They show that sparse stimulation can induce robust synaptic depression even in the absence of substantial vesicle consumption, and that this depressed state is rapidly reversed when stimulation frequency is increased. To account for these observations, the authors propose a model in which low-frequency depression reflects a redistribution of vesicles within the readily releasable pool, in particular, a reduction in docking site occupancy due to vesicle undocking.

      Strengths:

      I found the experimental work to be of high quality throughout. The use of simple synapse recordings to count individual vesicle release events is particularly powerful in this context and allows questions to be addressed that are difficult to approach with more conventional approaches. The demonstration that low-frequency depression can occur independently of prior vesicle release, together with the rapid recovery observed during high-frequency stimulation, places strong constraints on possible underlying mechanisms and represents a clear strength of the study.

      The modelling framework is clearly laid out and helps organize a broad set of observations across stimulation frequencies. Several of the experimental tests appear well-motivated by the model, including the recovery train experiments, the analysis of failures, and the use of doublet stimulation. Taken together, the data provide a coherent phenomenological description of low-frequency depression and its relationship to vesicle availability within the readily releasable pool.

      We thank the Reviewer for his positive assessment of our work.

      Weaknesses:

      While the experimental results are strong, the manuscript would benefit from rebalancing the strength of the mechanistic conclusions drawn from the modelling in light of its limitations. The framework is clearly useful and provides a coherent interpretation of the data, but it is not uniquely constrained by the experimental observations, and alternative models or interpretations could plausibly account for the findings. The use of different model regimes concatenated across time, with substantially different parameter values, highlights the abstract nature of the approach. For these reasons, the model seems best presented as one plausible explanatory framework rather than a definitive biological mechanism. Clarifying the distinction between data-driven observations and model-based inferences would help readers assess which conclusions are strongly supported and which remain more speculative.

      The interpretation of the Ca<sup>2+</sup>-related experiments would benefit from more cautious wording. The absence of detectable changes in presynaptic Ca<sup>2+</sup> signals does not exclude more localized or subtle Ca<sup>2+</sup>-dependent mechanisms, and conclusions regarding Ca<sup>2+</sup> independence should therefore be framed accordingly. In addition, while low-frequency depression is still observed at reduced extracellular Ca<sup>2+</sup>, these experiments appear less diagnostic of the specific model-derived mechanism emphasized elsewhere in the manuscript - namely, a selective reduction in docking-site occupancy - and should be discussed with appropriate qualification in the text.

      Concerning Ca<sup>2+</sup> signals, the Reviewer is right. While we found no change in Ca<sup>2+</sup> signalling apart from a slow Ca<sup>2+</sup> accumulation during long trains at 1 Hz, the possibility of an undetected change cannot be excluded. We have added a word of caution in this direction on p. 11. Concerning the 1.5 mM Ca<sup>2+</sup> experiments, the Reviewer presumably alludes to the first recovery train (yellow) point in Supplementary Fig. 2C. This is also the last point (s11) of the slow train at 0.5 Hz because no delay at all was interposed between the slow train and the recovery train. We have now included one more experiment (with a present total number n = 6), and we have corrected Fig. S2C accordingly. In the new version the depression measured for s4-s10 vs s1 during the 0.5 Hz trains is 0.69 +/- 0.05 (p = 0.00058, paired one-tail t-test). The ratio of the s1 value of the recovery train compared to control s1 is 0.83 +/- 0.08 (p = 0.028, paired one-tail t-test).

      Major points:

      (1) Clarify and qualify mechanistic claims derived from the model.

      Throughout the manuscript, changes in model parameters are at times described as if they directly reflected underlying physiological mechanisms. As a result, the conceptual distinction between experimentally observed phenomena, model-derived variables, and biological interpretation is not always clear. Several conclusions in the Results and Discussion are phrased as mechanistic statements, although they rest on assumptions intrinsic to the modelling framework. The authors should systematically review the text and explicitly distinguish between (i) experimentally observed changes in synaptic responses and (ii) inferences about vesicle docking states or transitions within the model.

      In particular, statements implying that vesicle undocking is the mechanism underlying low-frequency depression should be rephrased to reflect that this is an interpretation within the proposed framework rather than a uniquely demonstrated biological process. For example, statements such as "Low-frequency depression is caused by synaptic vesicle undocking" should be replaced with formulations such as "Within the framework of our model, low-frequency depression is accounted for by a redistribution of synaptic vesicles away from docking sites" or "Our results are consistent with a model in which changes in vesicle docking-state occupancy contribute to low-frequency depression."

      A particularly problematic example is the statement that "these experiments further confirm that LFD only involves a decrease in δ, without accompanying changes in ρ or IP size." Here, an experimentally defined phenomenon (LFD) is directly equated with changes in model-derived variables. Such statements should be revised to make clear that δ, ρ, and IP size are inferred quantities within the model, and that the experimental data are interpreted through this framework rather than directly confirming changes in these parameters. Similarly, overgeneralizing statements such as "Undocking therefore represents the key mechanism controlling short-term depression across stimulation frequencies" should be softened to reflect that this conclusion emerges from the model rather than from direct experimental evidence.

      As suggested, we clarify the distinction in the revised version between experimental data and modelling, and we refrain from making definitive statements on underlying cellular mechanisms.

      (2) Address the biological interpretation of time-dependent model regimes.

      The model relies on distinct parameter regimes applied at different time points, with some transitions effectively suppressed in certain regimes. While this approach captures the data well, its biological interpretation remains unclear. The authors should either (i) expand the discussion to outline plausible biological processes that could give rise to such regime changes (for example, calcium-dependent modulation of transition rates or activity-dependent changes in vesicle state stability), or (ii) more explicitly frame this aspect of the model as a descriptive abstraction rather than a mechanistic proposal. This further underscores the need to clearly separate the descriptive role of the model from claims about underlying biological mechanisms.

      We thank the Reviewer for drawing our attention to this important point. Below 10 ms, rate constants are largely determined by the large amplitude, fast decaying Ca<sup>2+</sup> signal occurring near voltage-dependent Ca<sup>2+</sup> channels (‘Ca<sup>2+</sup> nanodomain’). After 10 ms, the rate constants depend on the low amplitude, slowly decaying Ca<sup>2+</sup> signals averaged over the entire varicosity (‘volume-averaged Ca<sup>2+</sup>’). We explain this better in the revised version (Materials and Methods, p. 21).

      (3) Reframe conclusions drawn from calcium-related experiments.

      The calcium imaging data demonstrate no detectable changes in the measured presynaptic calcium signals under the tested conditions, but they do not rule out that calcium signals contribute in ways undetectable by the assay. Conclusions should therefore be revised to reflect this limitation, avoiding statements that exclude a role for calcium-dependent mechanisms. Wording such as "we did not detect evidence for..." would be more appropriate than conclusions implying the absence of an effect.

      Similarly, while low-frequency depression is still observed at reduced extracellular calcium (1.5 mM Ca<sup>2+</sup>), the specific mechanistic signature emphasized elsewhere in the manuscript - namely a selectively reduced first response during a high-frequency recovery train - is no longer apparent. These experiments should therefore be discussed as consistent with the proposed framework, but not as providing independent support for a selective reduction in docking-site occupancy. Explicitly acknowledging this limitation would improve clarity and avoid overinterpreting these data.

      This has been discussed above (‘weaknesses’).

      (4) Soften interpretations based on non-significant comparisons.

      In several places, comparisons that do not reach statistical significance are used to argue for equivalence between conditions (for example, comparisons involving failure versus non-failure trials or different LFD conditions). These conclusions should be revised to emphasize the limits of statistical power and framed as a lack of evidence for a difference rather than evidence of independence.

      We have attended this point in the revised version.

      Reviewer #2 (Public review):

      Summary:

      Silva and co-workers exploit their previously established methods of analyzing release events at single parallel fiber to molecular layer interneuron synapses. They observed synaptic depression at low transmission frequencies (< 5 Hz), which rapidly recovers during high-frequency transmission. Analysis of the time course of low-frequency depression revealed an initial rapid and a slow linearly increasing time course. Strikingly, the initial depression occurred even in the absence of preceding release, arguing against vesicle depletion as the underlying mechanism.

      Strengths:

      The main strength of the study is the careful demonstration of an interesting synaptic phenomenon challenging the classical vesicle-centered interpretation of synaptic depression.

      We thank the Reviewer for his positive assessment of our work.

      Weaknesses:

      No major weaknesses were identified by this reviewer.

      The finding of release-independent synaptic depression is important and would have widespread implications. Therefore, some more analyses to increase the confidence in these findings could be performed.

      My concern is whether rundown could explain the findings. If the rate of failures in s1 increases and at the same time the amplitude decreases during the experiments, an apparent depression in s2 could arise. The Supplementary Figure 5A addresses run-down, but the figure is not easy to understand, and, as far as I understood, it does not address the question of whether the release-independent depression could be caused by a rundown. To address this, the analysis of Figure 5 could be repeated by investigating the failure rate and amplitude separately or by analyzing the 1st and 2nd half of the recordings separately.

      The Reviewer makes a very important point that had escaped our attention. If the responses were declining over the course of an experiment, near the end of the recordings, a high proportion of failures would be associated with a weak response to the second AP. This could distort the relation between initial failures and amount of LFD, perhaps to the point of indicating LFD after failures when there were none. As suggested by the Reviewer, we tested this possibility by examining the stability of the synaptic responses during experiments. We found a mean s<sub>1</sub> value of 0.87 ± 0.13 for the first half of the experiments used in Fig. 5, and of 1.10 ± 0.17 for the second half (p > 0.05, n = 10). This analysis shows that there was no rundown during these experiments. We show in Author response image 1 a plot of s1 as a function of the number of experiments. These plots do not suggest any artefactual correlation between failures, mean s1, and rundown.

      Author response image 1.

      Plot of s1 as a function of train number for the experiments of Fig. 5. In response to a request of Reviewer 2, this figure illustrates the evolution of s1 values as a function of train number for the experiments used to produce Figure 5. In each experiment, about 20 s1 values were obtained at two ISIs (either 10 ms and 500 ms, or 800 ms and 1600 ms). The figure shows two examples of s1 values as a function of train number (these values fluctuate widely between 0 and 3), and the average across cells and ISI values. There is no indication of a rundown of S1 values as a function of train number

      Reviewer #3 (Public review):

      Summary:

      The manuscript builds on the observation that, at some synapses, low-frequency stimulation causes synaptic depression, which can be reversed by subsequent high-frequency stimulation. Such low-frequency depression (LFD) cannot be easily explained by the depletion of a single vesicle pool. Here, Silva and colleagues propose a model of activity-dependent vesicle trafficking to explain LFD at synapses between cerebellar granule cells and molecular layer interneurons.

      Strengths:

      Overall, LFD is interesting and worthy of examination, and the authors provide new experimental results that are of the high quality expected from this group.

      Weaknesses:

      The study proposes a novel model of vesicle trafficking that is not explained by known biological mechanisms, and the manuscript does not adequately compare or discuss alternative models.

      I have several concerns about how the authors interpret the data. First, the manuscript's primary conceptual advance is the idea that LFD involves vesicle undocking, rather than depletion. However, most experiments were performed under conditions that promote vesicle depletion (3 mM extracellular Ca<sup>2+</sup>). When experiments were repeated in physiological Ca<sup>2+</sup>, there appeared to be little or no LFD (stats are not provided). Second, the RS/DS/DU/undocking model, though not outside the realm of possibility, is not readily explained by known mechanisms and is only loosely supported by experimental findings. Third, when simulating LFD, the authors do not compare alternative models and use inappropriate language to imply that a model fit represents the truth (e.g., "the finding of identical experimental and simulated values confirms that the undocking mechanism accounts for LFD"). Finally, the model is presented in an overly complicated manner. The sheer amount of terms and nomenclature makes the manuscript confusing and difficult to read. Overall, the manuscript would benefit from added experiments and more statistics, a better justification and evaluation of the model, and more nuanced language.

      We respectfully disagree with these sweeping criticisms, as described in more detail below.

      Major concerns:

      (1) Most experiments were performed under conditions that exacerbate depletion

      In order to attribute LFD to vesicle undocking rather than depletion, it is important to show LFD under conditions where depletion is minimal. As mentioned above, the authors only report significant LFD in elevated extracellular Ca<sup>2+</sup>. In a small number of experiments performed in more physiological Ca<sup>2+</sup> (1.5 mM), there is no depression after a single stimulus, and it is not clear that there was statistically significant depression during a low-frequency train. Several studies cited in support of LFD share this problem:

      - Abrahamsson et al., (2007) recorded from Schaffer collaterals in 4 mM Ca, 3-4X physiological Ca<sup>2+</sup>.

      - Doussau et al., (2010) recorded from Aplysia synapses in 3X Ca compared to seawater.

      - Rudolph et al., (2011) is cited as an example of LFD. However, this study performed experiments at high release probability cerebellar climbing fibers, and reported depression that increased monotonically with stimulation frequency, so it does not resemble the phenomenon studied in this paper. Lin et al., (2022) also largely describe monotonic depression at the calyx.

      The Reviewer suggests that LFD may only occur under non-physiological conditions, if the release probability has been increased by artificially elevating the extracellular Ca<sup>2+</sup>. The implication is that LFD is at best a curiosity with little or no significance for brain signalling. We disagree with this point of view for several reasons.

      Concerning the statement ‘In order to attribute LFD to vesicle undocking rather than depletion, it is important to show LFD under conditions where depletion is minimal’: This is the purpose of the analysis shown in Fig. 5.

      The statement ‘the authors only report significant LFD in elevated extracellular Ca<sup>2+</sup>’ is inaccurate. Fig. S2C shows a clear LFD in 1.5 mM Ca<sup>2+</sup>, as acknowledged by Reviewer 1 (‘low-frequency depression is still observed at reduced extracellular Ca<sup>2+</sup>’). However, we failed to provide a p-value for the depression in the initial version of the paper (p = 0.004, n = 5, with this data set; paired t-test, one-tailed). In the revised version, we document the 1.5 mM results more extensively, including the incorporation of the results of an additional experiment, and an explicit statistical analysis of the data (p = 0.00058, n = 6; paired t-test, one-tailed).

      Concerning the statement ‘there is no depression after a single stimulus’: We find that the onset kinetics of LFD is slower in 1.5 Ca<sup>2+</sup> than in 3 Ca<sup>2+</sup> (respectively 1.8 ISI and 0.51 ISI, Fig. 2C and Fig. S2C). This explains that the PPR is not significantly <1 in 1.5 Ca<sup>2+</sup> without implying any weakening of extent of LFD at steady state.

      As explained in the manuscript (p. 5), in a previous work, we developed a method to ascribe changes in SV pools, within the RS/DS model, with specific modifications of s1, s2 and s5-s8 during test 100 Hz trains (Tran et al., 2022). This method was developed in 3 mM Ca<sup>2+</sup> conditions, and for this reason, we performed most experiments for the present work in 3 mM Ca<sup>2+</sup>.

      Chiu and Carter (2024) demonstrated LFD in neocortical synapses; they performed their study in 1.2 mM Ca<sup>2+</sup>, not in elevated Ca<sup>2+</sup>.

      Rudolph et al. (2011) showed low frequency depression not only in elevated external Ca<sup>2+</sup>, but also in 0.5 mM Ca<sup>2+</sup>. While Rudolph et al. (2011) did not make an explicit link between their observations and LFD, there is no reason to doubt that these observations are an example of LFD. They showed a biphasic depression when switching the stimulation frequency from 0.05 Hz to 2 Hz. In one of the founding papers of LFD, Doussau et al. (2010) describe a biphasic depression when switching the stimulation frequency from 0.025 Hz to 1 Hz; the Fig. 1 of the two papers (Rudolph 2011 and Doussau 2010) are strikingly similar.

      Lin et al. (2022) would probably not agree with the statement that the depression at the calyx is ‘largely monotonic’, as they stress the finding of quasi-constant depression between 5 and 50 Hz.

      The authors note that their results differ from those of Atluri and Regehr, but do not mention that a possible reason for the difference is the increased release probability in their experiments.

      In fact, we clearly listed the difference in external Ca<sup>2+</sup> as a likely source of the discrepancy by saying ‘This discrepancy presumably stems from differences in experimental conditions (room temperature, stimulation of multiple presynaptic PFs and 2 mM external Ca<sup>2+</sup> concentration in the previous work, vs. near-physiological temperature, single presynaptic stimulation and 3 mM external Ca<sup>2+</sup> here)’.

      The authors should provide statistics for the data obtained in 1.5 mM Ca, and discuss why LFD is increased in conditions that also elevate vesicle release probability.

      See our comments above: the revised version includes the requested statistics. On p. 6 of the manuscript, we do provide an explanation for the apparent lack of LFD at 1.5 Ca<sup>2+</sup> and 2 Hz, namely a superimposition of LFD with facilitation. At 1.5 Ca<sup>2+</sup> and 0.5 Hz, our LFD numbers are not weaker than at 3 mM Ca<sup>2+</sup> and 0.5 Hz of 1 Hz.

      Altogether, it is correct that many LFD experiments have been carried out in high release probability synapses, and/or under conditions of elevated Ca<sup>2+</sup>. However, the reasons underlying these choices are diverse (in our case, to build on the previous SV pool analysis developed in Tran et al. 2022 in 3 Ca<sup>2+</sup> conditions) and do not imply a limitation to the phenomenon. LFD is present in physiological conditions for low-to-moderate release probability synapses (as shown in our work), and altogether, there is no reason to dismiss LFD as nonphysiological.

      (2) Lack of biological mechanisms supporting the model

      The model is presented without compelling biological support. The evidence in support of vesicle undocking comes from experiments by the Watanabe lab, which showed fewerthanexpected docked vesicles under EM when cultured synapses were stimulated immediately prior to high-pressure freezing. Kusick et al were careful to note that these vesicles may have been lost to fusion.

      The Watanabe lab showed an SV deficit at docking sites at times ranging from about 100 ms to several seconds (Kusick et al., 2020, their Fig. 5E). This corresponds to the ISI values where we see paired-pulse depression. In their Summary, Kusick et al. raise the possibility of SV fusion as an alternative to undocking at the 100 ms time point. But, the same issue had previously been considered in Miki et al., 2018 with other techniques (their Fig. 2d), where it was shown that the SV deficit seen in paired-pulse experiments could not be explained by fusion. This leaves undocking as the most likely explanation, at least in our preparation. We have added a new paragraph on p. 14 to clarify this point.

      The putative undocking Kusick describes is immediate (< 5 ms after stimulation), and it was not shown to be Ca<sup>2+</sup> sensitive. This manuscript describes "calcium-dependent undocking" that proceeds from 10 ms - 200 ms. Multiple studies from the Watanabe lab show that a single stimulus lowers the number of docked vesicles, and subsequently, there is a transient redocking of vesicles that can be blocked by EGTA or Syt7 knockout.

      This is not an accurate description of the Kusick results or of our results. In the Kusick paper, the SV deficit seen at <5 ms after stimulation is attributed to exocytosis, not to undocking. Clearly, it is Ca<sup>2+</sup> dependent. Our manuscript describes potential calcium-dependent undocking not during the time 10 ms- 150 ms, during which our undocking rate is assumed to be calcium-independent, but starting at 150 ms, and lasting a few hundred ms thereafter.

      I also question the rationale for the authors' model that 2 vesicles are coupled in series to a single release site. Previous papers from this lab cited EM studies from frog and neuromuscular that showed filamentous connections between vesicles (do these synapses show LFD?). Here, the authors primarily cite their previous models to support their arguments. I encourage them to continue searching for ultrastructural evidence for 2-vesicle-docking-units and to cite such studies.

      It is important to remember that our sequential two-step model was not based on EM data, but on a series of functional data including variance-mean analysis of summed SV release numbers; covariance analysis among subsequent SV release numbers; analysis of release latencies as a function of stimulus number during an AP train; analysis of SV release numbers under conditions of very high release probability. We note that the phenomenon of Ca<sup>2+</sup>-dependent docking that we proposed based on these observations has been consistent with flash-and-freeze or zap-and-freeze results from several laboratories. Concerning potential filamentous connections between SVs and the AZ plasma membrane at a distance of several 10s of nm, this has been seen not only in frog or mice neuromuscular junctions, but also at brain synapses (ex: Siksou et al., Journal of Neuroscience 2007; Cole et al., Journal of Neuroscience 2016; Fernandez-Busnadiego, Journal of Cell Biology 2010; 2013).

      (3) Comparison to other vesicle models

      The authors use overly assertive language to suggest that the model proves a mechanism. "Altogether, these results indicate that the slow phase of LFD ... reflects a δ decrease without significant changes in pr, in ρ or in IP size". Simulating data does not conclusively "indicate" the underlying mechanism, but the authors could state their data can be "explained by a model where..".

      Please see our response above to a similar point by Reviewer 1.

      However, LFD does not require activity-dependent undocking. Instead, the phenomenon has been explained by high-release probability, paired with an activity-dependent increase in either docking or release probability (Chiu and Carter, 2024; Doussau et al., 2017). Does the new model do a better job of replicating some facet of the data? If multiple models can explain the same data, how can we determine which model is correct? The "Alternative Presynaptic Depression Mechanisms" should be expanded to discuss these issues.

      We could not find statements in the Chiu and Carter paper or in the Doussau et al. paper explaining LFD ‘by high-release probability, paired with an activity-dependent increase in either docking or release probability’. As far as we can see, Chiu and Carter do not propose any specific mechanism for LFD, beyond saying that depression and facilitation must be separate. Doussau et al. (their Fig. 6) clearly frame their interpretation in a sequential two-step model. As in the preceding Miki et al. paper (which they cite extensively), they assume a rapid (a few ms), Ca-dependent transition between their ‘reluctant pool’ and their ‘fully-releasable pool’, respectively homologous to RS and DS. Thus, the Doussau et al. interpretation is close to that presented in our present work, even though significant differences exist. An important difference is that Doussau et al. did not use simple synapses, so that they did not have access to key synaptic parameters such as the number of docking sites or the release probability per docking site. Consequently, the model in Doussau et al. does not have the same level of detail as ours. The revised version explains better the differences and similarity between the models of Doussau et al. and that exposed in our work (new paragraph on p. 14).

    1. Reviewer #1 (Public review):

      Sensory hair cells of the inner ear convert mechanical sound vibrations into electrical signals through mechano-electrical transduction (MET), a process critically dependent on the specialized organization and lipid composition of their plasma membrane. Although the protein components of the MET complex are relatively well characterized, the role of the lipid environment remains poorly understood and often overlooked. Recent discoveries that core MET proteins TMC1 and TMC2 function as lipid scramblases, disrupting membrane lipid asymmetry, expose a significant gap in our understanding of how lipid homeostasis is regulated in hair cells and how membrane dynamics influence MET function.

      In this study, the authors address this gap by identifying the P4-ATPase ATP8B1 and its chaperone TMEM30B as essential regulators of membrane lipid asymmetry in outer hair cells. They also generated HA-tagged knock-in mice to precisely localize the P4-ATPase ATP8B1 and its chaperone TMEM30B within outer hair cells, demonstrating their enrichment in stereocilia, and convincingly demonstrate that loss of these proteins causes phosphatidylserine externalization, hair cell degeneration, and hearing loss in mouse models, phenocopying defects observed in TMC1 mutant mice with constitutive scrambling activity. While these findings establish lipid flippase pathways as critical for hair cell survival and auditory function, they also raise important questions about the precise mechanisms linking lipid asymmetry disruption to MET dysfunction and hair cell pathology.

      Overall, the data convincingly support the conclusion that ATP8B1-TMEM30B flippase activity is required to maintain stereocilia lipid asymmetry and auditory function. The study substantially advances understanding of how lipid homeostasis intersects with MET. However, several points require clarification to ensure that localization claims and mechanistic interpretations are fully supported by the presented data.

      Revisions considered essential by this reviewer are:

      (1) Figure 1D.<br /> The authors should clarify how the qPCR data were normalized and specify the reference (housekeeping) genes used. This information is necessary to evaluate the robustness and comparability of the gene expression data.

      (2) Figure 1F.<br /> The lack of F-actin staining at the hair cell base raises the possibility that the permeabilization conditions may have limited antibody access to certain membrane regions. This is especially important given that the authors used a gentle permeabilization agent such as saponin to preserve membrane integrity. Because the authors conclude that ATP8B1 and TMEM30B are localized "almost exclusively to OHC bundles and the apical membrane, with minimal staining in the remaining plasma membrane," (line 128). Including co-labeling with a plasma membrane marker or more comprehensive F-actin visualization of lateral and basal regions would help ensure that the restricted localization is biological rather than technical. In the absence of such controls, the localization claim may be somewhat overstated and should be tempered accordingly.

      (3) Figure 7B.<br /> Although quantification of ATP8B1-HA intensity at the bundle appears similar between WT and Cib2 KO samples, the representative image suggests that some bundles lack detectable labeling. To better capture phenotype variability, it would be helpful to include an additional quantification showing the fraction or number of bundles with detectable ATP8B1-HA signal in Cib2 KO mice.

      (4) Lines 346-349.<br /> The manuscript suggests that IHCs lack stereocilia-enriched P4-ATPases. However, this conclusion is not directly supported by the presented data. The authors should either provide supporting localization or expression data for other P4-ATPases or soften the statement to indicate that no stereocilia-enriched P4-ATPases were detected under the conditions examined.

      Recommendations:

      (5) The authors convincingly demonstrate that TMEM30B loss results in ATP8B1 mislocalization. While not essential to the central conclusions, examining TMEM30B localization in ATP8B1 KO hair cells would clarify whether this interdependence is reciprocal, as described for other P4-ATPase-CDC50 complexes.

      (6) Lines 359-374.<br /> The discussion of Annexin V labeling is careful and balanced. This paragraph would benefit from referencing other studies that showed minimal Annexin V labeling in healthy P6 organ of Corti, reinforcing that robust PS externalization in the present study is pathological rather than developmental.

      (7) Lines 392-399.<br /> The proposed feedback model linking MET activity and ATP8B1-TMEM30B localization is compelling. The discussion could be strengthened by noting that in TMC1/2 double knockout hair cells, PS externalization is not observed, consistent with the idea that flippase activity becomes critical specifically when scrambling occurs. The mislocalization observed in Cib2 KO hair cells further supports the coupling between TMC-mediated scrambling and flippase-mediated membrane restoration.

    2. Reviewer #2 (Public review):

      Summary:

      Prior work identified TMEM30B (knockout mice) as well as ATP8B1 (human genetics and mouse model), ATP8A2 (knockout mice), and ATP811A (human genetics) as relevant for hearing. The authors also reasoned that, given the recent discovery of TMC1 and TMC2's dual function as mechanotransduction channels of the inner ear and as lipid scramblases, a counterpart flippase should be in the sensory hair-cell stereocilia bundle where mechanotransduction happens. They use CRISPR/CAS to modify the endogenous mouse genes and add an HA tag at the N-terminus of the ATP8B1, ATP8A1, ATP8A2, and ATP11A proteins. Their experiments with these mice unambiguously localized ATP8B1 at the base of outer hair cell stereocilia bundles. Knockout of ATP8B1 results in loss of outer hair cells, deficient auditory function (ABR), and degeneration of outer hair cell stereocilia bundles. Similarly, hair cells from genetically modified mice with endogenous HA-tagged TMEM30B proteins show localization of this protein to outer hair cell stereocilia bundles. TMEM30B knock-out mice phenocopy the ATP8B1 knock-out model. Interestingly, the authors show that annexing V staining precedes hair cell loss in ATP8B1 and TMEM30B knockout mice and that proper localization of these proteins is lost in mice that lack CIB2, a protein essential for hair cell mechanotransduction.

      Strengths:

      (1) Use of knock-in HA-tagged proteins, rather than antibody staining, to unambiguously localize ATP8B1 and TMEM30B.

      (2) Systematic characterization of auditory function (ABR), hair cell loss, and hair-cell stereocilia bundle morphology.

      (3) Advances our understanding of the role played by lipid homeostasis in auditory function.

      (4) Reports on mouse models that will be helpful to further understand the mechanistic role played by ATP8B1 and TMEM30B in normal hearing and hereditary deafness.

      Weaknesses:

      (1) Are the HA tags causing any functional issues? Function and localization of tagged proteins can sometimes be compromised. It would be good to know, for each knock-in model (TMEM30B, ATP8B1, ATP8A1, ATP8A2, and ATP11A ), whether the HA-tagged protein is causing any issues with the mice and particularly with hearing (ABRs). Are these mice normal? Can they hear? These data are missing.

      (2) Following on the point above, is it possible that ATP8B1-HA is well localized, but localization for the other three flippases (ATP8A1-HA, ATP8A2-HA, and ATP11A-HA) is compromised by the tag? Is this potential mislocalization causing any functional phenotypes? (ABRs of point 1). I find it surprising that there are flippases only in outer hair cells, and only formed by ATP8B1. A possible explanation is that the tag is interfering with trafficking. If so, there should be a phenotype (ABRs), although this might be masked by redundancy among these flippases or caused by systemic issues (admittedly difficult to sort out). Given that this manuscript will likely become foundational, and that there is evidence that at least two of the other flippases are involved in hearing loss, it would be good to provide more information about the mice and HA-tagged proteins in the other knock-ins (ATP8A1-HA, ATP8A2-HA, and ATP11A-HA). Depending on the data available for the knock-ins, the authors may want to discuss these scenarios and soften the statement indicating that inner-hair cells may lack flippase activity altogether.

      (3) Expression of ATP8B1 at P0 (Figure 1D), when there should not be protein in outer hair cells yet, seems high. Does this mean that other cells in the cochlea also express ATP8B1? Is this a concern?

      (4) Fluorescence scales in Figure 6 B and D and Figure 7 B and D are very different. So are the values for WT. One would expect that the WT would be similar in all cases (at least within the same compartments), given that the methods section indicates that "All images were collected using identical acquisition parameters, including zoom and laser power, across genotypes". If WT shows such variability, how can we compare?

    3. Author Response:

      Summary of Planned Revisions:

      We will clarify the qPCR methodology and interpretation to address potential misunderstandings.

      We will assess hearing in the generated HA-tagged mouse lines and, where appropriate, include a properly powered ABR analysis in the revised manuscript.

      We will address concerns regarding the z-stack in Figure 1f.

      We will include additional quantification for Figure 7B to strengthen the analysis.

      We will revise the relevant statement to read: “No IHC stereocilia-enriched P4-ATPases were detected under the conditions examined.”

      While we appreciate the suggestion to examine TMEM30B localization on the ATP8B1 KO background, this is not feasible within a reasonable timeframe; we will clarify this limitation in the manuscript.

      We will incorporate relevant prior work (e.g., George and Ricci, 2026) demonstrating minimal Annexin V labeling prior to P6 and lack of PS externalization in TMC1/2 double knockout models.

      We will clarify that hearing thresholds for TMEM30B-HA and ATP8B1-HA lines will be addressed in this study, while additional HA-tagged flippase lines (ATP8A1, ATP8A2, ATP11A) are part of ongoing work to be reported separately.

      We will soften statements regarding HA-tag insertion and clarify that, to our knowledge, localization and function are not disrupted, while acknowledging this as a potential limitation.

      We will revise the Methods section to clarify differences in fluorescence measurements across experiments.

      In addition to the experiments in response to reviewer’s suggestions, we will add the following data that we have generated while the paper was in review:

      Distortion product otoacoustic emission (DPOAEs) of the Atp8b1 KO and Tmem30b KO mice. Consistent with OHC function, their DPOAEs thresholds were elevated.

      Public Reviews:

      Reviewer #1 (Public review):

      (1) Figure1D.

      The authors should clarify how the qPCR data were normalized and specify the reference (housekeeping) genes used. This information is necessary to evaluate the robustness and comparability of the gene expression data.

      We thank the reviewer for this comment. qPCR data were normalized to GAPDH as the reference (housekeeping) gene. We will clarify this in the Methods section to ensure transparency and reproducibility.

      (2) Figure 1F.

      The lack of F-actin staining at the hair cell base raises the possibility that the permeabilization conditions may have limited antibody access to certain membrane regions. This is especially important given that the authors used a gentle permeabilization agent such as saponin to preserve membrane integrity. Because the authors conclude that ATP8B1 and TMEM30B are localized "almost exclusively to OHC bundles and the apical membrane, with minimal staining in the remaining plasma membrane," (line 128). Including co-labeling with a plasma membrane marker or more comprehensive F-actin visualization of lateral and basal regions would help ensure that the restricted localization is biological rather than technical. In the absence of such controls, the localization claim may be somewhat overstated and should be tempered accordingly.

      We appreciate this important point. The image shown represents a single z-slice from a larger stack, and the hair cell body lies outside the plane of this section. To clarify this, we will revise the figure presentation. Specifically, we can provide the full z-stack (already available via OSF) and/or replace the image with a resliced whole-mount view to better visualize the full cellular context.

      In terms of the possibility that the lack of staining in the hair cell’s plasma membrane might be due to insufficient antibody penetrance, we routinely perform Prestin (located in OHC plasma membrane) staining after saponin-mediated permeabilization and have never experienced antibody accessibility issues. Nevertheless, we will perform co-labeling for Prestin and include in the new submission.

      (3) Figure 7B.

      Although quantification of ATP8B1-HA intensity at the bundle appears similar between WT and Cib2 KO samples, the representative image suggests that some bundles lack detectable labeling. To better capture phenotype variability, it would be helpful to include an additional quantification showing the fraction or number of bundles with detectable ATP8B1-HA signal in Cib2 KO mice.

      We thank the reviewer for this suggestion. To better capture variability, we will include an additional quantification measuring the fraction of hair cell bundles with detectable ATP8B1-HA and TMEM30B-HA signal per field of view. This analysis will complement the existing intensity-based quantification.

      (4) Lines 346-349

      The manuscript suggests that IHCs lack stereocilia-enriched P4-ATPases. However, this conclusion is not directly supported by the presented data. The authors should either provide supporting localization or expression data for other P4-ATPases or soften the statement to indicate that no stereocilia-enriched P4-ATPases were detected under the conditions examined.

      We agree with the reviewer and will revise this statement to read: “No IHC stereocilia-enriched P4-ATPases were detected under the conditions examined.”

      Recommendations:

      (5) The authors convincingly demonstrate that TMEM30B loss results in ATP8B1 mislocalization. While not essential to the central conclusions, examining TMEM30B localization in ATP8B1 KO hair cells would clarify whether this interdependence is reciprocal, as described for other P4-ATPase-CDC50 complexes.

      We appreciate this insightful suggestion. However, performing this experiment would require generating a compound mouse line (crossing TMEM30B-HA into the ATP8B1 knockout background), which is not feasible within the revision timeframe. Additionally, the lack of a robust commercial antibody for TMEM30B further complicates this approach. We will note this as a future direction in the revised manuscript.

      (6) Lines 359-374.

      The discussion of Annexin V labeling is careful and balanced. This paragraph would benefit from referencing other studies that showed minimal Annexin V labeling in healthy P6 organ of Corti, reinforcing that robust PS externalization in the present study is pathological rather than developmental.

      We thank the reviewer for this suggestion and will incorporate relevant prior work, including George and Ricci (2026), which demonstrates minimal Annexin V labeling prior to P6, and further supports our interpretation.

      (7) Lines 392-399.

      The proposed feedback model linking MET activity and ATP8B1-TMEM30B localization is compelling. The discussion could be strengthened by noting that in TMC1/2 double knockout hair cells, PS externalization is not observed, consistent with the idea that flippase activity becomes critical specifically when scrambling occurs. The mislocalization observed in Cib2 KO hair cells further supports the coupling between TMC-mediated scrambling and flippase-mediated membrane restoration.

      We agree and will expand the discussion to include that TMC1/2 double knockout hair cells do not exhibit phosphatidylserine externalization, supporting the idea that flippase activity becomes critical in the context of scrambling.

      Reviewer #2 (Public review):

      Weaknesses:

      (1) Are the HA tags causing any functional issues? Function and localization of tagged proteins can sometimes be compromised. It would be good to know, for each knock-in model (TMEM30B, ATP8B1, ATP8A1, ATP8A2, and ATP11A), whether the HA-tagged protein is causing any issues with the mice and particularly with hearing (ABRs). Are these mice normal? Can they hear? These data are missing.

      We thank the reviewer for raising this important point. In this study, we will focus on TMEM30B-HA and ATP8B1-HA mouse lines, while additional HA-tagged flippase lines (ATP8A1, ATP8A2, ATP11A) are part of ongoing work to be reported separately.

      Both TMEM30B-HA and ATP8B1-HA mice are viable and exhibit normal breeding and aging. Preliminary (pilot) ABR measurements indicate wild-type–like hearing thresholds. We agree that this is important and will attempt to raise sufficient mouse numbers (in the time given) for a properly powered ABR analysis in the revised manuscript.

      (2) Following on the point above, is it possible that ATP8B1-HA is well localized, but localization for the other three flippases (ATP8A1-HA, ATP8A2-HA, and ATP11A-HA) is compromised by the tag? Is this potential mislocalization causing any functional phenotypes? (ABRs of point 1). I find it surprising that there are flippases only in outer hair cells and only formed by ATP8B1. A possible explanation is that the tag is interfering with trafficking. If so, there should be a phenotype (ABRs), although this might be masked by redundancy among these flippases or caused by systemic issues (admittedly difficult to sort out). Given that this manuscript will likely become foundational, and that there is evidence that at least two of the other flippases are involved in hearing loss, it would be good to provide more information about the mice and HA-tagged proteins in the other knock-ins (ATP8A1-HA, ATP8A2-HA, and ATP11A-HA). Depending on the data available for the knock-ins, the authors may want to discuss these scenarios and soften the statement indicating that inner-hair cells may lack flippase activity altogether.

      We appreciate this concern. To our knowledge, the HA tag does not appear to disrupt localization or function of the tagged proteins. However, we agree that this cannot be fully excluded. We will therefore soften our conclusions about IHC flippases and clarify that additional flippases (ATP8A1, ATP8A2, ATP11A) are under investigation and will be described in a separate study.

      (3) Expression of ATP8B1 at P0 (Figure 1D), when there should not be protein in outer hair cells yet seems high. Does this mean that other cells in the cochlea also express ATP8B1? Is this a concern?

      We thank the reviewer for this observation. We interpret the elevated signal at P0 as reflecting transcription preceding detectable protein expression. While expression in other cochlear cell types is possible, we have not observed detectable ATP8B1 localization outside hair cells using the HA-tagged model. We will clarify this point in the manuscript.

      (4) Fluorescence scales in Figure 6 B and D and Figure 7 B and D are very different. So are the values for WT. One would expect that the WT would be similar in all cases (at least within the same compartments), given that the methods section indicates that "All images were collected using identical acquisition parameters, including zoom and laser power, across genotypes". If WT shows such variability, how can we compare?

      We appreciate the need for clarification. Identical acquisition parameters were maintained within each experiment used for direct comparison (e.g., within a given panel). However, different panels (e.g., Figures 6B vs. 6D) were acquired on different days using different imaging settings.

      We will revise the Methods section to explicitly state this and clarify that comparisons are intended only within panels, not across experiments.

      1. The WHO MDA program administered in 2023 should have been described in the background, including its extent (e.g., how many persons received MDA; what was coverage).
      2. Looking at table 1, it seems that the surge occurred only from 2022-2023, and thus the costing for those two years can be averaged and compared to the pre-surge (2021) and post-surge/post-WHO MDA (2024)
      3. For Table 2-Pls give how many grams of the permethrin 5% ointment per tube and what dosage frequency they were administered (single dose or 2 weekly doses?). also how many ml was flucloxacillin syrup, how many mg was cetirizine tab and syrup; Pls indicate currency of unit cost per drug; Pls provide costs for IPC supplies
      4. What software was used to compute for costing?

      Sentences in lines 138 and 139 are duplicates Table 2 is not cited Table 1 - pls provide total consultations for each year, which served as denominator to get the proportional morbidity Vancouver style is not followed in some references Ref 2 - only 1st letter of 1st word should be capitalized) Ref#6 has unnecessary symbols (stars) after title Ref #14-16 have no date of citation Ref 4 and 8 should not have editor's names

    1. R0:

      Reviewer #1:

      Thank you for the opportunity to review this manuscript examining determinants of measles vaccine (MR1–MR2) dropout in an urban slum setting. The topic is highly relevant to measles elimination efforts, particularly in vulnerable urban populations where service continuity remains challenging. The study addresses an important operational question and employs an appropriate case–control design. However, several aspects of the manuscript require clarification and strengthening before it can be considered for publication.

      Introduction 1. The Introduction does not crisply say what is unknown about measles dropout in the Dhaka slum context. You need a single paragraph that: (a) identifies the knowledge gap, (b) explains why Korail/slum populations require focused study, and (c) states the study’s unique contribution. 2. Terminology and consistency. Use consistent vaccine labels (Penta1, MCV1, MR1, MR2). Early in the Introduction define what you mean by “drop-out” (the operational definition appears later in Methods — bring a short definition into the Introduction). 3. Some background facts belong in Methods/Results. E.g., the specific urban slum population size and annual vaccination target are Methods details and should not be in the Introduction. 4. At present the Introduction jumps between global stats, EPI history, WHO recommendations, and local data without clear transitions. Reorder so each paragraph follows logically (global → regional → national → evidence on determinants → gap in urban slums → study aim). 5. Missing justification of novelty. Say explicitly why this study adds new evidence (e.g., limited studies in Dhaka slums, few case–control analyses of MR1–MR2 drop-out in highly mobile urban slum populations, service-delivery factors not well quantified in Dhaka). This is essential for reviewers. 6. The intro cites Ethiopia, Somalia, Pakistan; add (or at least state you reviewed) South-Asia or Bangladesh-specific evidence on urban slums and drop-out. If those studies are sparse, say that explicitly — that is the gap.

      Methods Section 1. Please provide a precise definition of dropout, including the age/time cutoff for MR2 completion and whether delayed vaccination was considered acceptable. 2. It should be explicitly stated that both groups were drawn from the same source population using identical eligibility criteria to minimize selection bias. 3. Provide more detail on the sampling frame and recruitment process. Specify whether participants were identified through EPI registers, household listings, or community census data, and clarify whether random or convenience sampling was used. 4. Expand the description of the sample size calculation. Include the assumed effect size (odds ratio), exposure prevalence among controls, alpha level, statistical power, and the formula or software used. 5. Clearly define how key variables (e.g., maternal education, household income, waiting time, ANC/PNC visits) were categorized and justify the chosen cutoffs. 6. Clarify the source and validation of vaccination data. Indicate whether vaccination status was verified through vaccination cards, caregiver recall, or both, and explain how discrepancies were handled. 7. Provide more detail on bias control measures. Describe steps taken to minimize selection bias, recall bias, and information bias, including interviewer training and standardization procedures. 8. Strengthen the description of data management and quality control. Indicate whether data were double-entered, validated, and which statistical software (including version) was used. 9. Clarify ethical procedures. Provide the name of the approving ethics committee, approval number, and details on how informed consent was obtained and documented.

      Results Section 1. Strengthen the presentation of multivariable findings. Adjusted odds ratios (AORs), 95% confidence intervals, and p-values should be consistently reported in both tables and narrative text. Emphasize adjusted results over crude associations to avoid overinterpretation of unadjusted findings. 2. Clarify reference categories in tables. Tables presenting logistic regression results should clearly indicate the reference group for each categorical variable. This is essential for accurate interpretation of odds ratios. 3. Improve consistency between tables and text. Ensure that all key numerical results mentioned in the text match those in the tables (including decimal places). Avoid repeating entire tables in narrative form; instead, summarize the most important findings.

      Discussion and Conclusion 1. Strengthen the synthesis between findings and existing literature. While prior studies are cited, the discussion would benefit from more critical comparison. Clearly indicate whether your findings confirm, contradict, or extend previous evidence, particularly in South Asian or urban slum contexts. 2. Deepen interpretation beyond statistical significance. Move beyond reporting that associations were significant and elaborate on potential mechanisms (e.g., health system barriers, caregiver perceptions, structural inequities) that may explain the observed relationships. 3. Expand the discussion of public health and programmatic implications. The manuscript would benefit from clearer operational recommendations. For example, explain how EPI managers or urban health planners could translate these findings into targeted interventions. 4. Explicitly discuss potential biases (selection bias, recall bias, residual confounding), the inherent limitations of the case–control design, and issues of generalizability beyond Korail slum. 5. The discussion should more explicitly state what new knowledge this study adds compared to prior research—particularly regarding urban informal settlements and MR1–MR2 dropout.

      Reviewer #2:

      Beyond the reduced confirmed incidence of measles from 2019 (31.4) to 2022 (1.34 per million), at national level needs to include the measles case- based surveillance sensitivity indicators if they are met (Non measles febrile rash illness rate and % specimen collected from suspected cases ) Measles case-based surveillance indicators if achieved or not need to be described for the period of the study 15/10/2023 to 30/04/2024 for Dahka

      Reviewer #3:

      In this manuscript, Alam et al. report a study investigating factors associated with measles vaccination drop-out in children aged 12-23 months in urban areas in Bangladesh. It is quite interesting that the author found that maternal occupation, birth order, and waiting time for vaccination were among factors that were significantly associated with vaccination drop-out. Overall, the manuscript is modestly presented. My comments are summarized below. 1. What are MCV1, MCV2? 2. What is included in a pentavalent dose? 3. The statement in lines 53-54 needs to be supported by a reference. Particularly, to better protect children against measles, a herd immunity level of 95% is needed. And if the statement remains correct, what is the problem when the drop-out rate in Bangladesh is 3.4-5.5%? 4. The differences in vaccine coverage between urban and rural areas (lines 61-62) are significant? And is it due to vaccination drop-out? 5. Paragraphs in lines 53-60 and 63-70 are duplicated. 6. I think the introduction needs to be revised. It remains unclear why this study is conducted. 7. If possible, please make the questionnaire available for review and describe how the questionnaire is evaluated. 8. Some language errors in Table 4. 9. The conclusion of the manuscript introduces new information (measles elimination in Bangladesh), but it seems likely irrelevant.

    1. R0:

      Reviewer #1: Manuscript Number: PGPH-D-25-02685 Review Report Male Allyship Overall summary of the review Strength This manuscript addresses the important topic of male allyship in advancing women’s leadership in global health academia, using a well-structured qualitative approach and presenting findings across individual, institutional, and societal levels. Strengths include clear objectives, rigorous methods, and practical insights for policy and leadership. Areas that need improvement include: • Clarify the study type in the title and make it action-oriented. • Expand geographic representation beyond the U.S. and Canada. • Provide context for participant quotes. • Report data saturation and response rates. • Improve readability by breaking up long sentences and structuring results around clear subthemes. • acknowledge limitation for geographic representation beyond the U.S. and Canada Point by point feedback Title: - Male Allyship to Advance Women's Global Health Leadership in the Academy Strength • It addresses a timely and high-Impact Topic: The study area is forgotten by global community especially in academia on the importance of gender equity and leadership. • The title accurately reflects the manuscript’s core theme male allyship in academic global health leadership. • Timely and High-Impact Topic: The study addresses a significant gap in global health leadership literature—moving beyond simply identifying barriers to focusing on actionable solutions (allyship and sponsorship). This shift in focus is highly valuable for policymakers and institutional leaders Weakness • The short and long title of the manuscript is the same; no difference • The title talks about two things one “Global Health Leadership” and second “the academic institution in North America”. the focus of study population is not clear • The kind of the study type is no clearly indicated in the title. • The phrase “in the Academy” could be slightly ambiguous to international audiences—consider specifying “in academic global health institutions.” Suggested Revision • It is better if the title is an action oriented Like “Exploring Male Allyship to Advance Women’s Leadership in Global Health Academia: A Qualitative Study” or Male Allyship to Advance Women’s Leadership in Global Health Academia: A Qualitative Study Abstract Strengths: Introduction • Well-structured abstract with clear objectives, methods, and findings. • Strong justification for the study, citing global gender disproportionate disparities and responsibilities gaps in global health leadership. • Appropriate Methodology: It used a qualitative, semi structured interview approach to explore the perception, experiences and perceptions of high level leaders. Clear thematic analysis • It use clear research question on the experience of global leaders • Findings: Clearly structured around three levels—individual male ally, institutional, and societal level which offers a useful conceptual framework to shift cultural norms on gender roles for practical implications Area of improvement: • Introduction looks like an advocacy; no clearer separation between background and rationale. • • Participants were drawn only from the U.S. and Canada, which limits the geographic coverage of the study. In addition, selecting participants exclusively from the WomenLift Health network may reduce the diversity of perspectives and potentially affect the credibility of the findings. • Conceptual clarity: The term male allyship could be briefly defined in one sentence for clarity. • The conclusion partially repeats ideas from the introduction it looks like the summary of the introduction and better if based on the findings and highlighting the practical or policy relevance to enhance the impact Main Manuscript • The study team approached participants via email and invited them to participate and conduct the interview via zoom; so why the study participants restricted to two high-income countries? Result • Line 178-183: The results section reports the number of participants but does not indicate whether data saturation was achieved. In qualitative research, explaining when and how saturation was reached strengthens the credibility and adequacy of the sample size. • Reporting the response rate as a percentage would help readers interpret the level of participation more easily. • Line 185-191: The paragraph communicates the general findings, better if key themes are mentioned to give readers a concrete examples • Line 202: The quote is powerful, but providing brief context about the respondent role, experience, or institution better if included. • Line Motivation and incentives it is more plausible to read if concrete example of participant quotes are included • Line 252 to 278: the section contains valuable insights, but some sentences are long and dense. Breaking them into shorter sentences will improve readability. Consider structuring the section around clear subthemes: like awareness, role modeling etc., • Line 332-341 quote -P7, F, 40-44 is clear and conventional; however it’s a bit long and better if lightly edit for readability while retaining authenticity Discussion Strength • It is strong in structure, logic, and scholarly tone. It clearly links findings to prior research and offers practical and theoretical insights • Line 633 Toking et al reference number (54). Line 638 Sinha et al (59) need consistency Limitation Strength: Clearly states the population studied & acknowledges selection of diverse perspectives Area of Improvement: Findings may not generalize to early-career leaders, non-academic settings, or global contexts.

      Reviewer #2: This paper examines an important topic: male allyship in global health. A key strength of the paper is its focus on solutions as reported by participants. This helps move the focus away from challenges women in global health face, to exploring actual solutions to the problem. The paper presents many examples of what male allyship looks like, many of these being very actionable. I commend the authors for this. That said, there are several areas that need attention.

      1. The findings could be synthesized and made more concise. The authors present A LOT of information, which is overwhelming. There are many ways to address this. First, the authors can use much shorter direct excerpts from participants. Second, the authors could move participant excerpts into a table. Third, the authors could synthesize their findings much more which would reduce the number of themes/issues and tell the story a bit differently. The paper would read better if they structured the results based on the subheadings used in Fig 1. It would help tell a more succinct story.

      2. It is not clear what Fig 2 is showing.

      3. I wonder if to would help to present the findings by gender, that is, by showing similarities and differences between female and male participants.

      4. Could the authors present more details on the study participants. Sone of the content in the excerpts suggests that they were from academic institutions. A demographics table would help, in terms of academic vs non-academic institutions etc. The authors should also provide more details on the sampling and recruitment method: how exactly did they find these 21 participants?

      5. The Discussion could be strengthened by a more critical analysis of the concept of male allyship. The authors should consider what their findings/study implications are for the potential and pitfalls of the concept.

    1. Reviewer #1 (Public review):

      Summary:

      This paper examines plasticity in early cortical (V1-V3) areas in an impressively large number of rod monochromats (individuals with achromatopia). The paper examines three things:

      (1) Cortical thickness. It is now well established that early complete blindness leads to increases in cortical thickness. This paper shows increased thickness confined to the foveal projection zone within achromats. This paper replicates work by Molz (2022) and Lowndes (2021), but the detailed mapping of cortical thickness as a function of eccentricity and the inclusion of higher retinotopic areas is particularly elegant.

      (2) Failure to show largescale reorganization of early visual areas using retinotopic mapping. This is a replication of a very recent study of Molz et al. but I believe, given anatomical variability, the larger n in this study, and how susceptible pRF findings are to small changes in procedure, this replication is also of interest.

      (3) Connective field modelling, examining the connections between V3-V1. The paper finds changes in the pattern of connections, and smaller connective fields in individuals with achromatopsia than normally sighted controls, and suggests that these reflect compensatory plasticity, with V3 compensating for the lower resolution V1 signal in individuals with achromatopsia.

      This is a carefully done study (both in terms of data collection and analysis) that is an impressive amount of work.

      *Effects of eye-movements

      The authors have carried out the eye-movement analyses I asked of them. Unfortunately, in 4 individuals they couldn't calibrate the eyetracker (it's impressive they managed in 10). I think this means that 4 of 13 (since a different participant was excluded from head motion) individuals weren't included in correlation analyses. Limiting the correlation analysis to individuals with better fixation has obvious issues. I'd recommend redoing (or additionally including) stats using non-parametric measures while classifying these 4 as having fixation instability of 3 (i.e. greater instability than the participant with the worst fixation who was successfully calibrated).

      *Interpreting pRFs

      The paper would be strengthened by a little more explicit clarity about what pRFs represent and how that affects their interpretation of their findings as plasticity vs. non-plasticity (I know the authors are aware of this, but I think it would be helpful for readers who are less experienced in pRFs). In the introduction it would be helpful to point out that pRFs represent the collective response of a large population of neurons, and as a result pRF estimates can vary depending on which population of neurons that stimulus drives.

      For example, imagine for the sake of argument that rods only project to V1 neurons with larger receptive fields. If one measured pRFs in a control observer under phototopic vs. scotopic conditions one would see smaller pRFs in the photopic conditions. This wouldn't represent 'plasticity' - it would represent the fact that the firing neurons contributing to the pRF signal are a slightly different population because of a change in the stimulus content. This is of course exactly what you see in 2C. And indeed, the authors make this identical point ". In the non-selective condition, the smaller pRFs in controls are in line with the higher spatial resolution of the<br /> cone system, which is not active in the achromat group." But this point would be clearer if more of the conceptual underpinnings were made explicit in the introduction (or at this point in the paper).

      Shifts in which population of neurons drive your pRFs can explain main of the more puzzling results in the paper without detracting from your main conclusions. For example, in 2D, I don't think it's differences in S/N driving your results (pRFs are at least theoretically meant to be robust to S/N). If smaller RFs 'drop out' under low luminance and these smaller RFs also tend to be more central, then one would expect the control results of 1D. And I think a similar argument might even be made for the smaller difference in the rod monochromats.

      It would be possible to make the point of Figure 4B more simply if Figure 4B was replaced by additional Panels in Figure 2 simply showing V3 pRF sizes/eccentricity distributions. That would make the point that you don't see the same expansion in pRF sizes in V3 in a way that is just as clear, and is closer to the data.

      *Interpreting cRFs

      Similarly, I think the paper would be improved with more clarity about the underlying signal in CF modeling. Once again, I appreciate that the authors are familiar with this, but it will help the reader in interpretation. (And I do believe thinking carefully about this may alter your interpretations). CF receptive fields 'find' the region in V1 that best predict the V3 signal in a given voxel. In resting state this likely represents a combination of:

      (1) visually driven signal - correlations that may or may not reflect connectivity but represent the fact that regions that represent the same region of visual space will be active at the same time.

      (2) global bilaterally symmetrical signal consisting of enhanced correlations between iso-eccentric regions (Raemaekers et al., 2014), which may arise from vasculature that symmetrically stems from the posterior cerebral artery (Tong et al., 2013; Tong and Frederick, 2014).

      (3) intrinsic neural fluctuations that are more strongly correlated between connected neurons. These are likely quite weak compared to the other contributions.

      I think if you ignore 2, (which is not likely to differ between rod mono and controls) and model 1 and 3, you might well see shifts in CFs towards the boundary of the scotoma - essentially the CF's location will be biased towards the region of V1 that has stronger correlations - which = the region which has a visual signal.

      I do find convincing the argument that you don't see the same shift in controls in the rod-selective condition. So I think the results of 4A are fine. But a little more clarity about 'what's under the hood' in CF modeling would be nice.

      *Interpreting the relationship between pRFs and cRFs

      So there's something here that confuses me. We are all agreed that V3 pRF sizes are similar across RM and control. V1 pRFs are larger in RM. It feels intuitive that smaller CFs would compensate but I can't make it make sense to myself when I think it through. Each pRF represents a combination of receptive field location scatter and bandwidth. You want to argue that eccentricity mapping looks pretty normal, so there's no reason to think increased rf scatter, and I can believe that (though I do think this assumption should be discussed explictly).

      So far I think we agree.

      But let's think about what drives a CF during visual stimulation ... Specifically lets think about 'the pRF of the CF' (the region of visual space represented by the cluster of voxels in the CF). If pRFs for individual voxels in V1 are big, then the pRF for the CF is also going to be large. But we know that pRFs for V3 are normal size. So, the V3 CF will 'find' a smaller number of voxels in V1, in order to try to find the 'correct sized' CF pRF. Note that this explanation is very similar to yours. But doesn't require ANY 'intrinsic' connectivity. It's really just assuming the whole thing is driven by the visual signal and the CF size is determined by the ratio of the pRF sizes in V3 vs. V1.

      One possible solution would be to regress out the visual stimulus and redo this analysis based on the residuals.

    2. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      This paper examines plasticity in early cortical (V1-V3) areas in an impressively large number of rod monochromats (individuals with achromatopia). The paper examines three things:

      (1) Cortical thickness. It is now well established that early complete blindness leads to increases in cortical thickness. This paper shows increased thickness confined to the foveal projection zone within achromats. This paper replicates the work by Molz (2022) and Lowndes (2021), but the detailed mapping of cortical thickness as a function of eccentricity and the inclusion of higher visual areas is particularly elegant.

      (2) Failure to show largescale reorganization of early visual areas using retinotopic mapping. This is a replication of a very recent study by Molz et al. but I believe, given anatomical variability (and the very large n in this study) and how susceptible pRF findings are to small changes in procedure, this replication is also of interest.

      (3) Connective field modelling, examining the connections between V3-V1. The paper finds changes in the pattern of connections, and smaller connective fields in individuals with achromatopsia than normally sighted controls, and suggests that these reflect compensatory plasticity, with V3 compensating for the lower resolution V1 signal in individuals with achromatopsia.

      Strengths:

      This is a carefully done study (both in terms of data collection and analysis) that is an impressive amount of work. I have a number of methodological comments but I hope they will be considered as constructive engagement - this work is highly technical with a large number of factors to consider.

      Weaknesses:

      (1) Effects of eye-movements

      I have some concerns with how the effects of eye-movements are being examined. There are two main reasons the authors give for excluding eye-movements as a factor in their results. Both explanations have limitations.

      (a) The first is that R2 values are similar across groups in the foveal confluence. This is fine as far as it goes, but R2 values are going to be low in that region. So this shows that eyemovements don't affect coverage (the number of voxels that generate a reliable pRF), but doesn't show that eye-movements aren't impacting their other measures.

      We agree with the reviewer that eye movements could affect pRF measures. We have now also included data for all participants where we were able to obtain eye tracking measures and directly tested this relationship. Relevant results are copied below.

      Recap of results: 1) as expected gaze was less stable in achromats than controls, 2) achromats with more stable gaze did not show more activation in the scotoma projections zone, which we might have observed if fixation instability masks signals in this region 3) Gaze instability was not correlated with pRF size and eccentricity across V1 in achromats. We note that the relationship between nystagmus and visual sampling is complex - patients experience a stable image and may sample only during a specific phase of the eye movement. It is therefore not inherently clear if and how nystagmus affects pRF size.

      Relevant Manuscript text incorporating these analyses is copied below.

      To quantify eye movement, we used the following methods added to the manuscript:

      “Fixation stability

      Participants’ gaze was tracked throughout all pRF mapping runs. Collecting reliable gaze data from individuals with nystagmus is a challenge because out of the box calibration procedures mostly fail without stable fixation. To account for this, we implemented a post-hoc custom calibration procedure (Tailor et al., 2021). The eye-tracker was first precalibrated on a typically sighted individual. Then, before every other run, we collected gaze data from a 5-point fixation task (at fixation and above, below, left, and right of fixation at 5 eccentricity). This data allowed us to subsequently map the patient's recorded gaze coordinates to their precise locations on the screen. In 10 out of the 14 achromats we acquired reliable enough data to assess fixation stability.

      Calibration data processing: We first removed the first 0.5 seconds for each fixation location to allow for fixation to arrive on the target. We then performed (a) blink removal, (b) filtered out time points with eye movement velocity outliers (±2SD), and (c) filtered out any positions >3SDs to the left or right of the mean fixation location, and >1SD above or below. We took the median of the remaining gaze measurements as an approximate fixation estimate. The resulting 5 median fixation locations were used to fit an affine transformation that remapped the recorded gaze positions into screen space. 

      Quantifying fixation stability: after applying the transformation of the post-hoc calibration, data was filtered for blinks and extreme velocities (<2SD). For each functional run, fixation instability was measured as the standard deviation of gaze x-positions across 1second windows. Measures were then averaged across the two run repeats.”

      We report the resulting new fixation data results as follows:

      Results (coverage section):

      “Another potential confound in our findings is fixation instability. In pRF mapping, which is usually conducted under photopic (cone-dominant) conditions, unstable fixation can cause a signal drop in the foveal projection zone. As expected due to nystagmus, the achromatopsia group showed higher fixation instability compared to controls (rodselective: t<sub>(9.08)</sub>=-3.19, p=0.01; non-selective: t<sub<(9.41)</sub>=-4.88, p<0.001 degrees-offreedom corrected for unequal-variance; see Supplement Figure S2a). However, several lines of evidence suggest this instability cannot fully account for the lack of "filling in" in achromats. First, within the achromat group, we found no correlation between fixation stability and coverage (rod-selective: spearman-r<sub>(8)</sub> = -0.36, p=0.31; non-selective spearman-r<sub>(8)</sub>=0.07,p=0.85); Individuals with more stable, control-like fixation did not show more signal inside the scotoma (see Supplement 2). Second, in adults with achromatopsia, typically with less severe nystagmus (Kohl et al., 1993), two recent studies also found absence of filling in (Anderson et al., 2024; Molz et al., 2023).

      So, while we cannot fully exclude nystagmus masking foveal signals in the cortex of some patients, this converging evidence from structural and functional MRI measures across different studies and groups, strongly suggests that the deprived cortex does not substantially ‘fill in’ with peripheral rod inputs in achromatopsia.”

      Results (pRF size + eccentricity):

      “Larger pRFs indicate that neuronal populations in achromats’ V1 cortex, combine information across larger areas in visual space than in typically sighted controls. This could reflect true neural tuning differences as well as be driven by larger eye movement. However, fixation instability in achromats do not significantly correlate with pRF size in our sample (rod-selective: spearman-r<sub>(8)</sub> = -0.41, p=0.24; non-selective spearman-r<sub>(8)</sub>=0.37,p=0.29)

      It has been shown that fitting artefacts around scotoma edges, can give rise to similar outward eccentricity shifts (Binda et al., 2013). However, when accounting for fitting artefacts around the foveal scotoma edge by modelling the rod-free zone during pRF fitting, pRF size and eccentricity differences remain unchanged (see Supplement 3). Finally, we found no significant correlations between gaze stability and the eccentricity shift (rod-selective: spearman-r<sub>(8)</sub> = 0.58, p=0.08; non-selective spearman-r<sub>(8)</sub>=0.09,p=0.8, Supplement 4D)

      Together, these analyses reveal subtle differences in how V1 of achromats responds to rod signals outside the foveal zone, which are consistent with results from other studies (Molz et al. 2023, Anderson et al. 2024). While we found no direct evidence that these are being driven by confounding factors such as eye-movements or fitting artefacts, more work is needed to understand the underlying processes that give rise to these shifts.”

      The following text has been added to Supplement 2

      “As expected, achromats showed significant higher fixation instability compared to controls (as reported in the main text). We found no significant correlation between fixation instability and either coverage, pRF size, eccentricity in achromats. Results of Spearman R correlations in both rod- and non-selective conditions are reported in the figure. We note that the relationship between nystagmus and visual sampling is complex- patients experience a stable image and may sample only during specific eyemovement phases. It is therefore not fully clear if and how nystagmus should give rise to altered pRFs.”

      (b) The authors don't see a clear relationship between coverage and fixation stability. This seems to rest on a few ad hoc examples. (What happens if one plots mean fixation deviation vs. coverage (and sets the individuals who could not be calibrated as the highest value of calibrated fixation deviation. Does a relationship then emerge?).

      In any case, I wouldn't expect coverage to be particularly susceptible to eye-movements. If a voxel in the cortex entirely projects to the scotoma then it should be robustly silent. The effects of eye-movements will be to distort the size and eccentricity estimates of voxels that are not entirely silent.

      There are many places in the paper where eye-movements might be playing an important role. 

      Examples include the larger pRF sizes observed in achromats. Are those related to fixation instability?

      We thank the reviewer for their comment. As detailed in our previous response, we have now extracted fixation instability data from additional patients and have expanded our discussion of its potential effects throughout the manuscript.

      Given that fixation instability is expected to increase pRF size by a fixed amount, that would explain why ratios are close to 1 in V3 (Figure 4).

      We agree with the reviewer’s point, that the ratio change on its own is not strong evidence of compensation, this analysis was meant to complement the CF result. The plot in Figure 4 is intended to reconcile the connective field (CF) and pRF results. Its purpose is to illustrate that even though larger pRFs in achromats might seem counterintuitive alongside their smaller V3 CF sizes, the pRF data do not contradict the CF findings but they are in fact consistent with one another. We also agree that there are alternative explanations for the differences in pRF size, such as fixation stability, and we have now added this point to the text.

      Results (CF size):

      “To understand how this finer cortical sampling in V3 (smaller connective fields) impacts visual processing, we consider its effect on population receptive fields (pRFs). In V1, pRF sizes in achromats were significantly larger than in controls for both stimulus conditions, indicating coarser spatial tuning at the cortical input stage (Figure 4C, left). By selectively sampling from a smaller area of the V1 surface (smaller CFs), V3 can effectively compensate for this coarser input. If so, this process should result in a relative normalisation of pRF size in V3 compared to V1 (Figure 4C, right).

      To test this prediction, we plotted the ratio of pRF sizes between achromats and controls, where a value of 1 indicates parity between the groups (Figure 4B). As our compensatory connective field hypothesis predicts, the ratio was closer to 1 in V3 than in V1 across both stimulus conditions, confirming the pRF size difference was significantly reduced at the higher cortical stage. Together this shows converging evidence across the two models (pRF and CF) of hierarchical refinement as a possible compensatory mechanism, where V3's altered connectivity helps to normalize the processing of degraded sensory input from V1.”

      Discussion:

      “The hierarchical reorganisation observed in V3 is unlikely to be driven by fixation instability. Connective field (CF) estimates are robust to eye movements (Tangtartharakul et al., 2023), because they are anchored to V1 inputs rather than absolute screen position. Considered alone, the pRF results could alternatively be explained by eye movements introducing a fixed size offset that affects smaller V1 pRFs more strongly than those in V3. While we found no evidence for this relationship between pRF size and gaze measures in our patients, we cannot fully rule out the possibility. Nevertheless, the internal consistency between the CF and pRF measures provides a more parsimonious account; that sampling across the hierarchy accounts for coarser tuning at the input stage.”

      (2) Topography

      The claim of no change in topography is a little confusing given that you do see a change in eccentricity mapping in achromats. 

      Either this result is real, in which case there *is* a change in topography, albeit subtle, or it's an artifact. 

      Perhaps these results need a little bit of additional scrutiny. 

      One reason for concern is that you see different functions relating eccentricity to V1 segments depending on the stimulus. That almost certainly reflects biases in the modelling, not reorganization - the curves of Figure 2D are exactly what Binda et al. predict. 

      Another reason for concern is that I'm very surprised that you see so little effect of including/not including the scotoma - the differences seem more like what I'd expect from simply repeating the same code twice. (The quickest sanity check is just to increase the size of the estimated scotoma to be even bigger?).

      We thank the reviewer for their comment. We have double-checked our scotoma modelling, confirming its correct implementation. The results of the scotoma modelling are not identical to the full one, just similar (see below).

      Previous studies on “artificial scotomas” (such as the one reported by Binda et al.) have shown mixed results. While Binda and colleagues found that modelling artificial scotomas normalised pRF shifts, others found no effect (Haak et al. 2012, Prabhakaran et al. 2020). Notably, the rodfree zone in achromatopsia is considerably smaller (~0.5° radius) than most tested artificial scotomas. Moreover, it is unclear whether scotoma modelling is beneficial in clinical populations as artificial scotomas (screen-based masking) are not equivalent to retinal scotomas from inactive photoreceptors. A recent achromatopsia study (Anderson et al. 2024) also found no change in pRF estimates with scotoma modelling.

      In our scotoma analyses, we found meaningful differences only in the non-selective condition in controls where cones in the rod-free zone are stimulated - which would be the main expected effect of this modelling exercise (see below). In all other conditions (rod-selective in controls, both conditions in achromats), only rods are stimulated, we found no difference in coverage, eccentricity or pRF size when modelling the scotoma likely because the foveal signal is weak/absent, and did not contribute much to pRF estimates in the unmasked analyses.

      This means we cannot account for the eccentricity shift as an edge effect with this scotoma model – but we remain cautious about interpreting it as real. This is because first, as we mention in the paper, in the non-selective condition, which has a higher signal-to-noise ratio, the eccentricity estimates in achromats match those of the control group's rod system. Second, it is still possible that the observed shift is an artefact of modelling that was not accounted for by the approach of scotoma modelling.

      Our claim of "no change in topography" specifically referred to the absence of "filling-in" as measured by cortical coverage - the percentage of activated tissue regardless of fitted parameters. However, to avoid confusing given the eccentricity and pRF size results we now rephrased our claim.

      Abstract:

      “Cortical input stages (V1) exhibited high stability, with input-deprived cortex showing no retinotopic remapping and exhibiting structural hallmarks of deprivation.”

      Results (pRF eccentricity):

      “It has been shown that fitting artefacts around scotoma edges, can give rise to similar outward eccentricity shifts (Binda et al., 2013). However, when accounting for fitting artefacts around the foveal scotoma edge by modelling the rod-free zone during pRF fitting, pRF size and eccentricity differences remain unchanged (see Supplement 3). Finally, we found no significant correlations between gaze stability and the eccentricity shift (rod-selective: spearman-r<sub>(8)</sub> = 0.58, p=0.08; non-selective spearman-r<sub>(8)</sub>=0.09,p=0.8, Supplement 4D)

      Together, these analyses reveal subtle differences in how V1 of achromats responds to rod signals outside the foveal zone, which are consistent with results from other studies (Molz et al. 2023, Anderson et al. 2024). While we found no direct evidence that these are being driven by confounding factors such as eye movements or fitting artefacts, more work is needed to understand the underlying processes that give rise to these shifts.”

      To better illustrate the effect of scotoma modelling text has been added to Supplement 3:

      “Studies on artificial scotomas, where part of the visual field is masked, suggest that pRF estimates of eccentricity and size can be biased by fitting scotoma-edge artefacts, and that these can be mitigated by modelling the scotoma in the pRF fitting procedure (e.g., Binda et al. 2013).

      We therefore repeated the pRF modelling procedure with the rod-scotoma being modelled as a black oval mask (1.25°x0.9°) over the stimulus aperture model. As expected, a visible difference between the two models is only apparent in the nonselective condition in controls where the cones in the rod-free zone are being stimulated. In all the other conditions (rod-selective in controls, and both stimulation conditions in achromats) only the rods are stimulated, therefore the masked stimulus still matches the retinal activation, and no major differences can be observed. Performing the same statistical tests applied to the full model in the main text yields equivalent results of equivalent coverage in the rod-selective condition, with equivalent coverage across groups(t(47) = 0.78, p=0.43, BF10=0.31) and controls show a higher coverage in the non-selective stimulation condition compared to achromats (Mann U(52)=141, p<0.01; unequal variance, reverted to non-parametric).

      This consistency in pRF properties when modelling the rod scotoma, is in line with previous results from scotoma modelling; While Binda and colleagues found that this normalised pRF shifts, others found no effect (Haak et al. 2012, Prabhakaran et al. 2020). Notably, the rod-free zone in achromatopsia is considerably smaller (~0.5° radius) than most tested artificial scotomas, and as artificial scotomas (screen-based masking) are not equivalent to retinal scotomas from inactive photoreceptors, it is unclear how artificial scotoma findings generalise to clinical populations. Our results are in line with a recent achromatopsia study (Anderson et al. 2024) which also found no change in pRF estimates with scotoma modelling.”

      I'd also look at voxels that pass an R2>0.2 threshold for both the non-selective and selective stimulus. Are the pRF sizes the same for both stimuli? Are the eccentricity estimates? If not, that's another clear warning sign.

      Comparable results were obtained when using higher R2 thresholds. These results are now included in Supplement 6.

      (3) Connective field modelling

      Let's imagine a voxel on the edge of the scotoma. It will tend to have a connective field that borders the scotoma, and will be reduced in size (since it will likely exclude the cortical region of V1 that is solely driven by resting state activity). This predicts your rod monochromat data. The interesting question is why this doesn't happen for controls. One possibility is that there is topdown 'predictive' activity that smooths out the border of the scotoma (there's some hint of that in the data), e.g., Masuda and Wandell.

      One thing that concerns me is that the smaller connective fields don't make sense intuitively. When there is a visual stimulus, connective fields are predominantly driven by the visual signal. In achromats, there is a large swath of cortex (between 1-2.5 degrees) which shows relatively flat tuning as regards eccentricity. The curves for controls are much steeper, See Figure 2b. This predicts that visually driven connective fields should be larger for achromats. So, what's going on?

      The reviewer raises interesting points about the interpretation of our connective field results. The possibility of differential top-down modulation between controls and achromats is intriguing, however it is not supported by the data, if top-down modulation is activating foveal V1 in controls then we shouldn’t see a drop in the amount of significant vertices sampling from the fovea in the rod-selective condition compared to the non-selective, but in fact we do see quite a large drop in the amount of significant vertices in that area in the rod-selective condition. Therefore, at the moment we do not think there is strong basis to assume our data could be explained by achromats lacking top-down predictive activity in the scotoma area that is present in controls.

      Regarding the concern about smaller CFs seeming counterintuitive given the flat eccentricity tuning in achromats' V1: we believe there is not a straightforward prediction from pRF properties to CF sizes. The relationship between V1 pRF characteristics and V3 CF sampling is complex and not well-established in the literature, and the two can be decoupled to some degree. For instance, in our data, controls show flat V1 pRF sizes in the rod-selective condition (similar to achromats), yet their V3 CF sizes maintain the typical eccentricity-dependent increase seen in the non-selective condition. This suggests that CF size patterns don't simply mirror V1 pRF properties or visual stimuli responses.

      Importantly, CF modelling fundamentally differs from pRF analysis in how it might be affected by scotomas. Unlike pRF analysis where a scotoma creates a "silent" region in visual space, in CF modelling the deprived cortex remains physically present and continues generating neural signals (albeit not visually-driven ones). If V3-V1 connectivity were anatomically fixed, V3 would continue sampling from deprived V1 regions even if they do not produce visual-driven signals. A change in this sampling pattern, as we see in our data, is therefore evidence for plasticity.

      Our data support this interpretation. First, in achromats, the CF size pattern observed cannot be easily explained by scotoma-edge artefacts. V3 vertices sampling from the immediate vicinity of the scotoma (1°-3°) show CF sizes comparable to controls. The effect is only significant further away from the scotoma (4°-6°).

      Second, to assess how the presence of a scotoma affects CF measure we can compare the two conditions in the controls, since the rod-selective condition has a scotoma present and the nonselective condition does not. For this purpose, we performed an additional analysis, quantifying on a vertex-by-vertex level the differences in CF fitted parameters between the two stimulation conditions across V1. See results below. In achromats there are no systematic shifts between the stimulation conditions, as expected as both are rod-driven. In controls, this analysis reveals only subtle shifts (~0.45° in the rod-selective condition). CF size has also changed slightly although not significantly different from that observed in achromats. These shifts are much smaller than the CF size and eccentricity differences between controls and achromats, so we consider it unlikely that our findings are driven by scotoma artefacts.

      Author response image 1.

      Results (CF size):

      “The significant CF size differences are unlikely to be a model-fitting bias around a scotoma edge, as V3 vertices sampling from the immediate vicinity of the scotoma (1°3°) show CF sizes comparable to controls. The significant reduction in CF size occurs only further in the periphery (4°-6°), in regions that are primarily stimulus-driven.

      To understand how this finer cortical sampling in V3 (smaller connective fields) impacts visual processing, we consider its effect on population receptive fields (pRFs). In V1, pRF sizes in achromats were significantly larger than in controls for both stimulus conditions, indicating coarser spatial tuning at the cortical input stage (Figure 4C, left). By selectively sampling from a smaller area of the V1 surface (smaller CFs), V3 can effectively compensate for this coarser input. If so, this process should result in a relative normalisation of pRF size in V3 compared to V1 (Figure 4C, right).

      To test this prediction, we plotted the ratio of pRF sizes between achromats and controls, where a value of 1 indicates parity between the groups (Figure 4B). As our compensatory connective field hypothesis predicts, the ratio was closer to 1 in V3 than in V1 across both stimulus conditions, confirming the pRF size difference was significantly reduced at the higher cortical stage. Together this shows converging evidence across the two models (pRF and CF) of hierarchical refinement as a possible compensatory mechanism, where V3's altered connectivity helps to normalize the processing of degraded sensory input from V1.”

      Discussion (added paragraph):

      “The hierarchical reorganisation observed in V3 is unlikely to be driven by fixation instability. Connective field (CF) estimates are robust to eye movements (Tangtartharakul et al., 2023), because they are anchored to V1 inputs rather than absolute screen position. Considered alone, the pRF results could alternatively be explained by eye movements introducing a fixed size offset that affects smaller V1 pRFs more strongly than those in V3. While we found no evidence for this relationship between pRF size and gaze measures in our patients, we cannot fully rule out the possibility. Nevertheless, the internal consistency between the CF and pRF measures provides a more parsimonious account; that sampling across the hierarchy accounts for coarser tuning at the input stage.”

      The beta parameter is not described (and I believe it can alter connective field sizes).

      In Author response image 2, we plot the beta parameter of the pRF modelling in V1 with no R<sup>2</sup> filtering, error bars are 95% CIs:

      Author response image 2.

      The reviewer did not specify how beta might alter connective field sizes. We assume he meant that as in pRF mapping, the slope of activity from deprived to non-deprived cortex will artefactually create a CF model fit with smaller CF sizes. To test this, we calculated the slope of beta values between 0° and 3° in each participant in the rod-selective condition, as this range includes the scotoma and the area at the edge of the scotoma. We then used the slope as a covariate in an ANCOVA when comparing the CF sizes across groups in each sampled V1 segment. Accounting for the beta slope of V1 did not change the reported results. This analysis still shows smaller CF sizes in V3 in the rod-selective conditions between 4°-6° eccentricity – these differences remain significant (p<0.001 for 4°-5° and p<0.05 for 5°-6° when comparing achromats vs controls).

      Similarly, it's possible to get very small connective fields, but there wasn't a minimum size described in the thresholding.

      CF sizes were fit with a grid fit. Possible values were [0.5,1,2,3,4,5,7,10]. Therefore, the minimum size is 0.5. Filtering out the smallest connective field sizes does not change the results:

      Author response image 3.

      I might be missing something obvious, but I'm just deeply confused as to how the visual maps and the connectome maps can provide contradictory results given that the connectome maps are predominantly determined by the visual signal. Some intuition would be helpful.

      We agree that this appears counterintuitive, and now added further clarification. The two models (pRF and CF) fundamentally differ in what they measure and how they relate to visual processing. V1 pRF sizes reflect the relationship between neural activity and visual stimuli - essentially how much of a visual stimulus drives a voxel's response - while V3 CF sizes reflect how V3 samples from the V1 cortical surface, indicating how many V1 voxels contribute to a V3 voxel's activity.

      The measures constrain each other, as a V3 voxel's pRF size is expected to match the pooling of its connected V1 inputs. But they can be decoupled: A V3 voxel could sample from a small area of V1 cortex (a small CF in mm) that happens to represent a large area of visual space if those V1 voxels have large pRFs. The aim of Figure 4B is to clarify that the measures are consistent with one another even though they diverge in direction. In achromats, where V1 voxels have larger pRFs (coarser spatial resolution), V3 appears to compensate by sampling more selectively from V1 via smaller CF sizes. Theoretically, this should reduce the pRF size difference between controls and patients in V3, a prediction that our data supports.

      Results (CF size):

      “To understand how this finer cortical sampling in V3 (smaller connective fields) impacts visual processing, we consider its effect on population receptive fields (pRFs). In V1, pRF sizes in achromats were significantly larger than in controls for both stimulus conditions, indicating coarser spatial tuning at the cortical input stage (Figure 4C, left). By selectively sampling from a smaller area of the V1 surface (smaller CFs), V3 can effectively compensate for this coarser input. If so, this process should result in a relative normalisation of pRF size in V3 compared to V1 (Figure 4C, right).

      To test this prediction, we plotted the ratio of pRF sizes between achromats and controls, where a value of 1 indicates parity between the groups (Figure 4B). As our compensatory connective field hypothesis predicts, the ratio was closer to 1 in V3 than in V1 across both stimulus conditions, confirming the pRF size difference was significantly reduced at the higher cortical stage. Together this shows converging evidence across the two models (pRF and CF) of hierarchical refinement as a possible compensatory mechanism, where V3's altered connectivity helps to normalize the processing of degraded sensory input from V1.”

      Discussion (added paragraph):

      “The hierarchical reorganisation observed in V3 is unlikely to be driven by fixation instability. Connective field (CF) estimates are robust to eye movements (Tangtartharakul et al., 2023), because they are anchored to V1 inputs rather than absolute screen position. Considered alone, the pRF results could alternatively be explained by eye movements introducing a fixed size offset that affects smaller V1 pRFs more strongly than those in V3. While we found no evidence for this relationship between pRF size and gaze measures in our patients, we cannot fully rule out the possibility. Nevertheless, the internal consistency between the CF and pRF measures provides a more parsimonious account; that sampling across the hierarchy accounts for coarser tuning at the input stage.”

      Some analyses might also help provide the reader with insight. For example, doing analyses separately on V3 voxels that project entirely to scotoma regions, project entirely to stimulusdriven regions, and V3 voxels that project to 'mixed' regions.

      We agree that it is important to plot the connective field dynamics across the scotoma region.

      In Figure 4A we split the V3 vertices based on the V1 area they sample from. Therefore the 0°-1° would be considered as mainly sampling from the “scotoma” region and the higher the eccentricity is, the less “scotoma” it includes. The V3 vertices that have a significantly smaller CF size compared to controls are those sampling from mostly if not entirely stimulusdriven regions 4°-5° and 5°-6°. We are not sure how further binning the data by within, across and outside scotoma would be more informative.

      However, in Author response image 4, we plot in more details the distribution of CF sizes sampling from a V1 segment clearly inside and clearly outside the scotoma. The top figure shows the CF size distribution of V3 vertices that sample from a V1 0°-1° segment, where V1 is deprived of input due to the rod scotoma. In achromats, there is a clear drop in vertices with a very small (0.5) CF size. The bottom figure shows the distribution of V3 vertices that sample from the V1 4°-5° segment which falls outside the scotoma and shows a significant difference in CF size across the groups. Here in achromats you can see a drop in larger V3 CF sizes sampling from the V1 region, and an increase in smaller ones (note that this further addresses a previous concern that connective field differences across groups are solely driven by very small CFs).

      Author response image 4.

      Following the reviewer’s comment we have added the following statement in the results section discussing CF size:

      “The significant CF size differences are unlikely to be a model-fitting bias around a scotoma edge, as V3 vertices sampling from the immediate vicinity of the scotoma (1°3°) show CF sizes comparable to controls. The significant reduction in CF size occurs only further in the periphery (4°-6°), in regions that are primarily stimulus-driven.”

      The finding that pRF sizes are larger in achromats by a constant factor as a function of eccentricity is what differences in eye-movements would predict. It would be worth examining the relationship between pRF sizes and fixation stability.

      We found no relationship between fixation stability and pRF size in V1, although as we explain in response to an earlier point, this does not fully exclude the reviewers alterative explanation, which we now add to the discussion.

      Discussion:

      “The hierarchical reorganisation observed in V3 is unlikely to be driven by fixation instability. Connective field (CF) estimates are robust to eye movements (Tangtartharakul et al., 2023), because they are anchored to V1 inputs rather than absolute screen position. Considered alone, the pRF results could alternatively be explained by eye movements introducing a fixed size offset that affects smaller V1 pRFs more strongly than those in V3. While we found no evidence for this relationship between pRF size and gaze measures in our patients, we cannot fully rule out the possibility. Nevertheless, the internal consistency between the CF and pRF measures provides a more parsimonious account; that sampling across the hierarchy accounts for coarser tuning at the input stage.”

      Reviewer #2 (Public review):

      Summary:

      The authors inspect the stability and compensatory plasticity in the retinotopic mapping in patients with congenital achromatopsia. They report an increased cortical thickness in central (eccentricities 0-2 deg) in V1 and the expansion of this effect to V2 (trend) and V3 in a cohort with an average age of adolescents.

      In analyzing the receptive fields, they show that V1 had increased receptive field sizes in achromats, but there were no clear signs of reorganization filling in the rod-free area. In contrast, V3 showed an altered readout of V1 receptive fields. V3 of achromats oversampled the receptive fields bordering the rod-free zone, presumably to compensate and arrive at similar receptive fields as in the controls.

      These findings support a retention of peripheral-V1 connectivity, but a reorganization of later hierarchical stages of the visual system to compensate for the loss, highlighting a balance between stability and compensation in different stages of the visual hierarchy.

      Strengths:

      The experiment is carefully analyzed, and the data convey a clear and interesting message about the capacities of plasticity. 

      Weaknesses:

      The existence of unstable fixation and nystagmus in the patient group is alluded to, but not quantified or modeled out in the analyses. The authors may want to address this possible confound with a quantitative approach.

      We have responded to this in the “Recommendations for the authors” section of this reviewer, as they included a more detailed description of these points there.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      (1) I think the term rod monochromats should be included early in the paper since it's a more intuitive term to describe this population.

      We agree with the reviewer that the term “rod monochromats” is more intuitive as it clarifies the retinal source of the disease but have chosen the term achromats for consistency with a wide literature of published work in this group, including our own and our close collaborators’. To clarify, in the first mention of the group as achromats in the introduction we have now added this term:

      “Achromatopsia (also known as rod monochromacy) causes cone photoreceptors in the retina to be inactive from birth (Aboshiha et al., 2014).”

      (2) The paper essentially contains two definitions of 'eccentricity'. One (atlas/segments) comes from the Benson atlas and the other (functional) comes from pRF mapping. It would be good to make this distinction terminology clearer earlier in the paper. It would also be good to use more consistent terminology. I assume 'sampled atlas V1 eccentricity' in 3A is the same as 'V1 segment' in 1A?

      For consistency we have now referred to these as V1 segment and sampled V1 segment in the figures when describing the atlas-based definition, and eccentricity for the measured pRF-based eccentricity.

      (3) The 'stability vs. plasticity' framing in the introduction could be tightened slightly.

      We have made the following changes following the reviewer’s comment:

      “In the visual domain, the focal point of the debate on plasticity and stability has hinged on the extent to which retinal input deprivation can drive local reorganisation in early visual cortex, for example, for deprived tissue to take on inputs from spared retinal locations (Adams et al., 2007; Baker et al., 2005, 2008; Baseler et al., 2002, 2011; Calford et al., 2005; Dilks et al., 2009; Dumoulin & Knapen, 2018; Ferreira et al., 2016; Goesaert et al., 2014; Haak et al., 2015; Molz et al., 2023; Ritter et al., 2019; Schumacher et al., 2008). In reality visual impairment is a more global phenomenon, affecting all levels of visual processing, with complex dynamics beyond constricted local retinocortical projection zones(Carvalho et al., 2019).”

      (4) Figure 1A, define the x axis as degrees.

      We have now added the ° sign to all the tick labels indicating Benson map eccentricity.

      (5) Figure 2B, is there room for pictures of the silent substitution/standard stimulus

      We have now added images in a Supplement 5 to avoid cluttering the main Figure 2B

      (6) Figure 2

      Panel A has a slightly weird organization. The reader is supposed to compare the square symbols to each other, and the circles to each other, why not organize the figure so they are adjacent in the graph (i.e. non selective control, non-selective achromat, selective control, selective achromat)? That also helps the reader orient that in the non-selective conditions you have almost complete pRF coverage. 

      We have taken on the reviewer’s suggestion and changed the order.

      In the inset, maybe use empty symbols? That's the traditional way to say that the square/circle applies to both red and black.

      We prefer the current format.

      Figure 2C - the symbols change to circles? Why not keep the symbols of A?

      We have now changed the symbols of 2C&D.

      I'd put the non-selective maps above the selective maps?

      We appreciate the feedback but prefer to keep it as it is, as we feel the critical point is conveyed by the rod maps.

      (7) 'We propose a new hierarchical model of neural adaptation'. These ideas are hardly new. There are also other models, that would explain your data (cumulative plasticity) https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5953572/

      We thank the reviewer for the reference. We have now cited it in our discussion and removed the word “new” form the mentioned sentence.

      “Therefore, there is theoretically broader scope for experience-dependent reweighting of inputs (Beyeler et al., 2017; Makin & Krakauer, 2023) and to optimise use of inputs that are still available, more reliable, or more relevant in the impaired system. Conversely, higher-order visual areas may appear more plastic simply because they integrate the cumulative effects of learning from multiple lower stages (Beyeler et al., 2017).”

      We propose a hierarchical model of neural adaptation…” [deleted the word new]

      (8) Line 508. No image of the stimulus is contained in the paper

      Corrected

      (9) Line 620. I believe the Figure is 1B, not 1C.

      Corrected

      (10) Figure 4A. CF Size - add mm2 to the axes.

      Corrected

      Reviewer #2 (Recommendations for the authors):

      I am not an expert on pRF mapping, and as such, I am unsure how to relate to pRF mapping performed in patients with unstable fixation (not quantified, but referred to) and nystagmus, such as the achromatic population here. Since the majority of the results hinge on this analysis, I would appreciate more data about the differences between the groups. Supplement 2, which is meant to speak to this, shows only the data from 3 typical participants, and in itself is not evidence for "no correlation between stable fixation and enhanced foveal". Additionally, I'd appreciate a clear methods explanation of how the authors address these confounds; this is too important a concern to be left for the discussion section.

      We agree with the reviewer that eye movements could affect pRF measures. We have now also included data for all participants where we were able to obtain eye tracking measures and directly tested this relationship. Relevant results are copied below.

      Recap of results: 1) as expected gaze was less stable in achromats than controls, 2) achromats with more stable gaze did not show more activation in the scotoma projections zone, which we might have observed if fixation instability masks signals in this region 3) Gaze instability was not correlated with pRF size and eccentricity across V1 in achromats. We note that the relationship between nystagmus and visual sampling is complex - patients experience a stable image and may sample only during a specific phase of the eye movement. It is therefore not inherently clear if and how nystagmus affects pRF size.

      Relevant Manuscript text incorporating these analyses is copied below.

      To quantify eye movement, we used the following methods added to the manuscript:

      “Fixation stability

      Participants’ gaze was tracked throughout all pRF mapping runs. Collecting reliable gaze data from individuals with nystagmus is a challenge because out of the box calibration procedures mostly fail without stable fixation. To account for this, we implemented a post-hoc custom calibration procedure (Tailor et al., 2021). The eye-tracker was first precalibrated on a typically sighted individual. Then, before every other run, we collected gaze data from a 5-point fixation task (at fixation and above, below, left, and right of fixation at 5 eccentricity). This data allowed us to subsequently map the patient's recorded gaze coordinates to their precise locations on the screen. In 10 out of the 14 achromats we acquired reliable enough data to assess fixation stability.

      Calibration data processing: We first removed the first 0.5 seconds for each fixation location to allow for fixation to arrive on the target. We then performed (a) blink removal, (b) filtered out time points with eye movement velocity outliers (±2SD), and (c) filtered out any positions >3SDs to the left or right of the mean fixation location, and >1SD above or below. We took the median of the remaining gaze measurements as an approximate fixation estimate. The resulting 5 median fixation locations were used to fit an affine transformation that remapped the recorded gaze positions into screen space.

      Quantifying fixation stability: after applying the transformation of the post-hoc calibration, data was filtered for blinks and extreme velocities (<2SD). For each functional run, fixation instability was measured as the standard deviation of gaze x-positions across 1second windows. Measures when then averaged across the two run repeats.”

      Results (coverage section):

      “Another potential confound in our findings is fixation instability. In pRF mapping, which is usually conducted under photopic (cone-dominant) conditions, unstable fixation can cause a signal drop in the foveal projection zone. As expected due to nystagmus, the achromatopsia group showed higher fixation instability compared to controls (rodselective: t<sub>(9.08)</sub>=-3.19, p=0.01; non-selective: t<sub<(9.41)</sub>=-4.88, p<0.001 degrees-offreedom corrected for unequal-variance; see Supplement Figure S2a). However, several lines of evidence suggest this instability cannot fully account for the lack of "filling in" in achromats. First, within the achromat group, we found no correlation between fixation stability and coverage (rod-selective: spearman-r<sub>(8)</sub> = -0.36, p=0.31; non-selective spearman-r<sub>(8)</sub>=0.07,p=0.85); Individuals with more stable, control-like fixation did not show more signal inside the scotoma (see Supplement 2). Second, in adults with achromatopsia, typically with less severe nystagmus (Kohl et al., 1993), two recent studies also found absence of filling in (Anderson et al., 2024; Molz et al., 2023).

      So, while we cannot fully exclude nystagmus masking foveal signals in the cortex of some patients, this converging evidence from structural and functional MRI measures across different studies and groups, strongly suggests that the deprived cortex does not substantially ‘fill in’ with peripheral rod inputs in achromatopsia.”

      Results (pRF size + eccentricity):

      “Larger pRFs indicate that neuronal populations in achromats’ V1 cortex, combine information across larger areas in visual space than in typically sighted controls. This could reflect true neural tuning differences as well as be driven by larger eye movement. However, fixation instability in achromats do not significantly correlate with pRF size in our sample (rod-selective: spearman-r<sub>(8)</sub> = -0.41, p=0.24; non-selective spearman-r<sub>(8)</sub>=0.37,p=0.29)

      It has been shown that fitting artefacts around scotoma edges, can give rise to similar outward eccentricity shifts (Binda et al., 2013). However, when accounting for fitting artefacts around the foveal scotoma edge by modelling the rod-free zone during pRF fitting, pRF size and eccentricity differences remain unchanged (see Supplement 3). Finally, we found no significant correlations between gaze stability and the eccentricity shift (rod-selective: spearman-r<sub>(8)</sub> = 0.58, p=0.08; non-selective spearman-r<sub>(8)</sub>=0.09,p=0.8, Supplement 4D)

      Together, these analyses reveal subtle differences in how V1 of achromats responds to rod signals outside the foveal zone, which are consistent with results from other studies (Molz et al. 2023, Anderson et al. 2024). While we found no direct evidence that these are being driven by confounding factors such as eye-movements or fitting artefacts, more work is needed to understand the underlying processes that give rise to these shifts.”

      The following text has been added to Supplement 2

      “As expected, achromats showed significant higher fixation instability compared to controls (as reported in the main text). We found no significant correlation between fixation instability and either coverage, pRF size, eccentricity in achromats. Results of Spearman R correlations in both rod- and non-selective conditions are reported in the figure. We note that the relationship between nystagmus and visual sampling is complex- patients experience a stable image and may sample only during specific eyemovement phases. It is therefore not fully clear if and how nystagmus should give rise to altered pRFs.”

      The field connectivity analysis similarly seems to be used only on task data from the same design; if it was replicated from resting-state data, that would be a good way to show consistency which is independent of measures requiring fixation. 

      We agree that resting-state data would be valuable; however, we did not collect such data in these individuals due to time limitations. Instead, we demonstrate the consistency and reliability of our results by replicating our findings across two different stimulation conditions (rod-selective and non-selective), which differ in luminance, contrast and signal amplitude in both groups and for controls also in the photoreceptors involved. The convergence of results across these distinct visual conditions strengthens our confidence in the reliability of the observed effects. Also, notably, CF estimates have been shown to be robust to large eye movements, and therefore also to differences in fixation stability across groups (Tangtartharakul et al., 2023).

      The authors may want to contextualize their findings in relation to what reorganization exists in cases of late-onset loss of part of the visual field on one hand (stroke recovery), and in the case of complete blindness from early life on the other, as both speak to different levels of plasticity the visual system is capable of.

      We thank the reviewer for their comment and have added a new paragraph discussing this topic.

      Discussion:

      “Our findings on hierarchical adaptation have broader implications for other visual disorders, depending on their timing and nature. For instance, a central scotoma acquired in adulthood, as in macular degeneration, may not trigger the same V3 sampling shifts (Haak et al., 2016), suggesting a sensitive window for this form of plasticity, after which connective fields remain more stable. This also raises questions about congenital blindness, where the absence of any driving input could lead to weakening or repurposing of hierarchical connections (Saccone et al., 2024). Moreover, principles may differ between a deprived but structurally intact cortex, as in retinal dystrophies, and a physically damaged cortex, as in stroke. In the latter, more extensive reorganisation may be required to sample effectively from surviving, and potentially disparate, regions of V1. Perceptual training effects in stroke rehabilitation may reflect such dynamics (Cavanaugh et al., 2025; Elshout et al., 2021).”

      A more minor point: Can the authors clarify what the dark adaptation is used for, and provide the supplementary analysis showing that the duration difference for some of the participants didn't impact the results (stated but not shown).

      The dark adaptation period before the rod-selective condition allowed rod photoreceptors to recover from bleaching caused by prior mesopic light exposure, ensuring optimal rod sensitivity under scotopic conditions. To verify that our 15-minute adaptation period was sufficient, we tested 10 control participants with an extended 45-minute adaptation period. As we found no differences in the resulting rod maps between standard and extended adaptation protocols, these participants were combined with the main control group for all analyses. Author response image 5 are the plots for the two dark adaptation periods.

      Author response image 5.

    1. On 2020-04-16 12:20:10, user Marlowe Fox wrote:

      The tests on the efficacy of HCQ are confounded by multiple variables, including comorbidities, symptom onset, prescription drugs (RAAS inhibitors appear to play a key role in viral intensity), and testosterone/estrogen level, to name only a few.

      Geneticists, epidemiologists, and other scientists have long used casual diagrams to clearly show variables that may potentially confound their results (1). The Wuhan study at the very least would need to account for the following:

      HCQ <— comorbidities —> recovery<br /> HCQ <— symptom onset —> recovery<br /> HCQ <— drug prescriptions —> recovery

      Adjusting for the confounding variable would essentially smooth out the flow of information between the treatment (HCQ) and the outcome (recovery), allowing for the inference of causal effects.

      Assuming observable data is not available to adjust for confounding variables, a casual mechanism (mediator) could smooth out the flow of information from the treatment to the outcome (so long as the mediator is not influenced by confounder).

      Luckily, multiple in vitro studies have been performed. One study posits that HCQ lowers endosomal pH which ultimately inhibits COVID from binding to ACE 2 and decreasing viral intensity (3).

      HCQ —> endosomal pH —>glycosylation of COVID cellular receptor —> ACE 2 binding —> viral intensity —> acute lung injury

      Another in-silico study posits that HCQ blocks specific protein sites on the host ACE2 cell, thereby thwarting its attempt to infect it and preventing the cytokine storm (over-reaction of the lymphatic system) that some posit is responsible for Acute Lung Injury (3). So here we have an entirely different causal mechanism:

      HCQ —> BRD-2 receptor sites —> cytokine storm —> acute lung injury

      Despite these problems, some believe that the p-values obviate the need to control for potentially lurking variables. However, they are subject to myriad influences, known as p-hacking. Whether it is the number of tests performed or the number of comparisons made, it increases the chance of finding a statistically significant p-value (4). Three professional statisticians co-authored a paper reviewing the validity of the Wuhan study (5). There were several issues with the data upon which the two significant p-values were based.

      I suppose there is also a pragmatic argument: The p-values, along with existing studies and reports, are sufficient enough evidence to offset any concern for lurking variables in these urgent times. In other words, how much evidence is sufficient to warrant large scale roll-out of a low-cost treatment that may have a beneficial effect, from saving individuals who would have otherwise died to curbing its spread?

      The consequences of large roll-out: manufacturing, scaling, distribution chains, and so forth could result in a tremendous diversion of resources. How many pharmaceutical manufacturers even have the capacity to roll out production of this magnitude? What if they all start scaling their labor to produce this particular drug. You can’t just put this genie back into the bottle. Not to mention the scientific energy/intellectual capital that would go to proving or disproving this proposed treatment. And why? Because scientific evidence demanded it? No because a tortured p-value and unpublished/unsubstantiated anecdotal evidence caught the attention of some in the media, and it has been over-popularized as a panacea. What about the risk that HCQ is not an effective treatment despite large investments in cash and resources that have been invested? Do you think the wheels of capitalism turn so easily? Investors will want a return and if that means continually touting an ineffective drug through spurious science, they will continue to do so. What about individuals taking HCQ as a prophylactic, believing themselves to be protected against COVID? Or COVID+ individuals taking HCQ and believing themselves to be cured? Or individuals who think: Well, if I get it—I’ll just take HCQ and be fine. This would increase the spread of COVID. From my perspective, the ignorance to viral transmission and the required precautions is widespread. This is just one more reason not to acquiesce to the new social norms of wearing face masks, social distancing, and abiding by shelter-in-place rules. Here, I think an understanding of cognitive psychology is important to anticipate the future behavior of a society in which a cheap and easy-to-manufacture cure is published in the media.

      To sum up, HCQ's efficacy is not sufficiently proven to warrant a widespread roll-out, because it could result in several downstream consequences, from the diversion of resources (both manufacturing capabilities and intellectual capital) to increasing the risk threshold of individuals--who spurious believe in an easy and cheap treatment--thereby increasing the infection rate. One of two things needs to happen. Clinical trials that properly adjust for all potential comorbidities. Or the discovery of a causal mechanism (in vivo), which would obviate the need to control/adjust for confounders. For me, this would tip the utilitarian scales in regard to the potential benefits versus the risks.

      References

      1. Judea Pearl and Dana Mackenzie. 2018. The Book of Why: The New Science of Cause and Effect (1st. ed.). Basic Books, Inc., USA.
      2. https://www.ncbi.nlm.nih.go...
      3. https://papers.ssrn.com/sol...
      4. https://www.scientificameri....
      5. https://zenodo.org/record/3....
    2. On 2020-03-30 22:23:50, user Sinai Immunol Review Project wrote:

      Study Description

      This is a randomized clinical trial of hydroxychloroquine (HCQ) efficacy in the treatment of COVID-19. From February 4 – February 28, 2020 142 COVID-19 positive patients were admitted to Renmin Hospital of Wuhan University. 62 patients met inclusion criteria and were enrolled in a double blind, randomized control trial, with 31 patients in each arm.

      Inclusion criteria:<br /> 1. Age >= 18 years<br /> 2. Positive diagnosis COVID-19 by detection of SARS-CoV-2 by RT-PCR<br /> 3. Diagnosis of pneumonia on chest CT <br /> 4. Mild respiratory illness, defined by SaO2/SPO2 ratio > 93% or PaO2/FIO2 ratio > 300 mmHg in hospital room conditions (Note: relevant clinical references described below.)

      a. Hypoxia is defined as an SpO2 of 85-94%; severe hypoxia < 85%. <br /> b. The PaO2/FIO2 (ratio of arterial oxygen tension to fraction of inspired oxygen) is used to classify the severity of acute respiratory distress syndrome (ARDS). Mild ARDS has a PaO2/FIO2 of 200-300 mmHg, moderate is 100-200, and severe < 100.

      1. Willing to receive a random assignment to any designated treatment group; not participating in another study at the same time

      Exclusion criteria: <br /> 1. Severe or critical respiratory illness (not explicitly defined, presumed to be respiratory function worse than outlined in inclusion criteria); or participation in trial does not meet patient’s maximum benefit or safe follow up criteria<br /> 2. Retinopathy or other retinal diseases<br /> 3. Conduction block or other arrhythmias<br /> 4. Severe liver disease, defined by Child-Pugh score >= C or AST > twice the upper limit<br /> 5. Pregnant or breastfeeding<br /> 6. Severe renal failure, defined by eGFR <= 30 mL/min/1.73m2, or on dialysis<br /> 7. Potential transfer to another hospital within 72h of enrollment<br /> 8. Received any trial treatment for COVID-19 within 30 days before the current study

      All patients received the standard of care: oxygen therapy, antiviral agents, antibacterial agents, and immunoglobulin, with or without corticosteroids. Patients in the HCQ treatment group received additional oral HCQ 400 mg/day, given as 200 mg 2x/day. HCQ was administered from days 1-5 of the trial. The primary endpoint was 5 days post enrollment or a severe adverse reaction to HCQ. The primary outcome evaluated was time to clinical recovery (TTCR), defined as return to normal body temperature and cough cessation for > 72h. Chest CT were imaged on days 0 and 6 of the trial for both groups; body temperature and patient reports of cough were collected 3x/day from day 0 – 6. The mean age and sex distribution between the HCQ and control arms were comparable.

      Findings

      There were 2 patients showing mild secondary effects of HCQ treatment. More importantly, while 4 patients in the control group progressed to severe disease, none progressed in the treatment group.<br /> TTCR was significantly decreased in the HCQ treatment arm; recovery from fever was shortened by one day (3.2 days control vs. 2.2 days HCQ, p = 0.0008); time to cessation of cough was similarly reduced (3.1 days control vs. 2.0 days HCQ, p = 0.0016).<br /> Overall, it appears that HCQ treatment of patients with mild COVID-19 has a modest effect on clinical recovery (symptom relief on average 1 day earlier) but may be more potent in reducing the progression from mild to severe disease.

      Study Limitations

      This study is limited in its inclusion of only patients with mild disease, and exclusion of those on any treatment other than the standard of care. It would also have been important to include the laboratory values of positive RT-PCR detection of SARS-CoV-2 to compare the baseline and evolution of the patients’ viral load.

      Significance

      Despite its limitations, the study design has good rigor as a double blind RCT and consistent symptom checks on each day of the trail. Now that the FDA has approved HCQ for treatment of COVID-19 in the USA, this study supports the efficacy of HCQ use early in treatment of patients showing mild symptoms, to improve time to clinical recovery, and possibly reduce disease progression. However, most of the current applications of HCQ have been in patients with severe disease and for compassionate use, which are out of the scope of the findings presented in this trial. Several additional clinical trials to examine hydroxychloroquine are now undergoing; their results will be critical to further validate these findings.

      Reviewed by Rachel Levantovsky as a part of a project by students, postdocs and faculty in the Immunology Institute at the Icahn school of Medicine at Mount Sinai.

    3. On 2020-04-01 16:34:11, user Sinai Immunol Review Project wrote:

      Study Description <br /> This is a randomized clinical trial of hydroxychloroquine (HCQ) efficacy in <br /> the treatment of COVID-19. From February 4 – February 28, 2020 142 <br /> COVID-19 positive patients were admitted to Renmin Hospital of Wuhan <br /> University. 62 patients met inclusion criteria and were enrolled in a <br /> double blind, randomized control trial, with 31 patients in each arm.

      Inclusion criteria:<br /> 1. Age >= 18 years<br /> 2. Positive diagnosis COVID-19 by detection of SARS-CoV-2 by RT-PCR<br /> 3. Diagnosis of pneumonia on chest CT <br /> 4. Mild respiratory illness, defined by SaO2/SPO2 ratio > 93% or <br /> PaO2/FIO2 ratio > 300 mmHg in hospital room conditions (Note: <br /> relevant clinical references described below.)<br /> a. Hypoxia is defined as an SpO2 of 85-94%; severe hypoxia < 85%. <br /> b. The PaO2/FIO2 (ratio of arterial oxygen tension to fraction of inspired<br /> oxygen) is used to classify the severity of acute respiratory distress <br /> syndrome (ARDS). Mild ARDS has a PaO2/FIO2 of 200-300 mmHg, moderate is <br /> 100-200, and severe < 100.<br /> 5. Willing to receive a random assignment to any designated treatment group; not participating in another study at the same time

      Exclusion criteria: <br /> 1. Severe or critical respiratory illness (not explicitly defined, <br /> presumed to be respiratory function worse than outlined in inclusion <br /> criteria); or participation in trial does not meet patient’s maximum <br /> benefit or safe follow up criteria<br /> 2. Retinopathy or other retinal diseases<br /> 3. Conduction block or other arrhythmias<br /> 4. Severe liver disease, defined by Child-Pugh score >= C or AST > twice the upper limit<br /> 5. Pregnant or breastfeeding<br /> 6. Severe renal failure, defined by eGFR <= 30 mL/min/1.73m2, or on dialysis<br /> 7. Potential transfer to another hospital within 72h of enrollment<br /> 8. Received any trial treatment for COVID-19 within 30 days before the current study

      All patients received the standard of care: oxygen therapy, antiviral <br /> agents, antibacterial agents, and immunoglobulin, with or without <br /> corticosteroids. Patients in the HCQ treatment group received additional<br /> oral HCQ 400 mg/day, given as 200 mg 2x/day. HCQ was administered from <br /> days 1-5 of the trial. The primary endpoint was 5 days post enrollment <br /> or a severe adverse reaction to HCQ. The primary outcome evaluated was <br /> time to clinical recovery (TTCR), defined as return to normal body <br /> temperature and cough cessation for > 72h. Chest CT were imaged on <br /> days 0 and 6 of the trial for both groups; body temperature and patient <br /> reports of cough were collected 3x/day from day 0 – 6. The mean age and <br /> sex distribution between the HCQ and control arms were comparable.

      Findings<br /> There were 2 patients showing mild secondary effects of HCQ treatment. More <br /> importantly, while 4 patients in the control group progressed to severe <br /> disease, none progressed in the treatment group.<br /> TTCR was significantly decreased in the HCQ treatment arm; recovery from fever <br /> was shortened by one day (3.2 days control vs. 2.2 days HCQ, p = <br /> 0.0008); time to cessation of cough was similarly reduced (3.1 days <br /> control vs. 2.0 days HCQ, p = 0.0016).<br /> Overall, it appears that HCQ treatment of patients with mild COVID-19 has a modest effect on clinical recovery (symptom relief on average 1 day earlier) but may be more <br /> potent in reducing the progression from mild to severe disease.

      Study Limitations <br /> This study is limited in its inclusion of only patients with mild disease, <br /> and exclusion of those on any treatment other than the standard of care.<br /> It would also have been important to include the laboratory values of <br /> positive RT-PCR detection of SARS-CoV-2 to compare the baseline and <br /> evolution of the patients’ viral load.

      Significance<br /> Despite its limitations, the study design has good rigor as a double blind RCT <br /> and consistent symptom checks on each day of the trail. Now that the FDA<br /> has approved HCQ for treatment of COVID-19 in the USA, this study <br /> supports the efficacy of HCQ use early in treatment of patients showing <br /> mild symptoms, to improve time to clinical recovery, and possibly reduce<br /> disease progression. However, most of the current applications of HCQ <br /> have been in patients with severe disease and for compassionate use, <br /> which are out of the scope of the findings presented in this trial. <br /> Several additional clinical trials to examine hydroxychloroquine are now<br /> undergoing; their results will be critical to further validate these <br /> findings.

      Reviewed by Rachel Levantovsky as a part of a project<br /> by students, postdocs and faculty in the Immunology Institute at the <br /> Icahn school of Medicine at Mount Sinai.

    1. On 2020-03-26 15:11:11, user Sinai Immunol Review Project wrote:

      Study description: Plasma cytokine analysis (48 cytokines) was performed on COVID-19 patient plasma samples, who were sub-stratified as severe (N=34), moderate (N=19), and compared to healthy controls (N=8). Patients were monitored for up to 24 days after illness onset: viral load (qRT-PCR), cytokine (multiplex on subset of patients), lab tests, and epidemiological/clinical characteristics of patients were reported.

      Key Findings:<br /> • Many elevated cytokines with COVID-19 onset compared to healthy controls <br /> (IFNy, IL-1Ra, IL-2Ra, IL-6, IL-10, IL-18, HGF, MCP-3, MIG, M-CSF, G-CSF, MIG-1a, and IP-10).<br /> • IP-10, IL-1Ra, and MCP-3 (esp. together) were associated with disease severity and fatal outcome. <br /> • IP-10 was correlated to patient viral load (r=0.3006, p=0.0075).<br /> • IP-10, IL-1Ra, and MCP-3 were correlated to loss of lung function (PaO2/FaO2 (arterial/atmospheric O2) and Murray Score (lung injury) with MCP-3 being the most correlated (r=0.4104 p<0.0001 and r=0.5107 p<0.0001 respectively).<br /> • Viral load (Lower Ct Value from qRT-PCR) was associated with upregulated IP-10 only (not IL-1Ra or MCP-3) and was mildly correlated with decreased lung function: PaO2/FaO2 (arterial/atmospheric O2) and Murray Score (lung injury).<br /> • Lymphopenia (decreased CD4 and CD8 T cells) and increased neutrophil correlated w/ severe patients.<br /> • Complications were associated with COVID severity (ARDS, hepatic insufficiency, renal insufficiency).

      Importance: Outline of pathological time course (implicating innate immunity esp.) and identification key cytokines associated with disease severity and prognosis (+ comorbidities). Anti-IP-10 as a possible therapeutic intervention (ex: Eldelumab).

      Critical Analysis: Collection time of clinical data and lab results not reported directly (likely 4 days (2,6) after illness onset), making it very difficult to determine if cytokines were predictive of patient outcome or reflective of patient compensatory immune response (likely the latter). Small N for cytokine analysis (N=2 fatal and N=5 severe/critical, and N=7 moderate or discharged). Viral treatment strategy not clearly outlined.

    1. On 2024-11-08 19:59:59, user Andre Boca Ribas Freitas wrote:

      Important Observations on Underreported Chikungunya Mortality in Light of Global Burden Analysis

      Dear Authors,

      I thoroughly appreciated your recent preprint on the global burden of chikungunya and the potential benefits of vaccination. Your work provides critical insights into the widespread impact of this disease and emphasizes the significant potential of vaccine interventions.

      However, I wanted to highlight a critical issue that our research and that of others in the field have identified: the substantial underreporting of chikungunya-related mortality across many regions. While chikungunya is often categorized as a non-fatal disease, a growing body of evidence reveals severe and sometimes fatal cases that frequently go unrecorded by epidemiological systems. Our recent studies in Brazil documented excess mortality rates from chikungunya far surpassing those officially reported, with mortality rates up to 60 times higher than recorded by standard surveillance systems?Freitas et al., 2024?. Additionally, studies like those by Mavalankar et al. (2008) in India and Beesoon et al. (2008) in Mauritius underscore the elevated mortality associated with chikungunya during epidemic outbreaks, further reinforcing this critical gap in mortality surveillance.<br /> This growing evidence highlights the critical need for increased investment in molecular diagnostics, integrated surveillance, and more comprehensive mortality tracking for chikungunya. These measures are essential for aligning public health responses with the true impact of the disease and ensuring the full scope of chikungunya’s burden is addressed.

      Thank you for advancing this essential conversation. Through improved surveillance and research collaboration, we can work toward effective strategies to mitigate the severe impact of chikungunya globally.

      Best regards,

      Dr. André Ricardo Ribas Freitas<br /> Faculty of Medicine, São Leopoldo Mandic, Campinas-SP, Brasil

      Freitas ARR, et al. Excess Mortality Associated with the 2023 Chikungunya Epidemic in Minas Gerais, Brazil. Front Trop Dis. 2024. doi: 10.3389/fitd.2024.1466207.

      Mavalankar D, Shastri P, Bandyopadhyay T, Parmar J, Ramani KV. Increased mortality rate associated with chikungunya epidemic, Ahmedabad, India. Emerg Infect Dis. 2008 Mar;14(3):412-5. doi: 10.3201/eid1403.070720. PMID: 18325255; PMCID: PMC2570824.

      Beesoon S, Funkhouser E, Kotea N, Spielman A, Robich RM. Chikungunya fever, Mauritius, 2006. Emerg Infect Dis. 2008 Feb;14(2):337-8. doi: 10.3201/eid1402.071024. PMID: 18258136; PMCID: PMC2630048.

      Manimunda SP, Mavalankar D, Bandyopadhyay T, Sugunan AP. Chikungunya epidemic-related mortality. Epidemiol Infect. 2011 Sep;139(9):1410-2. doi: 10.1017/S0950268810002542. Epub 2010 Nov 15. PMID: 21073766.

      Freitas ARR, Donalisio MR, Alarcón-Elbal PM. Excess Mortality and Causes Associated with Chikungunya, Puerto Rico, 2014-2015. Emerg Infect Dis. 2018 Dec;24(12):2352-2355. doi: 10.3201/eid2412.170639. Epub 2018 Dec 17. PMID: 30277456; PMCID: PMC6256393.

      Freitas ARR, Gérardin P, Kassar L, Donalisio MR. Excess deaths associated with the 2014 chikungunya epidemic in Jamaica. Pathog Glob Health. 2019 Feb;113(1):27-31. doi: 10.1080/20477724.2019.1574111. Epub 2019 Feb 4. PMID: 30714498; PMCID: PMC6427614.

    1. On 2024-12-03 21:03:36, user xPeer wrote:

      Courtesy review from xPeerd.com

      This manuscript introduces DeepEnsembleEncodeNet (DEEN), an innovative polygenic risk score (PRS) model integrating autoencoders and fully connected neural networks (FCNNs) to address limitations of existing PRS methods. By disentangling dimensionality reduction and predictive modeling, DEEN enables the capture of both linear and non-linear SNP effects, improving prediction accuracy and risk stratification for binary (e.g., hypertension, type 2 diabetes) and continuous traits (e.g., BMI, cholesterol). Evaluation using UK Biobank and All of Us datasets highlights superior performance over established methods. While conceptually and methodologically compelling, areas such as interpretability, generalizability across diverse populations, and computational efficiency warrant further refinement.

      Major Revisions<br /> 1. Interpretability and Practicality<br /> Black-Box Concerns: The complexity of the DEEN model limits its interpretability compared to simpler PRS methods like Lasso or PRSice. While the manuscript acknowledges this limitation, incorporating efforts to visualize model predictions (e.g., feature importance maps or SNP clustering analysis) would enhance its usability (Section: Discussion, p.16).<br /> Clinical Translation: The manuscript emphasizes the potential of DEEN for clinical utility but lacks discussion on the challenges of implementing deep learning models in healthcare. Addressing regulatory barriers and clinician engagement would add value (Section: Discussion, p.17).<br /> 2. Population Generalizability<br /> Demographic Bias: Both datasets used (UK Biobank, All of Us) consist predominantly of European-ancestry individuals. This limits the model's applicability to global populations. Expanding the discussion on efforts to improve DEEN’s cross-ancestry generalizability is essential (Section: Results, p.11).<br /> Validation Across Diverse Cohorts: While DEEN is validated on two datasets, additional external validations across non-European populations would strengthen claims of generalizability and reliability.<br /> 3. Comparative Analyses<br /> Missing Baseline Methods: Although DEEN is compared with multiple PRS methods, inclusion of additional machine learning benchmarks (e.g., gradient boosting models, convolutional neural networks for SNP effects) would better contextualize DEEN’s advantages (Section: Results, p.8).<br /> Risk Stratification Assessment: The risk stratification results are promising but need more rigorous evaluation metrics beyond odds ratios, such as net reclassification improvement (NRI) or integrated discrimination improvement (IDI).<br /> 4. Computational Efficiency<br /> Resource Requirements: DEEN’s reliance on high-performance computing resources (e.g., GPU usage) is noted but not sufficiently quantified. Providing benchmarks of computational costs and runtime against alternative methods is crucial for practical implementation (Section: Methods, p.19).<br /> Optimization: While grid search was used for hyperparameter tuning, exploring automated optimization frameworks (e.g., Bayesian optimization) could reduce computational overhead.<br /> 5. Data Filtering and Variant Selection<br /> Potential Bias from Variant Filtering: The preselection of SNPs based on p-values may exclude rare variants or those with small effects. A sensitivity analysis on SNP filtering thresholds would clarify the robustness of DEEN’s predictive power (Section: Methods, p.20).<br /> Minor Revisions<br /> 1. Typos and Formatting<br /> Figure Legends: Some figures (e.g., Figure 5) lack clear explanations of axes and statistical methods.<br /> Grammar: Line 124: Replace "similarly drive CRC progression" with "similarly drive progression."<br /> 2. AI Content Analysis<br /> Estimated AI-Generated Content: ~20-25%.<br /> Implications: Repetitive phrasing in methodological descriptions and literature summaries suggests potential AI assistance. While the technical content appears valid, manual rephrasing can enhance originality and scientific depth.<br /> 3. Statistical Reporting<br /> Insufficient Confidence Intervals: Odds ratio enrichment results lack 95% confidence intervals in several places, undermining statistical rigor (Section: Results, p.9).<br /> Inconsistent Metric Definitions: Terms like “improved R²” and “higher AUC” are used loosely. Precise numerical values and effect size comparisons would improve clarity.<br /> 4. Terminology Consistency<br /> Key terms like "dimensionality reduction" and "risk stratification" should be consistently defined and applied across sections to avoid ambiguity.<br /> Recommendations<br /> Enhance Model Interpretability:

      Integrate explainability tools (e.g., SHAP values, visualization of autoencoder layers) to clarify how SNPs influence predictions.<br /> Discuss the potential for hybrid models balancing interpretability and performance.<br /> Address Demographic Bias:

      Validate DEEN using datasets from underrepresented populations (e.g., African, Asian ancestries).<br /> Incorporate transfer learning techniques to enhance generalizability.<br /> Benchmarking and Evaluation:

      Compare DEEN against additional advanced machine learning models for PRS.<br /> Introduce advanced evaluation metrics like NRI and IDI to strengthen claims.<br /> Refine Computational Analysis:

      Provide detailed resource utilization benchmarks.<br /> Explore alternative hyperparameter optimization methods to improve training efficiency.<br /> Expand Data Analysis:

      Perform a sensitivity analysis on variant filtering thresholds.<br /> Investigate the inclusion of rare variants to improve model robustness.

    1. On 2022-07-10 23:39:20, user Charles Warden wrote:

      Hi,

      Thank you very much for posting this preprint.

      I consider the topics raised by this study to be important and interesting.

      However, I have some comments and questions:

      1) I agree that confirmation bias can be a contributing factor. However, I think true limitations in utility are also important. So, I am not sure if I completely agree with the statement "When results were not consistent with participant’s personal or family history, many participants found reasons to dismiss or discredit these results. This indicates a role for confirmation bias in responses to [self-initiated] PRS." For example, I might really want to understand the genetic basis for a disease, but the percent heritability explained by the PRS may be low and I could therefore be disappointed with the usefulness of a PRS due to a discordant result.

      I have a blog post where I share my impute.me scores (along with others):

      https://cdwscience.blogspot.com/2019/12/prs-results-from-my-genomics-data.html

      I don't know if I would exactly say my response was "negative," but I certainly got the impression the PRS that I saw may have limited utility. In that sense, my view of the method was not positive, even if it did not evoke a strong emotional "negative" response.

      Within that blog post, “ulcerative colitis” would be an example where there were different PRS for the same disease but very different percentiles (for the same SNP chip). So, I would consider that an example of the reaction that is described being due to something other than confirmation bias.

      2) Did the interviewers respond when there were possible points of misunderstanding during the interview process?

      It was acknowledged as a limitation in the discussion: "the researchers did not have access to participant’s PRS results and were unable to evaluate people’s understanding of their results".

      However, it seems like that could be important. For example, there is a quote "Unfortunately, I do regret getting a PRS… I would have rather not known. I like uncertainty". Assuming that there were appropriate limitations to communicate, I believe a response from the interviewer might cause that quote to no longer reflect the subject’s opinion.

      In general, there appears to be a noticeable emphasis on mental health in the article. My opinion is that this is an area where limitations are particularly important. If it helps, I think there are some additional details in this blog post for the book Blueprint.

      In terms of my own impute.me results, I thought the "anxiety" PRS seemed reasonable (to the best of my ability to assess that). However, I also thought changes in conditions over time were important, and I thought there was potential for misuse.

      3a) I think it is a minor point, but I don't remember receiving an invite to join a Zoom meeting for a discussion about my impute.me results.

      I hope that I was one of the 209 candidates, but I was not sure if I could confirm that. I also noticed mention of categories like “medium” or “low” for one quote referencing a z-score of 2.5, but I only saw the continuous score distribution in the screenshots from my blog post.

      3b) Perhaps more importantly, I tried to go back to sign in to check if I missed something.

      In the Folkersen et al. 2020 paper, the link provided is for https://www.impute.me/. However, that link currently re-directs to a Nucleus website (https://mynucleus.com/).

      Can you please provide some more information about the re-direction of the impute.me link?

      For example, I submitted an e-mail to register on the new website, but I don't think I can see my earlier results anymore?

      Additionally, I was confused when I couldn’t find the GitHub code provided with that paper: https://github.com/lassefolkersen/impute-me

      4) Finally, but I don't think either of the 2 models that I see ("dismissed medical concerns" and "medical distrust") are a great description for myself. I think something like "curiosity" and "critical assessment" would be more appropriate for myself.

      For example, I wouldn't say I distrust the healthcare system or medical research broadly, but I do think feedback and engagement is important. Thus, when I encounter problems, I submit reports to FDA MedWatch. Likewise, I contribute data/experience to projects like PatientsLikeMe.

      Thanks Again,<br /> Charles

    1. On 2022-08-09 12:40:13, user PhillyPharmaBoy wrote:

      The authors conducted a thorough evaluation of the impact of ivermectin on SARS-CoV-2 clearance. On the surface their results differ from those of Krolewiecki, et al. (below). In a post hoc analysis these investigators found that ivermectin accelerated viral decay when drug concentration (4 hr) exceeded 160 ng/ml. It would be useful for the PLATCOV Group to mention this study and discuss potential reason(s) for the discrepancy.

      https://www.sciencedirect.c...

    1. On 2020-04-30 02:42:13, user Tyler Chen wrote:

      I appreciate the authors’ urgency in addressing SARS-CoV-2 decontamination for reuse of N95 filtering facepiece respirators (FFRs). In the spirit of that urgency and health impacts, I note two concerns with the current preprint that could accidentally cause confusion: (1) The paper claims N95 filtration is preserved after microwave-generated steam, whereas the test listed in the methods was a TSI quantitative fit test, which is primarily designed to test fit, not necessarily filtration. (2) The paper’s claim of a universally accessible N95 decontamination protocol may accidentally overstate the N95 models for which this protocol is verified. N95 models vary widely in their construction and resistance to steam heat, so any models other than the one used in this experiment will likely require thorough testing before this method is applied.

      I would suggest that the authors make the following changes:<br /> (1) Clarify whether or not filtration is verified at larger particle sizes and charge (e.g. 0.26 microns, uncharged). If filtration is not yet verified at larger particle sizes, this test may be important to verify N95 performance following microwave steam treatment.<br /> To provide some background: The TSI 8026 Particle Generator generates 0.04 micron particles [1] that are intended to be readily filtered by the N95, and are assumed to only enter the N95 through gaps in the face seal and not through the mask material itself [2]. It is possible for N95 FFRs to pass quantitative fit tests while still failing filtration tests at different particle sizes--one study “observed [protection factors] <100 even for subjects who passed fit testing (fit factor > 100)” [3]. Therefore, fit testing using the 8026 particle generator does not imply that N95 filtration is necessarily preserved at larger particle sizes which are most relevant for filtration effectiveness in the SARS-CoV-2 pandemic, especially given the fact that the decontamination treatment may shift what particle size is most penetrating for the N95. Recent non-peer-reviewed research shows N95 material suffering a decrease in filtration of 0.26 micron particles after 4-5 cycles of 10min stovetop steam treatment [4], though it is unclear from this manuscript if the MGS treatment did not reach this limit, or that the limit was not observed due to the different particle size used. Therefore, testing for quantitative fit should perhaps be supplemented by filtration tests at larger particle diameters (whether from this study or by citing others), especially when a decontamination process is involved such as steam heat that has an unknown effect on the most penetrating particle size for the N95. Given the potential for widespread implementation of this protocol it seems important that this point be clarified.

      (2) Secondly, it may be important to notify readers that the performance of the N95 model in this paper likely cannot be generalized to all N95 models without further testing. N95 models vary widely in construction and resistance to steam, and each model should be individually verified to maintain both fit and filtration by this protocol before use. This is supported by the fact that N95 models can vary widely with respect to the most penetrating particle size, and each country may only have access to certain N95 models [3,4,5]. Furthermore, there is literature evidence that the mask performance in response to steam and heat also varies across N95 models (see the table of results in Appendix B https://www.n95decon.org/fi... "https://www.n95decon.org/files/heat-humidity-technical-report)"). Therefore, it is important to clarify in the text that this method is not yet universally-validated -- N95 fit has only been verified for the 3M 1860 molded N95 in particular, and other models are likely to have significantly different behavior. Independent verification of both fit and filtration may be needed for other N95 models.

    1. On 2020-04-30 21:06:27, user Tim Lawes wrote:

      A great paper by very respected researchers and clinicians, ruined by a bizarre press-release saying COVID-19 'as deadly as Ebola'. Let's take a fact-check on that one:<br /> 1. Case fatality rates not equivalent: Ebola CFR 50-75% in recent outbreaks, not 33% as in COVID-19. If referring to high-income country (HIC) stats only, talking handful cases treated in HIC with Ebola, all working age occupationally fit health workers, CFR ~18% (n=5/27 quoted).<br /> 2. Totally different age of death: Ebola median 30-35 yrs, COVID 80yrs. Ebola 95% deaths <60 yrs, Covid ~10% < 60yrs.<br /> 3. Translate age at death to Years of Life Lost (YLL): comparing age at death profile provided by Doherty et al to Ebola papers, each Ebola death 'costs' 45 YLL, vs. each COVID death costs 1.4 YLL.<br /> 4. Translated to population impact. To exceed an equivalent burden of YLL per capita in West Africa in 2013-16 would require >1 million deaths in UK from COVID-19.

      To compare the mortality profiles of Ebola and COVID-19 is at best bad science, at worst an example of misinformation that perpetuates global health inequities. I don't imagine authors intended this, but I'm afraid its this sort of comment that creates hysteria, rather than appropriate responses. As a paediatrician we are seeing children coming to harm from avoiding hospital and not being seen in community due to misjudged risk assessments. A genuine thank you for your contribution to science, but please ensure reporting is responsible.

    1. On 2020-05-11 19:29:55, user Charles Warden wrote:

      Thank you for posting this pre-print.

      I have a some questions:

      1) Are the p-values significant after a Bonferroni correction?

      2) Are you focusing on APOE because it is the most significant result for a relatively common SNP?

      3) How are you defining the COVID-19 severity? Table 1 makes it look like you are comparing the proportion of positive cases for the 3 APOE genotype combinations (E3/E3, E3/E4, and E4/E4). However, that would be different than filtering for positive cases, and then looking for an association with a variable that describes the severity of the case.

      4) I thought it was good and interesting that you excluded subsets of individuals to try to check for confounders. However, it looks like the number of APOE E4/E4 goes from 37 total (with none removed due to dementia) to 22 and then back up to 32 and 35. If you want to adjust for all individuals with chronic diseases, then I would have expected that to be cumulative. What happens if you remove all of the patients with chronic disease and then test within the highest age range?

      5) I would expect most normalization to reduce but not completely remove the effect being adjusted for. Is is possible to look at older individuals as a separate bin (perhaps in a "Table 2") as evidence that the age-adjustment was effective? I could imagine this (along with what I suggested in 3)) might cause some issues with sample size, but there are usually some limitations for every study mentioned in the discussion.

      6) Is there any sort of independent validation that you can do in another cohort? As more cases are known, do you plan to check the subset of samples that currently test negative but later test positive as a type of test dataset?

      7) I usually think of the "Data Availability" as being for new data (rather than public data), but I am glad that you mentioned you UK Biobank application. However, since this wasn't quite what I expected in the Data/Code link, can you share the code that you used for analysis (assuming it can be reproduced by anyone else with similar access)?

      Thank you again for sharing your research.

    1. On 2020-04-12 13:04:20, user japhetk wrote:

      I think BCG studies' conclusions came from spurious correlations regardless of BCG has an effect or not.<br /> Anyway, now data from South America and Africa keeps coming and although, it may depend on the methods of analyses, my analyses show already the number death 13 days after the 100th case, and whether BCG is currently done is no longer significantly associated without correcting anything (p = 0.291, ANOVA).And after the number of tourists, population,total GDP, temperature of March, ratio of 65 years or older are corrected the associations show get even weaker (P = 0.621, ANCOVA).Among these covariates, the number of tourists has a robust significant effect on the number of deaths 13 days after the 100th case (0.00016), and the ratio of 65 years or older and population have significant effects, too (P= 0.024, 0.05, respectively). Total GDP (not GDP per capita) and the number of tourists have a close relationship (r = 0.82). <br /> The date when the 100th case was detected show more robust relationship with the BCG policy (currently performed or not), but after the correction of abovementioned covariates, this association also became insignificant )(p= 0.167). But this kind of relationship with the date of 100th case is seen in the case of variables that are specifically associated with Western countries, such as the consumption of wine)(the consumption of wine per capita shows robust association with the date of the 100th case after correction of population (p = 0.0002, more wine, the faster the detection of 100th case). <br /> So, my guess is that this spurious correlation mainly came from the fact the countries which abandaned BCG policies are more developed and more popular from tourists (which increased the faster and more and multiple spread of the virus) and also show greater aging (which increased the risk) and also they locate in western countries which were confident of their medical system and which were away from Asia and which were less alert to this infectious disease from China. The habit of wearing mask, hug, handshake or religious ceremonies might affect, too. <br /> In the cruise ship Diamond Princess, Japanese who were put in the same ship with Westerners show greater mortality rate than Westerners. And in a lot of Western European countries, the risk population (elderly) has experiences of BCG (they are classified as "past BCG", but in fact most of risk populations are experienced with BCG). So, the BCG hypothesis is not consistent with these facts, either. <br /> I am not saying BCG doesn't work, I am saying you cannot conclude anything from these uncontrolled studies which lacks in numerous potential confounding variables. Just let's wait for results of RCTs.

      Here's my data if I haven't made any mistakes.You can see the apparent little association with BCG policy and number of the death (13 days after the 100th case) as of 11th April.

      O Iran 291<br /> X Spain 288<br /> O China 259<br /> X Italy 233<br /> O Turkey 214<br /> O Algeria 130<br /> X United Kingdom 103<br /> O Indonesia 102<br /> O Brazil 92<br /> X France 91<br /> X Netherlands 76<br /> X United States 69<br /> O Dominican Republic 68<br /> X Ecuador 62<br /> O Portugal 60<br /> O Morocco 59<br /> O Philippines 54<br /> O Ukraine 45<br /> O Iraq 42<br /> O South Korea 35<br /> X Switzerland 33<br /> O Argentina 31<br /> O Egypt 30<br /> O Panama 30<br /> O India 29<br /> O Mexico 29<br /> X Canada 27<br /> O Hungary 26<br /> O Honduras 24<br /> O Peru 24<br /> O Romania 24<br /> O Albania 22<br /> O Greece 22<br /> O Ireland 22<br /> O Tunisia 22<br /> X Luxembourg 22<br /> O Bosnia and Herzegovina 21<br /> X Belgium 21<br /> O Burkina Faso 19<br /> O Macedonia 17<br /> X Andorra 17<br /> O Colombia 16<br /> O Poland 16<br /> O Afghanistan 15<br /> O Cuba 15<br /> O Moldova 15<br /> O Pakistan 13<br /> X Denmark 13<br /> O Bulgaria 10<br /> O Malaysia 10<br /> O Russia 10<br /> X Lebanon 10<br /> X Sweden 10<br /> O Lithuania 9<br /> O Mauritius 9<br /> O Azerbaijan 8<br /> X Austria 8<br /> X Israel 8<br /> O Chile 7<br /> O Kazakhstan 7<br /> O Venezuela 7<br /> X Australia 7<br /> O Croatia 6<br /> O Ghana 6<br /> O Japan 6<br /> O Thailand 6<br /> X Czech Republic 6<br /> X Norway 6<br /> O Jordan 5<br /> O South Africa 5<br /> O Sri Lanka 5<br /> O Taiwan 5<br /> O United Arab Emirates 5<br /> X Germany 5<br /> X Slovenia 5<br /> O Saudi Arabia 4<br /> O Uruguay 4<br /> O Armenia 3<br /> O Cote d'Ivoire 3<br /> O Uzbekistan 3<br /> X Finland 3<br /> O Costa Rica 2<br /> O Oman 2<br /> O Senegal 2<br /> O Estonia 1<br /> X New Zealand 1<br /> O Cambodia 0<br /> O Kuwait 0<br /> O Latvia 0<br /> O Malta 0<br /> O Qatar 0<br /> O Singapore 0<br /> O Vietnam 0<br /> X Slovakia 0

    1. On 2019-07-04 23:42:29, user Guyguy wrote:

      EVOLUTION OF THE EBOLA EPIDEMIC IN THE PROVINCES OF NORTH KIVU AND ITURI

      Thursday, July 4th, 2019

      The epidemiological situation of the Ebola Virus Disease dated 3 July 2019:<br /> Since the beginning of the epidemic, the cumulative number of cases is 2,382, of which 2,288 are confirmed and 94 are probable. In total, there were 1,606 deaths (1,512 confirmed and 94 probable) and 666 people healed.<br /> 420 suspected cases under investigation;<br /> 13 new confirmed cases, including 4 in Beni, 2 in Butembo, 2 in Katwa, 2 in Kalunguta, 1 in Mandima, 1 in Biena and 1 in Mabalako;<br /> 8 new confirmed cases deaths:<br /> 2 community deaths, including 1 in Butembo and 1 in Mandima;<br /> 6 deaths in Ebola Treatment Centers including 3 in Beni, 2 in Mabalako and 1 in Katwa;<br /> 11 people cured out of Ebola Treatment Center including 7 in Mabalako, 3 in Katwa and 1 in Beni. <br /> 128 Contaminated health workers: One health worker, vaccinated, is one of the new confirmed cases in Beni. The cumulative number of confirmed / probable cases among health workers is 128 (5% of all confirmed / probable cases) including 40 deaths.<br /> Source: Ministry of Health press team on the state of the response to the Ebola epidemic in the Democratic Republic of Congo.

    2. On 2019-07-21 03:12:01, user Guyguy wrote:

      EVOLUTION OF THE EBOLA EPIDEMIC IN THE PROVINCES OF NORTH KIVU AND ITURI

      Saturday, July 20th, 2019

      The epidemiological situation of the Ebola Virus Disease dated 19 July 2019:<br /> Since the beginning of the epidemic, the cumulative number of cases is 2,564, 2,470 confirmed and 94 probable. In total, there were 1,728 deaths (1,634 confirmed and 94 probable) and 726 people healed.<br /> 392 suspected cases under investigation;<br /> 18 new confirmed cases, including 7 in Beni, 3 in Mandima, 3 in Mabalako, 1 in Vuhovi, 1 in Butembo, 1 in Mambasa, 1 in Lubero and 1 in Masereka;<br /> 13 new confirmed cases deaths:<br /> 8 community deaths, including 4 in Beni, 2 in Mandima, 1 in Mabalako and 1 in Masereka;<br /> 5 Ebola Treatment Center (ETC) deaths, 2 in Mabalako, 2 in Beni and 1 in Katwa;<br /> 5 people recovered from ETCs, including 3 in Beni and 2 in Katwa.

      NEWS

      Minister of Health visits Beni<br /> The Minister of Health, Dr. Oly Ilunga Kalenga spent the day of Friday, July 19, 2019 in Beni where he visited the various field teams and the transit center whose capacity will be increased in the coming days.<br /> Following the resurgence of patients in Beni, Dr. Oly Ilunga said that one of the key lessons learned in this tenth epidemic is to rely on the health system. "If we really want to solve this epidemic and have a lasting impact, we need to strengthen the health system by working with the actors in this system and with the community," he said adding that this is how we can quickly stop this new outbreak in the city of Beni.<br /> He recalled that the declaration of this epidemic as an international public health emergency requires other countries to strengthen border surveillance, while for the response, the declaration recognizes the work that is being done and the performance of the response. managed to contain the epidemic in an extremely complex context.<br /> This statement also stresses the need for a response with greater coordination and consultation. Another point that Minister Oly Ilunga always insists on is the accountability of all actors on the ground, the sharing of information, the measurement of performance, and the use of data to guide and improve actions on ground.

      168,746 Vaccinated persons.

      76,632,731 Controlled people.

      138 Contaminated health workers<br /> The cumulative number of confirmed / probable cases among health workers is 138 (5% of all confirmed / probable cases) including 41 deaths.

      Source: Ministry of Health press team on the state of the response to the Ebola epidemic in the Democratic Republic of the Congo

    1. On 2019-08-03 19:56:40, user GuyguyKabundi Tshima wrote:

      EVOLUTION OF THE EBOLA EPIDEMIC IN THE PROVINCES OF NORTH KIVU AND ITURI

      Wednesday, July 31, 2019

      The Epidemiological Situation of Ebola Virus Disease, July 30, 2019

      Since the beginning of the epidemic, the cumulative number of cases is 2 701, of which 2 607 are confirmed and 94 are probable. In total, there were 1,813 deaths (1,719 confirmed and 94 probable) and 776 people healed.<br /> 293 suspected cases under investigation;<br /> 11 new confirmed cases, including 3 in Vuhovi, 1 in Mandima, 1 in Mambasa, 1 in Kalunguta and 1 in Nyiragongo (Goma);<br /> Continued search for the confirmed case in the health zone of Lubero dated 25/07/2019;<br /> 10 new confirmed cases deaths:<br /> 2 community deaths, including 1 in Beni and 1 in Mandima;<br /> 6 deaths at ETC, including 3 in Beni, 2 in Mabalako and 1 in Butembo;<br /> 2 deaths at the ETC of Beni;<br /> 6 people recovered from ETC, including 4 Mabalako, 1 in Katwa and 1 in Butembo;<br /> Two live health workers are among the new confirmed cases of Mambasa (non-vaccinated) and Vuhovi (vaccinated). The cumulative number of confirmed / probable cases among health workers is 148 (5% of all confirmed / probable cases), including 41 deaths.

      Organization of the Coordination Workshop for the Ebola Response to the Ebola Epidemic

      The Technical Secretariat of the Multi-sectoral Epidemic Response Committee of the EVD is organizing a coordination workshop from 31 July to 02 August 2019 to coordinate the response to the EVD epidemic at the Karibu Hotel in Goma in the province of North Kivu.<br /> This workshop aims to brief the Technical Secretariat of the Multisectoral Committee by coordinating the response on the organization of the current response in order to enable it to make informed decisions thus avoiding a major disruption of the response.<br /> It will enable the Technical Secretariat to inquire about the current epidemiological situation of EVD and the main challenges to be addressed, to learn about the current response structure (organization of the different levels of coordination) and the new strategic plan for the response (PSR4) and synergy with the security, humanitarian and financial sectors, as well as the operational readiness of DRC neighboring countries to create a favorable environment for the response.<br /> It will also allow to discuss challenges and perspectives related to priority themes (pillars). This workshop will result in the priority actions to be carried out over the next 90 days and the overall orientations on the response, as well as the new organizational structure of the response.<br /> It should be noted that under SRP-4, effective and coherent change in strategies, effective coordination, consistent standards and support for the most vulnerable communities are envisaged at risk in the provinces of North Kivu and Ituri while preventing the spread of the epidemic in other provinces and countries bordering the DRC

      Death of the second confirmed case of Ebola in Goma

      The second confirmed Ebola case from Goma died on Wednesday 31 July 2019 at the ETC Nyiragongo of Goma located in the General Reference Hospital of this city.<br /> This last case of Goma is a patient, who began to present the symptoms of EVD on July 22, 2019. On July 30, 2019 he went to the Goma General Referral Hospital (HGR) located in the Nyiragongo Health Zone, where he was directly transferred to the ETC for appropriate care. The ETC, being installed within this HGR.<br /> Previously, he was treated as an outpatient by a nurse in a private community health center in the Nyiragongo Health Zone.

      180,558<br /> Vaccinated persons<br /> The only vaccine to be used in this outbreak is the rVSV-ZEBOV vaccine, manufactured by the pharmaceutical group Merck, following approval by the Ethics Committee in its decision of 19 May 2018.

      80,118,963<br /> Controlled people<br /> 80 entry points (PoE) and operational health checkpoints (PoC)

      148<br /> Contaminated health workers<br /> Two live health workers are among the new confirmed cases of Mambasa (non-vaccinated) and Vuhovi (vaccinated).<br /> The cumulative number of confirmed / probable cases among health workers is 148 (5% of all confirmed / probable cases), including 41 deaths.

      Source: The press team of the Ministry of Health.

    2. On 2019-10-16 12:50:12, user GuyguyKabundi Tshima wrote:

      EVOLUTION OF THE EPIDEMIC IN THE PROVINCES OF NORTH KIVU AND ITURI AS AT OCTOBER 12, 2019<br /> Sunday, October 13, 2019<br /> Since the beginning of the epidemic, the cumulative number of cases is 3,218, of which 3,104 confirmed and 114 probable. In total, there were 2,150 deaths (2036 confirmed and 114 probable) and 1032 people healed.<br /> 429 suspected cases under investigation;<br /> 6 new confirmed cases to CTEs, including;<br /> 4 in North Kivu, including 2 in Beni and 2 in Kalunguta<br /> 2 in Ituri, including 1 in Mandima and 1 in Nyakunde;<br /> 2 new confirmed deaths, including:<br /> 1 community death in North Kivu in Kalunguta;<br /> 1 new confirmed death in CTE in North Kivu in Beni;<br /> 1 person healed out of CTE in Ituri in Mambasa;<br /> No health workers are among the newly confirmed cases. The cumulative number of confirmed / probable cases among health workers is 161 (5% of all confirmed / probable cases), including 41 deaths.

      NEWS

      New health area infected with Ebola virus in Ituri<br /> - A new Health Area has been affected by Ebola Virus Disease in Ituri. This is the Maroro Health Area in the Nyakunde Health Zone;<br /> - Indeed, Nyakunde was already at 294 days without notifying a new confirmed case of the EVD and returned to zero following this new affection;<br /> - Of all the 6 cases reported this Sunday, October 13, 2019, none of them were listed as contact, nor monitored regularly or vaccinated;<br /> - It is also reported that the alerts of all these cases are coming back from the community and their contacts are being listed, the investigations are continuing, the decontamination of the patients' households is being carried out and the ring of vaccination has been opened around all these cases.

      VACCINATION

      • Since the beginning of vaccination on August 8, 2018, 237,632 people have been vaccinated;
      • The only vaccine to be used in this outbreak is the rVSV-ZEBOV vaccine, manufactured by the pharmaceutical group Merck, following approval by the Ethics Committee in its decision of 20 May 2018.

      MONITORING AT ENTRY POINTS

      • Since the beginning of the epidemic, the total number of travelers checked (temperature rise) at the sanitary control points is 105,518,454 ;
      • To date, a total of 111 entry points (PoE) and sanitary control points (PoCs) have been set up in the provinces of North Kivu and Ituri to protect the country's major cities and prevent the spread of the epidemic in neighboring countries.

      As a reminder, the recommendations of the MULTISECTORAL COMMITTEE OF THE RESPONSE TO EBOLA VIRUS DISEASE are as follows:

      1. Follow basic hygiene practices, including regular hand washing with soap and water or ashes;
      2. If an acquaintance from an epidemic area comes to visit you and is ill, do not touch her and call the North Kivu Civil Protection toll-free number;
      3. If you are identified as a contact of an Ebola patient, agree to be vaccinated and followed for 21 days;
      4. If a person dies because of Ebola, follow the instructions for safe and dignified burials. It is simply a funeral method that respects funerary customs and traditions while protecting the family and community from Ebola contamination.
      5. For all health professionals, observe the hygiene measures in the health centers and declare any person with symptoms of # Ebola (fever, diarrhea, vomiting, fatigue, anorexia, bleeding).<br /> If all citizens respect these health measures recommended by the Secretariat, it is possible to quickly end this 10th epidemic.
    3. On 2019-11-14 14:53:08, user Guyguy wrote:

      EVOLUTION OF THE EPIDEMIC IN THE PROVINCES OF NORTH KIVU AND ITURI ON NOVEMBER 12, 2019

      Wednesday, November 13, 2019

      • Since the beginning of the epidemic, the cumulative number of cases is 3,291, of which 3,173 are confirmed and 118 are probable. In total, there were 2,193 deaths (2075 confirmed and 118 probable) and 1067 people cured.<br /> • 508 suspected cases under investigation;<br /> • 4 new confirmed cases in North Kivu, including 2 in Beni and 2 in Mabalako;<br /> • No new deaths of confirmed cases have been recorded;<br /> • No cured person has emerged from ETCs;<br /> • No health worker is among the new confirmed cases. The cumulative number of confirmed / probable cases among health workers is 163 (5% of all confirmed / probable cases), including 41 deaths;

      NEWS

      Ebola Virus Disease Response Coordinator Meeting with North Kivu National Assembly Vice President on J & J Vaccine

      • The General Coordinator for the Ebola Response to the Ebola Virus Disease, Prof. Steve Ahuka Mundeke, accompanied by a joint team of some members of the response and the consortium (National Institute of Biomedical Research-INRB, MSF / France and the London School), met this Wednesday, November 13, 2019 the Vice President of the North Kivu Provincial Assembly, the Honorable Jean-Paul Lumbulumbu, with whom they discussed the second Ebola vaccine called Johnson & Johnson.

      • The Professor Steve Ahuka Mundeke, who requested the involvement of elected representatives in the community mobilization for this vaccination, welcomed the availability of the Provincial Assembly of North Kivu to support the activities that will begin on Thursday, November 14, 2019 in two health areas of Karisimbi, namely Kahembe and Majengo in North Kivu Province;

      • In addition, the Honorable Jean-Paul Lumbulumbu promised to be among the first people to be vaccinated with the Johnson & Johnson vaccine, including members of the North Kivu Provincial Assembly, to serve as an example for their bases. To this end, he invited the people of North Kivu, particularly the sites concerned, to be vaccinated in order to protect themselves against any possible epidemic of the Ebola Virus Disease;

      • Also in the context of the introduction of this second vaccine, a briefing session was organized on the same Wednesday in the meeting room of the general coordination of the response in Goma, for members of the Risk Communication. and community engagement (CREC) with some partners from the Ministry of Health.<br /> Training of Beni journalists on their role and responsibility in public health emergencies.

      • The role and responsibilities of the journalist in the treatment of news in a public health crisis is at the center of this workshop held from 12 to 14 November 2019 in Beni, North Kivu Province;

      • This workshop aims to equip about twenty media professionals with essential notions related to the treatment of information during a health crisis;

      • At the opening of this meeting, the feather knights were trained on the risk communication related to Ebola virus disease and on the usual concepts in the response to this disease;

      • The two speakers of the day, Dr. Bibiche Matadi, who is responsible for the surveillance pillar at the sub-coordination of the Beni response and Mr. Rodrigue BARRY of the WHO, emphasized the quality of the message to be given to because, according to them, the eradication of this epidemic is based on mastery of all contacts and on community involvement;

      • The second day focused on journalist ethics and deontology in times of health crisis and on health - communication - media interaction;

      • For this second topic, Ms. Miphy Buata, a journalist with the Congolese News Agency and communications officer of the Multisectoral Committee for the Response to the Ebola Virus Epidemic, recalled that the media remains the only channel of choice to restore and build trust between the (recipient) community and the health sector (Issuer), particularly in the context of Ebola Virus Disease;

      • This workshop was organized by the Ministry of Health with WHO and was facilitated by UNICEF.

      VACCINATION

      • Since the start of vaccination on August 8, 2018, 251,079 people have been vaccinated;

      • The only vaccine to be used in this outbreak was the rVSV-ZEBOV vaccine, manufactured by the pharmaceutical group Merck, following approval by the Ethics Committee in its decision of 20 May 2018.

      MONITORING AT ENTRY POINTS

      • Since the beginning of the epidemic, the total number of travelers checked (temperature measurement ) at the sanitary control points is 116,596,285 ;

      • To date, a total of 112 entry points (PoE) and sanitary control points (PoCs) have been set up in the provinces of North Kivu and Ituri to protect the country's major cities and prevent the spread of the epidemic in neighboring countries.

      As a reminder, the recommendations of the MULTISECTORAL COMMITTEE OF THE RESPONSE TO EBOLA VIRUS DISEASE are as follows:

      1. Follow basic hygiene practices, including regular hand washing with soap and water or ashes;
      2. If an acquaintance from an epidemic area comes to visit you and is ill, do not touch her and call the North Kivu Civil Protection toll-free number;
      3. If you are identified as a contact of an Ebola patient, agree to be vaccinated and followed for 21 days;
      4. If a person dies because of Ebola, follow the instructions for safe and dignified burials. It is simply a funeral method that respects funerary customs and traditions while protecting the family and community from Ebola contamination.
      5. For all health professionals, observe the hygiene measures in the health centers and declare any person with symptoms of # Ebola (fever, diarrhea, vomiting, fatigue, anorexia, bleeding).<br /> If all citizens respect these health measures recommended by the Secretariat, it is possible to quickly end this 10th epidemic.
    4. On 2019-10-10 12:11:25, user GuyguyKabundi Tshima wrote:

      EPIDEMIOLOGICAL SITUATION

      EVOLUTION OF THE EPIDEMIC IN THE PROVINCES OF NORTH KIVU AND ITURI AT OCTOBER 06, 2019<br /> Monday, October 07, 2019<br /> Since the beginning of the epidemic, the cumulative number of cases is 3,205, of which 3,091 are confirmed and 114 are probable. In total, there were 2,142 deaths (2028 confirmed and 114 probable) and 1006 people healed.<br /> 363 suspected cases under investigation;<br /> 1 new confirmed case at CTE in North Kivu at Oicha;<br /> No new confirmed deaths<br /> 2 people healed from Butembo CTE;<br /> No health workers are among the newly confirmed cases. The cumulative number of confirmed / probable cases among health workers is 161 (5% of all confirmed / probable cases), including 41 deaths.

      NEWS

      7 people healed from Ebola Virus Disease released Monday at Komanda CTE<br /> - A total of 7 people cured of Ebola Virus Disease were released on Monday October 7th at the Ebola Treatment Center (ETC) in Komanda. ;<br /> - This is 4 people from Mambasa and 3 cases from Komanda Health Zone to whom discharge certificates were given by the director of this Ebola Treatment Center<br /> - This certificate of discharge bears as inscription: "On the date of issue of this document the bearer of this certificate does not present any risk of contaminating other people, because his test was negative for the Ebola virus disease. He / she is thus DECLARE GUERI (E) . His current state of health is not a danger to the community. That is why he / she can return to his household and his professional environment to continue the daily activities. The family, the community and the authorities are asked to welcome him to promote his social integration ".

      VACCINATION

      • Continuation of vaccination around the confirmed case of 04 October 2019 in the Tenambo Health Area in Oicha, North Kivu;
      • Continuation of the vaccination of newly recruited front-line staff at the General Reference Hospitals of Katwa and Kyondo in North Kivu;
      • Since vaccination began on 8 August 2018, 234,693 people have been vaccinated;
      • The only vaccine to be used in this outbreak is the rVSV-ZEBOV vaccine, manufactured by the pharmaceutical group Merck, following approval by the Ethics Committee in its decision of 20 May 2018.

      MONITORING AT ENTRY POINTS

      • Since the beginning of the epidemic, the total number of travelers checked (temperature increase) at the sanitary control points is 103,167,809 ;
      • To date, a total of 111 entry points (PoE) and sanitary control points (PoCs) have been set up in the provinces of North Kivu and Ituri to protect the country's major cities and prevent the spread of the epidemic in neighboring countries.

      As a reminder, the recommendations of the MULTISECTORAL COMMITTEE OF THE RESPONSE TO EBOLA VIRUS DISEASE are as follows:

      1. Follow basic hygiene practices, including regular hand washing with soap and water or ashes;
      2. If an acquaintance from an epidemic area comes to visit you and is ill, do not touch her and call the North Kivu Civil Protection toll-free number;
      3. If you are identified as a contact of an Ebola patient, agree to be vaccinated and followed for 21 days;
      4. If a person dies because of Ebola, follow the instructions for safe and dignified burials. It is simply a funeral method that respects funerary customs and traditions while protecting the family and community from Ebola contamination.
      5. For all health professionals, observe the hygiene measures in the health centers and declare any person with symptoms of # Ebola (fever, diarrhea, vomiting, fatigue, anorexia, bleeding).<br /> If all citizens respect these health measures recommended by the Secretariat, it is possible to quickly end this 10th epidemic.
    5. On 2019-10-17 18:36:39, user GuyguyKabundi Tshima wrote:

      EVOLUTION OF THE EPIDEMIC IN THE PROVINCES OF NORTH KIVU AND ITURI AS OF OCTOBER 15, 2019

      Wednesday, October 16, 2019<br /> Since the beginning of the epidemic, the cumulative number of cases is 3,227, of which 3,113 are confirmed and 114 are probable. In total, there were 2,154 deaths (2040 confirmed and 114 probable) and 1038 people healed.<br /> 530 suspected cases under investigation;<br /> 3 new confirmed cases, including:<br /> No cases in North Kivu;<br /> 3 in Ituri in Mandima;<br /> 1 new confirmed death, of which:<br /> 1 community death in Ituri in Mandima;<br /> No confirmed deaths;<br /> 2 people healed from the CTE in Ituri in Mambasa;<br /> No health workers are among the newly confirmed cases. The cumulative number of confirmed / probable cases among health workers is 161 (5% of all confirmed / probable cases), including 41 deaths.

      NEWS

      The state of play of the response at the center of an interview in Goma between the Technical Secretary of the CMRE and the United Nations Emergency Coordinator for Ebola<br /> - The Technical Secretary of the Multisectoral Committee on Epidemic Response to Ebola Virus Disease (ST / CMRE), Prof. Jean Jacques Muyembe Tamfum, granted a hearing on Wednesday, October 16, 2019 in Goma to the United Nations Emergency Coordinator for Ebola;<br /> - During their meeting, the two personalities discussed the state of play of the response to the 10th Ebola Virus Disease outbreak and the security situation in the areas affected by this epidemic;<br /> - It should be noted that the 10th Ebola epidemic has been taking place in the Democratic Republic of the Congo in areas of armed conflict, particularly in the provinces of North Kivu and Ituri, for more than a year;<br /> - Some time before this meeting, the technical secretary of the Multisectoral Committee for the Response to the Ebola Virus Disease Epidemic (ST / CMRE), Prof. Muyembe Tamfum, who is currently staying in Goma, North Kivu to inquire about the evolution of the response, chaired the morning meeting of the general coordination of the Ebola response to the epidemic.

      Pygmies at Mahombo camp in Mambasa territory in Ituri pledge to fight Ebola Virus Disease

      • The pygmies residing in Mahombo camp located more than 30 minutes walk from the main road from the village Nyangwe in the territory of Mambasa in ITURI, pledged Tuesday, October 15, 2019 to fight against Ebola by raising alerts with teams of the response;<br /> This commitment is the result of awareness raising by the Community Risk and Commitment (CREC) teams for 79 pygmies about the generalities of the Ebola virus disease, its methods of prevention and contamination;<br /> Pygmies have, for this purpose, asked for hand washing kits to break the chain of transmission of the Ebola virus in their respective communities.

      VACCINATION

      • Since vaccination began on 8 August 2018, 238,700 people have been vaccinated;
      • The only vaccine to be used in this outbreak is the rVSV-ZEBOV vaccine, manufactured by the pharmaceutical group Merck, following approval by the Ethics Committee in its decision of 20 May 2018.

      MONITORING AT ENTRY POINTS

      • The Governor of North Kivu, Carly Nzanzu Kasivita accompanied by a strong delegation, visited Maboya Control Points (PoCs) in Kalunguta and Kangote in Butembo in North Kivu Province;
      • The providers of the Mususa Point of Control (PoC) in Butembo, North Kivu, in collaboration with the Ndondo Primary School and the Kyambogho School Complex, participated in a mass sensitization session (travelers and riverside population) under the theme " All Eliminate Ebola Virus Disease "on International Handwashing Day;
      • Since the beginning of the epidemic, the total number of travelers checked (temperature rise) at the sanitary control points is 106.625.956 ;
      • To date, a total of 111 entry points (PoE) and sanitary control points (PoCs) have been set up in the provinces of North Kivu and Ituri to protect the country's major cities and prevent the spread of the epidemic in neighboring countries.

      As a reminder, the recommendations of the MULTISECTORAL COMMITTEE OF THE RESPONSE TO EBOLA VIRUS DISEASE are as follows:

      1. Follow basic hygiene practices, including regular hand washing with soap and water or ashes;
      2. If an acquaintance from an epidemic area comes to visit you and is ill, do not touch her and call the North Kivu Civil Protection toll-free number;
      3. If you are identified as a contact of an Ebola patient, agree to be vaccinated and followed for 21 days;
      4. If a person dies because of Ebola, follow the instructions for safe and dignified burials. It is simply a funeral method that respects funerary customs and traditions while protecting the family and community from Ebola contamination.
      5. For all health professionals, observe the hygiene measures in the health centers and declare any person with symptoms of # Ebola (fever, diarrhea, vomiting, fatigue, anorexia, bleeding).<br /> If all citizens respect these health measures recommended by the Secretariat, it is possible to quickly end this 10th epidemic.
    6. On 2019-11-17 04:20:48, user GuyguyKabundi Tshima wrote:

      EVOLUTION OF THE EPIDEMIC IN THE PROVINCES OF NORTH KIVU AND ITURI AS AT NOVEMBER 15, 2019<br /> Saturday, November 16, 2019<br /> • Since the beginning of the epidemic, the cumulative number of cases is 3,292, of which 3,174 are confirmed and 118 are probable. In total, there were 2,195 deaths (2077 confirmed and 118 probable) and 1070 people healed.<br /> • 517 suspected cases under investigation;<br /> • No new confirmed cases;<br /> • No new deaths of confirmed cases have been recorded;<br /> • No cured person has emerged from CTEs;<br /> • No health worker is among the new confirmed cases. The cumulative number of confirmed / probable cases among health workers is 163 (5% of all confirmed / probable cases), including 41 deaths.

      NEWS

      Goma opens leadership capacity building workshop for Ebola epidemic response to Ebola Virus Disease.

      • The coordinator of the epidemic response to Ebola Virus Disease in North and South Kivu Province and Ituri, Prof. Steve Ahuka Mundeke, opened this Saturday, November 16, 2019 in Goma North Kivu a workshop on building the capacity of actors involved in the response against Ebola;<br /> • For four days, participants, coordinating and sub-coordinating officers from the response, the Ministry of Health, the World Health Organization (WHO), national security, CDC and DFID will be equipped with management skills epidemics before, during and after the tenth epidemic of Ebola Virus Disease, especially in the event of any outbreak;<br /> • According to Prof. Ahuka, this workshop will not only benefit this epidemic, but will help, through acquired skills, to cope with other epidemics or other crises in a collective and individual way. " Each participant will be able to use these skills in his daily life ," he concluded;<br /> • This training for the response officers, from 16 to 20 November 2019, is organized by the Ministry of Health in collaboration with WHO with funding from UKaid from the British people.

      VACCINATION

      • 93 people were vaccinated with the 2nd Ad26.ZEBOV / MVA-BN-Filo vaccine (Johnson & Johnson) in the two Health Zones of Karisimbi in Goma;<br /> • Since the start of vaccination on August 8, 2018 with the rVSV-ZEBOV vaccine, 252,835 people have been vaccinated;<br /> • Approved October 22, 2019 by the Ethics Committee of the School of Public Health of the University of Kinshasa and October 23, 2019 by the National Ethics Committee, the second vaccine, called Ad26.ZEBOV / MVA-BN -Filo, is produced by Janssen Pharmaceuticals for Johnson & Johnson;<br /> • This new vaccine complements the first, the rVSV-ZEBOV, vaccine used until then (since August 08, 2018) in this outbreak, manufactured by the pharmaceutical group Merck, after approval of the Ethics Committee on May 20, 2018. It has recently been approved.

      MONITORING AT ENTRY POINTS

      • Since the beginning of the epidemic, the total number of travelers checked (temperature rise) at the sanitary control points is 117,333,420 ;<br /> • To date, a total of 112 entry points (PoE) and sanitary control points (PoCs) have been set up in the provinces of North Kivu and Ituri to protect the country's major cities and prevent the spread of the epidemic in neighboring countries.

      As a reminder, the recommendations of the MULTISECTORAL COMMITTEE OF THE RESPONSE TO EBOLA VIRUS DISEASE are as follows:

      1. Follow basic hygiene practices, including regular hand washing with soap and water or ashes;
      2. If an acquaintance from an epidemic area comes to visit you and is ill, do not touch her and call the North Kivu Civil Protection toll-free number;
      3. If you are identified as a contact of an Ebola patient, agree to be vaccinated and followed for 21 days;
      4. If a person dies because of Ebola, follow the instructions for safe and dignified burials. It is simply a funeral method that respects funerary customs and traditions while protecting the family and community from Ebola contamination.
      5. For all health professionals, observe the hygiene measures in the health centers and declare any person with symptoms of # Ebola (fever, diarrhea, vomiting, fatigue, anorexia, bleeding).<br /> If all citizens respect these health measures recommended by the Secretariat, it is possible to quickly end this 10th epidemic.
    7. On 2019-11-30 16:39:58, user Guyguy wrote:

      EVOLUTION OF THE EPIDEMIC IN THE PROVINCES OF NORTH KIVU AND ITURI AT NOVEMBER 26, 2019<br /> Wednesday, November 27, 2019<br /> • Since the beginning of the epidemic, the cumulative number of cases is 3,304, of which 3,186 are confirmed and 118 are probable. In total, there were 2,199 deaths (2081 confirmed and 118 probable) and 1077 people cured.<br /> • 366 suspected cases under investigation;<br /> • No new confirmed cases;<br /> • No new deaths among confirmed cases;<br /> • No cured person has emerged from CTEs;<br /> • No health worker is among the new confirmed cases. The cumulative number of confirmed / probable cases among health workers is 163 (5% of all confirmed / probable cases), including 41 deaths.

      NEWS

      Closure of training of Ebola Rapid Response Teams in Goma

      • The Ebola response coordinator for the Ebola response to operations, Dr. Luigino Mikulu, closed on Wednesday 27 November 2019 the training of Rapid Response Teams (RRTs), composed of units of the Armed Forces. (FARDC) and the Congolese National Police (PNC), on the Ebola virus disease that took place in Goma, capital of North Kivu Province from 22 to 26 November 2019;<br /> • For Dr. Luigino, this team is the first in the Rapid Response Teams to be composed of elements from other sectors, such as those of the Ministries of Defense and Security and the Ministry of the Interior;<br /> • This training aligns with the vision of the Technical Secretariat of the Multisectoral Ebola Virus Disease Response Committee (ST / CMRE), through the overall coordination of the response, to expand its mixed and multidisciplinary teams available and able to intervene 24 hours a day, 7 days a week and everywhere, where they will be deployed, not only for the response to this epidemic to Ebola Virus Disease, but also for other epidemics;<br /> • This training was a pride for WHO to accompany the Ministry of Health in order to capitalize the capacity building of FARDC and PNC units in public health;<br /> • The participants, in turn, reassured the overall coordination of the response, the Ministry of Health and all those who contributed to the delivery of this training, particularly to WHO and all facilitators, to be faithful disciples in the field by putting into practice all the notions learned during these sessions;<br /> • At the end of this training, the thirty participants, including the facilitators, received a participation certificate.

      VACCINATION

      • Despite the tense situation of the city of Beni, a vaccination ring was opened around the confirmed case of 24 October 2019 in the Kanzulinzuli Health Area of the General Reference Hospital;<br /> • 724 people were vaccinated, until Tuesday, November 26, 2019, with the 2nd Ad26.ZEBOV / MVA-BN-Filo vaccine (Johnson & Johnson) in the two health zones of Karisimbi in Goma;<br /> • Since the start of vaccination on August 8, 2018 with the rVSV-ZEBOV vaccine, 255,247 people have been vaccinated;<br /> • Approved October 22, 2019 by the Ethics Committee of the School of Public Health of the University of Kinshasa and October 23, 2019 by the National Ethics Committee, the second vaccine, called Ad26.ZEBOV / MVA-BN -Filo, is produced by Janssen Pharmaceuticals for Johnson & Johnson;<br /> • This new vaccine is in addition to the first, the rVSV-ZEBOV, vaccine used until then (since August 08, 2018) in this epidemic manufactured by the pharmaceutical group Merck, after approval of the Ethics Committee on May 20, 2018. has recently been pre-qualified for registration.

      MONITORING AT ENTRY POINTS

      • Sanitary control activities are disrupted in the towns of Beni and Butembo in North Kivu province following demonstrations by the population which decries killings of civilians;<br /> • Since the beginning of the epidemic, the total number of travelers checked (temperature measurement ) at the sanitary control points is 121,159,810 ;<br /> • To date, a total of 109 entry points (PoE) and sanitary control points (PoCs) have been set up in the provinces of North Kivu and Ituri to protect the country's major cities and prevent the spread of the epidemic in neighboring countries.

      As a reminder, the recommendations of the MULTISECTORAL COMMITTEE OF THE RESPONSE TO EBOLA VIRUS DISEASE are as follows:

      1. Follow basic hygiene practices, including regular hand washing with soap and water or ashes;
      2. If an acquaintance from an epidemic area comes to visit you and is ill, do not touch her and call the North Kivu Civil Protection toll-free number;
      3. If you are identified as a contact of an Ebola patient, agree to be vaccinated and followed for 21 days;
      4. If a person dies because of Ebola, follow the instructions for safe and dignified burials. It is simply a funeral method that respects funerary customs and traditions while protecting the family and community from Ebola contamination.
      5. For all health professionals, observe the hygiene measures in the health centers and declare any person with symptoms of # Ebola (fever, diarrhea, vomiting, fatigue, anorexia, bleeding).<br /> If all citizens respect these health measures recommended by the Secretariat, it is possible to quickly end this 10th epidemic.
    8. On 2020-01-07 12:53:20, user Guyguy wrote:

      EVOLUTION OF THE EPIDEMIC IN THE PROVINCES OF NORTH KIVU AND ITURI ON 05 JANUARY 2020

      Monday, January 06, 2020

      • Since the start of the epidemic, the cumulative number of cases has been 3,390, including 3,272 confirmed and 118 probable. In total, there were 2,233 deaths (2,115 confirmed and 118 probable) and 1,114 people healed;<br /> • 373 suspected cases under investigation;<br /> • 2 new confirmed cases in Ituri in Mambasa;<br /> • No new deaths among the confirmed cases, including:<br /> o No community deaths have been recorded;<br /> o No death among the confirmed cases;<br /> • No healed person has left the CTE;<br /> • No health worker is among the new confirmed cases. The cumulative number of confirmed / probable cases among health workers is 164 (approximately 5% of all confirmed / probable cases), including 41 deaths;<br /> • Mambasa again reported a confirmed case after 66 days of silence.

      NEWS<br /> Organization of an evaluation session of awareness-raising activities in the Malepe health area in Beni<br /> • The sub-coordination of the response to the Ebola virus disease epidemic organized this Monday 06 December 2020 an evaluation session of awareness-raising activities in the Malepe health area in Beni;<br /> • According to the Coordinator of this Sub-coordination, Dr. Pierre-Céleste Adikey, this evaluation aims to intensify surveillance around visitors and raise alerts. These strategies, he said, will strengthen measures to protect the City against any possible reinfection of the City;<br /> • On this occasion, it was announced the resumption of free healthcare within the Malepe health center with the support of the NGO ALIMA which, from now on, provides drugs for the free care of the sick;<br /> • In addition, the Ebola Treatment Center (CTE) in Mangina unloaded the first eight Ebola winners in 2020 on Monday. These survivors, who were reintegrated into their respective communities, notably in Aloya / Canteen, testified to good care within this CTE.

      VACCINATION<br /> • 4,802 people were vaccinated, until January 2, 2020, with the 2nd vaccine Ad26.ZEBOV / MVA-BN-Filo (Johnson & Johnson) in the two health areas from Karisimbi to Goma;<br /> • Since the start of vaccination on August 8, 2018 with the rVSV-ZEBOV vaccine, 261,596 people have been vaccinated;<br /> • Approved on October 22, 2019 by the Ethics Committee of the School of Public Health at the University of Kinshasa and on October 23, 2019 by the National Ethics Committee, the second vaccine, called Ad26.ZEBOV / MVA-BN -Filo, is produced by Janssen Pharmaceuticals for Johnson & Johnson;<br /> • This new vaccine complements the first, rVSV-ZEBOV, a vaccine used until then (since August 08, 2018) in this epidemic manufactured by the pharmaceutical group Merck, after approval by the Ethics Committee on May 20, 2018. It was recently pre-qualified for certification.

      ENTRY POINT SURVEILLANCE<br /> • Since the start of the epidemic, the total number of travelers checked (temperature measurement ) at health checkpoints has been 135,503,900 ;<br /> • To date, a total of 109 entry points (PoE) and health control points (PoC) have been established in the provinces of North Kivu and Ituri in order to protect the country's major cities and avoid the spread of the epidemic in neighboring countries.

      As a reminder, the recommendations of the MULTISECTORAL COMMITTEE OF THE EBOLA VIRUS DISEASE RESPONSE are as follows:

      1. Respect basic hygiene measures, in particular regular hand washing with water and soap or ash;
      2. If an acquaintance from an epidemic area comes to visit you and that he is sick, do not touch him and call the toll-free number for civil protection in North Kivu;
      3. If you are identified as a contact with an Ebola patient, agree to be vaccinated and followed for 21 days;
      4. If someone dies due to Ebola, follow the guidelines for dignified and secure burials. It is simply a mode of burial that respects funeral customs and traditions while protecting the family and the community from Ebola contamination.
      5. For all health professionals, observe hygiene measures in health centers and report any sick person showing symptoms of Ebola (fever, diarrhea, vomiting, fatigue, anorexia, bleeding).<br /> If all citizens respect these health measures recommended by the Secretariat, it is possible to quickly end this 10th epidemic.
    1. On 2020-05-07 00:09:49, user Charles Warden wrote:

      I think I saw something roughly similar in this Tweet:

      https://twitter.com/manuelr...

      However, I have the following questions:

      1) How are you taking into consideration lack of exposure? If you looked for a difference in prognosis among infected individuals, then that would provide a control that you know all individuals have been exposed to the virus. I realize this may not be exactly what you are looking for, but I would expect a small proportion of individuals having been exposed to the virus will make achieving significance for infected versus uninfected individuals more difficult.

      2) If you had antibody results, maybe this would help (even if that is also not perfect), but my understanding is that you are also not using that as a filter (which I am guessing is not available)?

      3) It looks like you considered Exome data. I think that this may be good because I would have guessed you might miss a signal with SNP chip data, if the relevant variants are not common (or at least not well characterized as part of larger haplotypes). However, is it possible that variant calling for most genes is less optimal with these genes? Is there any way to go back to the raw data and see if the variant calling strategy can change anything among infected individuals?

      4) If all of the above criteria are meet, do you need to consider non-genetic risk factors (such as age) into your model?

      5) A lack of a significant result is not the same as saying with high confidence that a hypothesis cannot be true. I think that you should communicate what you have observed in some way, but I think some caution might be needed to avoid confusion. For example, a reader from the general public might think you are confident that you have found results that contradict reports that ACE2 (and/or TMPRSS2) may be important for COVID-19 infections. My guess is this is not what you meant, but I wonder if the limitations to these results need to be emphasized more.

      If this provides me a way to ask these questions in a way that gets less attention from the general public, then I think it is good that you posted these results. Discussion about possible implications could be important, but my understanding is that this does not mean that this is strong evidence that the current public health recommendations should be changed (and I don’t want to cause any unnecessary confusion).

    1. On 2020-02-02 15:58:08, user Martin Modrák wrote:

      Summary: The provided analysis can IMHO be a helpful complement to other efforts to estimate incubation rate of 2019-nCoV. The uncertainty of estimates of incubation rate and other intervals provided in the abstract is likely greater then what is reported, the numbers thus should be treated with caution. Only cases outside of Wuhan up to January 24th 2020 are included (31 - 43 cases are available for the individual subanalyses).

      This review has been crossposted on pubPeer, medRxiv, prereview.org.

      Disclaimer: I lack background in epidemiology to let me evaluate whether the proposed modelling approach is a standard one, if much better tools are available or if there are possible issues with the underlying data. In the following, I therefore focus primarily on the statistical aspects of the method employed, without considering alternative approaches.

      The big picture:<br /> The main idea of the preprint is to use cases of 2019-nCoV reported in patients that spent only a short time in Wuhan to estimate incubation rate. The underlying assumption is that those patients could have been exposed to the virus only during their stay in Wuhan.

      Strengths:<br /> The approach is interesting in that it removes the need to directly guess when/how the patients got into contact with the virus. It is also conceptually simple and requires few additional assumptions.

      I find it great that multiple models for the time intervals are tested and reported. The fact that the models mostly agree increases my confidence in the results.

      I further congratulate the authors on being able to put the analysis together very quickly and provide a clear and concise manuscript. I am thankful they posted their results publicly as soon as possible.

      Limitations:<br /> The main disadvantage of the chosen approach is that it let's the authors to only use a fraction of the reported cases and that the approach is only valid on data from the early phase of the epidemic. Once more cases happen outside Wuhan, the number patients who have become infected elsewhere will increase and the approach of this preprint will no longer be applicable. This is however not strongly detrimental to the manuscript and it could hopefully serve as one of many approaches to estimate the characteristics of 2019-nCoV, each with its own strenghts and limitations.

      There are however some specific points I find problematic in the manuscript.

      1) Using AIC for model selection might be brittle, especially since the differences in AIC are very small and the AIC itself is a noisy measure. Using some sort of model averaging and/or stacking would likely be beneficial.

      2) Also, no explicit effort to verify that the models used are appropriate has been reported. A simple model check would be to overlay the actual data over Figure 1 (e.g. the empirical CDF produced assuming both exposure and onset happend in the middle of the interval). Similar effort could be useful for Figure 2.

      3) Taking 1 and 2 together implies there is substantial uncertainty about which model is the best. Further, no strong guarantees that at least one of the proposed models is appropriate were given. The uncertainty bounds computed using only the "best fit" model are therefore certainly overly optimistic as they ignore this uncertainty. While this is challenging to account for mathematically, I believe it should be reported prominently in the manuscript to avoid confusion.

      4) While using only visitors to Wuhan makes sense to estimate the incubation period, the estimates of time from illness onset to hospitalization and/or death would likely benefit from including all cases. I don't see why only using cases outside of Wuhan for these other estimates is beneficial. I can however see why incubation period might be the primary focus of the paper and therefore a dataset with cases in Wuhan was not constructed.

      5) For some reason the link to supplementary data is broken (probably not author's fault), so I cannot investigate the dataset. Code is also not available so it is hard to judge the modelling approach in detail.<br /> I have contacted the authors and will update this review if I receive that data and/or code.

      The only issue I feel strongly about in this manuscript is with the abstract, which should IMHO clearly state that only a small number of cases has been used and that the uncertainty is likely larger than what was computed. Otherwise the paper seems to be a good contribution to the global effort to understand 2019-nCoV.

    1. On 2020-03-22 20:13:37, user Sinai Immunol Review Project wrote:

      This study retrospectively evaluated clinical, laboratory, hematological, biochemical and immunologic data from 21 subjects admitted to the hospital in Wuhan, China (late December/January) with confirmed SARS-CoV-2 infection. The aim of the study was to compare ‘severe’ (n=11, ~64 years old) and ‘moderate’ (n=10, ~51 years old) COVID-19 cases. Disease severity was defined by patients’ blood oxygen level and respiratory output. They were classified as ‘severe’ if SpO2 93% or respiratory rates 30 per min.

      In terms of the clinical laboratory measures, ‘severe’ patients had higher CRP and ferritin, alanine and aspartate aminotransferases, and lactate dehydrogenase but lower albumin concentrations.

      The authors then compared plasma cytokine levels (ELISA) and immune cell populations (PBMCs, Flow Cytometry). ‘Severe’ cases had higher levels of IL-2R, IL-10, TNFa, and IL-6 (marginally significant). For the immune cell counts, ‘severe’ group had higher neutrophils, HLA-DR+ CD8 T cells and total B cells; and lower total lymphocytes, CD4 and CD8 T cells (except for HLA-DR+), CD45RA Tregs, and IFNy-expressing CD4 T cells. No significant differences were observed for IL-8, counts of NK cells, CD45+RO Tregs, IFNy-expressing CD8 T and NK cells.

      Several potential limitations should be noted: 1) Blood samples were collected 2 days post hospital admission and no data on viral loads were available; 2) Most patients were administered medications (e.g. corticosteroids), which could have affected lymphocyte counts. Medications are briefly mentioned in the text of the manuscript; authors should include medications as part of Table 1. 3) ‘Severe’ cases were significantly older and 4/11 ‘severe’ patients died within 20 days. Authors should consider a sensitivity analysis of biomarkers with the adjustment for patients’ age.

      Although the sample size was small, this paper presented a broad range of clinical, biochemical, and immunologic data on patients with COVID-19. One of the main findings is that SARS-CoV-2 may affect T lymphocytes, primarily CD4+ T cells, resulting in decreased IFNy production. Potentially, diminished T lymphocytes and elevated cytokines can serve as biomarkers of severity of COVID-19.

      This review was undertaken as part of a project by students, postdocs and faculty at the Immunology Institute of the Icahn school of medicine, Mount Sinai.

    1. On 2020-04-22 09:23:55, user Dr Mubarak Muhamed khan wrote:

      I keenly read this manuscript. My views. Following are the limitations before coming to conclusions <br /> 1. This is still not a published study in any top journals and must be taken back based on following points<br /> 2. Although a good write up, it’s retrospective study with hurried conclusions<br /> 3. Selection criteria is just based on hospitalised COVID19 patients. And at which stage drugs administered is not clear in all three groups<br /> 4. Outcome criteria is only either death or discharge. What happened to those who got discharged ? Whether there was any hastening in improvement due to these drugs? Whether there is shortening of duration due to drugs from COVID positive to negative?<br /> 5. Whether these drugs have been tried as prophylaxis? Or only used in hospitalised patients? <br /> 6. All patients included are with mean age at 70 and with many comorbidities <br /> 7. What dosages used for hcq and azythromicin ? How many days treatment given?<br /> 8. What type of pharmacovigilance noted for all groups?<br /> 9. Whether at anytime drugs discontinued due to side effects?<br /> 10. What side effects were obvious during the treatment period?<br /> 11. When patients succumbed to mechanical ventilation, how and what type of dosages of these drugs given?

      *Although it is good retrospective study to know the effects of HCQ and HCQ+AZT in treatment of COVID 19 infected hospitalised patients.... it will be very much premature to conclude these drug’s role based on short experience and points raised above*

    2. On 2020-04-24 09:57:00, user Philip Davies wrote:

      Well, well well,

      This pre-print would make a good script for an episode of Columbo.

      The retrospective analysis, as presented, leads the reader to just one conclusion in a bazaar of many possible conclusions.

      I am even starting to have sympathy with D. Raoult and his team. I note his hot tempered response to this paper, where he lists two enormous factors that should be considered when wrestling with the data: the fact that the HCQ and HCQ & AZ cohorts were a sicker crowd (he lists lymphopenia) and that the sickest of the non-HCQ ventilated patients were then given HCQ (plus AZ in most cases) in a desperate last bid only for most to die.

      Raoult's point is certainly valid.

      We must remember that for most of the study period the use of HCQ was "ex-license" on a compassionate basis only. This means only the sickest patients got it. Remember also that this is a retrospective analysis, therefore observational. It was not run as a therapeutic trial. On the other hand, the use of AZ was already accepted (hence 30% of the non-HCQ cohort got it anyway).... although do be aware that by this time there had been quite a lot of focus on potentially dangerous QT lengthening when HCQ and AZ were used together in very sick patients.

      The HCQ cohort was, across all key determinants, the weakest and sickest group (it had the poorest prospects looking at age, ethnicity, smoking status, congestive heart failure, peripheral vascular disease, cerebrovascular disease (strokes),dementia, COPD, Diabetes (with and without complications)! ... and indeed, the HCQ and HCQ & AZ cohorts did have 100% more lymphopenia than the non-HCQ group.

      BUT, the big asymmetric issues become obvious when we look at the pre- and post- ventilator numbers.

      In terms of patients discharged without needing ventilation, the "victorious" non-HCQ group performs poorer than the 2 treated groups. This despite having a better prognostic baseline. But the results for this group change dramatically (for the better) when we look at the outcomes of ventilation. 25 ventilated patients came from this group.... but 19 of these 25 patients were then started on HCQ or HCQ & AZ after ventilation was started. It is screamingly obvious that these would be the sickest patients in that group: they were given such compassionate drugs in extremis. So having ejected 19 of 25 ventilated patients into the other cohorts, the non-HCQ group only had 3 deaths from its remaining 6 ventilated patients.

      The numbers of ventilated patients in the other cohorts (HCQ and HCQ & AZ) were thus substantially inflated with these new super-sick patients, who mostly died.

      There really can be no conclusion at all when looking at a study of this nature without knowing much more about individual clinical conditions and guiding principles behind clinician's decision making. It's still possible to make some reasonable assumptions:

      If I were Columbo?... I would say the non-HCQ cohort contained patients of extremes, with the best and worst potential. The worst would have been the very frail (malignancy and or congestive heart failure maybe ... see the stats), who probably were earmarked for 'supplemental oxygen' only from the very start. Such patients would not have been suitable for compassionate use of non proven drugs (remember, most of this came before the "emergency use" edict by FDA). This would explain the number of non-ventilated patients who died in this group (they may have been given AZ only, not being a controversial drug, but otherwise they did not get any significant interventional therapy). These patients would have had significant chronic disease and very poor obs/indices (including lymphopenia). But given that this cohort had, overall, a better starting prognosis than the other two groups, it means that the remaining patients in the group were promising candidates for survival (with better obs/indices). Such patients, not being part of a clinical trial, would not have been offered HCQ on a compassionate basis unless they got dramatically worse .... and of course, the ones who did get worse on the ventilator were started on HCQ (& often AZ as well) and thus swapped into the HCQ / HCQ & AZ cohorts.

      If we can understand that, then we might start to think that in fact HCQ & AZ is the best performing cohort with the other 2 vaguely distant. But this is being unfair to the HCQ cohort:

      The reason that a sick patient would be given one experimental drug on a compassionate basis (HCQ) but not have a rather less experimental drug further added (AZ), can really only be explained by considering risk versus benefit. A clinician would choose to use HCQ because the patient was particularly sick. The clinician would only add AZ if it was felt that this was worth the risk.... but a particularly sick patient with significant cardiovascular disease (the HCQ contained the most CVD risk) might then die of a more abrupt arrhythmia through adding yet another QT lengthening drug. I dare say the clinicians were tempted to make some "Hail Mary" plays, but we must remember, these patients were not part of an ongoing trial, these drugs were "ex-license" for compassionate use only and clinicians were still accountable for responsible actions. So for those particularly sick frail patients, it wasn't worth the risk.

      I am pretty sure that the HCQ cohort (which had pretty good pre-ventilator stats) crashed badly because it was loaded with the sickest patients .... patients that were too sick to risk adding AZ.

      So, the findings of this retrospective analysis are, in my opinion, likely to be incorrect.

      I believe I can confidently state that:

      1. The HCQ cohort started with the sickest patients and had even more of the sickest added during ventilation. Some were too sick to risk the addition of AZ to existing HCQ.
      2. The HCQ/AZ cohort also had some very sick patients (again with more additions during ventilation).
      3. The Non-HCQ cohort had the best prognosis overall from the very start (although likely a polarized mixture of the most frail and the most promising)... and then its stats got even better when it jettisoned its sickest ventilated patients into the other 2 cohorts.

      It is almost impossible to reach a conclusion from all this. BUT, the most likely finding is NOT that adding HCQ delivers a worse outcome than standard treatment. In fact, if we look at the pre-ventilator stats, the addition of HCQ might actually have provided considerable benefit to a particularly sick group of patients. Whether or not the addition of AZ to HCQ adds benefit is also unclear ... although my 'swingometer' is pointing slightly more to benefit than harm.

      Once again. I suggest that a robust study into prophylaxis and early treatment (using sensible safer doses adjusted for pulmonary sequestration) will deliver the most interesting results for CQ/HCQ.

      Dr Phil Davies<br /> Aldershot Centre For Health<br /> http://thevirus.uk

      EditView in discussion<br /> Discussion on medrxiv 3 comments<br /> medrxiv viewer<br /> Philip Davies<br /> Philip Davies 4 days ago<br /> The low dose arm of this study is worth following.

      The big problem for this study is comparison. It really has not defined the control population at all. The Italian and Chinese references are entirely different. Even the 2 Chinese populations referenced had massively different outcomes because the populations examined were different.

      The Italian mortality rate was actually similar to the overall study average here (but much higher than the low dose arm). The Chinese study involved all patients admitted to the two hospitals ... that included a majority of patients with moderate ("ordinary" as the Chinese class it) disease severity. The patients in this Brazilian study were regarded as severe or critical ... such patients (looking at worldwide stats) would attract a mortality of 30-40% plus.

      This is the most important factor. Do not compare apples with pears. So far this study points the "swingometer" in favor of benefit versus harm for the use of HQN in patients with advanced disease.

      Once again however, we are looking at the potential impact of an orally administered drug to patients with advanced disease. That's a big ask.

      For CQ and HCQ the most interesting results will likely come from studies looking at prophylaxis and early treatment (using safe doses, not silly high doses with added drugs that also lengthen QT). We can't yet guess how they will pan out.

      Dr Philip Davies<br /> GP<br /> Aldershot Centre For Health, UK<br /> http://thevirus.uk

    1. On 2020-10-07 06:13:12, user Markku Peltonen wrote:

      There were a number of comments on this manuscript on twitter early August, with concerns on errors in the calculations among others. Might be useful for others, so here is what I tweeted on August 5th 2020 (https://twitter.com/MarkkuP...: "https://twitter.com/MarkkuPeltonen/status/1290754970292281349):")

      Recently there was a meta-analysis on the effects of masks conducted in Finland. A number of comments has been made about the quality of the piece, so I had a quick look at it. As the analysis was also mentioned at least in Sweden, few quick comments in English. 1/10

      Background: the Finnish Ministry of Social Affairs and Health did a systematic review in May 2020 on the use of community face coverings to prevent the spread of Covid-19. There was no meta-analysis in the review, which focused on effectiveness. 2/10

      The conclusion on that report was “very little research data available on the effectiveness of community face coverings in preventing the spread of COVID-19 in society.” and evidence “minor” or “non-existent”. 3/10

      So, now then a formal meta-analysis, identifying the same 5 randomised controlled trials, showing an effect with relative risk estimate 0.61 (95% CI 0.39-0.96).<br /> Few points: 4/10

      The meta-analysis focuses on efficacy; what is achievable potentially when perfect conditions. They do something which they call “account of bias caused by non-compliance”; ie. if persons in the mask-group did not were masks they “adjust” for this. 5/10

      To me, this sounds quite controversial: In my world we look at intention-to-treat first, and then perhaps maybe on the “per-protocol”/“as treated”. <br /> Efficacy important, but this is now something different than what the original systematic review aimed at. 6/10

      The problems of this accentuate in the Discussion, where the authors do not seem to understand the difference in efficacy and effectiveness, nor the fact that they are actually analysing something else than the original review, and making way too far-fetched conclusions. 7/10

      There are other peculiarities, for example “Four of the analyzed studies evaluated the use of masks on respiratory infections directly, and in one the primary outcome was compliance with mask use.”. Hopefully an error, I don’t believe they actually mix the outcomes like this. 8/

      . @jejkarppinen added the following comments after my initial post, which I agree with:<br /> - The potential biases in the original papers were not covered.<br /> - Quality of evidence was not evaluated at all.<br /> - Dissemination of the results did not consider the potential problems. 9/10

      Finally:<br /> - I've not read the original 5 studies. <br /> - I’m not an expert on systematic reviews/meta-analyses. <br /> - I do think recommendation for masks is motivated, and the evidence is there (but not here..).<br /> - I do think we should be objective when evaluating evidence. 10/10

      The original systematic review the Finnish Ministry of Social Affairs and Health in Finnish is here (english abstract only):<br /> http://julkaisut.valtioneuv...

      Ps. Somebody noted the lack of preregistered protocol, which reminded me that the PRISMA-guidelines helpful when reporting systematic reviews and meta-analyses. <br /> Their checklist should be followed in reporting:<br /> http://prisma-statement.org

      In addition, it was noted by Jesper Kivelä that there are errors in the calculations, these should be corrected (in Finnish):<br /> https://twitter.com/JesperK...

    1. On 2020-11-02 02:33:32, user Atomsk's Sanakan wrote:

      The paper calculates IFR using COVID-19 deaths 4 weeks after the median time at which antibody levels were measured. That's consistent with other papers that use at deaths 3 weeks or more after the median time. For example:

      https://www.thelancet.com/p... (with: https://www.thelancet.com/j... )<br /> https://www.medrxiv.org/con...

      The paper also notes that IFR for SARS-CoV-2 is substantially more than that of seasonal influenza:

      "These results also confirm that COVID-19 is far more deadly than seasonal flu; indeed, the World Health Organization indicates that seasonal influenza mortality is usually well below 0·1% unless access to health care is constrained."<br /> https://www.medrxiv.org/con...

      "Using the midpoint of that interval, we estimate that the total U.S. incidence of seasonal influenza during winter 2018-19 was in the range of 45 million to 93 million infections and hence that the population IFR for seasonal influenza was in the range of 0.04% to 0.08%--an order of magnitude smaller than the population IFR for COVID-19."<br /> https://www.medrxiv.org/con...

      That is consistent with sources such as:

      "The current data in Europe are consistent with an IFR of 0.5–1.0%, which is many times higher than seasonal influenza (<0.1%)."<br /> https://www.ncbi.nlm.nih.go...

      "The calculated COVID-19 infection fatality rate is 1.63%, which is 10 to 40 times more deadly than the seasonal flu (fatality rate 0.04%-0.16%)."<br /> https://news.ochsner.org/ne...<br /> [for: https://wwwnc.cdc.gov/eid/a... ]

      "In summary, we estimate that the overall COVID-19 IFR ranges from 0.14 - 0.42% in low income countries to 0.78 - 1.79% in high income countries, with the differences in those ranges reflecting the older demography of high income settings.<br /> [...]<br /> Our estimates of the IFR of COVID-19 are consistent with early estimates and remain substantially higher than IFR estimates for seasonal influenza (<0.1% in the USA) [...]."<br /> https://www.imperial.ac.uk/...

      World Health Organization, in October:<br /> "Several of these analyses have used published or pre-print seroepidemiologic results and they all converge around a point estimate of around 0.6%.<br /> That may not sound like a lot but that is a lot higher than influenza [...]."<br /> https://www.who.int/publica...

    1. On 2021-04-28 13:32:34, user Huijghebaert Suzanne wrote:

      This study poses a number of relevant questions to resolve, before concluding on efficacy. To start: calculating backwards, the number of PCR negative symptomatic subjects (31 in total) were comparable in both treatment groups, suggesting that all differences over the 21 days of treatment were accounted for solely by COVID-19, so strangely, the spray – in contrast to what is claimed – would not prevent other common colds....! So, how much analytical flaw, how much efficacy?<br /> 1) Calculating the % back in carrageenan+saline versus saline alone 7.6% corresponds to 15 and 8.6%<br /> to 17 subjects each respectively, totaling 32 instead of 31: where is the incorrect overlap, 1 person too<br /> much calculated as +?<br /> 2) Most importantly, which PCR test did you use and how did you assure that carrageenan did not interfere with the PCR assay? Cfr Ribeiro 18th Apr, 2013, asking for a safe way to remove carrageenan from RNA samples and reporting that after RNA extraction (by Trizol), reverse transcription and real time PCR, only the control group (saline injected, without carrageenan) had positive amplification, while carrageenan interfered<br /> with the reaction. <br /> 3) The impact of carrageenan may additionally also already interfere at the sampling stage by affecting/reducing the RNA loading, leading to less<br /> positive results (cfr Laurie et al.). Also that step should be validated.<br /> 4) As the difference moreover is very small, and the outbreaks in health care personal often concerns clusters within a same division, how do the individual data relate to different clusters within given departments?

      Without proper profound validation the PCR test(s) in presence of various carrageenan concentrations, the findings may not translate in a true benefit, but possibly mask (so missed) viral positivity and so falsely hide transmission events. Unless validated in detail, the combination of already reported reduced PCR-response (due to carrageenan) in the nasal samples and case clustering may possibly annihilate all differences of significance.

      Suzy Huijghebaert, Belgium

    1. On 2021-09-12 21:59:19, user Swapnil Hiremath wrote:

      <reposting with="" minor="" edits="" as="" disqus="" thought="" last="" one="" was="" spam="" for="" some="" reason=""><br /> The authors have undertaken an ambitious project: briefly, taking numerators from the VAERS database, denominators from vaccine numbers from elsewhere. They then perform a ‘harm-benefit’ analysis looking at COVID hospitalization as the only harm. The whole analysis is restricted to the 12-17 age group in whom the concern of myocarditis is admittedly higher.<br /> They report a risk which was anywhere from 1.5 to 6.1 times higher for vaccine associated myocarditis vs COVID causing hospitalization. Vaccines must be bad, surely.

      However, several problems are quickly apparent.<br /> 1. The rate of myocarditis is much higher than the ones reported in Ontario: 160/million for 12-15 males compared to 72.5/million from Ontario (which includes Moderna as well - which has higher rates of myocarditis than the Pfizer/BioNTech). Why would this be so? There are many possible reasons, including the overestimation from VAERS being probable cause. On a perusal of the supplement, there are many which are other viral diseases which could be the reason; additionally many descriptions are quite vague (‘the doctor told us troponin was elevated’). It is very easy to submit cases to VAERS, so the numbers reported could be an overestimate - proper case ascertainment with source documents is necessary to be sure of the cases. Needless to say, simple arithmetic to derive 'rates' is also problematic. The VAERS website specifically suggests the numbers should *not* be used for estimating rates.

      1. It was not clear why the authors chose Jan 1, when vaccines EUA for 16-17 started in March, and for 12-15 in May. In their database, there seems to be one case in March and most of the VAERS reports from May or later.

      3.Next, the authors make many assumptions when it comes to who had comorbidities and who did not among the children, and multiply numbers to come up with some crude estimates. It would be useful for a pediatric diseases researcher to assess these assumptions. The 40% assumption of children hospitalized 'with COVID' and not due to COVID is a very crude untruth that the authors and others have needlessly perpetuated on social media with little foundation.

      1. Most importantly, the authors assume that hospitalization is the only bad thing for children who develop COVID. Some 12-17 years olds have died due to COVID, and some may have had a 1 day hospital stay - their analysis treats these equally and incorrectly. Some teens developed MIS-C. Some developed longer term sequelae. To group them under ‘hospitalization’ seems overly simplistic. Similarly, from perusing some of the vaccine-myocarditis, many seem to have recovered with symptomatic care alone. The authors seem to be minimizing COVID and maximizing vaccine associated adverse events.

      2. It should be noted that the involvement of children in the first two waves seems to be different than the one we have seen in the last 2 months with delta (for whatever reason - perhaps with lower immunization numbers in these).

      3. Lastly, the pandemic is not yet done. Many more children are going to get COVID in the next few months and years. We are going to have many more hospitalization, morbidity and sadly many more deaths. There will be long term morbidity and sequalae. We do need better data to assess the risks and benefits. This study is not it.

      Unfortunately this study has been picked up uncritically by media and will worsen vaccine hesitancy. This seems unwise in the face of an ongoing pandemic.

    2. On 2022-01-14 15:44:35, user Jordan Taylor wrote:

      There are at least three major issues in this paper.

      The first is a technical issue and probably the most obviously fatal flaw. It was first pointed out (that I could see) by Shih-Hao Yeh, and I think deserves re-emphasising. The authors have badly miscalculated the COVID19 hospitalisation risks for children conditional on comorbid status. They cite a 120 day hospitalisation rate (during moderate viral prevalence) of 255/million children. They note that the hospitalisation risk is 4.7-fold higher for children with comorbidities than for those without. How do we calculate the risk for each subgroup then? In this case, we are told 70% of those hospitalised have commorbidities and 30% do not, so for each 255 hospitalised, 0.3*255 = 76.5 will be healthy and 178.5 will have commorbidities. We can't stop there though as we need to adjust for the size of the background healthy and comorbid populations, which the authors tell us is 67% and 33% respectively. To get rates per million we have 76.5/0.67 = 114.2 among the healthy and 178.5/0.33 = 540.9 among those with comorbidities. They seem to have come up with the 44.4/million and 210.5/million figures based on the assumption that the two have to sum to 255/million, which is just not how it works at all. A basic sanity check should have been "should the risk in the high risk group really be lower than the overall risk?" If the rate of gun deaths is 100/million in the military and 1/million in civilians, you don't just add them together to get an average population gun death rate of 101/million!

      The second is more conceptual. The comparison is between a risk from vaccination conditional on vaccination with a risk from infection which is not conditional on infection. In other words, the paper does not answer the question: what is the myocarditis risk of a vaccinated child versus the hospitalisation risk of an infected child? Instead it rather tries to answer the question: what is myocarditis risk of a vaccinated child versus the COVID hospitalisation risk for an average child over a 4 month period at X prevalence rate. Given that the pandemic has already been going on for 2 years now and shows no signs that it will simply disappear, this strikes me as an inappropriate choice.

      Another issue is how the claims in the conclusions compare to the data. The authors state that for "boys 12-17 without medical comorbidities, the likelihood of post vaccination dose two CAE is 162.2 and 94.0/million respectively..." (emphasis added) and that this "exceeds their expected 120-day COVID-19 hospitalization rate at both moderate (August 21, 2021 rates) and high COVID-19 hospitalization incidence." However, at no point in the text, data, methods, or the Study Profile supplement (S2) of the paper is a stratification of CAE rates by comorbid status provided. In fact, although they did not provide any code, I was broadly able to reproduce their VAERS myocarditis counts in 12-17 year olds from 01/01/2021-06/18/2021 using the criteria they provide*, and that was without adjusting for comorbidities. So it seems likely to me that they have not separately analyzed "healthy" and comorbid patients and thus have no basis for making claims about effects of vaccination on "boys without medical comorbidities".

      If as seems likely they have compared CAE rates among both healthy and unhealthy children with COVID19 hospitalisation rates among only healthy children, this is a big problem. The authors should be challenged to stratify their analysis equally for both COVID19 hospitalisation and vaccination before this paper is actually published.

      *I found 277 cases matched the original criteria, however, this was with a number of likely duplications and possibly other data entry erros; as mentioned, the authors' methods were pretty inadequate so we have no way to see whether or how they have cleaned the data.

    1. On 2022-01-04 18:56:26, user Thomas Barlow wrote:

      Did you account for pre-existing T-Cell immunity (pre-2020) which was known in 30 to 50% of the population in year 2020?<br /> Did you account for the fact that most people have had covid (most without even noticing) and will have developed T-Cell immunity naturally? How did you control for that? It's not a static measure and is more expressed and less expressed at different times of month, year.

      Studies on INFECTION FATALITY RATE (IFR) - peer-reviewed studies [Studies conducted long before any vaccine] :

      (notice the one confirmed for publication by the W.H.O. in September 2020 [published Oct. 2020] - A 0.23% IFR...about the same as flu).

      MARCH 2021<br /> “the available evidence suggests average global IFR of ~0.15%”<br /> https://onlinelibrary.wiley...

      FEB. 2021<br /> “The infection fatality rate for both the Bureau of Prisons and U.S. was 0.7%. Among institutions that tested >=85% of inmates, the combined infection fatality rate was 0.8%”<br /> https://www.ncbi.nlm.nih.go...

      JAN 2021<br /> “The overall non-institutionalized IFR was 0.26%.”<br /> - https://www.acpjournals.org...

      DEC 2020<br /> “This rate varied from place to place, with a lower range of 0.17% and a highest estimate of 1.7%.”<br /> https://www.sciencedirect.c...

      DEC. 2020<br /> “Results show a fatality ratio of about 0.9%, which is lower than previous findings.”<br /> https://www.mdpi.com/1660-4...

      NOV. 2020<br /> “The overall infection fatality risk was 0.8%”<br /> https://www.bmj.com/content...

      NOV 2020<br /> “In the United States, COVID-19 now kills about 0.6% of people infected with the virus, compared with around 0.9% early in the pandemic, IHME Director Dr. Christopher Murray told Reuters.”<br /> https://www.reuters.com/art...

      NOV. 2020<br /> “The estimated IFR was 0.36% (95% CI:[0.29%; 0.45%]) for the community and 0.35% [0.28%; 0.45%] when age-standardized to the population of the community.”<br /> https://www.nature.com/arti...

      OCT 2020<br /> “We know that antibody tests are not perfect, and there may be a considerable number of people who do not mount a detectable antibody response to SARS-CoV-2. However, even when this uncertainty is taken into account, we still find that COVID-19 has a high fatality rate - on the order of 1% for a typical high-income country.”<br /> https://www.imperial.ac.uk/...

      SEPT 2020<br /> The W.H.O. posted a heavily peer-reviewed & critiqued study from May 2020, showing the deaths per cases are 0.23% overall, and going up to 0.5% in the worst hit cities. 0.05% for under 70s - The W.H.O. reviewed it again, then published it in September:<br /> - https://www.who.int/bulleti...<br /> - https://apps.who.int/iris/h...

      AUG 2020<br /> The medical journal 'Nature' had an analysis and stated that:<br /> "This result was used to calculate an overall IFR for England of 0.9%”<br /> https://www.nature.com/arti...<br /> ________________

    1. On 2021-11-28 19:40:54, user Robert van Loo wrote:

      44 relevant new variants up till now and on average some 2 per 10 million cases. Did we only see 220 million cases globally? I would think more with over 5 million deaths and an IFR of 0.6 %. I have papers and also WHO stating the reported 260 million cases is factors lower than the real number of infections. With over 5 million reported deaths and an IFR of 0.6 % the real number of infections would be over 800 million cases. The number of relevant new variants per 10 million cases would then be about 4 times lower. Of course if reported cases always underreport to the same extent the extrapolation of reported cases to new variants would not change. Still important to make the distinction as the underreporting factor is hugely variable.

    1. On 2020-06-29 08:48:39, user Dr Mubarak Muhamed khan wrote:

      RE: can creating new vaccine everytime is solution for new mutating viruses?

      We published our view as e letter regarding old vaccine and it’s Possible use In present menace in science and C&E News<br /> Link:

      https://science.sciencemag....

      The e letter

      (2 June 2020)<br /> Thank you very much for excellent update in new vaccines. We appreciate every efforts towards betterment of human life and fighting with new menace. Still Certain questions need to be asked while trying new vaccines everytime for Every new virus or any microbe mutation?<br /> Although we are Not immunologists, still certain questions haunts our mind. We hope that these queries and questions will ignite the minds of researchers and immunologists. With open minds we must ask these questions to ourselves in today’s tough time instead of getting rattled by situation<br /> 1. Does every new virus create specific antibodies? And for how long it works?<br /> 2. Is there any limit of immune response for any healthy Homo Sapien?<br /> 3. Whether body immune response of Homo Sapien get fatigued with every new challenges by new viruses?<br /> 4. Whether after multiple challenges by new viruses , body try saving Homo Sapien by cross immunity?<br /> 5. Although with new challenges by new virus, body may try responding by creating initial IgM .... And then IgG for certain time period, but whether memory is created for long time for such mutating viruses?<br /> 6. Why not to boost immunity with booster doses of existing vaccines and check cross immunity for fighting with new mutating viruses ?<br /> 7. When new vaccines are in development, why not to give a chance of revaccinations with existing proved vaccines (BCG, MMR, and many more) to masses??<br /> 8. Is there any harm in starting booster doses to children’s and adults of existing vaccines?<br /> 9. Till the new vaccines are developed for SARS Cov 2, good ample amount of time one will get to test boosting immunity with current vaccines and checking cross immunity for fighting corona?<br /> 10. We must continue searching new vaccines for every new virus. But what’s harm repurposing existing proved vaccine for strong cross immunity to neutralise many new menace?

      Still Many more new mutated viruses will arrive and try to attack us in different ways in future. Why not to boost sustainable existing immunity with booster doses of existing well tested vaccines in vaccination programmes?

      Sincere Regards<br /> Dr Mubarak khan<br /> Dr Sapna Parab<br /> Director & Consultant<br /> Sushrut ENT Hospital & Dr Khan’s Research Centre, Talegaon Danhade, Pune, India

    1. On 2020-04-12 12:44:15, user Clive Bates wrote:

      Thank you for an extremely interesting and informative paper. I have a few suggestions about one aspect of the paper - tobacco use.

      1. The paper alternates between use of 'smoking status' (the body) and 'tobacco use' (tables). It would be helpful to know which is appropriate and how the data on tobacco status was gathered. Tobacco use could include smokeless tobacco and, if FDA definitions are applied, it could include vaping.

      2. The proportion of tobacco users assessed, hospitalised and developing critical conditions is substantially below the tobacco use prevalence for New York, even when age is considered. Is this worth mentioning?

      3. The multivariate analysis shows an apparent protective effect against hospitalisation for current and former tobacco use as reported (OR = 0.71, 95% CI 0.57-0.87 p=0.001). This is a striking finding, but consistent with findings from CDC's summary of US data (MMWR (April 3, 2020 / 69(13);382–386)) and China (Farsalinos et al - pre-print) in which smoking appears to be underrepresented in the population with progression to more severe symptoms.

      4. A weaker (non-significant) apparent protective effect of current or former tobacco use (OR = 0.89 95% CI 0.65-1.21, P=0.452) of was found in the progression from hospitalisation to critical condition. Hospitals generally impose smoking cessation and nicotine withdrawal at the point of hospitalisation.

      5. Would it be possible for the authors to rerun the multivariate analysis with current tobacco use and former tobacco use as separate variables? It is possible that former use is masking a stronger effect from current use. Current and former tobacco use may have quite different effects on progression of the disease and former use can include people who quit smoking decades ago. The merging of current and former tobacco use may be obscuring valuable information in the data.

      6. There are many possible explanations for an apparent protective effect. It is possible the tobacco use status has been underreported, or current and former users are overrepresented in the 'unknown' status. It is possible that patients fear disclosure of tobacco use will lead to discrimination in treatment or they may feel guilty about their 'contributory negligence'. However, it is also possible that there is a real protective effect from either smoking or nicotine use. This is not implausible: nicotine interacts with the same receptor that is responsible for development of the disease following exposure to the virus. This paper could yield useful supportive or falsifying insights into that hypothesis.

      7. Even if such findings are disconcerting, we should be led by the data. It is not possible to rule out a protective effect at this point and this paper adds to the reasons to take the idea seriously. There could be significant implications for the population impact of COVID-19, implications for advice to tobacco users, and implications for practice in hospitals.

      8. I have no conflicts of interest with respect to tobacco, nicotine or pharmaceutical industries.

    1. On 2022-02-17 20:47:23, user RT1C wrote:

      Another point of confusion: "An individual was considered protected by natural immunity 14 days after testing positive for COVID-19 by a nucleic acid amplification test (NAAT). If not previously infected, a person was considered protected by vaccine induced immunity 14 days after receipt of the second vaccine dose of an mRNA vaccine. " and "A vaccine booster was defined as at least 1 dose of any COVID-19 vaccine at least 90 days following COVID-19 infection for those with natural immunity (i.e. those previously infected), or a third dose of a COVID-19 vaccine at least 90 days following the second dose of an mRNA COVID-19 vaccine for those with vaccine-induced immunity (i.e. those not previously infected)."

      This is all very confusing, stemming from your broad use of "natural immunity" to include those who were vaccinated before or after infection. Figure 4 is entitled with "natural immunity" but includes people with 0, 1, 2 or 3 doses. Based on the definitions in the text quoted above, that doesn't seem possible. Did they get infected and then receive 0-3 doses AFTER infection and still called "natural immunity" subjects? What about people who received 1 or more doses before infection? Are they counted among the vaccine-induced immunity subjects? In my opinion, your definitions and uses don't seem consistent or understandable.

      Furthermore, because other research has shown a difference in immune response when people are vaccinated then infected vs. infected then vaccinated, you should not combine these as one group. Did you make any attempt to compare these situations? Shouldn't you?

      Look at Fig. 2, for example. I assume that many of the subjects included in the curves on the left ("Natural Immunity") actually were vaccinated at some time, since Fig. 4 shows that many with "natural immunity" were vaccinated by 0-3 doses. How, then, are we to interpret Fig. 3? Is the weaker immunity with longer durations since POIC to be interpreted as time since infection, or time since vaccination (which would count for resetting POIC)? Is the weaker immunity with longer durations due to decay of natural immunity as the text seems to imply, or is it due to confounding with vaccination? (After all, doubly vaccinated have higher susceptibility in Fig. 1). These issues make it difficult to understand your study.

      I think you probably have the data for an informative analysis, and your method of analysis looks promising (I prefer it to the "person-days" approach used in some other work). Please consider reexamining the dataset with a clarified definition of "natural immunity" that accounts for all combinations of vaccination and infection including sequence.

    1. On 2020-03-30 14:10:51, user Sinai Immunol Review Project wrote:

      Study description: Data analyzed from 52 COVID-19 patients admitted and then discharged with COVID-19. Clinical, laboratory, and radiological data were longitudinally recorded with illness time course (PCR + to PCR-) and 7 patients (13.5%) were readmitted with a follow up positive test (PCR+) within two weeks of discharge.

      Key Findings:

      At admission:<br /> o The majority of patients had increased CRP at admission (63.5%).<br /> o LDH, and HSST TNT were significantly increased at admission. <br /> o Radiographic signs via chest CT showed increased involvement in lower lobes: right lower lobe (47 cases, 90.4%), left lower lobe (37 cases, 71.2%).<br /> o GGO (90.4%), interlobular septal thickening (42.3%), vascular enlargement (42.3%), and reticulation (11.5%) were most commonly observed.

      After negative PCR test (discharge):<br /> o CRP levels decreased lymphocyte counts (#/L) increased significantly (CD3+, CD3+/8+ and CD3+/4+) after negative PCR.<br /> o Consolidation and mixed GGO observed in longitudinal CT imaging w different extents of inflammatory exudation in lungs, with overall tendency for improvement (except 2/7 patients that were readmitted after discharge with re-positive test) after negative PCR.

      Seven patients repeated positive RT-PCR test and were readmitted to the hospital (9 to 17 day after initial discharge):<br /> o Follow up CT necessary to monitor improvement during recovery and patients with lesion progression should be given more attention.<br /> o Dynamic CT in addition to negative test essential in clinical diagnosis due to nasal swab PCR sampling bias (false-negatives).<br /> o Increase in CRP occurred in 2 readmitted patients (and decr. in lymphocytes in one patient), but was not correlated with new lesions or disease progression vs. improvement (very low N).<br /> o Patients readmitted attributed to false-negative PCR vs. re-exposure.

      Importance: Study tracked key clinical features associated with disease progression, recovery, and determinants of clinical diagnosis/management of COVID-19 patients.

      Critical Analysis: Patients sampled in this study were generally younger (65.4% < 50 yrs) and less critically ill/all discharged. Small number of recovered patients (N=18). Time of follow up was relatively short. Limited clinical information available about patients with re-positive test (except CRP and lymph tracking).

    1. On 2020-03-30 15:38:19, user Sinai Immunol Review Project wrote:

      Main findings<br /> This is the first report to date of convalescent plasma therapy as a therapeutic against COVID-19 disease. This is a feasibility pilot study. The authors report the administration and clinical benefit of 200 mL of convalescent plasma (CP) (1:640 titer) derived from recently cured donors (CP selected among 40 donors based on high neutralizing titer and ABO compatibility) to 10 severe COVID-19 patients with confirmed viremia. The primary endpoint was the safety of CP transfusion. The secondary endpoint were clinical signs of improvement based on symptoms and laboratory parameters.

      The authors reported use of methylene blue photochemistry to inactivate any potential residual virus in the plasma samples, without compromising neutralizing antibodies, and no virus was detected before transfusion.

      The authors report the following:<br /> ? No adverse events were observed in all patients, except 1 patient who exhibited transient facial red spotting.<br /> ? All patients showed significant improvement in or complete disappearance of clinical symptoms, including fever, cough, shortness of breath, and chest pain after 3 days of CP therapy. <br /> ? Reduction of pulmonary lesions revealed by chest CT.<br /> ? Elevation of lymphocyte counts in patients with lymphocytopenia. <br /> ? Increase in SaO2 in all patients, indicative of recuperating lung function. <br /> ? Resolution of SARS-CoV-2 viremia in 7 patients and increase in neutralizing antibody titers in 5 patients. Persistence of neutralizing antibody levels in 4 patients.

      Limitations<br /> It is important to note that most recipients had high neutralization titers of antibodies before plasma transfusion and even without transfusion it would be expected to see an increase in neutralizing antibodies over time. In addition to the small sample set number (n=10), there are additional limitations to this pilot study:<br /> 1. All patients received concurrent therapy, in addition to the CP transfusion. Therefore, it is unclear whether a combinatorial or synergistic effect between these standards of care and CP transfusion contributed to the clearance of viremia and improvement of symptoms in these COVID-19 patients. <br /> 2. The kinetics of viral clearance was not investigated, with respect to the administration of CP transfusion. So, the definitive impact of CP transfusion on immune dynamics and subsequent viral load is not well defined.<br /> 3. Comparison with a small historical control group is not ideal.

      Relevance<br /> For the first time, a pilot study provides promising results involving the use of convalescent plasma from cured COVID-19 patients to treat others with more severe disease. The authors report that the administration of a single, high-dose of neutralizing antibodies is safe. In addition, there were encouraging results with regards to the reduction of viral load and improvement of clinical outcomes. It is, therefore, necessary to expand this type of study with more participants, in order to determine optimal dose and treatment kinetics. It is important to note that CP has been studied to treat H1N1 influenza, SARS-CoV-1, and MERS-CoV, although it has not been proven to be effective in treating these infections.

    1. On 2022-12-19 02:33:05, user Charles Warden wrote:

      Hi,

      Thank you very much for posting this preprint. I think this is an interesting topic to learn more about.

      I was surprised at what seemed to me like a relatively small amount of difference for the EA-PGS versus the rare alterations (such as in Figure 2), but I am not sure if that is because of certain associations that I might have with “rare” and “monogenic”. I apologize if some of these questions are naïve and certain assumptions may be more or less relevant in certain situations.

      For example, I wonder if the phenotypic “deviators” might relate to part what I am trying to describe, and I think it would be good for me to take some time to better understand resources like Developmental Disorder gene panel from Gene2Phenotype.

      Nevertheless, as a starting point, I hope that you can help with at least some of the following questions:

      1) I could find Supplemental Figure S1. However, I see references to Supplemental Tables, when I can’t find additional uploaded “Supplemental” files.

      Am I overlooking something, or do the Supplemental Tables need to be added in a “v2” version?

      2) I thought Figure 1 was very helpful in gauging the relative influence on the different characteristics.

      However, the abstract mentions a “threshold for clinical disease”. I apologize that I don’t know the best full list of known genetic/genomic alterations that would provide a good reference, but I would be interested to know how the predictive power (which I assume is correlated with the beta values) might compare to something like trisomy 21 for Down Syndrome (although that affects many genes on chromosome 21).

      Reading this preprint helped me in terms of gaining familiarity with earlier publications, such as the AJHG Kingdom et al. 2022 publication. In that paper, I was interested in the set of 25 “known highly penetrant genes.” However, my understanding is that those were not used in a main figure because of sample size for sufficiently rare variants/alterations. For example, I recognized “Williams syndrome” and “Prader-Willi/Angelman” in Table S2 of the other publication, but I believe there were less than 20 UKB subjects for each of those diseases.

      I also apologize that I think that I am confusing the results a bit, since the other publication includes copy number alterations and Figure 1 in this preprint mostly describes “variants”. However, I hope that provides some sense of what I am asking about.

      For example, I don’t know if sample size is an issue, but I would be interested in seeing an additional category to compare the current results to something of comparable clinical utility can be defined (if possible, from individual variants in certain genes). I apologize if I am overlooking something.

      3) In this preprint, I also see “monoallelic (i.e., autosomal dominant)” in the methods. So, I think this already answers one of my question/questions, but I wanted to still say something to try and find the better way to describe what I would call “monogenic” diseases. It is possible that there might be some other explanation like needing additional characterization of individual variants (and I believe the earlier paper mentioned “very few” pathogenic variants were in ClinVar), but I would like to start with the possibility that perhaps there is a more precise way to communicate my thoughts.

      In other words, I would expect the clinical disease for monogenic diseases to be recessive in certain situations, and the questions that I had earlier related to whether there could be a noticeable number of developmental disorders that follow a recessive inheritance pattern (such as whether a homozygous variant counted as “2 variants”).

      Likewise, in terms of downstream effects, I thought Phenylketonuria/PKU affected some of the traits mentioned in this study (if dietary changes were not made in time). I don’t know if that was precisely considered a “developmental disorder,” but that is an example of what I think of when I see “rare” and “monogenic”.

      Please let me know if I have misunderstood anything, but my understanding is that this may go against the assumed dominant inheritance pattern for the genes. However, the question that I am trying to ask is how to categorize what I would call “monogenic” disease like PKU (as I understand it) and then compare that to the effects described for the associations in this preprint? I think one other way to describe this might be something like “positive predictive power” and yet another way might be something like “what are the proportion of individuals meeting the threshold for disease among carriers (if significantly higher than controls)”? However, I don’t know if there are issues with my use of terminology (and/or sample size, criteria to volunteer and consent, or something else).

      4) In terms of the clinical importance, I am not sure if there might be caveats/limitations to suggesting “PGS may provide some clinical utility by improving diagnostic interpretation of rare, likely pathogenic variants that cause monogenic disease”? For example, if a large enough number of individuals can be functional adults (or even adults with excellent health/success), then I might worry about the stigma of test results provide early in life (if predictive power was over-estimated). Based upon Figure 3, my understanding is that there is a reasonable chance that could happen with the EA-PGS? If that was true, then I hope that there is or can be a different term/classification to separate such a risk assessment from the genetic tests for diseases similar in severity to what I tried to list above (if I understand correctly).

      I hope that I can learn more, and I hope that these questions might also be helpful to some other readers.

      Thank you very much!

      Sincerely,

      Charles

    1. On 2024-05-02 17:06:42, user Oliver wrote:

      I am 44 months cold turkey. History includes class II ointment and cream use and one round of prednisone (25mg a day), and maybe even a steroid shot...but I'm not sure (I fell on my back once and went to the ER, they gave me a shot of something...this was a few months before I started my TSW). I am trying to get my records to confirm. When I used TCS, I never finished a prescription. I used small amounts. I even applied using a finger cot. I used very consistently for only about 2-3 years. In total, in my life, probably 4 years (on and off). I was pretty careful but very naive. I blame myself and my dermatologist. She always just prescribed me another TCS to use. She never mentioned avoiding long term use. So maybe I shouldn't blame myself. What I want to comment on here is that I truly believe the current situation with TSW is a systemic medical catastrophe. Just think about it, practicing physicians are not looking under the microscope in their clinics to distinguish TSW between other eczematous disorders. They are going by the eye, and that's if they are even cognizant of TSW. If you had to place a bet on the amount of people's skin on this earth that are addicted to TCS and actually have TSW symptoms rather other eczematous disorders, what is your over/under? Serious question. It's easily over tens of millions if you count every country. In North America? Millions. That might be underestimating it. You could get a clearer picture just by seeing how many TCS products are sold to end consumer. It's tens of millions. The situation is beyond complex. As a nonprofessional, I'd even go as far as saying most people that have a history of TCS that are walking into their doctors clinic, they have TSW and that person and their doctor don't even know it. How many of these people are there every day? Sure, use the TCS for long term. But if you use for 20+ years and TSW is still not recognized by most institutions...I won't say much more but my mind will always remember the late Eric 'Nim' Bjorklund. May his memory be a blessing. TSW is more than just a serious condition. It's a crisis. It's irreparable. It could be one of the greatest institutional failures of the century in medicine.

    1. On 2020-04-18 17:11:55, user Clint Cooper wrote:

      Please allow me five responses to this study:<br /> 1. Yes, it's true that there are large numbers of people who have probably been exposed to Covid-19 and experienced no symptoms and have antibodies. The best country to look at with regard to testing mass numbers of people randomly is Iceland. Icelad has now tested over 11% of its entire population and a lot of testing was truly done randomly. The fatality rate there is around .6%, or one in about 167 people. In New Zealand, which has also engaged in massive testing, the fatality rate is about .8% or 1 in 125 people. HOWEVER (and this is a big however). the fatality rate in both countries has gone up steadily and is continuing to go up. Also, in those countries the average age of people infected is much lower than in other countries, like Italy, who have been hit much harder by this disease. This actually jives with the original fatality rate given by the World Health Organization, which was around .3 to .5% for younger people, and markedly higher for older people. <br /> Also, the average fatality rate for the common flu is about 1 in 2,000. So even if we can agree that the fatality rate of Covid-19 might be lower than originally thought, it's way more dangerous than any common flu. It is also exponentially more contagious, which is a huge part of the problem. <br /> 2. As testing becomes more and more widely available across the planet, the fatality rate has gone markedly up across the board, not down. For closed cases, the fatality rate has gone from 3-4% initially to now 21% and rising.<br /> 3. The fatality rate in this study assumes that no one ever died at home from Covid-19. As a matter of fact, NYC is now including people who died at home from Covid-19 but a lot of US states and other countries are not, which would artificially lower the fatality rate, when it could actually be much higher.<br /> 4. To say that somehow the fatality rate of this disease is no worse than just a really bad seasonal flu is looney tunes and doesn't pass the eyeball test since many, many perfectly healthy people with no pre-existing medical conditions are dying as a result of this disease left and right, including young and middle aged people. That simply doesn't happen with seasonal flu. The self-reported symptoms of Covid-19 are also far worse than most self-reported symptoms of the common flu. Recovery time also appears to be longer, and many doctors are reporting lung and heart tissue damage.<br /> 5. There are about 18,000 people in Santa Clara County who have tested for Covid-19 and about 10% of those tested positive. Is it possible that the same people who tested negative could have already had Covid-19 and it was already defeated by their immune system and they now have antibodies and are testing negative? Could it be that these very same people who already tested negative for the virus are now volunteering to test for antibodies? In this case, there is a massive level of redundancy and the study is useless and can't be extrapolated to the general public. This study is far too small and not random enough to provide any usefull information whatsoever. We would need truly random testing of a much bigger group of people to provide any insight at all into what the actual fatality rate is. We would also need to include a much wider age range.

    2. On 2020-04-21 09:14:54, user Michael Rosenberg wrote:

      Some additional information about performance of the Premier Biotech test is available at the distributors's web site. See links at the bottom of this page.

      Package Insert shows false positive rate of 2/371 for IgG and 3/371 for IgM. It is not clear if same samples that tested positive for IgG also tested positive for IgM or these were different samples. Assuming best case scenario, cumulative FP rate for IgG and IgM is 3/371 (0.8%).

      One could add to FP calculation 0/30 FPs reported in the preprint and 0/88 FPs mentioned by Dr. Sood in LA briefing yesterday. Adding these samples brings naive FP rate to 3/489 or 0.6%.

      Distributor also provides what appears a submission for approval of the test by China FDA (link). This document shows 3/150 false positives for IgG and 1 - for both IgG and IgM. Therefore cumulative FP rate for IgG and IgM in this study is 4/150 or 2.7%. Note however that this number likely overestimates FP rate - all negative subjects in this study were COVID contacts excluded by PCR.

    1. On 2025-11-23 17:32:31, user Charlotte Strøm wrote:

      In the following “text in italics – inside quotations marks are copy-pasted from the reference in question." Underlining and/or bolded text are done by me.

      1. SPIN AND FRAMING<br /> The title of the preprint is: “Randomised trial of not providing booster diphtheria-tetanus-pertussis (DTP) vaccination after measles vaccination and child survival: A failed trial”<br /> 1.1 Framing neutral findings as abnormal or disappointing<br /> The authors consistently imply the results are “unexpected” or “contradictory”, rather than acknowledging that the RCT failed to support earlier observational findings.

      Examples<br /> “A failed trial” says the title ->The trial did not “fail”: it ran, randomised 6500+ participants, and produced valid estimates showing no harm of the DTP vaccine. Calling it “failed” is a framing tactic that positions the result as an error rather than what the data showed.

      Page 8, lines 11-12: The was no difference in non-accidental mortality … the HR being 0.84 (0.52–1.37).” ->This is an appropriate stating of results, but the subsequent framing undercuts it.

      Page 8, lines 22-24: “Since no beneficial effect of not giving DTP4 was found, contradicting many observational studies… possible interactions were explored…” -> This subtly frames the RCT as problematic because it contradicts earlier observational research, rather than recognizing that RCTs supersede observational evidence.

      Page 10, line 2:“The present RCT is therefore an outlier which needs an explanation.” -> This is spin: the RCT is not an "outlier" needing explanation; observational studies - upon which the research hypothesis are based - are know to have confounding and are biased. CONSORT encourages presenting results without exaggeration or defensive justification.

      1.2 Causal interpretations of non-significant results<br /> The authors imply meaningful patterns where no statistically reliable findings exist.

      Examples Page 8–9 (exploring interactions despite explicitly stating low power): “There was one significant interaction … DTP strain … observed only for females.” (p=0.05)

      No correction for multiple testing; >20 interactions tested -> This is classic exploratory-analysis spin.

      1.3. Hypothesis-confirming language<br /> The manuscript repeatedly positions NSE hypotheses as foundational truths rather than unproven claims.

      Example Page 3, lines 5-7: “Several studies inidcated… beneficial non-specific effects… more pronounced in females.” -> These were observational or post-hoc analyses, being framed as established background biases the narrative.

      1.4 Framing underpowering as the main explanation<br /> Repeated emphasis that the trial was “strongly underpowered” serves to discount the main finding.

      Examples Page 8, lines 13-15: “...the trial was planned with 3% annual expected mortality rate… observed rate was 81% lower… we had 65% fewer deaths…” -> This is accurate but placed repeatedly tthroughout the text to frame the null result as flawed.

      Page 9, lines 18-21: “The RCT was strongly underpowered… mortality declined …” -> The authors do not consider that a null finding study is plausible.

      2. CONSORT NON-COMPLIANCE <br /> 2.1. Missing or unclear prespecified primary outcome<br /> CONSORT requires explicitly stating primary and secondary outcomes and linking to a prespecified Statistical Analysis Plan (SAP).

      Issues: The manuscript says: Page 5, lines 25-27: “The outcomes were all-cause non-accidental mortality and hospitalisation, as well as sex-difference…”<br /> -> It is unclear which of these is the primary outcome. Mortality? Hospitalisation? Sex-differential mortality? AND - there is ...

      -> No link to protocol-defined hierarchy.

      2.2. Discrepancies between protocolled numbers and intervention<br /> There are discrepancies between numbers stated in the publicly available protocol and study record at http://clinicaltrials.gov and the numbers appearing in the preprint. The intervention described in the preprint is not aligned with the protocol.

      These discrepancies are unexplained in the preprint. The preprint states that DTP3 has been reported elsewhere, but the reference that is included in the preprint (2) does not report on mortality data, moreover it includes both DTP3 and DTP4. And these protocol deviations are inadequately accounted for in the preprint.

      It remains therefore unexplained what the actual flow of study subjects were, and it remains unclear what the results are from the DTP3+OPV+MV versus OPV + MV only – as stated in the protocol.

      2.3. Randomisation procedure is not sufficiently described<br /> CONSORT requires allocation concealment method and sequence generation details

      Example Page 5, lines 12-14: “...randomisation lots were prepared by the trial supervisor… kept in envelopes… mother asked to draw envelope…” -> No description of safeguards (opaque, sealed, sequentially numbered). -> Allocation was not blinded, but CONSORT requires explicit reporting of potential bias. DTP3 is not mentioned in the trial flowchart - figure 1.<br /> 2.4. Lack of intention-to-treat analysis <br /> CONSORT requires ITT or explanation for deviations.

      Example Page 7, lines 1–3: “All children with follow-up and who received the per-protocol intervention were included in the analyses.”<br /> -> This is per-protocol only, inappropriate for a superiority RCT intended to detect harm.

      -> No ITT analysis is presented.

      2.5. No reporting of missing data handling<br /> CONSORT requires transparent handling of missing outcome data.

      Example Page 7, line 14: “No imputation for missing data was done.” -> But the extent of missing data is not reported for mortality or hospitalization outcomes.

      2.6. Discussion includes non-evidence-based explanations, violates CONSORT as Discussion should reflect results, not speculation<br /> The discussion drifts into immunological theory and historical interpretations unsupported by trial data.

      Examples: Page 10, lines 4-6:“...likely that immune mediated NSEs are more pronounced when mortality is high…”<br /> Page 9-10 (multiple paragraphs): Repeatedly argues unexpected null results require explanation. -> This is speculative; not based on data reported from this RCT.

      2.7. Lack of balanced discussion<br /> CONSORT item 22: discuss both limitations and strengths. -> The manuscript heavily emphasises limitations (underpowering, interventions, etc.), but does not discuss the strength of randomisation or lack of harmful signal which is odd considering the research hypothesis of the trial.

      3. OVERALL REFLECTIONS ON THE IMPACT OF SPIN, FRAMING, AND CONSORT DEVIATIONS<br /> Altogether, it seems to be rather unusual for researchers to put the trial down already in the headline, downright devaluating the trial. The authors are known to advocate detrimental effects of the DTP vaccine, a hypothesis that is based on purely observational studies (3, 4), and very small numbers that have not managed to replicate even by the same group of researchers (5).

      This preprint reports results from a large-scale randomized trial, outranking observational studies in the hierarchy of evidence. Hence – making use of “A failed trial” appears to be an attempt to frame the results as invalid, which is ethically disturbing and highly inappropriate towards trial participants and readers.

      p. 10:“The present RCT is therefore an outlier which needs an explanation. The drop in power due to the declining mortality rate may not only have lowered the possibility of finding significant tendencies; it is also likely that the immune mediated NSEs are more pronounced when mortality is high, so when mortality declines by >80%, the residual deaths may be less likely to be affected by immunological changes.”<br /> There seems to be a deliberate misinterpretation, unsubstantiated, and highly speculative. It is difficult not to read this in any other way than as a deliberate attempt to spin the results, frame them to be perceived according to the authors’ hypothesis about DTP having detrimental effects and increasing child mortality. Spinning results is defined as questionable research practice (6). The study was a null finding study, not an outlier.

      There were no signs of more pronounced negative NSE, i.e., higher mortality in the child participants, who got DTP with, or after the measles vaccine. However, the primary outcome analysis demonstrated that this trial is a null finding study and thus the hypothesis was rejected.

      3.1.Spinning the facts around other interventions.<br /> Several times in the preprint, the authors argue that other health interventions affected the trial conduct and the results.

      Examples<br /> Page 1;During the trial period many new interventions, including many national health campaigns, were carried out.”<br /> and <br /> “due to the large number of health interventions, not envisioned at the initiation of the trial, a limited part of the follow-up was a comparison between DTP4+OPV4 vs OPV4 as the most recent vaccinations”<br /> Page 6:“Other interventions and interactions. As the number of routine vaccinations and national health campaigns vaccinations increased through the 1990s and the 2000s, it has become increasingly clear that there are numerous interactions between different health interventions, such as vaccines and micronutrient supplementation, which are usually not taken into consideration in planning a vaccination programme. For example, the sequence of vaccinations, the time difference between non-live and live vaccines, and booster exposure to the same vaccines all had impact on the mortality levels. In addition, most vaccines have sex-differential NSEs (16). Since children were enrolled at 18 months of age, there were numerous possibilities for interactions with (a) national health intervention campaigns before enrolment; (b) participation in previous RCTs; and (c) national health campaigns after enrolment in the trial.”

      Page 10:trials of NSEs were planned more or less as vaccine efficacy studies. However, it has become increasingly clear that there are interactions with other routine vaccinations, vaccination campaigns, and other interventions affecting the immune system like vitamin A (16,19,20). Hence, in the present RCT we examined possible interactions with campaigns before enrolment, previous RCTs, and campaigns given after enrolment.”

      -> The reader is left with the impression that a series of other factors influenced the trial and possibly invalidated the results. However, this was a randomized trial set-up which to a great extent compensates for any potential confounding effects, ie. other interventions that may have affected the outcome; but they will do so in both the intervention and comparator group.

      Moreover, from table 1 of the preprint – Baseline characteristics – it would seem that the authors tend to put too much weight on multiple other factors as the trial appears to be well randomized.

      Finally, if it in fact was true that this trial was influenced by other RCTs, health interventions, or campaigns, then this argument applies to all trial data originated from this research group in Guinea Bissau and consequently invalidates all of them.<br /> Again it is remarkable that the authors put down their own trial, spin the data and frame them into letting the reader believe that the trial is worth nothing at all. This is not in accordance with appropriate reporting standards as per CONSORT (7).

      3.2. Spinning the facts around the succession of vaccines<br /> p.3 “high-titre-measles-vaccine (HTMV) was protective against measles infection, but surprisingly, it was associated with higher female mortality, when tested against STMV (5,6). Hence, NSEs could be beneficial or deleterious and they were often sex-differential.

      References 5 and 6 are self-citations and based on post hoc re-analyses. The hypothesis that the DTP – following HTMV induced higher mortality remain highly speculative and never replicated. A more likely explanation would be that the HTMV was dosed too high resulting in measles infections, attenuated but still, which unfortunately in some cases increased the subsequent risk of mortality. This is notably a specific effect of the vaccine. However, as the authors advocate that the live (attenuated) vaccines are inferring beneficial effects and the non-live vaccines infer detrimental effects, a post-hoc narrative was constructed on the succession of vaccines having relevance. Importantly, this current preprint where the DTP vaccine is given alongside or not a live attenuated vaccine does not support this highly speculative hypothesis. On the contrary: if anything the results pointed towards DTP increasing child survival.

      1. OVERALL REFLECTIONS ON ETHICS

      4.1. Troubling lack of ethical standards and compliance

      p. 7 it is stated that the study was explained to mothers in the following way:

      “...though DTP is highly protective against whooping cough, it can occasionally give adverse reactions or limit the effect of measles vaccine….”

      This speculative hypothesis seems to be introduced in the study participant / guardian information material, although this was never defined as a research question in the protocol.

      Moreover, the protocol states:

      “Hypothesis: Not providing DTP together with or after MV is associated with a 35 % reduction in overall mortality and 23% reduction in hospitalizations.

      Taking one step back – and reflecting just for a minute – it appears to be the wildest research question ever. How did the Ethics Committee and the relevant authorities allow for this largescale trial to be conducted in the first place? What could possibly justify a RCT of this magnitude based on an outrageous research question like the one that was raised in the protocol: A 35% reduction in mortality is expected from omission of a single shot of vaccine?

      4.2. Underpowered or not<br /> The preprint states that the trial was “highly underpowered,” although 109% of the planned study population was enrolled. There seems to be a large contrast between how this trial and a recently reported trial (8) are interpreted based on whether there was a significant finding or not. These discrepancies indeed appear as tendentious framing.

      A direct comparison of these two large RCTs conducted by the same research group – with vast discrepancies in the results (enrolment and conduct) as well as interpretation is available at this link: https://www.linkedin.com/pulse/review-preprint-reports-dtp-trial-nct00244673-charlotte-str%25C3%25B8m-awgtf/ <br /> 4.3. Self-citation rate of 95%<br /> Nineteen of 20 references include members of the same author group – and are thus self-citations. This may reflect a general lack outside this group of scientific support to the NSE hypothesis and / or selective citation which is considered to be questionable research practice (6). A rule of thumb is that a self-citation rate above 15% raises suspicion of selective citation.

      4.4. Reflections on the “Postscript” of the preprint<br /> It is truly a good thing that these results have finally come to light. The study subjects, their families, and the scientific community have been waiting for these data to be published.

      The preprint is concluded by a lengthy postscript explaining the unusual long delay (14 years) in publishing the results from this trial.

      "Postscript. We apologise for the late reporting. The implementation of the trial went quite different from the scheduled plans. In this older age group, more children than expected were registered by an ID and address that could not be followed. Funding was lacking for the PhD student to complete the data cleaning and analysis. Before funding could be obtained, the Guinean field supervisor had died which made it difficult to resolve some inconsistencies in data. The senior authors had too many other commitments. Finally, from 2020, the COVID-19 pandemic changed all priorities"<br /> These explanations may very well be seen as a result of hypocrisy, as members of this group of authors have published numerous papers – including reporting of several clinical trials during the past 14 years. Moreover during this delay it has been argued by members of the author group that an RCT with the exact same research hypothesis should be conducted (10):

      “Almost 4 years after WHO reviewed the evidence for NSEs and recommended further research, IVIR-AC has now submitted for public comments two protocols of RCTs to measure the NSE impact of BCG and MV on child mortality:<br /> a. A BCG trial will compare mortality between 0 and 14 weeks of age for children randomized to BCG-at birth plus routine vaccines at 6–14 weeks of age vs. placebo at birth and routine vaccines at 6–14 weeks, with BCG at 14 weeks of age.<br /> b. An MV trial will compare mortality between 14 weeks and 2 years of age for children randomized to an additional dose of MV co-administered with DTP3 vs. placebo co-administered with DTP3.”<br /> According to http://clinicaltrial.gov the study hypothesis of NCT00244673.<br /> “DTP3/4+OPV+MV versus OPV+MV or DTP4+OPV4 versus OPV4”<br /> And even worse – it was claimed in the same publication Expert Review of Vaccines, Vol 17, 2018 – Issue 5 (10) that: "Science is also about accounting for all data. ... it has not been possible to conduct RCTs of DTP in high-mortality areas."<br /> There has evidently been a complete lack of willingness from the research group behind this trial to report on this null finding study that rejected the research hypothesis and rejected the hypothesis that the DTP vaccine has detrimental NSE. Such selection bias in reporting trial results on mortality is scientifically troubling and ethically both irresponsible and unacceptable.

      References:

      1. Agergaard JN, S.; Benn, C.S.; Aaby, P. Randomised trial of not providing booster diphtheria-tetanus-pertussis (DTP) vaccination after measles vaccination and child survival: A failed trial. In: Bandim Health Project IN, Apartado 861, Bissau, Guinea-Bissau; Department of Infectious Diseases, Aarhus University Hospital, Denmark; Bandim Health Project, OPEN, Department of Clinical Research, University of Southern Denmark/Odense University Hospital, Denmark; Danish Institute for Advanced Study (DIAS), University of Southern Denmark, Denmark, editor. 2025.

      2. Agergaard J, Nante E, Poulstrup G, Nielsen J, Flanagan KL, Ostergaard L, et al. Diphtheria-tetanus-pertussis vaccine administered simultaneously with measles vaccine is associated with increased morbidity and poor growth in girls. A randomised trial from Guinea-Bissau. Vaccine. 2011;29(3):487-500.

      3. Mogensen SW, Andersen A, Rodrigues A, Benn CS, Aaby P. The Introduction of Diphtheria-Tetanus-Pertussis and Oral Polio Vaccine Among Young Infants in an Urban African Community: A Natural Experiment. EBioMedicine. 2017;17:192-8.

      4. Aaby P, Mogensen SW, Rodrigues A, Benn CS. Evidence of Increase in Mortality After the Introduction of Diphtheria-Tetanus-Pertussis Vaccine to Children Aged 6-35 Months in Guinea-Bissau: A Time for Reflection? Front Public Health. 2018;6:79.

      5. Sørensen MK, Schaltz-Buchholzer F, Jensen AM, Nielsen S, Monteiro I, Aaby P, et al. Retesting the hypothesis that early Diphtheria-Tetanus-Pertussis vaccination increases female mortality: An observational study within a randomised trial. Vaccine. 2022;40(11):1606-16.

      6. Bouter LM, Tijdink J, Axelsen N, Martinson BC, Ter Riet G. Ranking major and minor research misbehaviors: results from a survey among participants of four World Conferences on Research Integrity. Res Integr Peer Rev. 2016;1:17.

      7. Hopewell S, Chan AW, Collins GS, Hrobjartsson A, Moher D, Schulz KF, et al. CONSORT 2025 explanation and elaboration: updated guideline for reporting randomised trials. BMJ. 2025;389:e081124.

      8. Thysen SM, da Silva Borges I, Martins J, Stjernholm AD, Hansen JS, da Silva LMV, et al. Can earlier BCG-Japan and OPV vaccination reduce early infant mortality? A cluster-randomised trial in Guinea-Bissau. BMJ Glob Health. 2024;9(2).

      9. Benn CS. Non-specific effects of vaccines: The status and the future. Vaccine. 2025;51:126884.

      10. Benn CS, Fisker AB, Rieckmann A, Jensen AKG, Aaby P. How to evaluate potential non-specific effects of vaccines: the quest for randomized trials or time for triangulation? Expert Rev Vaccines. 2018;17(5):411-20.

    1. On 2025-12-01 00:37:35, user Cyril Burke wrote:

      [Note: This is the fourth of several rounds of review of an earlier version of our combined manuscript, aiming to reduce ‘racial’ disparity in kidney disease. The comments were kindly offered by nephrologists, through a medical journal, and we remain grateful to them for the time and care they gave to improve our manuscript.

      We removed identifying features and included our response, at the end. The changing title and line numbers refer to earlier versions.]

      January 4, 2023

      Dear Dr. Burke III,

      REDACTED.

      Editor: Unfortunately, the re-re-revised manuscript is not improved. The authors have declined to shorten the paper, despite repeated requests by both reviewers and the editor. As such, the paper is not fit for publication in its current format. This is pity as there are some valid points within the manuscript, although some others are debatable and not backed up by good scientific evidence.

      REDACTED.

      Reviewer #1: I greatly regret that the next round of revision (R3!) does not take into account the key suggestion of the previous round, to concentrate on part 1 and drastically shorten the paper

      Reviewer #2: Thank-you for the opportunity to review this manuscript.

      The manuscript raises some important points with regards to the use of serum creatinine in the diagnosis and monitoring of kidney disease, as well as important considerations about race.<br /> As the authors acknowledge the manuscript remains too lengthy for consideration as a research article. Unfortunately, the authors have declined to shorten the manuscript as recommended by the reviewers and editor.

      RESPONSE TO EDITOR AND REVIEWERS

      January 16, 2023

      Early detection of kidney injury by longitudinal creatinine to end racial disparity in chronic kidney disease: The impact of race corrections for individuals, clinical care, medical research, and social justice

      We write to appeal the rejection of our manuscript. We were grateful for constructive comments from the Academic Editor and Reviewers and incorporated almost all of their suggestions, some itemized below. We were pleased that Reviewer #2 and Reviewer #1 recommended publication in the second and third [journal] decisions, respectively. But we were surprised by this rejection.

      ‘Race’ has been central to our manuscript from the original submission because discussing ‘race’ is essential to reduce ‘racial’ disparity in kidney care. Kidney failure is three times more common in Black than White Americans. As anthropologists have known and shown for more than a century, but biologists and physicians have been slow to acknowledge, biological ‘race’ is scientifically invalid and should be irrelevant. However, in the United States, ‘race’ is uniquely defined, ubiquitously applied, and often presumed to have a biological basis in medical research. A key point in our paper, based on our clinical observations and data reanalysis, is that race corrections add further harm to medical care by obscuring the causes of disparities and delaying or derailing the search for real underlying cofactors, especially in nephrology [1,2].

      For this reason, we disagree with suggestions to slice the article into two or more separate publications (the long-known practice of “salami science” or publishing of the “smallest publishable unit”). Separating the data from the take-home message would undermine the overview we are trying to provide for [journal] readers.

      Below, we highlight some excerpts from the [journal] decisions, adding our commentary.

      1. Eneanya ND, Boulware LE, Tsai J, Bruce MA, Ford CL, Harris C, et al. Health inequities and the inappropriate use of race in nephrology. Nat Rev Nephrol. 2022 Feb;18(2):84-94. doi: 10.1038/s41581-021-00501-8. Epub 2021 Nov 8. PMID: 34750551; PMCID: PMC8574929.

      2. Norris KC, Williams SF, Rhee CM, Nicholas SB, Kovesdy CP, Kalantar-Zadeh K, et al. Hemodialysis Disparities in African Americans: The Deeply Integrated Concept of Race in the Social Fabric of Our Society. Semin Dial. 2017 May;30(3):213-223. doi: 10.1111/sdi.12589. Epub 2017 Mar 9. PMID: 28281281; PMCID: PMC5418094.

      1. (4/1/2022): Revision required

      Reviewer #1 wrote: <br /> …a somewhat unusual paper, devoted to a topic of potential major clinical relevance, and as yet understudied….

      Reviewer #2 wrote: <br /> “Choi- rates of ESRD in Black and White Veterans” doesn’t fit with the rest of the paper including the title; the introduction and conclusion also don’t adequately address this portion of the paper. It feels disjointed from the main point of discussion which is the use of sCr in screening “pre-CKD”. This section and discussion should be removed and possibly considered for another type of publication….

      We understood Reviewer #2 as indicating the reanalysis of Choi needed better integration with other Parts of the manuscript or had to be cut. This interpretation was validated by Reviewer #2’s response to our major revision (see below).

      2. (8/3/2022): Revision required

      The Academic Editor wrote:<br /> The revised manuscript only partially addresses the critiques raised by the Reviewers. ….the authors need to address all the minor points highlighted by Reviewer 2.

      Reviewer #1 wrote: <br /> …the main key message (which is right in the opinion of this reviewer (see first round of review) and warrants more attention and studies.

      Reviewer #1 wrote: <br /> The race part is irrelevant for the key point (race does not change over time, and thus is not relevant when looking at longitudinal serum creatinine or eGFR) and should be deleted in the opinion of this reviewer.

      On ‘race’, we strongly disagree. ‘Racial’ disparity will continue until we talk about ‘race’. For the major revision, we made clearer the connection between our two data reanalyses (of Shemesh et al and Choi et al). Social ‘race’ in the US differs from social ‘race’ anywhere else, yet these are rarely compared, so an international audience objecting to discussion of ‘race’ often has no idea what ‘race’ means in the US. ‘Race’ is fraught, and to advocate change requires more words than to acquiesce to current practices (i.e., banning discussion of ‘race’ favors the status quo).

      Reviewer #2 wrote: <br /> Thank-you, once again, for the opportunity to review this lengthy “thesis-style” manuscript which discusses some important often over-looked topics. The under-use of serial creatinine measurements and over-reliance on often erroneous eGFR measurements is an important point which is easily missed by healthcare workers with potentially serious consequences. Likewise, the misuse of racial constructs in medicine (and elsewhere) is an important point. I am satisfied with this re-submission and the changes which have been made to the original manuscript.

      Reviewer #2 acknowledged the changes and recommended publication.

      3. (10/10/2022): Revision required

      The Academic Editor wrote: <br /> The re-revised manuscript is further improved.... I could offer you the possibility to shorten the manuscript just focusing on what you define “Part One” plus “section A of Part Two”. You can briefly address the “race” issue in the discussion…

      The Academic Editor seems not to appreciate that ‘race’ is a central topic in our manuscript, as evidenced by our secondary data reanalysis of Choi. As we noted earlier, publishing our manuscript in two or more separated Parts would make the reader work to reassemble them. Cutting Choi and briefly addressing ‘race’ would not allow the quality of argument needed to address ‘racial’ disparity in kidney failure, and would fundamentally shift our paper to focus purely on nephrology. For the topic to be complete, our data must be assessed in terms of its meaning for ‘racial’ disparities that are currently widespread in medical practice.

      Reviewer #1 wrote: <br /> As part one is important and should trigger further studies, after reading the comments of reviewer 2 , I am ready to recommend acceptance.

      Reviewer #1 recommended the second revision for publication.

      Reviewer #2 wrote: <br /> Once again, this reviewer in no way questions the often-overlooked inaccuracies in mGFR methods. However, the authors cannot quote a well conducted review which shed light on the methodological bias and imprecision which exists between mGFR methods and claim that this methodological bias is “physiologic variability”. The authors should review: Rowe, Ceri, et al. "Biological variation of measured and estimated glomerular filtration rate in patients with chronic kidney disease." Kidney international 96.2 (2019): 429-435. Intra-individual variation (CVI) for serum creatinine ranges from around 2.8 – 8.5% while cystatin C ranges from around 3.9 – 8.6%, inter-individual variation (CVG) of serum creatinine: 7.0 – 17.4% and cystatin C: 12 – 15.1%. Biological variation (CVI and CV¬G) are not the same as analytical variation, which also exists for serum creatinine and cystatin C. The author’s statement is not backed up by scientific evidence.

      Reviewer #2 provided a key reference, leading to our addition to the next revision of an important section on “gold standards” and Bland-Altman plots.

      Reviewer #2 wrote:<br /> Instead of drastically shortening the manuscript the authors have added to the length thereof.... This reviewer has chosen not to provide further comment on the new additions to the manuscript”....

      …the main point of the article, although difficult to decipher, is highly relevant.

      We wonder if the paragraphs were somehow mixed up, because the tone of this comment is different and Reviewer #2 had recommended publication in the earlier Decision and had just recommended a key reference, above.

      4. (1/4/2023): Rejection

      The Academic Editor wrote: <br /> Unfortunately, the re-re-revised manuscript is not improved…<br /> The Academic Editor’s idea of improvement appears limited to breaking the manuscript into several parts. We had hoped that clear improvements might be persuasive, including a major section on “gold standards” (inspired by Reviewer #2’s reference), reorganization for readability, revision of the Table of Contents, and others, but as noted above, we could not accept the offer to publish a radically altered message.

      The Academic Editor wrote: <br /> …despite repeated requests by both reviewers…

      Reviewer #2 then Reviewer #1 had approved the manuscript for publication.

      The Academic Editor wrote: <br /> …there are some valid points within the manuscript, although some others are debatable and not backed up by good scientific evidence.

      We worked to not overstate our evidence. Regarding the data from over 2 million veterans of Choi et al (in Part 3) our reanalysis stated: “The sample size was very small—only 15 data points—because Choi broke (dichotomized) the continuous raw data into five data segments… therefore, the precision of this result may not hold up with replication. However…”. We also wrote addressing this concern (in Revision 2, Part 3) and updated the sentence (in Revision 3, Part 4): “…we discuss… some novel or speculative GFR cofactors…. These require further study, and some may prove insignificant.”

      Moreover, “good scientific evidence” is hard to define and extensively debated by methodologists, but the Academic Editor isn’t entirely wrong. The evidence we provided is more of a demonstration than new scientific evidence, which is both a strength and a limitation. The “gold standard scientific approach” would be to test all our claims analytically in new samples of data, which is far beyond the scope of this project, so the Academic Editor isn’t wrong about that—some claims are debatable and are not backed up by good scientific evidence. The analytic methodologies we used were far from conventional, but that was the point—to identify areas of misconception open to debate, and to shed new light on them in an innovative way. Were these not debatable points, there would be no need for an alternative approach.

      REDACTED.

      We could argue that our paper effectively employs science, but on this issue, it seems more relevant to note that ours is clearly about ways to improve the base of academic knowledge—refining scientific process through better understanding of science, so this criticism seems inconsistent with [REDACTED] and detracts from the nuance that is a strength of our manuscript.

      Nevertheless, we remain interested in incorporating feedback and ask whether the Reviewers could briefly list the points they believe are debatable and not backed up by good scientific evidence, which would allow us to address those points and either provide better evidence or state why the current evidence is weak.

      Reviewer #1 wrote: <br /> I greatly regret that the next round of revision (R3!) does not take into account the key suggestion of the previous round, to concentrate on part 1 and drastically shorten the paper.

      As we noted, the research part of the manuscript comes first, in Parts 1 to 3. Busy readers can stop before Parts 4 and 5, but we believe these data and discussions need to be kept together.

      Reviewer #2 wrote: <br /> The manuscript raises some important points with regards to the use of serum creatinine in the diagnosis and monitoring of kidney disease, as well as important considerations about race.

      As the authors acknowledge the manuscript remains too lengthy for consideration as a research article. Unfortunately, the authors have declined to shorten the manuscript as recommended by the reviewers and editor.….

      It is unclear what changed the mind of Reviewer #2, who recommended publication after the major revision and inspired the important section on “gold standards”—a clear improvement that we found satisfying.

      Reviewer #2 references our comment sent in our last Response to the Reviewers: “…our manuscript may no longer be a good fit for [the journal]”, which was our most polite way of declining the Academic Editor’s offer to publish only part of our manuscript, narrowly focused on Nephrology or Laboratory Medicine. Our goal is to keep the manuscript intact.

      In summary, the Academic Editor and Reviewers have not offered good scientific evidence for cutting a manuscript that lengthened to address their many thoughtful suggestions, nor against discussing ‘race’ as central to American ‘racial’ disparities in kidney failure. REDACTED.

      THEREFORE: For all the above reasons, we request reconsideration of the decision against publication.

      Thank you for considering this appeal.

      Sincerely,

      Cyril O. Burke III

    1. On 2022-04-18 19:58:31, user M. Akers wrote:

      I am looking for some clarification on the connection between race and lower life expectancy:

      1. In the discussion, citations 7 through 9 are highlighted to support a long history of systemic racism, yet there appears to be little supporting data in those articles that connects your specific findings in order to make that assumption. Maybe I’m missing something?
      2. On the other hand, with the continued drop in life expectancy for white, NH Americans in 2021, you offer no meaningful explanation. How can systemic racism be the most apparent cause for a drop in life expectancy in Hispanic and Black populations, yet no cause (or even an attempted explanation) can be surmised for white, NH Americans?
      3. It seems pretty clear that obesity and associated metabolic syndrome have been major drivers for mortality and morbidity during the pandemic in the United States. Do we know how the United States compares to the peer nations cited in your article in terms of obesity and other metabolic syndrome incidence that could help explain your findings? Furthermore, are Black and Hispanic American populations in the United States disproportionally obese compared to white, NH Americans that could also support a greater drop in life expectancy?
      4. Could an increase in suicide rates and/or drug overdoses in younger Americans contributed to your findings? Could lockdown policies and lack of socialization have contributed?

      It seems that for life expectancy to have dropped that significantly, a large proportion of young people would have needed to pass away in order for that drop to occur, yet we know that statistically, young people (e.g. <30 years old) did not die as a result of COVID.

      Any clarification would be helpful as I am having a difficult time making the attempted connection suggesting that race is the only viable variable that explains a drop in life expectancy. With the pandemic, it seems there could be, and likely are, numerous factors contributing to your findings, yet nothing other than race is focused on.

      Thank you.

    1. On 2025-09-29 15:17:43, user Bryan Wilent wrote:

      I think this is great and as I read through it hit me how challenging this is to do. Kudos to the team.

      1. I would consider adding therapeutic impact to title and checklist. The distinction between diagnostic accuracy and therapeutic impact should be addressed head on and then can distinguish between the metrics in the checklist. I feel like this is often conflated in the literature at times and people using this checklist would benefit with a clear guideline

      2, I gather that this will be part of the checklist, but readers would benefit with clear list of all measures available and delineation of which metrics apply to diagnostic versus interventional domains (relative risk, odds ratios, probabilities) If getting into the latter, should also expound on nature of the intervention, like in Holdefer/Skinner structural causal model paper.

      3 How to handle suboptimal IONM planning (modalities/nerves/muscles used), e.g., a study found that IONM had low sensitivity for quadriceps pain/weakness after lateral fusions but the study used posterior tibial nerve SSEP for the LE and EMG only.

      4. How to handle dynamic and variable alert criteria (and not a hard threshold), which is consistent with current ISIN guidelines for SSEPs with variable reproducibility and guidelines for MEPs in diagnosing evolving nerve root v cord v brain dysfunction? I don’t have a great suggestion, but I think it needs to be addressed. Example. A case had a 60% change in SSEPs from a limb in context of stable of MEPs, so an alert was not called.

      5. Alert to what? Is it appropriate to analyze alerts specific to a pattern or injury? Example, how to report if some "alerts" had low specificity, but this alert pattern had both high sensitivity and specificity. Lieberman et al from 2019 on MEP patterns and foot drop is a an example thinking of that uses ROC curves for different muscle MEP change patterns.

    2. On 2025-08-26 18:45:14, user Laura Hemmer wrote:

      Thank you to this expert group on undertaking this needed and carefully-executed initiative to help improve diagnostic accuracy of IONM studies! I have a few minor comments as follow below for your consideration.

      -In the discussion in the Introduction that lists applicable guidelines, the updated ASNM SSEP position statement published in 2024 could also be a helpful reference here for completeness and particularly for its discussion regarding anesthetic and physiologic factors that can impact SSEPs as well the section on interpretation and outcomes, which has some discussion on the interpretation of reversible evoked potential changes. (J Clin Monit Comput. 2024 Oct;38(5):1003-1042.)

      -In the methods section, “STARD dementia” should likely have a reference noted.

      -Please pay attention to the tense used for the portion regarding community engagement and feedback (e.g. abstract methods notes Phase 3 will include broader community…” as is starting to occur now, but then the results portion in the abstract somehow notes what was already emphasized by community feedback. Similarly, the Results Overview in the manuscript seems to indicate the results of community engagement and dissemination, even though it appears community engagement is just now occurring.) This may be confusing for readers.

      -Phase 3 in the Abstract Methods portion notes that this will include broader community feedback, but in the manuscript, it appears community feedback is actually Part 4 of Phase 2 (“Community Engagement and Consensus Building” and Phase 3 is actually the dissemination of the final checklist. Please clarify.

      -In the Results section, part #2, please consider if additional details of your assessment of adherence to the STARD checklist across 12 peer-reviewed publications should be made more fully available, such as adding these 12 references to Supplementary Content.

      -Is the Results section, part #4 accurate yet (i.e. already officially endorsed by 3 international societies) or just anticipated still? These societies will need to be stated before publication.

      -For anesthesia reporting in IONM studies, consider if more details regarding anesthetic technique could be useful. For example, what if additional anesthetic adjunctive/multimodal agents are also incorporated into the anesthetic regimen beyond just TIVA, inhalational, or mixed? We know from the literature, for example, that ketamine in different doses can impact MEP amplitude differently. Also, inhalational amount (e.g. MAC) should be noted when a “mixed” inhalational and intravenous hypnotic anesthetic regimen in being administered, as further evoked potential signal degradation would generally be expected with higher MAC levels.

      -Some of the anesthetic reporting details discussed in the results section are really more physiological details, so should the heading be something like “Anesthesia and Physiologic Reporting in IONM Studies” instead of just “Anesthesia Reporting in IONM studies” perhaps?

      -For patient demographics, in addition to the examples given in the document, including height, weight, etc., please consider noting that studies should also include other pertinent medical comorbidities for IONM purposes, such as the presence of diabetes mellitus and associated neuropathy which may make it harder to obtain robust baseline evoked potentials. Table 2 notes “clinical characteristics”, but I wonder if medical comorbidities that would be particularly pertinent to IONM and that may make even obtaining adequate, robust baseline signals difficult should be more clearly stated in the document and/or Table 2? It is helpful that Table 2, in the clinical characteristics of participants section (#20), does state that baseline IONM data should be reported.

      -Reversibility of IONM changes is well covered by the authors in its own dedicated section within the Results section of the manuscript. Recommendations by the authors on how to best handle all evoked potential deteriorations is also clearly given in the same area of the results section. This important discussion and recommendation by the group, gets a little diluted and confusing when it is re-addressed shortly afterwards still in the results section under “Alternate evaluation framework in IONM”. Please consider if the repetition here is fully needed, or perhaps this area could refer back to the very well-stated section previous in “Reversibility of IONM changes”? Also the section “Alternate evaluation framework in IONM” might benefit from more clear recommendations from the expert working group.

      -Consider if, from the 3rd sentence to the end of the 1st paragraph in the Discussion section, is actually needed. It is pretty redundant from earlier coverage in the document. For conciseness, could move the 2nd paragraph content to just after the 2nd sentence in the 1st paragraph in the Discussion section if desired.

      -Anesthesia techniques definition is very basic in Table 1. For readers who do not carefully read the manuscript and refer more to the Tables only, should more detail be given here or at least could note to see the manuscript text content? Similarly, no mention of anesthesia appears in Table 2, which is the actual checklist being presented. Since standardized reporting of anesthesia-related variables is critical for IONM diagnostic accuracy studies, should anesthesia reporting information appear in the Table 2 checklist?

      -Should studies be asked to more clearly state how it was determined that adequate baseline evoked potential signals were present (reporting of IONM baseline data is recommended in Table 2 #20, which is good). What about in the case of intracranial surgery and the concern for stimulation occurring below the area where ischemia could occur (potentially leading to false negatives)?

      -I do not see the supplementary material currently noted in the document on Medrxiv for review, so I have not reviewed this supplementary content.

      -Minor typographical/grammatical errors noted by me have been directly submitted to one of the working group members.

      Sincerely,<br /> Laura Hemmer, M.D.

    1. On 2020-04-14 11:29:44, user Andrea Nicoletto wrote:

      I am not sure whether I am commenting a scientific article, a political statement or something in between. This article came to my attention because, even if it has not been peer-reviewed yet, has already been referenced in a press release of Italy's central health agency (ISS), which comments the results as a matter of fact and calls them "published".

      Skimming the full text the following points came to my attention.

      1 - On page 4, you state that "Due to the high concordance (99%) among confirmation results with the engaged laboratories, thepolicy was then changed allowing selected Regions with demonstrated confirmation capacity to directly confirm COVID-19 cases (17)." Reference 17 contains only an internal note specifying what a "case" is, but does not contain the data supporting the statement that there is high concordance. Previous publications coming from ISS (e.g. [1]) state that "99% of the samples analized by the national reference lab of ISS result POSITIVE", which suggests a selection bias in sending samples to the central laboratory and completely invalidates the confirmation process. False positive/false negative rates of the cross-analysis has not been reported.

      2 - In your introduction, you state that "extensive contact tracing and testing of close contacts unveiled ongoing transmission in several municipalities of the Lombardia region". The difficulties in testing and tracking cases in Lombardia region is well known, with several papers (including your references) and statements from authorities highlighting the fact that (a) the number of tested people is little w.r.t. the number of potential cases; (b) the classification of potential case varies on a regional basis, and a potential case in any other Italian region might not qualify as a potential case in Lombardia region, thus not getting tested; (c) testing protocols in Lombardia region do not include the testing of people living within the same household of the confirmed case, thus it is unclear in which way the contact tracing has been "extensive"; (d) the delay between the collection of the swab and the communication of the result is long and with a high variance. All these facts shall be taken into account when analyzing statistical data.

      3 - In your conclusions (!), you state that "Further, we observe that as of March 8 2020, the Rt it is still abovethe epidemic threshold. The progressively harsh physical distancing measures enacted since then may have enhanced the decreasing trend in transmissibility as happened in China". You support this statement using a paper which analyses the transmission of Ebola in West Africa. This seems to me like a political statement which shall have no place in a scientific paper, let alone on its conclusions. You have no data to back this statement, since your analysis terminates on the 12th of March, i.e. three days later of the enacment of the lockdown. You even show that there is a decreasing trend in R_t starting in the last decade of February, which puts the R_t at the beginning of the national lockdown slightly above 1 with a strongly declining trend. While writing this sentence has no scientific value, of course, it allows the MoH and ISS to defend their decision-making because "science said that".

      I do not have the specific background to carry out a review of the quantitative data, but I cannot ignore the use of poorly-backed statements to dignify as "science" what are only political decisions. Unsurprisingly, these statements are those which will fit into press releases and official statements.

      [1] https://www.epicentro.iss.i...

    1. On 2020-04-16 18:03:55, user Sinai Immunol Review Project wrote:

      This retrospective study evaluated the use of intravenous methylprednisolone to treat severe COVID-19 pneumonia in a cohort of 46 patients. The severity of the disease was determined according to version 5 of the Coronavirus Pneumonia Diagnosis and Treatment Plan by the National Health Committee of the People's Republic of China. The percentage of patients with comorbidities was 32%, and it was reported similar between methylprednisolone treated and untreated groups. The results showed that the group of patients that received methylprednisolone (n=26) had a shorter number of days with fever than patients that did not received methylprednisolone (n=20); they also had a faster improvement of peripheral capillary oxygen saturation (SpO2), and better outcomes in follow-up CT scans of the lungs. The dosage of methylprednisolone was reported to be 1-2mg/kg/d for 5-7 days, although there is no information about the concrete dosages for each patient. From the 46 patients, 43 recovered and were discharged, while 3 cases were fatal. Patients without administration of methylprednisolone needed longer periods of supplemental oxygen therapy, though there is no reference to the number of patients requiring mechanical ventilation. Interestingly, there were no significant differences in leucocyte and lymphocyte counts nor in the levels of IL-2, IL-4, IL-6 or IL-10 after treatment with methylprednisolone.

      Some of our main criticisms to this study are also pointed-out by the authors themselves: it is a retrospective single-center study with no validation cohort and without mid- and long-term follow-ups. The reported mortality was 7% (3/46) and did not appear to be affected by corticosteroid treatment: one patient died in the group that did not receive methylprednisolone, while two patients died in the methylprednisolone treatment group. Additionally, although the authors mention that patients received cotreatments, such as antiviral therapy and antibiotics, there is no mention of differences between the prevalence of other medications between the two groups. Unfortunately, there is also no indication on whether the patients receiving methylprednisolone were discharged earlier; the authors merely refer that the symptoms and signs improved faster.

      Corticosteroid have been widely used as therapy for acute respiratory distress syndrome (ARDS), including in infections by SARS-CoV, so these findings in COVID-19 patients are not unexpected. The implications of this study for the current pandemic due to SARS-CoV-2 require evaluation in future clinical trials, especially in a randomized way and in combination with and comparison to other immunosuppressive and immunomodulatory agents, including hydroxychloroquine. Nevertheless, based on this report, the intravenous application of methylprednisolone with the intention of strengthening the immunosuppressive treatment and controlling the cytokine storm appears to be safe in COVID-19 patients, and it might successfully shorten the recovery period.

      This review was undertaken by Alvaro Moreira, MD as part of a project by students, postdocs and faculty at the Immunology Institute of the Icahn School of Medicine, Mount Sinai

    1. On 2020-07-20 17:29:44, user Kamran Kadkhoda wrote:

      The following finding <br /> Using the pre-defined cutoffs, the sensitivity of IgG antibodies rose from 7% (<=7days) to<br /> 7% after 14 days of symptoms. The sensitivity of IgA and IgM rose to 91% and 81% 2-4 weeks<br /> post-symptom onset but dropped after 4 weeks to 57% and 40%, respectively.<br /> ...is classic for an anamnestic immune response especially given<br /> IgG showing up early similar to other studies and the half-life of<br /> IgG-plasmablasts suggesting response to previous response to common CoVs.

    1. On 2020-07-21 22:11:37, user BiotechObserver wrote:

      "Screened patients either had confirmed SARS-CoV-2 infections by PCR, or suspected disease, defined as being told by a physician that symptoms may be related to SARS-CoV-2 or exposure to someone with confirmed SARS-CoV-2 infection... In addition to screening potential donors, Mount Sinai also offered the Mount Sinai ELISA antibody test to all employees within our health system on a voluntary basis."

      It would be helpful to know what percentage of each of these 4 subsets of screened patients (out of the 51,829 total screened) were positive vs. negative on the ELISA test. <br /> Your sensible subsequent explanation on sensitivity findings notwithstanding, (it seems plausible you are detecting positive in the ELISA test >95% of those with a confirmed past infection), this breakdown would still be valuable from an epidemiology perspective.

      "The vast majority of symptomatic cases that were screened experienced mild-to-moderate disease, with less than 5% requiring emergency department evaluation or hospitalization."

      It would be helpful to visualize in a few additional figures the breakdown of various measures (titer ranges, decline/increase, neutralizing activity) stratified by severity of symptoms (asymptomatic vs. mild vs. moderate vs. hospitalized, or asymptomatic vs. mild/moderate vs. hospitalized).

      Thank you.

    1. On 2020-07-24 16:23:32, user David Gagnon wrote:

      Was any data collected on the numbers for the ages of the people in the same household?This section was badly written, is missing something, or I just don't understand something in the language. 15.5% is the lowest of those numbers and that seems odd, since the I would guess the majority of 2 person households to be couples without kids, both young and old, and in the latter case the secondary infection should have been notably high, right?<br /> There are also three number in the section that correspond to three groups:<br /> "The secondary infection risk for study participants living in the same household increased from 15.5% to 43.6%, to 35.5% and to 18.3% for households with two, three or four people respectively (p<0.001). "<br /> Is the 18.3% for households with more than 4 people? Is it then 18% per person?

    1. On 2020-07-09 21:41:42, user kpfleger wrote:

      Thank you for this study! Suggestions to improve the manuscript:<br /> (1) In the PDF version linked here from medRxiv (as of 2020/7/9), p.1 states the multivariate infection OR as 1.45 but p.5 & table 3 list it is 1.50. Minor discrepancy but good to get the p.1 results summary correct.<br /> (2) You imply in the conclusion that most of the 25OHD test results were recent, such as upon presentation to health services for illness, but it would be helpful for you to characterize the dates of the 25OHD tests in your cohort. (Eg, so we know they aren't as old as the 10+ year old UK Biobank results.)<br /> (3) It would be helpful to have the descriptive statistics for demographic and clinical characteristics for the hospitalized vs. non-hospitalized COVID-19-P patients---analogs of tables 1 & 2 stratified by hospitalized or not. I'm not even sure you say how many were hospitalized.<br /> (4) You gave the multivariate adjusted OR for infection and the OR for hospitalization given infection. It would be nice to state the adjusted OR for absolute risk of hospitalization (not specifically given infection) as this is perhaps more meaningful for public policy.

      I look forward to a couple more analyses like this from other large HMOs or government insurers around the world. I also look forward to data on post-hospitalization measures of severity (eg, ICU/ITU admission, fatality) in population cohorts this large.

    1. On 2020-08-28 07:43:07, user Hilda Bastian wrote:

      At several points, the authors rely on what's described as the "historical efficacy of passive antibody therapy for infectious diseases". This is based on a small amount of data, much of it from the pre-intensive care era, and none from a publication later than 2010. As a result, no randomized trial is included, as they were published after 2010: 2 NIH randomized trials of convalescent plasma in influenza and 2 randomized trials of IVIG for influenza. Meta-analysis shows no benefit. [1] This also fails to consider the post-2010 ebola outbreak, and the failure of convalescent plasma to improve mortality from that disease. [1] Thus, there is no historically proof of efficacy of convalescent plasma, and what randomized data exists, does not suggest there has been important benefit in the past.

      In addition to relying on this biased assessment of historical evidence to support a conclusion of effectiveness in this study, the authors cite this claim as a reason for not conducting a randomized trial: "Many COVID-19 patients would likely have been distrustful of being randomized to a placebo based on historical precedent". However, if they were accurately informed, prospective participants would be told there was no evidence of benefit. In a randomized trial in the Netherlands stopped because it was determined no benefit was likely in the study as designed, the authors reported that only 1 in 4 eligible patients declined, and that was typically because of fear of adverse events. [2] The requirement for adequate trial recruitment has more to do with doctors and patients in outbreaks not being misled about the state of uncertainty of this treatment.

      The authors argue that the patients in the Expanded Access Program are diverse. However, it is important to point out that their diversity is not representative of the people severely ill with Covid-19 and at risk of dying. For example, 19% of the group are Black, whereas the CDC reports that they are over 30% of those hospitalized with Covid-19 and twice as likely to die. [3,4]

      In respect of the representativeness of the small non-random sample described as "pseudo-randomized" in this preprint, no data is provided on the hospitals providing those samples.

      In addition, as others have already pointed out in a discussion linked here, [5] critical information on timing of deaths is not provided. Those transfused earlier in the "epochs" have far longer follow-up for deaths than the larger number more recently. Given that since early in the outbreak, it's been observed that deaths occur across 2 to 8 weeks from the onset of symptoms, [6] the impact of this could be substantial, as participation in the EAP was higher later. In the group on the Diamond Princess cruise, for example, per Wikipedia's tallying, half the deaths may have occurred in that second month [7], and assessment of mortality appropriately included censoring for this. [8] Case series in the US typically report substantial proportions of people still in intensive care at study's end.

      The authors' interpretation of their subgroup analysis based on a non-random set of blood samples preserved for blood bank quality assurance proceeds as though the safety of convalescent plasma for Covid-19 has been established, based on the data of their own uncontrolled study. However, controlled study is required to be certain, for example, whether plasma with lower levels of antibodies trigger antibody dependent disease enhancement. [5] As the FDA's memorandum reports that the results are also dependent on which assay results are used, this should be reported in any discussion of this subgroup analysis. [9]

      In the absence of adequate controlled study of convalescent plasma establishing that it does more good than harm in infectious respiratory disease generally in contemporary medical settings, and Covid-19 in particular, the authors' claim that their uncontrolled study provides "strong evidence" is unjustified.

      Disclosure: I have written about this study for the general public at WIRED, and am in the process of doing so at PLOS Blogs.

      [1] Devasenapathy (2020). https://www.cmaj.ca/content...

      [2] Gharbharan (2020). https://www.medrxiv.org/con...

      [3] CDC COVID-Net (2020). https://gis.cdc.gov/grasp/C...

      [4] CDC surveillance data (2020). https://www.cdc.gov/coronav...

      [5] Harrell (2020). https://discourse.datametho...

      [6] WHO (2020). https://www.who.int/docs/de...

      [7] Wikipedia (2020). https://en.wikipedia.org/wi...

      [8] Russell (2020). https://www.medrxiv.org/con...

      [9] FDA Clinical Memorandum (2020). https://www.fda.gov/media/1...

    1. On 2020-12-05 01:38:02, user ACE NYPD wrote:

      I have been using Betadine Gargle (.05% Povidine Iodine) as a gargle & nasal spray for months at 3 or 4 times a day. My wife, who has comorbidities also uses it. I have been exposed to Covid at least 4 times by others at work, and have always come back negative. Since I am in Tech Support, I have used keyboards and mice of infected persons. I am currently working from home until my latest exposure is 14 days since exposure. I took a rapid test that came back negative 6 days after the exposure, but I am still waiting on the PCR test I took the same day.

      I also take a vitamin D supplement.

      Do I think that the Betadine Gargle is preventing me from getting Covid, yes I do, but of course talk to your physician first. I have found these articles about Povidine Iodine and Covid:

      https://www.pulmonologyadvi...

      https://doi.org/10.1177/014...

      https://www.thailandmedical...

      Stay safe and informed.

    1. On 2020-12-29 00:33:03, user Olga Matveeva wrote:

      Several recent preprints support some of this manuscript findings.<br /> 1. Authors from Sweden and China in a study entitled “Pulmonary stromal expansion and intra-alveolar coagulation are primary causes of Covid-19 death” demonstrated that “The virus was replicating in the pneumocytes and macrophages but not in bronchial epithelium, endothelial, pericytes or stromal cells. doi: https://doi.org/10.1101/202...<br /> 2. Researchers in China concluded that “Collectively, these results demonstrate that SARS-CoV-2 directly neutralizes human spleens and LNs through infecting tissue-resident CD169+ macrophages.” They published a preprint entitled “The Novel Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) Directly Decimates Human Spleens and Lymph Node” doi: https://doi.org/10.1101/202...<br /> 3. Researchers in France demonstrated “that SARS-CoV-2 efficiently infects monocytes and macrophages without any cytopathic effect.” Their findings are reported in the preprint entitled “Monocytes and macrophages, targets of SARS-CoV-2: the clue for Covid-19 immunoparalysis” doi: https://doi.org/10.1101/202...<br /> 4. Researchers in Brasil investigated SARS-CoV-2 infection of PBMCs and found that in vitro infection of whole PBMCs from healthy donors was productive of virus progeny. They also found that “SARS-CoV-2 was frequently detected in monocytes and B lymphocytes from COVID-19 patients, and less frequently in CD4+T lymphocytes” The preprint is entitled “Infection of human lymphomononuclear cells by SARS-CoV-2”. <br /> doi: https://doi.org/10.1101/202...

    1. On 2021-01-13 16:49:11, user Ezequiel Petrillo wrote:

      Suppl. Table 2 has an error. Where it says that the Custom-made qPCR mix has 8 ul of MgCl2, it should say 4 ul (4mM final concentration), adding the extra volume with ultrapure water. We will fix this error in an updated version soon.

    1. On 2021-06-11 13:44:12, user Jay Alan Erdman wrote:

      Where to begin? 1) This is an abstract; not yet peer reviewed; so it's just four guys saying this. 2)They don't say how they got their population. 3) They provide no data. 4) There is no control group either randomized, case control, or cohort. 5) They really do not specify their methods. 6) There is no treatment protocol so we don't know what additional treatment they may have received. Overall this doesn't meet any scientific standard. But even if it had been well done; these patients presented in the Spring of 2020 when treatment protocols were very different and outcomes much worse. This study says nothing about whether HCQ would be of any benefit to patients receiving treatment in Spring 2021.

    1. On 2021-07-01 03:15:16, user Subhajit Biswas wrote:

      Really excited to see that our original observation that pre-exposure to dengue may be cross-protective against COVID-19, has been further supported by the following study from Brazil!

      Title of paper: Previous Dengue Infection and Mortality in Coronavirus Disease 2019 (COVID-19)

      Abstract: We studied 2351 participants with coronavirus disease 2019; 1177 (50%) reported previous dengue infection. Those without previous dengue had a higher risk of death (hazard ratio: .44; 95% confidence interval: .22–.89; P = .023) in 60-day follow-up. These findings raise the possibility that dengue might induce immunological protection against severe acute respiratory syndrome coronavirus 2.

      Link: https://doi.org/10.1093/cid...

      This perhaps explains why mortality in dengue endemic regions like the Indian subcontinent, Africa and SE Asia is about 10-times less compared to dengue non-endemic regions.

      Why Brazil is an exception?

      Read following publications:

      1. https://www.medrxiv.org/con... by Prof Miguel A L Nicolelis

      2. https://doi.org/10.1016/j.c... by our Group

      As of today, Worldometer says mortality per million population is 287 for India compared to 1863 for US & 1878 for UK (1st July, 2021). Indian population is almost 4-times the population of US with 0.4 million deaths compared to 0.62 million deaths in US.

    1. On 2021-08-31 01:53:43, user William Brooks wrote:

      The results of the proposed model rely on three questionable assumptions: 1) masks are effective at preventing infection [1]; 2) infection risk decreases as mask usage increases [2]; and 3) masks are more effective than ventilation [3].

      However, the authors ignore real-world data challenging these assumptions even though they reference the UK's Events Research Programme (ERP), which found little difference between Phase 1 events with and without mask requirements [4]. Moreover, recent ERP data for large-scale sporting events without mask requirements "demonstrate that mass participation events can be conducted safely, with case numbers comparable to, or lower than community prevalence" [5].

      In short, the authors should base their models on real-world data rather than unproven assumptions.

      [1] https://www.acpjournals.org...<br /> [2] https://escipub.com/irjph-2...<br /> [3] https://aip.scitation.org/d...<br /> [4] https://www.gov.uk/governme...<br /> [5] https://www.gov.uk/governme...

    1. On 2022-09-23 15:15:43, user Yu Li wrote:

      It is important to regularly check the primers and probe sequences of a PCR or qPCR assay against GenBank because newly generated sequences may cause erosions or failures of a published assay. The article Wide mismatches in the sequences of primers and probes for Monkeypox virus diagnostic assays | medRxiv attempted the in silico analysis of published monkeypox virus (MPXV) specific qPCR assays. However, the article contains numerous errors in its results, lacks experimental data to support its conclusions, and can impair the 2022 monkeypox outbreak response.

      The genome sequences of monkeypox virus (MPXV) are highly similar (~95% identical) to that of other species of orthopoxviruses (OPXV). The similarities between MPXV clade I and clade II are over 99%. Therefore, identifying a qPCR targeting site for primer and probe design that perfectly matches MPXV and contains enough sequence differences to differentiate other OPXV can be very challenging. The probe sequence of a qPCR assay is often given priority for target selection in assay development. Multiple studies have reported that PCR primer mismatches do not necessarily affect performance of a PCR assay. For example, Kwok S et al (1) and Christopherson C et al (2) showed that up to 4 mismatches in the primer-template duplexes (28 and 30 base primers) did not have a significant effect on RT-PCR (the sequence similarity is as low as 80%). The mismatch positions and type of nucleotides involved in the mismatch play important roles. The buffer and annealing temperature used in a PCR assay can also be critical in determining the assay’s performance. A single base mismatch in the reverse primer of the Orthopoxvirus generic OPX3 assay led to a 100-fold decrease of the sensitivity of this assay in detecting the 2022 monkeypox outbreak predominate strain (clade IIb, lineage B.1) in one buffer (3) but switching to a different PCR buffer nearly reversed this lost sensitivity. This example highlights the critical nature of performing laboratory validation testing to ensure specificity and sensitivity. The published MPXV qPCR assays have largely been validated by inclusivity and exclusivity panels (4), and the MPXV_G2R generic assay has been used extensively without sensitivity issues in detecting different clades of MPXV. This article made claims that “Our results show that the current MPV real-time generic assay may be unsuitable to accurately detect MPV” without any supporting experimental data. In addition, the title of the article is misleading without supporting data and can lead to uncertainty surrounding MPXV diagnostics.

      The authors performed sequence similarity analysis of 8 published MPXV qPCR assays, including three CDC qPCR assays specifically designed to detect all MPXV isolates (generic assay), only clade I isolates (MPXV clade I assay) and only clade II isolates (MPXV clade II assay). In Figure 1, the detailed sequences alignment of MPXV generic assay MPXV_G2R were presented relative to the sequence of MPXV clade I. The authors showed two sets of primers; one set of primers, MPV-F-mu/MPV-R-mu, perfectly matches with MPXV clade IIb, lineage B.1 and contain a single mismatch for both the forward and reverse primers compared to originally published primer sequences. The MPXV_G2R generic assay was designed to detect both monkeypox clade I and clade II (4), and the primer sequences were designed using the MPXV clade I sequence. The publication of the MPXV G2R generic assay showed that this assay detects both clade I and clade II of MPXV (4). The MPXV G2R generic assay has been used for MPXV diagnostics since its publication in our laboratory and demonstrates no differences in the sensitivity of detecting MPXV clade I and clade II. Clinical diagnostic data confirmed that the limited primer mismatches have little effect on the performance of the MPXV_G2R generic assay under current protocols.

      In Figure 2 panel A, the authors claimed that the MPV_G2R_WA-P, the probe sequence of MPXV clade II specific assay, contains the Mutation1 sequences, which are in 4.2% of 683 MPXV genome sequences the authors have included in their analysis. However, there are no genome sequences from MPXV clade II containing the Mutation1 sequences by the BLAST analysis of GenBank database. It is likely that the authors mistakenly used the sequences from MPXV clade I (MPXV Congo basin) as the Mutation1 sequences of clade II (West Africa clades). MPV_G2R_WA-P was designed to specifically detect MPXV clade II; the probe targeting sequences contain a 3 base deletion compared to clade I. <br /> If the authors have sequence data supporting their claims of genome sequences of MPXV clade II containing the Mutation1 sequences, they should make these available for others to analyze.

      We are deeply concerned about the errors in this article and the lack of experimental data to support the authors’ conclusions. The authors should promptly address the issues raised here and consider the potential negative impact of this article on the MPXV diagnostics in 2022 monkeypox outbreak responses.

      References<br /> 1. Kwok S, Kellogg DE, McKinney N, Spasic D, Goda L, Levenson C, Sninsky JJ. Effects of primer-template mismatches on the polymerase chain reaction: human immunodeficiency virus type 1 model studies. Nucleic Acids Res. 1990 Feb 25;18(4):999-1005. doi: 10.1093/nar/18.4.999. PMID: 2179874; PMCID: PMC330356.<br /> 2. Cindy Christopherson, John Sninsky, Shirley Kwok, The Effects of Internal Primer-Template Mismatches on RT-PCR: HIV-1 Model Studies, Nucleic Acids Research, Volume 25, Issue 3, 1 February 1997, Pages 654–658, https://doi.org/10.1093/nar...<br /> 3. Crystal M. Gigante, Bette Korber, MatthewH. Seabolt, Kimberly Wilkins, Whitni Davidson, Agam K. Rao, Hui Zhao, Christine M. Hughes, Faisal Minhaj, Michelle A. Waltenburg, James Theiler, Sandra Smole, GlenR. Gallagher, David Blythe, Robert Myers, Joann Schulte, Joey Stringer, Philip Lee, Rafael M. Mendoza, LaToya A. Griffin-Thomas, Jenny Crain, Jade Murray, Annette Atkinson, AnthonyH. Gonzalez, June Nash, Dhwani Batra, Inger Damon, Jennifer McQuiston, Christina L. Hutson, Andrea M. McCollum, Yu Li. Multiple lineages of Monkeypox virus detected in the United States, 2021- 2022 bioRxiv 2022.06.10.495526; doi: https://doi.org/10.1101/202...<br /> 4. Li Y, Zhao H, Wilkins K, Hughes C, Damon IK. Real-time PCR assays for the specific detection of monkeypox virus West African and Congo Basin strain DNA. J Virol Methods. 2010 Oct;169(1):223-7. doi: 10.1016/j.jviromet.2010.07.012. Epub 2010 Jul 17. PMID: 20643162

    1. On 2022-12-15 10:49:12, user Author wrote:

      We would like to reply to a comment entitled “Japan preprint on myocarditis used inadequate methods to suggest COVID-19 vaccines cause more myocarditis deaths”: a review by Health Feedback (Editor: Ms. Flora Teoh). <br /> https://healthfeedback.org/...

      We thank them for commenting on our paper. We understand their main points of criticism were three summarised as followings:

      1. Comparison of pre-pandemic and post-pandemic rates of myocarditis death (their 2nd point)

      2. No examination of history of myocarditis death and ignored COVID-19 as the cause (their 1st point)

      3. Sample size was too small to discuss causality

      1. comparison of pre-pandemic and post-pandemic rates of myocarditis death (their 2nd point)

      Their 2nd point is based on the fundamental misunderstanding on the methods of our study. They erroneously stated "The authors' association of change in the risk of myocarditis death associated with COVID-19 vaccines was based on comparing pre-pandemic and post-pandemic rates of myocarditis death". <br /> We compared myocarditis mortality in the SARS-CoV-2 VACCINATED population with that of the 2017-2019 (pre-pandemic period: reference) population; we did NOT compare myocarditis mortality between POST-PANDEMIC and pre-pandemic periods.<br /> Because of the misunderstanding the fundamental methods of our study, the following criticism have no sense:<br /> “But this assumes that the only thing that changed between the two periods is the availability of the COVID-19 vaccines. It excludes, without justification, the possibility that COVID-19 itself could produce an increase in myocarditis deaths. No reason was given by the authors for excluding COVID-19 as a potential explanation, despite the fact that COVID-19 is a more likely explanation than COVID-19 vaccines for an increase.” “This is because we know—based on previous published studies—that COVID-19 is more likely to lead to cardiac complications than the vaccines [1,2]. Therefore, the alleged causal association rests on the assumption that only COVID-19 vaccines can explain the change in myocarditis mortality, which isn’t true.”<br /> However, we would like to comments on “COVID-19 is more likely to lead to cardiac complications than the vaccines” referring reports by Block et al [1], and Patone et al [2,3].<br /> It is important to consider following three points; vaccines are not given to dying persons and to persons with fever or other acute diseases. Hence vaccinated people are relatively healthier than the non-vaccinated (healthy vaccinee effect) [4]. Conversely, vulnerable persons (frail, suppressed immunity due to stress or sleep debt etc) are more likely to be infected with SARS-CoV-2 (vulnerability confounding bias: VCB) [5].

      Patone et al. [1] stated in the discussion section as follows: “Of note, the estimated IRRs were consistently <1 in the pre-exposure period before vaccination. ---- This was expected because events are unlikely to happen shortly before vaccination (relatively healthy people are receiving the vaccine).” This is exactly the same as the healthy vaccinee effect [4] and it is the lowest at day 0 of vaccination [2]: for example, IRR of arrhythmia at day 0 of BNT162b2 vaccination was 0.33 (0.29 to 0.37) compared with 0.72(0.70 to 0.73) during -28 to -1 days before vaccination [2]. <br /> Paton et al [1] also discussed that the estimated IRRs were consistently >1 in the pre-risk period before a SARS-CoV-2–positive test. They thought that events are more likely to happen before a SARS-CoV-2–positive test (as a standard procedure, patients admitted to the hospital are tested for SARS-CoV-2). But they missed to discuss that IRRs on day 0 of vaccination are the most prominent (with 10 times more than that in the pre-risk period, because standard testing of SARS-CoV-2 is mostly done on the day of admission). Hence, constant IRR >1 during -28 to -1 days before vaccination may be another cause. It may be explained by the vulnerability confounding bias [5].<br /> We estimated the effect of vulnerable person’s susceptibility to infection (vulnerability confounding bias: VCB) from the pre-risk period (-28 to -1 days) of the SARS-CoV-2 test-positive group: 2.84 (1.89 to 4.28) for myocarditis and 4.82 (4.68 to 4.97) for arrhythmia. When applied these data for the index of VCB, VCB-adjusted IRRs are 3.44 (2.11 to 5.59) and 1.11 (1.07 to 1.16) which are similar to or less than the healthy vaccinee effect adjusted IRRs of myocarditis (3.97: 3.05 to 5.16) and arrythmia (2.70: 2.38 to 3.05) respectively [4].<br /> It is not possible to estimate the healthy-vaccinee effect and VCB directly from the report of Block et al [3], however, post-SARS-CoV-2 infection/post-vaccination myocarditis risk ratios may be less than 1.00 in almost half of those listed when above adjustments were applied.

      2. No examination of history of myocarditis death and ignored COVID-19 as the cause (their 1st point)

      This point is also derived from the fundamental misunderstanding on the methods of our study. We did NOT compare myocarditis mortality between POST-PANDEMIC and pre-pandemic periods BUT compared SARS-CoV-2 VACCINATED population for 28 DAYS after vaccination with pre-pandemic periods. <br /> Therefore, as a rule, deaths following SARS-CoV-2 infection were not included in this study. In fact, none had COVID-19 listed in the death cause column of cases included in this analysis.<br /> Moreover, in the MHWL list we referred; most deaths included brief medical history as well as the cause of death. We clearly stated that “these were myocarditis death cases reported by physicians as serious adverse reactions to the vaccine” in the Methods section.<br /> Furthermore, as we stated in the discussion section, myocarditis deaths in the 2017-2019 (reference) population were also based on a doctor's diagnosis, with no other medical history known. Mevorach et al [6] also analysed using the same methodology and already published as a peer reviewed paper.

      3 Sample size is too small to discuss causality

      This point is also derived from the fundamental misunderstanding on the methods of our study. We compared SARS-CoV-2 VACCINATED population for 28 DAYS after vaccination with pre-pandemic periods. Hence this sample size was enough to demonstrate increased myocarditis mortality rate ratio after vaccination.<br /> As we stated in the end of the discussion section and in supplemental Table S6, all of the Modified US Surgeon General criteria for causal were satisfied.

      Sincerely,<br /> Watanabe and Hama.

      References<br /> [1] Block JP, Boehmer TK, Forrest CB, et al. Cardiac Complications After SARS-CoV-2 Infection and mRNA COVID-19 Vaccination - PCORnet, United States, January 2021-January 2022. MMWR Morb Mortal Wkly Rep 2022; 71:517-23. DOI: http://dx.doi.org/10.15585/...<br /> [2] Patone M, Mei XW, Handunnetthi L, et al. Risk of Myocarditis After Sequential Doses of COVID-19 Vaccine and SARS-CoV-2 Infection by Age and Sex. Circulation. 2022; 146(10):743-54. doi:10.1161/CIRCULATIONAHA.122.059970<br /> [3] Patone M, Mei XW, Handunnetthi L, et al. Risks of myocarditis, pericarditis, and cardiac arrhythmias associated with COVID-19 vaccination or SARS-CoV-2 infection. Nat Med. 2022; 28(2):410-22. doi:10.1038/s41591-021-01630-0<br /> [4] Hama R and Watanabe S. The risk of vaccination may be higher by considering “healthy vaccinee effect” Response to Husby et al: https://doi.org/10.1136/bmj... (Published 16 December 2021)<br /> Available at: https://www.bmj.com/content...<br /> (Accessed 30 November 2022)<br /> [5] Hama R and Watanabe S. Vulnerability confounding bias should be taken into account in assessing risk of post SARS-CoV-2 infection: an opposite concept of healthy-vaccinee effect (Under submission)<br /> [6] Mevorach D, Anis E, Cedar N, et al. Myocarditis after BNT162b2 mRNA Vaccine against Covid-19 in Israel. N Engl J Med. 2021; 385(23):2140-49. doi:10.1056/NEJMoa2109730

    1. On 2023-10-22 23:07:06, user CDSL JHSPH wrote:

      Hello! Thank you for sharing your work with us. I believe that your work in identifying barriers of transitioning from acute care of substance use disorder (SUD) to community-based treatment is a big first step to making a change in providing impactful support to SUD patients. I wanted to start off with saying I think the title of the topic is well framed, it conceptualizes exactly what to expect in the paper including the research focus of transitions of SUD patients from acute-care settings to community-based setting, it also gives an insight to the methods and understanding that the paper will aim to categorize the strategies. There were a few comments and questions that I think may help the paper and my understanding of this paper.<br /> 1) The Abstract: I really like the breakdown structure of the abstract, it makes it easier to read. I do believe an extra line could be added to the background section of the abstract that indicates a direct connection of the research results to its direct use in the bigger issue. I think adding something like the sentence on Line 4, page 5 would help the reader make this connection. <br /> 2) Results and Figures: I felt as through a pie chart could be used to summarize a few things in this section. It would make it easier to read in a way and represent what portion if the category was taken from the whole picture. An example of this could be during the Additional IntervenntionC Components across Care Continuum. The Table is very helpful, but a graphic figure may help readers understand the results in a better way.<br /> 3) Discussion: The need for more literary review was repeated multiple times throughout the discussion and I was wondering if there was a way of indicating this limitation’s importance without the repetition of it. <br /> Overall, I really enjoyed reading this paper. It was well-written and easy to follow. I hope that this paper makes the effect it intends to, and I hope to follow up with future research in which these strategies, barriers and facilitators are put to the test. I think this is a great step to making a big difference in addiction medicine.

    1. On 2021-08-29 21:38:00, user MANISH JOSHI wrote:

      We must stop ignoring natural immunity - it’s now long overdue<br /> Manish Joshi, MD

      This article by Gazit et al is another addition to a growing body of literature supporting the conclusion that natural immunity confers robust, durable, and high-level protection against COVID-19 (1-4). Yet some scientific journals, media outlets, and public policy messaging continue to cast doubt. That doubt has real-world consequences, particulary for resource limited countries. We would like to review available data.

      Infection generates immunity. The “SIREN” study in the Lancet addressed the relationships between seropositivity in people with previous COVID-19 infection and subsequent risk of severe acute respiratory syndrome due to SARS-CoV-2 infection over the subsequent 7-12 months (1). Prior infection decreased risk of symptomatic re-infection by 93%. A large cohort study published in JAMA Internal Medicine looked at 3.2 million US patients and showed that the risk of infection was significantly lower (0.3%) in seropositive patients v/s those who are seronegative (3%) (2).

      Perhaps even more important to the question of duration of immunity is a recent study that has demonstrated the presence of long-lived memory immune cells in those who have recovered from COVID-19 (3). This implies a prolonged (perhaps years) capacity to respond to new infection with new antibodies.

      In contrast to this collective data demonstrating both adequate and long-lasting protection in those who have recovered from COVID-19, the duration of vaccine-induced immunity is not fully known- but breakthrough infections in Israel, Iceland and in the US suggests few months. Before CDC decided to stop collecting data on all breakthrough infections at the end of April, 2021, it reported >10,000 breakthrough infections (2 weeks after completion of vaccination) in the US, with a mortality of ~2% (5). Booster COVID vaccine recommendations have been already announced in Israel and in the US proving ineffectiveness within 6 months.

      How should we use the collective data to prioritize vaccination? These new data support simple and logical concepts. The goal of vaccination is to generate memory cells that can recognize SARS-CoV-2 and rapidly generate neutralizing antibodies that either prevent or mitigate both infection and transmission. Those who have survived COVID-19 must almost by definition have mounted an effective immune response; it is not surprising that the evolving literature shows that prior infection decreases vulnerability. In our view, the data suggest that people confirmed to have been infected with SARS-CoV-2 may not need vaccination. We should not be debating the implications of prior infection; we should be debating how to confirm prior infection (6).

      Manish Joshi, MD<br /> Thaddeus Bartter, MD<br /> Anita Joshi, BDS, MPH

      1. Hall VJ, Foulkes S, Charlett A et al. SARS-CoV-2 infection rates of antibody-positive compared with antibody-negative health-care workers in England: large, multicentre, prospective cohort study (SIREN). Lancet. 2021
      2. Harvey RA, Rassen JA, Kabelac CA, et al. Association of SARS-CoV-2 Seropositive Antibody Test With Risk of Future Infection. JAMA Intern Med.
      3. Turner, J.S., Kim, W., Kalaidina, E. et al. SARS-CoV-2 infection induces long-lived bone marrow plasma cells in humans. Nature 2021
      4. Wang, Z., Yang, X., Zhong, J. et al. Exposure to SARS-CoV-2 generates T-cell memory in the absence of a detectable viral infection. Nat Commun 12, 1724 (2021).
      5. https://www.cdc.gov/mmwr/vo...
      6. Kuehn BM. High-Income Countries Have Secured the Bulk of COVID-19 Vaccines. JAMA. 2021;325(7):612
    1. On 2020-09-18 20:29:54, user David C. Norris, MD wrote:

      This paper is fundamentally misconceived:

      Biostatistically

      This paper apparently arises out of the biostatistical perspective which presently dominates the design and analysis of dose-finding trials in oncology. Yet even by purely statistical standards, it suffers serious shortcomings. Most notably, it looks for an interaction (viz., dose-response) without first demonstrating or ensuring the existence of a main effect. Reference #153 in this paper (Hazim et al. 2020) reported a 5% median response rate in a systematic review of recent dose-finding trials. Would the authors venture to estimate what fraction of their 93 ‘analysis series’ employed a drug with a substantial therapeutic effect? Some indication might be found in what fraction of the treatments unequivocally demonstrated a therapeutic effect in subsequent phase 2 or 3 trials. Adashek et al. (2019) document a secular trend in overall response rate (ORR) observed in phase 1 trials which is “now almost 20%, or even higher (~42%) when a genomic biomarker is used for patient selection.”

      Also arguably well within the purview of biostatistics would have been a decision-theoretic framing of phase 1 cancer trials. These trials may be understood as the earliest clinical steps in a learn-as-you-go (adaptive) drug-development process (Palmer 2002; Berry 2004). On such an understanding, aiming to treat early-phase participants at maximum tolerated doses (MTDs) in no way “dictates that an assumption is made … that higher doses are always more efficacious” (p. 4; italics in original). The authors’ use of “dictates” suggests they see something of logical necessity in this, and their further insertion of the logical quantifier “always” only exacerbates their overreach in formulating this central tenet of their study. Even the distinction between a logical assumption and a statistical prior gets lost in the shuffle. To remedy all this, the authors might consider attempting to state formally their understanding of the individual phase 1 trial participant’s decision-problem, complete with its essential uncertainties and some plausible utilities. (Within the community of investigators whom they address in the final paragraph of their Discussion, there is, I believe, broad agreement on the doctrine that these trials have therapeutic intent (Weber et al. 2016; Burris 2019). The authors would do well to take this patient-centered view as their starting point, as opposed to the dose-centered and unitary goal they proclaim at the end of their current Discussion.)

      Furthermore, statistics is nothing if not a discipline for “mastering variation” (Senn 2016), and a paper that sets out to question the strict monotonicity of dose-efficacy ought also enquire as to the presence of inter-individual heterogeneity in dose-response. Note that such heterogeneity would tend to attenuate the maximum slope of a convex dose-response in aggregate.

      Finally, the absence-of-evidence fallacy is widely appreciated among professional statisticians, yet seems to have been indulged liberally here without any safeguards such as are usually provided by power calculations.

      Pharmacologically

      Within statistics, there is a doctrine that statistical analysts should always engage ‘subject-matter experts’. But one sees in this paper no sign that any pharmacological concepts—let alone expertise—have been brought to bear on what would seem to be a pharmacological question. At a minimum, in any serious challenge to the ‘MTD heuristic’—as I have called it—one expects to find distinctions between on-target and off-target toxicities. In an analysis that invokes dose-response plateaus (whether these are conceived as approximate or absolute in this paper remains unclear), we ought to find discussion of receptor occupancy and saturation as underlying realistic mechanisms.

      To some extent, a neglect of subject-matter knowledge may be embedded in the very form of the present analysis, which tries to deal with its question in aggregate (through statistical techniques such as standardization) rather than in its particulars.

      Clinically

      In the final paragraph of their Discussion, the authors proffer advice to clinical investigators. In light of the limitations—statistical, logical, subject-matter—catalogued above, this is premature and should be omitted. Any given phase 1 clinical investigator will be considering a candidate drug in its particulars, conditional on a great deal of preclinical data and perhaps even nontrivial PKPD and systems-pharmacology modeling. The authors acknowledge as much (p. 16), seeming to appreciate that they have conducted an unconditional analysis of highly conditioned decision-making. To investigators thus intimately engaged with pharmacologic particulars, the null conclusions from a marginal analysis such as this one can contribute little useful guidance. If it were proposed to submit this work for peer review in substantially its present form, only a statistical audience should be addressed—and then solely with a cautionary note that the finding of a dose-response interaction will not leap out at a statistician from a convenience sample of phase 1 studies in which a therapeutic main effect remains dubious and unexamined. The main lesson of this work is that statisticians ought to investigate questions of pharmacology in their particulars, and with recourse to subject-matter concepts and expertise.

      References

      Adashek, Jacob J., Patricia M. LoRusso, David S. Hong, and Razelle Kurzrock. 2019. “Phase I Trials as Valid Therapeutic Options for Patients with Cancer.” Nature Reviews Clinical Oncology, September. https://doi.org/10.1038/s41....

      Berry, Donald A. 2004. “Bayesian Statistics and the Efficiency and Ethics of Clinical Trials.” Statistical Science 19 (1): 175–87. https://doi.org/10.1214/088....

      Burris, Howard A. 2019. “Correcting the ASCO Position on Phase I Clinical Trials in Cancer.” Nature Reviews Clinical Oncology, December. https://doi.org/10.1038/s41....

      Hazim, Antonious, Gordon Mills, Vinay Prasad, Alyson Haslam, and Emerson Y. Chen. 2020. “Relationship Between Response and Dose in Published, Contemporary Phase I Oncology Trials.” Journal of the National Comprehensive Cancer Network 18 (4): 428–33. https://doi.org/10.6004/jnc....

      Palmer, C. R. 2002. “Ethics, Data-Dependent Designs, and the Strategy of Clinical Trials: Time to Start Learning-as-We-Go?” Statistical Methods in Medical Research 11 (5): 381–402. https://doi.org/10.1191/096....

      Senn, Stephen. 2016. “Mastering Variation: Variance Components and Personalised Medicine.” Statistics in Medicine 35 (7): 966–77. https://doi.org/10.1002/sim....

      Weber, Jeffrey S., Laura A. Levit, Peter C. Adamson, Suanna S. Bruinooge, Howard A. Burris, Michael A. Carducci, Adam P. Dicker, et al. 2016. “Reaffirming and Clarifying the American Society of Clinical Oncology’s Policy Statement on the Critical Role of Phase I Trials in Cancer Research and Treatment.” Journal of Clinical Oncology 35 (2): 139–40. https://doi.org/10.1200/JCO....

    1. On 2021-07-29 07:51:21, user Portal Cedip wrote:

      I am surprised that a country that was punished so badly by COVID-19, due to its nihilism, purely academic debates which misled the point even after recognizing that SARS-Cov-2, get the boy sick kills children and young people, but -you know Winston- their finest hour will not come until they get sick and die in numbers that do not even represent the TOTAL burden of the disease (just 6,340 boys, of which 700 got the PICU and oh, maybe 13 died, eventually more. Who cares? Just another non caucasic problem. Sense of safety for a far away condition that colonize, infects, make CYP get hospitalized, complicates 10% and kills with a lethality of 2% ONLY. I saw my pediatric unit got exhausted due to the large number of teleconferences with boys we could not hospitalize. The crisis was burning out or infecting our teams. We were under attack but the non-traslational sweatless sirs were complaining about us being hysterical and overplaying our hands with our small patients. And our government made the impossible. A country ranked 27 in Health Services got top 10 in number of cases and deaths / 100, 000. We did not see our boys dying in front of us. But got overwhelmed at all ages. Our 19 million people´s country got 130.000 CYP infected, Three thousand were hospitalized, half of them had a critical trajectory or came back from home with TIMPS. One hundred died. Eighteen had less than 1 yo. That´s crude data. Most of it occurred during the second wave, after we naivly thought we had gotten rid of the virus (Christmas 2020). But the virus gave itself a gift from England: the variant Delta, which seized the country for 4 additional months. Now is calmed again. You trust that it got surrended to vaccination, a plan that already involves more than 65% of the population. <br /> NO<br /> I do not.

      My best wishes. With personal regards from the very south of the world,

      Ricardo

    1. On 2021-11-16 11:57:33, user disqus_aUdf6iYESf wrote:

      This is an interesting study, and not an easy one to do. I congratulate the authors on their work.

      I agree with the authors that the study is hypothesis generating.

      A few questions/comments:

      1) The authors describe no delay as being "score >2 SDs above the population mean". If no delay is the inverse of delay, I think this should be "a score higher than the cutoff for delay of 2 SD below population mean." A score >2 SD above population mean would include a very small proportion of children (about 2.5% in the population) of developmentally advanced children.

      2) As the authors note, using a questionnaire (Age and Stages) by phone is not ideal for evaluation, and responses could be biased by parental knowledge of maternal SARS-CoV-2 infection.

      3) The numbers with infection in the first trimester are small (only 5 children), but 4/5 (80%) had developmental delays, as compared to 6/20 in second trimester (30%), and 20/273 with infection in third trimester (7.3%). Those are striking differences, with a "dose-response" type pattern by trimester, but the numbers are small, so this study would need to be replicated by other groups, ideally with testing with the Bayley scales or other administered instrument.

      4) A control group without SARS-CoV-2 infection would be important as an additional comparison group, and was not present. This would give a sense of whether in the population who responded and were assessed by phone questionnaire, the rate of developmental delay (score < mean - 2SD) was similar to that expected in the general population.

      For all of these reasons, I think further studies are required to definitively state that maternal SARS-CoV-2 infection in the first or second trimester is associated with developmental delay, but this study provides preliminary data that this might be the case. It appears other studies in progress propose to prospectively address this question (e.g., PROUDEST study in Brazil), and such studies are required for a more definite answer as to whether SARS-CoV-2 infection early in pregnancy affects child neurodevelopment outcomes.

    1. On 2024-12-01 14:50:07, user xPeer wrote:

      Courtesy review from xpeerd.com

      Summary

      The preprint titled "Financial incentives to motivate treatment for hepatitis C with direct acting antivirals among Australian adults" investigates how financial incentives influence the initiation of direct-acting antiviral (DAA) therapy for untreated hepatitis C virus patients in Australia. Utilizing Bayesian adaptive design, the study assigns participants varying levels of financial incentives to observe which incentive levels effectively promote treatment initiation. The study is thorough in detailing statistical methods, including primary and secondary analysis plans, making it potentially influential for public health policy.

      Major Revisions

      1. Methodological Concerns:
      2. Futility Stopping Rules: The document briefly mentions futility stopping rules for eliminating less effective incentive levels. More detailed explanations and specific thresholds for these rules should be provided to ensure transparency and reproducibility (Page 2, Abstract).
      3. Bias and Confounding Variables: While the study employs Bayesian adaptive design and randomization, there is insufficient discussion on potential biases and confounding variables that could affect the study's results, such as differences in demographic variables, healthcare access, or socioeconomic status (Page 9, Study Design).

      4. Data Accessibility:

      5. Availability of Data for Replication: The document should explicitly state how and where the data will be made available for replication purposes, adhering to good scientific practices (Page 12, Data Availability Statement).

      6. Outcome Measures and Analysis:

      7. Primary Outcome Definition: There is a need for a more precise definition and justification of the primary outcome measure, namely DAA initiation within 12 weeks (Page 3, Primary analysis).
      8. Secondary Outcomes and Analysis: The description of secondary outcomes such as the number of missed DAA days and HCV PCR test results should be more detailed with clear operational definitions and analysis plans (Page 7, Secondary analyses).

      Recommendations

      1. Clarify Methodology: Provide a more in-depth explanation of the futility stopping rules, including specific criteria and decision-making processes to increase transparency (Page 2, Abstract).
      2. Address Potential Biases: Incorporate a section that addresses potential biases and confounding variables, explaining how these will be managed in the analysis (Page 9, Study Design).
      3. Enhance Data Accessibility: Ensure that the data is accessible to other researchers for replication, and clearly state the data-sharing mechanisms (Page 12, Data Availability Statement).
      4. Refine Outcome Measures: Define the primary and secondary outcome measures more clearly and provide detailed plans for their analysis (Page 3, Primary analysis and Page 7, Secondary analyses).

      Minor Revisions

      1. Typographical Errors:
      2. Replace "DAA’s" with "DAAs" (Page 9, Background).
      3. Replace "payment amounts are made" with "payments are made" (Page 2, Abstract).

      4. Formatting Issues:

      5. Standardize the presentation format of equations and mathematical notations to enhance readability (Page 6, Effect of co-incentives).

      6. AI-Generated Content Analysis:

      7. There is no explicit indication of AI-generated text. However, ensuring all elements are rigorously checked for epistemic accuracy and coherence is crucial, given the post-2021 publication date.
    1. On 2025-02-12 19:59:20, user Aron Troen wrote:

      Review Part II

      Methodological shortcomings<br /> Study population and period: The population demographics used as the denominator of per capita caloric requirement rely on census data from 2017 and UN OCHA reports on movement and displacement of the population between Gaza governates during the war. The study states that no adjustments were made for out-migration or excess deaths. However, approximately 150,000 people left the Gaza strip from the beginning of the war until the Rafah crossing was closed in May. When added to casualties and a natural death rate of ~5500 people per year, this means that the population denominator used to calculate the food supply in Kcal per person-day (Figure 4) was overestimated by ~ 200,000 people, which would result in the underestimation of the food supply by approximately 10%.

      The authors acknowledge the limitation that “There remains considerable uncertainty about our population denominators in the north, and even moderate error in these would have affected our Kcal per capita estimates. Gaza’s population has probably decreased due to high mortality and out-migration…”. Nevertheless, they shrug off this limitation by asserting that “…we expect this to have only marginally affected our estimates.” without explaining why.

      Data on truck deliveries

      The comparison between UN and Israeli shipping data is superficial and inadequate for supporting the decision to dismiss and exclude the data from the analysis. The authors fail to discuss the literature, of which they surely must be aware, which addresses the high-profile controversy over the number of trucks supplying aid to Gaza and the discrepancies between the UN and COGAT data, and which notes the under-reporting of private sector food shipments by the UN (see for example, Rosen, Bruce and Nitzan, Dorit, Humanitarian Food Aid for Gaza: Making Sense of Recent Data (June 02, 2024). Available at http://dx.doi.org/10.2139/ssrn.4851635) "http://dx.doi.org/10.2139/ssrn.4851635)") .

      Although the authors note the "large discrepancy between UN and Israeli government data" on the entrance of goods into Gaza, they erroneously assert that UNRWA monitored the composition of “ALL trucks” crossing into Gaza, despite the partial coverage of non-UN food consignments, and despite disclaimers published by UNRWA and recorded by the authors, that the data from May-August are incomplete. The authors make little effort to help the reader understand the reason for the discrepancy nor to explain how they reached the conclusion that UNRWA's dataset "appeared highly complete and well-curated, but may be biased by systematic under- or over-reporting unknown to us". Instead of making a serious effort to include COGAT data to improve the accuracy of their simulation, they perform a perfunctory comparison of the UN and COGAT data and justify the summary dismissal of the Israeli registry, using the categorical listing of truck weight registered by COGAT as “evidence of digit heaping or crude approximation”. This is a peculiar choice, given the importance of the COGAT dataset, which is included in the June IPC report and in a working paper that the authors cite that analyzes the caloric content of food supplied to Gaza, including private sector shipments that are missing from the UN data (now published at https://ijhpr.biomedcentral.com/articles/10.1186/s13584-025-00668-6) "https://ijhpr.biomedcentral.com/articles/10.1186/s13584-025-00668-6)") . An alternative choice might have been to simulate the weight and contents of the COGAT data like the authors did for incomplete WFP data, or to perform a sensitivity analysis and compare how caloric supply estimates might differ based on the data and assumptions used.

      Instead, the study implies that the discrepancy has to do more with weight of aid reported rather than the number of trucks. However, significant gaps are also evident in the number of trucks reported. For example, in February, UNRWA reported 1,857 trucks carrying food while COGAT's figure is 15% higher (2,117). In January the gap is equally large, with COGAT's number of trucks 13% higher than UNRWA's (3,364 and 2,990 respectively). According to COGAT, between January and May 2024, "as a result of the UN’s partial counting… there are 3,406 trucks missing from their Kerem Shalom data and 2,198 trucks missing from their Nitzana/Rafah data." ( https://govextra.gov.il/media/dtmhzmtn/discrepancies-in-un-aid-to-gaza-data-2.pdf) "https://govextra.gov.il/media/dtmhzmtn/discrepancies-in-un-aid-to-gaza-data-2.pdf)") . Furthermore, the period analyzed covers several unexplained changes in UNRWA's dashboard ( https://honestreporting.com/how-unrwa-covers-up-its-faulty-gaza-food-data/) "https://honestreporting.com/how-unrwa-covers-up-its-faulty-gaza-food-data/)") , apparently following data-driven criticism about its methodology and lack of transparency on social media ( https://x.com/AviBittMD/status/1780052840930578499) "https://x.com/AviBittMD/status/1780052840930578499)") . According to a FEWS NET report, "on September 8… UNRWA’s dashboard was updated with additional supply data for August, as well as for previous months, including commercial truck entries as reported to UNRWA." UNRWA has not disclosed where the new data on commercial trucks came from or how far back the data update had gone.

      The subsequent calculation of caloric availability includes a mix of registered and simulated data, in which the simulation parameters extremely underestimate the caloric supply. The model derives the simulated distribution of estimated Kcal per truck as described in the methods and shown in supplementary figure A1: “We reconstructed the number of these trucks over time based on published information and data shared by WFP . As no data on content were available, we simulated their caloric equivalent by repeatedly sampling from the empirical distribution of calories per truck obtained from the UNRWA dataset.“ There are several problems with this approach. First, it is unclear which specific truck data “shared by WFP” were used for this simulation, and whether they are publicly available. This should be clearly indicated in the uploaded github data files. Moreover, the WFP records the contents of their shipments. Why were their contents omitted in this case? Presenting summary tables in the article would help the orient the reader to the source data for the truck counts used, distinguishing between simulated or assumed and actual contents. An implicit assumption underlying the simulation of WFP contents according to estimated distribution of calories by UNRWA trucks, is that the contents of UNRWA and WFP shipments are the same. This needs to be documented or the assumption should be made explicit. Given that the study appears to significantly underestimate the weight of the UNRWA pallets, the procedure used would be expected to propagate biased estimates lower than the actual weights to the WFP data as well.

      The most critical problem in the model is with the ASSUMED weights that the authors assign to the consignments. They assume mean pallet weights to be 637.5 kg per pallet, with a minimum to maximum weight of 510-765 kg per pallet (gaza_food_data.xlsx, general tab), based on citations 23, 30 and 31. Citation 23 does not provide supporting data and refers to IPC reports in general. Citations 30 and 31 are standard operating procedures for the Egyptian Red Crescent (ERC) from October and November 2023, which REQUIRE an 18% higher palletization weight of 750 kg. However, even this value is considerably lower than UN aid REQUIREMENTS that specify pallet weights for wheat flour (1125-1200 kg/pallet), sugar (1200 kg/pallet), chickpeas 1200 kg/pallet), red lentils (1200 kg/pallet), rice (1200 kg/pallet), SF oil (910-1213 kg/pallet) or milk (655 kg/pallet) (UNRWA Special Shipping Instructions for Shipments by Sea Air and Land – April 2024 - page 6; https://unrwa.org/sites/default/files/emergency_gaza_2023-_rfq-pskh-42-24-the_provision_of_man_trucks_for_gfo-tender_doc.pdf) "https://unrwa.org/sites/default/files/emergency_gaza_2023-_rfq-pskh-42-24-the_provision_of_man_trucks_for_gfo-tender_doc.pdf)") . Examination of “dataset 20240911_Commodities Received.xlsx” reveals that consignments attributed to ERC alone or with other agencies (including UNRWA) account for only 90,009 of the total of 531,175 food line items (17%) and 8085 of the total of 22,833 mixed line items (35%). Therefore, even if the mean value of 637.5 kg/pallet were correct for the ERC-associated consignments, the weights assigned to the foods supplied are unreasonably low, giving an extreme underestimation of the calories supplied.

      This unreasonably low distribution of the estimated Kcal per truck can be seen in the simulated truck weights. The histogram in Appendix figure A1 shows a distribution that is heavily skewed to the left with the vast majority of trucks carrying less than 50 Million Kcal and perhaps a third carrying less than 25 Million Kcal. The simulated lower end of the distribution, which begins with 600 trucks carrying zero Kcal/truck, is highly unlikely to be accurate. Even if one takes the assumed mean weight per truck assigned by the researchers as 14,500 kg, multiplying by the calorie content of wheat flour (3,640 Kcal/kg) would give a mean calorie content per truck of 52.8 M Kcal. Even if a lower calorie food calorie density of circa 3200 Kcal/kg were used, based on visual inspection of Figure 3A (Kcal/kg food consignments between Oct 21 2023 – May 4 2024), the assumed mean caloric content of the food trucks should be 46.4 Million Kcal. These values, are hard to reconcile with the histogram, even if the assumed and simulated truck weights in the model are true. Thus, the validity of the model assumptions and their potential for propagating error and uncertainty in the results should be carefully revisited.

      Data on other food sources

      Estimates of the available existing food supply before the war combine the household stocks of humanitarian food aid, data provided to the researchers by UNRWA giving the exact stocks in UNRWA warehouses and the range of minimum-maximum capacity of WFP warehouses before the war; estimates of existing private stores, and of agriculture and livestock production, discounted for gradual depletion and destruction during the war’s early months. The model does not account for potential Hamas stockpiles ( https://www.nytimes.com/2023/10/27/world/middleeast/palestine-gazans-hamas-food.html) "https://www.nytimes.com/2023/10/27/world/middleeast/palestine-gazans-hamas-food.html)") .

      The spreadsheet “gaza_food_data.xlsx” tab “warehouses” lists total UNRWA and WFP warehouse capacity before the war as a range with a minimum to maximum capacity of 7,900-21,479 MT or 28.7 – 78.1 billion Kcal, whereas presumably, the “exact” contents of the food in UNRWA warehouses are those data listing a total of 38.3 billion Kcal of food in tab “unrwa_stocks”. No further information is provided to ascertain that the data given to the researchers by UNRWA and WFP is complete and accurate.

      Existing private stores/Caloric balance and consumption: The text describes the assumptions used in estimating the existing stores and their depletion during the war. The text defines model parameters (eg. I0, I0,m, etc.) but does not spell out the full model equation. Doing so would help the readers better understand the explicit logic of the simulation. <br /> The model discounts agriculture and livestock production using estimates of the rate and extent of damage to agricultural infrastructure citing UNOSTAT remote sensing data published by FAO (references 11, 40-42). The validity of estimates derived from image analysis depends heavily on the control conditions selected for a reference and on the quality of validation and calibration in the field. The percent damage arrived at by automated image analysis algorithms, depends on the selected reference conditions, whose rationale and validity are not given. Field validation is impossible in a war zone which is why the cited reports carry important disclaimers such as: ”This assessment has been conducted based on available satellite imagery, ancillary data and remote sensing analysis for the period 7 October - 31 December 2023 without field validation. Land cover data from 2021 was used as baseline data due to limited availability for data collection in the area of interest and time constraints related to the nature of the report.“ ( https://openknowledge.fao.org/server/api/core/bitstreams/f2ad2f59-0c29-472e-978b-54cef347c642/content) "https://openknowledge.fao.org/server/api/core/bitstreams/f2ad2f59-0c29-472e-978b-54cef347c642/content)") . The limitations of these estimates used in the model should be acknowledged.

      Estimating Baseline and Recommended per-capita caloric intake

      The per-capita caloric intake for emergency-affected populations is given by the WHO guide and is stratified by age and sex. Given the age and sex distribution of the population of Gaza (gaza_food_data.xlsx, tab prop_age_sex), the mean daily per capita calorie requirement for the population is 2,065 Kcal/person-day. This threshold shown in yellow in Figure 4, is the appropriate criterion for evaluating the adequacy of the food supplied by the humanitarian food cluster. <br /> However, the researchers go beyond this consensus humanitarian requirement, and derive a much higher Gaza-specific estimate “I0“ for the population intake at baseline. The baseline value of I0 appears to be just under 2,800 Kcal per person-day according to figure 4 (blue line value on October 7th, 2023). The paper does not give the baseline value “I0“ explicitly. However, it is nearly identical to the weighted average caloric intake (2,837 Kcal/person-day) observed in a population of obese older Gazan adults (mean age 57, weighted mean BMI 31.4) with a high prevalence of noncommunicable diseases, in a survey conducted during the COVID pandemic between March and July 2020, which was used to impute the daily intake of the overall population. The weighted intake and BMI may be calculated based on the data provided in the gaza_food_data.xlsx spreadsheet, tab prop_age_sex. The estimated pre-war intake, is roughly 33% higher than the humanitarian requirement, or “recommended daily intake”. The model derives the weekly available per person food supply, by subtracting this pre-war intake estimate, from the estimated weekly available daily per-capita food supply (from the sum of private stores and warehouses, agriculture and delivered food-aid, discounted for reported consumption and damage). The model makes the questionable assumption that the emergency-affected population would continue to consume the same amount of food that it did during the war, as it did before the war. Even before examining the validity of the method used to derive “I0“, this assumption forces the model to deplete the available food supply significantly more rapidly (about 33% sooner) than if the recommended humanitarian food requirement were used to simulate the adequacy of the available food supply.

      The logic behind the method of imputation to the whole population is not clearly explained (“we sampled random values from each age-sex stratum distribution…” Appendix A, Figure A2). <br /> Supplementary figure A2, entitled “Baseline adult caloric intake” shows simulated untransformed and log-transformed, age and sex specific distributions of energy intake, from Abu Hamad et al., J Hum Hypertens 2023. That reference describes a health survey conducted in Gaza between March and July 2020 among adults aged 40 and older, and using the semi-quantitative Food Frequency Questionnaire for Palestinian Populations which was developed by Hamdan et al., in a population of Palestinian women in Hebron, and published in Public Health Nutrition 17(11) in 2013. While such survey tools may be useful for epidemiological studies, they are intended to classify populations into categories of relative nutritional intake, rather than for deriving valid absolute individual nutrient intakes. In the case of the specific instrument used, Hamdan et al. write that studies like theirs “can be considered a calibration and correlation rather than a validation procedure”. The correlation that they obtained in that study between three repeat 24 hr food recall questionnaires and the semi quantitative FFQ was 0.601 and was not statistically significant (in other words the FFQ gives a similar but poorly concordant result to the reference standard). Moreover, it is doubtful, if the high average food intake of an obese, older and unhealthy population (which was obtained during a health crisis that increased sedentary behavior due to social distancing and isolation), provides a sound basis for imputing routine intakes for a population that is predominantly younger (82% of Gaza’s population are below age 40 – see gaza_food_data.xlsx, tab prop_age_sex), healthier, and not affected by a pandemic. It would be helpful if the researchers clarified these limitations and presented the age-and sex stratified per-person daily caloric derived intake and compared it with the consensus humanitarian requirements.

    1. On 2025-03-17 22:11:06, user Dr.PayamVaraee wrote:

      Critical Review: "The Threat of Populism to Science and Global Public Health: Lessons from Iran"<br /> A. Critique of Content and Main Claims<br /> 1. Claim: Populist Science Increased Mortality in Iran<br /> The article asserts that populist policies delayed vaccination efforts in Iran, leading to excess mortality. However, data comparisons with countries such as the US, UK, and Germany reveal similar trends, challenging the uniqueness of Iran’s case.

      Issues:<br /> Overlooking Key Variables: The analysis does not account for factors such as economic sanctions, healthcare infrastructure, and demographic differences.<br /> Post-Vaccination Mortality Decline: The significant drop in mortality following mass vaccination aligns with global patterns, suggesting that other factors played a role beyond populist decision-making.<br /> Flawed Comparisons: The article contrasts Iran with Bahrain and the UAE, despite major differences in population size, vaccine availability, and healthcare systems.<br /> 2. Claim: Iranian Data on COVID-19 Mortality is Unreliable<br /> The article utilizes the Prophet model to argue that Iranian mortality statistics were manipulated.

      Issues:<br /> Limitations of the Prophet Model: Originally designed for economic and social trend forecasting, this model is not optimized for analyzing health crises.<br /> Weak Evidence for Data Manipulation: The assumption that discrepancies between projections and reported data indicate fraud is flawed. Factors such as improved treatment strategies and emerging herd immunity are not considered.<br /> Selective Application: The same predictive model is not used to assess data accuracy in other countries, raising concerns about bias.<br /> B. Critique of Data Analysis Methods<br /> 1. Misuse of ANOVA<br /> The article employs a one-way ANOVA to compare vaccination delays across countries. However, this method does not sufficiently account for assumptions of normality and homogeneity of variance, potentially leading to misleading conclusions.

      Better Alternatives:<br /> Time-Series Models (ARIMA, VAR): These would provide a more accurate assessment of trends over time.<br /> Multivariate Regression: This method would allow for the inclusion of additional variables influencing vaccination delays and mortality rates.<br /> 2. Absence of Confounding Variable Control<br /> The article does not adjust for important factors such as:

      The proportion of elderly populations.<br /> Hospitalization rates and healthcare capacity.<br /> Lockdown policies and mobility restrictions.<br /> Neglecting these variables weakens the argument that Iran’s excess mortality was driven primarily by populist policies.

      C. Logical and Argumentative Flaws<br /> 1. Selective Data Use<br /> The article emphasizes evidence that supports its argument while disregarding counterexamples—such as similar mortality patterns in Western countries—leading to confirmation bias.

      1. Correlation vs. Causation Fallacy<br /> It assumes a direct causal link between delayed vaccinations and excess mortality without considering other influencing factors, such as economic restrictions, healthcare efficiency, and prior infection rates.

      2. Oversimplification of a Complex Issue<br /> By attributing Iran’s COVID-19 response largely to populism, the article overlooks the fact that mortality spikes occurred in Germany, the US, and other non-populist-led countries. A more nuanced analysis is needed.

      D. Broader Issues with the Scope of the Article<br /> 1. Disproportionate Focus on Iran<br /> If populist science is a global issue, why is Iran the only case study? A comparative approach—including countries like the US, Brazil, and Poland—would strengthen the argument.

      1. Lack of Practical Solutions<br /> The article critiques Iran’s handling of the pandemic but does not propose strategies to combat misinformation and improve public health responses globally.

      2. Limited and Selective Data Sources<br /> The article relies heavily on The Economist and WHO while neglecting independent organizations such as the CDC and regional research institutions. A broader range of data sources would improve credibility.

      E. Additional Criticism of the Core Argument<br /> 1. Populism Beyond Iran<br /> Research, including the PANCOPOP study, shows that right-wing populism influenced pandemic responses in the US, Brazil, Poland, and Serbia. The article’s exclusive focus on Iran suggests political bias rather than an objective analysis of populism in global public health.

      1. Contradictions in the Populism Model<br /> The article argues that Iran exhibited both the denialist model (seen in the US and Brazil) and the authoritarian control model (similar to Poland and Serbia). These models, however, are distinct and mutually exclusive in the PANCOPOP framework, making this assertion contradictory.

      2. Absence of Comparative Analysis<br /> The study lacks a global perspective on how different forms of populism shaped pandemic policies, weakening its claim that Iran’s case is uniquely alarming.

      3. Misattribution of Vaccination Delays Solely to Populism<br /> The article ignores other major contributing factors, such as:

      Economic Sanctions: Restrictions on vaccine imports.<br /> Vaccine Hesitancy: Public resistance to certain vaccines.<br /> Domestic Vaccine Development: Initial reliance on homegrown vaccines before shifting to imports.<br /> By overlooking these aspects, the article oversimplifies the reasons behind Iran’s vaccination timeline.

      1. Failure to Address Global Media Influence<br /> Studies have demonstrated that misinformation on COVID-19 spread across multiple countries, yet the article singles out Iran without discussing similar issues in other regions.

      2. Statistical Flaws<br /> The ANOVA and Prophet model are misapplied, limiting the validity of conclusions.<br /> A lack of multivariate regression fails to control for external factors influencing pandemic outcomes.

      3. Contradictory Use of Data<br /> The article doubts the credibility of Iran’s official statistics yet uses those same statistics to support its claims. This undermines its argument and suggests inconsistent reasoning.

      Conclusion<br /> The article presents a flawed and unbalanced analysis of how populism influenced Iran’s COVID-19 response.

      Key Weaknesses:<br /> Selective use of data that aligns with the author's argument while ignoring broader trends.<br /> Lack of comparative analysis, failing to place Iran’s case within a global context.<br /> Misuse of statistical methods, leading to questionable conclusions.<br /> Recommendations for a Stronger Study:<br /> A multi-country analysis incorporating nations with varying political ideologies.<br /> Consideration of alternative explanations for mortality trends, such as healthcare infrastructure and economic factors.<br /> A transparent and methodologically sound approach to data interpretation.<br /> A truly robust and objective study would examine multiple countries, account for confounding variables, and avoid overgeneralizing populism’s impact on public health outcomes.

    1. On 2025-10-20 15:14:25, user xPeer wrote:

      Courtesy Peer Review Simulation from xPeerd :

      Summary<br /> This manuscript examines the experiences of mothers of autistic children within UK child-protection services, with a particular focus on the prevalence and nature of social services' involvement and allegations of Fabricated or Induced Illness (FII). Using a survey of 242 mothers (diagnosed autistic, self-identified autistic, and non-autistic), the authors investigate whether mothers with autism face greater scrutiny or risk of having their children removed compared to others. The findings suggest high levels of investigation for all groups, but no significant differences between groups. However, a markedly elevated rate of FII allegations is identified among mothers of autistic children compared to general epidemiological estimates. The methodology integrates participatory approaches but is limited by its sample scope, lack of a typically developing comparison group, and exploratory design.

      Potential Major Revisions

      1. Methodological Scope and Representativeness
      2. The sample lacks a control group of mothers with typically developing children: “we did not actively recruit mothers of typically developing children due to practical considerations…” (p. 18, Limitations). This hinders interpretation of whether findings are unique to mothers of autistic children or represent broader social service dynamics.
      3. The design is exploratory, and as stated: “the questions in the survey were exploratory and therefore we did not enquire in detail about social service involvement” (p. 18, Limitations). More granular data (e.g., timelines, outcomes, types of interventions) would strengthen the work’s empirical claims.

      4. Statistical Analysis and Power

      5. Some subgroup analyses are based on small subsamples (e.g., N = 21 for autistic mothers called into a meeting), reducing statistical power (Table 3, p. 13).
      6. The manuscript acknowledges no statistically significant differences between diagnostic groups in key outcomes such as child protection registration or FII allegations (e.g., “no statistical difference emerged”; p. 14), suggesting caution is required in interpreting implications for policy or discrimination.

      7. Interpretive Overreach

      8. The discussion interprets elevated rates of investigation as evidence of systemic discrimination, but alternative explanations (e.g., increased service contact for autistic children, reporting biases) are not fully interrogated: “our results suggest a significant increase in inquiries and registrations...compared to the general population” (p. 17).
      9. The text could benefit from a more critical posture towards causal inference.

      10. Ethical and Legal Framing

      11. The work alludes to ethical and human rights implications but does not provide a detailed ethical analysis or legal context, which are crucial for claims concerning state intervention and discrimination (see discussion, pp. 17–19).

      Potential Minor Revisions

      • Typographical and Grammatical Errors
      • Occasional word repetition and typographic slips (e.g., “we are separately reportingly the results here...”; p. 8).
      • Consistent usage of terms (e.g., sometimes “non-autistic”, sometimes “nonautistic”).
      • Formatting Issues
      • The document is interspersed with license and preprint notices that disrupt readability.
      • Table captions and labels (e.g., Table 3, Table 4) lack uniform placement and can be confusing in the PDF layout.
      • Section headers could be standardized for clarity.
      • References
      • All references are recent and field-appropriate. No missing citations identified. All URLs and DOIs appear correct.

      • AI Content Analysis

      • The writing style, structure, and nuanced argumentation are consistent with human-authored academic research. Estimated AI-generated content: <5%. No sections strongly flagged as AI-generated; narrative voice and academic conventions are maintained throughout. No epistemic inconsistencies or abrupt shifts in style detected.

      Recommendations

      1. Include a Wider Comparison Group
      2. For greater generalizability, future iterations should incorporate mothers of typically developing children. This would clarify whether the experiences described are unique to mothers of autistic children.
      3. Deepen the Methodological Rigor
      4. Enrich the survey to collect more detailed information on the nature, duration, and outcome of social services’ engagement.
      5. Where possible, triangulate self-report data with administrative records or interviews with professionals (subject to ethical approval).
      6. Clarify Causal Inferences
      7. Approach claims about systemic discrimination with caution—consider and analytically address alternative explanations or confounds.
      8. Expand the Legal and Ethical Analysis
      9. A more thorough excursus on UK legal standards and the ethical principles governing child-protection interventions would enhance the policy relevance of the manuscript.
      10. Address Subsample Limitations
      11. Explicitly acknowledge and discuss the implications of small subsample sizes for statistical inference throughout the results sections.
      12. Improve Readability and Consistency
      13. Edit for grammar, typographic errors, and ensure formatting consistency between tables, figures, and narrative text.
    1. On 2020-05-19 04:00:43, user Sinai Immunol Review Project wrote:

      Main Findings:<br /> During the unprecedented COVID-19 pandemic, identifying patients at high risk for mortality is critical so as to guide clinical decisions on early intervention and patient care. To identify factors associated with risk of death from COVID-19, the study developed a secure and pseudonymized analytics platform, OpenSAFELY, that links the UK National Health Service (NHS) patient electronic health records (EHR) with COVID-19 in-hospital death notifications. This platform enabled the rapid analysis of by far the largest cohort to date from any country, comprising 17,425,445 multi-ethnic adults and 5,683 COVID-19 deaths. The analyses were based on hazard ratio generated by cox-regression and were adjusted for demographics and co-morbidities.<br /> Increased risk of COVID-19 hospital death was associated with male gender, older age, certain clinical conditions (uncontrolled diabetes, severe asthma, other respiratory diseases, history of haematological malignancy or recent non-haematological cancer, obesity, cardiovascular disease, kidney, liver, neurological diseases, autoimmune conditions, organ transplant and splenectomy). Notably, the association of asthma with higher risk of COVID-19 related death is contradictory to previous findings of no increased risk of death or even protective association. This effect was even stronger with recent use of oral corticosteroids (i.e. more severe asthma). In addition, people of lower socio-economic background (i.e. deprivation) or black and Asian origin were identified at high risk. However, this association could not be entirely attributed to pre-existing health conditions or other risk factors, which warrants further exploration into drivers of these associations. The open source analytics code is available at OpenSAFELY.org.

      Limitations:<br /> 1) There are few drawbacks in data source and collection. The study did not account for COVID-19 deaths in false-negative/ untested individuals, relied on EHR from specific software and dealt with incomplete EHR information. <br /> 2) Additional discussion regarding the reasons behind the associations would be insightful. Specifically, recent studies have shown that risk factors including asthma, hypertension, and diabetes impact the expression of ACE2 gene, which is the entry receptor for SARS-CoV-2. However, while asthma with type 2 inflammation has been associated with lower ACE2 expression and thereby potentially protective effect, this association has not been observed for nonatopic asthma in these studies. In the current study, asthma is only categorized in terms of severity (recent oral corticosteroid use vs not). Further categorization in terms of subtype would have been helpful.<br /> 3) Understanding the underlying causes of high risk in people of black and Asian origin is important for public health and mitigation of the spread. In this study, the most common assumptions of high burden of underlying comorbidities and lower socio-economic status are shown to contribute only partly to the risk. However, other factors, such as occupational exposure, neighborhood and household-density and possible influence of genetic or other biological factors still need to be explored. <br /> 4) The study suggests increased risk in former smokers and slight protective effect in current smokers. More in-depth analyses into whether the protective effect of current smoker status is an artifact of over-adjustment, selection protocol of healthy controls or a true correlation are needed. <br /> 5) It would be helpful to have the p-value along with the reported hazards ratio and 95% confidence intervals.

      Significance: <br /> Overall, the OpenSAFELY platform allows secure and real-time analyses of clinical data stored in situ. As this global pandemic progresses, outcomes and data are expected to expand, revealing more insights to the effects of medical treatments and less common risk factors on COVID-19 infection, spread and death. This approach can help better identify additional factors that affect disease severity and immune response. Finally, this rapid and massive study was only possible because of the detailed longitudinal data already available through General Practitioners within the UK National Health Service (NHS), replication of which at a similar massive scale would be daunting within the highly fragmented healthcare system of the USA. While even within UK NHS many data integration issues remain, the findings from this study is a testament to the global model we need to follow to increase our power to rapidly answer crucial questions related to COVID-19 epidemiology. Such an approach will also open new avenues for increased understanding of other diseases.

      Reviewed by Myvizhi Esai Selvan as part of a project by students, postdocs and faculty at the Immunology Institute of the Icahn School of Medicine, Mount Sinai

    1. On 2020-06-19 03:12:29, user AMM wrote:

      1. I would like to know if serological tests were performed on the 118 healthcare workers who did have COVID, and what that data shows.

      2. The methods state that 2 separate tests for IgG/M were used: MCLIA and colloidal gold. Were both used on all subjects, did some subjects get just one or the other? This is important because they have different sensitivities and specificities.

      3. The paper states that the LAST TEST RESULT for each person was used (for RT-PCR and IgG/M). For the hospitalized COVID patients, did they ever have a positive test? If so, when?

      4. You assume that all COVID+ patients must have developed the antibodies, but 10% lost them. You base this from a paper that said 100% of patients developed IgG by day 17-19. However, the population tested in that paper has much milder cases than in your population. It would not be wise to assume that every patient develops antibodies from a limited population study.

      5. It could easily be that the 10% of hospitalized COVID patients who tested negative for IgG/M just represent the false negatives. The colloidal gold test you used only has a sensitivity of 88.6%. So a 10% false negative would be expected from that test.

      6. You cited another research article that tested for IgG in 1021 people returning to work. This was also in Wuhan. They found 10% of their population tested positive for antibodies, while your general population group showed 4.6% positive. I believe this is more reason to perform your own sensitivity tests of the kit you used.

      7. One observation not mentioned about your data is that it shows 4% of healthcare workers (who had much more exposure to COVID) tested positive for IgG, compared to 4.6% of non-healthcare workers. One would expect their numbers to be higher, not lower, than non-healthcare workers.

      I do not believe there is sufficient scientific evidence here to support the claim that “after SARS-CoV-2 infection, people are unlikely to produce long-lasting protective antibodies against this virus.”

    1. On 2020-06-19 07:54:01, user Dr. Sebastian Boegel wrote:

      Thank you very much for this huge community effort and the very nice results. Congrats to the team. This is a very important study and in analogy to what have been proposed in cancer a while ago: https://www.ncbi.nlm.nih.go...

      I have a couple of questions:<br /> 1.) I am not sure, if I understand that right: the clusters are derived from patients with immunemodulating treatment, such as glucocortocoids, MMF, etc.. In order to make sure that the defined clusters reflect the underlying disease and not the medication, you applied the same model to newly diagnosed patients, of which only a minority received prior treatment. And what you find is roughly the same proportions of diseases in each module. Is that right? If not, my question is: could you describe clearer why you think that these groups reflect the disease itself and not the treatment.

      2.) If 1.) is correct, than i am wondering, that untreated and treated patients cluster in the same way as I would except that immunemodulating treatment affects gene expression of many, esp. immune related genes, systemically, such that the blood transcriptome ist totally different. How do you explain that?

      3.) In the last sentence of the discussion, you wrote that this study will be usefule for a personalized medicine. From a clinical point of view, can you describe how this will help (maybe some examples, what does that study mean for a clinician? and for a diagnostics company?)

      4.) This is a multi center study. How did you normalize the sequencing data, such that the data doesnt cluster according to site? Did you check that? See also TCGA or GTEX.

      5.) How and when is it possible to access the raw data? Will RNA-Seq fastqs also shared? And are clinical information for each patient available?

      Thanks again for this very informative and well structured study. I acknowledge the hard work. This will be )once published peer reviewed) a seminal study in this field.

      Sebastian

    1. On 2020-04-05 18:23:54, user Sinai Immunol Review Project wrote:

      Summary: Authors evaluate clinical correlates of 10 patients (6 male and 4 female) hospitalized for severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). All patients required oxygen support and received broad spectrum antibiotics and 6 patients received anti-viral drugs. Additionally, 40% of patients were co-infected with influenza A. All 10 patients developed lymphopenia, two of which developed progressive lymphopenia (PLD) and died. Peripheral blood (PB) lymphocytes were analyzed – low CD4 and CD8 counts were noted in most patients, though CD4:CD8 ratio remained normal.

      Critical analysis: The authors evaluated a small cohort of severe SARS-CoV-2 cases and found an association between T cell lymphopenia and adverse outcomes. However, this is an extremely small and diverse cohort (40% of patients were co-infected with influenza A). These findings need to be validated in a larger cohort. Additionally, the value of this data would be greatly increased by adding individual data points for each patient as well as by adding error bars to each of the figures.

      Significance: This study provides a collection of clinical data and tracks evolution of T lymphocyte in 10 patients hospitalized for SARS-CoV-2, of which 4 patients were co-infected with influenza A.

      Review by Katherine E. Lindblad as part of a project of students, postdocs and faculty at the Immunology Institute of the Icahn School of Medicine at Mount Sinai.

    1. On 2020-04-10 17:40:13, user Sinai Immunol Review Project wrote:

      Key findings<br /> The authors investigated the use of a commercially available form of heparin, low molecular weight heparin (LMWH), as a therapeutic drug for patients with COVID-19. Previous studies showed that in addition to its anticoagulant properties, LMWH exerts anti-inflammatory effects by reducing the release of IL-6 and counteracting IL-6.

      This was a retrospective single-center study conducted in Wuhan, China. Forty-two (42) hospitalized patients with coronavirus pneumonia were included, of which 21 underwent LMWH treatment (heparin group) and 21 did not (control). The general characteristics of the two groups of patients were statistically comparable. Both control and LMWH had the same hospitalization time and there were no critical cases in either group.

      This study found that treatment with LMWH significantly reduced IL-6 levels in patients in the heparin group compared to the control group. However, LMWH treatment did not have an effect on the levels of other inflammatory factors: CRP, IL-2, IL-4, IL-10, TNF-?, and IFN-?. Compared with the control group, patients in the heparin group had a significantly increased percentage of lymphocytes after treatment, further suggesting that LMWH treatment has anti-inflammatory effects and can reduce the lymphopenia associated with COVID-19.

      Consistent with other studies in COVID-19 patients, they found that LMWH treatment can improve hypercoagulability. D-dimer and FDP levels (biomarkers of coagulation) in the heparin group significantly decreased from baseline after treatment, whereas there was no significant change in levels for the control group. Of note however, patients in the heparin group had a significantly higher level of D-dimer and FDP at baseline compared to the control group.

      Importance<br /> Many studies have shown that severely ill COVID-19 patients have significantly higher levels of IL-6 compared to patients with mild cases and it has been proposed that progression to severe disease may be caused by lymphopenia and cytokine storms. The anti-inflammatory effects of heparin may help prevent or reverse a cytokine storm caused by the virus and thus delay COVID-19 progression and improve overall condition in patients. The pleiotropic effects of heparin may have a greater therapeutic effect than compounds that are directed against a single target, such as an anti-IL-6 therapy. This is because COVID-19 patients commonly have complications such as coagulopathy and endothelial dysfunction leading to cardiac pathology that may be mitigated by heparin treatment (Li J, et.al; Wojnicz et.al).

      Limitations<br /> This study is limited by a small sample size (n=44) and a single-center design. Double-blinded, randomized, placebo controlled clinical trials of LMWH treatment are needed to understand the possible benefit of the treatment. Additionally, this study was unable to control for the dose and days of treatment of LMWH. Identifying the correct dose and timing of LMWH is a matter of immediate interest. Of note, patients in the heparin group received two types of LMWH, enoxaparin sodium or nadroparin calcium, which have been reported to have differing anticoagulant activity. The use of different LMWHs in the heparin group warrants further explanation.

      Another caveat of this study is that the levels of D-dimer and fibrinogen degradation products were significantly higher at baseline for patients in the heparin group compared to those in the control group. Therefore, it is difficult to decipher whether some of the positive effects of heparin treatment were due to its anti-coagulation effects or direct anti-inflammatory effects. Future studies are that delineate the anti-inflammatory functions of heparin independently of its anticoagulant properties in cases of COVID-19 would be useful.

      Lastly, this study did not discuss any side-effects of heparin, such as the risk of bleeding. Moreover, coagulation can help to compartmentalize pathogens and reduce their invasion, therefore anticoagulant treatments like heparin may have risks and it remains to be determined which patients would benefit from this therapy.

      References:<br /> Li J, Li Y, Yang B, Wang H, Li L. Low-molecular-weight heparin treatment for acute lung injury/acute respiratory distress syndrome: a meta-analysis of randomized controlled trials. Int J Clin Exp Med 2018;11(2):414-422

      Wojnicz R, Nowak J, Szygula-Jurkiewicz B, Wilczek K, Lekston A, Trzeciak P, et al. Adjunctive therapy with low-molecular-weight heparin in patients with chronic heart failure secondary to dilated cardiomyopathy: oneyear follow-up results of the randomized trial. Am Heart J. 2006;152(4):713.e1-7

      Review by Jamie Redes as part of a project by students, postdocs and faculty at the Immunology Institute of the Icahn School of Medicine, Mount Sinai.

    1. On 2020-08-10 11:48:29, user memini wrote:

      The assumption of constant case ascertainment (Line 474) in these countries from March through June is a critical one. It is probably most accurate in Portugal where health systems were least overwhelmed and which has the most testing per covid-19 death. Portugal's dashboard reports <100 tests processed per day to >10,000 tests per day over this period, so it is unlikely that the assumption of constant case ascertainment is sound in Portugal or in the other three countries.

      The authors could show whether similar parameters fit a more realistic time series of infections (estimated from deaths). Alternatively, simulated infection trajectories could be filtered by a reasonable estimate of how case ascertainment varied over time before estimating parameters.

      Also, Fig S1 shows gamma distributions with CV=0.5, 1, 2; but best fits give CV = 2, 3 or 4 for these countries (Table S1). The relative susceptibility of the median person for those distributions is 0.17, .01, and 0.0001. Given the narrow confidence intervals in Table S1, authors could hypothesize how susceptibility in Portugal could be so much more variable than England, for instance.

    1. On 2021-02-06 06:40:50, user David Epperly wrote:

      Here's something that addresses Pfizer and Moderna and I agree that the 2nd dose is important. "While durability is improved with a 2 or more dose regimen, dose timing is subject to optimization."<br /> Evidence For COVID-19 Vaccine Deferred Dose 2 Boost Timing<br /> 1. Good efficacy of dose 1<br /> 2. Greater than 3 month durability of dose 1<br /> 3. Double vaccinated population<br /> 4. Dramatically reduce hospitalizations<br /> 5. Save ~ 90K US lives in 2021<br /> https://doi.org/10.2139/ssr...

    1. On 2021-03-02 18:20:29, user Martin Hepp wrote:

      Ok, this is only a preprint. However, a wording like "provides a precise estimate of the true underlying SARS-CoV-2 transmission risk in schools and day-care centres." in the introduction sets all alarm bells of any scientist ringing. "precise" and "true" are bold words, rarely used in serious academic publications (where typically a prominent "threats to validity" section would highlight and discuss the limitations of the findings) - in particular, if the underlying method is relatively weak. Some limitations are discussed on pp.12 and 13, but in a rather superficial way.

      Just a few major questions that challenge the overall contribution:

      1. During the major part of the period of the analysis, the incidence was very low, in particular among young people. See https://corona-data.eu/medi... for a heatmap. Of the total duration of the study of ca. 17 weeks, only the last 5 - 6 weeks and thus less a mere 30 % had a significant incidence in the age-groups 0-4, 5-9, and 10-14, and it was lower than in the general population.

      2. As children are less likely to be symptomatic and the testing regime has a strong bias towards symptomatic patients, it is a valid assumption that the share of undetected infections is higher among students and children than in the general population. As the authors' entire analysis and model for transmission is based on test-confirmed public health cases, the authors should have tested this hypothesis, e.g. by random PCR tests in areas and during periods with a sufficient community incidence. If you miss asymptomatic cases, you are not only invalidating your aggregate statistics, but of course also the entire graph of infections becomes incomplete and questionable.

      3. On pp. 6 an 7, the authors cite the official definitions for cases and procedures; however, there is no information whether the theoretical guidelines for contact tracing, testing, non-pharmaceutical interventions like social distancing, masks, ventilation etc. were actually followed, and if the compliance remained stable over the course of the analysis and representative for the different groups. For instance, one could hypothesize that the effect of wearing mask in classrooms after November 20 is partially obscured by a reduction in ventilation due to cool weather and in general more time spent indoors. Taking the textbook definition of a characteristic of an observation and then assuming it to match the data is a significant threat to validity.

      4. The same holds for the approach of instructing the DPHAs on how to use the questionnaire but not testing the quality of the results statistically or by cross-validation. How do you know that the DPHAs understood and applied your instructions properly? And even if they did, how do you know that the data they were using was correct? it is not a lot of effort to rule out or estimate the margin of error of a potential weakness.

      5. The entire statistical analysis method is only a bit over half a page of largely spaced text (p. 8).

      6. The claim that children are less likely to produce a sufficient viral load to infect others is highly disputed in the literature, see e.g. https://zoonosen.charite.de... these findings are not uniformly agreed (see e.g. https://www.sciencemediacen... "https://www.sciencemediacentre.org/expert-reaction-to-a-preprint-looking-at-the-amount-of-virus-from-those-with-covid-19-in-different-age-groups/)"), but it is not commonly accepted that children are unlikely to infect others. This challenges the assumption that asymptomatic individuals are unlikely to infect others even if they are themselves infected.

      7. The authors state on p.12 that the rate of asymptomatic infections was relatively low with ca. 17%. Unfortunately, this population aggregate used by the authors obscures the influence of age on the likelihood of asymptomatic infections and hence on the number of undetected infections in school settings. A recent meta-study https://www.frontiersin.org... suggests that the rate is higher in children (p=0.5, CI 0.21 - 0.79) than in adults (p=0.3, CI 0.13 - 0.56). There is a lot of variance observed in the underlying studies, but the order of magnitude could explain a major share of the reported higher likelihood of infections originating from teachers than from students alone.

      8. The focus on "hygiene practices" (p.13) as a recommendation conflicts with the widely accepted view that SARS-CoV-2 transmission is largely airborne and that sustained social contact in indoor environments is a high-risk setting, even with masks.

      9. If the risk of students in school infecting teachers is so low, one should immediately stop the priority vaccination of teachers. I think the priority vaccination is justified.

      For lay people: If children are less likely to show symptoms than adults, and testing and hence becoming an index case is more likely for symptomatic individuals, it will be no surprise that teachers, who are adults, are more often identified as index cases than children. If the data graph of humans interacting in the pandemic is incomplete, and there is a systematic bias that leads to more missing index patients being children, your findings can easily be a simple artifact resulting from the chosen approach.

      Now, all science is tentative; we all know our papers could be improved, the evidence or data be more convincing, additional aspects be considered. The problem arises when this is combined with politics. The introduction (p. 5, 2nd paragraph) is heavily focussed on a positive view on re-opening school. The arguments raised are not wrong per se, but they are also not balanced - in a pandemic with a novel virus against which the majority of the human population seems to be immunologically naïve, other societal risks should be given the same space. If you motivate your research with the wish to reopen schools, readers have reason to assume that you are not neutral as to the outcome.

      This is all common in the daily struggle of anybody in research and academia.

      But when you combine such very preliminary work with substantial threats to validity with a bold claim in the intro and a conclusion in which you report with certainty that only 1 in 100 infected students will infect another person in school, knowing that there is a lot of heated debate in the society, then your "Ethical Statement" should be amended by "We knowingly accept that populist media like BILD, interest groups, and decision makers will use our fragile findings and our wording as solid evidence for a risk-prone opening strategy. Since we are so confident in our research, we take full responsibility for the societal consequences."

      Doing preliminary research is unavoidable. Distributing it in a form that is the perfect bait for media and decision makers is unethical.

      This

      https://www.bild.de/ratgebe...

      is the direct effect of your work.

      More than ca. 3 million daily visitors on bild.de (likely largely from the German population) have seen their variant of your message.

    1. On 2021-08-13 17:27:45, user Chuck Crane wrote:

      If you look at the questionnaire (the "supplementary materials" link) you find that the MD's and DVM's are "professional degree," and there is no "PhD" classification at all. It says "Doctorate," which includes Jill Biden's Ed.D. and so on. So the chart is deceptive.

      D8 What is the highest degree or level of school you have completed?<br /> 1. Less than high school<br /> 2. High school graduate or equivalent (GED)<br /> 3. Some college<br /> 4. 2 year degree<br /> 5. 4 year degree<br /> 6. Master’s degree<br /> 7. Professional degree (e.g. MD, JD, DVM)<br /> 8. Doctorate

      The paper is not in sync with the questionnaire, saying, e.g., "Those with professional degrees (e.g., JD, MBA) and PhDs were the only education groups without a decrease in hesitancy, and by May, those with PhDs had the highest hesitancy." I can't see how an MBA could look at the question and check "Professional degree" instead of "Master's Degree."

      Think the paper needs a good proofreading.

      Participation bias is a big issue. They asked a lot of people to participate, but only a small percentage did. The rather inane attempt to correct for this is to assume that if a particular class of respondents is under-represented, just assign responses from that class more weight, according to their proportion of the population ("post stratification adjustment").

    1. On 2021-10-11 18:40:32, user Andrew T Levin wrote:

      Comment #1: Research in Context

      1. Diamond Princess Cruise Ship. The manuscript makes no reference to any epidemiological analysis of this episode, which informed seminal assessments of the age-specific infection fatality rate (IFR) of COVID-19.[1-4] Nonetheless, that evidence is particularly relevant, because the cruise ship’s passengers included 1231 individuals ages 70+ who were not merely “community-dwelling” but healthy enough to embark on a multi-week grand tour of southeast Asia. Following extensive RT-PCR testing, 335 passengers ages 70+ were confirmed to have been infected with SARS-Cov-2, and 13 of those passengers died from COVID-19 – an IFR of about 4%. Moreover, the strong link to age is underscored by the even higher IFR of 8% for passengers ages 80+. Given the size of that sample (which meets the 1000+ threshold used here), this evidence should certainly be incorporated into this meta-analysis.

      2. Comprehensive Tracing Programs. The manuscript makes no reference to countries that succeeded in containing the first wave of the pandemic in spring 2020 through systematic tracing and testing of all contacts of infected individuals.[5] Such evidence is particularly relevant here, because the virus was contained within the “community-dwelling” populations of those locations and never spread to any elderly care facilities. For example, in the case of New Zealand, there were 256 infections and 19 deaths among adults ages 70+ -- an IFR of about 7%.

      3. Hospitalized Patients. The manuscript cites a single study (published in July 2020) that examined the association between comorbidities and mortality risk of COVID-19.[6] However, that study was not able to distinguish whether comorbidities were linked to greater prevalence (the probability of getting infected) or to a higher IFR (the risk of mortality conditional on infection). Unfortunately, the manuscript makes no reference to any subsequent studies on this issue. In particular, a large-scale study of U.K. BioBank participants found that measures of frailty were indeed associated with higher mortality rates in the overall panel but not linked to mortality within the subset of hospitalized COVID-19 patients.[7] In effect, the prevalence of COVID-19 was markedly higher among residents of U.K. nursing homes compared to individuals of similar age living in the community, but the IFR was not significantly different. Those findings directly contradict a key assertion made at the start of this manuscript.

      4. Prior Meta-Analysis of Community-Dwelling Populations. The introduction of this manuscript neglects to mention that an existing meta-analysis study (published in Nature in November 2020) was specifically focused on assessing IFRs excluding deaths in nursing homes.[8] That study estimated the link between age and IFR using seroprevalence and fatality data for adults less than 65 years old, and then showed that the model predictiions were consistent with data on fatalities among community-dwelling adults ages 65+. Moreover, that study used seroprevalence data adjusted for assay characteristics, and the results were obtained using a rigorous Bayesian statistical model that incorporated random variations in the time lags between infection, seropositivity, and fatal outcomes – a striking contrast to this manuscript, which uses rudimentary assumptions to address those issues.

      5. Other Meta-Analyses. The introduction of this manuscript briefly refers to two other meta-analysis studies of the link between age and IFR.[5, 9] However, the manuscript then asserts: “Importantly, the vast majority of seroprevalence studies include very few elderly people.” (p.5) That assertion is supported by a single citation to the SeroTracker database, which provides comprehensive coverage of all existing national, regional, and local seroprevalence studies across the globe.[10] However, this assertion is completely incorrect as a characterization of the preceding meta-analysis of age-specific IFRs. As indicated in Levin et al. (2020, figure 5), that meta-analysis study included seroprevalence data on older adults (including narrow brackets for ages 60-69, 65-74, 70-79, and 75-84 as well as open-ended brackets for ages 60+, 65+, 70+, 80+, and 85+) from nine national studies (Belgium, France, Hungary, Italy, Netherlands, Portugal, Spain, Sweden, and the U.K.) and eight regional locations (Ontario, Canada; Geneva, Switzerland; Connecticut, Indiana, Louisiana, Miami, Missouri, and San Francisco, USA).[5]

    1. On 2023-12-19 12:39:03, user Christos Proukakis wrote:

      Response to: “Is Gauchian genotyping of GBA1 variants reliable?”

      Marco Toffoli1,2, Anthony HV Schapira1,2, Fritz J Sedlazeck2,3,4, Christos Proukakis1,2 *

      1. Department of Clinical and Movement Neurosciences, Queen Square Institute of Neurology, University College London, UK
      2. Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
      3. Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
      4. Department of Molecular and Human Genetics, Baylor College of Medicine, TX, USA

      * To whom correspondence should be addressed: c.proukakis@ucl.ac.uk

      We recently described two methods for GBA1 analysis, which is hampered by the adjacent highly homologous pseudogene: Gauchian, a novel algorithm for analysis of short-read WGS, and targeted long-read sequencing 1. Tayebi et al have applied the former to WGS from 95 individuals, and compared it to Sanger sequencing 2. They report concordant genotypes in 85, while 11 had discrepant calls (we note that this leads to a total of 96). In addition, they report 28 false Gauchian calls in 1000 Genomes Project (1kGP) samples. Gauchian was developed because the homology of the GBA region requires a short read variant caller that does not rely solely on read alignments, and can identify specific variants known to be pathogenic. To understand the cause of these discrepancies, we reviewed their data, and conclude that they are mis-interpreting Gauchian results in 8 of the 11 discrepant samples, and incorrectly using Gauchian to analyze low-coverage 1kGP samples.

      Among the 11 (11.5%) samples with inconsistent calls with Sanger (Table 1), four (Pat_08, Pat_26, Pat_28 and Pat_58) were not called as the variants are not on Gauchian’s target list, which includes all ClinVar variants in December 2021. These variants, and any others, can be easily added (see Supplementary Information). Three other samples (Pat_75, Pat_76 and Pat_79) had low data quality resulting in large variation in sequencing depth across the genome, as shown by the median absolute deviation (MAD) of genome coverage: 0.269, 0.128 and 0.127 (three highest values among all samples). Gauchian recommends trusting calls in samples with MAD values <0.11, and produces a warning message if this is exceeded. In all three samples, the GBA1+GBAP1 copy number was a no-call (marked as “None” in the output file), indicating that Gauchian could not determine the copy number due to high coverage variation. Variants were not called because no further analysis was done beyond copy number calling. These should not be viewed as false negatives, as the warning message and the report of no-calls should prompt the user to obtain higher quality data or consider alternative sequencing. Among the remaining 4 samples with inconsistent results: Pat_03 had a Gauchian call of heterozygosity for p.Asn409Ser, while Sanger reports this as homozygous. Review of the IGV trace (Tayebi et al. Supp Figure 1) shows that at least 10 reads (around a fifth of the total) have the reference base, and therefore it is hard to conclude this is homozygous. Review of the Sanger trace (not provided) could determine whether there is a low peak representing the reference allele. We cannot provide a conclusion, and additional analysis is recommended. Mosaicism could be a plausible explanation, and this has been reported in GBA1 3,4, albeit not at this position. Pat_47 had a false negative p.Leu483Pro call. Pat_16 was indeed wrongly genotyped as homozygous for p.Asn409Ser, related to the adjacent c.1263del+RecTL deletion. Pat_92 had all expected variants called, but the heterozygous p.Asp448His was mis-genotyped as homozygous. In summary, there is one false negative and two wrongly genotyped variants (heterozygous variants called homozygous). Gauchian’s precision is therefore 98.9% (175 out of 177 calls are correct). Its allele-level recall/sensitivity is 99.4% after excluding alleles not on Gauchian’s target list, and samples which could not be analyzed due to high coverage variation. Alternatively, it can be calculated as 97.2% if only samples with high coverage variation are excluded, 96.2% if only alleles not on the target list are excluded, and 94.1% if all these samples are considered .

      Tayebi et al. concluded that Gauchian is not able to call recombinant variants without providing orthogonal evidence. In Pat_95, Pat_71 and Pat_16, they examined alignments in IGV and reported absence of supporting reads for Gauchian calls, but all recombinant alleles called by Gauchian were consistent with Sanger. This highlights that read mapping in this region is unreliable (variant supporting reads may align to the pseudogene), making interpretation of alignments in IGV very challenging. Gauchian is designed to untangle ambiguous alignments, locally phase haplotypes and make correct calls. Particularly, in Pat_95, they claimed that Gauchian called the expected RecNciI variant but got the mechanism of the recombinant allele wrong (gene conversion vs. gene fusion). This claim appears to be based on incorrect interpretation of IGV alignments, i.e. seeing 3’ UTR mismatches associated with GBAP1 does not necessarily indicate gene fusion, as they can be misalignments, or even part of the gene conversion. The RecNciI in Pat_95 is a gene conversion, as indicated by the normal copy number between GBAP1 and GBA1. Tayebi et al. claimed that this is a gene fusion without orthogonal evidence. In addition, they claimed that Gauchian misreported copy numbers in Pat_92, Pat_42 and Pat_72, again without orthogonal evidence. We validated Gauchian copy number gains by digital PCR in four cases 1. While particular recombinants could be prone to erroneous copy number calling, we do not know what “other techniques'' identified a different copy number in Pat_92. Orthogonal validation using digital PCR would resolve this. Finally, it is true that Gauchian does not have all possible recombinants on its target list, as it is designed to focus on recombinant variants in exons 9-11, because others are rare and detectable with standard callers.

      Tayebi et al. reported 4 samples where Gauchian missed variants in GRCh38 compared to GRCh37. Among these, two (Pat_35, Pat_75) were due to incorrect alignment settings that resulted in abnormally low mapping quality throughout the region. It is likely that ALT-aware alignment was on for all samples except these two. The remaining two (Pat_16, Pat_78) reflected an area of improvement for Gauchian to better call p.Asn409Ser, which is not a GBAP1-like variant, and can thus be called well by standard callers.

      We reported Gauchian calls of 1000 Genomes Project (1kGP) samples, validating some by targeted long reads 1. Gauchian called zero samples with biallelic variant in exons 9-11. However, Tayebi et al. reported a completely different set of Gauchian calls in the same samples (in their Table 4). This was caused by incorrect use of Gauchian on old low coverage WGS (median coverage <10X, https://ftp.1000genomes.ebi... "https://ftp.1000genomes.ebi.ac.uk/vol1/ftp/phase3/data/)"), rather than 30X (https://ftp-trace.ncbi.nlm.... "https://ftp-trace.ncbi.nlm.nih.gov/1000genomes/ftp/1000G_2504_high_coverage/data/)").

      We are grateful to Tayebi et al for assessing Gauchian analysis of this very challenging gene 2, but note that most discrepancies were due to incorrect use or misinterpretation of results. “No call” samples due to inadequate data quality cannot be considered false negative, as no calls are provided, and warnings of noisy coverage are given where applicable. Samples with inadequate coverage should obviously be avoided, as Gauchian is expected to perform at coverage >30X. Gauchian does not call variants not on its target list, which can be expanded. We provide updated recall (99.4%) and precision (98.9%) values. We have not seen any evidence of the alleged inability of Gauchian to call recombinant variants, and would welcome orthogonal copy number assessment of discrepancies. We show that Gauchian can be used for GBA1 assessment when coverage and data quality are adequate. We do note a limitation in genotyping p.Asn409Ser, a non-recombinant variant that can be called by standard variant callers, which we recommend running together with Gauchian for a complete call set. Finally, in clinical cases where absolute certainty is required, Sanger sequencing could be considered, with targeted long read sequencing another option 1,5–7.

      Table 1. Details on the 11 samples where Gauchian and Sanger are inconsistent.

      Gauchian calls Sanger Assessment,Tayebi et al. Our assessmentSample Copy Number of GBA1 and GBAP1 GBAP1-like variant in exons 9-11 Other unphased variants Genotype Prediction

      Pat_08 4 None p.Asn409Ser p.Asn409Ser/p.Gln389Ter False Negative Missed variant is not on Gauchian's target list

      Pat_28 4 None p.Arg535His p.Arg535His/Cys381Tyr False Negative Missed variant is not on Gauchian's target list

      Pat_58 4 None p.Asn409Ser, p.Arg296Ter p.Asn409Ser, p.Arg296Ter, c.203delC False Negative Missed variant is not on Gauchian's target list

      Pat_26 4 None p.Asn409Ser p.Asn409Ser/p.Arg502Cys False Negative Missed variant is not on Gauchian's target list.

      Tayebi et al.’s Supplementary Figure 1 shows no variant at p.Arg502Cys (c.1504C>T), but a different variant at the neighboring position, p.Arg502His (c.1505G>A), which is not on Gauchian's target list.

      Pat_75 None (No Call) NA NA p.Arg502Cys/p.Arg159Trp Missed Copy number is a no-calldue to high variation in depth so no further variant calling was performed. Coverage MAD 0.269

      Pat_76 None (No Call) NA NA p.Asn409Ser/p.Asn409Ser Missed Copy number is a no-call due to high variation in depth so no further variant calling was performed. Coverage MAD 0.128

      Pat_79 None (No Call) NA NA p.Leu483Pro/p.Arg502Cys Missed Copy number is a no-call due to high variation in depth so no further variant calling was performed. Coverage MAD 0.127

      Pat_03 4 None p.Asn409Ser p.Asn409Ser/p.Asn409Ser False Negative Gauchian call is supported by reads, see Tayebi et al.’s Supplementary Figure 1.

      Pat_47 4 None p.Asn409Ser p.Asn409Ser/p.Leu483Pro False Negative True false negative

      Pat_16 3 c.1263del+RecTL/ p.Asn409Ser, p.Asn409Ser p.Asn409Ser, c.1263del+RecTL False Positive Heterozygous p.Asn409Ser misgenotyped as homozygous as Gauchian did not know the exact breakpoint of the c.1263del+RecTL deletion, which is very close to p.Asn409Ser.

      Pat_92 7 p.Asp448His/p.Leu483Pro,p.Asp448His p.Asp448His/ p.Leu483Pro+Rec7 False Negative There is no false negative. Rec7 is reflected in the copy number call (copy number gain). This GBAP1 duplication does not have any functional impact on GBA, so Gauchian does not report it as a GBA variant. Heterozygous p.Asp448His misgenotyped as homozygous.

      Acknowledgements

      We are grateful to Xiao Chen and Michael Eberle for helpful comments. They are former employees of Illumina and current employees of Pacific Biosciences. This research was funded in in part by Aligning Science Across Parkinson's [Grant numbers 000430 and 000420] through the Michael J. Fox Foundation for Parkinson's Research (MJFF).

      Competing interests

      FJS receives research support from PacBio and Oxford Nanopore. AHVS has received consulting fees from AvroBio, Auxilius, Coave, Destin, Enterin, Escape Bio, Genilac, and Sanofi and speaking fees from Prada Foundation.

      Supplementary Information

      Add new variants to Gauchian’s config file

      The four new variants can be added to Gauchian’s config file as follows.

      For hg38, add the following lines to gauchian/data/GBA_target_variant_38.txt

      chr1 155236304 A GBAP G c.1165C>T(p.Gln389Ter)<br /> chr1 155236327 T GBAP C c.1142G>A(p.Cys381Tyr)<br /> chr1 155239989 CGGGGGT GBAP CGGGGGGT c.203delC(p.Thr69fs)

      Add the following line to gauchian/data/GBA_target_variant_homology_region_38.txt<br /> chr1 155235195 T 155214568 C c.1505G>A(p.Arg502His)

      For GRCh37, add the following lines to gauchian/data/GBA_target_variant_37.txt<br /> 1 155206095 A GBAP G c.1165C>T(p.Gln389Ter)<br /> 1 155206118 T GBAP C c.1142G>A(p.Cys381Tyr)<br /> 1 155209780 CGGGGGT GBAP CGGGGGGT c.203delC(p.Thr69fs)

      Add the following line to gauchian/data/GBA_target_variant_homology_region_37.txt<br /> 1 155204986 T 155184359 C c.1505G>A(p.Arg502His)

      Bibliography

      1. Toffoli, M. et al. Comprehensive short and long read sequencing analysis for the Gaucher and Parkinson’s disease-associated GBA gene. Commun. Biol. 5, 670 (2022).

      2. Tayebi, N., Lichtenberg, J., Hertz, E. & Sidransky, E. Is Gauchian genotyping of GBA1 variants reliable? medRxiv (2023) doi:10.1101/2023.10.26.23297627.

      3. Filocamo, M. et al. Somatic mosaicism in a patient with Gaucher disease type 2: implication for genetic counseling and therapeutic decision-making. Blood Cells Mol. Dis. 26, 611–612 (2000).

      4. Hagege, E. et al. Type 2 Gaucher disease in an infant despite a normal maternal glucocerebrosidase gene. Am. J. Med. Genet. A 173, 3211–3215 (2017).

      5. Pachchek, S. et al. Accurate long-read sequencing identified GBA1 as major risk factor in the Luxembourgish Parkinson’s study. npj Parkinsons Disease 9, 156 (2023).

      6. Graham, O. E. E. et al. Nanopore sequencing of the glucocerebrosidase (GBA) gene in a New Zealand Parkinson’s disease cohort. Parkinsonism Relat. Disord. 70, 36–41 (2020).

      7. Leija-Salazar, M. et al. Evaluation of the detection of GBA missense mutations and other variants using the Oxford Nanopore MinION. Mol. Genet. Genomic Med. 7, e564 (2019)

    1. On 2022-07-25 16:31:06, user Dr. D. Miyazawa MD wrote:

      Please also refer to previous studies.

      Hypothesis that hepatitis of unknown cause in children is caused by adeno-associated virus type 2 (08 May 2022)<br /> https://www.bmj.com/content...

      Daisuke Miyazawa. Possible mechanisms for the hypothesis that acute hepatitis of unknown origin in children is caused by adeno-associated virus type 2. Authorea. May 16, 2022.<br /> DOI: 10.22541/au.165271065.53550386/v2

    1. On 2020-05-29 18:32:49, user Sinai Immunol Review Project wrote:

      Main Findings<br /> The authors analyzed and compared the stability of viable SARS-COV-2 and SARS-CoV-1 inoculums in five environmental conditions (aerosol, copper, cardboard, steel, and plastic) by using Bayesian regression model. It was reported that SARS-COV-2 was still detected in aerosols at 3 hours, with an exponential reduction in infectious titer that was similarly observed for SARS-CoV-1. The study also concluded that both SARS-COV-2 and SARS-CoV-1 are more stable on stainless steel and plastic than cardboard and copper. Viable SARS-CoV-2 was detected up to 72 hours on stainless steel and plastic. On copper and cardboard, SARS-COV-2 was viable up to 4 hours and 24 hours, respectively, compared to SARS-CoV-1 which could be detected up to 8 hours on both material types. The half-lives between both viruses are similar, except for on cardboard.

      Limitation of the study<br /> The strain used in the study was SARS-COV-2 nCoV-WA1-2020 (MN985325.1) from the first case of 2019 novel coronavirus in the US. However, mutation throughout the course of the pandemic is inevitable and may cause unpredictable consequences on its transmissibility and disease severity. Thus, follow-up on samples from various patients in different geographic and temporal time points should be conducted.

      Significance<br /> The results support that modes of SARS-COV-2 transmission can be attributed to both aerosol and fomites, due to extended viability for hours in aerosol and up to 72 hours on stainless steel surfaces. The types of plastic, cardboard, copper, and stainless materials were selected to reflect typical hospital and household situations. It is important to compare with the SARS-CoV-1 as similarities between the two suggests methods of mitigating the pandemic by abrogating transmission both in the community and hospital.

      Review by Joan Shang as part of a project by students, postdocs and faculty at the Immunology Institute of the Icahn School of medicine, Mount Sinai.

    1. On 2020-06-10 01:57:51, user Sinai Immunol Review Project wrote:

      Main findings<br /> To improve understanding of the cellular changes in the T and B cell compartments of COVID-19 patients, both during and after disease, Fan et al. analyzed lymphocytes isolated from the PBMCs of 4 severe COVID-19 patients (n=4), 6 COVID-19 recovered patients (n=6), and 3 healthy controls (n=3). Of note, 3 recovered patients' samples were collected 7 days after a negative SARS-CoV-2 test (recovery-early stage; RE) and absence of clinical symptoms, whereas the other 3 samples were collected 20 days after these criteria (recovery-late stage; RL). The authors used single-cell RNA sequencing and single-cell V(D)J sequencing to perform their analysis.

      The authors identified 9 classes of T cells, which included 4 sub-classes of CD4+ T cells and 5 sub-classes of CD8+ T cells. Not surprisingly, across severe COVID-19 patients, the proportion of T cells was reduced, compared to healthy controls. However, differential gene expression analysis revealed that T cells from severe COVID-19 patients highly expressed inflammatory markers, including IFNG and GZMA. Interestingly, when compared to these patients with active disease, RE samples showed significant enrichment of ICOS+ TH2-like follicular helper T cells (TFH), whereas RL samples showed a reportedly significant enrichment of a cluster identified as TH1 cells, though this result should be revisited for review (See biological limitations). These cell types were, in fact, reduced in severe COVID-19 patients. Generally, these T cells from recovering patients continued to indicate persistent activation and counter-regulation, based on expression of TCR activation-associated genes, including RNF125 and PELI1. Subsequent trajectory analyses of transcriptional dynamics indicated transition of effector CD8+ T cells to central memory T cells in RL patients. Ligand-receptor analysis revealed potential interactions between TH1 cells and CD14+ monocytes in severe COVID-19 patients. Finally, TCR sequencing identified several VJ combinations in high frequencies in severe COVID-19 patients, but not others.

      Within the B cell compartments across patients, the authors identified 9 clusters of naive B cells, 2 clusters of memory B cells, 2 clusters of plasma B cells, and a cluster of plasmablasts. Of these clusters, one, in particular, expressed genes characteristic of FCRL5+ atypical memory B cells, which have been described to be induced by viral infections. Interestingly, ligand-receptor analyses of the clusters in each group of patient samples identified different degrees of TFH cell and B cell interactions, suggesting different stages of T cell help for B cell activation. Subsequent BCR characterizations revealed the presence of homogenous monoclonal and heterogeneous clonally expanded B cell populations; the latter population exhibited an enrichment of B cell activation genes. The authors, then, compare across patients to evaluate T and B cell clonality based on V(D)J recombination analyses of RE and RL patient samples (See technical limitations).

      Interestingly, cytokine expression analysis revealed IL-6 expression by B cells. In contrast, B cells expressed IL12A in RE patients, while effector memory CD8, proliferative CD8, and CD4 T cells and plasma B cells highly expressed IL16 in RL patients. The authors report additional cytokine (and cellular) characteristics that distinguish severe COVID-19 patients and recovering patients.

      Limitations<br /> Technical<br /> A primary technical limitation is the sample size of this study for each group. There is little clinical information about the patients and no details about disease severity in patients recruited after viral clearance. For example, age and CMV status have a huge impact on the TCR repertoire, therefore clinical data on the different groups should be presented. Moreover, without additional information on the clinical management of the severe COVID-19 patients and what therapies were given to the recovering COVID-19 patients, it is difficult to compare the cellular changes in the immune landscapes of the COVID-19 patients across samples. Longitudinal analysis would have been more informative especially with regards to repertoire analysis and how expanded clones during active infections might differentiate into particular phenotypes after viral clearance.CD8 expression should have been included in the violin plots, as it is usually more robust and reliable than CD4 expression.

      Biological<br /> An immediate concern is whether the authors mis-characterized cluster 13 as a TH1 cell cluster. The cluster exhibits a low expression of CD3G and CD4. It’s neighboring clusters within the hierarchy belong to monocyte groups, so it is unexpected that a T cell subtype would be belong to their branch of the hierarchy tree. Consider also cluster 38, which shows more robust expression of CD3G and NKG7 and is arranged with the B cell group.

      In addition, the authors did not highlight or discuss expression of co-inhibitory receptors that could elucidate the heterogeneity of T cell differentiation during COVID-19. As a result, it is difficult to truly assess the activation status of the CD8+ cytotoxic T cells and whether there are features of T cell exhaustion.

      Finally, the distinction between naïve and some subsets of memory T cells by scRNA analysis can be challenging. It would be important for the authors to explore whether cluster 26, classified as a naïve CD8 T cell cluster predominant in RL group could be actually memory cells. It would have been important to show clonal diversity of the different clusters.

      Significance<br /> In summary, Fan et al. provide a comparative analysis of lymphocyte changes between PBMCs of patients with ongoing COVID-19 progression and of patients recovering from the disease. Using a combination of single-cell RNA sequencing and V(D)J recombination sequencing, the authors describe specific changes in T and B cell subpopulations over the course of early and late-stage recovery.

      This review was undertaken by Matthew D. Park as part of a project by students, postdocs and faculty at the Immunology Institute of the Icahn school of medicine, Mount Sinai.

    1. On 2020-06-15 01:10:48, user Serge wrote:

      There are many inaccuracies in the report that may significantly affect the conclusions.<br /> 1. Diamond Princess analysis: the mortality data (in single digits) is not sufficient for a confident estimate of the mortality per jurisdiction (for some nations there was only a single case). Moreover, most countries started universal BCG vaccination around 1950s plus the effect of WWII would likely compromise any earlier program to a significant extent. That means that regardless of the country of origin, large part of over 70 population would not be protected and thus shouldn't be considered in verification of the hypothesis.<br /> 2. Certainly there can be no expectation that the protection effect would extend equally into a very advanced age, 60 years and longer after vaccination.<br /> 3. What is meant by the statement "BCG was provided mostly in Europe"? This is plain incorrect, please check "BCG World Atlas".<br /> 4. Country analysis: was the population taken into account? It is not clear from the description of diagrams. I would advise to attempt to calculate mortality per capita, from the most current data and compare it between jurisdictions at a similar period of exposure. Note that all countries with the highest M.p.c. adjusted for the time of exposure, never had a BCG program (or equivalent as in Spain where it was provided for 18 years out of 70) there's simply not a single exception.

    1. On 2020-04-20 15:36:38, user Philip Davies wrote:

      Interesting study, thank you.

      This is another study that attempts to ascertain if oral HCQ tablets can be of clinical use in patients more than one week into symptomatic disease, hospitalized with bilateral pneumonia and with evidence of established inflammatory reaction (cytokine storm). That's a big ask for any oral medication.

      The study is again small (both arms have less than 100 patients). The most significant outcome measured (death) is realized in very small numbers (3 and 4). The confidence levels are extremely wide.

      The are several problems with this study. There are marked differences in the two populations. The study honestly attempts to accommodate these confounding factors using a propensity score method (IPTW). Normally this method is valuable but here I can’t see that it has been well applied.

      It pays to look at the raw data. There is a significant difference (between the two arms) in the initial intensity of disease.

      At baseline (admission), HCQ arm comprises 78.3% men (>20% more of these higher risk patients than control arm with 64.9%); HCQ arm has 21.9% patients with more severe disease in the form of CT showing >50% lung affected). This is >80% more than in control arm (12.1%). HCQ arm has 90.5% patients with CRP > 40mg/l (CRP is a good indicator of impending/current severity). This is 10% higher than control arm (81.9%). HCQ arm had median O2 flow on admission = 3 litres/minute (50% higher than control arm at 2 litres / minute).

      So, at baseline, the HCQ arm had significantly more patients with severe disease than control arm. The O2 flow is actually more significant than first sight would suggest. 2 l/m is always the first step in O2 therapy. The data shows us that most patients in the control arm could hold their sats on this first step therapy. This also means they may have been OK on just 1 l/m. We don't know. But we do know that most patients in the HQN could not hold their sats at that first step and needed an increase (3 l/m ... so that's 50-300% more O2 than control arm).

      Admittedly there were other confounding factors which compromised the control arm more than HCQ arm (some chronic disease elements). But it's clear to me that disease severity was markedly more established in the HCQ arm.

      Another factor to note: the HCQ treatment was not initiated at the moment those baseline values were obtained (on admission). The HCQ was initiated within 48 hours. So let’s look again at the timelines. The median duration of symptoms at admission shows that the HCQ arm comprised patients who were further into worsening illness: they were admitted on D8 compared to control, D7. They may not have had HCQ initiated until D10.

      Then we look at outcomes: the raw data shows that the disadvantaged HCQ arm actually does better in the two most important outcomes, death and ICU admission. The HCQ delivers 12% less death and ICU admissions than the control arm. Admittedly the numbers are small so the confidence levels are very wide.

      So what does that tell us? The answer is not much. But even accepting the poorly aligned baseline for disease severity, the outcomes with their wide 95% confidence levels do deliver a mildly promising indication on the 'swingometer'. They point more towards benefit than harm when using HCQ in this advanced disease state.

      As a final comment on significant side effects (increased QT interval) from the use of HCQ. Once again, this trial used a particularly high dose of HCQ (600mg/day...right at ceiling dose for rheumatological use and much higher than the total antimalarial treatment dose). They also added azithromycin (another QT lengthening drug) to 20% of the HCQ patients. It’s not surprising at all to find such QT lengthening in a sick, more elderly population taking these medications in particularly high doses).

      Further trials should utilize conservative doses of CQ/HCQ which have been proven safe in many millions of patients.

      We don't yet know how this will pan out. We urgently need proper evidence. Statistically robust studies into prophylaxis and early intervention are likely to deliver the most interesting results.

      Dr Philip Davies<br /> Aldershot Centre For Health<br /> http://thevirus.uk

    1. On 2020-04-23 17:27:44, user Sinai Immunol Review Project wrote:

      Presence of SARS-CoV-2 reactive T cells in COVID-19 patients and healthy donors

      Braun J et al.; medRxiv 2020.04.17.20061440; https://doi.org/10.1101/202...

      Keywords

      • SARS-CoV-2 specific CD4 T cells

      • Human endemic coronaviruses

      • COVID-19

      Main findings

      In this preprint, Braun et al. report quantification of virus-specific CD4 T cells in 18 patients with mild, severe and critical COVID-19, including 10 patients admitted to ICU. Performing in vitro stimulation of PBMCs with two sets of overlapping SARS-CoV-2 peptide pools – the S I pool spanning the N-terminal region (aa 1-643) of the S protein, including 21 predicted SARS-CoV-1 MHC-II epitopes, and the C-terminal S II pool (aa 633-1273) containing 13 predicted SARS-CoV-1 MHC-II epitopes – the authors detected S-protein-specific CD4 T cells in up to 83% of COVID-19 patients based on intracellular 4-1BB (CD137) and CD40L (CD154) induction. Notably, peptide pool S II shares higher homology with human endemic coronaviruses (hCoVs) 229E, NL63, OC43, and HKU1 that may cause the common cold, but it does not include the SARS-CoV-2 receptor-binding domain (RBD), which has been identified as a critical target of neutralizing antibodies in both SARS-CoV-1 and SARS-CoV-2. S I-reactive CD4 T cells were found in 12 out of 18 (67%) patients, whereas CD4 T cells against S II were detected in 15 patients (83%). Intriguingly, S-specific CD4 T cells could also be found in 34% (n=23) of 68 SARS-CoV-2 seronegative donors, referred to as reactive healthy donors (RHD), with a preference for S II over S I epitopes. Only 6 of 23 RHDs also had detectable frequencies of S I-specific CD4 T cells, overall suggesting S II-reactive CD4 T cells had likely developed in response to prior infections with hCoVs. Of 18 out of 68 total healthy donors tested, all were found to have anti-hCoV antibodies, although this was independent of concomitant anti-S II CD4 T cell frequencies detected. This finding mirrors observations of declining numbers of specific CD4 T cells, but persistent humoral memory after certain vaccinations such as against yellow fever. The authors further speculate that these pre-existing virus-specific T cells against hCoVs might be one of the reasons why children and younger patients, usually considered to have a higher incidence of hCoV infections per year, are seemingly better protected against SARS-CoV-2. Unlike specific CD4 T cells found in RHDs, most S-specific CD4 T cells in COVID-19 patients displayed a phenotype of recent in vivo activation with co-expression of HLA-DR and CD38, as well as variable expression of Ki-67. In addition, a substantial fraction of peripherally found HLA-DR+/CD38+ bulk CD4 T cells was found to be refractory to peptide stimulation, potentially indicating cellular exhaustion.

      Limitations

      This is one of the first preprints reporting the detection of virus-specific CD4 T cells in COVID-19 (also cf. Dong et al., https://www.medrxiv.org/con... Weiskopf et al., https://www.medrxiv.org/con... "https://www.medrxiv.org/content/10.1101/2020.04.11.20062349v1.article-info)"). While it generally adds to our current knowledge about the potential role of T cells in response to SARS-CoV-2, a few limitations, some of which are discussed by the authors themselves, should be addressed. Findings in this study pertain to a relatively small cohort of patients of variable clinical disease. To corroborate the observations made here, larger studies including both more healthy donors and more patients of all clinical stages are needed to better assess the function of virus-specific CD4 T cells in COVID-19. Specifically, the presence of pre-existing, potentially hCoV-cross-reactive CD4 T cells in healthy donors needs to be explored in the context of COVID-19 immunopathogenesis. While the authors suggest a potentially protective role based on higher incidence of hCoV infection in children and younger patients, and therefore a presumably larger pool of pre-existing virus-specific memory T cells, the opposite could also be the case given cumulatively increased number of hCoV infections in older patients. In this context, it would therefore have been interesting to also measure anti-hCoV antibodies in COVID-19 patients. Furthermore, this study did not quantify virus-specific CD8 T cells. Based on observations in SARS-CoV-1, virus-specific memory CD8 T cells are more likely to persist long-term and confer protection than CD4 T cells, which were detected only at lower frequencies six years post recovery from SARS-CoV-1 (cf. Li CK et al., Journal of immunology 181, 5490-5500.) Morover, no other specifities such as against the N or M epitopes were evaluated. Robust generation of virus-specific T cells against the N protein was shown to be induced by SARS-CoV-2 in another pre-print by Dong et al. (Dong et al., https://www.medrxiv.org/con... "https://www.medrxiv.org/content/10.1101/2020.03.17.20036640v1)"), while Weiskopf et al. recently reported preference of both CD8 and CD4 T cells for S epitopes https://www.medrxiv.org/con... "https://www.medrxiv.org/content/10.1101/2020.04.11.20062349v1.article-info)"). Moreover, the authors seem to suggest that some of the virus-specific CD4 T cells detected could be potentially cross-reactive to predicted SARS-CoV-1 epitopes present in the peptide pools used. Indeed, this has been recently established for several SARS-CoV-2 binding antibodies, while it was found not to be the case for RBD-targeting neutralizing antibodies (cf. Wu et al., https://www.medrxiv.org/con... Ju et al., https://www.biorxiv.org/con... "https://www.biorxiv.org/content/10.1101/2020.03.21.990770v2)"). A similar observation has not been made for T cells so far and should be evaluated. Finally, since reactive healthy donors were only tested for anti-S1 IgG, however not for other more ubiquitous binding antibodies, e.g. against M, and only a fraction of these donors was additionally confirmed to be negative by PCR, there is, though unlikely, the possibility that some of the seronegative reactive donors had been previously exposed to SARS-CoV-2.

      Significance

      Quantification of virus-specific T cells in peripheral blood is a useful tool to determine the cellular immune response to SARS-CoV-2 both in acute disease and even more so post recovery. Ideally, once immunogenic T cell epitopes are better characterized, tetramer assays will allow for faster and more efficient detection of their frequencies. Moreover, assessing the potential role of pre-existing virus-specific CD4 T cells in healthy donors in the context of COVID-19 pathogenesis will be of particular importance. The observations made here are also highly relevant for the design and development of potential vaccines and should therefore be further explored in ongoing research on potential coronavirus therapies and prevention strategies.

      This review was undertaken by V. van der Heide as part of a project by students, postdocs and faculty at the Immunology Institute of the Icahn school of medicine, Mount Sinai.

    1. On 2020-06-11 13:53:48, user peter tofts wrote:

      please include: 1.) what type of corticosteroid was used (meythyleprednisolone) 2.) the dose (?1mg/kg or other v pulsed) duration etc... 3.) timing: the authors mention timing around 7 days from onset of symptoms- also Delay respect to Sx 13+/- 4.2 so I suppose maybe 6 days +/- 4 into their hospitalization? interesting paper thankyou

    1. On 2020-05-01 23:43:12, user Sinai Immunol Review Project wrote:

      Title A single-cell atlas of the peripheral immune response to severe COVID-19<br /> Wilk, A.J. et al. MedRxiv ; doi:10.1101/2020.04.17.20069930

      Keywords<br /> scRNAseq; Interferon-Stimulating Genes (ISGs); Activated granulocytes

      Main Findings<br /> The authors performed single-cell RNA-sequencing (scRNAseq) on peripheral blood from 6 healthy donors and 7 patients, including 4 ventilated and 3 non-ventilated patients. 5 of the patients received Remdesivir.

      scRNAseq data reveal 30 gene clusters, distributed among granulocytes, lymphocytes (NK, B, T cells), myeloid cells (dendritic cells DCs, monocytes), platelets and red blood cells. Ventilated patients specifically display cells containing neutrophil granule proteins that appear closer to B cells than to neutrophils in dimensionality reduction analyses. The authors named these cells “Activated Granulocytes” and suggest them to be class-switched B cells that have lost the expression of CD27, CD38 and BCMA and acquired neutrophil-associated genes, based on RNA velocity studies.

      SARS-CoV2 infection leads to decreased frequencies of myeloid cells, including plasmacytoid DCs and CD16+ monocytes. CD14+ monocyte frequencies are unchanged in the patients, though their transcriptome reveals an increased activated profile and a downregulation of HLAE, HLAF and class II HLA genes. NK cell transcriptomic signature suggests lower CD56bright and CD56dim NK cell frequencies in COVID-19 patients. NK cells from patients have increased immune checkpoint (Lag-3, Tim-3) and activation marker transcripts and decreased maturation and cytotoxicity transcripts (CD16, Ksp-37, granulysin). Granulysin transcripts are also decreased in CD8 T cells, yet immune checkpoint transcripts remain unchanged in both CD8 and CD4 T cells upon SARS-CoV2 infection. The frequencies of memory and naïve CD4 and CD8 T cell subsets seem unchanged upon disease, though gdT cell proportions are decreased. SARS-CoV2 infection also induces expansion of IgA and IgG plasmablasts that do not share Ig V genes.

      Interferon-signaling genes (ISGs) are upregulated in the monocyte, the NK and the T cell compartment in a donor-dependent manner. ISG transcripts in the monocytes tend to increase with the age, while decreasing with the time to onset disease. No significant cytokine transcripts are expressed by the circulating monocytes and IFNG, TNF, CCL3, CCL4 transcript levels remain unchanged in NK and T cells upon infection.

      Limitations<br /> The sample size of the patients is limited (n=7) and gender-biased, as all of them are men.<br /> The activating and resting signatures in monocytes should be further detailed. The authors did not detect IL1B transcripts in monocytes from the patients, though preliminary studies suggest increased frequencies of CD14+ IL1B+ monocytes in the blood of convalescent COVID-19 patients[1].<br /> Decreased NK cells, B cells, DCs, CD16+ monocytes and gdT cells observed in peripheral blood might not only reflect a direct SARS-CoV2-induced impairment, but also the migration of these cells to the infected lung, in line with preliminary data suggesting unchanged NK cell frequencies in the patient lungs[2].<br /> The authors identified platelets in their cluster analyses. Recent reports of pulmonary complications secondary to COVID-19 describe thrombus formation that is probably due, in part, to platelet activation[3, 4]. A targeted characterization of the platelet transcriptome may thus benefit an increased understanding of this phenomenon.<br /> The transcriptome of the Activated Granulocytes should be further detailed. As discussed by the authors, IL24 and EGF might be involved in the generation of the Activated Granulocytes, though these cytokines are poorly represented in the blood of the patients. The generation of these cells should therefore be further investigated in future studies.

      Significance<br /> The authors show a SARS-CoV2-induced NK cell dysregulation, in accordance with previous studies[5]. Alongside the upregulation of ISGs in NK cells, these findings suggest an impaired capacity of the NK cells to respond to activating signals in COVID-19 patients. The unchanged expression of immune checkpoints on CD4 and CD8 T cells suggest distinct SARS-CoV2 dysregulation pathways in the NK and the T cell compartments. In particular, the downregulation of transcripts encoding for class II HLA but not for the HLA-A, -B, -C molecules in monocytes suggest an impaired antigen presentation capacity to CD4 T cells, which should be further investigated.<br /> The authors provide preliminary results suggesting an age-related activation of the monocytes in the COVID-19 patients. Future studies will be needed to evaluate if the age impacts the involvement of the monocytes in the cytokine storm observed in COVID-19 patients.

      References<br /> 1. Wen, W., et al., Immune Cell Profiling of COVID-19 Patients in the Recovery Stage by Single-Cell Sequencing. MedRxiv, 2020.<br /> 2. Liao, M., et al., The landscape of lung bronchoalveolar immune cells in COVID-19 revealed by single-cell RNA sequencing. MedRxiv, 2020.<br /> 3. Giannis, D., I.A. Ziogas, and P. Gianni, Coagulation disorders in coronavirus infected patients: COVID-19, SARS-CoV-1, MERS-CoV and lessons from the past. J Clin Virol, 2020. 127: p. 104362.<br /> 4. Dolhnikoff, M., et al., Pathological evidence of pulmonary thrombotic phenomena in severe COVID-19. J Thromb Haemost, 2020.<br /> 5. Zheng, M., et al., Functional exhaustion of antiviral lymphocytes in COVID-19 patients. Cell Mol Immunol, 2020.

      Credit<br /> Reviewed by Bérengère Salomé and Zafar Mahmood as part of a project by students, postdocs and faculty at the Immunology Institute of the Icahn School of Medicine, Mount Sinai

    1. On 2020-05-04 18:15:10, user Dr SK Gupta wrote:

      High Dose Chloroquine with Poor patient selection are the culprits- not the drug <br /> Investigators were over enthusiastic in using a higher dose of chloroquine in elderly patients. In China National Health and Care Commission officially included the Chloroquine as medical agent on 19 Feb 2020 to be used in corona virus treatment plan. The dose of 500mg of chloroquine twice a day was decided following in vitro studies EC50 values, PBPK modeling and mice RLTEC data projected on Human beings (1). <br /> The initial recommended dose of 500 mg of chloroquine phosphate salt twice per day can quickly approach danger thresholds with sustained use at the maximum course of 10 days (Total chloroquine base 6gm). The lethal dose of chloroquine base in adults is about 5g. In China, On Feb 26, 2020, the treatment guidelines were revised, shortening the maximum course to 7 days to keep the total dose of chloroquine base 4.2 gm much lower than toxic dose (2). <br /> Elderly population is particularly prone to chloroquine toxicity especially at high doses. It is unfortunate in present study, that a base line ECG was not done to measure the QTc interval because the drug should be avoided if the QTc was more than 500ms especially in patients with severe disease prone to develop myocarditis due to primary disease Covid-19 per se(3). On the contrary we find that the higher dose regimen included Older age population with mean [SD] age, 54.7 [13.7] years vs 47.4 [13.3] years with more heart disease (5 of 28 [17.9%] vs 0) as compared to lower dose regimen. We in India having hige experience of using the drug would refrain from using such high doses.<br /> Gao et al reported results from more than 100 patients demonstrated that chloroquine phosphate is superior to the control treatment: in inhibiting the exacerbation of pneumonia, improving lung imaging findings, promoting a virus-negative conversion, shortening the disease course. Severe adverse reactions to chloroquine phosphate were not noted in the aforementioned patients (4). <br /> Poor patient selection and use of toxic doses of chloroquine seems to have brought disrepute to a promising drug. More studies are required before condemning the drug in present indication of Covid 19<br /> References:<br /> 1. Wang M, Cao R, Zhang L, et al. Remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus (2019-nCoV) in vitro. Cell Res 2020; 30:269–71<br /> 2. COVID-19: a recommendation to examine the effect of hydroxychloroquine in preventing infection and progression Dan Zhou, Sheng-Ming Dai and Qiang Tong J Antimicrob Chemother doi:10.1093/jac/dkaa114

      1. Cardiovascular risks of hydroxychloroquine in treatment and prophylaxis of COVID-19 patients: A scientific statement from the Indian Heart Rhythm Society<br /> Aditya Kapoor, Ulhas Pandurangi,Vanita Arora, Anoop Gupta, Aparna Jaswal et al. Indian Pacing and Electrophysiology Journal, https://doi.org/10.1016/j.i...

      2. Gao J, Tian Z, Yang X. Breakthrough: chloroquine phosphate has shown apparent efficacy in treatment of COVID-19 associated pneumonia in clinical studies. Bioscience Trends 2020; 14:72–3

    1. On 2020-03-28 22:38:39, user Sinai Immunol Review Project wrote:

      Summary of Findings: <br /> - Prospective cohort of 67 patients, clinical specimens taken and follow-up conducted. <br /> - Viral shedding, serum IgM, IgG antibody against NP evaluated and correlated to disease severity and clinical outcome <br /> - Viral RNA levels peaked at 1 week from febrile/cough symptom onset in sputum, nasal swabs, and stool samples. Shedding ranged from 12-19 days (median ranges) and was longer in severe patients. <br /> - IgM and IgG titers stratified patients into three archetypes as ‘strong vs weak vs non-responders’. Strong responders (with higher IgM/IgG titers) were significantly higher in severe patients.

      Limitations (specific for immune monitoring <br /> - Patient cohort is small for such a study and no individuals who were asymptotic were included; thus we cannot clearly interpret antibody titer associations with disease severity without "immunity" response.<br /> - Not clear if stool RNA captured from live infection in intestine/liver or from swallowed sputum. Transmission electron microscopy (TEM) carried out on sputum samples as proof of concept, but not stools. TEM unreasonable for actual clinical diagnosis. <br /> - Several patients had co-morbidities (such as pulmonary and liver disease) that were not accounted for when tracking antibody responses. Viral kinetics and IgM/IgG titers in subsets of patients with underlying conditions/undergoing certain medication would be informative.

      Relevance (specific for immune monitoring) <br /> - Three archetypes of antibody response to SARS-CoV-2 with different disease progression and kinetics is useful to stratify patients, and for future serological tests.

      • Strong spike-IgG levels often correlate with lymphopenia and CoVID-19 disease severity (https://doi.org/10.1101/202... ), similar to macaque studies in SARS (1). It would be critical to see if anti-NP or anti-Spike IgG antibodies for SARS-CoV-2 also elicit similar detrimental effects before clinical use.

      References: <br /> 1. Liu L, Wei Q, Lin Q, Fang J, Wang H, Kwok H, et al. JCI Insight 2019; 4(4): pii: 123158. <br /> Doi: 10.1172/jci.insight.123158

      Review by Samarth Hegde as part of a project by students, postdocs and faculty at the Immunology Institute of the Icahn school of medicine, Mount Sinai.

    1. On 2020-06-04 17:10:24, user Mandy Lyons wrote:

      "only 37.4% of suspected SARS-CoV-2 patients seroconverted"<br /> 1) What are the criteria for suspected SARS-CoV-2 patients?

      2) Do these suspected cases have SARS-CoV-2, or do they have an infection which mimics SARS-CoV-2?<br /> 3) Is the test testing for the test? I.e. is there something functionally different in the infection causing presumed cases which, if it actually is SARS-CoV-2, would cause the antibody test to be inaccurate?<br /> 4) Is there another illness circulating which mimics SARS-CoV-2 which has heretofore not been identified?<br /> 5) Is there any follow-up or investigation on these negative antibody cases in both the confirmed and suspected cases?

    1. On 2020-05-18 16:23:32, user Daniel Connelly wrote:

      From the beginning, it was known that Zinc is the active portion of the HCQ & Zinc combination. The HCQ was necessary to increase intracellular Zinc to block viral replication. The organizers of this study are both brave and brilliant.....but they also were not fully truthful. They called it a retrospective study when it is really a prospective study. The experimental arm was with Zinc and the control was without Zinc. The cohorts for the 2 arms were well matched and the regimen standardized. HCQ was not officially part of the study as it was dosed the same in both cohorts. <br /> Why would they need to organize the study this way? IMO, because they would have been blocked from doing a prospective study around HCQ. That is to dirty politics that good physicians are fighting against to save lives.<br /> What does this "prospective" study of "Zinc" show????<br /> 1. All other studies which did not use Zinc along with HCQ are at best, irrelevant and at worst, fraudulent.<br /> 2. HCQ with or without Zinc is useless in severely ill ICU patients.....as expected.<br /> 3. Zinc with HCQ was effective early to increase recovery and prevent death....as expected.<br /> 4. The study strongly supports the proposed mechanism of action of HCQ as a zinc ionophore.

      What we don't know:<br /> 1. How effective is HCQ + Zinc + Azithromycin when given in the ambulatory setting with the onset of symptoms?<br /> 2. How many hospitalizations would be avoided? Deaths?<br /> 3. How much is the transmission R0 value reduced for patients on this drug combo, especially in closed environments like nursing homes? How much is the environmental viral load decreased?

    1. On 2021-05-26 15:18:37, user Turki Bin Hammad wrote:

      The Astrazenca vaccine is reported to elicit much higher immune response when the second dose is given 2 to 3 months later compared to the 4-Week interval. Was the immune data for the AZD/Oxford vaccine in this study took into account the expected difference with various dosing schedules?

    1. On 2021-08-12 10:05:30, user Ken Sprenger wrote:

      There are a number of concerns with the methodology and consistency in this study:<br /> 1. There is inconsistency in the description of the population of patients to be enrolled. The study registered on ClinicalTrials.gov (NCT 044297411) indicates that the study would include patients with mild to moderate COVID-19, whereas the title of the study published on medRxiv indicates that patients would have mild COVID-19. Then under Study Design in Methods in the publication, it indicates that the study would include mild to moderate COVID-19 patients. Then under Study Population in the publication it includes “asymptomatic cases”. In Table 1 All patients (N=89) and Symptomatic =72, therefore 17 (19%) of patients were asymptomatic. It would appear, therefore, that the definition of the study population had been substantially amended to include asymptomatic patients. There are a number of considerations:<br /> a. This was therefore no longer a study of ivermectin in patients with mild to moderate or even mild COVID-19, as stated in the publication and elsewhere.<br /> b. It would be important to know when the decision was made to enrol the asymptomatic patients and the reasons for this change.

      1. Under Intervention in the Methods section of the publication, the authors say “Unexpectedly some patients who were isolated in the hotels as verified positive patients were found to be borderline or negative upon our RT-PCR test” and were withdrawn from the study. Figure 1 indicates that 7 patients assigned to the ivermectin arm and 14 patients assigned to the placebo arm had Ct values >35 in the “two first tests”. <br /> a. This means that 18% of patients were withdrawn after the start of the study as they had all been enrolled and randomized. <br /> b. The Ct values given in Figure 1 are all >35, and so it is difficult to understand why the text in the publication says “borderline or negative”. What did borderline mean in regard to Ct values? <br /> c. The authors state under Intervention in Methods that “… all RT-PCR tests, including verification that patients were positive on day zero, were conducted by the same lab, at the Israel Central Virology Laboratory of the Ministry Of Health (located at Sheba Medical Centre).”<br /> d. Withdrawing 18% of patients who initially qualified for the study (Ct confirmed by Ministry of Health laboratory) because they had reduced viral shedding (even if negative or approaching negative) in subsequent RT-PCR tests when reduction of viral shedding is the endpoint of the study, is problematic. Generally once patients are enrolled in a study they should not be withdrawn by the investigator except for safety reasons, or if the participant withdraws consent. An intention to treat analysis including these patients should have been performed.

      2. There is a further change to the protocol which defines the time from symptom onset to start of medication. Initially start of study drug dosing appears to have been within 3 days of symptom onset, but then because of “delay in getting results in the community” of the RT-PCR this time was increased to 7 days from symptom onset. <br /> a. There is no indication of when this amendment was introduced, but it appears to be after the start of the study, in which case some patient’s day 6 (3 days from symptom onset + 3 days of study drug), will be different to others which will be 7 days from symptoms + 3 days of study drug. <br /> b. As it is possible that the viral load will be a little lower in patients on day 7 compared to day 3 after symptom onset, and the patients who started treatment on day 7 might reach Ct >30 at the end of 3 days of treatment sooner than patients who started treatment on day 3. <br /> c. Measuring the endpoint Ct (which will be changing with time, unrelated to study drug) at different intervals from baseline is potentially introducing a bias.

      3. Medication is described differently in 3 places:<br /> a. Abstract in the publication: 0.2 mg/kg x 3 days.<br /> b. Randomisation section in Methods in the publication: in patients 40-69kg, 4 tablets (=12 mg daily) x 3 days and in patients >= 70 kg, 5 tablets (=15 mg daily) x 3 days.<br /> c. ClinicalTrials.com description: 3mg capsules, 15-20mg/day x 3 days.<br /> These three statements are all different doses and some are described as capsules, others as tablets. Consistency and the correct description of study medication is critical in a study which tests a drug against a placebo.

      4. Primary outcome. Under Outcomes in the publication it states “The primary endpoint was viral clearance following a diagnostic swab taken on the sixth day (third day after termination of treatment), in the intervention group compared to placebo.” A negative swab was defined as RT-PCR Ct >30. The authors point out that the standard for a negative swab in Israel is a Ct >40 and, although they don’t mention this, the manufacturer of the RT-PCR kit used in the study (Seegene Allplex CoV19) recommends a cut-off Ct >40 in their manual. Despite these exiting standards they made a decision to use a Ct >30 because for a Ct >40 “reaching this level may take a few weeks, and there is significant evidence that a non-infectious state is usually achieved at Ct level >30”. <br /> a. It seems inappropriate in a study such as this, to change a commonly accepted standard (Ct >40) in Israel to save a “few weeks” of time. <br /> b. The point at which the decision was made to use the Ct >30 as negative is not declared and one would be concerned if it was taken after the start of the study and particularly if it was after unblinding of the study!

      Conclusions:<br /> 1. There are numerous deviations from the ClinicalTrials.Gov description (amended version June 14, 2020) including: patient inclusion criteria, participant eligibility period post exposure (maximum 72 hours), study medication description, another 2 outcome measures which were not reported. Many deviations such as these raise concerns about the thoroughness with which the study was conducted and reported.<br /> 2. This was not a study of patients with mild to moderate COVID-19 as indicated in the publication title (or even with just mild COVID-19) as nearly 1 in 5 of randomized patients were asymptomatic. <br /> 3. A large number of patients (18%) were withdrawn from the study by the investigator as their RT-PCRs had Ct values >35) on day 2 and 4, after an initial positive. This is highly unusual and could have biased the study. An intention to treat analysis should have been performed. <br /> 4. The definition of a negative swab, which was really the endpoint, appears to have been made arbitrarily, as it did not align with Israel’s standards and the kit manufacturer’s manual, and there is no clarity as to when this decision was made. <br /> 5. A study amendment to the protocol which defined the time from symptom onset to start of study drug dosing was changed from 3 to 7 days. This could have biased the study.

      It might be true that Ivermectin does reduce viral shedding time in mild to moderate COVID-19. However this study is not rigorous enough, includes amendments which could have introduced bias, and has methodological issues. In my opinion it should not be accepted as good evidence that ivermectin reduces viral loads and could reduce isolation time in patients with COVID-19.

    1. On 2020-04-23 15:26:33, user Razvan Valentin Florian wrote:

      According to https://jcm.asm.org/content... , the concentration of virus RNA in feces is about 4 x 10^3 RNA/ml undiluted, i.e. of the order of 10^6 RNA/l. Assuming a factor of dilution of 10^-2 of feces in wastewater, a factor of disintegration of less than 10^-1 of RNA on the way from toilet to collection point, and a factor of prevalence in the population of less than 10^-2, this leads to less than 10^1 RNA/l in wastewater at the collection point. The preprint mentions that "the quantification limit was 10^3 equivalent viral genomes per liter" and the graph indicates that they found 10^4-10^7 eq/L (probably equivalent RNA/l). This seems implausible according to the previous back-of-the-napkin computation. I would be happy if this estimate is invalidated, since if measuring concentration of virus RNA in wastewater would be possible, this would be a great tool for the management of the epidemic.

    1. On 2020-04-30 14:34:37, user Ivan Berlin wrote:

      Rentsch CT et al. Covid-19 Testing, Hospital Admission, and Intensive Care Among 2,026,227 United States Veterans Aged 54-75 Years. <br /> medRxiv preprint doi: https://doi.org/10.1101/202... version posted April 14, 2020<br /> Review of the results concerning smoking related issues.<br /> Ivan Berlin<br /> The title is somewhat confusing. Only 3789 persons were tested for SARS-CoV-2.<br /> Data are extracted from the Veteran Administration (USA) Birth Cohort born between 1945 and 1965 electronic database. Between February 8 and March 30, 2020, 3789 persons were tested for SARS-CoV-2. Among them 585 were tested SARS-CoV-2 positive (15.4%) and 3204 SARS-CoV-2 negative. (Remark: the authors frequently confound testing for SARS-CoV-2 and having the disease: COVID-19.)<br /> Testing used nasopharyngeal swabs, 1% of the testing samples was from other unspecified sources. Testing was performed “in VA state public health and commercial reference laboratoires”, page 7. No further specification about the testing method is provided. Data are analyzed as if no between test-sources variability existed. However, it is unlikely that between test-source variability would influence the findings.<br /> Data extraction included diagnostics by diagnostic codes of comorbidities, non-steroid inflammatory drug (NSAID), angiotensin converting enzyme inhibitor (ACE) and angiotensin II receptor blocker (ARB) use, vital signs, laboratory results, hepatic fibrosis score, presence or absence of alcohol use disorder and smoking status.<br /> Smoking status data, never, former, current smokers were extracted using the algorithm described in McGinnis et al. Validating Smoking Data From the Veteran’s Affairs Health Factors Dataset, an Electronic Data Source. Nicotine & Tobacco Research, Volume 13, Issue 12, December 2011, Pages 1233–1239, https://doi.org/10.1093/ntr... used for HIV patients. According to this paper, the algorithm correctly classified 84% of never-smokers 95% of current smokers but only 43% of former smokers. The reported overall kappa statistic was 0.66. When categories were collapsed into ever/never, the kappa statistic was somewhat better: 0.72 (sensitivity = 91%; specificity = 84%), and for current/not current, 0.75 (sensitivity = 95%; specificity = 79%). Thus, classification error cannot be excluded in particular in classifying former smokers. <br /> In unadjusted analyses (Table 1) factors associated significantly with SARS-CoV-2 positivity were: male sex, black race, urban residence, chronic kidney disease, diabetes, hypertension, higher body mass index, vital signs but not NSAID or ACE/ARB exposure. It is to note, that among the laboratory findings, severity of hepatic fibrosis was associated with positive SARS-CoV-2 tests. <br /> Among those with alcohol use disorder, 48 (8.2%) tested positive versus 480 (15%) who tested negative (p<0.001).<br /> Among never smokers 216 (36.9%) tested positive vs 826 (25.8%) who tested negative. Among former smokers 179 (30.6%) tested positive vs 704 (22%) who tested negative. Among current smokers 159 (27.7%) tested positive vs 1444 (45.1%) who tested negative. Expressed otherwise, among SARS-CoV-2 negative individuals, there were less never smokers, less former smokers and more current smokers. To note: the reported OR for current smoking should be the inverse to that presented i.e. <1 and not >1. However, among individuals with SARS-CoV-2 positivity there were more persons with positive smoking history (former + current smokers): 57.8 % than with negative smoking history (never smokers): 36.9%. <br /> COPD, known to be strongly related to former or current smoking, was more frequent among SARS-CoV-2 negative (28.2%) than among SARS-CoV-2 positive (15.4%) individuals p<001).<br /> In multivariable analyses (Table 2), male sex, black ethnicity, urban residence, lower systolic blood pressure, prior use of NSAID but not ACE/ARB use and obesity were associated with SARS-CoV-2 positive test; current smoking (OR: 0.45, 91% CI: 0.35-057), alcohol use disorder (OR 0.58, 95%CI: 0.41-0.83) and COPD (OR: 0.67, 95%CI: 0.50-0.88) were associated with decreased likelihood of SARS-CoV-2 positive test. No association with age and SARS-CoV-2 positive test was observed. The association with hepatic fibrosis with SARS-CoV-2 positive tests remained significant in the multivariable analysis and the authors point out (page 15) that the “pronounced independent association with FIB-4 (fibrosis) and albumin suggest that virally induced haptic inflammation may be a harbinger of the cytokine storm.”, page 15. <br /> The main risk factors for hospitalization or ICU among SARS-CoV-2 positive persons are those that associated with a worse clinical signs (status). This is expected: clinical decision about severity is based on current clinical signs and not on previous history. <br /> Neither co-morbidities, nor smoking status or alcohol use disorder were associated with hospitalization/ICU. Surprisingly, age was inversely associated with hospitalization (Table 4) among SARS-CoV-2 positive individuals.<br /> Conclusion of the reviewer.<br /> This is the first report showing that there are less current smokers among SARS-CoV-2 positive persons. However, smoking history (former + current smoking) seems to be more frequent among SARS-CoV-2 positive individuals than never smoking. It is not known what is the percent of former smokers who were recent quitters; duration of previous abstinence from smoking is a crucial variable in assessing associations with smoking status. This raises the question of the validity of smoking status category classification. <br /> It is not known when smoking status is reported with respect of the SARS-CoV-2 testing. It is likely that individuals with clinical symptoms stopped smoking some days before testing and considered themselves as former smokers.

      The fact that alcohol use disorder, which is frequently associated with tobacco use disorder, is also less frequent among SARS-COV-2 positive individuals raises the question of the specificity of the smoking finding and the raises the contribution of substance use disorders overall i.e. the finding about current smoking is part of a cluster of various previous or current substance use disorders e.g. cannabis use, potentially associated with SARS-CoV-2 negative test. <br /> COPD as well as current smoking are being reported to be more frequent among SARS-CoV-2 negative individuals raising the possibility that reduced respiratory function (entry of SARS-CoV-2 is by the respiratory tract) is associated with lower likelihood of SARS-CoV-2 positive tests. This hypothesis may suggest that reduced respiratory function and not smoking itself is associated with higher likelihood of SARS-CoV-2 negative tests. <br /> The paper does not report on analyses of smoking by clinical signs/co-morbidities interactions. It is likely that former smokers or those with alcohol use disoders are more frequent among individuals with comorbidities. Based on previous knowledge about smoking associated health disorders, one can assume that more severe clinical signs were associated with current smoking or among recent quitters; the smoking x clinical signs interaction is not tested. <br /> The authors conclude on page 14 “To wit, we found that current smoking, COPD, and alcohol use disorder, factors that generally increase risk of pneumonia, were associated with decreased probability of testing positive. While they were not associated with hospitalization or intensive care, it is too early to tell if these factors are associated with subsequent outcomes such as respiratory failure or mortality.”<br /> The reduced current smoking rate among SARS-CoV-2 positive individuals is an interesting but preliminary finding. It is likely that it is part of a more complex symptomatology and not specific to current smoking. Smoking status should have been assessed on a more detailed manner. The current findings, from a retrospective cross sectional analysis, certainly not support the hypothesis that current smoking protects against SARS-CoV-2 positivity.

    1. On 2020-05-06 08:05:15, user Prof Pranab Kumar Bhattacharya wrote:

      Dear Editor<br /> In the world, Corona virus cases jumped up till 3rd May 2020 from December 2019 is 3,51,743 with death 2,45,617 (18%) and 31.5 death per one million people of infected.Almost 212 countries worldwide and most affected countries are USA,( death rate 304, followed by Spain (540),Itali 475, UK 414, France 379 per million population when in India total cases of positive by RT PCR is 40,266 death 1300 per one million people and in West Bengal province of India total infected is 963 with death 48 cases as per ministry of health government of India records on covid 19. The question is why such a huge percentage of death from this dangerous virus ( no more should be considered simple like influenza virus) inspite of lockdown, social distancing ventilation guided treatment protocol for mild moderate and severe pneumonia from covid 19?<br /> Mortality from covid 19 is higher in groups at higher risks of thromboembolism including hypertension, types 2DM, obesity, coronary artery disease ,cardiomyopathy, pre existing renal pathology as co morbid condition known to all. It has been also seen world wide that the risk of thromboembolism ( both venous and arterial) are more likely to occur when patients are admitted at ICU or in PEP ventilation, ànd in aged over 60 yrs( approximately 63% of death in India from covid 19).<br /> What did the autopsy studies revealed of these death, though very limited autopsy were performed with covid 19 death as the virus is HG 3 category virus. Brane Hanely (1) eral published in journal of clinical pathology of BMJ group showed histopathology of lungs on HE stain oedema, Type Ii pneumocytes hyperplasia,large pneumocytes with ground glass viral inclusions bodies focal inflammation, multinucleated giant cells,when no hyaline membrane ( a histopathological features of ARDS) diffuse alveolar damage. The pulmonary vessels showed hyaline necrosis with thrombus formation and capillary congestion.inflamatory infiltrate composed of alveolar macrophages in alveolar lumen and lymphocytes in interstitium. Zhe Xu et Al (2) in journal Lancet reported also one 50 year old man died on day 14 of covid 19 after being treated with lopinovir+retinovir+moxiflixain and high nasal cannula oxygen therapy and niddle autopsy of lungs liver and heart tissue showed diffuse alveolar damage with cellular fibrimyxiod exudate,dissquamation of pneumocytes and hyaline membrane formation (sign of ARDS) , interstitial mononuclear inflammatory infiltrate dominated by lymphocytes ( CD8) multi nucleated syncitial giant cells, atypical pneumocytes and microvascular thrombosis in pulmonary vessels (2).Sufang Tian et Al (3) did post mortem needle core autopsy of four patients who died of severe covid 19 pneumonia and patients age range were 59-81 years and time of death 15-52 days were in ventilation. Histology of their finding in lungs were again diffuse injury to alveolar epithelial cells, hyaline membrane formation, hyperplasia of type II pneumocytes , diffuse alveolar damage and consolidation by fibroblasts proliferation with extra cellular fibrin forming clusters.All these tour cases had vascular congestion with intravascular thrombus suggesting an acute phase components reaction and fibrinoid necrosis of blood vessels.The autopsy finding of heart was that endocardia and myocardia didn't contain inflammatory cellular infiltrate, although focally myocardium appeared irregular in shape with darkened cytoplasm and fibrinoid necrosis of blood vessels in myocardia.There were various degrees of focal oedema interstitial fibrosis and myocardial hypertrophy which suggests patients had underlying hypertension associated with hypertrophy or past ischemic injury. A large series of 38 cases of autopsy of lung by Luca carsana etall (5) showed from death cases of covid 19 in northern Itali on H&E stain showed also diffuse alveolar damage, capillary congratulations, necrosis, necrosis of pneumocytes, hyaline membrane, interstitial oedema,type II pneumocytes hyperplasia, platelet fibrin rich thrombus(5) .Electron microscopy showed viral particles within cytoplasmic vaccoule of pneumocytes.<br /> So from above post mortem studies, besides ARDS like pictures in terminal event , platelet fibrin rich thrombosis of pulmonary vessels, myocardial vessels, hyaline necrosis of blood vessels of both lungs and of myocardium are prominent picture along with endothelial dysfunction according to this author.The severe cases of pneumonia from covid 19 also shows increased D Dimer value (4) prognostically bad , increased c reactive protein, increased pro calcitonin and increased FDP value<br /> All these suggest to me that pathogenesis behind so many death in ventilation or at ICU of covid 19 patients are not ARDS itself but some kinds of coagulopathy or DIC occurred before death in severe pneumonia cases<br /> Though lymphopenia, inflammatory cytokine stroms ( raised IL6,raised TNF are for cytokine stroms)are typical abnormalities described in almost all literature in highly pathogenic Covid 19 infection with disease severity ,only one rapid response in BMJ (4) suggest , based on post mortem finding use of low molecular weight heparin (LMWH) to be included in the treatment modules of covid 19, particularly those who have high D Dimer high FDP value in serum though TT,APTT,PT,INR may not show any significant difference.use of heparin therapy with constant monitoring for bleeding manifestation should be instituted in patients showing clinical signs of turning towards severe pneumonia,along with antiviral therapy with remdesvir (within 7 days onset of symptoms at scheduled disease)<br /> If the pathology behind the death of severe pneumonia in covid 19 patients is DIC ( according to Autopsy finding the pneumocytes are not killed or destroyed by the virus nor by cytotoxic T cells, rather proliferation occur with much viral replication and virus load) there will be DIC , vascular congestion, thrombosis there will be AMI stroke ) then treatment at ICU with ventilation become useless unless if thromboembolism is not resolved first with LMWH infusion <br /> Referencs <br /> 1) Brain Hanley, Sebastian B Lucas,Esther youd,Benjamin swift,Michael Asbron "Autopsy in suspected covid 19 cases " JCP 73,(5):2020 http://dx.doi.org.10.1136/jclinpath-2020-20652<br /> 2) Xu Z,Shi L,Wang y eral "Pathological finding of covid 19 associated with acute respiratory distress syndrome " The Lancet respiratory medicine 8 (4):420-22 :2020<br /> 3) Sudan Tian, young xiong,Shu yuan xiao,Liu H et all "Study of 2019 novel Corona virus disease ( covid 19) through post mortem core biopsy" Modern pathology (Nature.com ) 14 th April 2020 http://doi.org/10.1038/s 41379-020-0536-<br /> 4) William Atenio ,Nadu Okonkwo "should prognostic models for covid 19 not also incorporate markers of thrombosis" Rapid Response published BMJ on 14th April 2020 to article"Prediction model for diagnosis and prognosis of covid 19 infection: systematic Review and clinical analysis" The BMJ 2020:369:m1328 published on 7th April 2020 https://doi.org/10.1136/bmj...<br /> 5)Luca carsana, Aurelo sanzogoni ,Ahmed Nast, Roberta Rossi etall"pulmonary post mortem finding in a large series of covid 19 cases from northern Itali" MedRxiv https://doi.org/1101/2020.0...

    1. On 2020-06-24 18:56:17, user André GILLIBERT wrote:

      Title : Proposal for improved reporting of the Recovery trial<br /> André GILLIBERT (M.D.)1, Florian NAUDET (M.D., P.H.D.)2<br /> 1 Department of Biostatistics, CHU Rouen, F 76000, Rouen, France<br /> 2 Univ Rennes, CHU Rennes, Inserm, CIC 1414 (Centre d’Investigation Clinique de Rennes), F- 35000 Rennes, France

      **Introduction**

      Dear authors,<br /> We read with interest the pre-print of the article entitled “Effect of Dexamethasone in Hospitalized Patients with COVID-19: Preliminary Report”. This reports the preliminary results of a large scale randomized clinical trial (RCT) conducted in 176 hospitals in the United Kingdom. To our knowledge it is the largest scale pragmatic RCT comparing treatments of the COVID-19 in curative intent. The 28-days survival endpoint is objective, clinically relevant and should not be influenced by the measurement bias that may be caused by the open-label design. While 2,315 study protocols have been registered on ClinicalTrials.gov about COVID-19, as of June 24th 2020, Recovery is, to our knowledge, the only randomized clinical trial on COVID-19 that succeeded to include more than ten thousands patients. The open-label design and simple electronic case report form (e-CRF) may have helped to include a non-negligible proportion of all COVID-19 patients hospitalized in the United Kingdom (UK). Indeed, as of June 24th 2020, approximatively 43,000 patients died of COVID-19 in hospital in the UK, of whom approximatively 0.24 × 11,500 = 2,760, that is more than 6% of all hospital deaths of COVID-19, where included in the Recovery study.<br /> Having read with interest version 6.0 of the publicly available study protocol (https://www.recoverytrial.n... "https://www.recoverytrial.net/files/recovery-protocol-v6-0-2020-05-14.pdf)") we had hoped for more details in the reporting of methods and results of this trial and take advantage of the open-peer review process offered by pre-prints servers to suggest improving some aspects of the reporting before the final peer-reviewed publication. Please, find below some easy to answer comments that may help to improve the article overall.

      **Interim analyses and multiple treatment arms**

      The first information would be about interim analyses. The protocol (version 6.0) specifies that it is adaptive and that randomization arms may be added removed or paused according to decisions of the Trial Steering Committee (TSC) basing its decision on interim analyses performed by the Data Monitoring Committee (DMC) and communicated when “the randomised comparisons in the study have provided evidence on mortality that is strong enough […] to affect national and global treatment strategies” (protocol, page 16, section 4.4, 2nd paragraph). The Supplementary Materials of the manuscript specifies that “the independent Data Monitoring Committee reviews unblinded analyses of the study data and any other information considered relevant at intervals of around 2 weeks”. This suggests that many interim analyses may have been performed from the start (March 9th) to the end (June 8th) of the study.<br /> Statistically, interim analyses not properly taken in account generate an inflation of the type I error rate which may be increased again by the multiple treatment arms. Methods such as triangular tests make it possible to control the type I error rate. Most methods of control of type I error rate in interim analyses require that the maximal sample size be defined a priori and that the timing and number of interim analyses be pre-planned. This protocol being adaptive, new arms were added, implying new statistical tests in interim analyses, and no pre-defined sample size as seen in page 2 of the protocol: “[...] it may be possible to randomise several thousand with mild disease [...], but realistic, appropriate sample sizes could not be estimated at the start of the trial.” This make control of the type I error rate difficult. The fact that the study has been stopped on the final analysis as we understand from the current draft rather than interim analysis does not remove the type I error rate inflation. The multiple treatment arms lead to another inflation of the type I error rate.<br /> The current manuscript does not specify any procedure to fix these problems. The Statistical Analysis Plans (SAP) V1.0 (in section 5.5) and V1.1 (in section 5.6) specify that “Evaluation of the primary trial (main randomisation) and secondary randomisation will be conducted independently and no adjustment be made for these. Formal adjustment will not be made for multiple treatment comparisons, the testing of secondary and subsidiary outcomes, or subgroup analyses.” and nothing is specified about interim analysis. Therefore, we conclude that no P-value adjustment for multiple testing has been performed, neither for multiple treatment arms nor for interim analysis. If an interim analysis assessing 4 to 6 treatment arms at the 5% significance level has been performed every 2 weeks from march to June, up to 50 tests may have been performed, leading to major inflation of type I error rate. In our opinion, the best way to assess and maybe fix the type I error rate inflation, is to report with maximal transparency every interim analysis that has been performed, with the following information:<br /> 1. Date of the interim analysis and number of patients included at that stage<br /> 2. Was the interim analysis planned (e.g. every 2 weeks as planned according to supplementary material) or unplanned (e.g. due to an external event, for instance the article of Mehra et al about hydroxychloroquine published in The Lancet, doi:10.1016/S0140-6736(20)31180-6), and if exceptional, why?<br /> 3. Which statistical analyzes, on which randomization arms, have been performed at each stage <br /> 4. If predefined, what criteria (statistical or not) would have conducted to early arrest of a randomization arm for inefficiency and what criteria would have conducted to arrest for proved efficacy?<br /> 5. If statistical criteria were not predefined, did the DMC provide a rationale for his choice to communicate or not the results to the TSC? If yes, could the rationale be provided?<br /> 6. The results of statistical analyzes performed at each step<br /> 7. The decision of the DMC to communicate or not the results to the TSC and which results have been reported as the case may be<br /> The information about interim analyses and multiple randomization arms will help to assess whether the inflation of type I error rate is severe or not. A post hoc multiple testing adjustment, taking in account the many randomized treatments and interim analyses, should be attempted, and discussed, even though there may be technical issues due to the adaptative nature of the protocol.

      **Adjustment for age**

      An adjustment for age (in three categories <70 years, 70-79, >= 80 years, see legend of table S2) in a Cox model was performed for the comparison of dexamethasone to standard of care in the article. This adjustment was not specified in the version 6.0 of the protocol but was, according to the manuscript “added once the imbalance in age (a key prognostic factor) became apparent”. This is confirmed by the addition of a words ““However, in the event that there are any important imbalances between the randomised groups in key baseline subgroups (see section 5.4), emphasis will be placed on analyses that are adjusted for the relevant baseline characteristic(s).” in section 5.5 page 16 of the SAP V1.1 of June 20th compared to the SAP V1.0 of June 9th which specified a log-rank test. The SAP V1.0 of the 9th June may have been written before the database has been analyzed (data cut June 10th) but the SAP of the 20th has probably been written after preliminary analysis have been performed. This is consistent with the words “became apparent” of the manuscript. Therefore, in our opinion, this adjustment must be considered as a post hoc analysis rather than as the main analysis. Moreover, even though the SAP V1.1 specifies that an “important imbalance” will lead to an “emphasis” on adjusted analyses, it does not change the primary analysis (see section 5.1.1 page 14). It is not clear what “important imbalance” means. To interpret that, we will perform statistical tests to assess balance of key baseline subgroups specified in SAP V1.1 (see section 5.4):<br /> 1. Risk group (three risk groups with approximately equal number of deaths based on factors recorded at randomisation). Its distribution is shown in figure S2. A chi-square tests on the distribution of risk groups in Dexamethasone 1255/500/349 and Usual care 2680/926/715 groups, lead to a P-value=0.092. A chi-square test for trend yields a P-value equal to 0.23.<br /> 2. Requirement for respiratory support at randomisation (None; Oxygen only; Ventilation or ECMO). P-value=0.89 for chi-square test and P-value=0.86 for chi-square for trend.<br /> 3. Time since illness onset (<=7 days; >7 days). P-value=0.17<br /> 4. Age (<70; 70-79; 80+ years). P-value=0.016 for chi-square test, p=0.019 for chi-square test for trend<br /> 5. Sex (Male; Female). P-value=0.97 for chi-square test<br /> 6. Ethnicity (White; Black, Asian or Minority Ethnic). No data found.<br /> The criteria to define “important imbalance” seems to be statistical significance at the 0.05 threshold, however that should have been stated and tests for all other variables should have been provided too.<br /> First, this adjustment, from a theoretical point-of-view, was not necessary since the study was randomized; if the exact condition of imbalance triggering the adjustment was pre-specified in the protocol or SAP before the imbalance was known, it could induce a very slight reduction of the type I error rate and power. However, as it was performed when the imbalance was known, there is a risk that the sign of the imbalance (i.e. higher age in the dexamethasone group) have influenced the choice of adjustment. Indeed, an adjustment conditional to a higher age in the dexamethasone group will increase the estimated effect of dexamethasone in these conditions, and so, provide an inflation of the type I error rate. If the same conditional adjustment were further considered for other prognostic variables, the inflation could even be higher. <br /> Unless there is strong evidence that the amendment to the SAP was performed without knowledge of the sign of the imbalance (higher age in the dexamethasone group), we suggest that the primary analysis be kept as originally planned, without adjustment, and that the age adjustment be performed in a sensitivity analysis only. The knowledge of the sign of the unbalance is unclear in the last version of the SAP (V1.1, June 20th) and in the manuscript. In addition, in an open label trial, it is always better to stick to the protocol.

      **Results in other treatment arms**

      The manuscript specifies that “the Steering Committee closed recruitment to the dexamethasone arm since enrolment exceeded 2000 patients.” It is not stated whether any other treatment arm has exceeded 2000 patients or not and whether the study is still ongoing. Results of treatment arms that have been stopped should be provided (all arms having enrolled more than 2000 patients?). If not, the number of patients randomized in other treatment arms should, at least, be reported. If the study is completely stopped, all treatments should be analyzed and reported, unless there is a specific reason not to do so; that reason should be stated as the case may be. This data would be useful to provide evidence on other molecules. It would also clarify the number of statistical tests that have been performed or not, providing more information about the overall inflation of alpha risk.

      **Sample size**

      The paragraph about the sample size suggests that inclusions were planned, at some time, to stop when 2000 patients were included in the dexamethasone arm. The amended protocol (May 14th), the SAP V1.0 (June 9th) and the SAP V1.1 (June 20th, 4 days after the results have been officially announced) all have a paragraph about the sample size but all specify that the sample size is not fixed and none specify any criteria of arrest of the research based on sample size. There are 2104 patients included in this arm, which is substantially larger than the target of 2000 patients. The exact chronology and methodology should be clarified: when was the sample size computed and what was the exact criteria to arrest the research? Could the document (internal report?) related to this sample size calculation and statistical or non-statistical decision of arrest of the research be published in supplementary material?<br /> Indeed, assessment of the type I error rate requires knowing exactly when and why the research has been arrested: arrest for low inclusion rate of new patients or for reaching target sample size cannot be interpreted the same as arrest for high efficacy observed on an interim analysis.

      **Future of the protocol**

      With the new evidence about dexamethasone, the protocol will probably be stopped or evolve. The future recruitment may slow as the peak of the epidemic curve in United Kingdom is passed. The past, present and future of the protocol needs also to be known to assess the actual type I error rate. Indeed, future analyses, that have not yet been performed influence the overall type I error rate. That is why we suggest that author’s provide the daily or weekly inclusion rate from March to June and discuss the future of the study.

      **Loss to follow-up**

      Table S1 shows that the follow-up forms have been received for 1940/2104 (92.2%) patients of the dexamethasone group and 3973/4321 patients of the usual care group (91.9%). The patients without follow-up forms (8.5% overall) may either be lost to follow-up or have been included in the 28 last days before June 10th 2020 (data cut). The manuscript mentions that 4.8% of patients “had not been followed for 28 days by the time of the data cut”, suggesting that 8.5%-4.8% = 3.7% of patients are lost to follow-up, but that is our own interpretation. We suggest that authors report the actual number of loss to follow-up and how their data have been imputed or analyzed. The number of loss to follow-up may differ for different outcomes. For instance, if the Office of National Statistics (ONS) data has been used for vital status assessment, there should be no loss to follow-up on that outcome.

      **Vital status**

      The current manuscript only specifies the data of the web-based case report (e-CRF) form, filled by hospital staff, as source of information, suggesting that it is the only source of information about the vital status. The document entitled “Definition and Derivation of Baseline Characteristics and Outcomes” provided at https://www.recoverytrial.n... specifies many other sources. For instance, the vital status had to be assessed from the Office of National Statistics (ONS). Other sources, including Secondary Use Service Admitted Patient Care (SUSAPC) and e-CRF could be used for interim analysis. The ONS was considered as the defining source (most reliable). Whether the ONS data has been used or not should be clarified. If the ONS data have been used, statistics of agreement of the two data sources (e-CRF and ONS) may be provided to help assessing the quality of data. If the ONS data have not been used, this deviation from the planned protocol should be documented.<br /> The manuscript as well as the recovery-outcomes-definitions-v1-0.pdf file specifies that the follow-up form of the e-CRF is completed at “the earliest of (i) discharge from acute care (ii) death, or (iii) 28 days after the main randomisation”. If the follow-up form is not updated further, patients discharged alive before day 28 (e.g. day 14) may have incomplete vital status information at day 28. The following information should be specified:<br /> 1. Whether the follow-up form of the e-CRF had to be updated by hospital staff at day 28 for these patients<br /> 2. If response to (1) is yes, whether there was a means to distinguish between a lost to follow-up at day 28 (form not updated) and a patient discharged and alive at day 28 (form updated to “alive at day 28”)<br /> 3. If response to (2) is yes, how many patients discharged before day 28 were lost to follow-up at day 28<br /> 4. If response to (2) is yes, how has their vital status at day 28 been imputed or managed in models with censorships (log-rank, Kaplan-Meier, Cox)<br /> Of course, this information is really needed if the ONS and SUSAPC data have not been used.<br /> The quality of the vital status information is critical in such a large scale open-label multi-centric trial, because there is a risk that one or more center selectively report death, biasing the primary analysis.

      **Inclusion distribution by center**

      A multicentric study provides stronger evidence than a single-center study but sometimes, few centers include most patients, with a risk of low-quality data or selection bias. The very high number of included patients in the Recovery trial suggests that many centers included many patients but the distribution of inclusions per center could be reported.

      **Randomization**

      The protocol specifies that “in some hospitals, not all treatment arms will be available (e.g. due to manufacturing and supply shortages); and at some times, not all treatment arms will be active (e.g. due to lack of relevant approvals and contractual agreements).” This is further clarified in the SAP V1 (section 2.4.2 Exclusion criteria, page 8) by the sentence “If one or more of the active drug treatments is not available at the hospital or is believed, by the attending clinician, to be contraindicated (or definitely indicated) for the specific patient, then this fact will be recorded via the web-based form prior to randomisation; random allocation will then be between the remaining (or indicated) arms.” Showing that randomization arms may be closed on an individual basis, when the patient is included, with the argument of contraindication or definitive indication. It seems that the “standard of care” group could not be removed and that at least another randomization arm had to be kept as suggested by the words “random allocation will then be between the remaining arms (in a 2:1:1:1, 2:1:1 or 2:1 ratio)” in section 2.9.1 page 11 of the SAP V1.0. Even exclusion of a single randomization arm can lead to imbalance between groups. For instance, if physicians believed that a treatment was contraindicated for the most severe patients, only non-severe patients could be randomized to the treatment’s arm, while most severe patients would be randomized to other arms. Several things can be done to assess and fix this bias. First, report how many times this feature has been used and which randomization arms have been most excluded. If it has been used many times, provide the pattern of use that help to assess whether this is a collective measure (e.g. 2-weeks period of shortage of a treatment in a center ? no major selection bias) or individual measure. If its use has been rare, a sensitivity analysis could simply exclude these patients. If it has been frequent, we suggest a statistical method to analyze this data without bias, based on the following principles: patients randomized between 3 randomization arms A, B and C (population X) are comparable for the comparisons of A to B. Patients randomized between A, B and D (population Y), are comparable for the comparisons of A to B. Population X and population Y may differ but, inside each population, A can be compared to B. Therefore, the within-X comparison of A to B and within-Y comparison of A to B are both valid and can be meta-analyzed to assess a global difference between A and B. This can be simply done with an adjustment on the population (X or Y) in a fixed effects multivariate model. Pooling of X and Y populations should not be performed without adjustment.<br /> A second problem with randomization exists although the dexamethasone arm is the least affected. Randomization arms have been added in this adaptative trial. When a new randomization arm is added, new patients may be randomized to this arm and fewer patients are randomized to other arms. Consequently, the distribution of dates of inclusion may differ between groups. This may have some impact on the mortality at two levels: (1) the medical prescription of hospitalization may have evolved as the epidemic evolved, with hospitalization reserved to most severe patients at the peak of epidemic and maybe wider hospitalization criteria at the start of epidemic and (2) evolution of patients included in the Recovery trial. Indeed, even if centers should have included as many patients as possible as soon as their inclusion criteria were met, it is possible that they have only included part of eligible patients and that this part evolved with time. This bias can be easily assessed and fixed: the curves of inclusions in the different arms and mortality rate in the Recovery trial can be drawn as a function of date (from March to June) and an adjustment on date of inclusion may be performed in a sensitivity analysis.

      **Conclusion**

      Recovery is the study with the best methodology that we have seen on COVID-19 treatments in curative intent and we salute the initiative of publishing transparently the protocol, its amendments, the statistical analysis plan and the first draft of the report. We hope that our reporting suggestions will be taken in account in the final version of the paper. We think that discussing these points will qualify the interpretation of results, further improve the transparent approach adopted by designers of the study and improve the reliability of the conclusions. We expect a high-quality reporting of these final results, with full transparency on interim analyses, statistical analysis plans and statistical analysis reports. We hope that these comments are helpful and again we acknowledge that this study is not solely outstanding in terms of importance of the results but is also a stellar example for the whole field of therapeutic research. We invite other researchers to provide comments to this article to engage in Open Science.