10,000 Matching Annotations
  1. Dec 2024
    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, the authors investigate the unique Mycobacteriaceae cell envelope using cryo-tomography/cryo-electron microscopy with Corynebacterium glutamicum as a model organism. Cryo-EM images of C. glutamicum cells successfully resolved previously observed densities corresponding to the MM, arabinogalactan, peptidoglycan, and inner membrane layers of the cell envelope along with the S-layer. The authors found that the S-layer is patchy in a manner dependent on growth phase (i.e. liquid versus solid growth). Intriguingly, when the S-layer was present, the leaflets of the MM appeared to be disrupted. The authors solved the structure of purified S-layer protein PS2 by cryo-EM, however they could not resolve the C-terminal membrane interaction domain. The authors found that PS2 is hexameric and different hexamers are linked by trimeric interface to create a porous structure. Phylogenetic analysis showed conservation of PS2 within corynebacteria and suggested a signature for MM-association.

      Major comments:

      1. The S-layer structure is porous and the authors suggest that it may function as a molecular sieve or permeability barrier. This hypothesis should either be tested experimentally, or further discussion is needed regarding what small molecules (chemical features, size) would be able to penetrate.
      2. The authors show cryo-EM images of dividing C. glutamicum cells but don't make any statements as to the presence, morphology, and measurements of the different cell envelope layers. This analysis should be included.
      3. The authors should include more discussion as to the patchiness or "wavy" MM near sites of PS2 contact. Cryo-EM of cells that express a variant of PS2 that lack the membrane anchoring domain would demonstrate that this is specific to PS2-membrane contacts. Minimally, providing some quantification for this phenotype would strengthen the claim (for instance, does the spacing between the perturbations match the expected scale of distance between S-layer membrane contacts).
      4. The authors speculate on complete conservation of certain residues in the C-terminal domain of PS2 and hypothesize that they may be important for maturation or targeting of MM-associated proteins. Two additional examples of proteins with this motif are mentioned as evidence. Authors should search for this motif in pre-existing lists of MM proteins in the literature to test if this hypothesis is robust. Experiments to test if the conserved C-terminal residues of PS2 are required for export or assembly into an S-layer are feasible but optional given the scope of the paper.
      5. The authors do not draw the distinction between MM-associated and integral MM proteins (that contain a transmembrane domain). Is the C-terminal membrane anchoring domain of PS2 likely to span the entire bilayer or just be associated by a few amino acids?

      Minor comments:

      1. The authors comment that the thickness of the MM both with and without the S-layer is the similar and conclude that there is no change in mycolic acid length. The resolution of the technique is not sufficient to make this statement.
      2. It would be helpful if the authors could comment if their membrane dimension measurements agree with previously published results in the main text of the manuscript. It is currently only included in the legend of Table S1.

      Significance

      The manuscript provides compelling images and structures of the C. glutamicum cell envelope and S-layer protein PS2, respectively. These cryo-EM images of the cell envelope appear to agree nicely with pre-existing studies in the field. The introduction of the manuscript was well-written and the data in the manuscript is of broad interest to those who study the Mycobacteriaceae cell envelope. There is a lot of compelling data included in the paper, but the study would be strengthened by further analysis of the data as well as additional experiments to support some of the hypotheses suggested.

      Reviewer expertise: bacterial genetics, bacterial cell envelope, protein transport

    1. RESEMAVAL_values %>% mutate(democratic_values = case_when(Q238 == 1 ~ "absolutely democratic", Q238 == 2 ~ "almost democratic", Q238 == 3 ~ "almost democratic", Q238 == 4 ~ "absolutely not democratic" ))

      Замечание: Не очень понимаю, а зачем было объединять категории? 2 и 3 категория были объединены в одну, но зачем? Если делаете такие действия, то описывайте логику, иначе не ясно.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1:

      Reviewer #1 was very appreciative of our results and commented “This is a novel result in ferredoxin and a significant contribution to the field”. We are very honored and pleased.

      Reviewer #2:

      (1) Changing the nomenclature of the models investigated to include the oxidation state being discussed. As they are now (CM, CMNA, etc), multiple re-reads were required to ascertain which redox state was being discussed for a particular model in a given section of the text. Appending "Ox" or "Red" for oxidized or reduced would be sufficient. 

      As you indicated there are several nomenclatures to distinguish the model systems in the text. On the other hand, the main issue discussed in the text is the ionization potential (IP), which is calculated by the difference in energies between oxidized and reduced states for each model. In other words, a discussion of the IP value on each model includes both the “Ox” and “Red” energies. In order to clarify the relationship between the nomenclature of models and redox states, we added sentences below.

      “Note that the IP value is obtained for each model by calculating both the Ox and Red state energies of the model.” (lines 195-196).

      On the other hand, we must specify the charge state when the geometry optimization is performed for CM and CMH models. Therefore, we revised the sentence as follows.

      “The decrease in |IP| value indicates that the relative stability of the Red state is suppressed compared with the CMH but is significantly larger than the CM, suggesting the importance of the protonation of Asp64 (Fig. S2B). 

      To consider the effect of the structural change caused by the redox on the IP, geometrical optimization of the 4Fe-4S core was performed for the CM (Red) and CMH (Red) models using the same level of theory to the single-point calculations. The optimized Cartesian coordinates are summarized in Table S3. As illustrated in Fig. S2A, the IP values of CM and CMH change from –3.27 to –2.38 eV (|DIP| = 0.89 eV), and from –1.06 to –0.19 eV (|DIP| = 0.87 eV), respectively, before and after the geometrical optimization.” (lines 224-232)

      (2) In addition to the very thorough DFT investigation of the different spin and charge combinations, did the authors try a broken-symmetry calculation to obtain the ground state description of the FeS cluster? Given the ubiquity of this approach in other FeS cluster studies, it was surprising that this approach was not taken here. Granted, the DFT investigation of each possible combination is sufficiently thorough and need not be redone. 

      Thank you for your comments. A term “spin-unrestricted method”, which is used in the manuscript in the text is synonym of “broken-symmetry method”. In order to emphasize this, we revised the manuscript as follows. 

      “All calculations were performed by using the spin-unrestricted (broken-symmetry) hybrid DFT method with the B3LYP functional set. As the basis set, 6-31G* and 6-31+G* were used for [Fe, C, N, O, H] and [S] atoms, respectively, for the IP calculations.” (Line 451)

      (3) Line 161 "an" to "a" 

      We corrected the mistake. Thank you so much. (Line 161)

      (4) Figure 4A seems a bit odd. Why do the traces eclipse the y-axis? And the traces between 330 and 370 nm are much noisier and appear thicker than the rest of the plot. Is this an issue with the monochromator grating used in wavelength selection? Reducing the thickness of the individual traces may help the data presentation in this figure. Also, the arrows on the plot have an opaque white background. Can this be removed so that the arrows do not eclipse the traces in the plot? 

      The spectrum in the Fig.4A seemed to be odd. The spectral figure has been revised to improve its appearance. (We have also corrected E53A in Figure 5B.) This reviewer also pointed out that “the traces between 330 and 370 nm are much noisier”. We are struggling with the noise caused by the grating (or the motor malfunction) of the monochromator as you pointed out. Once the monochromator is repaired and a smooth spectrum is obtained, we will upload further revisions.

      (5) Figure S9 is a very nice schematic illustrating the general findings of the study. Can this be moved to the main text?

      Thank you for your helpful comment. Accordingly, the Fig.9S and its legend are moved to the main text. (Lines 675-680)

    1. Reviewer #1 (Public review):

      This manuscript by Bai et al concerns the expression of Scleraxis (Scx) by muscle satellite cells (SCs) and the role of that gene in regenerative myogenesis. The authors report the expression of this gene associated with tendon development in satellite cells. Genetic deletion of Scx in SCs impairs muscle regeneration, and the authors provide evidence that SCs deficient in Scx are impaired in terms of population growth and cellular differentiation. Overall, this report provides evidence of the role of this gene, unexpectedly, in SC function and adult regenerative myogenesis.

      There are a few points of concern.

      (1) From the data in Figure 1, it appears that all of the SCs, assessed both in vitro and in vivo, express Scx. The authors refer to a scRNA-seq dataset from their lab and one report from mdx mouse muscle that also reveal this unexpected gene expression pattern. Has this been observed in many other scRNA-seq datasets? If not, it would be important to discuss potential explanations as to why this has not been reported previously.

      (2) A major point of the paper, as illustrated in Fig. 3, is that Scx-neg SCs fail to produce normal myofibers and renewed SCs following injury/regeneration. They mention in the text that there was no increased PCD by Caspase staining at 5 DPI. A failure of cell survival during the process of SC activation, proliferation, and cell fate determination (differentiation versus self-renewal) would explain most of the in vivo data. As such, this conclusion that would seem to warrant a more detailed analysis in terms of at least one or two other time points and an independent method for detecting dead/dying cells (the in vitro data in Fig. 4F is also based on assessment of activated Caspase to assess cell death). The in vitro data presented later in Fig. S4G,H do suggest an increase in cell loss during proliferative expansion of Scx-neg SCs. To what extent does cell loss (by whatever mechanism of cell death) explain both the in vivo findings of impaired regeneration and even the in vitro studies showing slower population expansion in the absence of Scx?

      (3) I'm not sure I understand the description of the data or the conclusions in the section titled "Basement membrane-myofiber interaction in control and Scx cKO mice". Is there something specific to the regeneration from Scx-neg myogenic progenitors, or would these findings be expected in any experimental condition in which myogenesis was significantly delayed, with much smaller fibers in the experimental group at 5 DPI?

      (4) The data presented in Fig. 4B showing differences in the purity of SC populations isolated by FACS depending on the reporter used are interesting and important for the field. The authors offer the explanation of exosomal transfer of Tdt from SCs to non-SCs. The data are consistent with this explanation, but no data are presented to support this. Are there any other explanations that the authors have considered and that could be readily tested?

      (5) The Cut&Run data of Fig. 6 certainly provide evidence of direct Scx targets, especially since the authors used a novel knock-in strain for analyses. The enrichment of E-box motifs provides support for the 207 intersecting genes (scRNA-seq and Cut&Run) being direct targets. However, the rationale elaborated in the final paragraph of the Results section proposing how 4 of these genes account for the phenotypes on the Scx-neg cells and tissues is just speculation, however reasonable. These are not data, and these considerations would be more appropriate in the Discussion in the absence of any validation studies.

      Comments on revisions:

      The authors have adequately addressed all of the concerns I raised regarding the original submission. I have no further issues to be addressed.

    2. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      This manuscript by Bai et al concerns the expression of Scleraxis (Scx) by muscle satellite cells (SCs) and the role of that gene in regenerative myogenesis. The authors report the expression of this gene associated with tendon development in satellite cells. Genetic deletion of Scx in SCs impairs muscle regeneration, and the authors provide evidence that SCs deficient in Scx are impaired in terms of population growth and cellular differentiation. Overall, this report provides evidence of the role of this gene, unexpectedly, in SC function and adult regenerative myogenesis.

      We appreciate the comments and thank her/him for the support.

      There are a few minor points of concern.

      (1) From the data in Figure 1, it appears that all of the SCs, assessed both in vitro and in vivo, express Scx. The authors refer to a scRNA-seq dataset from their lab and one report from mdx mouse muscle that also reveals this unexpected gene expression pattern. Has this been observed in many other scRNA-seq datasets? If not, it would be important to discuss potential explanations as to why this has not been reported previously.

      Thanks for this question regarding data in Fig.1. We did initially use immunofluorescence staining of Pax7 and GFP on muscle sections and primary myoblast cultures prepared from Tg-ScxGFP mice to conclude that Scx was expressed in satellite cells (SCs). In addition to the cited mdx RNA-seq data, we have included a re-analysis of a published scRNA-seq data set in Fig.2E (Dell'Orso et al., Development, 2019), and our own scRNA-seq data (Fig.S5D, F). We have now re-examined an additional scRNA-seq data set of TA muscles at various regeneration time points (De Micheli et al., Cell Rep. 2020), in which Scx expression was detected in MuSC progenitors and mature muscle cells. We have added the De Micheli et al. reference and the re-analysis of that scRNA-seq data set for Scx expression as an additional panel in Fig. 2E, with accompanying text (p. 7, ln. 4-6). Thus, our immunostaining results are consistent with scRNA-seq data from our and two other independent scRNA-seq data sets.

      We think that Scx expression in the adult myogenic lineage was not previously reported mainly because its expression level was low, and might be dismissed as spurious detection. Additionally, detecting such low expression levels requires sophisticated detection methods with high capture efficiency. Previous studies have noted limitations in transcript capture or transcription factor dropout in 10x Genomics-based datasets (Lambert et al., Cell, 2018; Pokhilko et al., Genome Res., 2021). The most likely and straightforward reason is that Scx was simply not a focus in prior studies amid so many other genes of interest. We have now added this last explanation in the text (p.7, ln. 8-9), following the re-analyses of Scx expression in published scRNA-seq data sets.

      (2) A major point of the paper, as illustrated in Fig. 3, is that Scx-neg SCs fail to produce normal myofibers and renewed SCs following injury/regeneration. They mention in the text that there was no increased PCD by Caspase staining at 5 DPI. A failure of cell survival during the process of SC activation, proliferation, and cell fate determination (differentiation versus self-renewal) would explain most of the in vivo data. As such, this conclusion would seem to warrant a more detailed analysis in terms of at least one or two other time points and an independent method for detecting dead/dying cells (the in vitro data in Fig. 4F is also based on an assessment of activated Caspase to assess cell death). The in vitro data presented later in Fig. S4G, H do suggest an increase in cell loss during proliferative expansion of Scx-neg SCs. To what extent does cell loss (by whatever mechanism of cell death) explain both the in vivo findings of impaired regeneration and even the in vitro studies showing slower population expansion in the absence of Scx?

      We appreciate these constructive suggestions. Based on the number of available control and cKO animals, we were limited to one additional time point at 3 dpi to assess PCD by TUNEL in vivo. We were disappointed again to find no appreciable levels of PCD at 3 dpi by TUNEL (new Fig.S4I), thus no quantifications were included. We also re-did the in vitro experiment using purified SCs and monitored PCD by staining for cleaved Caspase-3 using a validated tube of antibodies (positive staining after 6 h of treatment by 1 mM staurosporine of control and ScxcKO cells; included as new Fig. S4J and legend). We were pleased to find an increase of cleaved Caspase3 stained cells, i.e. PCD, of Scx-cKO SCs at day 4 in culture, compared to that of the control. We have now replaced the old Fig. 4F with new Fig.4F and 4G to document PCD. We also provided new text/legend for these new data (p.10. ln. 2-10; new legend for Fig. 4F and 4G).

      (3) I'm not sure I understand the description of the data or the conclusions in the section titled "Basement membrane-myofiber interaction in control and Scx cKO mice". Is there something specific to the regeneration from Scx-neg myogenic progenitors, or would these findings be expected in any experimental condition in which myogenesis was significantly delayed, with much smaller fibers in the experimental group at 5 DPI?

      We very much appreciate this comment. We agree that there is unlikely anything specific about the regeneration from Scx-negative myogenic progenitors. Unfilled or empty ghost fibers (basement membrane remnant) are expected due to small fiber and poor regeneration in the ScxcKO mice at 5 dpi. We have removed the subtitle and changed the content to an expected consequence rather than something special (p. 8, ln. 19-22).

      (4) The data presented in Fig. 4B showing differences in the purity of SC populations isolated by FACS depending on the reporter used are interesting and important for the field. The authors offer the explanation of exosomal transfer of Tdt from SCs to non-SCs. The data are consistent with this explanation, but no data are presented to support this. Are there any other explanations that the authors have considered and that could be readily tested?

      Thanks for highlighting this phenomenon. We struggled with the SC purity issue for a long time. The project started with using the R26RtdT reporter for tdT’s paraformaldehyde  resistant strong fluorescence (fixation) to aid visualization in vivo. Later, when we used the tdT signal to purify SCs by FACS, we found that only 80% sorted tdT+ cells are Pax7+. We then switched to the R26RYFP reporter, from which we achieved much higher purity (95%) of SCs (Pax7+) by FACS. As such, we also repeated and confirmed many in vivo experimental results using the R26RYFP reporter (included in the manuscript). Due to the low purity of tdT+SCs by FACS, we discontinued that mouse colony after we confirmed the superior utility of the R26RYFP reporter for SC isolation.

      We sincerely apologize for not being able to conduct further testable experiments on this intriguing phenomenon. However, this issue has since been addressed and published by Murach et al., iScience, (2021). Like our experience, they found non-satellite mononuclear cells with tdT fluorescence after TMX treatment when SCs were isolated via FACS. To determine this was not due to off-target recombination or a technical artifact from tissue processing, they conducted extensive analyses. They found that the tdT+ mononuclear cells included fibrogenic cells (fibroblasts and FAPs), immune cells/macrophages, and endothelial cells. Additionally, they confirmed the significant potential of extracellular vesicle (EV)-mediated cargo transfer, which facilitates the transfer of full-length tdT transcript from lineage-marked Pax7+ cells to those mononuclear cells. We have modified the text to emphasize and acknowledge their contribution to this important point, and explained the difference between YFP and tdT reporter alleles in more detail (p.9, ln. 11-17).

      (5) The Cut&Run data of Fig. 6 certainly provide evidence of direct Scx targets, especially since the authors used a novel knock-in strain for analyses. The enrichment of E-box motifs provides support for the 207 intersecting genes (scRNA-seq and Cut&Run) being direct targets. However, the rationale elaborated in the final paragraph of the Results section proposing how 4 of these genes account for the phenotypes on the Scx-neg cells and tissues is just speculation, however reasonable. These are not data, and these considerations would be more appropriate in the Discussion in the absence of any validation studies.

      We agree with this comment and have moved speculations into the Discussion (p. 15, ln. 4-15, and from p. 18, ln. 4 to p. 19, ln. 4).

      Reviewer #2 (Public Review):

      Summary:

      Scx is a well-established marker for tenocytes, but the expression in myogenic-lineage cells was unexplored. In this study, the authors performed lineage-trace and scRNA-seq analyses and demonstrated that Scx is expressed in activated SCs. Further, the authors showed that Scx is essential for muscle regeneration using conditional KO mice and identified the target genes of Scx in myogenic cells, which differ from those of tendons.

      Strengths:

      Sometimes, lineage-trace experiments cause mis-expression and do not reflect the endogenous expression of the target gene. In this study, the authors carefully analyzed the unexpected expression of Scx in myogenic cells using some mouse lines and scRNA-seq data.

      We appreciate the comments and thank her/him for noting the strengths of our manuscript.

      Weaknesses:

      Scx protein expression has not been verified.

      We are aware of this weakness. We had previously used Western blotting (WB) using cultured SCs from control and ScxcKO mice, but did not detect endogenous Scx protein even in the control. In response to this comment, we have re-done several WB experiments using new lysates from control and ScxcKO SCs and two commercial antibodies: anti-Scx antibody 1 from Abcam (ab58655) and anti-Scx antibody 2 from Invitrogen (PA5-23943). These antibodies have been reported to detect endogenous Scx protein in tendon cells in Spang et al., BMC Musculoskelet Disord (2016) and  Bochon et al., Int J Stem Cells (2021). Despite our best efforts, we were not able to detect a reliable Scx band. We have also conducted immunofluorescence using these two antibodies. Still, we failed to detect a difference of staining signals between control and cKO SCs using these antibodies. Lastly, we conducted immunofluorescence using the ScxTy1 myoblasts and we did not find the staining signal coinciding with the Ty1 signal (by double staining). We have been very frustrated by not knowing what caused this technical difficulty in our hands. Given that these were negative data, we did not include them. However, we do hope that the combined data from scRNA-seq, ScxCreERT2 lineage-tracing, Tg-ScxGFP expression, and ScxTy1 knock-in together are deemed sufficient to make up for the deficiency of data for endogenous Scx protein in regenerative myogenic cells.

      Response to Recommendations for the Authors:

      Reviewer #1 (Recommendations For The Authors):

      p. 8: The text refers to Fig. 3I, but this should be Fig. 3H.

      We apologize for the confusion. Please note that by keeping all 14 dpi data in the same row, we placed Fig.3I at an unconventional/unexpected position, i.e., next to 3D &3E, and above 3F-H. We were aware that this unconventional placement could cause confusion, and it did. With that said, we have now re-arranged the subfigures (same data content) so that the updated Fig.3 contains subfigures in the expected and proper spatial order. We double-checked the figure referral in the text (p. 8, ln. 16-17) and the text is correct – just that the original Fig.3I should have been at the original Fig.3H position and that is now corrected.

      Reviewer #2 (Recommendations For The Authors):

      (1) Given that Scx binds to the E-box and regulates gene expression, it is of interest to know the relevance between MyoD and Scx. If possible, the reviewer recommends to include some discussions.

      Thanks for the comment. MyoD1 is a well-known transcript factor regulating myogenesis, whereas Scx is primarily studied in tenocytes and other connective tissues. We agree that our new findings deserve a discussion regarding the relevance between MyoD1 and Scx.  We have added a description of their differences in the discussion and two new references (p.19, ln. 7-17).

      (2) Considering that Scx is a transcriptional factor, it is interesting that Scx-GFP was not detected in the nuclei of regenerated myofibers. Could the subcellular localization of Scx-GFP provide some insights into the function of Scx as a transcription factor during muscle regeneration?

      Tg-ScxGFP is a transgenic line generated by random insertion into the genome (Pryce et al., 2007; cited). The plasmid used for transgenesis was constructed by replacing most of Scx’s first exon with GFP, and including ~ 9Kb flanking regulatory sequences. As such, the ScxGFP is not a fusion gene, but rather that the GFP expression is regulated by Scx promoter and enhancer(s). This GFP reporter lacks a nuclear localization signal (NLS), hence it is mainly detected in the cytoplasm; some nuclear signal is detected, presumably due to GFP’s small size permitting passive diffusion into the nucleus. Thus, the GFP signal is used as a reporter for Scx expression, but GFP subcellular localization does not provide insight into Scx function per se. Conversely, ScxTy1/Ty1 is a knock-in allele created by fusing a triple-Ty1 tag (3XTy1) to the C-terminus of Scx, and we observed that Ty1 is located in the nucleus by the immunofluorescent staining. We used the Ty1 epitope to carry out CUT&RUN experiments to gain insight to the function of Scx as a transcription factor.

      (3) Fig1D The number of arrows in the Merge image is not matched with others. In addition, the star mark in the Pax7 image is likely an error.

      Apologies. We have now corrected these errors in the revised Fig.1D.

      (4) FigS1A Is there only one myofiber shown in the dashed line in this image? It is unclear why only this myofiber is surrounded by the dashed line.

      The dashed line encircles a single fiber because it was not visible in the provided image. However, there are 3 fibers in this image. Because we did not immuno-stain for myofibers here, we circled one fiber for illustration. For clarity, we brightened the background (of the entire original images) so the background signals from myofiber boundaries are discernable without outlines.

      (5) FigS1B There was no overlapped DAPI staining in the Myogenin+ cell. DAPI-staining should be present in Myogenin+ cells because myogenin is located in the nucleus.

      Fig.S1B is immuno-staining for MyoD , and we marked one MyoD+DAPI+GFP+ cell/nucleus. Fig.S1C is immune-staining for Myogenin, and we also marked one (cell/nucleus) that is triple positive.

      (6) The position of the asterisk for the ScxGFP in FigS1D is misaligned. In addition, the position is not matched with Fig1C. Because all myofibers are Scx-positive, it is strange that only one myofiber has an asterisk. The reviewer suggests removing the mark.

      Thank you for pointing out these errors. We have now corrected the misalignment and removed the unnecessary asterisk.

    1. Author response:

      The following is the authors’ response to the original reviews.

      eLife Assessment 

      This study presents valuable experimental and numerical results on the motility of a magnetotactic bacterium living in sedimentary environments, particularly in environments of varying magnetic field strengths. The evidence supporting the claims of the authors is solid, although the statistical significance comparing experiments with the numerical work is weak. The study will be of interest to biophysicists interested in bacterial motility. 

      We thank the reviewers and editors for their careful reading and the constructive comments. With respect to the statement about weak statistical significance, we think that this statement mixes two separate issues, the significance of the difference between experiments at 0 and 50µT and the comparison of experiments with simulations. We have amended our manuscript to address both points as described below. The difference between the experiments at 0 and 50µT is indeed significant, and the discrepancy between experiments and simulations can be explained by unavoidable differences in the way we quantify bacterial throughput.

      Public Reviews: 

      Reviewer #1 (Public Review): 

      Summary: 

      The authors present experimental and numerical results on the motility Magnetospirillum gryphiswaldense MSR-1, a magnetotactic bacterium living in sedimentary environments. The authors manufactured microfluidic chips containing three-dimensional obstacles of irregular shape, that match the statistical features of the grains observed in the sediment via microcomputer tomography. The bacteria are furthermore subject to an external magnetic field, whose intensity can be varied. The key quantity measured in the experiments is the throughput ratio, defined as the ratio between the number of bacteria that reach the end of the microfluidic channel and the number of bacteria entering it. The main result is that the throughput ratio is non-monotonic and exhibits a maximum at magnetic field strength comparable with Earth's magnetic field. The authors rationalize the throughput suppression at large magnetic fields by quantifying the number of bacteria trapped in corners between grains. 

      Strengths: 

      While magnetotactic bacteria's general motility in bulk has been characterized, we know much less about their dynamics in a realistic setting, such as a disordered porous material. The micro-computer tomography of sediments and their artificial reconstruction in a microfluidic channel is a powerful method that establishes the rigorous methodology of this work. This technique can give access to further characterization of microbial motility. The coupling of experiments and computer simulations lends considerable strength to the claims of the authors, because the model parameters (with one exception) are directly measured in the experiments. 

      Weaknesses: 

      The main weakness of the manuscript pertains to the discussion of the statistical significance of the experimental throughput ratio. Especially when comparing results at zero and 50 micro Tesla. The simulations seem to predict a stronger effect than seen in the experiments. The authors do not address this discrepancy. 

      We thank the reviewer for their positive assessment and the detailed constructive remarks. 

      The increase in bacterial throughput between 0 and 50 µT is indeed more pronounced in the simulations than in the experiments, partly due to the fact that there is considerably more variability in the experimental data. We did two things to address this issue: (1) We performed additional statistical test addressing the difference between the experimental results at 0 and 50 µT. Indeed, the difference is only weakly significant (in contrast to the difference of either to 500µT). The increase is however consistent with the observation in the absence of obstacles in the channel, where we see a monotonous increase from 0 to 500 µT (Supp. Figure S5). We have added the test results in the caption of Fig. 3. (2) To address the difference between simulations and experiments, we added a section in Methods on how we determine the throughput and a short discussion in the Results section. The key points are that the initial condition is different in simulations and experiments and that the throughput is therefore quantified differently. This difference is due to experimental limitations: we cannot track bacteria through the whole channel and we wanted to avoid pushing them into the channel with fluid flow to avoid effects of flow on the results. As a consequence, bacteria continue to enter the IN region of the channel from the inlet during the experiment, while in the simulation, they all start at the beginning of the channel simultaneously. We expect this to mostly affect the case with diffusive transport (B=0).

      Reviewer #2 (Public Review): 

      Summary: 

      simulation study of magnetotactic bacteria in microfluidic channels containing sediment-mimicking obstacles. The obstacles were produced based on micro-computer tomography reconstructions of bacteria-rich sediment samples. The swimming of bacteria through these channels is found experimentally to display the highest throughput for physiological magnetic fields. Computer simulations of active Brownian particles, parameterized based on experimental trajectories are used to quantify the swimming throughput in detail. Similar behavior as in experiments is obtained, but also considerable variability between different channel geometries. Swimming at strong field is impeded by the trapping of bacteria in corners, while at weak fields the direction of motion is almost random. The trapping effect is confirmed in the experiments, as well as the escape of bacteria with reducing field strength. 

      Strengths: 

      This is a very careful and detailed study, which draws its main strength from the fruitful combination of the construction of novel microfluidic devices, their use in motility experiments, and simulations of active Brownian particles adapted to the experiment. Based on their results, the authors hypothesize that magnetotactic bacteria may have evolved to produce magnetic properties that are adapted to the geomagnetic field in order to balance movement and orientation in such crowded environments. They provide strong arguments in favor of such a hypothesis. 

      Weaknesses: 

      Some of the issues touched upon here have been studied also in other articles. It would be good to extend the list of references accordingly and discuss the relation briefly in the text. 

      We thank the reviewer for the constructive comments. We answer to the point concerning previous literature in the response to the recommendations below.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors): 

      Here follows a list of points the authors should address. 

      (1) Are additional experiments feasible to decrease the statistical noise present in Fig. 3c? At the very least, the authors should discuss the statistical significance of the results at 50 muT vis-a-vis 0 T. 

      See our response to Strengths/Weaknesses above

      (2) The experimental setup is not immediately clear. I think that adding a panel from Fig. S1 (or a sketch thereof) would help clarify, especially in relation to the entry zone and end zone. 

      We are not sure what you mean. Fig. 3A already contains exactly such a panel. We have however added another supplementary figure that shows an additional detailed view of the setup (Fig. S3). In addition, we revised several figures: We have replaced Fig. S1 with a better version and exchanged the schematic view of the obstacle channel in Fig 1, removing the additional inlets that were not used in this study (also in Fig 3A), Instead we added a comment in Methods explaining their presence. Hopefully this makes the setup clear.

      (3) It should be also stated that there is no external flow imposed on the channel. 

      We have added such a statement in the description of the experiment (in section 2.2 Swimming of magnetotactic bacteria through sediment-mimicking obstacle channels.  

      (4) Fig. 3c and Fig. 6c are seemingly showing the same quantity (or closely related ones). The authors should use the same symbol and give an explicit mathematical definition. 

      The two quantities are not exactly the same, as we cannot directly quantify the flux of bacteria through the channel in our experiments. On the one hand, we cannot track bacteria through the whole channel, on the other hand, the initial conditions are not exactly the same as in the simulations. In the simulations all bacteria start at the same time at the entrance to the channel. In the experiments, they enter from the inlet and do so at different times (pushing them in with fluid flow would be possible, but carries the risk of perturbing the results due to induced flow through the channel). We have added a new section in the Methods section that explains this difference and describes the procedure used to obtain the throughput from the experiments in detail. We have also added a corresponding comment in the Result section, where the simulations are compared with the experiments. 

      Minor issues: 

      - Figures have different styles that should be unified. For example, the panel labels sometimes have round brackets and sometimes they don't.

      See above

      - Page 6, (muCT) should have the Greek letter mu 

      Thanks, corrected.

      - Fig. 3a is not very clear; see my point 2 above. 

      See above

      Reviewer #2 (Recommendations For The Authors): 

      I have only a few comments and questions, which the authors should address: 

      (1) The observed exponential dependence of decay time on the "well" depth could be related to the exponential density distribution of active particles in a gravitational field, which has been derived previously. Might be interesting to discuss such a possible connection. 

      Thank you for the suggestion, the two cases are indeed somewhat analogous with behaviors reminiscent of thermal processes with an effective temperature. Such a description is however not generally possible (even for sedimentation, only some features are described). We plan to address in future work whether it can be made more quantitative in our case of escape from the corner traps. We have included a short discussion of the analogy in the section on trapping and escape. 

      (2) The authors should consider the following relevant references, and discuss them briefly in their manuscript:

      - Sedimentation, trapping, and rectification of dilute bacteria J Tailleur, ME Cates EPL 86, 60002 (2009) 

      - Human spermatozoa migration in microchannels reveals boundary-following navigation P Denissenko, V Kantsler, DJ Smith, J Kirkman-Brown Proc. Natl. Acad. Sci. USA 109, 8007-8010 (2012) 

      - Wall accumulation of self-propelled spheres J Elgeti, G Gompper Europhysics Letters 101, 48003 (2013) 

      - Wall entrapment of peritrichous bacteria: a mesoscale hydrodynamics simulation study SM Mousavi, G Gompper, RG Winkler Son Maber 16 (20), 4866-4875 (2020) 

      - A Geometric Criterion for the Optimal Spreading of Active Polymers in Porous Media C Kurzthaler, S Mandal, T Bhabacharjee, H Löwen, SS Daba, HA Stone Nat. Commun. 12, 7088 (2021) 

      - Run-to-Tumble Variability Controls the Surface Residence Times of E. coli Bacteria G Junot, T Darnige, A Lindner, VA Martinez, J Arlt, A Dawson, WCK Poon, H Auradou, E Clement Phys. Rev. Leb. 128, 248101 (2022) 

      - Dynamics and phase separation of active Brownian particles on curved surfaces and in porous media P Iyer, RG Winkler, DA Fedosov, G Gompper Phys. Rev. Research 5, 033054 (2023) 

      We agree that there is a lot of literature on these aspects, specifically interaction of self-propelled objects with walls and motion of swimmers through porous media. We have slightly extended our overview of previous literature in the introduction and included most of these references.

    1. Reviewer #1 (Public review):

      Summary:

      In this excellent manuscript by Egan et al., the authors very carefully dissect the roles of inflammasome components in restricting Salmonella Typhimurium (STm) replication in human macrophages. They show that caspase-1 is essential to mediating inflammasome responses and that caspase-4 contributes to bacterial restriction at later time points. The authors show very clear roles for the host proteins that mediate terminal lysis, gasdermin D and ninjurin-1. The unique finding in this study is that in the absence of inflammasome responses, Salmonella hypereplicates within the cytosol of macrophages. These findings suggest that caspase-1 and possibly caspase-4 play roles in restricting the replication of Salmonella in the cytosol as well as in the Salmonella containing vacuole.

      Strengths:

      (1) The genetic and biochemical approaches have shown for the first time in human macrophages that the caspase-1-GSDMD-NINJ1 axis is very important for restricting intracellular STm replication. In addition, they demonstrate a later role for Casp4 in control of intracellular bacterial replication.

      (2) In addition, they show that in macrophages deficient in the caspase-1-GSDMD-NINJ1 axis that STm are found replicating in the cytosol, which is a novel finding. The electron microscopy is convincing that STm are in the cytosol.

      (3) The authors go on to use a chloroquine resistance assay to show that inflammasome signaling also restricts STm within SCVs in human macrophages.

      (4) Finally, they show that the Type 3 Secretion System encoded on Salmonella Pathogenicity Island 1 contributes to STm's cytosolic access in human macrophages.

      Weaknesses:

      (1) Their results with human macrophages suggest that there are differences between murine and human macrophages in inflammasome-mediated restriction of STm growth. For example, Thurston et al. showed that in murine macrophages that inflammasome activation controls the replication of mutant STm that aberrantly invades the cytosol, but only slightly limits replication of WT STm. In contrast, here the authors found that primed human macrophages rely on caspase-1, gasdermin D and ninjurin-1 to restrict WT STm. I wonder if the priming of the human macrophages in this study could account for the differences in these studies. Along those lines, do the authors see the same results presented in this study in the absence of priming the macrophages with Pam3CSK4. I think that determining whether the control of intracellular STm replication is dependent on priming is very important. Another difference with the Thurston et al. paper is the way that the STm inoculum was prepared - stationary phase bacteria that were opsonized. Could this also account for differences between the two studies rather than differences between murine and human macrophages in inflammasome-dependent control of STm?

      (2) The authors show that the pore-forming proteins GSDMD and Ninj1 contribute to control of STm replication in human macrophages. Is it possible that leakage of gentamicin from the media contributes to this control?

      (3) One major question that remains to be answered is whether casp-1 plays a direct role in the intracellular localization of STm. If the authors quantify the percentage of vacuolar vs. cytosolic bacteria at early time points in WT and casp-1 KO macrophages, would that be the same in the presence and absence of casp-1? If so, then this would suggest that there is a basal level of bacterial-dependent lysis of the SCV and in WT macrophages the presence of cytosolic PAMPS trigger cell death and bacteria can't replicate in the cytosol. However, in the inflammasome KO macrophages, the host cell remains alive and bacteria can replicate in the cytosol.

      Comments on revisions:

      The authors have addressed my previous concerns. The addition of the statements indicating the limitations of the study are an important addition.

    2. Reviewer #2 (Public review):

      Summary:

      This work addresses the question of how human macrophages restrict intracellular replication of Salmonella.

      Strengths:

      Through a series of genetic knockouts and using specific inhibitors, Egan et al. demonstrated that the inflammasome components caspase-1, caspase-4, gasdermin D (GSDMD), and the final lytic death effector ninjurin-1 (NINJ1) are required for control of Salmonella replication in human macrophages. Interestingly, caspase-1 proved crucial in restricting Salmonella early during infection, whereas caspase-4 was essential in the later stages of infection. Furthermore, using a chloroquine resistance assay and state-of-the-art microscopy, the authors found that NAIP receptor and caspase-1 mostly regulate replication of cytosolic bacteria, with smaller, yet significant, impact on the vacuolar bacteria.

      The finding that inflammasomes are critical in the restriction of replication of intracellular Salmonella in human macrophages contrasts with the published minimal role of inflammasomes in restriction of replication of intracellular Salmonella in murine macrophages. Some of these differences could be due to differences in the methodologies used in the two studies. However, the findings suggest yet another example of interspecies and intercellular differences in regulation of bacterial infections by the immune system.

      Comments on revisions:

      The authors may wish to comment that the measurements of released cytokines by ELISA do not discriminate between active and full-length forms of the cytokines.

    3. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1: 

      (1) Their results with human macrophages suggest that there are differences between murine and human macrophages in inflammasome-mediated restriction of STm growth. For example, Thurston et al. showed that in murine macrophages that inflammasome activation controls the replication of mutant STm that aberrantly invades the cytosol, but only slightly limits replication of WT STm. In contrast, here the authors found that primed human macrophages rely on caspase-1, gasdermin D and ninjurin-1 to restrict WT STm. I wonder if the priming of the human macrophages in this study could account for the differences in these studies. Along those lines, do the authors see the same results presented in this study in the absence of priming the macrophages with Pam3CSK4. I think that determining whether the control of intracellular STm replication is dependent on priming is very important.

      We thank the Reviewer for their careful attention to our manuscript and for their thoughtful comments. We have addressed this question about the impact of priming by repeating the bacterial intracellular burden assays in unprimed WT and CASP1-/- THP-1 cells. We have added additional figures to the manuscript to address this: Figure 1 – Figure Supplement 3. Under unprimed conditions, CASP1-/- cells still harbored significantly higher bacterial burdens at 6 hpi and a significant fold-increase in bacterial CFUs compared to WT cells. These results suggest that the caspase-1-mediated restriction of intracellular Salmonella replication in human macrophages is independent of priming. 

      (2) Another difference with the Thurston et al. paper is the way that the STm inoculum was prepared - stationary phase bacteria that were opsonized. Could this also account for differences between the two studies rather than differences between murine and human macrophages in inflammasome-dependent control of STm?

      We thank the Reviewer for this excellent suggestion. To address this possibility, we repeated the bacterial intracellular burden assays in WT and CASP1-/- THP-1 cells using stationary phase bacteria. We infected WT and CASP1-/- THP-1 cells with stationary phase Salmonella, and we subsequently assayed for intracellular bacterial burdens. These data have now been added to the manuscript in Figure 1 – Figure Supplement 4. Interestingly, we did not observe any fold-change in the bacterial colony forming units in both the WT and CASP1-/- THP-1 cells for the stationary phase Salmonella. These data indicate that by 6 hours postinfection, Salmonella do not replicate efficiently in human macrophages unless grown under SPI-1-inducing conditions. Furthermore, these results suggest that differences in how the Salmonella inoculum is prepared may contribute to the discrepancies between our study and previous studies, as noted by the Reviewer. 

      (3) The authors show that the pore-forming proteins GSDMD and Ninj1 contribute to control of STm replication in human macrophages. Is it possible that leakage of gentamicin from the media contributes to this control?

      Response: We thank the Reviewer for their insightful comment. We have addressed this question on the impact of gentamicin by repeating the bacterial intracellular burden assays using a lower concentration of gentamicin in combination with extensively washing the cells with RPMI media to remove the gentamicin. WT and CASP1-/- THP-1 cells were infected with WT Salmonella. Then, at 30 minutes post-infection, cells were treated with 25 μg/ml of gentamicin to kill any extracellular bacteria. At 1 hour post-infection (hpi), the cells were washed for a total of five times with fresh RPMI to remove the gentamicin, and then the media was replaced with fresh media containing no gentamicin. In parallel, we also treated cells with 100 μg/ml of gentamicin at 30 minutes post-infection, washed the cells five times with fresh RPMI at 1 hpi to remove the gentamicin, and then replaced the media with fresh media containing 10 μg/ml of gentamicin. This data has now been included in the manuscript as Figure 1 – Figure Supplement 5. We observed similar levels in the intracellular bacterial burdens at 1 hpi and 6 hpi and a fold-increase in bacterial colony forming units in CASP1-/- cells compared to WT cells across both gentamicin conditions, suggesting that gentamicin appears to not contribute to the intracellular control of Salmonella replication in human macrophages. Of note, we also tried repeating the bacterial intracellular burden assays without gentamicin, using only washes to remove extracellular at 1 hpi; however, under these experimental conditions, we observed high levels of extracellular Salmonella. Therefore, we relied on using a lower concentration of gentamicin to kill extracellular Salmonella in conjunction with extensive washing to remove the gentamicin for the remainder of the infection. 

      (4) One major question that remains to be answered is whether casp-1 plays a direct role in the intracellular localization of STm. If the authors quantify the percentage of vacuolar vs. cytosolic bacteria at early time points in WT and casp-1 KO macrophages, would that be the same in the presence and absence of casp-1? If so, then this would suggest that there is a basal level of bacterial-dependent lysis of the SCV and in WT macrophages the presence of cytosolic PAMPS trigger cell death and bacteria can't replicate in the cytosol. However, in the inflammasome KO macrophages, the host cell remains alive and bacteria can replicate in the cytosol.

      We thank this Reviewer for raising this important point. We have addressed this experimentally by quantifying the percentage of vacuolar vs. cytosolic Salmonella at 2 hpi in WT, NAIP-/-, and CASP1-/- THP-1 cells using a chloroquine (CHQ) resistance assay. This data has now been included in the manuscript in the new Figure 5A. The original subfigures of Figure 5 have consequently been rearranged. We did not observe any significant differences in vacuolar and cytosolic bacterial burdens at this early time point in WT, NAIP-/-, and CASP1-/- THP-1 cells. As noted by the Reviewer, these results suggest that the basal level of bacterialdependent lysis of the SCV in human macrophages is not dependent on caspase-1 or NAIP. 

      Reviewer #3: 

      (1) The main weaknesses of the study are the inherent limitations of tissue culture models. For example, to study interaction of Salmonella with host cells in vitro, it is necessary to kill extracellular bacteria using gentamicin. However, since Salmonella-induced macrophage cell death damages the cytosolic membrane, gentamicin can reach intracellular bacteria and contribute to changes in CFU observed in tissue culture models (major point 1). This can result in tissue culture "artefacts" (i.e., observations/conclusions that cannot be recapitulated in vivo). For example, intracellular replication of Salmonella in murine macrophages requires T3SS-2 in vitro, but T3SS-2 is dispensable for replication in macrophages of the spleen in vivo (Grant et al., 2012).  

      We thank the Reviewer for their helpful comments and insightful suggestions. We have addressed some of the concerns about gentamicin in our response to Reviewer #1 above. To address the Reviewer’s concerns further, we have included language to acknowledge the limitations of our study based on the artefacts of tissue culture models in our Discussion section: “In this study, we utilized tissue culture models to examine intracellular Salmonella replication in human macrophages. These in vitro systems allow for precise control of experimental conditions and, therefore, serve as powerful tools to interrogate the molecular mechanisms underlying inflammasome responses and Salmonella replication in both immortalized and primary human cells. Still, there are limitations of tissue culture models, as they lack the inherent complexity of tissues and organs in vivo. To assess whether our findings reflect Salmonella dynamics in the mammalian host, it will be important to complement our studies and extend the implications of our work using approaches that model more complex systems, such as organoids or organ explant models co-cultured with immune cells, and in vivo techniques, such as humanized mouse models.”

      (2) In Figure 1: are increased CFU in WT vs CASP1-deficient THP-1 cells due to Caspase 1 restricting intracellular replication or due to Caspase-1 causing pore formation to allow gentamicin to enter the cytosol thereby restricting bacterial replication? The same question arises about Caspase-4 in Figure 2, where differences in CFU are observed only at 24h when differences in cell death also become apparent. The idea that gentamicin entering the cytosol through pores is responsible for controlling intracellular Salmonella replication is also consistent with the finding that GSDMD-mediated pore formation is required for restricting intracellular Salmonella replication (Figure 3). Similarly, the finding that inflammasome responses primarily control Salmonella replication in the cytosol could be explained by an intact SCV membrane protecting Salmonella from gentamicin (Figure 5). 

      We thank the Reviewer for highlighting this important point regarding gentamicin.

      We have addressed this question in our response above to Review #1 and in Figure 1 – Figure Supplement 5. We observed caspase-1-mediated restriction of Salmonella in human macrophages even when cells were treated with a lower concentration of gentamicin (25 μg/ml) for 30 minutes and then extensively washed with RPMI media to remove any gentamicin for the remainder of the infection. These data suggest that gentamicin is likely not responsible for controlling intracellular Salmonella in human macrophages.

    1. Sommaire du webinaire FCPE avec timestamps

      https://vimeo.com/1012776402

      Ce sommaire couvre les points principaux abordés lors du webinaire FCPE sur l'équipement d'un enfant avec son premier smartphone.

      Introduction et contexte (0:00:00 - 0:03:24)

      • Le webinaire, animé par Axelle Desaint, directrice générale d'Internet sans crainte, est organisé par la FCPE Nationale et s'adresse à tous les parents. [1, 2]
      • L'objectif est d'aider les parents à accompagner leurs enfants dans l'utilisation de leur premier smartphone et de discuter des tensions que cela peut engendrer. [2, 3]
      • Internet sans crainte propose plus de 200 outils gratuits pour aider les parents à accompagner leurs enfants dans leur usage du numérique. [4]
      • L'association organise également le Safer Internet Day en France. [5]

      Premier équipement et points de vigilance (0:07:43 - 0:08:11)

      • Le premier téléphone est souvent un appareil récupéré de la famille. Il est crucial de supprimer toutes les données (photos, contenus) et applications inadaptées à l'enfant avant de le lui confier. [6, 7]

      Le contrôle parental : un allié pour la protection (0:12:08 - 0:13:37)

      • Le contrôle parental n'est pas de l'espionnage mais un outil pour encadrer et protéger l'enfant. [8]
      • Il permet de définir le temps d'écran, de limiter l'accès à certains contenus et de gérer les achats potentiels. [9-11]

      Le temps d'écran : un sujet de préoccupation majeur (0:15:04 - 0:17:59)

      • Le temps d'écran est la principale source de tension entre parents et enfants. [12, 13]
      • Il est important de discuter avec l'enfant des moments d'écran plutôt que du temps d'écran. [14, 15]
      • Les applications et plateformes sont conçues pour capter l'attention et inciter à rester connecté le plus longtemps possible. [16, 17]
      • Il est crucial d'identifier ces mécanismes avec l'enfant (scroll infini, vidéos enchaînées, notifications) pour mieux les gérer. [17-21]

      Poser un cadre et favoriser l'auto-régulation (0:19:43 - 0:24:23)

      • Plutôt que de parler de temps d'écran, il est plus constructif de discuter des moments d'écran avec l'enfant. [14]
      • Les enfants ont besoin d'un cadre pour s'auto-réguler. [22]
      • Il est important d'éviter les écrans avant le coucher pour garantir un sommeil de qualité. [23]
      • Le site "fa nu" propose un outil pour créer une charte numérique en famille. [24]
      • Les paramètres des smartphones proposent des fonctionnalités pour visualiser et gérer le temps d'écran (temps passé, temps par application, notifications, mode silencieux). [25-27]

      Impacts du temps d'écran sur la santé (0:25:11 - 0:30:05)

      • L'utilisation excessive des écrans peut avoir des conséquences négatives sur le sommeil, la santé physique et mentale. [28-30]
      • La surexposition à des contenus négatifs ("Doomscrolling" ou "obésité de scrolling") peut engendrer anxiété et stress. [31-33]
      • Il est important de veiller à ce que les écrans ne remplacent pas d'autres activités essentielles (interactions sociales, activités physiques, lecture, etc.). [34]

      Âge d'accès aux réseaux sociaux et aux messageries (0:31:48 - 0:38:41)

      • Il existe un âge légal d'accès aux réseaux sociaux et aux messageries, généralement 13 ans. [35, 36]
      • WhatsApp est le premier réseau social des enfants, offrant des fonctionnalités proches des réseaux sociaux classiques. [37-39]

      Conseils pour une utilisation responsable (0:40:41 - 0:53:00)

      • Privilégier les comptes privés, activer les messages de rappel pour faire des pauses, désactiver les notifications. [40]
      • Gérer ses contacts et avoir conscience avec qui on parle. [41, 42]
      • Définir quel est le bon cadre pour communiquer et éviter de partager sa vie privée sur les groupes familiaux. [43, 44]
      • Ne pas partager de photos intimes et oser parler d'intimité et de consentement avec son enfant. [45, 46]
      • Réfléchir avant de partager tout type de contenu. [47]

      Cyber-harcèlement et contenus choquants (0:53:58 - 1:12:45)

      • Les enfants sont exposés à des contenus choquants, violents et à caractère sexuel, notamment sur Telegram. [48-54]
      • Il est crucial de les sensibiliser aux risques du cyber-harcèlement et de leur apprendre à réagir face à ces situations. [55-59]
      • En cas de cyber-harcèlement, il est important de conserver des preuves (copies d'écran), bloquer l'agresseur, signaler le compte et demander le retrait des publications. [60-63]
      • Le site internet-signalement.gouv.fr permet de signaler les contenus illégaux. [64]

      Ressources et outils d'Internet sans crainte (1:12:48 - 1:31:50)

      • Le guide "Équiper son enfant d'un smartphone" [65, 66]
      • Guides pratiques sur les principaux réseaux sociaux [67, 68]
      • Dossier thématique sur le cyber-harcèlement, les réseaux sociaux, la parentalité numérique, etc. sur le site internet sans crainte [69-71]
      • Compte Instagram d'Internet sans crainte pour des conseils et informations [72, 73]

      Questions des participants et réponses d'Axelle Desaint (1:17:35 - 1:31:50)

      • Le danger de l'espionnage des enfants et l'importance de la confiance. [74]
      • La nécessité de dialoguer et d'explorer les réseaux sociaux avec son enfant. [75-78]
      • L'utilisation des applications de contrôle parental. [78-80]
      • La géolocalisation et ses impacts sur les jeunes. [81-83]
      • La méconnaissance de Telegram par les parents. [84]

      Ce sommaire n'inclut pas les remarques des participants qui n'apportent pas d'éléments clés à la compréhension du sujet principal.

    2. Webinaire FCPE : Équiper son enfant de son premier smartphone

      Sommaire du webinaire FCPE avec timestamps

      Ce sommaire couvre les points principaux abordés lors du webinaire FCPE sur l'équipement d'un enfant avec son premier smartphone.

      Introduction et contexte (0:00:00 - 0:03:24)

      • Le webinaire, animé par Axelle Desaint, directrice générale d'Internet sans crainte, est organisé par la FCPE Nationale et s'adresse à tous les parents. [1, 2]
      • L'objectif est d'aider les parents à accompagner leurs enfants dans l'utilisation de leur premier smartphone et de discuter des tensions que cela peut engendrer. [2, 3]
      • Internet sans crainte propose plus de 200 outils gratuits pour aider les parents à accompagner leurs enfants dans leur usage du numérique. [4]
      • L'association organise également le Safer Internet Day en France. [5]

      Premier équipement et points de vigilance (0:07:43 - 0:08:11)

      • Le premier téléphone est souvent un appareil récupéré de la famille. Il est crucial de supprimer toutes les données (photos, contenus) et applications inadaptées à l'enfant avant de le lui confier. [6, 7]

      Le contrôle parental : un allié pour la protection (0:12:08 - 0:13:37)

      • Le contrôle parental n'est pas de l'espionnage mais un outil pour encadrer et protéger l'enfant. [8]
      • Il permet de définir le temps d'écran, de limiter l'accès à certains contenus et de gérer les achats potentiels. [9-11]

      Le temps d'écran : un sujet de préoccupation majeur (0:15:04 - 0:17:59)

      • Le temps d'écran est la principale source de tension entre parents et enfants. [12, 13]
      • Il est important de discuter avec l'enfant des moments d'écran plutôt que du temps d'écran. [14, 15]
      • Les applications et plateformes sont conçues pour capter l'attention et inciter à rester connecté le plus longtemps possible. [16, 17]
      • Il est crucial d'identifier ces mécanismes avec l'enfant (scroll infini, vidéos enchaînées, notifications) pour mieux les gérer. [17-21]

      Poser un cadre et favoriser l'auto-régulation (0:19:43 - 0:24:23)

      • Plutôt que de parler de temps d'écran, il est plus constructif de discuter des moments d'écran avec l'enfant. [14]
      • Les enfants ont besoin d'un cadre pour s'auto-réguler. [22]
      • Il est important d'éviter les écrans avant le coucher pour garantir un sommeil de qualité. [23]
      • Le site "fa nu" propose un outil pour créer une charte numérique en famille. [24]
      • Les paramètres des smartphones proposent des fonctionnalités pour visualiser et gérer le temps d'écran (temps passé, temps par application, notifications, mode silencieux). [25-27]

      Impacts du temps d'écran sur la santé (0:25:11 - 0:30:05)

      • L'utilisation excessive des écrans peut avoir des conséquences négatives sur le sommeil, la santé physique et mentale. [28-30]
      • La surexposition à des contenus négatifs ("Doomscrolling" ou "obésité de scrolling") peut engendrer anxiété et stress. [31-33]
      • Il est important de veiller à ce que les écrans ne remplacent pas d'autres activités essentielles (interactions sociales, activités physiques, lecture, etc.). [34]

      Âge d'accès aux réseaux sociaux et aux messageries (0:31:48 - 0:38:41)

      • Il existe un âge légal d'accès aux réseaux sociaux et aux messageries, généralement 13 ans. [35, 36]
      • WhatsApp est le premier réseau social des enfants, offrant des fonctionnalités proches des réseaux sociaux classiques. [37-39]

      Conseils pour une utilisation responsable (0:40:41 - 0:53:00)

      • Privilégier les comptes privés, activer les messages de rappel pour faire des pauses, désactiver les notifications. [40]
      • Gérer ses contacts et avoir conscience avec qui on parle. [41, 42]
      • Définir quel est le bon cadre pour communiquer et éviter de partager sa vie privée sur les groupes familiaux. [43, 44]
      • Ne pas partager de photos intimes et oser parler d'intimité et de consentement avec son enfant. [45, 46]
      • Réfléchir avant de partager tout type de contenu. [47]

      Cyber-harcèlement et contenus choquants (0:53:58 - 1:12:45)

      • Les enfants sont exposés à des contenus choquants, violents et à caractère sexuel, notamment sur Telegram. [48-54]
      • Il est crucial de les sensibiliser aux risques du cyber-harcèlement et de leur apprendre à réagir face à ces situations. [55-59]
      • En cas de cyber-harcèlement, il est important de conserver des preuves (copies d'écran), bloquer l'agresseur, signaler le compte et demander le retrait des publications. [60-63]
      • Le site internet-signalement.gouv.fr permet de signaler les contenus illégaux. [64]

      Ressources et outils d'Internet sans crainte (1:12:48 - 1:31:50)

      • Le guide "Équiper son enfant d'un smartphone" [65, 66]
      • Guides pratiques sur les principaux réseaux sociaux [67, 68]
      • Dossier thématique sur le cyber-harcèlement, les réseaux sociaux, la parentalité numérique, etc. sur le site internet sans crainte [69-71]
      • Compte Instagram d'Internet sans crainte pour des conseils et informations [72, 73]

      Questions des participants et réponses d'Axelle Desaint (1:17:35 - 1:31:50)

      • Le danger de l'espionnage des enfants et l'importance de la confiance. [74]
      • La nécessité de dialoguer et d'explorer les réseaux sociaux avec son enfant. [75-78]
      • L'utilisation des applications de contrôle parental. [78-80]
      • La géolocalisation et ses impacts sur les jeunes. [81-83]
      • La méconnaissance de Telegram par les parents. [84]

      Ce sommaire n'inclut pas les remarques des participants qui n'apportent pas d'éléments clés à la compréhension du sujet principal.

    1. Reviewer #1 (Public review):

      Summary:

      This study introduces a novel therapeutic strategy for patients with high-risk HER2-positive breast cancer and demonstrates that the incorporation of pyrotinib into adjuvant trastuzumab therapy can improve invasive disease-free survival.

      Strengths:

      The study features robust logic and high-quality data. Data from 141 patients across 23 centers were analyzed, thereby effectively mitigating regional biases and endowing the research findings with high applicability.

      Weaknesses:

      (1) Introduction and Discussion: Update the literature regarding the efficacy of pyrotinib combined with trastuzumab in treating HER2-positive advanced breast cancer.

      (2) Did all the data have a normal distribution? Expand the description of statistical analysis.

      (3) The novelty and innovative potential of your manuscript compared to the published literature should be described in more detail in the abstract and discussion section.

      (4) Figure legend should provide a bit more detail about what readers should focus on.

      (5) P-values should be clarified for the analysis.

      (6) The order (A, B, and C) in Figure 3 should be labeled in the upper left corner of the Figure.

    2. Reviewer #2 (Public review):

      In this manuscript, Cao et al. evaluated the efficacy and safety of 12 months pyrotinib after trastuzumab-based adjuvant therapy in patients with high-risk, HER2-positive early or locally advanced breast cancer. Notably, the 2-year iDFS rate reached 94.59% (95% CI: 88.97-97.38) in all patients, and 94.90% (95% CI: 86.97-98.06) in patients who completed 1-year treatment of pyrotinib. This is an interesting and uplifting results, given that in ExteNET study, the 2-year iDFS rate was 93.9% (95% CI 92·4-95·2) in the 1-year neratinib group, and the 5-year iDFS survival was 90.2%, and 1-year treatment of neratinib in ExteNET study did not translate into OS benefit after 8-year follow-up. In this case, readers will be eagerly anticipating the long-term follow-up results of the current PERSIST study, as well as the results of the phase III clinical trial (NCT03980054).

      I have the following comments:

      (1) The introduction of the differences between pyrotinib and neratinib in terms of mechanism, efficacy, resistance, etc. is supposed to be included in the text so that authors could better highlight the clinical significance of the current trial.

      (2) Please make sure that a total of 141 patients were enrolled in the study, 38 patients had a treatment duration of less than or equal to 6 months, and a total of 92 and 31 patients completed 1-year and 6-month treatment of extended adjuvant pyrotinib, respectively, which means 7 patients had a treatment duration of fewer than 6 months.

      (3) The previous surgery history should be provided, and how many patients received lumpectomy, and mastectomy.

    1. Reviewer #1 (Public review):

      Summary:

      This study adapts a previously published model of the cat spinal locomotor network to make predictions of how phase durations of swing and stance at different treadmill speeds in tied-belt and split-belt conditions would be altered following a lateral hemisection. The simulations make several predictions that are replicated in experimental settings.

      Strengths:

      (1) Despite only altering the connections in the model, the model is able to replicate very well several experimental findings. This provides strong validation for the model and highlights its utility as a tool to investigate the operations of mammalian spinal locomotor networks.

      (2) The study provides insights about interactions between the left and right sides of the spinal locomotor networks, and how these interactions depend on the mode of operation, as determined by speed and state of the nervous system.

      (3) The writing is logical, clear, and easy to follow.

      Weaknesses:

      (1) Could the authors provide a statement in the methods or results to clarify whether there were any changes in synaptic weight or other model parameters of the intact model to ensure locomotor activity in the hemisected model?

      (2) The authors should remind the reader what the main differences are between state-machine, flexor-driven, and classical half-center regimes (lines 77-79).

      (3) There may be changes in the wiring of spinal locomotor networks after the hemisection. Yet, without applying any sort of plasticity, the model is able to replicate many of the experimental data. Based on what was experimentally replicated or not, what does the model tell us about possible sites of plasticity after hemisection?

      (4) Why are the durations on the right hemisected (fast) side similar to results in the full spinal transected model (Rybak et al. 2024)? Is it because the left is in slow mode and so there is not much drive from the left side to the right side even though the latter is still receiving supraspinal drive, as opposed to in the full transection model? (lines 202-203).

      (5) There is an error with probability (line 280).

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript uses the eye lens as a model to investigate basic mechanisms in the Fgf signaling pathway. Understanding Fgf signaling is of broad importance to biologists as it is involved in the regulation of various developmental processes in different tissues/organs and is often misregulated in disease states. The Fgf pathway has been studied in embryonic lens development, namely with regards to its involvement in controlling events such as tissue invagination, vesicle formation, epithelium proliferation, and cellular differentiation, thus making the lens a good system to uncover the mechanistic basis of how the modulation of this pathway drives specific outcomes. Previous work has suggested that proteins, other than the ones currently known (e.g., the adaptor protein Frs2), are likely involved in Fgfr signaling. The present study focuses on the role of Shp2 and Shc1 proteins in the recruitment of Grb2 in the events downstream of Fgfr activation.

      Strengths:

      The findings reveal that the juxtamembrane region of the Fgf receptor is necessary for proper control of downstream events such as facilitating key changes in transcription and cytoskeleton during tissue morphogenesis. The authors conditionally deleted all four Fgfrs in the mouse lens that resulted in molecular and morphological lens defects, most importantly, preventing the upregulation of the lens induction markers Sox2 and Foxe3 and the apical localization of F-actin, thus demonstrating the importance of Fgfrs in early lens development, i.e. during lens induction. They also examined the impact of deleting Fgfr1 and 2, on the following stage, i.e. lens vesicle development, which could be rescued by expressing constitutively active KrasG12D. By using specific mutations (e.g. Fgfr1ΔFrs lacking the Frs2 binding domain and Fgfr2LR harboring mutations that prevent binding of Frs2), it is demonstrated that the Frs2 binding site on Fgfr is necessary for specific events such as morphogenesis of lens vesicle. Further, by studying Shp2 mutations and deletions, the authors present a case for Shp2 protein to function in a context-specific manner in the role of an adaptor protein and a phosphatase enzyme. Finally, the key surprising finding from this study is that downstream of Fgfr signaling, Shc1 is an important alternative pathway - in addition to Shp2 - involved in the recruitment of Grb2 and in the subsequent activation of Ras. The methodologies, namely, mouse genetics and state-of-the-art cell/molecular/biochemical assays are appropriately used to collect the data, which are soundly interpreted to reach these important conclusions. Overall, these findings reveal the flexibility of the Fgf signaling pathway and its downstream mediators in regulating cellular events. This work is expected to be of broad interest to molecular and developmental biologists.

      Weaknesses:

      A weakness that needs to be discussed is that Le-Cre depends on Pax6 activation, and hence its use in specific gene deletion will not allow evaluation of the requirement of Fgfrs in the expression of Pax6 itself. But since this is the earliest Cre available for deletion in the lens, mentioning this in the discussion would make the readers aware of this issue. Referring to Jag1 among "lens-specific markers" (page 5) is debatable, suggesting changing to the lines of "the expected upregulation of Jag1 in lens vesicle". The Abstract could be modified to clearly convey the existing knowledge gap and the key findings of the present study. As it stands now, it is a bit all over the place. Some typos in the manuscript need to be fixed, e.g. "...yet its molecular mechanism remains largely resolved" - unresolved? "...in the development lens" - in the developing lens? In Figure 4 legend, "(B) Grb2 mutants Grb2 mutants displayed...", etc.

    2. Reviewer #2 (Public review):

      Summary:

      I have reviewed a manuscript submitted by Wang et al., which is entitled "Shc1 cooperates with Frs2 and Shp2 to recruit Grb2 in FGF-induced lens development". In this paper, the authors first examined lens phenotypes in mice with Le-Cre-mediated knockdown (KD) of all four FGFR (FGFR1-4), and found that pERK signals, Jag1, and foxe3 expression are absent or drastically reduced, indicating that FGF signaling is essential for lens induction. Next, the authors examined lens phenotypes of FGFR1/2-KD mice and found that lens fiber differentiation is compromised and that proliferative activity and cell survival are also compromised in lens epithelium. Interestingly, Kras activation rescues defects in lens growth and lens fiber differentiation in FGFR1/2-KD mice, indicating that Ras activation is a key step for lens development. Next, the authors examined the role of Frs2, Shp2, and Grb2 in FGF signaling for lens development. They confirmed that lens fiber differentiation is compromised in FGFR1/3-KD mice combined with Frs2-dysfunctional FGFR2 mutants, which is similar to lens phenotypes of Grb2-KD mice. However, lens defects are milder in mice with Shp2YF/YF and Shp2CS mutant alleles, indicating that the involvement of Shp2 is limited for the Grb2 recruitment for lens fiber differentiation. Lastly, the authors showed new evidence on the possibility that another adapter protein, Shc1, promotes Grb2 recruitment independent of Frs2/Shp2-mediated Grb2 recruitment.

      Strengths:

      Overall, the manuscript provides valuable data on how FGFR activation leads to Ras activation through the adapter platform of Frs2/Shp2/Grb2, which advances our understanding of complex modification of the FGF signaling pathway. The authors applied a genetic approach using mice, whose methods and results are valid to support the conclusion. The discussion also well summarizes the significance of their findings.

      Weaknesses:

      The authors eventually found that the new adaptor protein Shc1 is involved in Grb2 recruitments in response to FGF receptor activation. however, the main data for Shc1 are histological sections and statistical evaluation of lens size. So, my major concern is that the authors need to provide more detailed data to support the involvement of Shc1 in Grb2 recruitment of FGF signaling for lens development.

    3. Reviewer #3 (Public review):

      Summary:

      The manuscript entitled "Shc1 cooperates with Frs2 and Shp2 to recruit Grb2 in FGF-induced lens development" by Wang et al., investigates the molecular mechanism used by FGFR signaling to support lens development. The lens has long been known to depend on FGFR signaling for proper development. Previous investigations have demonstrated that FGFR signaling is required for embryonic lens cell survival and for lens fiber cell differentiation. The requirement of FGFR signaling for lens induction has remained more controversial as deletion of both Fgfr1 and Fgfr2 during lens placode formation does not prevent the induction of definitive lens markers such as FOXE3 or αA-crystallin. Here the authors have used the Le-Cre driver to delete all four FGFR genes from the developing lens placode demonstrating a definitive failure of lens induction in the absence of FGFR signaling. The authors focused on FGFR1 and FGFR2, the two primary FGFRs present during early lens development, and demonstrated that lens development could be significantly rescued in lenses lacking both FGFR1 and FGFR2 by expressing a constitutively active allele of KRAS. They also showed that the removal of pro-apoptotic genes Bax and Bak could also lead to a substantial rescue of lens development in lenses lacking both FGFR1 and FGFR2. In both cases, the lens rescue included both increased lens size and the expression of genes characteristic of lens cells.

      Significantly the authors concentrated on the juxtamembrane domain, a portion of the FGFRs associated with FRS2. Previous investigations have demonstrated the importance of FRS2 activation for mediating a sustained level of ERK activation. FRS2 is known to associate both with GRB2 and SHP2 to activate RAS. The authors utilized a mutant allele of Fgfr1, lacking the entire juxtamembrane domain (Fgfr1ΔFrs), and an allele of Fgfr2 containing two-point mutations essential for Frs2 binding (Fgfr2LR). When combining three floxed alleles and leaving only one functional allele (Fgfr1ΔFrs or Fgfr2LR) the authors got strikingly different phenotypes. When only the Fgfr1ΔFrs allele was retained, the lens phenotype matched that of deleting both Fgfr1 and Fgfr2. However, when only the Fgfr2LR allele was retained the phenotype was significantly milder, primarily affecting lens fiber cell differentiation, suggesting that something other than FRS2 might be interacting with the juxtamembrane domain to support FGFR signaling in the lens. The authors also deleted Grb2 in the lens and showed that the phenotype was similar to that of the lenses only retaining the Fgfr2LR allele, resulting in a failure of lens fiber cell differentiation and decreased lens cell survival. However, mutating the major tyrosine phosphorylation site of GRB2 did not affect lens development. The author additionally investigated the role of SHP2 lens development by making by either deleting SHP2 or by making mutations in the SHP2 catalytic domain. The deletion of the SHP2 phosphatase activity did not affect lens development as severely as the total loss of SHP2 protein, suggesting a function for SHP2 outside of its catalytic activity. Although the loss of Shc1 alone has only a slight effect on lens size and pERK activation in the lens, the authors showed that the loss of Shc1 exacerbated the lens phenotype in lenses lacking both Frs2 and Shp2. The authors suggest that SHC1 binds to the FGFR juxtamembrane domain allowing for the recruitment of GRB2 independently of FRS2.

      Strengths:

      (1) The authors used a variety of genetic tools to carefully dissect the essential signals downstream of FGFR signaling during lens development.

      (2) The authors made a convincing case that something other than FRS2 binding mediates FGFR signaling in the juxtamembrane domain.

      (3) The authors demonstrated that despite the requirement of both the adaptor function and phosphatase activity of SHP2 are required for embryonic survival, neither of these activities is absolutely required for lens development.

      (4) The authors provide more information as to why FGFR loss has a phenotype much more severe than the loss of FRS2 alone during lens development.

      (5) The authors followed up their work analyzing various signaling molecules in the context of lens development with biochemical analyses of FGF-induced phosphorylation in murine embryonic fibroblasts (MEFs).

      (6) In general, this manuscript represents a Herculean effort to dissect FGFR signaling in vivo with biochemical backing with cell culture experiments in vitro.

      Weaknesses:

      (1) The authors demonstrate that the loss of FGFR1 and FGFR2 can be compensated by a constitutive active KRAS allele in the lens and suggest that FGFRs largely support lens development only by driving ERK activation. However, the authors also saw that lens development was substantially rescued by preventing apoptosis through the deletion of BAK and BAX. To my knowledge, the deletion of BAK and BAX should not independently activate ERK. The authors do not show whether ERK activation is restored in the BAK/BAX deficient lenses. Do the authors suggest the FGFR3 and/or FGFR4 provide sufficient RAS and ERK activation for lens development when apoptosis is suppressed? Alternatively, is it the survival function of FGFR-signaling as much as a direct effect on lens differentiation?

      (2) The authors make the argument that deleting all four FGFRs prevented lens induction but that the deletion of only FGFR1 and FGFR2 did not. Part of this argument is the retention of FOXE3 expression, αA-crystallin expression, and PROX1 expression in the FGFR1/2 double mutants. However, in Figure 1E, and Figure 1F, the staining of the double mutant lens tissue with FOXE3, αA-crystallin, and PROX1 is unconvincing. However, the retention of FOXE3 expression in the FGFR1/FGFR2 double mutants was previously demonstrated in Garcia et al 2011. Also, there needs to be an enlargement or inset to demonstrate the retention of pSMAD in the quadruple FGFR mutants in Figure 1D.

      (3) Do the authors suggest that GRB2 is required for RAS activation and ultimately ERK activation? If so, do the authors suggest that ERK activation is not required for FGFR-signaling to mediate lens induction? This would follow considering that the GRB2 deficient lenses lack a problem with lens induction.

      (4) The increase in p-Shc is only slightly higher in the Cre FGFR1f/f FGFR2r/LR than in the FGFR1f/Δfrs FGFR2f/f. Can the authors provide quantification?

      (5) The authors have not shown directly that Shc1 binds to the juxtamembrane region of either Fgfr1 or Fgfr2.

      (6) The authors have used the Le-Cre strain for all of their lens deletion experiments. Previous work has documented that the Le-Cre transgene can cause lens defects independent of any floxed alleles in both homozygous and hemizygous states on some genetic backgrounds (Dora et al., 2014 PLoS One 9:e109193 and Lam et al., Human Genomics 2019 13(1):10. Are the controls used in these experiments Le-Cre hemizygotes?

    1. Reviewer #3 (Public review):

      This important study provides insights into the functional diversification of RIP family kinase proteins in vertebrate animals. The provided results, which combine bioinformatic and experimental analyses, will be of interest to specialists in both immunology and evolutionary biology. However, the computational part of the methodology is insufficiently covered in the paper and the experimental results would benefit from including data for additional species.

      (1) In the Methods section concerning gene loss analysis, the authors refer to the 'Phylogenetic analysis' section for details of RIPK sequence acquisition and alignment procedure. This section is missing from the manuscript as provided. In its absence, it is hard for the reviewer to provide relevant comments on gene presence/absence analysis.

      (2) In the same section, the authors state that gene sequences were filtered and grouped based on the initial gene tree pattern (lines 448-449). How exactly did the authors filter the non-RIP kinases and other irrelevant homologs from the gene trees? Did they consider the reciprocal best (BLAST) hit approach or similar approaches for orthology inference? Did they also encounter potential pseudogenes of genes marked as missing in Figure 1C? Will the gene trees mentioned be available as supplementary files?

      (3) The authors state the presence of additional RIPK2 paralog in non-therian vertebrates. The ramifications of this paralog loss in therians are not discussed in the text, although RIPK2 is also involved in NF-kB activation. In addition, the RIPK2B gene loss pattern is shunned from Figure 1C to Supplementary Figure 4, despite posing comparable interest to the reader.

      (4) The authors present evidence for (repeated) positive selection in both RIPK1 and RIPK3 in bats; however, neither bat RIPK1/3 orthologs nor bat-specific RHIM tetrad variants (IQFG, IQLG) are considered in the experimental part of the work.

      (5) The authors present gene presence/absence patterns for zebra mussels as an outgroup of vertebrate species analyzed. From the evolutionary perspective, adding results for a closer invertebrate group, such as lancelets, tunicates, or echinoderms, would be beneficial for reconstructing the evolutionary progression of RIPK-mediated immune functions in animals.

      (6) In the broader sense, the list of non-mammalian species included in the study is not explained or substantiated in the text. What was the rationale behind selecting lizards, turtles, and lampreys for experimental assays? Why was turtle RIPK3 but not turtle RIPK1CT protein used for functional tests? Which results do the authors expect to observe if amphibian or teleost RIPK1/3 are included in the analysis, especially those with divergent tetrad variants?

      (7) For lamprey RIPK3, the observed NF-kB activity levels still remain lower than those of mammalian and reptilian orthologs even after catalytic tetrad modification. In the same way, switching human RIPK3 catalytic tetrad to that of lamprey does not result in NF-kB activation. What are the potential reasons for the observed difference? Does it mean that lamprey's RIPK3 functions in NF-kB activation are, at least partially, delegated to RIPK1?

      (8) In lines 386-388, the authors state that 'only non-mammalian RIPK1CT proteins required the RHIM for maximal NF-kB activation', which is corroborated by results in Figure 4B. The authors further associate this finding with a lack of ZBP1 in the respective species (lines 388-389). However, non-squamate reptiles seem to retain ZBP1, as suggested by Supplementary Table 1. Given that, do the authors expect to observe RHIM-independent (maximal) NF-kB activation in turtles and crocodilians or respective RIPK1CT-transfected cells?

    1. Who Can Name the Bigger Number?by Scott Aaronson [Author's blog] [This essay in Spanish] [This essay in French] [This essay in Chinese] In an old joke, two noblemen vie to name the bigger number. The first, after ruminating for hours, triumphantly announces "Eighty-three!" The second, mightily impressed, replies "You win." A biggest number contest is clearly pointless when the contestants take turns. But what if the contestants write down their numbers simultaneously, neither aware of the other’s? To introduce a talk on "Big Numbers," I invite two audience volunteers to try exactly this. I tell them the rules: You have fifteen seconds. Using standard math notation, English words, or both, name a single whole number—not an infinity—on a blank index card. Be precise enough for any reasonable modern mathematician to determine exactly what number you’ve named, by consulting only your card and, if necessary, the published literature. So contestants can’t say "the number of sand grains in the Sahara," because sand drifts in and out of the Sahara regularly. Nor can they say "my opponent’s number plus one," or "the biggest number anyone’s ever thought of plus one"—again, these are ill-defined, given what our reasonable mathematician has available. Within the rules, the contestant who names the bigger number wins. Are you ready? Get set. Go. The contest’s results are never quite what I’d hope. Once, a seventh-grade boy filled his card with a string of successive 9’s. Like many other big-number tyros, he sought to maximize his number by stuffing a 9 into every place value. Had he chosen easy-to-write 1’s rather than curvaceous 9’s, his number could have been millions of times bigger. He still would been decimated, though, by the girl he was up against, who wrote a string of 9’s followed by the superscript 999. Aha! An exponential: a number multiplied by itself 999 times. Noticing this innovation, I declared the girl’s victory without bothering to count the 9’s on the cards. And yet the girl’s number could have been much bigger still, had she stacked the mighty exponential more than once. Take , for example. This behemoth, equal to 9387,420,489, has 369,693,100 digits. By comparison, the number of elementary particles in the observable universe has a meager 85 digits, give or take. Three 9’s, when stacked exponentially, already lift us incomprehensibly beyond all the matter we can observe—by a factor of about 10369,693,015. And we’ve said nothing of or . Place value, exponentials, stacked exponentials: each can express boundlessly big numbers, and in this sense they’re all equivalent. But the notational systems differ dramatically in the numbers they can express concisely. That’s what the fifteen-second time limit illustrates. It takes the same amount of time to write 9999, 9999, and —yet the first number is quotidian, the second astronomical, and the third hyper-mega astronomical. The key to the biggest number contest is not swift penmanship, but rather a potent paradigm for concisely capturing the gargantuan. Such paradigms are historical rarities. We find a flurry in antiquity, another flurry in the twentieth century, and nothing much in between. But when a new way to express big numbers concisely does emerge, it’s often a byproduct of a major scientific revolution: systematized mathematics, formal logic, computer science. Revolutions this momentous, as any Kuhnian could tell you, only happen under the right social conditions. Thus is the story of big numbers a story of human progress. And herein lies a parallel with another mathematical story. In his remarkable and underappreciated book A History of π, Petr Beckmann argues that the ratio of circumference to diameter is "a quaint little mirror of the history of man." In the rare societies where science and reason found refuge—the early Athens of Anaxagoras and Hippias, the Alexandria of Eratosthenes and Euclid, the seventeenth-century England of Newton and Wallis—mathematicians made tremendous strides in calculating π. In Rome and medieval Europe, by contrast, knowledge of π stagnated. Crude approximations such as the Babylonians’ 25/8 held sway. This same pattern holds, I think, for big numbers. Curiosity and openness lead to fascination with big numbers, and to the buoyant view that no quantity, whether of the number of stars in the galaxy or the number of possible bridge hands, is too immense for the mind to enumerate. Conversely, ignorance and irrationality lead to fatalism concerning big numbers. Historian Ilan Vardi cites the ancient Greek term sand-hundred, colloquially meaning zillion; as well as a passage from Pindar’s Olympic Ode II asserting that "sand escapes counting." ¨ But sand doesn’t escape counting, as Archimedes recognized in the third century B.C. Here’s how he began The Sand-Reckoner, a sort of pop-science article addressed to the King of Syracuse: There are some ... who think that the number of the sand is infinite in multitude ... again there are some who, without regarding it as infinite, yet think that no number has been named which is great enough to exceed its multitude ... But I will try to show you [numbers that] exceed not only the number of the mass of sand equal in magnitude to the earth ... but also that of a mass equal in magnitude to the universe. This Archimedes proceeded to do, essentially by using the ancient Greek term myriad, meaning ten thousand, as a base for exponentials. Adopting a prescient cosmological model of Aristarchus, in which the "sphere of the fixed stars" is vastly greater than the sphere in which the Earth revolves around the sun, Archimedes obtained an upper bound of 1063 on the number of sand grains needed to fill the universe. (Supposedly 1063 is the biggest number with a lexicographically standard American name: vigintillion. But the staid vigintillion had better keep vigil lest it be encroached upon by the more whimsically-named googol, or 10100, and googolplex, or .) Vast though it was, of course, 1063 wasn’t to be enshrined as the all-time biggest number. Six centuries later, Diophantus developed a simpler notation for exponentials, allowing him to surpass . Then, in the Middle Ages, the rise of Arabic numerals and place value made it easy to stack exponentials higher still. But Archimedes’ paradigm for expressing big numbers wasn’t fundamentally surpassed until the twentieth century. And even today, exponentials dominate popular discussion of the immense. Consider, for example, the oft-repeated legend of the Grand Vizier in Persia who invented chess. The King, so the legend goes, was delighted with the new game, and invited the Vizier to name his own reward. The Vizier replied that, being a modest man, he desired only one grain of wheat on the first square of a chessboard, two grains on the second, four on the third, and so on, with twice as many grains on each square as on the last. The innumerate King agreed, not realizing that the total number of grains on all 64 squares would be 264-1, or 18.6 quintillion—equivalent to the world’s present wheat production for 150 years. Fittingly, this same exponential growth is what makes chess itself so difficult. There are only about 35 legal choices for each chess move, but the choices multiply exponentially to yield something like 1050 possible board positions—too many for even a computer to search exhaustively. That’s why it took until 1997 for a computer, Deep Blue, to defeat the human world chess champion. And in Go, which has a 19-by-19 board and over 10150 possible positions, even an amateur human can still rout the world’s top-ranked computer programs. Exponential growth plagues computers in other guises as well. The traveling salesman problem asks for the shortest route connecting a set of cities, given the distances between each pair of cities. The rub is that the number of possible routes grows exponentially with the number of cities. When there are, say, a hundred cities, there are about 10158 possible routes, and, although various shortcuts are possible, no known computer algorithm is fundamentally better than checking each route one by one. The traveling salesman problem belongs to a class called NP-complete, which includes hundreds of other problems of practical interest. (NP stands for the technical term ‘Nondeterministic Polynomial-Time.’) It’s known that if there’s an efficient algorithm for any NP-complete problem, then there are efficient algorithms for all of them. Here ‘efficient’ means using an amount of time proportional to at most the problem size raised to some fixed power—for example, the number of cities cubed. It’s conjectured, however, that no efficient algorithm for NP-complete problems exists. Proving this conjecture, called P¹ NP, has been a great unsolved problem of computer science for thirty years. Although computers will probably never solve NP-complete problems efficiently, there’s more hope for another grail of computer science: replicating human intelligence. The human brain has roughly a hundred billion neurons linked by a hundred trillion synapses. And though the function of an individual neuron is only partially understood, it’s thought that each neuron fires electrical impulses according to relatively simple rules up to a thousand times each second. So what we have is a highly interconnected computer capable of maybe 1014 operations per second; by comparison, the world’s fastest parallel supercomputer, the 9200-Pentium Pro teraflops machine at Sandia National Labs, can perform 1012 operations per second. Contrary to popular belief, gray mush is not only hard-wired for intelligence: it surpasses silicon even in raw computational power. But this is unlikely to remain true for long. The reason is Moore’s Law, which, in its 1990’s formulation, states that the amount of information storable on a silicon chip grows exponentially, doubling roughly once every two years. Moore’s Law will eventually play out, as microchip components reach the atomic scale and conventional lithography falters. But radical new technologies, such as optical computers, DNA computers, or even quantum computers, could conceivably usurp silicon’s place. Exponential growth in computing power can’t continue forever, but it may continue long enough for computers—at least in processing power—to surpass human brains. To prognosticators of artificial intelligence, Moore’s Law is a glorious herald of exponential growth. But exponentials have a drearier side as well. The human population recently passed six billion and is doubling about once every forty years. At this exponential rate, if an average person weighs seventy kilograms, then by the year 3750 the entire Earth will be composed of human flesh. But before you invest in deodorant, realize that the population will stop increasing long before this—either because of famine, epidemic disease, global warming, mass species extinctions, unbreathable air, or, entering the speculative realm, birth control. It’s not hard to fathom why physicist Albert Bartlett asserted "the greatest shortcoming of the human race" to be "our inability to understand the exponential function." Or why Carl Sagan advised us to "never underestimate an exponential." In his book Billions & Billions, Sagan gave some other depressing consequences of exponential growth. At an inflation rate of five percent a year, a dollar is worth only thirty-seven cents after twenty years. If a uranium nucleus emits two neutrons, both of which collide with other uranium nuclei, causing them to emit two neutrons, and so forth—well, did I mention nuclear holocaust as a possible end to population growth? ¨ Exponentials are familiar, relevant, intimately connected to the physical world and to human hopes and fears. Using the notational systems I’ll discuss next, we can concisely name numbers that make exponentials picayune by comparison, that subjectively speaking exceed as much as the latter exceeds 9. But these new systems may seem more abstruse than exponentials. In his essay "On Number Numbness," Douglas Hofstadter leads his readers to the precipice of these systems, but then avers: If we were to continue our discussion just one zillisecond longer, we would find ourselves smack-dab in the middle of the theory of recursive functions and algorithmic complexity, and that would be too abstract. So let’s drop the topic right here. But to drop the topic is to forfeit, not only the biggest number contest, but any hope of understanding how stronger paradigms lead to vaster numbers. And so we arrive in the early twentieth century, when a school of mathematicians called the formalists sought to place all of mathematics on a rigorous axiomatic basis. A key question for the formalists was what the word ‘computable’ means. That is, how do we tell whether a sequence of numbers can be listed by a definite, mechanical procedure? Some mathematicians thought that ‘computable’ coincided with a technical notion called ‘primitive recursive.’ But in 1928 Wilhelm Ackermann disproved them by constructing a sequence of numbers that’s clearly computable, yet grows too quickly to be primitive recursive. Ackermann’s idea was to create an endless procession of arithmetic operations, each more powerful than the last. First comes addition. Second comes multiplication, which we can think of as repeated addition: for example, 5´3 means 5 added to itself 3 times, or 5+5+5 = 15. Third comes exponentiation, which we can think of as repeated multiplication. Fourth comes ... what? Well, we have to invent a weird new operation, for repeated exponentiation. The mathematician Rudy Rucker calls it ‘tetration.’ For example, ‘5 tetrated to the 3’ means 5 raised to its own power 3 times, or , a number with 2,185 digits. We can go on. Fifth comes repeated tetration: shall we call it ‘pentation’? Sixth comes repeated pentation: ‘hexation’? The operations continue infinitely, with each one standing on its predecessor to peer even higher into the firmament of big numbers. If each operation were a candy flavor, then the Ackermann sequence would be the sampler pack, mixing one number of each flavor. First in the sequence is 1+1, or (don’t hold your breath) 2. Second is 2´2, or 4. Third is 3 raised to the 3rd power, or 27. Hey, these numbers aren’t so big! Fee. Fi. Fo. Fum. Fourth is 4 tetrated to the 4, or , which has 10154 digits. If you’re planning to write this number out, better start now. Fifth is 5 pentated to the 5, or with ‘5 pentated to the 4’ numerals in the stack. This number is too colossal to describe in any ordinary terms. And the numbers just get bigger from there. Wielding the Ackermann sequence, we can clobber unschooled opponents in the biggest-number contest. But we need to be careful, since there are several definitions of the Ackermann sequence, not all identical. Under the fifteen-second time limit, here’s what I might write to avoid ambiguity: A(111)—Ackermann seq—A(1)=1+1, A(2)=2´2, A(3)=33, etc Recondite as it seems, the Ackermann sequence does have some applications. A problem in an area called Ramsey theory asks for the minimum dimension of a hypercube satisfying a certain property. The true dimension is thought to be 6, but the lowest dimension anyone’s been able is prove is so huge that it can only be expressed using the same ‘weird arithmetic’ that underlies the Ackermann sequence. Indeed, the Guinness Book of World Records once listed this dimension as the biggest number ever used in a mathematical proof. (Another contender for the title once was Skewes’ number, about , which arises in the study of how prime numbers are distributed. The famous mathematician G. H. Hardy quipped that Skewes’ was "the largest number which has ever served any definite purpose in mathematics.") What’s more, Ackermann’s briskly-rising cavalcade performs an occasional cameo in computer science. For example, in the analysis of a data structure called ‘Union-Find,’ a term gets multiplied by the inverse of the Ackermann sequence—meaning, for each whole number X, the first number N such that the Nth Ackermann number is bigger than X. The inverse grows as slowly as Ackermann’s original sequence grows quickly; for all practical purposes, the inverse is at most 4. ¨ Ackermann numbers are pretty big, but they’re not yet big enough. The quest for still bigger numbers takes us back to the formalists. After Ackermann demonstrated that ‘primitive recursive’ isn’t what we mean by ‘computable,’ the question still stood: what do we mean by ‘computable’? In 1936, Alonzo Church and Alan Turing independently answered this question. While Church answered using a logical formalism called the lambda calculus, Turing answered using an idealized computing machine—the Turing machine—that, in essence, is equivalent to every Compaq, Dell, Macintosh, and Cray in the modern world. Turing’s paper describing his machine, "On Computable Numbers," is rightly celebrated as the founding document of computer science. "Computing," said Turing, is normally done by writing certain symbols on paper. We may suppose this paper to be divided into squares like a child’s arithmetic book. In elementary arithmetic the 2-dimensional character of the paper is sometimes used. But such use is always avoidable, and I think it will be agreed that the two-dimensional character of paper is no essential of computation. I assume then that the computation is carried out on one-dimensional paper, on a tape divided into squares. Turing continued to explicate his machine using ingenious reasoning from first principles. The tape, said Turing, extends infinitely in both directions, since a theoretical machine ought not be constrained by physical limits on resources. Furthermore, there’s a symbol written on each square of the tape, like the ‘1’s and ‘0’s in a modern computer’s memory. But how are the symbols manipulated? Well, there’s a ‘tape head’ moving back and forth along the tape, examining one square at a time, writing and erasing symbols according to definite rules. The rules are the tape head’s program: change them, and you change what the tape head does. Turing’s august insight was that we can program the tape head to carry out any computation. Turing machines can add, multiply, extract cube roots, sort, search, spell-check, parse, play Tic-Tac-Toe, list the Ackermann sequence. If we represented keyboard input, monitor output, and so forth as symbols on the tape, we could even run Windows on a Turing machine. But there’s a problem. Set a tape head loose on a sequence of symbols, and it might stop eventually, or it might run forever—like the fabled programmer who gets stuck in the shower because the instructions on the shampoo bottle read "lather, rinse, repeat." If the machine’s going to run forever, it’d be nice to know this in advance, so that we don’t spend an eternity waiting for it to finish. But how can we determine, in a finite amount of time, whether something will go on endlessly? If you bet a friend that your watch will never stop ticking, when could you declare victory? But maybe there’s some ingenious program that can examine other programs and tell us, infallibly, whether they’ll ever stop running. We just haven’t thought of it yet. Nope. Turing proved that this problem, called the Halting Problem, is unsolvable by Turing machines. The proof is a beautiful example of self-reference. It formalizes an old argument about why you can never have perfect introspection: because if you could, then you could determine what you were going to do ten seconds from now, and then do something else. Turing imagined that there was a special machine that could solve the Halting Problem. Then he showed how we could have this machine analyze itself, in such a way that it has to halt if it runs forever, and run forever if it halts. Like a hound that finally catches its tail and devours itself, the mythical machine vanishes in a fury of contradiction. (That’s the sort of thing you don’t say in a research paper.) ¨ "Very nice," you say (or perhaps you say, "not nice at all"). "But what does all this have to do with big numbers?" Aha! The connection wasn’t published until May of 1962. Then, in the Bell System Technical Journal, nestled between pragmatically-minded papers on "Multiport Structures" and "Waveguide Pressure Seals," appeared the modestly titled "On Non-Computable Functions" by Tibor Rado. In this paper, Rado introduced the biggest numbers anyone had ever imagined. His idea was simple. Just as we can classify words by how many letters they contain, we can classify Turing machines by how many rules they have in the tape head. Some machines have only one rule, others have two rules, still others have three rules, and so on. But for each fixed whole number N, just as there are only finitely many distinct words with N letters, so too are there only finitely many distinct machines with N rules. Among these machines, some halt and others run forever when started on a blank tape. Of the ones that halt, asked Rado, what’s the maximum number of steps that any machine takes before it halts? (Actually, Rado asked mainly about the maximum number of symbols any machine can write on the tape before halting. But the maximum number of steps, which Rado called S(n), has the same basic properties and is easier to reason about.) Rado called this maximum the Nth "Busy Beaver" number. (Ah yes, the early 1960’s were a more innocent age.) He visualized each Turing machine as a beaver bustling busily along the tape, writing and erasing symbols. The challenge, then, is to find the busiest beaver with exactly N rules, albeit not an infinitely busy one. We can interpret this challenge as one of finding the "most complicated" computer program N bits long: the one that does the most amount of stuff, but not an infinite amount. Now, suppose we knew the Nth Busy Beaver number, which we’ll call BB(N). Then we could decide whether any Turing machine with N rules halts on a blank tape. We’d just have to run the machine: if it halts, fine; but if it doesn’t halt within BB(N) steps, then we know it never will halt, since BB(N) is the maximum number of steps it could make before halting. Similarly, if you knew that all mortals died before age 200, then if Sally lived to be 200, you could conclude that Sally was immortal. So no Turing machine can list the Busy Beaver numbers—for if it could, it could solve the Halting Problem, which we already know is impossible. But here’s a curious fact. Suppose we could name a number greater than the Nth Busy Beaver number BB(N). Call this number D for dam, since like a beaver dam, it’s a roof for the Busy Beaver below. With D in hand, computing BB(N) itself becomes easy: we just need to simulate all the Turing machines with N rules. The ones that haven’t halted within D steps—the ones that bash through the dam’s roof—never will halt. So we can list exactly which machines halt, and among these, the maximum number of steps that any machine takes before it halts is BB(N). Conclusion? The sequence of Busy Beaver numbers, BB(1), BB(2), and so on, grows faster than any computable sequence. Faster than exponentials, stacked exponentials, the Ackermann sequence, you name it. Because if a Turing machine could compute a sequence that grows faster than Busy Beaver, then it could use that sequence to obtain the D‘s—the beaver dams. And with those D’s, it could list the Busy Beaver numbers, which (sound familiar?) we already know is impossible. The Busy Beaver sequence is non-computable, solely because it grows stupendously fast—too fast for any computer to keep up with it, even in principle. This means that no computer program could list all the Busy Beavers one by one. It doesn’t mean that specific Busy Beavers need remain eternally unknowable. And in fact, pinning them down has been a computer science pastime ever since Rado published his article. It’s easy to verify that BB(1), the first Busy Beaver number, is 1. That’s because if a one-rule Turing machine doesn’t halt after the very first step, it’ll just keep moving along the tape endlessly. There’s no room for any more complex behavior. With two rules we can do more, and a little grunt work will ascertain that BB(2) is 6. Six steps. What about the third Busy Beaver? In 1965 Rado, together with Shen Lin, proved that BB(3) is 21. The task was an arduous one, requiring human analysis of many machines to prove that they don’t halt—since, remember, there’s no algorithm for listing the Busy Beaver numbers. Next, in 1983, Allan Brady proved that BB(4) is 107. Unimpressed so far? Well, as with the Ackermann sequence, don’t be fooled by the first few numbers. In 1984, A.K. Dewdney devoted a Scientific American column to Busy Beavers, which inspired amateur mathematician George Uhing to build a special-purpose device for simulating Turing machines. The device, which cost Uhing less than $100, found a five-rule machine that runs for 2,133,492 steps before halting—establishing that BB(5) must be at least as high. Then, in 1989, Heiner Marxen and Jürgen Buntrock discovered that BB(5) is at least 47,176,870. To this day, BB(5) hasn’t been pinned down precisely, and it could turn out to be much higher still. As for BB(6), Marxen and Buntrock set another record in 1997 by proving that it’s at least 8,690,333,381,690,951. A formidable accomplishment, yet Marxen, Buntrock, and the other Busy Beaver hunters are merely wading along the shores of the unknowable. Humanity may never know the value of BB(6) for certain, let alone that of BB(7) or any higher number in the sequence. Indeed, already the top five and six-rule contenders elude us: we can’t explain how they ‘work’ in human terms. If creativity imbues their design, it’s not because humans put it there. One way to understand this is that even small Turing machines can encode profound mathematical problems. Take Goldbach’s conjecture, that every even number 4 or higher is a sum of two prime numbers: 10=7+3, 18=13+5. The conjecture has resisted proof since 1742. Yet we could design a Turing machine with, oh, let’s say 100 rules, that tests each even number to see whether it’s a sum of two primes, and halts when and if it finds a counterexample to the conjecture. Then knowing BB(100), we could in principle run this machine for BB(100) steps, decide whether it halts, and thereby resolve Goldbach’s conjecture. We need not venture far in the sequence to enter the lair of basilisks. But as Rado stressed, even if we can’t list the Busy Beaver numbers, they’re perfectly well-defined mathematically. If you ever challenge a friend to the biggest number contest, I suggest you write something like this: BB(11111)—Busy Beaver shift #—1, 6, 21, etc If your friend doesn’t know about Turing machines or anything similar, but only about, say, Ackermann numbers, then you’ll win the contest. You’ll still win even if you grant your friend a handicap, and allow him the entire lifetime of the universe to write his number. The key to the biggest number contest is a potent paradigm, and Turing’s theory of computation is potent indeed. ¨ But what if your friend knows about Turing machines as well? Is there a notational system for big numbers more powerful than even Busy Beavers? Suppose we could endow a Turing machine with a magical ability to solve the Halting Problem. What would we get? We’d get a ‘super Turing machine’: one with abilities beyond those of any ordinary machine. But now, how hard is it to decide whether a super machine halts? Hmm. It turns out that not even super machines can solve this ‘super Halting Problem’, for the same reason that ordinary machines can’t solve the ordinary Halting Problem. To solve the Halting Problem for super machines, we’d need an even more powerful machine: a ‘super duper machine.’ And to solve the Halting Problem for super duper machines, we’d need a ‘super duper pooper machine.’ And so on endlessly. This infinite hierarchy of ever more powerful machines was formalized by the logician Stephen Kleene in 1943 (although he didn’t use the term ‘super duper pooper’). Imagine a novel, which is imbedded in a longer novel, which itself is imbedded in an even longer novel, and so on ad infinitum. Within each novel, the characters can debate the literary merits of any of the sub-novels. But, by analogy with classes of machines that can’t analyze themselves, the characters can never critique the novel that they themselves are in. (This, I think, jibes with our ordinary experience of novels.) To fully understand some reality, we need to go outside of that reality. This is the essence of Kleene’s hierarchy: that to solve the Halting Problem for some class of machines, we need a yet more powerful class of machines. And there’s no escape. Suppose a Turing machine had a magical ability to solve the Halting Problem, and the super Halting Problem, and the super duper Halting Problem, and the super duper pooper Halting Problem, and so on endlessly. Surely this would be the Queen of Turing machines? Not quite. As soon as we want to decide whether a ‘Queen of Turing machines’ halts, we need a still more powerful machine: an ‘Empress of Turing machines.’ And Kleene’s hierarchy continues. But how’s this relevant to big numbers? Well, each level of Kleene’s hierarchy generates a faster-growing Busy Beaver sequence than do all the previous levels. Indeed, each level’s sequence grows so rapidly that it can only be computed by a higher level. For example, define BB2(N) to be the maximum number of steps a super machine with N rules can make before halting. If this super Busy Beaver sequence were computable by super machines, then those machines could solve the super Halting Problem, which we know is impossible. So the super Busy Beaver numbers grow too rapidly to be computed, even if we could compute the ordinary Busy Beaver numbers. You might think that now, in the biggest-number contest, you could obliterate even an opponent who uses the Busy Beaver sequence by writing something like this: BB2(11111). But not quite. The problem is that I’ve never seen these "higher-level Busy Beavers" defined anywhere, probably because, to people who know computability theory, they’re a fairly obvious extension of the ordinary Busy Beaver numbers. So our reasonable modern mathematician wouldn’t know what number you were naming. If you want to use higher-level Busy Beavers in the biggest number contest, here’s what I suggest. First, publish a paper formalizing the concept in some obscure, low-prestige journal. Then, during the contest, cite the paper on your index card. To exceed higher-level Busy Beavers, we’d presumably need some new computational model surpassing even Turing machines. I can’t imagine what such a model would look like. Yet somehow I doubt that the story of notational systems for big numbers is over. Perhaps someday humans will be able concisely to name numbers that make Busy Beaver 100 seem as puerile and amusingly small as our nobleman’s eighty-three. Or if we’ll never name such numbers, perhaps other civilizations will. Is a biggest number contest afoot throughout the galaxy? ¨ You might wonder why we can’t transcend the whole parade of paradigms, and name numbers by a system that encompasses and surpasses them all. Suppose you wrote the following in the biggest number contest: The biggest whole number nameable with 1,000 characters of English text Surely this number exists. Using 1,000 characters, we can name only finitely many numbers, and among these numbers there has to be a biggest. And yet we’ve made no reference to how the number’s named. The English text could invoke Ackermann numbers, or Busy Beavers, or higher-level Busy Beavers, or even some yet more sweeping concept that nobody’s thought of yet. So unless our opponent uses the same ploy, we’ve got him licked. What a brilliant idea! Why didn’t we think of this earlier? Unfortunately it doesn’t work. We might as well have written One plus the biggest whole number nameable with 1,000 characters of English text This number takes at least 1,001 characters to name. Yet we’ve just named it with only 80 characters! Like a snake that swallows itself whole, our colossal number dissolves in a tumult of contradiction. What gives? The paradox I’ve just described was first published by Bertrand Russell, who attributed it to a librarian named G. G. Berry. The Berry Paradox arises not from mathematics, but from the ambiguity inherent in the English language. There’s no surefire way to convert an English phrase into the number it names (or to decide whether it names a number at all), which is why I invoked a "reasonable modern mathematician" in the rules for the biggest number contest. To circumvent the Berry Paradox, we need to name numbers using a precise, mathematical notational system, such as Turing machines—which is exactly the idea behind the Busy Beaver sequence. So in short, there’s no wily language trick by which to surpass Archimedes, Ackermann, Turing, and Rado, no royal road to big numbers. You might also wonder why we can’t use infinity in the contest. The answer is, for the same reason why we can’t use a rocket car in a bike race. Infinity is fascinating and elegant, but it’s not a whole number. Nor can we ‘subtract from infinity’ to yield a whole number. Infinity minus 17 is still infinity, whereas infinity minus infinity is undefined: it could be 0, 38, or even infinity again. Actually I should speak of infinities, plural. For in the late nineteenth century, Georg Cantor proved that there are different levels of infinity: for example, the infinity of points on a line is greater than the infinity of whole numbers. What’s more, just as there’s no biggest number, so too is there no biggest infinity. But the quest for big infinities is more abstruse than the quest for big numbers. And it involves, not a succession of paradigms, but essentially one: Cantor’s. ¨ So here we are, at the frontier of big number knowledge. As Euclid’s disciple supposedly asked, "what is the use of all this?" We’ve seen that progress in notational systems for big numbers mirrors progress in broader realms: mathematics, logic, computer science. And yet, though a mirror reflects reality, it doesn’t necessarily influence it. Even within mathematics, big numbers are often considered trivialities, their study an idle amusement with no broader implications. I want to argue a contrary view: that understanding big numbers is a key to understanding the world. Imagine trying to explain the Turing machine to Archimedes. The genius of Syracuse listens patiently as you discuss the papyrus tape extending infinitely in both directions, the time steps, states, input and output sequences. At last he explodes. "Foolishness!" he declares (or the ancient Greek equivalent). "All you’ve given me is an elaborate definition, with no value outside of itself." How do you respond? Archimedes has never heard of computers, those cantankerous devices that, twenty-three centuries from his time, will transact the world’s affairs. So you can’t claim practical application. Nor can you appeal to Hilbert and the formalist program, since Archimedes hasn’t heard of those either. But then it hits you: the Busy Beaver sequence. You define the sequence for Archimedes, convince him that BB(1000) is more than his 1063 grains of sand filling the universe, more even than 1063 raised to its own power 1063 times. You defy him to name a bigger number without invoking Turing machines or some equivalent. And as he ponders this challenge, the power of the Turing machine concept dawns on him. Though his intuition may never apprehend the Busy Beaver numbers, his reason compels him to acknowledge their immensity. Big numbers have a way of imbuing abstract notions with reality. Indeed, one could define science as reason’s attempt to compensate for our inability to perceive big numbers. If we could run at 280,000,000 meters per second, there’d be no need for a special theory of relativity: it’d be obvious to everyone that the faster we go, the heavier and squatter we get, and the faster time elapses in the rest of the world. If we could live for 70,000,000 years, there’d be no theory of evolution, and certainly no creationism: we could watch speciation and adaptation with our eyes, instead of painstakingly reconstructing events from fossils and DNA. If we could bake bread at 20,000,000 degrees Kelvin, nuclear fusion would be not the esoteric domain of physicists but ordinary household knowledge. But we can’t do any of these things, and so we have science, to deduce about the gargantuan what we, with our infinitesimal faculties, will never sense. If people fear big numbers, is it any wonder that they fear science as well and turn for solace to the comforting smallness of mysticism? But do people fear big numbers? Certainly they do. I’ve met people who don’t know the difference between a million and a billion, and don’t care. We play a lottery with ‘six ways to win!,’ overlooking the twenty million ways to lose. We yawn at six billion tons of carbon dioxide released into the atmosphere each year, and speak of ‘sustainable development’ in the jaws of exponential growth. Such cases, it seems to me, transcend arithmetical ignorance and represent a basic unwillingness to grapple with the immense. Whence the cowering before big numbers, then? Does it have a biological origin? In 1999, a group led by neuropsychologist Stanislas Dehaene reported evidence in Science that two separate brain systems contribute to mathematical thinking. The group trained Russian-English bilinguals to solve a set of problems, including two-digit addition, base-eight addition, cube roots, and logarithms. Some subjects were trained in Russian, others in English. When the subjects were then asked to solve problems approximately—to choose the closer of two estimates—they performed equally well in both languages. But when asked to solve problems exactly, they performed better in the language of their training. What’s more, brain-imaging evidence showed that the subjects’ parietal lobes, involved in spatial reasoning, were more active during approximation problems; while the left inferior frontal lobes, involved in verbal reasoning, were more active during exact calculation problems. Studies of patients with brain lesions paint the same picture: those with parietal lesions sometimes can’t decide whether 9 is closer to 10 or to 5, but remember the multiplication table; whereas those with left-hemispheric lesions sometimes can’t decide whether 2+2 is 3 or 4, but know that the answer is closer to 3 than to 9. Dehaene et al. conjecture that humans represent numbers in two ways. For approximate reckoning we use a ‘mental number line,’ which evolved long ago and which we likely share with other animals. But for exact computation we use numerical symbols, which evolved recently and which, being language-dependent, are unique to humans. This hypothesis neatly explains the experiment’s findings: the reason subjects performed better in the language of their training for exact computation but not for approximation problems is that the former call upon the verbally-oriented left inferior frontal lobes, and the latter upon the spatially-oriented parietal lobes. If Dehaene et al.’s hypothesis is correct, then which representation do we use for big numbers? Surely the symbolic one—for nobody’s mental number line could be long enough to contain , 5 pentated to the 5, or BB(1000). And here, I suspect, is the problem. When thinking about 3, 4, or 7, we’re guided by our spatial intuition, honed over millions of years of perceiving 3 gazelles, 4 mates, 7 members of a hostile clan. But when thinking about BB(1000), we have only language, that evolutionary neophyte, to rely upon. The usual neural pathways for representing numbers lead to dead ends. And this, perhaps, is why people are afraid of big numbers. Could early intervention mitigate our big number phobia? What if second-grade math teachers took an hour-long hiatus from stultifying busywork to ask their students, "How do you name really, really big numbers?" And then told them about exponentials and stacked exponentials, tetration and the Ackermann sequence, maybe even Busy Beavers: a cornucopia of numbers vaster than any they’d ever conceived, and ideas stretching the bounds of their imaginations. Who can name the bigger number? Whoever has the deeper paradigm. Are you ready? Get set. Go. References Petr Beckmann, A History of Pi, Golem Press, 1971. Allan H. Brady, "The Determination of the Value of Rado’s Noncomputable Function Sigma(k) for Four-State Turing Machines," Mathematics of Computation, vol. 40, no. 162, April 1983, pp 647- 665. Gregory J. Chaitin, "The Berry Paradox," Complexity, vol. 1, no. 1, 1995, pp. 26- 30. At http://www.umcs.maine.edu/~chaitin/unm2.html. A.K. Dewdney, The New Turing Omnibus: 66 Excursions in Computer Science, W.H. Freeman, 1993. S. Dehaene and E. Spelke and P. Pinel and R. Stanescu and S. Tsivkin, "Sources of Mathematical Thinking: Behavioral and Brain-Imaging Evidence," Science, vol. 284, no. 5416, May 7, 1999, pp. 970- 974. Douglas Hofstadter, Metamagical Themas: Questing for the Essence of Mind and Pattern, Basic Books, 1985. Chapter 6, "On Number Numbness," pp. 115- 135. Robert Kanigel, The Man Who Knew Infinity: A Life of the Genius Ramanujan, Washington Square Press, 1991. Stephen C. Kleene, "Recursive predicates and quantifiers," Transactions of the American Mathematical Society, vol. 53, 1943, pp. 41- 74. Donald E. Knuth, Selected Papers on Computer Science, CSLI Publications, 1996. Chapter 2, "Mathematics and Computer Science: Coping with Finiteness," pp. 31- 57. Dexter C. Kozen, Automata and Computability, Springer-Verlag, 1997. ———, The Design and Analysis of Algorithms, Springer-Verlag, 1991. Shen Lin and Tibor Rado, "Computer studies of Turing machine problems," Journal of the Association for Computing Machinery, vol. 12, no. 2, April 1965, pp. 196- 212. Heiner Marxen, Busy Beaver, at http://www.drb.insel.de/~heiner/BB/. ——— and Jürgen Buntrock, "Attacking the Busy Beaver 5," Bulletin of the European Association for Theoretical Computer Science, no. 40, February 1990, pp. 247- 251. Tibor Rado, "On Non-Computable Functions," Bell System Technical Journal, vol. XLI, no. 2, May 1962, pp. 877- 884. Rudy Rucker, Infinity and the Mind, Princeton University Press, 1995. Carl Sagan, Billions & Billions, Random House, 1997. Michael Somos, "Busy Beaver Turing Machine." At http://grail.cba.csuohio.edu/~somos/bb.html. Alan Turing, "On computable numbers, with an application to the Entscheidungsproblem," Proceedings of the London Mathematical Society, Series 2, vol. 42, pp. 230- 265, 1936. Reprinted in Martin Davis (ed.), The Undecidable, Raven, 1965. Ilan Vardi, "Archimedes, the Sand Reckoner," at http://www.ihes.fr/~ilan/sand_reckoner.ps. Eric W. Weisstein, CRC Concise Encyclopedia of Mathematics, CRC Press, 1999. Entry on "Large Number" at http://www.treasure-troves.com/math/LargeNumber.html. Back to Writings page Back to Scott's homepage Back to Scott's blog

      What even is the largest number that has real world use what would be the point of bigger numbers if we cant use the big numbers we have now for real world calculations?

    1. References

      Probably interesting for background:

      Eklund, A., Frank, J., Nilsson, L., Zetterberg, A., & Mansson, J. 2024. Times of trouble - Seasonal variation in number and severity of attacks on sheep caused by large carnivores and eagles in Sweden. European Journal of Wildlife Research, 70(9): 2-11. DOI: https://doi.org/10.1007/s10344-023-01761-4

      Kvalshaug, O.J. 2013. Inter-specific patterns of depredation on domestic sheep and semi-domestic reindeer in Norway, by a large predator guild. Master Thesis, Norwegian University of Life Sciences, 36.

      Linnell, J.D.C., Nilsen, E.B., Lande, U., Herfindal, I., Odden, J., & Skogen, K. 2005. Zoning as a means of mitigating conflicts with large carnivores: Principles and reality. Conservation Biology Series-Cambridge, 9: 163-175. DOI: https://doi.org/10.1017/cbo9780511614774.011

      Mabille, G., Stien, A., Tveraa, T., Mysterud, A., Brøseth, H., & Linnell, J.D.C. 2015. Sheep farming and large carnivores: What are the factors influencing claimed losses? Ecosphere, 6(5): 1-17. DOI: https://doi.org/10.1890/es14-00444.1

      Strand, G., Hansen, I., De Boon, A., & Sandström, C. 2019. Carnivore Management Zones and their Impact on Sheep Farming in Norway. Environmental Management, 64: 537-552. DOI: https://doi.org/10.1007/s00267-019-01212-4

      Strand, G. 2020. The combined effects of centralization and carnivore management on sheep farmers and sheep farming in Norway. Human Dimensions of Wildlife, 26(4): 321-336. DOI: https://doi.org/10.1080/10871209.2020.1818895

    2. Primer Validation

      More detail is needed here, include how you determined the limit of detection of your assay. State that you used standard curves to estimate limit of detection (LOD), but see Klymus et al. (2020). Given that your assays are for qualitative purposes, the limit of quantification (LOQ) is likely not relevant in your case. Please verify this to clarify in the main text why the qPCR efficiency may be irrelevant for your assays, but the LOD is.

      Depending on who you will get as an examiner, it may be worthwhile to also mention that you did the testing according to the MIQE guidelines, which I think were incorporated into this paper (see thier Appendix S1 for the checklist):

      • Thalinger, B., Deiner, K., Harper, L. R., Rees, H. C., Blackman, R. C., Sint, D., ... & Bruce, K. (2021). A validation scale to determine the readiness of environmental DNA assays for routine species monitoring. Environmental DNA, 3(4), 823-836.

      • Bustin, S. A. (2024). Improving the quality of quantitative polymerase chain reaction experiments: 15 years of MIQE. Molecular aspects of medicine, 96, 101249.

      • Klymus, K. E., Merkes, C. M., Allison, M. J., Goldberg, C. S., Helbing, C. C., Hunter, M. E., Jackson, C. A., Lance, R. F., Mangan, A. M., Monroe, E. M., Piaggio, A. J., Stokdyk, J. P., Wilson, C. C., & Richter, C. A. (2020). Reporting the limits of detection and quantification for environmental DNA assays. Environmental DNA, 2, 271–282. https://doi.org/10.1002/edn3.29

    Annotators

    1. The number of generated instructions per step. Computing a mini-batch of gradients reducesthe variance of a stochastic gradient descent procedure. Similarly, generating multiple instructionsin each step improves the optimization stability with LLMs. On the other hand, to achieve betterperformance with a fixed budget for the number of instructions to evaluate, the number of per-stepinstructions should not be too large, so as to allow more optimization steps to incorporate richerinformation of past instructions with their accuracies. Taking both aspects into consideration, Figure 8compares the optimization performance of sampling 1 / 2 / 4 / 8 (default) / 16 instructions in eachstep, showing that sampling 8 instructions at each step overall achieves the best performance

      Số lượng chỉ dẫn được tạo ra ở mỗi bước: Việc tính toán một mini-batch của đạo hàm làm giảm phương sai của việc xuống đồi đạo hàm ngẫu nhiên. Tương tự như vậy, việc tạo ra nhiều chỉ dẫn dúng làm tăng khả năng tối ưu của LLM. Mặt khác

    2. Table 4 summarizes top instructions found on GSM8K with different scorer and optimizer LLMs.We observe that:• The styles of instructions found by different optimizer LLMs vary a lot: PaLM 2-L-IT andtext-bison ones are concise, while GPT ones are long and detailed.• Although some top instructions contain the “step-by-step” phrase, most others achieve a compa-rable or better accuracy with different semantic meanings.10

      Bảng 4 tổng hợp các chỉ dẫn tốt nhất trên bài toán GSM8K với các scorer và optimizer LLM khác nhau. Nhận xét: - Các phong cách tạo chỉ dẫn của các LLM có sự khác nhau lớn: PaLM 2-L IT và text-bison thường ngắn gọn, còn GPT thì dài và nhiều chi tiết. - Mặc dù một số chỉ dẫn tốt nhất chứ cụm từ "step-by-step", tất cả các cụm khác đều đạt kết quả tương tự hoặc cao hơn với các cụm có ý nghĩa khác.

    3. We present the results in Table 3. We randomly generate 5 problem instances for each number ofnodes n. In addition to measuring the optimality gap, on problems where the LLM finds the optimalsolutions, we also show the number of optimization steps taken to reach the global optimum. First,we observe that gpt-4 significantly outperforms gpt-3.5-turbo and text-bison across allproblem sizes. Specifically, on smaller-scale problems, gpt-4 reaches the global optimum about 4×faster than other LLMs. On larger-scale problems, especially with n = 50, gpt-4 still finds solutionswith a comparable quality to heuristic algorithms, while both text-bison and gpt-3.5-turboget stuck at local optima with up to 20× worse optimality gaps.

      Với mỗi số lượng n điểm, 5 tập điểm khác nhau sẽ được tạo ngẫu nhiên. Ngoài việc đánh giá dựa trên khoảng cách tối ưu, bài báo còn đánh giá dựa trên số lượng bước tối ưu cần thực hiện để đạt được tối ưu toàn cục. Nhận xét: - gpt-4 tốt hơn nhiều so với 2 mô hình còn lại ở tất cả các kích thước bài toán. Cụ thể, với các bài toán có kích thước nhỏ, gpt-4 đạt tối ưu toàn cục nhanh gấp 4 lần so với các mô hình khác.

    4. We generate the problem instances by sampling n nodes with both x and y coordinates in [−100, 100].We use the Gurobi solver (Optimization et al., 2020) to construct the oracle solutions and compute theoptimality gap for all approaches, where the optimality gap is defined as the difference between thedistance in the solution constructed by the evaluated approach and the distance achieved by the oraclesolution, divided by the distance of the oracle solution. Besides evaluating OPRO with differentLLMs including text-bison, gpt-3.5-turbo and gpt-4, we also compare OPRO to thefollowing heuristics

      Cài đặt của bài toán: - Lấy mẫu ngẫu nhiên n điểm với 2 giá trị tọa độ x và y đều nằm trong khoảng [-100, 100]. - Gurobi solver được sử dụng để tạo giải pháp ground-truth. Điểm đánh giá được sử dụng cho các giải pháp được sinh ra là khoảng cách tối ưu (optimality gap). Trong đó, khoảng cách tối ưu được định nghĩa là hiệu giữa giải pháp được tạo sinh và giải pháp ground-truth sau đó chia cho giá trị của giải pháp ground-truth.

    5. Models. The LLMs we use as the optimizer and the scorer are:• Optimizer LLM: Pre-trained PaLM 2-L (Anil et al., 2023), instruction-tuned PaLM 2-L(denoted PaLM 2-L-IT), text-bison, gpt-3.5-turbo, and gpt-4.• Scorer LLM: Pre-trained PaLM 2-L and text-bison.With pre-trained PaLM 2-L as the scorer, the optimizer LLM generates A_begin instructions.Since text-bison has been instruction-tuned, the optimizer LLM generates Q_begin and Q_endinstructions when text-bison is used as the scorer.

      Các mô hình được sử dụng trong thực nghiệm: - Optimizer LLM: pretrained PaLM 2-L, instruction-tuned PaLM 2-L, text-bison. gpt-3.5 turbo và gpt-4 - Scorer LLM: pretrained PaLM 2-L và text-bison

      Với pretrained PaLM 2-L là scorer LLM, các optimizer LLM sẽ tạo sinh các chỉ dẫn A_begin, Còn text-bison là mô hình đã dược instruction-tuned, các optimizer LLM sẽ tạo sinh các chỉ dẫn Q_begin và Q_end khi text-bison làm scorer LLM

    1. Dominic system (after Dominic O'Brien is a Person-Action image association system for numbers in specific order. Uses it to turn two digit numbers into famous people. Also associates an action with a famous person, representing the same two digit number. Now you can imagine a 4 digit number as a person doing an action A mobile number would be 2 person action combinations in sequence. The upfront work is remembering the persons and actions as images for each of the 00-99 two digit numbers. Then putting a four digit number together requires putting them in sequence e.g. in a one of your preselected [[Memory palaces 20201007192310]], The act of remembering is constructing the images and placing them in the memory palace of choice.

      There is also a Person-Action-Object system PAO, which allows you to do three pairs of digits in one image. Allowing 1 million numbers to remember.

    1. Tim Ferris posting a text by Gabriel Wyner from 2014 on learning a new language in several steps 1) hear the novel sounds in the language and how to spell them 2) learn a list of basic words by connecting them to their image not their translatiojn 3) learn (simplified) grammar 4) continue the game (adding focused vocab, reading, listening speaking etc)

    1. Tip! Financiële regulering is dus noodzakelijk om een stabiele en betrouwbare financiële sector te waarborgen. Het is van belang voor (1) het behouden van vertrouwen, (2) het corrigeren van marktimperfecties, (3) het beheersen van systeemrisico’s, (4) het beschermen van consumenten, (5) het beperken van moral hazard en (6) het bestrijden van onethisch gedrag. Zonder deze regulering zou de financiële sector niet in staat zijn om effectief bij te dragen aan de groei en stabiliteit van de reële economie.

      Stamp

    1. References

      Probably interesting for background:

      Eklund, A., Frank, J., Nilsson, L., Zetterberg, A., & Mansson, J. 2024. Times of trouble - Seasonal variation in number and severity of attacks on sheep caused by large carnivores and eagles in Sweden. European Journal of Wildlife Research, 70(9): 2-11. DOI: https://doi.org/10.1007/s10344-023-01761-4

      Kvalshaug, O.J. 2013. Inter-specific patterns of depredation on domestic sheep and semi-domestic reindeer in Norway, by a large predator guild. Master Thesis, Norwegian University of Life Sciences, 36.

      Linnell, J.D.C., Nilsen, E.B., Lande, U., Herfindal, I., Odden, J., & Skogen, K. 2005. Zoning as a means of mitigating conflicts with large carnivores: Principles and reality. Conservation Biology Series-Cambridge, 9: 163-175. DOI: https://doi.org/10.1017/cbo9780511614774.011

      Mabille, G., Stien, A., Tveraa, T., Mysterud, A., Brøseth, H., & Linnell, J.D.C. 2015. Sheep farming and large carnivores: What are the factors influencing claimed losses? Ecosphere, 6(5): 1-17. DOI: https://doi.org/10.1890/es14-00444.1

      Strand, G., Hansen, I., De Boon, A., & Sandström, C. 2019. Carnivore Management Zones and their Impact on Sheep Farming in Norway. Environmental Management, 64: 537-552. DOI: https://doi.org/10.1007/s00267-019-01212-4

      Strand, G. 2020. The combined effects of centralization and carnivore management on sheep farmers and sheep farming in Norway. Human Dimensions of Wildlife, 26(4): 321-336. DOI: https://doi.org/10.1080/10871209.2020.1818895

    2. Primer Validation

      More detail is needed here, include how you determined the limit of detection of your assay. State that you used standard curves to estimate limit of detection (LOD), but see Klymus et al. (2020). Given that your assays are for qualitative purposes, the limit of quantification (LOQ) is likely not relevant in your case. Please verify this to clarify in the main text why the qPCR efficiency may be irrelevant for your assays, but the LOD is.

      Depending on who you will get as an examiner, it may be worthwhile to also mention that you did the testing according to the MIQE guidelines, which I think were incorporated into this paper (see thier Appendix S1 for the checklist):

      • Thalinger, B., Deiner, K., Harper, L. R., Rees, H. C., Blackman, R. C., Sint, D., ... & Bruce, K. (2021). A validation scale to determine the readiness of environmental DNA assays for routine species monitoring. Environmental DNA, 3(4), 823-836.

      • Bustin, S. A. (2024). Improving the quality of quantitative polymerase chain reaction experiments: 15 years of MIQE. Molecular aspects of medicine, 96, 101249.

      • Klymus, K. E., Merkes, C. M., Allison, M. J., Goldberg, C. S., Helbing, C. C., Hunter, M. E., Jackson, C. A., Lance, R. F., Mangan, A. M., Monroe, E. M., Piaggio, A. J., Stokdyk, J. P., Wilson, C. C., & Richter, C. A. (2020). Reporting the limits of detection and quantification for environmental DNA assays. Environmental DNA, 2, 271–282. https://doi.org/10.1002/edn3.29

    Annotators

    1. Методика Фейнмана: 1. Определиться с тем, что хочешь изучить 2. Выучить это, либо научить кого то другого 3. Если есть пробелы в понимании - вернутся к материалу и найти их. 4. Упростить понятное и привести аналогии, примеры

    1. 2. der auf der andren Seite befindliche Wert oder vergegenständlichte Arbeit muß eine Akkumulation von Gebrauchswerten sein, hinreichend groß, um die gegenständlichen Bedingungen zu liefern nicht bloß zur Produktion der Produkte oder Werte, nötig um das lebendige Arbeitsvermögen zu reproduzieren oder zu erhalten, sondern um Surplusarbeit zu absorbieren – das ||48|objektive Material für sie herzugeben;

      Marx, Grundrisse, Heft 4

    1. the recommendations focus on four categories: in-clusive design and democratic innovation, meaningful participation in AI governance, transparency and account-ability for harm prevention, and effective access to justice.

      Categorías:

      1. Diseño inclusivo e innovación democrática

      2. Participación significativa en la gobernanza de la IA

      3. Transparencia y rendición de cuentas para la prevención de daños

      4. Acceso efectivo a la justicia

    1. 4. When material in the mantle is heated at Earth's core, it rises towards Earth's surface. As it rises, it cools, moves laterally, becomes denser, and sinks, creating a ___ cell.

      Convection

    2. 4. When material in the mantle is heated at Earth's core, it rises towards Earth's surface. As it rises, it cools, moves laterally, becomes denser, and sinks, creating a ___ cell.

      convection

    1. Standard care includes diet therapyto limit phenylalanine and tyrosine intake and lifelong treatmentwith 2-(2-nitro-4-trifluoromethylbenzoyl)-1,3-cyclohexanedione(NTBC; also known as nitisinone), a potent inhibitor of the up-stream enzyme HPD (Alvarez et al. 2017), to prevent toxic metab-olite accumulation in the liver and kidneys [Grompe et al. 1995;Lindstedt et al. 1992; Larochelle et al. 2012 (Fig. 1a)]. Importantly,non-compliance with NTBC and diet treatment is a serious chal-lenge for the clinical management of HT-I and results in higherrisks of patients developing hepatocellular carcinoma, as well aspainful corneal lesions due to high circulating tyrosine levelsand neurological crises due to high SA levels (

      From what I gather, people with this mutation have to go this lifelong treatment. My question is how intense is this treatment? I am not too familiar with the intake of these chemicals, are they harmful, or are they similar to simple vitamins? Can people with this condition live "normal" lives or not? I think these questions are good to know if the researchers are trying to find more effective treatment methods.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1:

      - Summary: 

      Recordings were made from the dentate nucleus of two monkeys during a decision-making task. Correlates of stimulus position and stimulus information were found to varying degrees in the neuronal activities. 

      We agree with this summary.

      - Strengths: 

      A difficult decision-making task was examined in two monkeys.

      We agree with this statement.

      - Weaknesses: 

      One of the monkeys did not fully learn the task. The manuscript lacked a coherent hypothesis to be tested, and no attempt was made to consider the possibility that this part of the brain may have little to do with the task that was being studied. 

      We understand the reviewers concern. It is correct that one of the monkeys (Mi) did not perform at a high level, but it should be noted that both monkeys learned significantly above chance level. Therefore, we would argue that both monkeys in fact did learn the task but Mi’s performance was suboptimal. This difference in the performance levels gave us a rare opportunity to dive deeper into the reasons why some animals perform better than the others and we show that Mi (the lower performing monkey) paid more attention to the outcome of the previous trial – this is evident from our behavioural and decoding models.

      We tested the overall hypothesis that neurons of the nucleus dentate can dynamically modulate their activity during a visual attention task, comprising not only sensorimotor but also cognitive attentional components. Many neurons in the dentate are multimodal (Figure 3C-D) which was something that was theorized. One of the specific hypotheses that we tested is that the dentate cells can be direction-selective for both the sensorimotor and cognitive component. Given that many of the recorded cells showed direction-selectivity in their firing rate modulation for gap directions and/or stimulus directions, we provide strong evidence that this hypothesis is correct. We have now spelled out this hypothesis more explicitly in the introduction of the revised version. We now also explain better why we tested this specific hypothesis. Indeed, earlier studies in primates such as those by Herzfeld and colleagues (2018, Nat. Neuro.) and van Es and colleagues (2019, Current Biol) have indicated that direction-selectivity of cerebellar activity may occur in various sensorimotor domains.

      We also appreciate the comment of this Reviewer that in our original submission we did not show our attempt to consider the possibility that this part of the brain may have little to do with the task that was being studied. We in fact did consider this possibility in that we successfully injected 3 ml of muscimol (5 μg/ml, Sigma Aldrich) into the dentate nucleus in vivo in one of the monkeys (Mo). This application resulted in a reduction of more than 10% in correct responses of the covert attention task after 45 minutes, whereas the performance remained the same following saline injections. Unfortunately, due to the timing of the experiments and Covid19-related laboratory restrictions we were unable to perform these experiments in the other monkey or repeat them in Mo. We aim to replicate this in future experiments and publish it when we have full datasets of at least two monkeys available. For this paper we have prioritized our tracing experiments, highlighting the connections of the dentate nucleus with attention related areas in brainstem and cortex in both monkeys, following perfusion.

      - Perhaps the large differences in performance between the two subjects can be used as a way to interpret the neural data's relationship to behavior, as it provided a source of variance. This is what we would hypothesize if we believed that this area of the brain is playing a significant role in the task. If one animal learns much more poorly, and this region of the brain is important for that behavior, then shouldn't there be clear, interpretable differences in the neural data? 

      We thank the Reviewer for this comment. We have added a new Supplementary Figure 2, in which we present the data for both monkeys separately in the revised manuscript. Comparing the two datasets however, we see more commonalities related to the significant learning in both monkeys than differences that might be related to their different levels of learning. We have therefore decided to show the different datasets transparently in the new Supplementary Figure 2, but to stay on the conservative side in our interpretations.

      - How should we look for these differences? A number of recent papers in mice have uncovered a large body of data showing that during the deliberation period, when the animal is interpreting a sensory stimulus (often using the whisker system), there is ramping activity in a principal component space among neurons that contribute to the decision. This ramping activity is present (in the PCA space) in the motor areas of the cortex, as well as in the medial and lateral cerebellar nuclei. Perhaps a similar computational approach would benefit the current manuscript. 

      We also appreciate this point. We have done the principal component analysis accordingly, and we indeed do find the ramping activity in several components of the dentate activity of both monkeys (Mi and Mo). We have now added a new Supplementary Figure 3 with the first three components of both correct and incorrect trials for Mi and Mo, highlighting their potential contribution.

      - What is the hypothesis that is being tested? That is, what do you think might be the function of this region of the cerebellum in this task? It seems to me that we are not entirely in the dark, as previous literature on mice decision-making tasks has produced a reasonable framework: the deliberation period coincides with ramping activity in many regions of the frontal lobe and the cerebellum. Indeed, the ramp in the cerebellum appears to be a necessary condition for the ramp to be present in the frontal lobe. Thus, we should see such ramping activity in this task in the dentate. When the monkey makes the wrong choice, the ramp should predict it. If you don't see the ramping activity, then it is possible that the hypothesis is wrong, or that you are not recording from the right place. 

      It is indeed one of our specific hypotheses that the dentate cells can be direction-selective for the preparing cognitive component and/or sensorimotor response. We provide evidence that this hypothesis may be correct when we analyze the regular time response curves (see Figure 2 and the new Supplementary Figure 2 where the data of both monkeys are now presented separately). Moreover, we have now verified this by analysing the ramping curves of PCA space (new Supplementary Figure 3) and firing frequency of DN neurons that modulated upon presentation of the C-stimulus (new Supplementary Figure 4). These figures and findings are now referred to in the main text.

      - As this is a difficult task that depends on the ability of the animals to understand the meaning of the cues, it is quite concerning that one of the monkeys performed poorly, particularly in the early sessions. Notably, the disparity between the two subjects is rather large: one monkey at the start of the recordings achieved a performance that was much better than the second monkey did at the end of the recording sessions. You highlighted the differences in performance in Figure 1D and mentioned that you started recording once the animals reached 60% performance. However, this did not make sense to me as the performance of Mi even after the final day of recording did not reach the performance of Mo on the first day of recording. Thus, in contrast to Mo, Mi appeared to be not ready for the task when the recording began.

      We understand this point. However, please note that the learning performance of the monkeys concerned retraining sessions after they had had several weeks of vacation. So, even though it is correct that one of the two monkeys had a very good consolidation and started already at a relatively high level on the first retraining session, the other one also started and ended at a level above chance level (the y-axis starts at 0.5). We now highlight this point better in the Results section.

      - One objective of having two monkeys is to illustrate that what is true in one animal is also true in the other. In some figures, you show that the neural data are significantly different, while in others you combine them into one. Thus, are you confident that the neural data across the animals should be combined, as you have done in Figure 2? Perhaps you can use the large differences in performance as a source of variance to find meaning in the neural data. 

      This is a valid question; as highlighted above, we have now addressed this point in the new Supplementary Figure 2, where the data for both monkeys are presented separately. Given the sample sizes and level of variances, it is in general difficult to draw conclusions about the potential differences and contributions, but the data are sufficiently transparent to observe common trends. With regard to linking differences in the neural data to the differences in performance level, please also consider Figure 4, the new Supplementary Figure 3 (with the ramping PCA component) and new Supplementary Figure 4 (with the additional analysis of the ramping activity of DN neurons that modulated upon presentation of the C-stimulus), which suggests that the ramping stage of Mo starts before that of Mi. This difference highlights the possibility that injecting accelerations of the simple spike modulations of Purkinje cells in the cerebellar hemispheres into the complex of cerebellar nuclei may be instrumental in improving the performance of responses to covert attention, akin to what has been shown for the impact of Purkinje cells of the vestibulocerebellum on eye movement responses to vestibular stimulation (De Zeeuw et al. 1995, J Neurophysiol). This possibility is now also raised in the Discussion.

      - How do we know that these neurons, or even this region of the brain, contribute to this task? When a new task is introduced, the contributions of the region of the brain that is being studied are usually established via some form of manipulation. This question is particularly relevant here because the two subjects differed markedly in their performance, yet in Figure 3 you find that a similar percentage of neurons are responding to the various elements of the task.

      We appreciate this question. As highlighted above, we are refraining from showing our muscimol manipulation (3 ml of 5 μg/ml muscimol, Sigma Aldrich), as it only concerns 1 successful dataset and 1 control experiment. We hope to replicate this reversible lesion experiment in the future and publish it when we have full new datasets of at least two monkeys available. As explained above, for this paper we have sacrificed both monkeys following a timed perfusion, so as to have similar survival times for the transport of the neuro-anatomical tracer involved.  

      - Behavior in both animals was better when the gap direction was up/down vs. left/right. Is this difference in behavior encoded during the time that the animal is making a decision? Are the dentate neurons better at differentiating the direction of the cue when the gap direction is up/right vs. left/right? 

      These data have now been included in the new Supplementary Figure 2; we did not observe any significant differences in this respect.

      Reviewer #2:

      - The authors trained monkeys to discriminate peripheral visual cues and associate them with planning future saccades of an indicated direction. At the same time, the authors recorded single-unit neural activity in the cerebellar dentate nucleus. They demonstrated that substantial fractions of DN cells exhibited sustained modulation of spike rates spanning task epochs and carrying information about stimulus, response, and trial outcome. Finally, tracer injections demonstrated this region of the DN projects to a large number of targets including several known to interconnect the visual attention network. The data compellingly demonstrate the authors' central claims, and the analyses are well-suited to support the conclusions. Importantly, the study demonstrates that DN cells convey many motor and nonmotor variables related to task execution, event sequencing, visual attention, and arguably decision-making/working memory. 

      We thank the Reviewer for this positive and constructive feedback.

      - The study is solid and I do not have major concerns, but only points for possible improvement. 

      We thank the Reviewer for this positive feedback.

      - A key feature of this data is the extended changes/ramps in DN output across epochs (Figure 2). Crudely, this presents a challenge for the view that DN output mainly drives motor effectors, as the saccade itself lasts only a tiny fraction of the overall task. Some discussion of this dichotomy in thinking about the function(s) of the cerebellum, vis a vis the multifarious DN targets the authors demonstrate here, etc., would be helpful. 

      We agree with the Reviewer and we have expanded our Discussion on this point, also now highlighting the outcome of the new PCA analysis recommended by Reviewer 1 (see the new Supplementary figure Figure 3).

      - A high-level suggestion on the data: the presentation of the data focuses (sensibly) on the representation of the stimulus and response epochs (Figures 2-3). Yet, the authors then show that from decoding, it is, in fact, a trial outcome that is best represented in the population (Figure 4). While there is nothing 'wrong' with this, it reads slightly incongruously, and the reader does a bit of a "double take" back to the previous figures to see if they missed examples of the trial-outcome signals, but the previous presentations only show correct trials. Consider adding somewhere in the first 3 main figures some neural data showing comparisons with incorrect trials. This way, the reader develops prior expectations for the outcome decoding result and frame of reference for interpreting it. On a related note, the text contains an earlier introduction of this issue (p24 last sentence) and p25 paragraph 1 cites Figure 3D and 3E for signals "related to the absence of reward" - but the caption says this includes only correct trials? 

      We thank the Reviewer for bringing up these points. We have addressed the textual suggestions. Moreover, we have done the PCA analysis suggested by Reviewer 1 for both the correct and incorrect trials (see Supplementary material).

      - P29: The discrepancy in retrograde labeling between monkeys (2 orders of magnitude): I realize the authors can't really do anything about this, but the difference is large enough to warrant concerns in the interpretation (how did the tracer spread over the drastically larger area? Isotropically? Could it cross more "hard boundaries" and incorporate qualitatively different inputs/outputs?). A small discussion of possible caveats in interpreting the outcomes would be helpful. 

      We fully agree with this comment. As highlighted in the text, in both monkeys we first identified the optimal points for injection in the dentate nucleus electrophysiologically and we used the same pump with the same settings to carry out the injections, but even so the differences are substantial. We suspect that the larger injection might have been caused by an air bubble trapped in the syringe or a deviation in the stock solution, but we can never be sure of that. We have added a potential explanation for the caveat that might have played a role.

      - And a list of quick points: 

      We have addressed all points listed below; we want to thank the Reviewer for bringing them up.

      P3 paragraph 2 needs comma "in daily life,". 

      P4 paragraph 2 "C-gap" terminology not previously defined. 

      P4 paragraph 2 "animals employed different behavioral strategies". Grammatically, you should probably say "each animal employed a different behavioral strategy," but also scientifically the paragraph doesn't connect this claim to anything about the DN (whereas, e.g., the abstract does make this connection clear). 

      P5 paragraph 1 "theca" should be "the". 

      P6 paragraph 1 problem with ignashenkova citation insert. 

      P10 paragraph 1 I think the spike rate "difference between highest and lowest" is not exactly the same as "variance," you might want to change the terminology. 

      P10 paragraph 1 should probably say "To determine if a cell preferentially modulated". 

      P10 paragraph 1 last sentence the last clause could be clearer. 

      P17 paragraph 2 should be something like "as well as those by Carpenter and..."? 

      P20 caption: consider "...directionality in the task: only one C-stim...". 

      P20 caption: consider "to the left and right in the [L/R] task...to the top/bottom in the [U/D] task". 

      Fig1E and S1 - is there a physical meaning of the "weight" unit, and if none, can this be transformed into a more meaningful unit? 

      P21 paragraph 1 consider "activity was recorded for 304 DN neurons...". 

      P21 paragraph 1 "correlations with the temporal windows" it's not clear how activity can "correlate" with a time window, consider rephrasing (activity levels changed during these time epochs, depending on stimulus identity). 

      P21 paragraph 1 should be "by comparing the number of spikes in a bin...". 

      P22 paragraph 2 "when we aligned the neurons to the time of maximum change" needs clarification. The maximum change of what? And per neuron? Across the population? 

      P22 paragraph 2 "than that of the facilitating" should be "than did the facilitating units". 

      P24 paragraph 1 needs a comma and rewording "Within each direction, trials are sorted by the time of saccade onset". 

      P24 paragraph 1 should probably say "Same as in G, but for suppressed cells". 

      P24 paragraph 2 should say "more than one task event" not "events". 

      P24 paragraph 2 needs a comma "To fully characterize the neural responses, we fitted". 

      P25 paragraph 1 should probably say "we sampled from similar populations of DN". 

      P34 paragraph 3 consider rephrasing the sentence that contains both "dissociation" and "dissociate". 

      P37 last line: consider "coordination of cerebellum and cerebral cortex *in* higher order mental..."? 

      P38 paragraph 1 citation needed for "kinematics of goal-directed hand actions of others"? 

      P38 paragraph 1 commas probably not needed "map visual input, from high-level visual regions, onto..." 

      References

      - Herzfeld D.J., Kojima Y, Soetedjo R, Shadmehr R (2018) Encoding of error and learning to correct that error by the Purkinje cells of the cerebellum. Nat Neurosci 21:736–743.

      - van Es, D.M., van der Zwaag W., and Knapen T. (2019) Topographic Maps of Visual Space in the Human Cerebellum. Current Biol Volume 29, Issue 10p1689-1694.e3May 20.

      - De Zeeuw CI, Wylie DR, Stahl JS, Simpson JI. (1995) Phase relations of Purkinje cells in the rabbit flocculus during compensatory eye movements. J Neurophysiol. Nov;74(5):2051-64. doi: 10.1152/jn.1995.74.5.2051.

    1. References

      Probably interesting for background:

      Eklund, A., Frank, J., Nilsson, L., Zetterberg, A., & Mansson, J. 2024. Times of trouble - Seasonal variation in number and severity of attacks on sheep caused by large carnivores and eagles in Sweden. European Journal of Wildlife Research, 70(9): 2-11. DOI: https://doi.org/10.1007/s10344-023-01761-4

      Kvalshaug, O.J. 2013. Inter-specific patterns of depredation on domestic sheep and semi-domestic reindeer in Norway, by a large predator guild. Master Thesis, Norwegian University of Life Sciences, 36.

      Linnell, J.D.C., Nilsen, E.B., Lande, U., Herfindal, I., Odden, J., & Skogen, K. 2005. Zoning as a means of mitigating conflicts with large carnivores: Principles and reality. Conservation Biology Series-Cambridge, 9: 163-175. DOI: https://doi.org/10.1017/cbo9780511614774.011

      Mabille, G., Stien, A., Tveraa, T., Mysterud, A., Brøseth, H., & Linnell, J.D.C. 2015. Sheep farming and large carnivores: What are the factors influencing claimed losses? Ecosphere, 6(5): 1-17. DOI: https://doi.org/10.1890/es14-00444.1

      Strand, G., Hansen, I., De Boon, A., & Sandström, C. 2019. Carnivore Management Zones and their Impact on Sheep Farming in Norway. Environmental Management, 64: 537-552. DOI: https://doi.org/10.1007/s00267-019-01212-4

      Strand, G. 2020. The combined effects of centralization and carnivore management on sheep farmers and sheep farming in Norway. Human Dimensions of Wildlife, 26(4): 321-336. DOI: https://doi.org/10.1080/10871209.2020.1818895

    2. Primer Validation

      More detail is needed here, include how you determined the limit of detection of your assay. State that you used standard curves to estimate limit of detection (LOD), but see Klymus et al. (2020). Given that your assays are for qualitative purposes, the limit of quantification (LOQ) is likely not relevant in your case. Please verify this to clarify in the main text why the qPCR efficiency may be irrelevant for your assays, but the LOD is.

      Depending on who you will get as an examiner, it may be worthwhile to also mention that you did the testing according to the MIQE guidelines, which I think were incorporated into this paper (see thier Appendix S1 for the checklist):

      • Thalinger, B., Deiner, K., Harper, L. R., Rees, H. C., Blackman, R. C., Sint, D., ... & Bruce, K. (2021). A validation scale to determine the readiness of environmental DNA assays for routine species monitoring. Environmental DNA, 3(4), 823-836.

      • Bustin, S. A. (2024). Improving the quality of quantitative polymerase chain reaction experiments: 15 years of MIQE. Molecular aspects of medicine, 96, 101249.

      • Klymus, K. E., Merkes, C. M., Allison, M. J., Goldberg, C. S., Helbing, C. C., Hunter, M. E., Jackson, C. A., Lance, R. F., Mangan, A. M., Monroe, E. M., Piaggio, A. J., Stokdyk, J. P., Wilson, C. C., & Richter, C. A. (2020). Reporting the limits of detection and quantification for environmental DNA assays. Environmental DNA, 2, 271–282. https://doi.org/10.1002/edn3.29

    Annotators

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      Summary:

      The pituitary gonadotropins, FSH and LH, are critical regulators of reproduction. In mammals, synthesis and secretion of FSH and LH by gonadotrope cells are controlled by the hypothalamic peptide, GnRH. As FSH and LH are made in the same cells in mammals, variation in the nature of GnRH secretion is thought to contribute to the differential regulation of the two hormones. In contrast, in fish, FSH and LH are produced in distinct gonadotrope populations and may be less (or differently) dependent on GnRH than in mammals. In the present manuscript, the authors endeavored to determine whether FSH may be independently controlled by a distinct peptide, cholecystokinin (CCK), in zebrafish.

      Strengths:

      The authors demonstrated that the CCK receptor is enriched in FSH-producing relative to LH-producing gonadotropes, and that genetic deletion of the receptor leads to dramatic decreases in gonadotropin production and gonadal development in zebrafish. Also, using innovative in vivo and ex vivo calcium imaging approaches, they show that LH- and FSH-producing gonadotropes preferentially respond to GnRH and CCK, respectively. Exogenous CCK also preferentially stimulated FSH secretion ex vivo and in vivo.

      Weaknesses:

      The concept that there may be a distinct FSH-releasing hormone (FSHRH) has been debated for decades. As the authors suggest that CCK is the long-sought FSHRH (at least in fish), they must provide data that convincingly leads to such a conclusion. In my estimation, they have not yet met this burden. In particular, they show that CCK is sufficient to activate FSH-producing cells, but have not yet demonstrated its necessity. Their one attempt to do so was using fish in which they inactivated the CCK receptor using CRISPR-Cas9. While this manipulation led to a reduction in FSH, LH was affected to a similar extent. As a result, they have not shown that CCK is a selective regulator of FSH.

      Our conclusion regarding the necessity of CCK signaling for FSH secretion is based on the following evidence:

      (1) CCK-like receptors are expressed in the pituitary gland predominantly on FSH cells.

      (2) Application of CCK to pituitaries elicits FSH cell activation and to a much lesser degree activation of LH cells.  (calcium imaging assays)

      (3) Application of CCK to pituitaries and by injections in-vivo significantly increased only FSH release.

      (4) Mutating the FSH-specific CCK receptor in a different species of fish (medaka) also causes a complete shutdown of FSH production and phenocopies a fsh-mutant phenotype (Uehara, Nishiike et al. 2023).

      Taken together, we believe that this data strongly supports the conclusion that CCK is necessary for FSH production and release from the fish pituitary. Admittedly, the overlapping effects of CCK on both FSH and LH cells in zebrafish (evident in both our calcium imaging experiments and especially in the KO phenotype) complicates the interpretation of the phenotype. We speculate that the effect of CCK on LH cells in zebrafish can be caused either by paracrine signaling within the gland or by the effects of CCK on GnRH neurons that were shown to express CCK receptors .

      In the current version, we emphasize that CCK also induces LH secretion. Although it does not affect LH to the same extent as FSH, an overlap does exist. This is mentioned in the abstract and discussion.

      Moreover, they do not yet demonstrate that the effects observed reflect the loss of the receptor's function in gonadotropes, as opposed to other cell types.

      Although there is evidence for the expression of CCK receptor in other tissues, we do show a direct decrease of FSH and LH expression in the gonadotrophs of the pituitary of the mutant fish; taken together with its significant expression in FSH cells compared to the rest of the cells of the pituitary in the cell specific transcriptomic, it is the most reasonable explanation for the mutant phenotype.

      Unfortunately, unlike in mice, technologies for conditional knockout of genes in specific cell types are not yet available for our model and cell types. Additional tissue distribution of the three receptors types of CCK was added in supplementary figure 1, from this tissue distribution it can be appreciated how in the pituitary only CCKBRA (our identified CCK receptor) is expressed, while in other tissues it is either not expressed or expressed with the additional CCK receptors that can compensate its activity.

      It also is not clear whether the phenotypes of the fish reflect perturbations in pituitary development vs. a loss of CCK receptor function in the pituitary later in life. Ideally, the authors would attempt to block CCK signaling in adult fish that develop normally. For example, if CCK receptor antagonists are available, they could be used to treat fish and see whether and how this affects FSH vs. LH secretion.

      While the observed gonadal phenotype of the KO (sex inversed fish) should have a developmental origin since it requires a long time to manifest, the effect of the KO on FSH and LH cells is probably more acute. Unfortunately a specific antagonist that affect only CCKRBA and not the other CCK receptors wasn’t identified yet.

      In the Discussion, the authors suggest that CCK, as a satiety factor, may provide a link between metabolism and reproduction. This is an interesting idea, but it is not supported by the data presented. That is, none of the results shown link metabolic state to CCK regulation of FSH and fertility. Absent such data, the lengthy Discussion of the link is speculative and not fully merited.

      In the revised manuscript, we provided data to link cck with metabolic status in supplementary figure 1 and modified the discussion to tone down the link between metabolic status to and reproductive state.

      Also in the Discussion, the authors argue that "CCK directly controls FSH cells by innervating the pituitary gland and binding to specific receptors that are particularly abundant in FSH gonadotrophs." However, their imaging does not demonstrate innervation of FSH cells by CCK terminals (e.g., at the EM level).

      Innervation of the fish pituitary does not imply a synaptic-like connection between axon terminals and endocrine cells. In fact, such connections are extremely rare, and their functionality is unclear. Instead, the mode of regulation between hypothalamic terminals and endocrine cells in the fish pituitary is more similar to "volume transmission" in the CNS, i.e. peptides are released into the tissue and carried to their endocrine cell targets by the circulation or via diffusion. A short explanation was added in lines 395-398 in the discussion

      Moreover, they have not demonstrated the binding of CCK to these cells. Indeed, no CCK receptor protein data are shown.

      Our revised manuscript  includes detailed experiments showing the activation of the receptor by its homologous ligand, supplementary Figure 1 includes a transactivation  assay of CCK to its receptor and the effect of the different mutants on the activation of the receptor. Unfortunately, no antibody is available against this fish specific receptor (one of the caveats of working with fish models); therefore, we cannot present receptor protein data.

      The calcium responses of FSH cells to exogenous CCK certainly suggest the presence of functional CCK receptors therein; but, the nature of the preparations (with all pituitary cell types present) does not demonstrate that CCK is acting directly in these cells.

      We agree with the reviewer that there are some disadvantages in choosing to work with a whole-tissue preparation. However, we believe that the advantages of working in a more physiological context far outweigh the drawbacks as it reflects the natural dynamics more precisely. Since our transcriptome data, as well as our ISH staining, show that the CCK receptor is exclusively expressed in FSH cells, it is improbable that the observed calcium response is mediated via a different pituitary cell type.

      Indeed, the asynchrony in responses of individual FSH cells to CCK (Figure 4) suggests that not all cells may be activated in the same way. Contrast the response of LH cells to GnRH, where the onset of calcium signaling is similar across cells (Figure 3).

      The difference between the synchronization levels of LH and FSH cells activity stems from the gap-junction mediated coupling between LH cells that does not exist between FSH cells(Golan, Martin et al. 2016). Therefore, the onset of calcium response in FSH cells is dependent on the irregular diffusion rate of the peptide within the preparation, whereas the tight homotypic coupling between LH cells generates a strong and synchronized calcium rise that propagates quickly throughout the entire population

      The differences in connectivity between LH and FSH cells is mentioned in lines 194-195

      Finally, as the authors note in the Discussion, the data presented do not enable them to conclude that the endogenous CCK regulating FSH (assuming it does) is from the brain as opposed to other sources (e.g., the gut).

      We agree with the reviewer that, for now, we are unable to determine whether hypothalamic or peripheral CCK are the main drivers of FSH cells. While the strong innervation of the gland by CCK-secreting hypothalamic neurons strengthens the notion of a hypothalamic-releasing hormone and also fits with the dogma of the neural control of the pituitary gland in fish (Ball 1981), more experiments are required to resolve this question.

      Reviewer #2 (Public Review):

      Summary:

      This manuscript builds on previous work suggesting that the CCK peptide is the releasing hormone for FSH in fishes, which is different than that observed in mammals where both LH and FSH release are under the control of GnRH. Based on data using calcium imaging as a readout for stimulation of the gonadotrophs, the researchers present data supporting the hypothesis that CCK stimulates FSH-containing cells in the pituitary. In contrast, LH-containing cells show a weak and variable response to CCK but are highly responsive to GnRH. Data are presented that support the role of CCK in the release of FSH. Researchers also state that functional overlap exists in the potency of GnRH to activate FSH cells, thus the two signalling pathways are not separate. The results are of interest to the field because for many years the assumption has been that fishes use the same signalling mechanism. These data present an intriguing variation where a hormone involved in satiation acts in the control of reproduction.

      Strengths:

      The strengths of the manuscript are that researchers have shed light on different pathways controlling reproduction in fishes.

      Weaknesses:

      Weaknesses are that it is not clear if multiple ligand/receptors are involved (more than one CCK and more than one receptor?). The imaging of the CCK terminals and CCK receptors needs to be reinforced.

      Reviewer consultation summary: 

      The data presented establish sufficiency, but not necessity of CCK in FSH regulation. The paper did not show that CCK endogenously regulates FSH in fish. This has not been established yet.

      This is a very important comment, also raised by reviewer 1. To avoid repetition, please see our detailed response to the comment above.

      The paper presents the pharmacological effects of CCK on ex vivo preparations but does not establish the in vivo physiological function of the peptide. The current evidence for a novel physiological regulatory mechanism is incomplete and would require further physiological experiments. These could include the use of a CCK receptor antagonist in adult fish to see the effects on FSH and LH release, the generation of a CCK knockout, or cell-specific genetic manipulations.

      As detailed in the responses to the first reviewer, we cannot conduct conditional, cellspecific gene knockout in our model. However we did conducted KO and show the direct effect on FSH and LH secretion together with physiological characterisation of the mutant.

      Zebrafish have two CCK ligands: ccka, cckb and also multiple receptors: cckar, cckbra and cckbrb. There is ambiguity about which CCK receptor and ligand are expressed and which gene was knocked out.

      In the revised manuscript, we clarified which of the receptors are expressed (CCKRBA) and which receptor is targeted. We also provided data showing the specificity of the receptors (both WT and mutant) to the ligands. Supplementary 1 shows receptor cross-activation. The method also specifies the exact NCBI ID numbers of the targeted receptor and the antibody used for the immunostaining.

      Blocking CCK action in fish (with receptor KO) affects FSH and LH. Therefore, the work did not demonstrate a selective role for CCK in FSH regulation in vivo and any claims to have discovered FSHRH need to be more conservative.

      We agree with the reviewer that the overlap in the effect of CCK measured in the calcium activation of cells and in the KO model does not allow us to conclude selectivity. In this context, it is crucial to highlight that CCKRBA exhibits high expression on FSH cells but not on LH cells. Therefore, the effect of CCK on LH cells is likely paracrine or through GnRH neurons that were shown to express CCK receptors. In the current version, we emphasize that CCK also induces LH secretion. Although it does not affect LH to the same extent as FSH, an overlap does exist. This is mentioned in the abstract and discussion.

      The labelling of the terminals with anti-CCK looks a lot like the background and the authors did not show a specificity control (e.g. anti-CCK antibody pre-absorbed with the peptide or anti-CCK in morphant/KO animals).

      Figures colours had been updated to better visualise the specific staining of the antibody. Also, The same antibody had been previously used to mark CCK-positive cells in the gut of the red drum fish(Webb, Khan et al. 2010) , where a control (pre-absorbed with the peptide) experiment had been conducted.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      Abstract:

      The authors have not yet established that CCK is the primary regulator of FSH in vivo.

      In the new version, we highlight the leading effect of CCK on the reproductive axis, which includes FSH and LH.

      Introduction:

      The authors need to make clear earlier in the Introduction that fish have two types of gonadotropes. This information comes too late (last paragraph) currently.

      Added in line 42

      They should discuss relevant data on the differential regulation of FSH and LH in fish, as a rationale for looking for different releasing factors.

      This has been discussed in the first paragraph of the introduction

      In the last sentence of the penultimate paragraph, the authors assume that it must be a hypothalamic factor that regulates FSH. Why is this necessarily the case? Are there data indicating that a hypothalamic factor is required for FSH production in fish?

      This has been mentioned in the discussion, we do not deny that circulating CCK or CCK from other brain areas might affect FSH secretion in the pituitary (line 402-404). However, as the hypothalamus serves as the main gateway from the brain to the pituitary and contains hypophysiotropic CCK neurons it is the most reasonable assumption.

      Results:

      In the first paragraph, the authors reference three types of CCK receptors, only one of which is expressed in the pituitary. The specific receptor should be named here.

      The receptor name and NCBI id had been added in this paragraph.

      Figure 1: What specificity controls were used for the ISH in Figure 1?

      HCR- The method used to identify RNA expression and developed by Molecular Instruments (https://www.molecularinstruments.com/hcr-rnafish-protocols), do not require specific control as had been previously done with older ISH methods. The use of multiple short probes assure the specificity to the RNA.More over the expression is specific to the targeted cells.

      In Figure 1D, the red square is missing in the KO fish (at low magnification).

      This was fixed in the updated version.

      In Figure 1G, the number of dots does not correspond to the number of animals described in the figure legend. Does each point represent an animal?

      Each dot represent a fish. The order of the numbers in the legend didn’t match the order in the graph, this had been fixed in the last version

      Figure 2A: It is not clear that all FSH (GFP) cells are double-labeled. Should all double-labeled cells appear white? Many appear as green. Some quantification of the proportion of co-labeling is needed. Also, the scale bars are too small to read. Perhaps add the size of the scale bars to the legend.

      They are all double-labeled, as can be seen by the single-color images, since GFP fluorescence is stronger than RCaMP fluorescence, the double-labelling might be seen a green cells; a scale bar was added.

      Figure 2C: Is the synchronous activity of LH cells here dependent on endogenous GnRH? Can these events be blocked with a GnRH receptor antagonist?

      We currently do not have enough data to support this hypothesis and the in vivo 2 photon system is not optimal to answer these questions since these are spontaneous events which are difficult to predict. This is the main reason we moved to an ex vivo system. The similar response we receive when applying GnRH in the ex vivo system support it is GnRH activation.

      Figure 4C: As some LH cells respond to CCK, can the authors really claim that CCK is a selective regulator of FSH? What explains the heterogeneity in the response of LH cells to CCK?

      In this version, we highlight that CCK directly activates FSH but it is also affecting LH to some extent. However it is clear that the effect on FSH cells is more significant.

      Figures 5A and B: With larger Ns, some of the trends might be significant (e.g., GnRH stimulated FSH release and CCK stimulated LH release).

      Though there is a trend, the values in the Y axis reveal that the trend of response of FSH to GnRH and LH to CCK is lower then the distribution of the basal response (the before) in all of the graphs. Hence we do not believe a larger N will affect those results. We added the range of the secreted hormones concentrations in the result description to emphasize the difference in values,

      Figures 5C and D: What explains the lack of an increase in LH secretion following GnRH treatment?

      We did not measure LH Secretion in the plasma as we didn’t have enough blood, we do see an increase in LH transcription (see supplementary figure 5 – figure supplement 1)

      Also, as mRNA levels were measured (in C), reference should be made to expression rather than transcription. Not all changes in mRNA levels reflect changes in transcription.Also, remove transcription from the legend. Reference to supplementary Figure 4 in the legend should be supplementary Figure 6. Finally, in C and D, distinguish males from females (as in 5A and B).

      Modifications had been done according to the reviewer suggestions.

      Figure legends:

      The figure legends are very long. One way to shorten them is to remove descriptions of the results. The legends should indicate what is in each figure, not the results of the experiments.

      Modifications had been done according to the reviewer suggestions.

      Sample sizes should be spelled out in the legends, as they are not in the M&M.

      We made sure all sample sizes are mentioned in the legend

      Materials and Methods:

      Section 1.1 can be removed as it repeats content presented elsewhere.

      This section was removed

      Section 1.5: It is unclear what this means: "blinding was not applied to ensure tractability" Please clarify.

      This section was removed

      Reviewer #2 (Recommendations For The Authors):

      It appears that zebrafish have two ligands: ccka, cckb. Also multiple receptors: cckar, cckbra and cckbrb. Authors need to discuss this and clearly state which ligand and which receptor they are referring to in the manuscript.

      We discussed the receptor type in the first paragraph of the results, the exact synthetic peptide used is described in the methods. The 8 amino acids of the mature CCK peptide are the same between CCKa and CCKb. A sentence regarding the specificity of the antibody to the mature CCK peptide was added in line 101.

      "to GnRH puff application (300 μl of 30 μg/μl)"; (250 μl of 30 μg/ml CCK)

      Please give the final concentration to make it easy on the readers of the data.

      The molarity of the final concentration was added.

      (2.4) Differential calcium response underlies differential hormone. This section is a bit confusing to read, for example:

      "For that, we collected the medium perfused through our ex vivo system (Fig. 2a) and measured LH and FSH levels using a specific ELISA validated for zebrafish [31] while monitoring the calcium activity of the cells."

      So the authors did the ELISA while monitoring the activity (?). This sentence does not make sense: please rewrite it.

      We modified this sentence  in line 308-311

      To functionally validate the importance of CCK signalling we used CRISPR-cas9 to generate loss-of-function (LOF) mutations in the pituitary- CCK receptor gene.

      The authors need to clearly state WHICH gene they inactivated: Zebrafish have three CCK-receptors, so "the pituitary receptor gene" needs to be defined.

      Was added again in line 107, and is mentioned in the methods

      Figure 3 is a crucial figure!

      Figure 3B: The data are not very convincing. Please state how thick the sections are in the figure legend (assuming these are adult pituitaries),

      Added in the legend (figure 1C in the new version), slice thickness and adult fish.

      Please show at least the merged image a high magnification view of the co-localization of the receptor with the cells.

      This is figure 1 in the new revision, a magnified figure was added

      Please give the scale bar size for 3B.

      Scales for all images were added

      Figure 3C: the co-localization of the terminals of the CCK and FSH cells shows very few cells expressing close to terminals.

      Important: Because the labelling of the terminals with anti-CCK looks a lot like the background, it is very important to show the control (anti-CCK antibody pre-absorbed with the peptide). The authors should have these data. The photo needs to have been taken at the same gain (contrast) and the photo showing the terminals.

      This is  a commercial antibody that had been previously validated for CCK in fish. The co-localization pattern resembles GnRH innervation in the pituitary. In fish when hypothalamic neurons innervate the pituitary they do not innervate all the cells, as this is an endocrine system, the peptide can travel to neighbouring cells via diffusion or aided blood flow (Golan, Zelinger et al. 2015) ).  The images reveal the direct innervation of CCK in the pituitary and its proximity to FSH cells.

      Figure 4c, on right. The text seems to be stretched as if the photo was adjusted without locking the aspect ratio. Please check the original images.

      This has been fixed

      Can the authors use different pseudo colours? Differentiating a double label of white versus yellow is very difficult, and thus the photo is not very convincing.

      This had been changed to green and magenta

      What is meant by "CCK-AB" antibody? Perhaps anti-CCK would be a better label

      This has been fixed

      Figure 5A: increase the magnification of the insets; the structure of the gonads is very difficult to see with clarity in these low mag images. The most obvious way to improve this figure is to reduce or eliminate the pie graph (not really necessary) and show a high magnification (and larger) image of the gonadal structure.

      This is figure 1 in the new version, with magnification of the gonad next to each body section.

      Discussion:

      " Moreover, in the zebrafish, as well as in other species, the functional overlap in gonadotropin signalling pathways is not limited to the pituitary but is also present in the gonad, through the promiscuity of the two gonadotropin receptors"<br /> The reasoning of this sentence is not clear: zebrafish do not use GnRH to control reproduction: they lack GnRH1 through genomic rearrangement (see Whitlock, Postlethwait and Ewer 2019) and KO of GnRH2/GnRH3 does not affect reproduction.

      While GnRH KO model indicate a redundancy of GnRH in this axis in zebrafish, there is also ample evidence for its importance in regulating reproduction such as its effect on gonadotropin (Golan, Martin et al. 2016) and its use in spawning inductions in fish (Mizrahi and Levavi-Sivan 2023). We believe it is currently too soon to conclude that GnRH signalling is completely non relevant to reproduction in cyprinids.  

      Reviewing Editor (Recommendations For The Authors):

      It would be interesting to see calcium imaging experiments in the CCKR receptor mutants to establish a more direct connection between peptide action and activity.

      We added a receptor assay that reflect the non-activation of the mutated receptors by CCK (supplementary figure 1) , and compared it to the wild type that is activated. This show that: 1) CCK directly activate our identified receptor in FSH cells. 2) the mutated receptors are non-active.

      "all homozygous fish (CCKR+12/+7/-1/ CCKR+12/+7/-1, n=12)"

      It may be better to write the genotype of fish separately as CCKR+12/+12, CCKR+7/+7 and CCKR-1/-1, n=12) otherwise it seems as if all alleles occurred together in the same fish.

      Modified according to the reviewer request

      In Figure 1 scale bar legends are very small. 

      Description of the scale bars were added to the all the legends

      Figure 1 legend "On the top right of each panel is the gender distribution" - fish have no gender but sex.

      Modified according to the reviewer request

      The authors should endeavour to improve the presentation of the figures. They should use a sans-serif font and check that text is not cut at the edge of figure panels, that scale bars are uniform and clearly labelled and fonts are of similar size and clearly legible. E.g. labels of the fish brain of Fig3A are very small.

      We modified all the figures to adapt the font and the scales, we increased the size of the image in Figure 3a to make the labels clearer.

      Please use the elife format to name supplementary figures, as Figure X - Figure Supplement Y (each supplement associated with one of the main figures).

      Fixed

      Peptide concentrations in the ex vivo experiments should also be given as molar concentrations not only as '250 μl of 30 μg/ml CCK'.

      Fixed

      "In contrast, FSH cells responded with a very low calcium rise in hormonal secretion in response to GnRH" - a very low rise in hormonal secretion

      Fixed

      Please clarify why you used a GnRH synthetic agonist and not the native peptide.

      It is commonly used for spawning induction in fish (line 245); it has also been shown to directly affect the secretion of LH and FSH (Biran, Golan et al. 2014, Biran, Golan et al. 2014, Mizrahi, Gilon et al. 2019) , added to line 245.

      References

      Ball, J. (1981). "Hypothalamic control of the pars distalis in fishes, amphibians, and reptiles." General and comparative endocrinology 44(2): 135-170.

      Biran, J., M. Golan, N. Mizrahi, S. Ogawa, I. S. Parhar and B. Levavi-Sivan (2014). "Direct regulation of gonadotropin release by neurokinin B in tilapia (Oreochromis niloticus)." Endocrinology 155(12): 4831-4842.

      Biran, J., M. Golan, N. Mizrahi, S. Ogawa, I. S. Parhar and B. Levavi-Sivan (2014). "LPXRFa, the Piscine Ortholog of GnIH, and LPXRF Receptor Positively Regulate Gonadotropin Secretion in Tilapia (Oreochromis niloticus)." Endocrinology 155(11): 4391-4401.

      Golan, M., A. O. Martin, P. Mollard and B. Levavi-Sivan (2016). "Anatomical and functional gonadotrope networks in the teleost pituitary." Scientific Reports 6: 23777.

      Golan, M., E. Zelinger, Y. Zohar and B. Levavi-Sivan (2015). "Architecture of GnRH-Gonadotrope-Vasculature Reveals a Dual Mode of Gonadotropin Regulation in Fish." Endocrinology 156(11): 4163-4173.

      Mizrahi, N., C. Gilon, I. Atre, S. Ogawa, I. S. Parhar and B. Levavi-Sivan (2019). "Deciphering Direct and Indirect Effects of Neurokinin B and GnRH in the Brain-Pituitary Axis of Tilapia." Front Endocrinol (Lausanne) 10: 469.

      Mizrahi, N. and B. Levavi-Sivan (2023). "A novel agent for induced spawning using a combination of GnRH analog and an FDA-approved dopamine receptor antagonist." Aquaculture 565: 739095.

      Uehara, S. K., Y. Nishiike, K. Maeda, T. Karigo, S. Kuraku, K. Okubo and S. Kanda (2023). "Cholecystokinin is the follicle-stimulating hormone (FSH)-releasing hormone." bioRxiv: 2023.2005.2026.542428.

      Webb, K. A., Jr., I. A. Khan, B. S. Nunez, I. Rønnestad and G. J. Holt (2010). "Cholecystokinin: molecular cloning and immunohistochemical localization in the gastrointestinal tract of larval red drum, Sciaenops ocellatus (L.)." Gen Comp Endocrinol 166(1): 152-159.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews: 

      Reviewer #1 (Public Review):

      The authors introduce a computational model that simulates the dendrites of developing neurons in a 2D plane, subject to constraints inspired by known biological mechanisms such as diffusing trophic factors, trafficked resources, and an activity-dependent pruning rule. The resulting arbors are analyzed in terms of their structure, dynamics, and responses to certain manipulations. The authors conclude that 1) their model recapitulates a stereotyped timecourse of neuronal development: outgrowth, overshoot, and pruning 2) Neurons achieve near-optimal wiring lengths, and Such models can be useful to test proposed biological mechanisms- for example, to ask whether a given set of growth rules can explain a given observed phenomenon - as developmental neuroscientists are working to understand the factors that give rise to the intricate structures and functions of the many cell types of our nervous system. 

      Overall, my reaction to this work is that this is just one instantiation of many models that the author could have built, given their stated goals. Would other models behave similarly? This question is not well explored, and as a result, claims about interpreting these models and using them to make experimental predictions should be taken warily. I give more detailed and specific comments below.  

      We thank the reviewer for the summary of the work. But the criticism “that this is one instantiation of many models [we] could have built” is unfair as it can apply to any model. We chose one of the most minimalistic models which implements known biological mechanisms including activity-independent and -dependent phases of dendritic growth, and constrained parameters based on experimental data. We compare the proposed model to other alternatives in the Discussion section. In the revised manuscript, we additionally investigate the sensitivity of model output to variations of specific parameters, as explained below.

      Point 1.1. Line 109. After reading the rest of the manuscript, I worry about the conclusion voiced here, which implies that the model will extrapolate well to manipulations of all the model components. How were the values of model parameters selected? The text implies that these were selected to be biologically plausible, but many seem far off. The density of potential synapses, for example, seems very low in the simulations compared to the density of axons/boutons in the cortex; what constitutes a potential synapse? The perfect correlations between synapses in the activity groups is flawed, even for synapses belonging to the same presynaptic cell. The density of postsynaptic cells is also orders of magnitude of, etc. Ideally, every claim made about the model's output should be supported by a parameter sensitivity study. The authors performed few explorations of parameter sensitivity and many of the choices made seem ad hoc.  

      We have performed detailed sensitivity analysis on the model parameters mentioned by the reviewer, including (I) the density of postsynaptic cells (somatas), (II) the density of potential synapses, and (III) the level of correlations between synapses. 

      (I) While the density of postsynaptic cells in our baseline model seems a bit low, at least when compared to densities observed in adulthood (Keller et al., 2018), we explored how altering this value affects the model dynamics. We found that the postsynaptic cell density does not affect the timing of dendritic outgrowth, overshoot and synaptic pruning. It only changes the final size of the dendritic arbor and the resulting number of connected synapses. This analysis is now included in Supplementary Figure 3-2.

      (II) The density of potential synapses and the density of connected synapses that we used in the manuscript are already in the range of densities that can be found in the literature (Leighton et al., 2024; Ultanir et al., 2007; Glynn et al., 2011; Yang et al., 2014), some of which we already cited in the original submission.

      A potential concern might be that the rapid slowing down of growth in the model could be due to a depletion of potential synapses. To illustrate that this is not the case, we showed that the number of available potential synapses over the time course of the simulations remains high (Figure 3, new panel e). Therefore, the initial density of potential synapses is sufficient and does not affect the final density of connected synapses.

      To further illustrate the robustness of our model dynamics to longer simulation times, we added a new supplementary figure (Supplementary Figure 3-1).

      These new figure additions (Figure 3e, Supplementary Figure 3-1, and Supplementary Figure 3-2) and their implications for the model dynamics are discussed in the Results section of the revised paper:

      p.9 line 198, “After the initial overshoot and pruning, dendritic branches in the model stay stable, with mainly small subbranches continuing to be refined (Figure 3-Figure Supplement 1). This stability in the model is achieved despite the number of potential synaptic partners remaining high (Figure 3e), indicating a balance between activity-independent and activitydependent mechanisms. The dendritic growth and synaptic refinement dynamics are independent of the postsynaptic somata densities used in our simulations (Figure 3-Figure Supplement 2). Only the final arbor size and the number of connected synapses decrease with an increase in the density of the somata, while the timing of synaptic growth, overshoot and pruning remains the same (Figure 3-Figure Supplement 2).”

      We also added more details to the description of our model in the Methods section:

      p.24 line 615, “For all simulations in this study, we distributed nine postsynaptic somata at regular distances in a grid formation on a 2-dimensional 185 × 185 pixel area, representing a cortical sheet (where 1 pixel = 1 micron, Figure 4). This yields a density of around 300 neurons per 𝑚𝑚2 (translating to around 5,000 per 𝑚𝑚3, where for 25 neurons in Figure 3Figure Supplement 2 this would be around 750 neurons per 𝑚𝑚2 or 20,000 per 𝑚𝑚3). The explored densities are a bit lower than compared to neuron densities observed in adulthood (Keller et al., 2018). In the same grid, we randomly distributed 1,500 potential synapses, yielding an initial density of 0.044 potential synapses per 𝜇𝑚2 (Figure 3e). At the end of the simulation time, around 1,000 potential synapses remain, showing that the density of potential synapses is sufficient and does not significantly affect the final density of connected synapses. Thus, the rapid slowing down of growth in our model is not due to a depletion of potential synaptic partners. The resulting density of stably connected synapses is approximately 0.015 synapses per 𝜇𝑚2 (around 60 synapses stabilized per dendritic tree, Figure 3b). This density compares well to experimental findings, where, especially during early development, synaptic densities are described to be within a range similar to the one observed in our model (Leighton et al., 2024; Ultanir et al., 2007; Glynn et al., 2011; Yang et al., 2014; Koshimizu et al., 2009; Tyler and Pozzo-Miller, 2001).”

      (III) Lastly, we investigated how the correlation between synapses of the same activity group might affect our conclusions. As correlations in our model mainly arise from patterns of spontaneous activity which are abundant in early postnatal development (retinal waves (Ackman et al., 2012) or endogenous activity in the form of highly synchronized events involving a large fraction of the cells (Siegel et al., 2012), we explored varying the correlations within each activity group, across activity groups and combinations of both. While this analysis supported our previously described intuition on how competition between synaptic activities should drive activity-dependent refinement, recently a study found direct evidence for such subcellular refinement of synaptic inputs specifically dependent on spontaneous activity between retinal ganglion cell axons and retinal waves in the superior colliculus (Matsumoto et al., 2024). The new analysis confirmed our earlier results that the competition between activity groups leads to activity-dependent refinement and yielded further insight into how the studied activity correlations can affect the competition. Those results are presented in a completely new figure (new Figure 5, supported by the Supplementary Figure 5-1 and 5-2) and discussed in the Results section:

      p.11 line 249, “Group activity correlations shape synaptic overshoot and selectivity competition across synaptic groups.

      Since correlations between synapses emerge from correlated patterns of spontaneous activity abundant during postnatal development (Ackman et al., 2012; Siegel et al., 2012), we explored a wide range of within-group correlations in our model (Figure 5a). Although a change in correlations within the group has only a minor effect on the resulting dendritic lengths (Figure 5b) and overall dynamics, it can change the density of connected synapses and thus also affect the number of connected synapses to which each dendrite converges throughout the simulations (Figure 5c,e). This is due to the change in specific selectivity of each dendrite which is a result of the change in within-group correlations (Figure 5d). While it is easier for perfectly correlated activity groups to coexist within one dendrite (Figure 5-Figure Supplement 1a, 100%), decreasing within-group correlations increases the competition between groups, producing dendrites that are selective for one specific activity group (60%, Figure 5d, Figure 5-Figure Supplement 1a). This selectivity for a particular activity group is maximized at intermediate (approximately 60%) within-group correlations, while the contribution of the second most abundant group generally remains just above random chance levels (Figure 5-Figure Supplement 1a). Further reducing within-group correlations (20%, Figure 5a) causes dendrites to lose their selectivity for specific activity groups due to the increased noise in the activity patterns (20%, Figure 5a). Overall, reducing within-group correlations increases synapse pruning (Figure 5f, bottom), also found experimentally (Matsumoto et al., 2024) as dendrites require an extended period to fine-tune connections aligned with their selectivity biases. This phenomenon accounts for the observed reduction in both the density and number of synapses connected to each dendrite.

      In addition to the within-group correlations, developmental spontaneous activity patterns can also change correlations between groups as for example retinal waves propagated in different domains (Feller et al., 1997) (Figure 5-Figure Supplement 2). An increase in between-group correlations in our model intuitively decreases competition between the groups since fully correlated global events synchronize the activity of all groups (Figure 5-Figure Supplement 2). The reduction in competition reduces pruning in the model, which can be recovered by combining cross-group correlations with decreased within-group correlations (Figure 5-Figure Supplement 2). Our simulations show that altering the correlations within activity groups increases competition (by lowering the within-group correlations) or decreases competition (by raising the across-group correlations). Hence, in our model, competition between activity groups due to non-trivially structured correlations is necessary to generate realistic dynamics between activity-independent growth and activity-dependent refinement or pruning.

      In sum, our simulations demonstrate that our model can operate under various correlations in the spike trains. We find that the level of competition between synaptic groups is crucial for the activity-dependent mechanisms to either potentiate or depress synapses and is fully consistent with recent experimental evidence showing that the correlation between spontaneous activity in retinal ganglion cells axons and retinal waves in the superior colliculus governs branch addition vs. elimination (Matsumoto et al., 2024)."

      Precise details on the implementation of the changed activity correlations were added to the Methods section:

      p. 25 line 638, “Within-group and across-group activity correlations. For the decreased withingroup correlations, we generated parent spike trains for each individual group with the firing rate 𝑟𝑖𝑛 = 𝑟𝑡𝑜𝑡𝑎𝑙 ∗ 𝑃𝑖𝑛 (e.g., 𝑃𝑖𝑛 = 100%; 60%; 20%, Figure 5). All the synapses of the same group share the same parent spike train and the remaining spikes for each synapse are uniquely generated with the firing rate 𝑟𝑟𝑒𝑠𝑡 = 𝑟𝑡𝑜𝑡𝑎𝑙 ∗ (1 − 𝑃𝑖𝑛) (e.g., (1 − 𝑃𝑖𝑛) = 0%; 40%; 80%), resulting in the desired firing rate 𝑟𝑡𝑜𝑡𝑎𝑙 (see Table 1). For the increase in across-group correlations, we generated one master spike train with the firing rate 𝑟𝑐𝑟𝑜𝑠𝑠 = 𝑟𝑡𝑜𝑡𝑎𝑙 ∗ 𝑃𝑐𝑟𝑜𝑠𝑠 for all the synapses of all groups (e.g., 𝑃𝑐𝑟𝑜𝑠𝑠 = 5%; 10%; 20%, Figure 5-Figure Supplement 2). This master spike train is shared across all groups and then filled up according to the within-group correlation (if not specified differently 𝑃𝑖𝑛 = 1 − 𝑃𝑐𝑟𝑜𝑠𝑠 to maintain the rate 𝑟𝑡𝑜𝑡𝑎𝑙). In all the cases, also in those where the change in across-group correlations is combined with the change in within-group correlations, the remaining spikes for each synapse are generated with a firing rate 𝑟𝑟𝑒𝑠𝑡 = 𝑟𝑡𝑜𝑡𝑎𝑙 ∗ (1 − 𝑃𝑖𝑛 − 𝑃𝑐𝑟𝑜𝑠𝑠) to obtain an overall desired firing rate of 𝑟𝑡𝑜𝑡𝑎𝑙.”

      Point 1.2. Many potentially important phenomena seem to be excluded. I realize that no model can be complete, but the choice of which phenomena to include or exclude from this model could bias studies that make use of it and is worth serious discussion. The development of axons is concurrent with dendrite outgrowth, is highly dynamic, and perhaps better understood mechanistically. In this model, the inputs are essentially static. Growing dendrites acquire and lose growth cones that are associated with rapid extension, but these do not seem to be modeled. Postsynaptic firing does not appear to be modeled, which may be critical to activity-dependent plasticity. For example, changes in firing are a potential explanation for the global changes in dendritic pruning that occur following the outgrowth phase.  

      Thanks to the reviewer for bringing up these important considerations. We do indeed write in the Introduction (e.g. lines 36-76) which phenomena we include in the model and why. The Discussion also compares our model to others (lines 433-490), pointing out that most models either focus on activity-independent or activity-dependent phases. We include both, combining the influence of both molecular gradients and growth factors as well as activity-dependent connectivity refinements instructed by spontaneous activity. We consider our model a tractable, minimalist mechanistic model which includes both activity-independent and activity-dependent aspects. 

      Regarding postsynaptic firing, this is indeed super relevant and an important point to consider. In one of our recent publications (Kirchner and Gjorgjieva, 2021), we studied only an activity-dependent model for the organization of synaptic inputs on non-growing dendrites which have a fixed length. There, we considered the effect of postsynaptic firing (via a back-propagating action potential) and demonstrated that it plays an important role in establishing a global organization of synapses on the entire dendritic tree of the neuron. For example, we showed that it could lead to the emergence of retinotopic maps on the dendritic tree which have been found experimentally (Iacaruso et al., 2017). Since we use the same activity-dependent plasticity model in this paper, we expect that the somatic firing will have the same effect on establishing synaptic distributions on the entire dendritic tree. This is now also discussed in the Discussion section of the revised manuscript:

      p. 21 line 491, “Although we did not explicitly model postsynaptic firing, our previous work with static dendrites has shown that it can play an important role in establishing a global organization of synapses on the entire dendritic tree of the neuron (Kirchner and Gjorgjieva, 2021). For example, we showed that it could lead to the emergence of retinotopic maps on the dendritic tree which have been found experimentally (Iacaruso et al., 2017). Since we use the same activity-dependent plasticity model in this paper, we expect that the somatic firing will have the same effect on establishing synaptic distributions on the entire dendritic tree.”

      Including the concurrent development of axons in the model is indeed very interesting. In fact, a recent tour-de-force techniques paper found similar to what we assume. Hebbian activity-dependent dynamics of axonal branches of retinal ganglion cells experiencing spontaneous activity in relation to retinal waves in the superior colliculus (Matsumoto et al., 2024). New branches tend to be added at the locations where spontaneous activity of individual branches is more correlated with retinal waves, whereas asynchronous activity is associated with branch elimination. We suspect the same Hebbian activity-dependent dynamics to apply also to dendritic growth. 

      To address simultaneous dynamic axons to our growing dendrites, in the revised version of the manuscript, we included a simplified form of axonal dynamics by allowing changes in the lifetime and location of potential synapses, which come from axons of presynaptic partners. We explored different median lifetimes of synapses in combination with several distances with which a synapse can move in the simulated space (new Supplementary Figure 3-3). Our results show that dynamically moving synapses only affect the dynamics and stability of our model when the rate of moving synapses combined with the distance of moving synapses is faster than the dendritic growth. In scenarios in which synapses can move across large distances, dendrites get further destabilized due to synapses transferring from one dendrite to another, perturbing the attractor fields of the potential synapses even in late phases of the simulations. Besides such non-biological scenarios, dynamically moving synapses do not affect the model dynamics too much. Thus, they mostly add additional noise and variability to the growth and pruning without changing the timing and amplitude of the dynamics. These results are discussed in the results section of the revised manuscript:

      p.9 line 207, “The development of axons is concurrent with dendritic growth and highly dynamic Matsumoto et al. (2024). To address the impact of simultaneously growing axons, we implemented a simple form of axonal dynamics by allowing changes in the lifetime and location of potential synapses, originating from the axons of presynaptic partners (Figure 3-Figure Supplement 3). When potential synapses can move rapidly (median lifetime of 1.8 hours), the model dynamics are perturbed quite substantially, making it difficult for the dendrites to stabilize completely (Figure 3–Figure Supplement 3c). However, slowly moving potential synapses (median lifetime of 18 hours) still yield comparable results (Figure 3-Figure Supplement 3). The distance of movement significantly influenced results only when potential synaptic lifetimes were short. For extended lifetimes, the moving distance had a minor impact on the dynamics, predominantly affecting the time required for dendrites to stabilize. This was the result of synapses being able to transfer from one dendrite to another, potentially forming new long-lasting connections even at advanced stages of synaptic refinement. In sum, our results show that potential axonal dynamics only affect the stability of our model when these dynamics are much faster than dendritic growth.”

      Precise details on the implementation of the dynamically moving synapses and their synaptic lifetimes are now in the Methods section:

      p. 25 line 650, “Dynamically moving synapses. For the moving synapses we introduced lifetimes for each synapse, randomly sampled from a log-normal distribution with median 1.8h (for when they move frequently), 4.5h or 18h (for when they move rarely) and variance equal to 1 (Figure 3-Figure Supplement 3b). The lifetime of a synapse decreases only when the synapse is not connected to any of the dendrites (i.e., is a potential synapse). When the lifetime of a synapse expires, the synapse moves to a new location with a new lifetime sampled from the same log-normal distribution. This enables synapses to move multiple times throughout a simulation. The exact locations and distances to which each synapse can move are determined by a binary matrix (dimensions: 𝑝𝑖𝑥𝑒𝑙𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒 × 𝑝𝑖𝑥𝑒𝑙𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒) representing a ring (annulus) with the inner radius 𝑑/4 and outer radius 𝑑/2 , where the synapse location is at the center of the matrix. All the locations of the matrix within the ring boundaries (between the inner radius and outer radius) are potential locations to which the synapse can move. The synapse then moves randomly to one of the possible locations where no other synapse or dendrite is located. For the movement distances, we chose the ring dimensions 3 × 3, 25 × 25 and 101 × 101, yielding the moving distances (radii) of 1 pixel per movement, 12 pixels per movement and 50 pixels per movement (𝑟 = (𝑑−1)/2). These pixel distances represent small movements, as much as a dendrite can grow in one step (1 micron), and larger movements which are far enough so that the synapse will not attract the same branches again (12 microns) or far enough so that it might attract a completely different dendrite (50 microns, Figure 3-Figure Supplement 3a).”

      Point 1.3. Line 167. There are many ways to include activity -independent and -dependent components into a model and not every such model shows stability. A key feature seems to be that larger arbors result in reduced growth and/or increased retraction, but this could be achieved in many ways (whether activity dependent or not). It's not clear that this result is due to the combination of activity-dependent and independent components in the model, or conceptually why that should be the case.

      We never argued for model uniqueness. There are always going to be many different models (at different spatial and temporal scales, at different levels of abstraction). We can never study all of them and like any modeling study in systems neuroscience we have chosen one model approach and investigated this approach. We do compare the current model to others in the Discussion. If the reviewers have a specific implementation that we should compare our model to as an alternative, we could try, but not if this means doing a completely separate project.

      Point 1.4. Line 183. The explanation of overshoot in terms of the different timescales of synaptic additions versus activity-dependent retractions was not something I had previously encountered and is an interesting proposal. Have these timescales been measured experimentally? To what extent is this a result of fine-tuning of simulation parameters?  

      We found that varying the amount of BDNF controls the timescale of the activity-dependent plasticity (see our Figure 6c). Hence, changing the balance between synaptic additions vs. retractions is already explored in Figure 6e and f. Here we show that the overshoot and retraction does not have to be fine-tuned but may be abolished if there is too much activity-dependent plasticity. 

      Regarding the relative timescales of synaptic additions vs. retractions: since the first is mainly due to activity-independent factors, and the second due to activity-dependent plasticity, the questions is really about the timescales of the latter two. As we write in the Introduction (lines 61-63), manipulating activity-dependent synaptic transmission has been found to not affect morphology but rather the density and specificity of synaptic connections (Ultanir et al. 2007), supporting the sequential model we have (although we do not impose the sequence, as both activity-independent and activitydependent mechanisms are always “on”; but note that activity-dependent plasticity can only operate on synapses that have already formed).

      The described results are robust to parameter variations (performed on the postsynaptic density, potential synapse density, and within- and across-group correlations) as described in the reply to reviewer #1 point 1.1.

      Point 1.5. Line 203. This result seems at odds with results that show only a very weak bias in the tuning distribution of inputs to strongly tuned cortical neurons (e.g. work by Arthur Konnerth's group). This discrepancy should be discussed.  

      First, we note that the correlated activity experienced by our modeled synapses (and resulting synaptic organization) does not necessarily correspond to visual orientation, or any stimulus feature, for that matter, but is rather a property of correlated spontaneous activity. 

      Nonetheless, there is some variability in what the experimental data show. Many studies have shown that synapses on dendrites are organized into functional synaptic clusters: across brain regions, developmental ages and diverse species from rodent to primate (Kleindienst et al., 2011; Takahashi et al., 2012; Winnubst et al., 2015; Gökçe et al., 2016; Wilson et al., 2016; Iacaruso et al., 2017; Scholl et al., 2017; Niculescu et al., 2018; Kerlin et al., 2019; Ju et al., 2020, Hedrick et al., 2022, Hedrick et al., 2024). Interestingly, some in vivo studies have reported lack of fine-scale synaptic organization (Varga et al., 2011; X. Chen et al., 2011; T.-W. Chen et al., 2013; Jia et al., 2010; Jia et al., 2014), while others reported clustering for different stimulus features in different species. For example, dendritic branches in the ferret visual cortex exhibit local clustering of orientation selectivity but do not exhibit global organization of inputs according to spatial location and receptive field properties (Wilson et al. 2016; Scholl et al., 2017). In contrast, synaptic inputs in mouse visual cortex do not cluster locally by orientation, but only by receptive field overlap, and exhibit a global retinotopic organization along the proximal-distal axis (Iacaruso et al., 2017). We proposed a theoretical framework to reconcile these data: combining activity-dependent plasticity similar to the BDNF-proBDNF model that we used in the current work, and a receptive field model for the different species (Kirchner and Gjorgjieva, 2021). This is now also discussed in the Discussion section of the revised manuscript:

      p. 20 line 471, “The correlated activity experienced by our modeled synapses (and resulting synaptic organization) does not necessarily correspond to visual orientation, or any stimulus feature, for that matter, but is rather a property of spontaneous activity. Nonetheless, there is some variability in what the experimental data show. Many have shown that synapses on dendrites are organized into functional synaptic clusters: across brain regions, developmental ages and diverse species from rodent to primate (Kleindienst et al., 2011; Winnubst et al., 2015; Iacaruso et al., 2017; Scholl et al., 2017; Niculescu et al., 2018; Takahashi et al., 2012; Gökçe et al., 2016; Wilson et al., 2016; Kerlin et al., 2019; Ju et al., 2020; Hedrick et al., 2022, 2024). Other studies have reported lack of fine-scale synaptic organization (Chen et al., 2013; Varga et al., 2011; Chen et al., 2011; Jia et al., 2010, 2014). Interestingly, some of these discrepancies might be explained by different species showing clustering with respect to different stimulus features (orientation or receptive field overlap) (Scholl et al., 2017; Wilson et al., 2016; Iacaruso et al., 2017). Our prior work proposed a theoretical framework to reconcile these data: combining activity-dependent plasticity as we used in the current work, and a receptive field model for the different species (Kirchner and Gjorgjieva, 2021).”

      Point 1.6. Line 268. How does the large variability in the size of the simulated arbors relate to the relatively consistent size of arbors of cortical cells of a given cell type? This variability suggests to me that these simulations could be sensitive to small changes in parameters (e.g. to the density or layout of presynapses).  

      We again thank the reviewer for the detailed explanation and feedback on parameters that should be tested in more detail. We have explored several of the suggested model parameters and believe that we have managed to explain and illustrate their effects on the model's dynamics clearly. The precise changes are explained in the reply to point 1.1 and are now available in the revised version of the manuscript.

      Point 1.7. The modeling of dendrites as two-dimensional will likely limit the usefulness of this model. Many phenomena- such as diffusion, random walks, topological properties, etc - fundamentally differ between two and three dimensions.  

      Indeed, there are many differences between two and three dimensions. We have ongoing work that extends the current model to 3D but is beyond the scope of the current paper. In systems neuroscience, people have found very interesting results making such simplified geometric assumptions about networks, for instance the one-dimensional ring model has been used to uncover fundamental insights about computations even though highly simplified and abstracted. We are convinced that our model, especially with the new sensitivity analysis, makes interesting and novel contributions and predictions.

      Point 1.8. The description of wiring lengths as 'approximately optimal' in this text is problematic. The plotted data show that the wiring lengths are several deviations away from optimal, and the random model is not a valid instantiation of the 2D non-overlapping constraints the authors imposed. A more appropriate null should be considered.  

      We appreciate the reviewer’s feedback regarding the use of the term “approximately optimal” in describing wiring lengths. We acknowledge that our initial terminology was imprecise and could be misleading. We had previously referred to the minimal wiring length as the optimal wiring length, which does not fully capture the nuances of neuronal wiring optimization. As noted in prior literature, such as the work by Hermann Cuntz (Cuntz et al., 2010 & 2012), neurons can optimize their wiring beyond simply minimizing dendritic length.

      To address this issue, to better capture the balance between wiring minimization and functional constraints, such as conduction delays, we have developed a new modeling approach based on minimum spanning trees with a balancing factor (Cuntz et al., 2010 & 2012). This factor modulates the trade-off between minimizing wiring length and accounting for conduction delays from synapses to the soma. Specifically, the model assumes a balance between minimizing the total dendritic length and minimizing the tree distance between synapses and the site of input integration, typically the soma. This balance is illustrated in Figure 8 (Figure 7 in the original manuscript), where we demonstrate that the deviation from the theoretical minimum length arises because direct paths to synapses often require longer dendrites in our models.

      Together with the new result, which we added as the new panels f, g and h to Figure 8 (originally Figure 7), we also adjusted panel a of Figure 8, to now illustrate the difference between random wiring, minimal wiring and minimal conductance delay. The updated Figure 8 and its new findings are discussed in the results section of the revised manuscript:

      p.17 line 387, “This deviation is expected given that real dendrites need to balance their growth processes between minimizing wire while reducing conduction delays. The interplay between these two factors emerges from the need to reduce conduction delays, which requires a direct path length from a given synapse to the soma, consequently increasing the total length of the dendritic cable. (Cuntz et al., 2010, 2012; Ferreira Castro et al., 2020).

      To investigate this further, we compared the scaling relations of the final morphologies of our models with other synthetic dendritic morphologies generated using a previously described minimum spanning tree (MST) based model. The MST model balances the minimization of total dendritic length and the minimization of conduction delays between synapses and the soma. This balance results in deviations from the theoretical minimum length because direct paths to synapses often require longer dendrites (Cuntz et al., 2008, 2010). The balance in the model is modulated by a balancing factor (𝑏𝑓 ). If 𝑏𝑓 is zero, dendritic trees minimize the cable only, and if 𝑏𝑓 is one, they will try to minimize the conduction delays as much as possible. It is important to note that the MST model does not simulate the developmental process of dendritic growth; it is a phenomenological model designed to generate static morphologies that resemble real cells.

      To facilitate the comparison of total lengths between our simulated and MST morphologies, we generated MST models under the same initial conditions (synaptic spatial distribution) as our models and simulated them to match several morphometrics (total length, number of terminals, and surface area) of our grown morphologies. This allowed us to create a corresponding MST tree for each of our synthetic trees. Consequently, we could evaluate whether the branching structures of our models were accurately predicted by minimum spanning trees based on optimal wiring constraints. We found that the best match occurred with a trade-off parameter 𝑏𝑓 = 0.9250 (Figure 8f). Using the morphologies generated by the MST model with the specified trade-off parameter (𝑏𝑓 ), we showed that the square root of the synapse count and the total length (𝐿) in both our model generated trees and the MST trees exhibit a linear scaling relationship (Figure 8g; 𝑅2 = 0.65). The same linear relationship can be observed for the square root of the surface area and the total length 𝐿 of our model trees and the MST trees (Figure 8h; 𝑅2 = 0.73). Overall, these results indicate that our model generate trees are wellfitted by the MST model and follow wire optimization constraints.

      We acknowledge that the value of the balancing factor 𝑏𝑓 in our model is higher than the range of balancing factors that is typically observed in the biological dendritic counterparts, which generally ranges between 0.2 and 0.4 (Cuntz et al., 2012; Ferreira Castro et al., 2020; Baltruschat et al., 2020). However, it is still remarkable that our model, which does not explicitly address these two conservation laws, achieves approximately optimal wiring. Why do we observe such a high 𝑏𝑓 value? We reason that two factors may contribute to this. First, in our models, local branches grow directly to the nearest potential synapse, potentially taking longer routes instead of optimally branching to minimize wiring length (Wen and Chklovskii, 2008). Second, the growth process in our models does not explicitly address the tortuosity of the branches, which can increase the total length of the branches used to connect synapses. In the future, it will be interesting to add constraints that take these factors into account. Taken together, combining activity-independent and -dependent dendrite growth produces morphologies that approximate optimal wiring.”

      Further details on the fitted MST model and the corresponding analysis were added to the methods section:

      p.26 line 669, “Comparison with wiring optimization MST models. To evaluate the wire minimization properties of our model morphologies (n=288), we examined whether the number of connected synapses (N), total length (L), and surface area of the spanning field (S) conformed to the scaling law 𝐿 ≈ 𝜋−1/2 ⋅ 𝑆1/2 ⋅ 𝑁1/2 (Cuntz et al., 2012). Furthermore, to validate that our model dendritic morphologies scale according to optimal wiring principles, we created simplified models of dendritic trees using the MST algorithm with a balancing factor (bf). This balancing factor adjusts between minimizing the total dendritic length and minimizing the tree distance between synapses and the soma (Cost = 𝐿 + 𝑏𝑓 ⋅ 𝑃 𝐿) (MST_tree; best bf = 0.925) (Cuntz et al., 2010); TREES Toolbox http://www.treestoolbox.org).

      Initially, we generated MSTs to connect the same distributed synapses as our models. We performed MST simulations that vary the balancing factor between 𝑏𝑓 = 0 and 𝑏𝑓 = 1 in steps of 0.025 while calculating the morphometric agreement by computing the error (Euclidean distance) between the morphologies of our models and those generated by the MST models. The morphometrics used were total length, number of terminals, and surface area occupied by the synthetic morphologies.”

      Point 1.9. It's not clear to me what the authors are trying to convey by repeatedly labeling this model as 'mechanistic'. The mechanisms implemented in the model are inspired by biological phenomena, but the implementations have little resemblance to the underlying biophysical mechanisms. Overall my impression is that this is a phenomenological model intended to show under what conditions particular patterns are possible. Line 363, describing another model as computational but not mechanistic, was especially unclear to me in this context.  

      What we mean by mechanistic is that we implement equations that model specific mechanisms i.e. we have a set of equations that implement the activity-independent attraction to potential synapses (with parameters such as the density of synapses, their spatial influence, etc) and the activitydependent refinement of synapses (with parameters such as the ratio of BDNF and proBDNF to induce potentiation vs depression, the activity-dependent conversion of one factor to the other, etc). This is a bottom-up approach where we combine multiple elements together to get to neuronal growth and synaptic organization. This approach is in stark contrast to the so-called top-down or normative approaches where the method would involve defining an objective function (e.g. minimal dendritic length) which depends on a set of parameters and then applying a gradient descent or other mathematical optimization technique to get at the parameters that optimize the objective function. This latter approach we would not call mechanistic because it involves an abstract objective function (who could say what a neuron or a circuit should be trying to optimize?) and a mathematical technique for how to optimize the function (we don’t know if neurons can compute gradients of abstract objective functions). 

      Hence our model is mechanistic, but it does operate at a particular level of abstraction/simplification. We don’t model individual ion channels, or biophysics of synaptic plasticity (opening and closing of NMDA channels, accumulation of proteins at synapses, protein synthesis). We do, however, provide a biophysical implementation of the plasticity mechanism through the BDNF/proBDNF model which is more than most models of plasticity achieve, because they typically model a phenomenological STDP or Hebbian rule that just uses activity patterns to potentiate or depress synaptic weights, disregarding how it could be implemented. To the best of our understanding, this is what is normally considered mechanistic in the field (in contrast to, for example, biophysical).

      Reviewer #2 (Public Review): 

      This work combines a model of two-dimensional dendritic growth with attraction and stabilisation by synaptic activity. The authors find that constraining growth models with competition for synaptic inputs produces artificial dendrites that match some key features of real neurons both over development and in terms of final structure. In particular, incorporating distance-dependent competition between synapses of the same dendrite naturally produces distinct phases of dendritic growth (overshoot, pruning, and stabilisation) that are observed biologically and leads to local synaptic organisation with functional relevance. The approach is elegant and well-explained, but makes some significant modelling assumptions that might impact the biological relevance of the results. 

      Strengths: 

      The main strength of the work is the general concept of combining morphological models of growth with synaptic plasticity and stabilisation. This is an interesting way to bridge two distinct areas of neuroscience in a manner that leads to findings that could be significant for both. The modelling of both dendritic growth and distance-dependent synaptic competition is carefully done, constrained by reasonable biological mechanisms, and well-described in the text. The paper also links its findings, for example in terms of phases of dendritic growth or final morphological structure, to known data well. 

      Weaknesses: 

      The major weaknesses of the paper are the simplifying modelling assumptions that are likely to have an impact on the results. These assumptions are not discussed in enough detail in the current version of the paper. 

      (1) Axonal dynamics. 

      A major, and lightly acknowledged, assumption of this paper is that potential synapses, which must come from axons, are fixed in space. This is not realistic for many neural systems, as multiple undifferentiated neurites typically grow from the soma before an axon is specified (Polleux & Snider, 2010). Further, axons are also dynamic structures in early development and, at least in some systems, undergo activity-dependent morphological changes too (O'Leary, 1987; Hall 2000). This paper does not consider the implications of joint pre- and post-synaptic growth and stabilisation.  

      We thank the reviewer for the summary of the strengths and weaknesses of the work. While we feel that including a full model of axonal dynamics is beyond the scope of the current manuscript, some aspects of axonal dynamics can be included and are now implemented and tested in the revised manuscript. Since this feedback covers similar aspects of the model that were also pointed out by reviewer #1, we refer here to our detailed reply to their comments 1.1 and 1.2, where we list and discuss all the analyses performed to address the raised issues.

      (2) Activity correlations 

      On a related note, the synapses in the manuscript display correlated activity, but there is no relationship between the distance between synapses and their correlation. In reality, nearby synapses are far more likely to share the same axon and so display correlated activity. If the input activity is spatially correlated and synaptic plasticity displays distance-dependent competition in the dendrites, there is likely to be a non-trivial interaction between these two features with a major impact on the organisation of synaptic contacts onto each neuron.  

      We have explored the amount of correlation (between and within correlated groups) in the revised manuscript (see also our reply to reviewer comment 1.1).

      However, previous experimental work, (e.g. Kleindienst et al., 2011) has provided anatomical and functional analyses that it is unlikely that the functional synaptic clustering on dendritic branches is the result of individual axons making more than one synapse (see pg. 1019).

      (3) BDNF dynamics 

      The models are quite sensitive to the ratio of BDNF to proBDNF (eg Figure 5c). This ratio is also activity-dependent as synaptic activation converts proBDNF into BDNF. The models assume a fixed ratio that is not affected by synaptic activity. There should at least be more justification for this assumption, as there is likely to be a positive feedback relationship between levels of BDNF and synaptic activation.  

      The reviewer is correct. We used the BDNF-proBDNF model for synaptic plasticity based on our previous work (Kirchner and Gjorgjieva, 2021).  

      There, we explored only the emergence of functionally clustered synapses on static dendrites which do not grow. In the Methods section (Parameters and data fitting) we justify the choice of the ratio of BDNF to proBDNF from published experimental work. We also performed sensitivity analysis (Supplementary Fig. 1) and perturbation simulations (Supplementary Fig. 3), which showed that the ratio is crucial in regulating the overall amount of potentiation and depression of synaptic efficacy, and therefore has a strong impact on the emergence and maintenance of synaptic organization. Since we already performed all this analysis, we expect that the same results will also apply to the current model which includes dendritic growth, as it involves the same activity-dependent mechanism.

      A further weakness is in the discussion of how the final morphologies conform to principles of optimal wiring, which is quite imprecise. 'Optimal wiring' in the sense of dendrites and axons (Cajal, 1895; Chklovskii, 2004; Cuntz et al, 2007, Budd et al, 2010) is not usually synonymous with 'shortest wiring' as implied here. Instead, there is assumed to be a balance between minimising total dendritic length and minimising the tree distance (ie Figure 4c here) between synapses and the site of input integration, typically the soma. The level of this balance gives the deviation from the theoretical minimum length as direct paths to synapses typically require longer dendrites. In the model this is generated by the guidance of dendritic growth directly towards the synaptic targets. The interpretation of the deviation in this results section discussing optimal wiring, with hampered diffusion of signalling molecules, does not seem to be correct. 

      We agree with this comment. We had wrongly used the term “optimal wiring” as neurons can optimize their wiring not only by minimizing their dendritic length but other factors as noted by the reviewer. In the revised manuscript we replaced the term “optimal wiring” with “minimal wiring” wherever it was incorrectly used. On top of that, we performed further analysis and discussed these differences, as pointed out in the reply to reviewer #1 point 1.8.

      To summarize, we want to again thank the reviewer for their in-depth review and all the suggestions that helped us improve the analysis and implementation of our model.

      Reviewer #3 (Public Review): 

      The authors propose a mechanistic model of how the interplay between activity-independent growth and an activity-dependent synaptic strengthening/weaken model influences the dendrite shape, complexity and distribution of synapses. The authors focus on a model for stellate cells, which have multiple dendrites emerging from a soma. The activity independent component is provided by a random pool of presynaptic sites that represent potential synapses and that release a diffusible signal that promotes dendritic growth. Then a spontaneous activity pattern with some correlation structure is imposed at those presynaptic sites. The strength of these synapses follow a learning rule previously proposed by the lab: synapses strengthen when there is correlated firing across multiple sites, and synapses weaken if there is uncorrelated firing with the relative strength of these processes controlled by available levels of BDNF/proBDNF. Once a synapse is weakened below a threshold, the dendrite branch at that site retracts and loses its sensitivity to the growth signal 

      The authors run the simulation and map out how dendrites and synapses evolve and stabilize. They show that dendritic trees growing rapidly and then stabilize by balancing growth and retraction (Figure 2). They also that there is an initial bout of synaptogenesis followed by loss of synapses, reflecting the longer amount of time it takes to weaken a synapse (Figure 3). They analyze how this evolution of dendrites and synapses depends on the correlated firing of synapses (i.e. defined as being in the same "activity group"). They show that in the stabilized phase, synapses that remain connected to a given dendritic branch are likely to be from same activity group (Figure 4). The authors systemically alter the learning rule by changing the available concentration of BDNF, which alters the relative amount of synaptic strengthening, which in turn affects stabilization, density of synapses and interestingly how selective for an activity group one dendrite is (Figure 5). In addition the authors look at how altering the activity-independent factors influences outgrowth (Figure 6). Finally, one of the interesting outcomes is that the resulting dendritic trees represent "optimal wiring" solutions in the sense that dendrites use the shortest distance given the distribution of synapses. They compare this distribute to one published data to see how the model compared to what has been observed experimentally.  

      There are many strengths to this study. The consequence of adding the activity-dependent contribution to models of synapto- and dendritogenesis is novel. There is some exploration of parameters space with the motivation of keeping the parameters as well as the generated outcomes close to anatomical data of real dendrites. The paper is also scholarly in its comparison of this approach to previous generative models. This work represented an important advance to our understanding of how learning rules can contribute to dendrite morphogenesis.

      We thank the reviewer for the positive evaluation of the work and the suggestions below.

      To improve the clarity of the manuscript, we adjusted and fixed some figures and corresponding paragraphs as follows:

      (1) We increased the number of ticks and their corresponding numbers in all the figures to make them easier to read and interpret.

      (2) In Figure 3 panel d, showing the evolution of synaptic weight, we corrected the upper limit at the yaxis to 1 (from previously 2).

      (3) Due to a typo in the implementation of the BDNF concentration, we had to correct the used BDNF concentrations from 49%, 45% and 40%, to 49%, 46.5% and 43% respectively.

      (4) The y-axis labels of Figure 6 (old Figure 5) panel e and f were changed to make the plots clearer (e: “morphology change explained (%)” to "effect on morphology (%)", and f: “synapse connection explained (%)” to "effect on connected synapses (%)").

      (5) The values for the eta and tau-w in the supplementary Table were corrected. Previously tau-w was falsely 6000 time steps which was corrected to 3000 time steps, and eta was 45% and is now 46.5%.

      We believe that all the changes to the manuscript will address the reviewer’s concerns and enhance the clarity and accuracy of the findings described in the manuscript.

    1. Analyse des angles morts de la feuille de route

      La feuille de route de la santé mentale et de la psychiatrie, bien qu'ambitieuse et dotée de moyens importants, présente quelques points de vigilance, que l'on pourrait qualifier d'angles morts.

      Difficultés de recrutement: Malgré les efforts financiers consentis pour renforcer les équipes soignantes, notamment en pédopsychiatrie, la mise en œuvre de certaines mesures se heurte à la pénurie de professionnels [1, 2].

      Ce manque de personnel qualifié limite l'efficacité des actions et retarde l'amélioration de l'accès aux soins, en particulier dans certaines zones géographiques [1].

      Adaptation aux spécificités des personnes âgées: Si la feuille de route prévoit des actions dédiées aux personnes âgées, l'adaptation des dispositifs aux particularités de cette population, notamment en matière de suicide, reste à approfondir [3].

      L'inclusion dans VigilanS des personnes âgées sur des critères autres qu'une tentative de suicide est envisagée, mais sa mise en œuvre effective n'est pas encore actée [3].

      Prise en charge de la souffrance psychique au travail:

      Bien que la feuille de route aborde la question de la souffrance psychique au travail, les actions mises en place se concentrent principalement sur les professionnels de santé [4].

      Il serait pertinent d'élargir ces actions à l'ensemble des secteurs d'activité pour mieux répondre aux besoins de la population active.

      Efficacité de MonSoutienPsy:

      Le dispositif MonSoutienPsy, bien qu'ayant enregistré un nombre important de bénéficiaires, soulève des interrogations quant à son efficacité réelle [5].

      L'adhésion des psychologues reste à consolider et une évaluation du dispositif est en cours [5].

      Manque d'indicateurs d'impact: Pour certaines actions, l'absence d'indicateurs d'impact précis rend difficile l'évaluation de leur efficacité [6, 7].

      La mise en place d'un système de suivi et d'évaluation plus robuste permettrait de mieux mesurer les progrès réalisés et d'ajuster les actions si nécessaire.

      En plus de ces angles morts, il est important de souligner que la réussite de la feuille de route dépendra également de facteurs externes:

      Évolution des déterminants de la santé mentale:

      La feuille de route se focalise sur l'accès aux soins et l'accompagnement des personnes souffrant de troubles psychiques, mais accorde une importance moindre aux déterminants de la santé mentale, tels que la précarité, les violences, la discrimination, etc. [8].

      Une action plus forte sur ces déterminants est essentielle pour améliorer durablement la santé mentale des Français.

      Mobilisation de l'ensemble des acteurs:

      La mise en œuvre de la feuille de route nécessite la collaboration de nombreux acteurs, notamment les ministères, les agences régionales de santé, les professionnels de santé, les associations, etc. [9, 10].

      Une coordination efficace et une mobilisation de tous les acteurs sont cruciales pour assurer la réussite des actions.

    2. La scolarité dans la feuille de route de la santé mentale et de la psychiatrie

      La feuille de route met l'accent sur le rôle crucial de la scolarité dans la promotion de la santé mentale des enfants et des jeunes.

      Plusieurs actions spécifiques ciblent le milieu scolaire:

      Renforcement des compétences psychosociales (CPS):

      L'action 1 de la feuille de route et la mesure 11 des Assises visent à diffuser le plus largement possible les interventions renforçant les CPS.

      Ces compétences sont considérées essentielles pour la promotion du bien-être mental et peuvent être mises en place dans tous les milieux de vie, y compris l'école. [1]

      Une stratégie intersectorielle de déploiement 2022-2027, co-portée par la Direction Générale de la Santé (DGS) et la Direction générale de l’enseignement scolaire (DGESCO) est en cours. [2]

      L'objectif est de créer un environnement continu de soutien au développement des CPS pour les enfants nés en 2037. [3]

      Prévention de la souffrance psychique chez les étudiants: La population étudiante est exposée à de nombreux stress et doit bénéficier de repérage et d'interventions précoces. [4]

      Le déploiement du secourisme en santé mentale dans les milieux étudiants vise à former 150 000 secouristes d'ici fin 2025. [5]

      En 2023, 2 646 étudiants ont été formés aux premiers secours en santé mentale. [6]

      Adressage par les services de médecine scolaire pour MonSoutienPsy:

      Le dispositif MonSoutienPsy permet aux personnes souffrant de troubles psychiques d’intensité légère à modérée de bénéficier de séances d’accompagnement psychologique. [7]

      La loi de financement de la sécurité sociale (LFSS) pour 2024 prévoit la possibilité d’adressage par les services de médecine scolaire, facilitant l'accès des élèves à ce dispositif. [8]

      Sensibilisation des médecins scolaires :

      Des conseils pour les parents sur le bon usage des écrans et le sommeil ont été ajoutés au carnet de santé de l’enfant, ainsi que des questions dans le cadre des examens obligatoires. [9, 10]

      Ces ajouts visent à inciter les médecins, y compris les médecins scolaires, à échanger avec les familles sur ces sujets importants pour la santé mentale des enfants.

      En plus de ces actions spécifiques, la feuille de route aborde des thématiques qui ont un impact direct sur le milieu scolaire:

      Lutte contre la stigmatisation:

      La lutte contre la stigmatisation des troubles psychiques est un élément important de la feuille de route.

      Des actions de sensibilisation et d'information du grand public sont prévues, ce qui peut contribuer à créer un environnement plus inclusif et bienveillant dans les écoles. [11]

      Amélioration de l'accès aux soins en pédopsychiatrie:

      La feuille de route prévoit des mesures pour renforcer l'offre de soins en pédopsychiatrie, notamment par le financement de projets dans les territoires sous-dotés. [12]

      Un meilleur accès aux soins spécialisés peut permettre une prise en charge plus précoce et plus efficace des troubles psychiques chez les enfants et les adolescents scolarisés.

      Il est important de noter que la feuille de route ne mentionne pas explicitement de mesures pour :

      **Former les enseignants à la détection des troubles psychiques: **

      La formation des enseignants à la détection des signes de souffrance psychique chez les élèves pourrait permettre une intervention plus précoce et une meilleure orientation vers les professionnels de santé.

      Adapter les programmes scolaires pour inclure des modules d'éducation à la santé mentale :

      L'inclusion de modules d'éducation à la santé mentale dans les programmes scolaires pourrait contribuer à sensibiliser les élèves dès le plus jeune âge à l'importance du bien-être mental et à la prévention des troubles psychiques.

      En conclusion, la feuille de route de la santé mentale et de la psychiatrie accorde une place importante à la scolarité, en reconnaissant son rôle clé dans la promotion de la santé mentale des enfants et des jeunes.

      Toutefois, certains aspects pourraient être renforcés, notamment la formation des enseignants et l'adaptation des programmes scolaires, pour une prise en compte plus globale de la santé mentale dans le milieu éducatif.

    1. Reviewer #1 (Public review):

      Summary:

      This work uses a novel, ethologically relevant behavioral task to explore decision-making paradigms in C. elegans foraging behavior. By rigorously quantifying multiple features of animal behavior as they navigate in a patch food environment, the authors provide strong evidence that worms exhibit one of three qualitatively distinct behavioral responses upon encountering a patch:<br /> (1) "search", in which the encountered patch is below the detection threshold;<br /> (2) "sample", in which animals detect a patch encounter and reduce their motor speed, but do not stay to exploit the resource and are therefore considered to have "rejected" it; and<br /> (3) "exploit", in which animals "accept" the patch and exploit the resource for tens of minutes.<br /> Interestingly, the probability of these outcomes varies with the density of the patch as well as the prior experience of the animal. Together, these experiments provide an interesting new framework for understanding the ability of the C. elegans nervous system to use sensory information and internal state to implement behavioral state decisions.

      Strengths:

      (1) The work uses a novel, neuroethologically-inspired approach to studying foraging behavior.

      (2) The studies are carried out with an exceptional level of quantitative rigor and attention to detail.

      (3) Powerful quantitative modeling approaches including GLMs are used to study the behavioral states that worms enter upon encountering food, and the parameters that govern the decision about which state to enter.

      (4) The work provides strong evidence that C. elegans can make 'accept-reject' decisions upon encountering a food resource.

      (5) Accept-reject decisions depend on the quality of the food resource encountered as well as on internally represented features that provide measurements of multiple dimensions of internal state, including feeding status and time.

      Weaknesses:

      (1) The authors repeatedly assert that an individual's behavior in the foraging assay depends on its prior history (particularly cultivation conditions). While this seems like a reasonable expectation, it is not fully fleshed out. The work would benefit from studies in which animals are raised on more or less abundant food before the behavioral task.

      (2) The authors convincingly show that the probability of particular behavioral outcomes occurring upon patch encounter depends on time-associated parameters (time since last patch encounter, time since last patch exploitation). There are two concerns here. First, it is not clear how these values are initialized - i.e., what values are used for the first occurrence of each behavioral state? More importantly, the authors don't seem to consider the simplest time parameter, the time since the start of the assay (or time since worm transfer). Transferring animals to a new environment can be associated with significant mechanical stimulus, and it seems quite possible that transferring animals causes them to enter a state of arousal. This arousal, which certainly could alter sensory function or decision-making, would likely decay with time. It would be interesting to know how well the model performs using time since assay starts as the only time-dependent parameter.

      (3) Similarly, Figures 2L and M clearly show that the probability of a search event occurring upon a patch encounter decreases markedly with time. Because search events are interpreted as a failure to detect a patch, this implies that the detection of (dilute) patches becomes more efficient with time. It would be useful for the authors to consider this possibility as well as potential explanations, which might be related to the point above.

      (4) Based on their results with mec-4 and osm-6 mutants, the authors assert that chemosensation, rather than mechanosensation, likely accounts for animals' ability to measure patch density. This argument is not well-supported: mec-4 is required only for the function of the six non-ciliated light-touch neurons (AVM, PVM, ALML/R, PLML/R). In contrast, osm-6 is expected to disrupt the function of the ciliated dopaminergic mechanosensory neurons CEP, ADE, and PDE, which have previously been shown to detect the presence of bacteria (Sawin et al 2000). Thus, the paper's results are entirely consistent with an important role of mechanosensation in detecting bacterial abundance. Along these lines, it would be useful for the authors to speculate on why osm-6 mutants are more, rather than less, likely to "accept" when encountering a patch.

      (5) While the evidence for the accept-reject framework is strong, it would be useful for the authors to provide a bit more discussion about the null hypothesis and associated expectations. In other words, what would worm behavior in this assay look like if animals were not able to make accept-reject decisions, relying only on exploit-explore decisions that depend on modulation of food-leaving probability?

    2. Reviewer #3 (Public review):

      Summary:

      In this study by Haley et al, the authors investigated explore-exploit foraging using C. elegans as a model system. Through an elegant set of patchy environment assays, the authors built a GLM based on past experience that predicts whether an animal will decide to stay on a patch to feed and exploit that resource, instead of choosing to leave and explore other patches.

      Strengths:

      I really enjoyed reading this paper. The experiments are simple and elegant, and address fundamental questions of foraging theory in a well-defined system. The experimental design is thoroughly vetted, and the authors provide a considerable volume of data to prove their points. My only criticisms have to do with the data interpretation, which I think is easily addressable.

      Weaknesses:

      (1) Sensing vs. non-sensing

      The authors claim that when animals encounter dilute food patches, they do not sense them, as evidenced by the shallow deceleration that occurs when animals encounter these patches. This seems ethologically inaccurate. There is a critical difference between not sensing a stimulus, and not reacting to it. Animals sense numerous stimuli from their environment, but often only behaviorally respond to a fraction of them, depending on their attention and arousal state. With regard to C. elegans, it is well-established that their amphid chemosensory neurons are capable of detecting very dilute concentrations of odors. In addition, the authors provide evidence that osm-6 animals have altered exploit behaviors, further supporting the importance of amphid chemosensory neurons in this behavior.

      (2) Search vs. sample & sensing vs. non-sensing

      In Figures 2H and 2I, the authors claim that there are three behavioral states based on quantifying average velocity, encounter duration, and acceleration, but I only see three. Based on density distributions alone, there really only seem to be 2 distributions, not 3. The authors claim there are three, but to come to this conclusion, they used a QDA, which inherently is based on the authors training the model to detect three states based on prior annotations. Did the authors perform a model test, such as the Bayesian Information Criterion, to confirm whether 2 vs. 3 Gaussians is statistically significant? It seems like the authors are trying to impose two states on a phenomenon with a broad distribution. This seems very similar to the results observed for roaming vs. dwelling experiments, which again, are essentially two behavioral states.

      (4) History-dependence of the GLM

      The logistic GLM seems like a logical way to model a binary choice, and I think the parameters you chose are certainly important. However, the framing of them seems odd to me. I do not doubt the animals are assessing the current state of the patch with an assessment of past experience; that makes perfect logical sense. However, it seems odd to reduce past experience to the categories of recently exploited patch, recently encountered patch, and time since last exploitation. This implies the animals have some way of discriminating these past patch experiences and committing them to memory. Also, it seems logical that the time on these patches, not just their density, should also matter, just as the time without food matters. Time is inherent to memory. This model also imposes a prior categorization in trying to distinguish between sensed vs. not-sensed patches, which I criticized earlier. Only "sensed" patches are used in the model, but it is questionable whether worms genuinely do not "sense" these patches.

      (5) osm-6

      The osm-6 results are interesting. This seems to indicate that the worms are still sensing the food, but are unable to assess quality, therefore the default response is to exploit. How do you think the worms are sensing the food? Clearly, they sense it, but without the amphid sensory neurons, and not mechanosensation. Perhaps feeding is important? Could you speculate on this?

      (7) Impact:

      I think this work will have a solid impact on the field, as it provides tangible variables to test how animals assess their environment and decide to exploit resources. I think the strength of this research could be strengthened by a reassessment of their model that would both simplify it and provide testable timescales of satiety/starvation memory.

    3. Author response:

      We thank the reviewers for their thoughtful comments. We are working to revise our manuscript and address each of the reviewers comments. A summary of our planned revisions and responses to some of the reviewers’ major concerns are included below.

      Cultivation Density: Reviewers #1 and #2 suggested that additional studies testing the effects of varying bacterial density during animal development (cultivation) would strengthen our findings. While we agree with the reviewers that this is a very interesting experiment, it is not feasible. Indeed, we attempted this experiment but found it nontrivial to maintain stable bacterial density conditions over long timescales as this requires matching the rate of bacterial growth with the rate of bacterial consumption. Despite our best efforts, we have not been able to identify conditions that satisfy these requirements. We will focus our revised manuscript to include only assertions about the effects of recent experiences.

      Transfer Method: Reviewers #1 and #2 expressed concern that the stress of transferring animals to a new plate may have resulted in an increased arousal state and thus a greater probability of rejecting patches. We thank the reviewers for this thoughtful remark and plan to conduct additional analyses to address this hypothesis. We did, however, anticipate this possibility and, to mitigate the stress of moving, we used an agar plug method where animals were transferred using the flat surface of small cylinders of agar. Importantly, the use of agar as a medium to transfer animals provides minimal disruption to their environment as all physical properties (e.g. temperature, humidity, surface tension) are maintained. Qualitatively, we observe no marked change in behavior from before to after transfer with the agar plug method, especially as compared to the often drastic changes observed when using a metal or eyelash pick.

      Time Parameter: Related to the transfer method, Reviewer #1 expressed concern that the simplest time parameter (time since start of the assay) might better predict animal behavior. We thank the reviewer for pointing out the need to specifically test whether the time-dependent change in explore-exploit decision-making corresponds better with satiety (time off patch) or arousal (time since transfer/start of assay) state. We will conduct additional analyses to address these alternative hypotheses.

      Parameter Initialization: Reviewer #1 pointed out an oversight in our methods section regarding the model parameter values used for the first encounter. We plan to clarify the initialization of parameters in the manuscript. In short, for the first patch encounter where k = 1:

      ρk is the relative density of the first patch.

      τs is the duration of time spent off food since the beginning of the recorded experiment. For the first patch, this is equivalent to the total time elapsed.

      ρh is the approximated relative density of the bacterial patch on the acclimation plates (see Assay preparation and recording in Methods). Acclimation plates contained one large 200 µL patch seeded with OD600 = 1 and grown for a total of ~48 hours. As with all patches, the relative density was estimated from experiments using fluorescent bacteria OP50-GFP as described in Bacterial patch density estimation in Methods.

      ρe is equivalent to ρh.

      Sensing vs. non-sensing: Reviewer #3 suggested that the term “non-sensing” may not be ethologically accurate. We thank the reviewer for their comment and agree that we do not know for certain whether the animals sensed these patches or were merely non-responsive to them. We are, however, confident that these encounters lack evidence of sensing. Specifically, we note that our analyses used to classify events as sensing or non-sensing examined whether an animal’s slow-down upon patch entry could be distinguished from either that of events where animals exploited or that of encounters with patches lacking bacteria. We found that  “non-sensing” encounters are indeed indistinguishable from encounters with bacteria-free patches where there are no bacteria to be sensed (see Figure 2 - Supplement 7C-D and Patch encounter classification as sensing or non-sensing in Methods). Regardless, we agree with the reviewer that all that can be asserted for certain about these events is that animals do not respond to the bacterial patch in any way that we measured. Therefore, we will replace the term “non-sensing” with “non-responding” to better indicate the ethological interpretation of these events.

      Time-dependent changes in sensing vs. non-sensing: Reviewer #1 remarked that the sensation of dilute patches increases with time. We agree with the reviewer that we observe increased responsiveness to dilute patches with time. Although this is interesting, our primary focus was on what decision an animal made given that they clearly sensed the presence of the bacterial patch. Nonetheless, we will add this observation to the discussion as an area of future work to investigate the sensory mechanisms behind this effect.

      Classification of sensing vs. non-sensing: Reviewers #2 and #3 expressed concerns about the validity of the two clusters identified using the semi-supervised QDA approach described. We are grateful to the reviewers for pointing out the difficulty in visualizing the clusters and the need for additional clarity in explaining the supervised labeling. We will use additional visualizations and methods to validate the clusters we have discovered. Specifically, we aim to provide additional evidence that the sensing vs. nonsensing data is bi-modal (i.e. a two-cluster classification method fits best). Further, it seems that there may be some confusion as to how we arrived at 3 encounter types (i.e. search, sample, exploit) that we plan to clarify in the manuscript. Specifically, it’s important to note that two methods were used on two different (albeit related) sets of parameters. We first used a two-cluster GMM to classify encounters as explore or exploit. We then used a two-cluster semi-supervised QDA to classify encounters as sensing or non-sensing (to be changed to “non-responding”, see above response) using a different set of parameters. We thus separated the explore cluster into two (sensing and non-sensing exploratory events) resulting in three total encounter types: exploit, sample (explore/sensing), and search (explore/non-sensing). We will clarify this in the text. Additionally, we will clarify the labelling used for “supervising” QDA. Specifically, we made two simple assumptions: 1) animals must have sensed the patch if they exploited it and 2) animals must not have sensed the patch if there were no bacteria to sense. Thus, we labeled encounters as sensing if they were found to be exploitatory as we assume that sensation is prerequisite to exploitation; and we labeled encounters as non-sensing for events where animals encountered patches lacking bacteria (OD600 = 0). All other points were non-labeled prior to learning the model. In this way, our labels were based on the experimental design and results of the GMM, an unsupervised method; rather than any expectations we had about what sensing should look like. The semi-supervised QDA method then used these initial labels to iteratively fit a paraboloid that best separated these clusters, by minimizing the posterior variance of classification.

      Accept-reject vs. stay-switch: Reviewers #1 and #2 ask for additional discussion on how the accept-reject decision-making framework differs from the stay-switch framework. We thank the reviewers for alerting us to this gap in our discussion. We intend to clarify that these frameworks ask two different types of questions (i.e. “Do you want to eat it?” versus “If so, how long do you want to eat it for?”). These concepts are well described in canonical foraging theory literature (see Pyke, Pulliam & Charnov 1977 for a review on the subject) and are easily distinguishable for animals that forage using the following framework: 1) search for prey, 2) encounter prey from a distance, 3) identify prey type, 4) decide to pursue (accept-reject decision), 5) pursue and capture the prey, 6) exploit prey, and 7) decide to stop exploiting and start searching again (stay-switch decision). In this case, it is easy to see the distinction between accept-reject and stay-switch decisions. However, in some scenarios, animals must physically encounter prey prior to identification and then must make an accept-reject decision. In these cases where pursuit and capture are not visualized, it is harder to distinguish between accept-reject and stay-switch decisions. In our experiments, we find significant bimodality in encounter duration (see Figure 2H) where short duration (exploratory) encounters appear to represent a lower bound where animals spend the minimum amount of time possible on a patch (less than 2 minutes), which we interpret as a rejection of the patch. On the other hand, exploitatory encounters span a large range of durations from 2 to 60+ minutes which we interpret as an initial acceptance of the patch followed by a series of stay-switch decisions which determine the overall duration of the encounter. While one could certainly model our data using only stay-switch decision-making, we ascertain that an encounter of minimal duration is better interpreted ethologically as a rejection than as an immediate switch decision. We will revise the text to further extrapolate upon our point of view on this somewhat philosophical distinction and what it predicts about C. elegans behavior.

      Sensory mutant behavior: Reviewers #1 and #3 ask for further speculation on the observed behavior of osm-6 and mec-4 animals. We will further elaborate on our findings, how they relate to previous studies, and what they suggest about the mechanisms behind these foraging decisions.

      Model design: Reviewer #3 suggested several alterations to the behavioral model. While the proposed model seems entirely reasonable and could aid in elucidating the time component of how prior experience affects decision-making, we chose the present model based on our experience with model selection using these data. Indeed, as the reviewer suggested, we did a great number of analyses involving model selection including model selection criteria (AIC, BIC) and optimization with regularization techniques (LASSO and elastic nets). We found that the problem of model selection was compounded by the enormous array of highly correlated variables we had to choose from. Additionally, we found that both interaction terms and non-linear terms of our task variables could be predictive of accept-reject decisions but that the precise set of terms selected depended sensitively on which model selection technique was used and generally made rather small contributions to prediction. The diverse array of results and combinatorial number of predictors to possibly include failed to add anything of interpretable value. We therefore chose to take a different approach to this problem. Rather than trying to determine what the “best” model was we instead asked whether a minimal model could be used to answer a set of core questions. Indeed, our goal was not maximal predictive performance but rather to distinguish between the effects of different influences enough to determine if encounter history had a significant, independent effect on decision making. We thus chose to only include task variables that spanned the most basic components of behavioral mechanisms to ask very specific questions. For example, we selected a time variable that we thought best encapsulated satiety. While we could have included many additional terms, or made different choices about which terms to include, based on our analyses these choices would not have qualitatively changed our results. Further, we sought to validate the parameters we chose with additional studies (i.e. food-deprived and sensory mutant animals). We regard our study as an initial foray into demonstrating accept-reject decision-making in nematodes. The exact mechanisms and, consequently, the best model design is therefore beyond the scope of this study. Lastly, Reviewer #3 criticized the use of only sensed patches in the model. While we acknowledge that we are not certain as to whether the “non-sensing” encounters are truly not sensed, we find qualitatively similar results when including all exploratory patches in our analyses. In fact, when all encounters are used, we find stronger correlations between our task variables and the accept-reject decision. However, we take the position that sensation is necessary for decision-making and thus believe that while our model’s predictive performance may be better using all encounters, the interpretation of our findings is stronger when we only include sensing events.

    1. Reviewer #2 (Public review):

      Summary:

      The manuscript investigates the role of the Mid1 gene in hippocampal (HPC) development and its contribution to Opitz G/BBB syndrome (OS), which is characterized by neurological deficits and structural abnormalities. The authors use a knockout mouse model (Mid1-/y) to elucidate the underlying molecular mechanisms that contribute to learning and memory impairments. They demonstrate that Mid1 gene deletion leads to reduced synaptic plasticity, abnormal neural rhythms, and decreased cognitive functions, providing a mechanistic explanation for the neurological deficits seen in OS patients. This study addresses an important gap in understanding the neural mechanisms underlying Opitz G/BBB syndrome and provides substantial evidence that the Mid1 gene plays a critical role in hippocampal function and cognition.

      Strengths:

      Understanding the role of Mid1 in HPC development could have broader implications for neurodevelopmental disorders beyond OS, particularly in conditions associated with synaptic dysfunction or memory impairments. The study's focus on the impact of Mid1 on the cAMP signaling pathway, BDNF expression, and synaptic plasticity offers novel mechanisms relevant to both neurodevelopment and neurodegeneration. Moreover, the combination of RNA-seq, electrophysiological measurements, and histological staining provides a multidimensional approach to understanding how Mid1 influences neuronal function and structure.

      Weaknesses:

      (1) The introduction is insufficient, and the number of references is too low. With only nine references, there isn't enough context to adequately explain the background and previous evidence.

      (2) The specificity of behavioral deficits is lacking. The authors indicate learning and memory dysfunction, yet the Y-maze and Morris water maze primarily assess spatial memory. Additional behavioral tests, such as the novel object recognition test for recognition memory or fear conditioning for associative learning, should be included to provide a more comprehensive assessment.

      (3) The manuscript mentions decreased synaptic plasticity but lacks thorough investigation; a more detailed analysis of long-term potentiation (LTP) or depression (LTD) would strengthen the claims. Additionally, while spine morphology is analyzed, incorporating electrophysiological measurements of synaptic strength would better correlate structural changes with functional outcomes.

      (4) The authors performed H&E staining to count the number of hippocampal pyramidal neurons; however, H&E lacks specificity for identifying pyramidal neurons. Neuronal-specific IHC staining would be more appropriate for this quantification. Additionally, the manuscript does not mention the counting method used, which should be clarified.

      (5) Information on the knockout mice used in the study is missing from the Methods section. Additionally, the sex of the mice should be specified, as exploring potential sex-specific differences in the impact of Mid1 deletion could significantly enhance the study's findings.

    1. Primer Validation

      More detail is needed here, include how you determined the limit of detection of your assay. State that you used standard curves to estimate limit of detection (LOD), but see Klymus et al. (2020). Given that your assays are for qualitative purposes, the limit of quantification (LOQ) is likely not relevant in your case. Please verify this to clarify in the main text why the qPCR efficiency may be irrelevant for your assays, but the LOD is.

      Depending on who you will get as an examiner, it may be worthwhile to also mention that you did the testing according to the MIQE guidelines, which I think were incorporated into this paper (see thier Appendix S1 for the checklist):

      • Thalinger, B., Deiner, K., Harper, L. R., Rees, H. C., Blackman, R. C., Sint, D., ... & Bruce, K. (2021). A validation scale to determine the readiness of environmental DNA assays for routine species monitoring. Environmental DNA, 3(4), 823-836.

      • Bustin, S. A. (2024). Improving the quality of quantitative polymerase chain reaction experiments: 15 years of MIQE. Molecular aspects of medicine, 96, 101249.

      • Klymus, K. E., Merkes, C. M., Allison, M. J., Goldberg, C. S., Helbing, C. C., Hunter, M. E., Jackson, C. A., Lance, R. F., Mangan, A. M., Monroe, E. M., Piaggio, A. J., Stokdyk, J. P., Wilson, C. C., & Richter, C. A. (2020). Reporting the limits of detection and quantification for environmental DNA assays. Environmental DNA, 2, 271–282. https://doi.org/10.1002/edn3.29

    2. References

      Probably interesting for background:

      Eklund, A., Frank, J., Nilsson, L., Zetterberg, A., & Mansson, J. 2024. Times of trouble - Seasonal variation in number and severity of attacks on sheep caused by large carnivores and eagles in Sweden. European Journal of Wildlife Research, 70(9): 2-11. DOI: https://doi.org/10.1007/s10344-023-01761-4

      Kvalshaug, O.J. 2013. Inter-specific patterns of depredation on domestic sheep and semi-domestic reindeer in Norway, by a large predator guild. Master Thesis, Norwegian University of Life Sciences, 36.

      Linnell, J.D.C., Nilsen, E.B., Lande, U., Herfindal, I., Odden, J., & Skogen, K. 2005. Zoning as a means of mitigating conflicts with large carnivores: Principles and reality. Conservation Biology Series-Cambridge, 9: 163-175. DOI: https://doi.org/10.1017/cbo9780511614774.011

      Mabille, G., Stien, A., Tveraa, T., Mysterud, A., Brøseth, H., & Linnell, J.D.C. 2015. Sheep farming and large carnivores: What are the factors influencing claimed losses? Ecosphere, 6(5): 1-17. DOI: https://doi.org/10.1890/es14-00444.1

      Strand, G., Hansen, I., De Boon, A., & Sandström, C. 2019. Carnivore Management Zones and their Impact on Sheep Farming in Norway. Environmental Management, 64: 537-552. DOI: https://doi.org/10.1007/s00267-019-01212-4

      Strand, G. 2020. The combined effects of centralization and carnivore management on sheep farmers and sheep farming in Norway. Human Dimensions of Wildlife, 26(4): 321-336. DOI: https://doi.org/10.1080/10871209.2020.1818895

    Annotators

    1. Primer Validation

      More detail is needed here, include how you determined the limit of detection of your assay. State that you used standard curves to estimate limit of detection (LOD), but see Klymus et al. (2020). Given that your assays are for qualitative purposes, the limit of quantification (LOQ) is likely not relevant in your case. Please verify this to clarify in the main text why the qPCR efficiency may be irrelevant for your assays, but the LOD is.

      Depending on who you will get as an examiner, it may be worthwhile to also mention that you did the testing according to the MIQE guidelines, which I think were incorporated into this paper (see thier Appendix S1 for the checklist):

      • Thalinger, B., Deiner, K., Harper, L. R., Rees, H. C., Blackman, R. C., Sint, D., ... & Bruce, K. (2021). A validation scale to determine the readiness of environmental DNA assays for routine species monitoring. Environmental DNA, 3(4), 823-836.

      • Bustin, S. A. (2024). Improving the quality of quantitative polymerase chain reaction experiments: 15 years of MIQE. Molecular aspects of medicine, 96, 101249.

      • Klymus, K. E., Merkes, C. M., Allison, M. J., Goldberg, C. S., Helbing, C. C., Hunter, M. E., Jackson, C. A., Lance, R. F., Mangan, A. M., Monroe, E. M., Piaggio, A. J., Stokdyk, J. P., Wilson, C. C., & Richter, C. A. (2020). Reporting the limits of detection and quantification for environmental DNA assays. Environmental DNA, 2, 271–282. https://doi.org/10.1002/edn3.29

    Annotators

    1. Reviewer #2 (Public review):

      Summary:

      The Flower protein is expressed in various cell types, including neurons. Previous studies in flies have proposed that Flower plays a role in neuronal endocytosis by functioning as a Ca2+ channel. However, its precise physiological roles and molecular mechanisms in neurons remain largely unclear. This study employs C. elegans as a model to explore the function and mechanism of FLWR-1, the C. elegans homolog of Flower. This study offers intriguing observations that could potentially challenge or expand our current understanding of the Flower protein. Nevertheless, further clarification or additional experiments are required to substantiate the study's conclusions.

      Strengths:

      A range of approaches was employed, including the use of a flwr-1 knockout strain, assessment of cholinergic synaptic activity via analyzing aldicarb (a cholinesterase inhibitor) sensitivity, imaging Ca2+ dynamics with GCaMP3, analyzing pHluorin fluorescence, examination of presynaptic ultrastructure by EM, and recording postsynaptic currents at the neuromuscular junction. The findings include notable observations on the effects of flwr-1 knockout, such as increased Ca2+ levels in motor neurons, changes in endosome numbers in motor neurons, altered aldicarb sensitivity, and potential involvement of a Ca2+-ATPase and PIP2 binding in FLWR-1's function.

      Weaknesses:

      (1) The observation that flwr-1 knockout increases Ca2+ levels in motor neurons is notable, especially as it contrasts with prior findings in flies. The authors propose that elevated Ca2+ levels in flwr-1 knockout motor neurons may stem from "deregulation of MCA-3" (a Ca2+ ATPase in the plasma membrane) due to FLWR-1 loss. However, this conclusion relies on limited and somewhat inconclusive data (Figure 7). Additional experiments could clarify FLWR-1's role in MCA-3 regulation. For instance, it would be informative to investigate whether mutations in other genes that cause elevated cytosolic Ca2+ produce similar effects, whether MCA-3 physically interacts with FLWR-1, and whether MCA-3 expression is reduced in the flwr-1 knockout.

      (2) In silico analysis identified residues R27 and K31 as potential PIP2 binding sites in FLWR-1. The authors observed that FLWR-1(R27A/K31A) was less effective than wild-type FLWR-1 in rescuing the aldicarb sensitivity phenotype of the flwr-1 knockout, suggesting that FLWR-1 function may depend on PIP2 binding at these two residues. Given that mutations in various residues can impair protein function non-specifically, additional studies may be needed to confirm the significance of these residues for PIP2 binding and FLWR-1 function. In addition, the authors might consider explicitly discussing how this finding aligns or contrasts with the results of a previous study in flies, where alanine substitutions at K29 and R33 impaired a Flower-related function (Li et al., eLife 2020).

      (3) A primary conclusion from the EM data was that FLWR-1 participates in the breakdown, rather than the formation, of bulk endosomes (lines 20-22). However, the reasoning behind this conclusion is somewhat unclear. Adding more explicit explanations in the Results section would help clarify and strengthen this interpretation.

      (4) The aldicarb assay results in Figure 3 are intriguing, indicating that reduced GABAergic neuron activity alone accounts for the flwr-1 mutant's hyposensitivity to aldicarb. Given that cholinergic motor neurons also showed increased activity in the flwr-1 mutant, one might expect the flwr-1 mutant to display hypersensitivity to aldicarb in the unc-47 knockout background. However, this was not observed. The authors might consider validating their conclusion with an alternative approach or, at the minimum, providing a plausible explanation for the unexpected result. Since aldicarb-induced paralysis can be influenced by factors beyond acetylcholine release from cholinergic motor neurons, interpreting aldicarb assay results with caution may be advisable. This is especially relevant here, as FLWR-1 function in muscle cells also impacts aldicarb sensitivity (Figure S3B). Previous electrophysiological studies have suggested that aldicarb sensitivity assays may sometimes yield misleading conclusions regarding protein roles in acetylcholine release.

      (5) Previous studies have suggested that the Flower protein functions as a Ca²⁺ channel, with a conserved glutamate residue at the putative selectivity filter being essential for this role. However, mutating this conserved residue (E74Q) in C. elegans FLWR-1 altered aldicarb sensitivity in a direction opposite to what would be expected for a Ca²⁺ channel function. Moreover, the authors observed that E74 of FLWR-1 is not located near a potential conduction pathway in the FLWR-1 tetramer, as predicted by Alphafold3. These findings raise the possibility that Flower may not function as a Ca2+ channel. While this is a potentially significant discovery, further experiments are needed to confirm and expand upon these results.

      (6) Phrases like "increased excitability" and "increased Ca2+ influx" are used throughout the manuscript. However, there is no direct evidence that motor neurons exhibit increased excitability or Ca2+ influx. The authors appear to interpret the elevated Ca2+ signal in motor neurons as indicative of both increased excitability and Ca2+ influx. However, this elevated Ca2+ signal in the flwr-1 mutant could occur independently of changes in excitability or Ca2+ influx, such as in cases of reduced MCA-3 activity. The authors may wish to consider alternative terminology that more accurately reflects their findings.

    1. Reviewer #1 (Public review):

      Summary:

      This paper is a relevant overview of the currently published literature on low-intensity focussed ultrasound stimulation (TUS) in humans, with a meta-analysis of this literature that explores which stimulation parameters might predict the directionality of the physiological stimulation effects.

      The pool of papers to draw from is small, which is not surprising given the nascent technology. It seems nevertheless relevant to summarize the current field in the way done here, not least to mitigate and prevent some of the mistakes that other non-invasive brain stimulation techniques have suffered from, most notably the theory- and data-free permutation of the parameter space.<br /> The meta-analysis concludes that there are, at best, weak trends toward specific parameters predicting the direction of the stimulation effects. The data have been incorporated into an open database, that will ideally continue to be populated by the community and thereby become a helpful resource as the field moves forward.

      Strengths:

      The current state of human TUS is concisely and well summarized. The methods of the meta-analysis are appropriate. The database is a valuable resource.

      Weaknesses:

      These are not so much weaknesses but rather comments and suggestions that the authors may want to consider.

      (1) I may have missed this, but how will the database be curated going forward? The resource will only be as useful as the quality of data entry, which, given the complexity of TUS can easily be done incorrectly.

      (2) It would be helpful to report the full statistics and effect sizes for all analyses. At times, only p-values are given. The meta-analysis only provides weak evidence (judged by the p-values) for two parameters having a predictive effect on the direction of neuromodulation. This reviewer thinks a stronger statement is warranted that there is currently no good evidence for duty cycle or sonication direction predicting outcome (though I caveat this given the full stats aren't reported). The concern here is that some readers may gallop away with the impression that the evidence is compelling because the p-value is on the correct side of 0.05.

      (3) This reviewer thinks the issue of (independent) replication should be more forcefully discussed and highlighted. The overall motivation for the present paper is clearly and thoughtfully articulated, but perhaps the authors agree that the role that replication has to play in a nascent field such as TUS is worth considering.

      (4) A related point is that many of the results come from the same groups (the so-called theta-TUS protocol being a clear example). The analysis could factor this in, but it may be helpful to either signpost independent replications, which studies come from the same groups, or both.

      (5) The recent study by Bao et al 2024 J Phys might be worth including, not least because it fails to replicate the results on theta TUS that had been limited to the same group so far (by reporting, in essence, the opposite result).

      (6) The summary of TUS effects is useful and concise. Two aspects may warrant highlighting, if anything to safeguard against overly simplistic heuristics for the application of TUS from less experienced users. First, could the effects of sonication (enhancing vs suppressing) depend on the targeted structure? Across the cortex, this may be similar, but for subcortical structures such as the basal ganglia, thalamus, etc, the idiosyncratic anatomy, connectivity, and composition of neurons may well lead to different net outcomes. Do the models mentioned in this paper account for that or allow for exploring this? And is it worth highlighting that simple heuristics that assume the effects of a given TUS protocol are uniform across the entire brain risk oversimplification or could be plain wrong? Second, and related, there seems to be the implicit assumption (not necessarily made by the authors) that the effects of a given protocol in a healthy population transfer like for like to a patient population (if TUS protocol X is enhancing in healthy subjects, I can use it for enhancement in patient group Y). This reviewer does not know to which degree this is valid or not, but it seems simplistic or risky. Many neurological and psychiatric disorders alter neurotransmission, and/or lead to morphological and structural changes that would seem capable of influencing the impact of TUS. If the authors agree, this issue might be worth highlighting.

    1. Reviewer #2 (Public review):

      Park et al. set out to test two competing hypotheses about the role of the medial prefrontal cortex (PFC) in cognitive control, the ability to use task-relevant cues and ignore task-irrelevant cues to guide behavior. The "central computation" hypothesis assumes that cognitive control relies on computations performed by the PFC, which then interacts with other brain regions to accomplish the task. Alternatively, the "local computation" hypothesis suggests that computations necessary for cognitive control are carried out by other brain regions that have been shown to be essential for cognitive control tasks, such as the dorsal hippocampus and the thalamus. If the central computation hypothesis is correct, PFC lesions should disrupt cognitive control. Alternatively, if the local computation hypothesis is correct, cognitive control would be spared after PFC lesions. The task used to assess cognitive control is the active place avoidance task in which rats must avoid a section of a rotating arena using the stationary room cues and ignoring the local olfactory cues on the rotating platform. Performance on this task has previously been shown to be disrupted by hippocampal lesions and hippocampal ensembles dynamically represent the room and arena depending on the animal's proximity to the shock zone. They found no group (lesion vs. sham) differences in the three behavioral parameters tested: distance traveled, latency to enter the shock zone, and number of shock zone entries for both the standard task and the "conflict" task in which the shock zone was rotated by 180 degrees. The only significant difference was the savings index; the lesion group entered the new shock zone more often than the sham group during the first 5 minutes of the second conflict session. This deficit was interpreted as a cognitive flexibility deficit rather than a cognitive control failure. Next, the authors compared cytochrome oxidase activity between sham and lesion groups in 14 brain regions and found that only the amygdala showed significant elevation in the lesion vs. sham group. Pairwise correlation analysis revealed a striking difference between groups, with many correlations between regions lost in the lesion group (between reuniens and hippocampus, reuniens and amygdala and a correlation between dorsal CA1 and central amygdala that appeared in the lesion group and were absent in the sham group. Finally, the authors assessed dorsal hippocampal representations of the spatial frame (arena vs. room) and found no differences between lesion and sham groups. The only difference in hippocampal activity was reduced overdispersion in the lesion group compared to the sham group on the pretraining session only and this difference disappeared after the task began. Collectively, the authors interpret their findings as supporting the local computation hypothesis; computations necessary for cognitive control occur in brain regions other than the PFC.

      Strengths:

      (1) The data were collected in a rigorous way with experimental blinding and appropriate statistical analyses.

      (2) Multiple approaches were used to assess differences between lesion and sham groups, including behavior, metabolic activity in multiple brain regions, and hippocampal single-unit recording.

      Weaknesses:

      (1) Only male rats were used with no justification provided for excluding females from the sample.

      (2) The conceptual framework used to interpret the findings was to present two competing hypotheses with mutually exclusive predictions about the impact of PFC lesions on cognitive control. The authors then use mainly null findings as evidence in support of the local computation hypothesis. They acknowledge that some people may question the notion that the active place avoidance task indeed requires cognitive control, but then call the argument "circular" because PFC has to be involved in cognitive control. This assertion does not address the possibility that the active place avoidance task simply does not require cognitive control.

      (3) The authors did not link the CO activity with the behavioral parameters even though the CO imaging was done on a subset of the animals that ran the behavioral task nor did they make any attempt to interpret these findings in light of the two competing hypotheses posed in the introduction. Moreover, the discussion lacks any mechanistic interpretations of the findings. For example, there are no attempts to explain why amygdala activity and its correlation with dCA1 activity might be higher in the PFC lesioned group.

      (4) Publishing null results is important to avoid wasting animals, time, and money. This study's results will have a significant impact on how the field views the role of the PFC in cognitive control. Whether or not some people reject the notion that the active place avoidance task measures cognitive control, the findings are solid and can serve as a starting point for generating hypotheses about how brain networks change when deprived of PFC input.

    1. Reviewer #1 (Public review):

      Summary:

      The study by Xu and colleagues provides a useful study of brainstem circuits involved in evoked respiratory reflexes that they define to be cough or cough-like in nature. The study is conducted in mice which has the benefit of allowing for the use of modern transgenic tools, although many of the experiments end up using viral vector-based approaches that could be deployed in any species. The disadvantage of the mouse model is understanding the true identity of the respiratory event that is defined as cough. This limitation requires careful interrogation in order to understand the biology of the circuit under investigation. In this respect, the authors provide an incomplete description of a putative cough pathway linking the caudal spinal trigeminal nucleus with the ventral respiratory group. Neurons assigned as CaMKII+ with putative inputs from the paratrigeminal nucleus are central to this circuit, although the evidence for each of these claims is relatively weak or non-existent. Overall, the study employs interesting methods but limitations in methods and details of methods reduce interpretation of the study outcomes.

      Strengths:

      The use of modern methods to investigate brainstem circuits involved in an essential respiratory reflex.

      Weaknesses:

      (1) The most significant issue that needs careful consideration is the exact respiratory response, which is called a cough. The authors show a trace from their plethysmography recordings and superimpose the 3 phases of cough (inspiration, compression, expiration) with confidence, yet the parameters used to delineate these phases are unclear. Of more concern, an identical respiratory trace was reported recently as a sneeze in Jiang et al Cell 2024 (PMID 39243765). Comparing Figure 1 in the Xu study with Figure 5 in the Jiang study, it is impossible to see any difference in the respiratory trace that would allow the assignment of one as cough and the other as sneeze. The audio signals also look remarkably similar and the purported cough signal in the Jiang study is quite different. Gannot et al Nat Neurosci 2024 (PMID 38977887) seems to agree with Xu in the identity of a cough signal, but Li et al Cell 2021 (PMID 34133943) again labels these as sneezes. One of the older studies that tried to classify respiratory signals in mice (Chen et al PlosONE 2013) labeled the Jiang cough trace as a deep inspiration, while sneeze looks different again. To add further confusion, Zhang et al AJP 2017 (PMID 28228416 ) provide yet another respiratory plethysmography trace that they define as a cough, and label responses discussed above as expiration reflexes. This begs the question - who, if anyone, is correct? Interpreting the circuits underlying these peculiar mouse responses depends on accuracy in defining the response in the first instance.

      (2) The involvement of the causal nSp5 in cough is an unexpected finding. Some understanding of if and how vagal afferent inputs reach this location would help strengthen the manuscript. The authors claim in the discussion that the nucleus of the solitary tract is not the source of inputs, but rather they may arise from the paratrigeminal nucleus (although no data is presented to support this claim). This could fit with the known jugular vagal afferent pathway, which is embryologically distinct and terminates in trigeminal regions, rather than the NTS. But if this is correct, what does this finding then say about the purported involvement of NTS neurons in cough in mice, for example, the recent study by Gannot et al Nat Neurosci where Tac1-expressing NTS neurons were integral for what they call cough in mice? Xu and colleagues are encouraged to resolve their input circuitry so that we can better understand the pathway under investigation and how it relates to the NTS pathway. Related to this, and the issues differentiating cough-like responses from sneeze, the authors will need to consider how to differentiate their cough-like circuitry from the sneeze pathway from the caudal nSp5 to the cVRG as reported by Li et al Cell 2021. It seems highly possible that the two groups are studying the same circuitry, yet the interpretation is confounded by an inability to agree on the identity of the evoked response.

      (3) Injection volumes and titres for AAV transductions are not stated anywhere. The methods (line 484) indicate that different volumes were used for different purposes, but nowhere is this information stated properly. Looking at representative images suggests that volumes were very large, with most of the brainstem often transduced. As single slices are only ever shown it becomes a concern as to how extensive transductions truly are. The authors need to provide complete maps of viral transduction so that readers can understand exactly what regions could contribute to responses, thereby confounding interpretation.

      (4) The authors do not provide any data to explore the impacts of manipulations on basal breathing. This is important as impacts on the respiratory patterning will likely have profound effects on evoked responses that are not related to the specific pathway under investigation. For example, in Figure 2b. breathing looks to be severely compromised in the TKO animals and disrupted in the M4 DREADD animals. Figure 3 also shows the effects of optical stimulation on breathing patterns, which appear like apnea with several breakthrough augmented breaths (some labeled as cough?), although hard to see properly in the traces provided. Figure 5, one would expect VRG inhibition to have impacts on breathing, and the traces supplied appear to suggest this is the case. Please include data showing breathing effects and consider how these may confound your study interpretation.

    2. Reviewer #2 (Public review):

      Summary:

      This study employs a combination of state-of-the-art experimental approaches in mice to identify components of the brainstem circuits involved in the cough reflex in a freely behaving mouse model. The cough reflex is an important respiratory airway defense mechanism, and there has been longstanding interest in defining the neural circuits involved in the mammalian brainstem. Consistent with other recent studies, the present results provide multiple lines of evidence indicating that mice are a suitable model for studying neural mechanisms generating cough behavior. The main novel finding of this study is the authors' results indicating that the caudal spinal trigeminal nucleus (SP5C) nucleus plays a role in generating cough-like behaviors in response to inhaled tussigen. The supporting data presented for this role includes the authors' findings that: (1) neural activity in the SP5C is strongly correlated with tussigen-evoked cough-like behaviors, (2) impairing synaptic outputs or chemogenetic inhibition of SP5C neurons effectively abolished these cough-like reflexes, (3) optogenetically activating a specific subpopulation of excitatory neurons in the SP5C triggers cough-like behaviors, (4) SP5C neurons project monosynaptically to ventral medullary regions containing respiratory circuits that exhibit cough-related neural activity, and (5) specific activation of the SP5C-ventral respiratory circuitry induces robust cough-like behavior without tussive stimuli. This study will be valuable to respiratory neurobiologists studying mechanosensory control of breathing in mammals.

      Strengths:

      (1) The authors developed an experimental paradigm in mice that combines whole-body plethysmography (WBP), audio, and video tracking to assess breathing and putative cough-like behaviors in conscious animals.

      (2) The mouse model enables optogenetic, chemogenetic, virus-based circuit tracing and manipulation, and in vivo fiber photometry to analyze neural activity and define circuity in the medulla-producing cough-like behavior.

      (3) Multiple lines of evidence from these experimental approaches support the conclusion that the SP5C nucleus plays a role in the respiratory reflex behaviors studied in mice, but there is uncertainty that these behaviors are definitively cough.

      Weaknesses:

      (1) This paper lacks essential quantitative details about the number of animals studied explicitly for many of the experimental paradigms presented and for statistical analyses as well as to verify replication of the neuroanatomical data presented.

      (2) The authors' evidence is incomplete that the reflex behavior produced in their mouse model is definitively cough, limiting functional interpretation of the putative circuit identified and requiring more thorough experimental interrogation of the behavior studied.

      (3) The medullary circuit described conveys afferent sensorimotor signals to downstream respiratory circuits to coordinate cough-like motor behavior, but how the circuits that typically mediate the cough reflex, which involve airway-related vagal sensory neurons, operate in conjunction or parallel with the SP5C circuit described has not been determined, which is a significant gap in understanding how the present results fit into the neural control of the cough reflex.

    1. Reviewer #3 (Public review):

      Summary:

      This paper points out an inconsistency of the roles of the striatal spiny neurons projecting to the indirect pathway (iSPN) and the synaptic plasticity rule of those neurons expressing dopamine D2 receptors and proposes a novel, intriguing mechanisms that iSPNs are activated by the efference copy of the chosen action that they are supposed to inhibit.

      The proposed model was supported by simulations and analysis of the neural recording data during spontaneous behaviors.

      Strengths:

      Previous models suggested that the striatal neurons learn action-value functions, but how the information about the chosen action is fed back to the striatum for learning was not clear. The author pointed out that this is a fundamental problem for iSPNs that are supposed to inhibit specific actions and its synaptic inputs are potentiated with dopamine dips.

      The authors propose a novel hypothesis that iSPNs are activated by efference copy of the selected action which they are supposed to inhibit during action selection. Even though intriguing and seemingly unnatural, the authors demonstrated that the model based on the hypothesis can circumvent the problem of iSPNs learning to disinhibit the actions associated with negative reward errors. They further showed by analyzing the cell-type specific neural recording data by Markowitz et al. (2018) that iSPN activities tend to be anti-correlated before and after action selection.

      Weaknesses:

      (1) It is not correct to call the action value learning using the externally-selected action as "off-policy." Both off-policy algorithm Q-learning and on-policy algorithm SARSA update the action value of the chosen action, which can be different from the greedy action implicated by the present action values. In standard reinforcement learning terminology, on-policy or off-policy is regarding the actions in the subsequent state, whether to use the next action value of (to be) chosen action or that of greedy choice as in equation (7).

      It is worth noting that this paper suggested that dopamine neurons encode on-policy TD errors:<br /> Morris G, Nevet A, Arkadir D, Vaadia E, Bergman H (2006). Midbrain dopamine neurons encode decisions for future action. Nat Neurosci, 9, 1057-63. https://doi.org/10.1038/nn1743

      (2) It is also confusing to contract TD learning and Q-learning, as the latter is considered as one type of TD learning. In the TD error signal by state value function (6) is dependent on the chosen action a_{t-1} implicitly in r_t and s_t based on the reward and state transition function.

      (3) It is not clear why interferences of the activities for action selection and learning can be avoided, especially when actions are taken with short intervals or even temporal overlaps. How can the efference copy activation for the previous action be dissociated with the sensory cued activation for the next action selection?

      (4) Although it may be difficult to single out the neural pathway that carries the efference copy signal to the striatum, it is desired to consider their requirements and difference possibilities. A major issue is that the time delay from actions to reward feedback can be highly variable.

      An interesting candidate is the long-latency neurons in the CM thalamus projecting to striatal cholinergic interneurons, which are activated following low-reward actions:<br /> Minamimoto T, Hori Y, Kimura M (2005). Complementary process to response bias in the centromedian nucleus of the thalamus. Science, 308, 1798-801. https://doi.org/10.1126/science.1109154

      (5) In the paragraph before Eq. (3), Eq. (1) should be Eq. (2) for the iSPN.

    1. Reviewer #2 (Public review):

      Summary:

      The authors analyze parameters related to anisotropy and gyrification in the developing human brain and describe an increase in tissue fraction (TF) across development. They correlate TF and sulcal depth in the CP and SP across local neighborhoods, describing a negative correlation. Also, they perform age-mismatched correlation of tissue fraction at early stages with sulcal depth at later ones and show correlation inside sulci, which they interpret as indicating the presence of minor structural changes in the brain that precede the development of sulci.

      Strengths:

      The study compiles a large cohort of cases through different developmental ages and performs sophisticated data analysis. Overall, the work is interesting.

      Weaknesses:

      I have some questions. What is the potential meaning of TF? It seems to be an estimator of anisotropy highly related to fractional anisotropy (FA), but it behaves in a complementary manner, increasing along gestation, in sharp contrast with the decrease observed in FA in this study (suppl. fig 3) and by others. Please clarify how it is calculated, what is the potential biological meaning of TF and how it differs from FA.

      The correlations between TF and sulcal depth do not seem to provide much novelty, since as mentioned by the authors, previous evidence has pointed in that direction. The other concept in the paper relates to detecting structural changes in prospective sulcal areas in the cortex, which the authors analyze through the age-mismatched correlation of TF and subsequent sulcation. However, the results do not show a robust correlation as detailed below and do not seem particularly useful, as they require the inclusion of post-hoc information in the model, limiting the strength of the relationship and the predictive value. My main point of criticism is that if TF is a good marker of the structural modifications that will favor the development of sulci later in development, TF should show a map predictive of those sulci (e.g. at GW 25), that is however not the case. It is not necessary to correlate with future sulcal depth, as we know exactly where the primary sulci will develop. Conversely, it seems that TF decreases along the gyrification process, and it might just be a measure of the structural changes accompanying it.

      In Figure 2 it illustrates the increase in TF across GA, but no R values or significance values are provided. Please add them to evaluate the robustness of the correlation.

      In previous work of the authors, the subplate is not clearly distinguished from the subcortical white matter after 31 GW, as it starts to disintegrate (Kostovic et al., 2002; Calixto et al., 2024). However, in this manuscript, the SP is differentiated at those later ages. The methods section describes a 2 mm thick compartment below the cortical plate. However, if that is the case, it seems quite arbitrary (to coincide with the resolution of the diffusion imaging) and risks analyzing a compartment that is no longer present. Please explain the criteria followed for such distinction and more importantly, how such distinction is reliable considering the low detectability described in previous studies. In this regard, the discussion described that a rapid increase in TF was only seen in the SP after 30 GW, but maybe this increase would reflect the dissipation of the SP and the transformation of that space in subcortical white matter, with a much more expected anisotropy. The authors should review this.

      The analysis describes a negative correlation between tissue fraction and sulcal depth when gyrification proceeds and the authors find that an age-mismatched correlation between tissue fraction in young embryos and sulcal depth in older embryos also shows a negative correlation in future sites of sulcation. However, for the correlation to exist, the tissue fraction in young individuals should already show low values in the prospective sulci, but no clear changes can be seen at GW 25 or 27 in lissencephalic areas that will bear sulci later on, as is the case of the central sulcus at GW 25 or the STS at GW 27, the latter showing very high tissue fraction (instead of the expected low).

      Another question refers to Figures 3b and c. The graphs represent specific neighborhoods in the central sulcus at 27 and 35 GW. It can be argued that those neighborhoods might not be representative of the brain or of the whole sulcus. Please show the graph with all neighborhoods, which will provide more definitive evidence of the existence of the correlation. In this regard, the average graphs represented in Figure 3F seem to show a clear correlation at 27 GW in the subplate, but the correlation seems to fade at later stages (in both SP and CP), with both sulci and gyri exhibiting a negative correlation while other sulcal areas do not exhibit correlation. I think all points should be included in the correlation to better support the hypothesis.

      Figure 4 shows the age-mismatched correlations, but they do not seem convincing particularly when they should be more useful, at 25 GW. Indeed, as seen in both Figures A and C, the central sulcus shows a negative correlation only in a few spots on one hemisphere, while (in C) most of the prospective sulcus shows a positive correlation, contrary to the hypothesis.

      Lastly, the authors performed an age-mismatched correlation between TF at different ages and the sulcal depth at 35W, when it is maximal. This maximal depth might be "pushing" the correlation to significant territory. The authors should provide correlation also with the sulcal depth at other GAs, such as P29, P31, or P33, and analyze how the correlations hold.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Reviewers

      We thank all three reviewers for their time and engagement, for their generally supportive comments, and for raising some important concerns. We are pleased to submit a significantly revised manuscript where we tried to accommodate all suggested changes and extensions. Importantly, we have included additional experiments that support the relevance of FACT for the overall stability of the inner kinetochore. Below is a detailed point-to-point response. Changes to the manuscript relative to the original submission have been highlighted at the end of this response.


      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __

      Summary: The authors investigated molecular interactions between CCAN and FACT complexes. They revealed contact domains in FACT and the cognate subcomplexes of CCAN by in vitro reconstitution from recombinant proteins followed by SEC and pull-down assay.

      They also revealed a couple of potential means to control interactions between FACT and the CCAN. They conclude that phosphorylation of FACT by CK2 is essential for binding to the CCAN; and CENP-A nucleosomes or DNA prevent CCAN from interacting with FACT.

      Major comments:

      The authors show that phosphorylation of FACT is essential for interaction with CCAN.

      They argue that this phosphorylation is partly catalysed by CK2.

      My concerns are:

      -1- The authors assume that the sites phosphorylated in insect cell are also phosphorylated in human cells. However, it is not demonstrated which residues are phosphorylated in human cells and whether they match those from insect cells. Whether phosphorylation of recombinant proteins in insect cells is physiologically relevant to mammalian is uncertain. Kinetochore components are not very well conserved evolutionarily, thus their regulation may be different.

      We thank the reviewer for these remarks, which we answer together with point 2 below.

      -2- They identify several residues which are phosphorylated by CK2 in vitro. However, these are not necessarily the same sites as those phosphorylated in insect cells or more importantly in human cells. The in vitro phosphorylation by CK2 did not restore binding affinity in full, suggesting phosphorylation at other sites may be critical for interaction with CCAN. Further evidence is required to support the claim that those sites are phosphorylated in vivo and important for integrity of kinetochores in mitosis.

      Our analysis of FACT phosphorylation represents a relatively small part of a very data-rich paper, and was not meant to be exhaustive. Nonetheless, the reviewer's comments are important and well received. We agree that we have no definitive evidence that the same sites are phosphorylated in insect cells, in vitro, and in human cells. However, it is quite remarkable, and supports specificity, that the interaction with FACT, lost after dephosphorylation in vitro, is restored with CK2 and not with three additional mitotic kinases (CDK1, Aurora B, and PLK1 - Figure S8D). We also note that S437, S444 and S667 of SSRP1, which were phosphorylated by CK2 in vitro, were also detected as phosphorylated sites on recombinant FACT purified from insect cells (Table S1). So collectively, while we agree with the reviewer that the analysis of FACT phosphorylation is not complete, it does significantly add to the manuscript and more generally to the FACT field.

      Minor comments:

      Figure 1H

      I am confused with 4 stars shown at the top of the right plot. If the 4 stars are meant to show a significant difference, then the statement in the text (line 123) is not correct.

      "SSRP1 localization was also largely unaffected ..."

      Similar discrepancies are found in Figures 3H (line 212), Figures S2 (line 122), S5I (line 197), and S6I (line209). Figure S6H is not referred to anywhere.

      There is no description for the numbers at the top. Are they mean values? Do red bars represent S.D.?

      We thank the reviewer for these comments. In this revised version of the manuscript, we have substantially improved the quantification and statistical analysis. The main problem with the previous automated analysis is that the non-circular shape of the CREST-staining led to inconsistencies with the statistical analysis and the statement. In contrast, the same analysis works well when the CENP-C signal was used for KT identification (e.g. in Figure 3), as CENP-C staining yields well separated circular signals ideally suited for our automated identification of individual KTs and subsequent retrieval of fluorescence intensities. We have therefore modified our analysis macro for all experiments where CREST was used as a reference. We used Othsu-thresholding of the DAPI signal for generating a segmentation mask per each cell. Then, integrated cell intensities were calculated for each fluorescence channel based on the DAPI reference mask. With these adjustments, the statistical analyses (Figures 1, S2, S3) support the claim presented. We have updated the Methods and Results sections to reflect the revised analysis.

      The numbers on top of the graphs are median values, bars represent interquartile ranges. We have now included the description in figure legends.

      We appreciate your feedback, which prompted us to clarify and enhance the rigor of our approach.

      We are now referring to Fig. S6H in the text.

      Figure 1D

      There is no description of R* to the right of gels.

      We have added a description of R* to the relevant figure legend.

      Figure S2

      A 4 hour nocodazole treatment is too short to drive all cells into mitosis. Is the data taken from mitotic cells only?

      Yes, the data are taken only from the mitotic population. We have now clarified this in the figure legend.

      Reviewer #1 (Significance (Required)):

      The interaction of FACT with kinetochore components has been known for several years. However how FACT contributes to architecture or function of kinetochore is not very well understood. How the FACT complex, which is known for its established role as a histone chaperone, is involved in kinetochore assembly/architecture will attract interest in several fields of basic research including epigenetics, mitosis, structural biology.

      We are grateful to the reviewer for this supportive statement that recognizes the broad potential interest of the manuscript.

      Identification of CCAN subunits that interact with FACT is important for future analysis to understand the kinetochore function of FACT. The authors identified OPQRU and CHIKM subcomplex in addition to TW as FACT-interacting domains. These subcomplexes are geographically scattered in a 3D model of CCAN holocomplex. Stoichiometry of CCAN and FACT might be informative whether a single or multiple FACT binds to the multiple sites of CCAN. The authors do not address whether these multiple sites are occupied simultaneously, separately or sequentially.

      We thank the reviewer for raising this point. As mentioned in the discussion, we have not yet been able to perform a structural analysis of the FACT/CCAN complex to determine its stoichiometry. However, we have now added a newexperiment (Figure S1B,C) where we quantified in-gel tryptophan fluorescence after analytical size-exclusion chromatography. This strongly suggests that FACT and CCAN form a complex with a 1:1 stoichiometry. Nevertheless, we cannot comment on which sites are occupied.

      The statement at the end of Abstract (lines 23-25) is a speculative hypothesis without evidence for "a pool of CCAN that is not stably integrated into chromatin", "chaperoning CCAN", and "stabilisation of CCAN".

      We agree with the reviewer that this is speculative, and have therefore modified the Abstract to clearly indicate this point.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      FACT is a histone chaperone and is involved in various events on chromatin such as transcription and replication. In addition, FACT interacts with various kinetochore components, suggesting potential functions at the kinetochore. However, it is largely unclear how FACT functions in the kinetochore. Authors of this MS took the biochemical approach to understand roles of FACT in the kinetochore.

      Authors demonstrated that FACT forms a complex with the constitutive centromere associated network (CCAN), which contains 16 subunits on centromeric chromatin, using multiple binding sites. They also showed that casein kinase II (CK2) phosphorylated FACT and dephosphorylated FACT did not bind to CCAN. Furthermore, they displayed that DNA addition disrupt the stable FACT-CCAN complex.

      Overall, while authors have done solid and high-quality biochemical analyses (these are elegant), it is still unclear how FACT plays its roles in the kinetochore. Simple knockout or knockdown study on FACT might be complicated, because FACT has multi-functions. If authors can identify specific regions of FACT for interaction with CCAN, they would put specific mutations into FACT to analyze phenotype. Although they did not reach a high-resolution structure for the FACT-CCAN complex, they can utilize AlphaFold and test specific interaction regions, biochemically. Then, using such information, significance of FACT-CCAN interaction might be testable in cells. Such a kind of study would be expected. In summary, biochemical parts are beautiful, but the paper did not address significance of FACT-CCAN interaction.

      We thank the reviewer for praising the biochemical work in our manuscript. The reviewer, however, also underscored the limits of our functional analysis. The reviewer proposes generating separation-of-function mutants in a minimal kinetochore-binding region. Indeed, we have identified the minimal domain for the interaction of FACT with kinetochores. However, this information is insufficient for a reliable functional analysis at this stage, as the region we identified encompasses the AIDs and the phosphorylation-rich region, both of which have been previously shown to be important for transcription and other functions. Furthermore, any suitable mutant should be tested in cells devoid of endogenous FACT, raising the concern that the resulting phenotype may be indirect.

      Nonetheless, as we wanted to provide at least an initial answer to the reviewer's concern, we enriched the manuscript by adding experiments in a recently published cell line (K562-SSRP1-dTAG) where FACT levels can be controlled with a small molecule (Žumer et al. Mol Cell., 2024) and that the authors generously shared with us. In this line, which grows in suspension and that we had to adapt to grow on a substrate for imaging, we were able to deplete FACT while cells were arrested in mitosis. We are glad to report that we found a significant reduction in the kinetochore levels of CENP-TW after this treatment, which is consistent with other conclusions from our study. These experiments add an initial functional characterization of the interaction of FACT with kinetochores, and extend the significance of the manuscript. We refer to these results again below in response to specific point 5.

      Specific point

      Authors showed nice mitotic localization of FACT. Can they observe this localization by a usual IF? Using GFP fusion, do they observe kinetochore localization like IF experiments?

      The localization of FACT was observed using pre-extraction and fixation followed by antibody staining. We have now added a panel demonstrating mitotic localization of GFP-SSRP1 at the kinetochore in transiently transfected RPE-1 cells (Fig. S2A).

      On page 7, authors tested CENP-C binding to FACT and they conclude that C-teminal region of CENP-C preferentially binds to FACT. However, they used N-terminal region of CENP-C (2-545) for CCAN-FACT complex formation in entire MS. therefore, this is complicated, and story on CENP-C N-terminal region can be removed from this MS.

      We were only able to purify full-length CENP-C with tags at the N- and C-terminus, including an MBP tag with a stabilizing effect. At the time of our first successful purification of full-length CENP-C, we had already established the solid phase assay using MBPFACT as a bait on amylose beads and CENP-C2-545HIKM as one of the preys. As we cannot obtain stable full-length CENP-C without MBP, this form of CENP-C is incompatible with our pull-down assay. Nevertheless, CENP-C2-545 still has low affinity for FACT, influencing the FACT/CCAN interaction independent of the PEST-rich region. We, therefore, opted for keeping this information in the manuscript.

      On page 9, authors suddenly focus on N-terminal tails of CENP-Q and CENP-U. Why did they focus on this region. They should explain this. If they perform a structural prediction, they should describe this point.

      Thanks for raising this point. We focused on the N-terminal tails of CENP-QU because they are known interaction hubs. We have now added a sentence to introduce this concept and citing the appropriate literature.

      I agree the fact that FACT phosphorylation is required for FACT-CCAN interaction. They may explain how the phosphorylation contributes to stable FACT-CCAN interaction.

      We have added a sentence explaining that FACT is known to mimic DNA, and negative charges due to phosphorylation could drive this effect. A more detailed mechanistic understanding will require identifying specific phosphorylation sites required for the interaction.

      Readers really want to know phenotype, if FACT-CCAN interaction was compromised without disruption pf CCAN assembly in cells. Although I agree that FACT has some functions in the kinetochore, it is still unclear what FACT does in the kinetochore.

      We wholeheartedly agree with the reviewer. As depletion of FACT by RNAi required 48 h, an unreasonably long time for this multifunctional protein. We therefore turned to engineering RPE-1 cells for rapid degradation of SSRP1. While these attempts were unsucessful, earlier this year, Žumer et al. Mol Cell., 2024 reported generating a K562-SSRP1-dTAG cell line growing in suspension. As already reported, this cell line now allowed to demonstrate a significant effect on the kinetochore stability of CENP-TW upon mitotic depletion of FACT.

      Reviewer #2 (Significance (Required)):

      As mentioned above, biochemical parts are beautiful, but the paper did not address significance of FACT-CCAN interaction.

      We thank the reviewer for this positive assessment. In this revision, we have obtained initial evidence that FACT contributes to kinetochore stability.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      Main findings:

      The major findings of this paper are:

      Detailed dissection of CCAN subunit interactions and requirements to bind the FACT complex using in vitro reconstituted components Binding of FACT and nucleosomes to CENP-C are mutually exclusive FACT phosphorylation by CK2 enhances interaction with CCAN FACT localization in mitosis depends on the CCAN CCAN binding to FACT is outcompeted by DNA and CENP-A nucleosomes The claims and conclusions of the paper are supported by the data and do not require additional experiments. All experiments include biological replicates and appropriate controls.

      We are thankful to the reviewer for this very positive assessment of our work.

      Minor comments

      Intro: • Line 81: In humans [...], here it is worth mentioning that in Drosophila, FACT subunits have been shown to interact directly with the CENP-A assembly factor CAL1 (Ref 61). This paper is perfunctorily cited once in the context of its implication of FACT in CENP-A deposition, but it merits more consideration when setting up the foundational context for the present work.

      We have extended the Introduction and discuss the specified paper more thoroughly.

      Figure 1:

      1F: Add insets.

      Done.

      1G and all other figures containing IFs: Avoid red/green color scheme (red-green colorblindness is fairly common, affecting about 8% of men).

      Done.

      1E: Please add a table summarizing interactions.

      We have included this table as Fig. S1E.

      Results: • It's fine to direct readers to previous work in which you reconstituted the CCAN, but the text should mention how proteins are exogenously expressed and purified (as done for FACT in line 247).

      Done.

      Line 113: FACT has been shown to localize to the mitotic kinetochore also in Drosophila (Ref 61).

      We have included this information now.

      Line 132: The authors should cite work from the Drosophila system as well when they mention centromere transcriptional activity in mitosis (e.g. https://doi.org/10.1083/jcb.201404097; https://doi.org/10.1083/jcb.201611087; and Ref 61).

      We have added these citations.

      Figure 2F: The authors could use a line to mark the region interacting with FACT and that interacting with CENP-A to help summarize the data in this diagram.

      Done.

      Figure 4: Highlight constructs n.2 (FACT^TRUNC) since these are sufficient for interaction (e.g., use a box around them).

      Done.

      Line 276: "CCAN decodes CENP-A^NCP..." What do the authors mean by "decodes"? This whole sentence would benefit from clearer language.

      We thank the reviewer for this suggestion and have aimed for clearer language.

      Figure 6: There's a lot of information in these experiments that would benefit from two schematics, one showing the mechanism of FACT + CCAN binding with DNA and one with CENP-A nucleosomes.

      Done.

      Discussion: The authors discuss FACT localization at kinetochores in mitosis. In Drosophila Schneider cells, FACT is observed enriched at the centromeres in both mitosis and interphase (Ref 61). The authors mention their inability to detect FACT in interphase in the discussion, but I did not find this mentioned in the results. The authors state that FACT "redistributes to the entire chromosome" upon entry into interphase. They cite Figure 1F in reference to this statement, but the staining in the early G1 panel is difficult to interpret with the low signal/noise scaling of CENP-C and the lack of zoom insets. Their protocol uses a pre-extraction step with Triton prior to fixation. Apparently, this was not enough to reveal FACT in interphase, but better images and a brief description are warranted.

      We have now added a staining of SSRP1 in interphase in the panel.

      It is unlikely that FACT would change its localization pattern in mitosis. A more likely possibility is that in mitosis FACT is not redistributed, but rather more tightly bound (and thus less easily extracted by Triton treatment) at kinetochores, while along the arms FACT is more readily removed by extraction because at this time transcription is repressed and FACT is likely less engaged in transcription-mediated histone destabilization.

      We thank the reviewer for these remarks and have updated the Discussion.

      Given the well-known function of FACT in transcription and the many studies linking transcription to centromere maintenance, including with the involvement of FACT, the model that "the localization of FACT at the kinetochore coincides with active centromeric transcription in mitosis and interphase" is very tempting. A speculative model would go a long way to help the reader visualize all these complex aspects of FACT's interactions and possible functions.

      We agree with the reviewer that such a model is tempting. However, we also feel that it would be rather speculative at this stage and we feel that the manuscript does not provide sufficient data to support the model.

      Reviewer #3 (Significance (Required)):

      The strongest aspect of the study is the detailed characterization of protein-protein interactions, as well as competition with DNA and CENP-A nucleosomes. The siRNA experiments in cells complement this largely in vitro study. However, a limitation of the study is that it does not shed light on what FACT might be doing at the centromere. Additionally, it does not sufficiently provide context for these findings in relation to previous studies that have demonstrated the roles of FACT at the centromere in budding yeast, fission yeast, and Drosophila. Nonetheless, this study provides valuable insights into the details of FACT interactions at the kinetochore and will be of interest to readers interested in centromeres and kinetochore. I am a centromere biologist with molecular and cell biology expertise.

      We are very grateful to the reviewer for his/her support.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Main findings:

      The major findings of this paper are:

      • Detailed dissection of CCAN subunit interactions and requirements to bind the FACT complex using in vitro reconstituted components
      • Binding of FACT and nucleosomes to CENP-C are mutually exclusive
      • FACT phosphorylation by CK2 enhances interaction with CCAN
      • FACT localization in mitosis depends on the CCAN
      • CCAN binding to FACT is outcompeted by DNA and CENP-A nucleosomes The claims and conclusions of the paper are supported by the data and do not require additional experiments. All experiments include biological replicates and appropriate controls.

      Minor comments

      Intro:

      • Line 81: In humans [...], here it is worth mentioning that in Drosophila, FACT subunits have been shown to interact directly with the CENP-A assembly factor CAL1 (Ref 61). This paper is perfunctorily cited once in the context of its implication of FACT in CENP-A deposition, but it merits more consideration when setting up the foundational context for the present work.

      Figure 1:

      • 1F: Add insets.
      • 1G and all other figures containing IFs: Avoid red/green color scheme (red-green colorblindness is fairly common, affecting about 8% of men).
      • 1E: Please add a table summarizing interactions.

      Results:

      • It's fine to direct readers to previous work in which you reconstituted the CCAN, but the text should mention how proteins are exogenously expressed and purified (as done for FACT in line 247).
      • Line 113: FACT has been shown to localize to the mitotic kinetochore also in Drosophila (Ref 61).
      • Line 132: The authors should cite work from the Drosophila system as well when they mention centromere transcriptional activity in mitosis (e.g., https://doi.org/10.1083/jcb.201404097; https://doi.org/10.1083/jcb.201611087; and Ref 61).
      • Figure 2F: The authors could use a line to mark the region interacting with FACT and that interacting with CENP-A to help summarize the data in this diagram.
      • Figure 4: Highlight constructs n.2 (FACT^TRUNC) since these are sufficient for interaction (e.g., use a box around them).
      • Line 276: "CCAN decodes CENP-A^NCP..." What do the authors mean by "decodes"? This whole sentence would benefit from clearer language.
      • Figure 6: There's a lot of information in these experiments that would benefit from two schematics, one showing the mechanism of FACT + CCAN binding with DNA and one with CENP-A nucleosomes.

      Discussion:

      The authors discuss FACT localization at kinetochores in mitosis. In Drosophila Schneider cells, FACT is observed enriched at the centromeres in both mitosis and interphase (Ref 61). The authors mention their inability to detect FACT in interphase in the discussion, but I did not find this mentioned in the results. The authors state that FACT "redistributes to the entire chromosome" upon entry into interphase. They cite Figure 1F in reference to this statement, but the staining in the early G1 panel is difficult to interpret with the low signal/noise scaling of CENP-C and the lack of zoom insets. Their protocol uses a pre-extraction step with Triton prior to fixation. Apparently, this was not enough to reveal FACT in interphase, but better images and a brief description are warranted. It is unlikely that FACT would change its localization pattern in mitosis. A more likely possibility is that in mitosis FACT is not redistributed, but rather more tightly bound (and thus less easily extracted by Triton treatment) at kinetochores, while along the arms FACT is more readily removed by extraction because at this time transcription is repressed and FACT is likely less engaged in transcription-mediated histone destabilization. Given the well-known function of FACT in transcription and the many studies linking transcription to centromere maintenance, including with the involvement of FACT, the model that "the localization of FACT at the kinetochore coincides with active centromeric transcription in mitosis and interphase" is very tempting. A speculative model would go a long way to help the reader visualize all these complex aspects of FACT's interactions and possible functions.

      Significance

      The strongest aspect of the study is the detailed characterization of protein-protein interactions, as well as competition with DNA and CENP-A nucleosomes. The siRNA experiments in cells complement this largely in vitro study. However, a limitation of the study is that it does not shed light on what FACT might be doing at the centromere. Additionally, it does not sufficiently provide context for these findings in relation to previous studies that have demonstrated the roles of FACT at the centromere in budding yeast, fission yeast, and Drosophila. Nonetheless, this study provides valuable insights into the details of FACT interactions at the kinetochore and will be of interest to readers interested in centromeres and kinetochore.

      I am a centromere biologist with molecular and cell biology expertise.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      FACT is a histone chaperone and is involved in various events on chromatin such as transcription and replication. In addition, FACT interacts with various kinetochore components, suggesting potential functions at the kinetochore. However, it is largely unclear how FACT functions in the kinetochore. Authors of this MS took the biochemical approach to understand roles of FACT in the kinetochore.

      Authors demonstrated that FACT forms a complex with the constitutive centromere associated network (CCAN), which contains 16 subunits on centromeric chromatin, using multiple binding sites. They also showed that casein kinase II (CK2) phosphorylated FACT and dephosphorylated FACT did not bind to CCAN. Furthermore, they displayed that DNA addition disrupt the stable FACT-CCAN complex.

      Overall, while authors have done solid and high-quality biochemical analyses (these are elegant), it is still unclear how FACT plays its roles in the kinetochore. Simple knockout or knockdown study on FACT might be complicated, because FACT has multi-functions. If authors can identify specific regions of FACT for interaction with CCAN, they would put specific mutations into FACT to analyze phenotype. Although they did not reach a high-resolution structure for the FACT-CCAN complex, they can utilize AlphaFold and test specific interaction regions, biochemically. Then, using such information, significance of FACT-CCAN interaction might be testable in cells. Such a kind of study would be expected. In summary, biochemical parts are beautiful, but the paper did not address significance of FACT-CCAN interaction.

      Specific point

      1. Authors showed nice mitotic localization of FACT. Can they observe this localization by a usual IF? Using GFP fusion, do they observe kinetochore localization like IF experiments?
      2. On page 7, authors tested CENP-C binding to FACT and they conclude that C-teminal region of CENP-C preferentially binds to FACT. However, they used N-terminal region of CENP-C (2-545) for CCAN-FACT complex formation in entire MS. therefore, this is complicated, and story on CENP-C N-terminal region can be removed from this MS.
      3. On page 9, authors suddenly focus on N-terminal tails of CENP-Q and CENP-U. Why did they focus on this region. They should explain this. If they perform a structural prediction, they should describe this point.
      4. I agree the fact that FACT phosphorylation is required for FACT-CCAN interaction. They may explain how the phosphorylation contributes to stable FACT-CCAN interaction.
      5. Readers really want to know phenotype, if FACT-CCAN interaction was compromised without disruption pf CCAN assembly in cells. Although I agree that FACT has some functions in the kinetochore, it is still unclear what FACT does in the kinetochore.

      Significance

      As mentioned above, biochemical parts are beautiful, but the paper did not address significance of FACT-CCAN interaction.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The authors investigated molecular interactions between CCAN and FACT complexes. They revealed contact domains in FACT and the cognate subcomplexes of CCAN by in vitro reconstitution from recombinant proteins followed by SEC and pull-down assay.

      They also revealed a couple of potential means to control interactions between FACT and the CCAN. They conclude that phosphorylation of FACT by CK2 is essential for binding to the CCAN; and CENP-A nucleosomes or DNA prevent CCAN from interacting with FACT.

      Major comments:

      The authors show that phosphorylation of FACT is essential for interaction with CCAN. They argue that this phosphorylation is partly catalysed by CK2.

      My concerns are:

      1. The authors assume that the sites phosphorylated in insect cell are also phosphorylated in human cells. However, it is not demonstrated which residues are phosphorylated in human cells and whether they match those from insect cells. Whether phosphorylation of recombinant proteins in insect cells is physiologically relevant to mammalian is uncertain. Kinetochore components are not very well conserved evolutionarily, thus their regulation may be different.
      2. They identify several residues which are phosphorylated by CK2 in vitro. However, these are not necessarily the same sites as those phosphorylated in insect cells or more importantly in human cells. The in vitro phosphorylation by CK2 did not restore binding affinity in full, suggesting phosphorylation at other sites may be critical for interaction with CCAN. Further evidence is required to support the claim that those sites are phosphorylated in vivo and important for integrity of kinetochores in mitosis.

      Minor comments:

      Figure 1H

      I am confused with 4 stars shown at the top of the right plot. If the 4 stars are meant to show a significant difference, then the statement in the text (line 123) is not correct. "SSRP1 localization was also largely unaffected ..." Similar discrepancies are found in Figures 3H (line 212), Figures S2 (line 122), S5I (line 197), and S6I (line209). Figure S6H is not referred to anywhere. There is no description for the numbers at the top. Are they mean values? Do red bars represent S.D.?

      Figure 1D

      There is no description of R* to the right of gels.

      Figure S2

      A 4 hour nocodazole treatment is too short to drive all cells into mitosis. Is the data taken from mitotic cells only?

      Significance

      The interaction of FACT with kinetochore components has been known for several years. However how FACT contributes to architecture or function of kinetochore is not very well understood. How the FACT complex, which is known for its established role as a histone chaperone, is involved in kinetochore assembly/architecture will attract interest in several fields of basic research including epigenetics, mitosis, structural biology.

      Identification of CCAN subunits that interact with FACT is important for future analysis to understand the kinetochore function of FACT. The authors identified OPQRU and CHIKM subcomplex in addition to TW as FACT-interacting domains. These subcomplexes are geographically scattered in a 3D model of CCAN holocomplex. Stoichiometry of CCAN and FACT might be informative whether a single or multiple FACT binds to the multiple sites of CCAN. The authors do not address whether these multiple sites are occupied simultaneously, separately or sequentially.

      The statement at the end of Abstract (lines 23-25) is a speculative hypothesis without evidence for "a pool of CCAN that is not stably integrated into chromatin", "chaperoning CCAN", and "stabilisation of CCAN".

    1. Author response:

      The following is the authors’ response to the previous reviews.

      Reviewer #1 (Public Review):

      In this revision, the authors significantly improved the manuscript. They now address some of my concerns. Specifically, they show the contribution of end-effects on spreading the inputs between dendrites. This analysis reveals greater applicability of their findings to cortical cells, with long, unbranching dendrites than other neuronal types, such as Purkinje cells in the cerebellum.

      They now explain better the interactions between calcium and voltage signals, which I believe improve the take-away message of their manuscript. They modified and added new figures that helped to provide more information about their simulations.

      However, some of my points remain valid. Figure 6 shows depolarization of ~5mV from -75. This weak depolarization would not effectively recruit nonlinear activation of NMDARs. In their paper, Branco and Hausser (2010) showed depolarizations of ~10-15mV.

      More importantly, the signature of NMDAR activation is the prolonged plateau potential and activation at more depolarized resting membrane potentials (their Figure 4). Thus, despite including NMDARs in the simulation, the authors do not model functional recruitment of these channels. Their simulation is thus equivalent to AMPA only drive, which can indeed summate somewhat nonlinearly.

      In the current study, we used short sequences of 5 inputs, since the convergence of longer sequences is extremely unlikely in the network configurations we have examined. This resulted in smaller EPSP amplitudes of ~5mV (Figure 6 - Supplement 2A, B). Longer sequences containing 9 inputs resulted in larger somatic depolarizations of ~10mV (Figure 6 - Supplement 2E, F). Although we had modified the (Branco, Clark, and Häusser 2010) model to remove the jitter in the timing of arrival of inputs and made slight modifications to the location of stimulus delivery on the dendrite, we saw similar amplitudes when we tested a 9-length sequence using (Branco, Clark, and Häusser 2010)’s published code (Figure 6 - Supplement 2I, J). In all the cases we tested (5 input sequence, 9 input sequence, 9 input sequence with (Branco, Clark, and Häusser 2010) code repository), removal of NMDA synapses lowered both the somatic EPSPs (Figure 6 - Supplement 2C,D,G,H,K,L) as well as the selectivity (measured as the difference between the EPSPs generated for inward and outward stimulus delivery) (Figure 6 Supplement 2M,N,O). Further, monitoring the voltage along the dendrite for a sequence of 5 inputs showed dendritic EPSPs in the range of 20-45 mV (Figure 6 - Supplement 2P, Q), which came down notably (10-25mV) when NMDA synapses were abolished (Figure 6 - Supplement 2R, S). Thus, even sequences containing as few as 5 inputs were capable of engaging the NMDA-mediated nonlinearity to show sequence selectivity, although the selectivity was not as strong as in the case of 9 inputs.

      Reviewer #1 (Recommendations for the authors):

      Minor points:

      Figure 8, what does the scale in A represent? I assume it is voltage, but there are no units. Figure 8, C, E, G, these are unconventional units for synaptic weights, usually, these are given in nS / per input.

      We have corrected these. The scalebar in 8A represents membrane potential in mV. The units of 8C,E,G are now in nS.

      Reviewer #2 (Public Review):

      Summary:

      If synaptic input is functionally clustered on dendrites, nonlinear integration could increase the computational power of neural networks. But this requires the right synapses to be located in the right places. This paper aims to address the question of whether such synaptic arrangements could arise by chance (i.e. without special rules for axon guidance or structural plasticity), and could therefore be exploited even in randomly connected networks. This is important, particularly for the dendrites and biological computation communities, where there is a pressing need to integrate decades of work at the single-neuron level with contemporary ideas about network function.

      Using an abstract model where ensembles of neurons project randomly to a postsynaptic population, back-of-envelope calculations are presented that predict the probability of finding clustered synapses and spatiotemporal sequences. Using data-constrained parameters, the authors conclude that clustering and sequences are indeed likely to occur by chance (for large enough ensembles), but require strong dendritic nonlinearities and low background noise to be useful.

      Strengths:

      (1) The back-of-envelope reasoning presented can provide fast and valuable intuition. The authors have also made the effort to connect the model parameters with measured values. Even an approximate understanding of cluster probability can direct theory and experiments towards promising directions, or away from lost causes.

      (2) I found the general approach to be refreshingly transparent and objective. Assumptions are stated clearly about the model and statistics of different circuits. Along with some positive results, many of the computed cluster probabilities are vanishingly small, and noise is found to be quite detrimental in several cases. This is important to know, and I was happy to see the authors take a balanced look at conditions that help/hinder clustering, rather than to just focus on a particular regime that works.

      (3) This paper is also a timely reminder that synaptic clusters and sequences can exist on multiple spatial and temporal scales. The authors present results pertaining to the standard `electrical' regime (~50-100 µm, <50 ms), as well as two modes of chemical signaling (~10 µm, 100-1000 ms). The senior author is indeed an authority on the latter, and the simulations in Figure 5, extending those from Bhalla (2017), are unique in this area. In my view, the role of chemical signaling in neural computation is understudied theoretically, but research will be increasingly important as experimental technologies continue to develop.

      Weaknesses:

      (1) The paper is mostly let down by the presentation. In the current form, some patience is needed to grasp the main questions and results, and it is hard to keep track of the many abbreviations and definitions. A paper like this can be impactful, but the writing needs to be crisp, and the logic of the derivation accessible to non-experts. See, for instance, Stepanyants, Hof & Chklovskii (2002) for a relevant example.

      It would be good to see a restructure that communicates the main points clearly and concisely, perhaps leaving other observations to an optional appendix. For the interested but time-pressed reader, I recommend starting with the last paragraph of the introduction, working through the main derivation on page 7, and writing out the full expression with key parameters exposed. Next, look at Table 1 and Figure 2J to see where different circuits and mechanisms fit in this scheme. Beyond this, the sequence derivation on page 15 and biophysical simulations in Figures 5 and 6 are also highlights.

      We appreciate the reviewers' suggestions. We have tightened the flow of the introduction. We understand that the abbreviations and definitions are challenging and have therefore provided intuitions and summaries of the equations discussed in the main text.

      Clusters calculations

      Our approach is to ask how likely it is that a given set of inputs lands on a short segment of dendrite, and then scale it up to all segments on the entire dendritic length of the cell.

      Thus, the probability of occurrence of groups that receive connections from each of the M ensembles (PcFMG) is a function of the connection probability (p) between the two layers, the number of neurons in an ensemble (N), the relative zone-length with respect to the total dendritic arbor (Z/L) and the number of ensembles (M).

      Sequence calculations

      Here we estimate the likelihood of the first ensemble input arriving anywhere on the dendrite, and ask how likely it is that succeeding inputs of the sequence would arrive within a set spacing.

      Thus, the probability of occurrence of sequences that receive sequential connections (PcPOSS) from each of the M ensembles is a function of the connection probability (p) between the two layers, the number of neurons in an ensemble (N), the relative window size with respect to the total dendritic arbor (Δ/L) and the number of ensembles (M).

      (2) I wonder if the authors are being overly conservative at times. The result highlighted in the abstract is that 10/100000 postsynaptic neurons are expected to exhibit synaptic clustering. This seems like a very small number, especially if circuits are to rely on such a mechanism. However, this figure assumes the convergence of 3-5 distinct ensembles. Convergence of inputs from just 2 ense mbles would be much more prevalent, but still advantageous computationally. There has been excitement in the field about experiments showing the clustering of synapses encoding even a single feature.

      We agree that short clusters of two inputs would be far more likely. We focused our analysis on clusters with three of more ensembles because of the following reasons:

      (1) The signal to noise in these clusters was very poor as the likelihood of noise clusters is high.

      (2) It is difficult to trigger nonlinearities with very few synaptic inputs.

      (3) At the ensemble sizes we considered (100 for clusters, 1000 for sequences), clusters arising from just two ensembles would result in high probability of occurrence on all neurons in a network (~50% in cortex, see p_CMFG in figures below.). These dense neural representations make it difficult for downstream networks to decode (Foldiak 2003).

      However, in the presence of ensembles containing fewer neurons or when the connection probability between the layers is low, short clusters can result in sparse representations (Figure 2 - Supplement 2). Arguments 1 and 2 hold for short sequences as well.

      (3) The analysis supporting the claim that strong nonlinearities are needed for cluster/sequence detection is unconvincing. In the analysis, different synapse distributions on a single long dendrite are convolved with a sigmoid function and then the sum is taken to reflect the somatic response. In reality, dendritic nonlinearities influence the soma in a complex and dynamic manner. It may be that the abstract approach the authors use captures some of this, but it needs to be validated with simulations to be trusted (in line with previous work, e.g. Poirazi, Brannon & Mel, (2003)).

      We agree that multiple factors might affect the influence of nonlinearities on the soma. The key goal of our study was to understand the role played by random connectivity in giving rise to clustered computation. Since simulating a wide range of connectivity and activity patterns in a detailed biophysical model was computationally expensive, we analyzed the exemplar detailed models for nonlinearity separately (Figures 5, 6, and new figure 8), and then used our abstract models as a proxy for understanding population dynamics. A complete analysis of the role played by morphology, channel kinetics and the effect of branching requires an in-depth study of its own, and some of these questions have already been tackled by (Poirazi, Brannon, and Mel 2003; Branco, Clark, and Häusser 2010; Bhalla 2017). However, in the revision, we have implemented a single model which incorporates the range of ion-channel, synaptic and biochemical signaling nonlinearities which we discuss in the paper (Figure 8, and Figure 8 Supplement 1, 2,3). We use this to demonstrate all three forms of sequence and grouped computation we use in the study, where the only difference is in the stimulus pattern and the separation of time-scales inherent in the stimuli.

      (4) It is unclear whether some of the conclusions would hold in the presence of learning. In the signal-to-noise analysis, all synaptic strengths are assumed equal. But if synapses involved in salient clusters or sequences were potentiated, presumably detection would become easier? Similarly, if presynaptic tuning and/or timing were reorganized through learning, the conditions for synaptic arrangements to be useful could be relaxed. Answering these questions is beyond the scope of the study, but there is a caveat there nonetheless.

      We agree with the reviewer. If synapses receiving connectivity from ensembles had stronger weights, this would make detection easier. Dendritic spikes arising from clustered inputs have been implicated in local cooperative plasticity (Golding, Staff, and Spruston 2002; Losonczy, Makara, and Magee 2008). Further, plasticity related proteins synthesized at a synapse undergoing L-LTP can diffuse to neighboring weakly co-active synapses, and thereby mediate cooperative plasticity (Harvey et al. 2008; Govindarajan, Kelleher, and Tonegawa 2006; Govindarajan et al. 2011). Thus if clusters of synapses were likely to be co-active, they could further engage these local plasticity mechanisms which could potentiate them while not potentiating synapses that are activated by background activity. This would depend on the activity correlation between synapses receiving ensemble inputs within a cluster vs those activated by background activity. We have mentioned some of these ideas in a published opinion paper (Pulikkottil, Somashekar, and Bhalla 2021). In the current study, we wanted to understand whether even in the absence of specialized connection rules, interesting computations could still emerge. Thus, we focused on asking whether clustered or sequential convergence could arise even in a purely randomly connected network, with the most basic set of assumptions. We agree that an analysis of how selectivity evolves with learning would be an interesting topic for further work.

      References

      Bhalla, Upinder S. 2017. “Synaptic Input Sequence Discrimination on Behavioral Timescales Mediated by Reaction-Diffusion Chemistry in Dendrites.” Edited by Frances K Skinner. eLife 6 (April):e25827. https://doi.org/10.7554/eLife.25827.

      Branco, Tiago, Beverley A. Clark, and Michael Häusser. 2010. “Dendritic Discrimination of Temporal Input Sequences in Cortical Neurons.” Science (New York, N.Y.) 329 (5999): 1671–75. https://doi.org/10.1126/science.1189664.

      Foldiak, Peter. 2003. “Sparse Coding in the Primate Cortex.” The Handbook of Brain Theory and Neural Networks. https://research-repository.st-andrews.ac.uk/bitstream/handle/10023/2994/FoldiakSparse HBTNN2e02.pdf?sequence=1.

      Golding, Nace L., Nathan P. Staff, and Nelson Spruston. 2002. “Dendritic Spikes as a Mechanism for Cooperative Long-Term Potentiation.” Nature 418 (6895): 326–31. https://doi.org/10.1038/nature00854.

      Govindarajan, Arvind, Inbal Israely, Shu-Ying Huang, and Susumu Tonegawa. 2011. “The Dendritic Branch Is the Preferred Integrative Unit for Protein Synthesis-Dependent LTP.” Neuron 69 (1): 132–46. https://doi.org/10.1016/j.neuron.2010.12.008.

      Govindarajan, Arvind, Raymond J. Kelleher, and Susumu Tonegawa. 2006. “A Clustered Plasticity Model of Long-Term Memory Engrams.” Nature Reviews Neuroscience 7 (7): 575–83. https://doi.org/10.1038/nrn1937.

      Harvey, Christopher D., Ryohei Yasuda, Haining Zhong, and Karel Svoboda. 2008. “The Spread of Ras Activity Triggered by Activation of a Single Dendritic Spine.” Science (New York, N.Y.) 321 (5885): 136–40. https://doi.org/10.1126/science.1159675.

      Losonczy, Attila, Judit K. Makara, and Jeffrey C. Magee. 2008. “Compartmentalized Dendritic Plasticity and Input Feature Storage in Neurons.” Nature 452 (7186): 436–41. https://doi.org/10.1038/nature06725.

      Poirazi, Panayiota, Terrence Brannon, and Bartlett W. Mel. 2003. “Pyramidal Neuron as Two-Layer Neural Network.” Neuron 37 (6): 989–99. https://doi.org/10.1016/S0896-6273(03)00149-1.

      Pulikkottil, Vinu Varghese, Bhanu Priya Somashekar, and Upinder S. Bhalla. 2021. “Computation, Wiring, and Plasticity in Synaptic Clusters.” Current Opinion in Neurobiology, Computational Neuroscience, 70 (October):101–12. https://doi.org/10.1016/j.conb.2021.08.001.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      (1) The technology requires a halo-tagged derivation of the active compound, and the linked position will have a huge impact on the potential "target hits" of the molecules. Given the fact that most of the active molecules lack of structure-activity relationship information, it is very challenging to identify the optimal position of the halo tag linkage.

      We appreciate your insightful comment. While finding the optimal position to attach a chemical linker to a small molecule of interest is indeed a challenging but necessary step, this is a common difficulty across all target-ID methods, except for those that are modification-free, as we described in Discussion. However, modification-free approaches such as DARTS, CETSA, and TPP have their own limitations, such as low sensitivity and a high false-positive rate. Additionally, DARTS and SPROX are limited to use with cell lysates. Please refer to the introduction in our manuscript for more details on these approaches. On the other hand, synthesizing HTL derivatives is relatively straightforward compared to other modifications, and we provide helpful guidelines for chemical linker design, provided the optimal chemical moiety has been identified, which is crucial for target identification. We selected dasatinib and HCQ/CQ as model compounds because previous studies offered insights into their derivative synthesis. Our data also show that DH5 retains strong kinase inhibitory activity (Figure 4—figure supplement 2), and DC661-H1 demonstrates potent inhibition of autophagy (Figure 6—figure supplement 1). For novel compounds, conducting a thorough structure-activity relationship (SAR) study is essential to determine the optimal position for HTL derivative synthesis.

      (2) Although POST-IT works in zebrafish embryos, there is still a long way to go for the broad application of the technology in other animal models.

      Thank you for your constructive comment. Yes, there is still a long way to go in developing the POST-IT system for broader applications in other animal models, especially in mice. However, we hope that our study provides valuable insights and inspiration to scientists and experts for applying the POST-IT system in various models. We are also committed to further improving its applicability.

      (3) The authors identified SEPHS2 as a new potential target of dasatinib and further validated the direct binding of dasatinib with this protein. However, considering the super strong activity of dasatinib against c-Src (sub nanomolar IC50 value), it is hard to conclude the contribution of SEPHS2 binding (micromolar potency) to its antitumor activity.

      Thank you for your insightful comment. We agree that the anticancer activity of dasatinib primarily results from inhibiting tyrosine kinases such as SRC and ABL. However, SEPHS2 contains an “opal" termination codon, UGA, at the 60th amino acid residue, which codes for selenocysteine. Due to the technical challenge of expressing selenoproteins in E. coli, we mutated it to cysteine for expression in E. coli to avoid premature translation termination, as described in the Materials and Methods section. Although the purified recombinant SEPHS2 shows a Kd of about 10 µM for dasatinib, the binding affinity to endogenous SEPHS2 may be higher since selenocysteine is larger and more electronegative than cysteine. This presents an interesting area for future investigation. Furthermore, our study of dasatinib’s binding to SEPHS2 could help facilitate the development of new SEPHS2 inhibitors, potentially targeting the active site of SEPHS2.

      Reviewer #3 (Public review):

      (1) Target Specificity: It is crucial for the authors to differentiate between the primary targets of the POST-IT system and those identified as side effects. This distinction is essential for assessing the specificity and utility of the technology.

      Thank you for your insightful comment. Drugs inevitably bind to various proteins with differing affinities, which can contribute to both side effects and beneficial outcomes. Typically, the primary targets exhibit high affinities. In this manuscript, we ranked the identified protein targets of DH5 based on affinity from mass spectrometry and p-values (Fig. 5A), and for DC661-H1, we used the SILAC ratio (Fig. 6A). We also individually assessed many drug-protein binding affinities using the MST assay, as well as in vitro and in cellulo assays, demonstrating their specificity. Moreover, we believe it is essential to identify as many protein targets as possible at physiological drug concentrations to better understand the drug’s side effects. Of course, further investigation is required to assess the roles and effects of these target proteins.

      (2) In Vivo Target Identification: The manuscript lacks detailed clarity on which specific targets were successfully identified in the in vivo experiments. Expanding on this information would provide a clearer view of the system's effectiveness and scope in complex biological settings.

      Thank you for your insightful comment regarding in vivo target identification. In this manuscript, we utilized a cell line as the primary method for in vivo target identification and validation after optimizing our system in test tubes. We successfully validated many of the targets identified using our POST-IT system (Figure 6—figure supplement 3). To demonstrate the proof of principle for in vivo application, we employed zebrafish embryos as an in vivo model, showing that endogenous SRC can be effectively pulled down by DH5 treatment (Fig. 7). While we could have explored the entire proteome to identify endogenous target proteins in zebrafish that bind to DH5 or dasatinib, we felt this would extend beyond our original scope, given that we have already demonstrated POST-IT’s ability to identify target proteins for dasatinib. Specific target identification and validation are crucial when using zebrafish for drug discovery. Additionally, we acknowledge that drugs likely interact with a range of protein targets in living organisms and may undergo metabolism and interactions within the circulatory system, which we address in our discussion.

      (3) Reproducibility and Scalability: Discussion on the reproducibility of the POST-IT system across various experimental setups and biological models, as well as its scalability for larger-scale drug discovery programs, would be beneficial.

      Thank you for the suggestion. While our system has shown  high reproducibility in our experiments, further improving both reproducibility and scalability would be advantageous. One potential approach to address this is through the generation of stable-expressing cell lines and transgenic zebrafish lines, which we have discussed in the revised manuscript. Establishing stable cell lines with robust POST-IT expression could enhance scalability for drug discovery applications.

      (4) Quantitative Analysis: A more detailed quantitative analysis of the protein interactions identified by POST-IT, including statistical significance and comparative data against other technologies, would enhance the manuscript.

      Thank you for your suggestion. In our assessment of drug-protein affinity, we included Kd values as quantitative measures using MST assays. The protein targets of dasatinib identified through mass spectrometry are also accompanied by p-values for quantitative analysis (Fig. 5A), and the detailed procedures are described in the Material and methods section. While it is challenging to provide direct comparative data against other technologies, our system successfully identified many known target proteins for dasatinib, as well as SEPHS2 and VPS37C as new targets for dasatinib and for HCQ/CQ, respectively, which were not detected by other methods.

      (5) Technological Limitations: The authors should discuss any limitations or potential pitfalls of the POST-IT system, which would be crucial for future users and for guiding subsequent improvements.

      Thank you for your insightful suggestion We agree that clearly defining the technological limitations is important. Therefore, we have expanded our original discussion on the limitations of our POST-IT system (Discussion section, paragraph 6).

      (6) Long-Term Stability and Activity: Information on the long-term stability and activity of the POST-IT components in different biological environments would ensure the reliability of the system in prolonged experiments.

      Yes, this is an important question. We did not notice any stability or toxicity issues with Halo-PafA and Pup substrates in HEK293T cells or zebrafish, which is an important factor for stable cell lines and transgenic zebrafish lines. However, HTL derivatives of the drug could be toxic or unstable due to the nature of the drug or its metabolism, which needs to be taken into account when designing experiments, and we have included this in the Discussion.

      (7) Comparison with Existing Technologies: A detailed comparison with existing proximity tagging and target identification technologies would help position POST-IT within the current landscape, highlighting its unique advantages and potential drawbacks.

      We appreciate your valuable feedback and agree that such comparisons are crucial. We have included a detailed overview and comparison of existing proximity-tagging systems and their related target identification technologies in the Introduction (lines 78-100) and Discussion (lines 391-412), highlighting their respective pros and cons. Additionally, we have expanded the discussion to further compare these technologies with our POST-IT system, addressing its advantages and limitations (lines 378-390, lines 448-467). We hope this provides sufficient context and information to effectively position POST-IT among the landscape of proximity-tagging target identification technologies.

      (8) Concerns Regarding Overexposed Bands: Several figures in the manuscript, specifically Figure 3A, 3B, 3C, 3F, 3G, Figure 4D, and the second panels in Figure 7C as well as some figures in the supplementary file, exhibit overexposed bands.

      We appreciate your astute observation regarding the overexposed bands and apologize for any confusion. The “overexposed” bands represent the unpupylated proteins, while the bands above them correspond to the pupylated proteins. We intended to clearly show both pupylated and unpupylated bands, although the latter are generally much weaker. We are currently working on further improving our POST-IT system to enhance pupylation efficiency.

      (9) Innovation Concern: There is a previous paper describing a similar approach: Liu Q, Zheng J, Sun W, Huo Y, Zhang L, Hao P, Wang H, Zhuang M. A proximity-tagging system to identify membrane protein-protein interactions. Nat Methods. 2018 Sep;15(9):715-722. doi: 10.1038/s41592-018-0100-5. Epub 2018 Aug 13. PMID: 30104635. It is crucial to explicitly address the novel aspects of POST-IT in contrast to this earlier work.

      Thank you for bringing this to our attention. Proximity-tagging systems like BioID, TurboID, NEDDylator, and PafA (Lui Q et al., Nat Methods 2018) were initially developed to study protein-protein interactions or identify protein interactomes, as these applications are of broader interest and generally easier to implement. However, applying proximity-tagging systems for small molecule target identification requires significant optimization. As described in the introduction (lines 78-100), target protein identification systems have since been developed using TurboID and NEDDylator (Tao AJ et al., Nat Commun 2023; Hill ZB et al., J Am Chem Soc 2016). It is conceivable that a PafA-based proximity-tagging system could also be adapted for target-ID, and other groups may pursue this approach in the future. Although the PafA-Pup system shows great promise for target-ID applications, extensive optimization was needed to enable its use for this purpose. Finally, we demonstrate that POST-IT offers distinct advantages over other proximity-tagging-based target-ID systems. For more details, please refer to the introduction and discussion sections.

      Recommendations for the authors:

      Reviewer #2 (Recommendations for the authors):

      (1) Figure 1- Figure Supplement 1A: The Pup substrate "HB-Pup" is mentioned, but the main text or figure legend provides no introduction or description.

      We appreciate your astute observation. We have added a description in the main text and figure legend as follows: “…and used HB-Pup as a control, which contains 6´His and BCCP at the N terminus of Pup” in the main text (line 142) and “HB, TS, and SBP refer to 6´His and BCCP, twin-STII (Strep-tag II), and streptavidin binding peptide, respectively.” in the Figure 1-figure supplement 1A.

      (2) Figure 1 - Figure Supplement 3B: The authors used TS-sPupK61R as a substrate but did not explain why. The main text mentions that mutating sPup alone did not affect polypupylation, raising the question of why TS-sPupK61R was used in this figure. Furthermore, while the authors state that polypupylation becomes evident after 1 hour of incubation (more pronounced after 2 or 3 hours), the reactions here were conducted for only 30 minutes.

      Thank you for your question. Figure 1 - Figure Supplement 3B was conducted to test self-pupylation levels in the different Halo-PafA derivatives. For this purpose, we could use any Pup substrate such as SBP-sPup and SBPK4R-sPupK61R, instead of Ts-sPup and TS-sPupK61R, as they do not show any differences in pupylation activity. We chose Ts-sPup and TS-sPupK61R simply because any Pup substrates could be used for this purpose. Similarly, we did not need to incubate the reaction for a longer time to detect polypupylation, as our intention was to test “self-pupylation”. We demonstrated in Figure 1 – figure supplement 2 that polypupylation is dependent on the number or position of lysine residues in Pup substrate or tags. The results clearly showed that self-pupylation was almost completely abolished by the Halo8KR mutation. To clarify this, we added the following description in lines 168-169: “Ts-sPup and TS-sPupK61R were chosen as sPup substrates for this experiment, although any Pup substrates could have been used. The levels of self-pupylation were assessed.”

      (3) Line 156: The statement that "the TS-tag completely abolished polypupylation in TS-sPup" is inaccurate. Using TSK8R-sPupK61R as the substrate, several bands appear, which likely represent Halo-PafA with varying degrees of polypupylation. Some bands also appear to correspond to those seen when using TS-sPup as a substrate. The authors should clarify how they distinguish between multipupylation and polypupylation in this case.

      We sincerely appreciate your insight into clarifying the distinction between multipupylation and polypupylation. Polypupylation refers to the addition of a new Pup onto a previously linked Pup on the target protein, akin to polyubiquitination. In contrast, multipupylation involves multiple single pupylations at different positions on the target proteins. Since pupylation occurs exclusively at lysine residues in tag-Pup substrates, mutating all lysine residues to arginine, as in TSK48R-sPupK61R, prevents the mutant tag-Pup from linking to another Pup. This means that only single pupylation can proceed with this type of mutant Pup substrate. If multiple pupylated bands are observed with this mutant substrate, it indicates “multipupylation” rather than “polypupylation”, as shown in Figure 1-figure supplement 2D. The same applies to the pupylation bands in Figure 1-figure supplement 2E and F, as sSBP-sPupK61R and SBPK4R-sPupK61R lack lysine residues. By comparing these multipupylation bands, it is also possible to distinguish them from polypupylation bands, which are marked by yellow arrows. However, after 2-3 pupylation bands, higher-order bands become increasingly difficult to distinguish.

      To clarify the mutation in the TS-tag, we revised the sentence in line 156 from “However, further mutations within the TS-tag completely abolished polypupylation in TS-sPup” to “However, further mutations of two lysine residues within the TS-tag, creating TSK8R-sPupK61R, completely abolished polypupylation in TS-sPup”. Additionally, we have inserted sentences in line 152 to define polypupylation and multipupylation, as described here.

      (4) Line 160: Similar to the above concern about line 156, the claim that SBPK4R and sSBP completely prevented polypupylation is unconvincing and requires more supporting evidence.

      Thank you for raising this concern. As mentioned above, both SBPK4R and sSBP lack lysine residues required for pupylation. As a result, these mutants can only undergo multiple single pupylations on the lysine residues of the target protein, which leads to “multipupylation”. In Figure 1-figure supplement 2E and F, pupylation bands by sSBP-sPupK61R or SBPK4R-sPupK61R do not display doublet bands (one from multipupylation and the other from polypupylation), as seen with SBP-sPup, marked by yellow arrows. Notably, Halo-PafA containing polypupylated branches migrates more slowly than one with an equal number of multipupylation events. To clarify this point, we have added the phrase “as shown in sSBP-sPupK61R and SBP4KR-sPupK61R” at the end of the sentence in line 160.

      (5) Lines 176-177: The authors claim that PafAS126A exhibited reduced polypupylation compared to PafA, but given that PafAS126A may reduce depupylase activity, how could it reduce polypupylation levels? Moreover, it is hard to find any data supporting this conclusion in Figure 1 - Figure Supplement 3B.

      We appreciate your insightful comment. At this point, we do not fully understand how the mutation that reduces depupylase activity also decreases polypupylation. It is possible that PafAS126A has a lower preference for pupylated Pup as a prey, which is required for polypupylation, since depupylase activity depends on recognizing pupylated Pup as a prey to remove it. Nonetheless, Halo-PafAS126A shows reduced levels of higher molecular weight bands compared to Halo-PafA, as shown in Figure 1-figure supplement 3B, while exhibiting increased pupylation in lower molecular weight bands, which represent either multipupylation or low-degree polypupylation. Since higher molecular weight bands (> 150 kD) are likely due to polypupylation, this result suggests reduced polypupylation and increased multipupylation in Halo-PafAS126A. To clarify this in the main text, we have added the following description in line 177: “as evidenced by the decreased levels of high molecular weight bands and an increase in low molecular weight bands”

      (6) POST-IT system in cellulo validation: The system was developed using the Halo-tag, yet the in-cell validation uses FRB and FKBP instead, without explaining this switch. This inconsistency makes the logic of the experiment unclear.

      We appreciate your insightful comment. The interaction between rapamycin and FRB or FKBP is known to be highly specific and robust, making this system useful in various biological contexts. Due to this property, rapamycin can induce interaction between two proteins when one is fused with FRB and the other with FKBP. Before testing or optimizing the POST-IT system in cells, we hypothesized that using the rapamycin-induced interaction between FRB and FKBP could introduce pupylation of the target protein, provided that PafA is fused with FRB or FKBP and the target protein is fused with the other. The results demonstrate that PafA can introduce pupylation of the target protein in a proximity-dependent manner via this chemically induced interaction. To further clarify this in the main text, we modified the original sentence in lines 214-216 as follows: “To mimic drug-target interaction-induced pupylation in live cells and assess the potential of PafA as a proximity-tagging system for target-ID, we incorporated the rapamycin-induced interaction between FRB and FKBP into our PL system, as this interaction between a small molecule and a protein is known to be highly specific and robust (Figure 3—figure supplement 1A).”

      (7) Line 209: The authors decided to use the SBP-tag for further studies due to better performance, but in Figure 3 - Figure supplement 1, they still used the unintroduced HB-Pup as the substrate, which is confusing and lacks explanation.

      Thank you for raising your question. The SBP-tag is not superior to the TS-tag in terms of pupylation activity. However, the TSK8R mutant cannot bind to Strep-Tactin beads, while the SBP mutants, SBPK4R and sSBP, can bind to streptavidin. Therefore, we chose the SBP-tag instead of the TS-tag for further studies as a Pup substrate in POST-IT system, as we needed to pull down the target proteins. HB-Pup is consistently used as a control throughout various experiments, as it is the original Pup substrate. In Figure 3-figure supplement 1B and C, HB-Pup was used to test chemically induced pupylation by PafA. In these cases, it was not so critical which Pup substrate was chosen. Furthermore, we compared HB-Pup and different SBP-sPup substrates in Figure 3-figure supplement 1D, where HB-Pup was used as a control or for comparison. Although pupylation bands with HB-Pup appear more robust, this substrate contains multiple lysine residues, leading to high levels of polypupylation. To make it clear, we modified the sentence in line 209 to “Therefore, we decided to use the SBP-tag as a Pup substrate in the POST-IT system for further studies.”.

      (8) Line 220: Both SBP-sPup and SBPK4R-sPupK61R are described as exhibiting efficient pupylation, but the data show mostly self-pupylation and little to no pupylation of the target protein.

      Thank you for your concern. However, pupylation of the target protein is actually quite substantial, as the intensities of the free form and pupylated proteins are relatively similar, as shown in the upper panel of Figure 3-figure supplement 1D. Self-pupylation is always much higher than target pupylation, because PafA constantly pupylates itself, whereas pupylation of the target protein occurs only through interaction. Furthermore, V5-FRB-mKate2-PafA contains many lysine residues, which increases the levels of self-pupylation.

      (9) Lines 222-224: The authors chose SBPK4R-sPupK61R to avoid polypupylation, although SBP-sPup did not cause detectable polypupylation. Neither substrate caused pupylation of the target protein, so the rationale behind this choice is unclear.

      Thank you for raising your question. Similar to the above comment (#8), please refer to the pupylation bands of the target protein, as shown in the upper panel of Figure 3-figure supplement 1D. The pupylation band of the target protein is quite remarkable, as the intensities of the free form and pupylated proteins are comparable. Additionally, there are no multiple pupylation bands in either case, except for one additional weak multipupylation band, indicating no polypupylation by SBP-sPup, which does not have K-to-R mutations. Of course, SBPK4R-sPupK61R can only undergo single pupylation, as it does not contain lysine residues. Although we did not observe polypupylation by SBP-sPup in this experimental condition, it is possible that SBP-sPup may cause polypupylation under different experimental conditions or with other target proteins. Since SBPK4R-sPupK61R exhibits comparable pupylation of the target protein at least in this experiment setting as SBP-sPup, we selected SBPK4R-sPupK61R as the Pup substrate for POST-IT system to avoid any potential polypupylation that could be caused by SBP-sPup in other cases. We believe that polypupylation can introduce bias into the analysis and hinder the comprehensive discovery of additional target proteins for small molecules.

      (10) Line 224: The authors conclude that rapamycin greatly reduced self-pupylation, but the supporting data are unclear.

      Thank you for your constructive comments on our manuscript. Please refer to the lower panel of Figure 3-figure supplement 1D. When using either SBPK4R-sPupK61R or SBP-sPup, rapamycin treatment results in reduced levels of self-pupylation compared to the no-treatment control. However, we did not observe this reduction with HB-Pup and do not know the reason. To clarify this in the main text, we added the following description to the end of the sentence: “when using either SBPK4R-sPupK61R or SBP-sPup, as shown in the lower panel of Figure 3—figure supplement 1D”

      (11) Line 234: The authors selected an 18-amino acid linker, but given that linkers longer than 10 amino acids enhance labeling, this choice should be explained.

      Thank you for raising your question. In fact, a linker of 10 amino acids (aa) or longer is likely to behave similarly. We chose an 18 aa linker instead of a 40 aa linker primarily for the convenience of cloning and to reduce the potential for DNA sequence recombination associated with longer repeats. Additionally, a longer, flexible linker may behave like an intrinsically disordered protein (Harmon et al., 2017), which can lead to unwanted protein-protein interactions or phase separation. To elaborate on this, we added the following sentences after the sentence in line 233-235: “We chose the 18-amino acid linker instead of the 40-amino acid linker for easier cloning and to lower the risk of DNA recombination from longer repeats. Additionally, a longer, flexible linker may behave like an intrinsically disordered protein (Harmon et al., 2017), an unwanted feature for target-ID.”

      (12) S126A and K172R mutations: The authors claim that these mutations additively enhanced pupylation under cellular conditions, but in Figure 3B, the band intensities appear similar for the wild-type and mutant versions.

      Thank you for raising your concern. Although a single pupylation band appears similar among the three different Halo-PafA proteins, multipupylation bands are slightly but noticeably increased by the S126A and K172R mutations compared to Halo8KR-PafA. Since we used SBPK4R-sPupK61R as a Pup substrate, all higher molecular weight bands result from multipupylation rather than polypupylation. This illustrates why it is preferable to use SBPK4R-sPupK61R over SBP-sPup, as the pupylation bands with SBP-sPup are mixtures of poly- and multipupylation, making it difficult to assess levels of target labeling. To clarify this in the main text, we added the following description after the sentence in line 236: “as the higher molecular weight multipupylation bands are slightly but noticeably increased with these mutations compared to Halo8KR-PafA”

      (13) Line 263: The authors selected DH5 for further experiments due to its efficiency, but the data suggest that the performance of DH1 to DH5 is similar.

      We appreciate your question about the different dasatinib HTL derivatives. However, our data clearly show that DH2-5 derivatives bind significantly more effectively to Halo-PafA in vitro and in live cells compared to DH1 (Figure 4A and B). Additionally, the DH2-5 derivatives result in dramatically increased pupylation of the target protein in vitro and noticeable enhancement in live cells (Figure 4C and D). Among DH2 to DH5, there is no obvious difference in binding to Halo-PafA or pupylation of the target protein. Therefore, we chose DH5, as we believe that the longer linker in DH5 may facilitate the binding of a more diverse range of target proteins to dasatinib, enabling the discovery of additional target proteins.

      (14) Line 309: The authors introduce HCQ and CQ as important drugs but then investigate the mechanism using DC661 without introducing or justifying the choice of this compound.

      Thank you for your point. We explained the reason to choose DC661, a dimer form of CQ, instead of CQ for the synthesis of an HTL derivative in line 310. “assuming that a dimer would enhance binding affinity as previously described.” As the dimer forms of a drug or a small molecule such as testosterone dimers, estrogen dimers, and numerous anticancer drug dimers have been often developed to enhance drug effects (Paquin A et., Molecules 2021). Similarly, dimer forms of HCQ/CQ have been introduced and shown to be more potent (Hrycyna CA et al., ACS Chem Biol 2014; Rebecca VW et al., Cancer Discovery 2019). We expected that using a dimer form might offer higher probability to identify target proteins for HCQ/CQ.

      (15) The authors suggest that multipupylation levels were enhanced but do not explain whether this might benefit the system or introduce other issues. Clarifying this point would provide valuable insight for potential users of this system.

      Thank you for your thoughtful suggestion. Polypupylation likely leads to biased enrichment of a limited set of target proteins, and its levels may not correlate with the binding affinity of target proteins to the small molecule of interest, features that can negatively impact target-ID. In contrast, multipupylation may be correlated with binding affinity or interaction frequency, as we observed increased levels of multipupylation with higher Pup concentrations and longer incubation times. This suggests that target proteins with multiple lysines in proximity to PafA can be sequentially pupylated, starting with the most accessible lysine. However, if a target protein has only one accessible lysine, pupylation will occur only once, regardless of the protein’s affinity to the small molecule. In summary, while polypupylation may be a drawback for target-ID, multipupylation could be useful for both target-ID and understanding binding mode. To elaborate on this, we added the following additional explanation after the sentence in line 152: “, whereas multipupylation is more likely correlated with binding affinity or interaction frequency.”

      (16) The author should address whether the Halotag ligand modification of the drug alters the binding properties between the drug and targets. That may be causing artifact binding of the drug and other proteins.

      Thank you for your insightful comment. Yes, it is true that chemical modifications of the small molecule of interest, such as linker derivatization (e.g., HTL) or photo-affinity labeling, generally lead to reduced activity or affinity compared to the original molecule. Synthesizing a derivative is a common challenge across all target-ID methods, except for modification-free approaches, as we mentioned in the Discussion. However, modification-free methods like DARTS, CETSA, and TPP have their own limitations, including low sensitivity or high false positive rates. Identifying the optimal position for chemical modification on the small molecule of interest is critical. We chose dasatinib and HCQ/CQ as model compounds, because previous studies provided insights into their derivative synthesis. In addition, our data show that DH5 retains robust kinase inhibitory activity (Figure 4-figure supplement 2), and DC661-H1 exhibits potent autophagy inhibition (Figure 6-figure supplement 1). For novel compounds, a thorough structure-activity relationship study is essential to identify the optimal position for HTL derivative synthesis.

      (17) The author stated there is no observable toxicity in zebrafish without providing a detailed analysis or enough data. Further analysis of the expression of Halo-PafA and its substrate sPup influence on toxicity or side effects to the living cells or animals would be needed. It is important for in vivo applications.

      Thank you for your constructive suggestion. We have now included additional experimental data in Figure 7-figure supplement 1, showing no toxicity in zebrafish embryos expressing the POST-IT system. We assessed toxicity in two ways: by injecting the POST-IT DNA plasmid into one-cell-stage embryos for acute expression, and by using embryos from transgenic zebrafish expressing POST-IT under a heat-shock inducible promoter. Neither the injection nor the heat-shock activation of POST-IT expression resulted in any noticeable toxicity.

    1. Reviewer #3 (Public review):

      Gatt et al. present a novel take on single-cell RNA-sequencing from complex planktonic samples, introducing an approach they aptly named Ukiyo-e-Seq. This work combines environmental sampling with cell picking, microscopic imaging, and Smart-seq2 single-cell RNA sequencing to profile uncultured eukaryotic plankton. Developing single-cell approaches for such ecosystems is critical, given the poor representation of many planktonic species in cultures and reference databases. This work could help bridge existing technological gaps between morphological and molecular studies of aquatic microeukaryotes

      The authors argue that microscopy does not provide information on the biochemistry of species under consideration. At best, it provides taxonomic labeling of species within a sample, yet imaging fails to assess their metabolic state or to disentangle cryptic species. In a standard metatranscriptomic setup, the sequence pool is described by aligning assembled contigs with reference databases to obtain functional and taxonomic information. This complex community-level data is impossible to parse at the single-organism level. Moreover, by relying on reference datasets, a lot of potential information can be missed. The aim of the approach is to combine the strengths of both methods, generating single-cell transcriptomic data linked to individual plankton images.

      Strengths:

      Ukiyo-e-Seq generated a valuable dataset by combining imaging and transcriptomics for individual planktonic organisms from environmental samples. This multimodal approach has the potential to improve taxonomic predictions and functional insights at the single-organism level. This manuscript demonstrates the technical feasibility of such an approach. Data of this type is rare and thus represents a valuable resource to further advance single-cell sequencing of planktonic species from environmental samples.

      Weaknesses:

      (1) The merge-split strategy, where single-cell reads are pooled prior to assembly, is counterintuitive. Pooling obscures the single-organism resolution that single-cell methods aim to achieve. The approach might be useful for assembling low-coverage contigs, but risks masking unique expression profiles for transcripts unique to a given well. As an alternative, the authors could assemble each well independently to obtain well-specific transcriptomic bins. Assemblies could then be clustered based on sequence similarity, thereby imposing strict clustering parameters to maintain resolution, to create a common reference for downstream analysis if needed. In my opinion, better results would be obtained by implementing a per-well assembly and read mapping.

      (2) The focus on the top five most expressed contigs throughout the manuscripts' data analysis is a limiting choice, as it excludes most contigs. In the preprint, we are presented with a very narrow view of the data. Visualising the entire range of assembled contigs would provide a better picture of the transcriptomic composition and diversity per well. It would be interesting to assess if the full information could be used to preliminary bin transcriptomic sequences from individual wells, for example, by gathering all 'private' contigs with high read coverage in a single well. Does such a set represent a single complete eukaryotic transcriptome?

      (3) I missed a verification with (broad-scale) taxonomic assessments based on the associated microscopic images. In their goals, the authors state that a joint approach has the potential to discover new taxonomic biodiversity. I agree, and to me, this is what is exciting about the preprint, yet I miss an example or the right bioinformatic implementation to drive home this claim. Are there organisms in wells where poor taxonomic annotations, based on alignment to a reference database or the LCA approach implemented in Kraken2, would usually result in ignoring the species in classic metatranscriptomics? Can you advance the taxonomic annotation by referring back to the organisms' picture? Can manual assessment of taxonomy advance the results from the LCA approach?

      (4) The current use of AlphaFold to predict protein structures does not convincingly add to the study's core objectives.

      Overall, Ukiyo-e-Seq presents a promising method for studying single-cell diversity in environmental samples, though the bioinformatic pipeline requires refinement to support some of the claims made by the authors. Additionally, the manuscript would benefit from clarity and additional details in its methods and a more consistent approach to presenting results and summary statistics across all assembled contigs and all sampled wells, rather than focusing on selected wells.

    2. Author response:

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      The authors aim to elucidate the diversity and gene expression patterns of marine plankton using innovative collection and sequencing methodologies. Their work investigates the taxonomic and functional profiles of planktonic communities, providing insights into their ecological roles and responses to environmental changes.

      Strengths:

      The methodology utilized in this study, particularly the combination of single-cell sequencing and advanced bioinformatics techniques, represents a significant advancement in the field of plankton research. The application of the Smart-seq2 protocol for cDNA synthesis, followed by rigorous quality control measures, ensures high-quality data generation. This comprehensive approach not only enhances the resolution of the obtained genetic information but also allows for a more detailed exploration of the diversity and functional potential of the phytoplankton community.

      One of the major strengths of this study is the rigorous methodological approach, including precise sampling techniques and robust data analysis protocols, which enhance the reliability of the results. The use of advanced sequencing technologies allows for a comprehensive assessment of gene expression, significantly contributing to our understanding of plankton diversity and its implications for marine ecosystems.

      Weaknesses:

      While the evidence presented is solid, there are areas where the analysis could be expanded. The authors could further explore the ecological interactions within plankton communities, which would provide a more holistic view of their functional roles. Additionally, a broader discussion of the implications of their findings for marine conservation efforts could enhance the manuscript's impact.

      The choice of both the plankton net and filter pore size during the plankton collection process is critical, as these factors directly impact the types of phytoplankton collected. The use of a 25 μm filter paper, in particular, may result in the omission of many eukaryotic phytoplankton species. This limitation, combined with the characteristics of the plankton net, could affect the comprehensiveness and accuracy of the results, potentially influencing the study's conclusions regarding phytoplankton diversity.

      The timing of fixation is crucial, as it directly affects whether the measured transcriptome accurately represents the organisms' actual transcriptional state in their native water environment. If fixation occurred a significant time after sample collection, the transcriptomic data may not reflect their true in situ transcriptional activity, which greatly reduces the relevance of this method.

      Thank you for your time, effort, and expertise.

      We agree that additional analyses could improve our understanding of the plankton communities sampled. We have conducted an array of alternative analyses that were not included in the current manuscript and plan to perform new analyses over the next few months as part of a deeper revision of the manuscript. We are especially interested in “providing a more holistic view of the functions” of individual plankton within the community.

      As for the protocol details, the pore size of the filter paper was chosen to focus on ~100 micron-sized organisms as a starting point: they are likely to contain more RNA than smaller organisms, making them well suited for an initial proof of concept of the methodology. That choice, however, is not particularly tightly constrained, therefore smaller plankton could be captured. This is supported by the lack of correlation, in our data, between organismal size and number of detected sequencing reads.

      Timing to cell death/fixation is a common question we receive not just in this manuscript but any RNA-Seq from primary samples. In this case, plankton were seen swimming until picking, and after picking each organism was deposited within two seconds into a lysis buffer for fixation. Therefore, we do not have reason to believe that the transcriptional activity sampled in the sequencing reads differs in any major way from the one in living plankton. Nonetheless, a study specifically testing the effect of time between ocean sampling and reverse transcription would provide more quantitative information on this point.

      Reviewer #2 (Public review):

      Summary:

      The paper introduces Ukiyo-e-Seq, a novel method integrating microscopy with single-cell transcriptomics to study individual, uncultured eukaryotic plankton cells. By combining microscopic imaging with transcriptomic analysis, the approach links plankton morphology to gene expression, enabling taxonomic identification and functional protein exploration. Ukiyo-e-Seq was tested on 66 microbial eukaryotic cells, revealing taxonomic diversity across four superkingdoms and allowing analysis of protein complexes and developmental genes in individual species. According to the authors, this method has the potential to advance single-cell marine biodiversity studies by addressing limitations in traditional taxonomy and metatranscriptomics, especially for rare or uncultured organisms.

      However, the study's conclusions are often weakly supported by data, particularly given that this is not the first study to combine microscopy and single-cell transcriptomics of eukaryotic plankton using Smart-seq2.

      Strengths:

      A notable strength is the authors' generation of several single-cell transcriptomes for the diatom Chaetoceros, which could benefit from greater focus rather than broadly addressing eukaryotic single cells.

      Weaknesses:

      The study lacks comparison with other single-cell transcriptomics studies and it was presented as the first study that combines imaging and single-cell transcriptomics (smart-seq2) of eukaryotic plankton while in fact it is not. The sampling methodology is not replicable as the authors used a tea strainer instead of standard plankton collection equipment to filter larger cells. Terminology throughout the paper is unconventional, such as "public and private contigs," "single-organism genomics," "highly expressed contigs," and "optical methods." Additionally, the authors did not specify which database was used for taxonomic assignments. These issues may stem from the authors' limited background in microbial ecology. Overall, the study has many drawbacks and it could benefit from complete rewriting and focusing mainly on single-cell transcriptomics of diatoms.

      Thank you for your time, effort, and expertise.

      There might be a bit of confusion between single-cell and single-organism sequencing, likely due to lack of clarity in our initial submission. In particular, in this manuscript no effort was spent trying to dissociate oligocellular plankton into individual cells before sequencing. While probably feasible, we expect that to be technically much harder than single-organism sequencing as performed here. The reviewer does not reference a published paper where combined imaging and RNA-Seq of individual uncultured plankton has been achieved, and we were unable to find one in the scientific literature. As stated in the manuscript, others have already performed some work on cultured plankton and single-organism sequencing (without matching images) of uncultured environmental microorganisms.

      The suggestion to focus on a smaller biological niche such as diatoms and adopt language more familiar to that specific community is well received. Indeed, given that organisms as diverse as fish larvae and diatoms could be profiled with Ukiyo-e-Seq, future studies could use the same method to address specific questions with a deeper and more narrow scope. However, this manuscript is demonstrating the feasibility of Ukiyo-e-Seq and its ability to produce usable data for a broad spectrum of organisms: part of the scientific audience might not have a specific interest in diatoms.

      The tea strainer was used for coarse pre-filtering: the exact pore size, geometry and factory tolerance on those measurements are inconsequential because each organism is later chosen (or not) based on a high-resolution microscopy image (or multiple, if fluorescence is considered). This really is a strength of Ukiyo-e-Seq over FACS or droplet-based sorters, which can only collect coarse optical information from each organism for (typically) less than 1 millisecond. In Ukiyo-q-Seq, while the actual decision to pick an individual is currently manual (by the operator of the picker), it can be automated in principle. For instance, one could build a machine learning model of plankton taxonomy based on a large collection of labelled images and use predictions from such a model to automatically drive the picker (e.g. focussing on diatoms), increasing throughput. Even in that case, however, the initial filtering stages using tea strainers, plankton nets, filter paper etc. would not be critical for the final selection of individuals as long as they are not too restrictive.

      The database used for taxonomic assignment was the NCBI non-redundant nucleotide database, accessed through the reference library provided by Kraken2 (nt).

      Reviewer #3 (Public review):

      Gatt et al. present a novel take on single-cell RNA-sequencing from complex planktonic samples, introducing an approach they aptly named Ukiyo-e-Seq. This work combines environmental sampling with cell picking, microscopic imaging, and Smart-seq2 single-cell RNA sequencing to profile uncultured eukaryotic plankton. Developing single-cell approaches for such ecosystems is critical, given the poor representation of many planktonic species in cultures and reference databases. This work could help bridge existing technological gaps between morphological and molecular studies of aquatic microeukaryotes

      The authors argue that microscopy does not provide information on the biochemistry of species under consideration. At best, it provides taxonomic labeling of species within a sample, yet imaging fails to assess their metabolic state or to disentangle cryptic species. In a standard metatranscriptomic setup, the sequence pool is described by aligning assembled contigs with reference databases to obtain functional and taxonomic information. This complex community-level data is impossible to parse at the single-organism level. Moreover, by relying on reference datasets, a lot of potential information can be missed. The aim of the approach is to combine the strengths of both methods, generating single-cell transcriptomic data linked to individual plankton images.

      Strengths:

      Ukiyo-e-Seq generated a valuable dataset by combining imaging and transcriptomics for individual planktonic organisms from environmental samples. This multimodal approach has the potential to improve taxonomic predictions and functional insights at the single-organism level. This manuscript demonstrates the technical feasibility of such an approach. Data of this type is rare and thus represents a valuable resource to further advance single-cell sequencing of planktonic species from environmental samples.

      Weaknesses:

      (1) The merge-split strategy, where single-cell reads are pooled prior to assembly, is counterintuitive. Pooling obscures the single-organism resolution that single-cell methods aim to achieve. The approach might be useful for assembling low-coverage contigs, but risks masking unique expression profiles for transcripts unique to a given well. As an alternative, the authors could assemble each well independently to obtain well-specific transcriptomic bins. Assemblies could then be clustered based on sequence similarity, thereby imposing strict clustering parameters to maintain resolution, to create a common reference for downstream analysis if needed. In my opinion, better results would be obtained by implementing a per-well assembly and read mapping.

      (2) The focus on the top five most expressed contigs throughout the manuscripts' data analysis is a limiting choice, as it excludes most contigs. In the preprint, we are presented with a very narrow view of the data. Visualising the entire range of assembled contigs would provide a better picture of the transcriptomic composition and diversity per well. It would be interesting to assess if the full information could be used to preliminary bin transcriptomic sequences from individual wells, for example, by gathering all 'private' contigs with high read coverage in a single well. Does such a set represent a single complete eukaryotic transcriptome?

      (3) I missed a verification with (broad-scale) taxonomic assessments based on the associated microscopic images. In their goals, the authors state that a joint approach has the potential to discover new taxonomic biodiversity. I agree, and to me, this is what is exciting about the preprint, yet I miss an example or the right bioinformatic implementation to drive home this claim. Are there organisms in wells where poor taxonomic annotations, based on alignment to a reference database or the LCA approach implemented in Kraken2, would usually result in ignoring the species in classic metatranscriptomics? Can you advance the taxonomic annotation by referring back to the organisms' picture? Can manual assessment of taxonomy advance the results from the LCA approach?

      (4) The current use of AlphaFold to predict protein structures does not convincingly add to the study's core objectives.

      Overall, Ukiyo-e-Seq presents a promising method for studying single-cell diversity in environmental samples, though the bioinformatic pipeline requires refinement to support some of the claims made by the authors. Additionally, the manuscript would benefit from clarity and additional details in its methods and a more consistent approach to presenting results and summary statistics across all assembled contigs and all sampled wells, rather than focusing on selected wells.

      Thank you for your time and effort, and for your expertise on the matter.

      The suggestions to conduct additional bioinformatic analyses to explore more fully the criticality and potential of various design choices (e.g. meta-assembly) are well received. We have tried some of those ideas already (e.g. assembling individual wells) and we have considered but not yet conducted or polished others (e.g. a more thorough taxonomic verification). We will endeavour to carry out as many of those analyses as possible during the deeper revision process in the coming months.

      AlphaFold 3’s use was designed to demonstrate the ability to investigate protein-protein interactions from individual species. When two peptide sequences are detected within the same well, they are more likely to be potential interacting partners than in a metatranscriptomic study, because the compartmentalisation of reads into tens or hundreds of wells greatly reduces the search space of potential interaction partners (which has a baseline runtime complexity of n squared, where n is the number of peptide sequences identified).

      ----------

    1. Reviewer #3 (Public review):

      This work brings a computational approach to the study of promoters and transcription. The paper is improved but there are still factual errors and implausible explanations. I am not convinced by the response from the authors, concerning the promoter -35 element, in their rebuttal.

      Comments on author rebuttal:

      - We respectfully but strongly disagree that our analysis has misrepresented the true nature of -35 boxes. First, accounting for more A's at position 5 in the PWM is not going to lead to a "critical error." This is because positions 4-6 of the motif barely have any information content (bits) compared to positions 1-3 (see Fig 1A).

      The analysis does misrepresent the consensus -35 element, which is, unequivocally, TTGACA. I agree that positions 4-6 of the element are less well-conserved.

      - This assertion is not just based on our own PWM, but based on ample precedent in the literature. In PMID 14529615, TTG is present in 38% of all -35 boxes, but ACA only in 8%.

      This does not mean that TTGACA is not the consensus, or that "ACA" is not important at promoters where it's present.

      - In PMID 29388765, with the -10 instance TATAAT, the -35 instance TTGCAA yields stronger promoters compared to the -35 instance TTGACA (See their Figure 3B).

      This is a known phenomenon and results from "perfect" promoters being limited at the point of RNA polymerase promoter escape (because the RNAP struggles to "let go" of perfect promoters). This does not mean the TTGACA is not the consensus. Indeed, and this is a key point, it is evident in the figure the authors refer to that TTGACA stimulates more transcription than alternative -35 sequences when -10 elements are not perfect.

      - In PMID 29745856 (Figure 2), the most information content lies in positions 1-3, with the A and C at position 5 both nearly equally represented, as in our PWM.

      The motif shown in this paper suffers from exactly the same issue as the paper under review; the variable spacing between the -35 hexamer and -10 element isn't taken into account by MEME.

      - In PMID 33958766 (Figure 1) an experimentally-derived -35 box is even reduced to a "partial" -35 box which only includes positions 1 and 2, with consensus: TTnnnn.

      This paper does not show an "experimentally-derived -35 box" in Figure 1 (or anywhere else, as far as I can see).

      - In addition, we did not derive the PWMs as the reviewer describes. The PWMs we use are based on computational predictions that are in excellent agreement with experimental results. Specifically, the PWMs we use are from PMID 29728462, which acquired 145 -10 and -35 box sequences from the top 3.3% of computationally predicted boxes from Regulon DB.

      The paper mentioned states "for the genomic RNAP logo, sequences were taken from computationally predicted RNAP binding sites on RegulonDB" so these are not experimentally defined promoters? It's not obvious from the paper, or regulon DB, which sequences these are or how they were predicted.

      - Thank you for pointing out that our original submission was incomplete in this regard. We address these concerns by new analyses, including some new experiments. First, Rho dependent termination is associated with the RUT motif, which is very rich in Cytosines (PMID: 30845912). Given that our sequences confer between 65%-78% of AT-content, canonical rho dependent termination is unlikely. However, we computationally searched for rho-dependent terminators using the available code from PMID: 30845912, but the algorithm did not identify any putative RUTs. Because this analysis was not informative, we did not include it in the paper.

      I don't believe it is the case that Rho absolutely requires a RUT sequence. My understanding is that, if an RNA is not translated, Rho will intervene (e.g. see PMID: 18487194).

      - We respectfully disagree that the reviewer's point is pertinent because what the reviewer is referring to is the likelihood that the sequence is a promoter, which indeed increases with AT content, but we are focused on the likelihood that a sequence becomes a promoter through DNA mutation

      I disagree that this distinction is relevant. An AT-rich sequence will much more closely resemble a promoter by chance than a GC rich sequence. As an extreme example, the sequence TTTTTT can be converted into a reasonable -10 element by one change (to TATTTT) but the sequence GGGGGG can't.

    1. Reviewer #1 (Public review):

      Summary:

      This study from Abssy et al. aims to determine if different non-invasive peripheral stimulation techniques - such as magnetic and electrical stimulations - may influence pain intensity, unpleasantness, and secondary hyperalgesia using a 4-arm parallel-group study. They observed no effect on pain intensity and unpleasantness. Also, they reported that only the TENS (electrical stimulation) did not impact secondary hyperalgesia. They hypothesized that the effects were probably due to the sound emitted by RPMS (magnetic stimulation). In a follow-up study, they tried to determine if covering the sound of RPMS would abolish the effect on secondary hyperalgesia using a single-arm design. They observed no effect of RPMS.

      Strengths:

      (1) The research team recruited a relatively large sample size for this type of study.

      (2) The phasic heat pain protocol appears rigorous and well-described.

      (3) The Figures are helpful in facilitating the understanding of the study design and results.

      (4) The statistical analyses appear sound.

      Weaknesses:

      (1) The proposed design is not sufficient to answer the research question. The rationale of the study proposed in the introduction is that auditory stimulation may explain the analgesic effects of RPMS. To answer this question, the authors should have used a factorial design using 4 groups (active RPMS + sound; active RPMS + no sound; sham RPMS + sound; sham RPMS + no sound). Using this design, it would have been possible to determine if the sound, the afferent stimulation, or both are necessary to produce analgesia. Rather, they tested two types of RPMS (iTBS, cTBS) without real rationale, one electrical stimulation and a placebo.

      (2) There are multiple ways that the current design could have introduced biases. The study was not randomized but pseudo-randomised. What does that mean? Was their allocation concealment? Was the assessor and data analyst blinded to group allocation? Did an intention to treat analyses were performed? Did the participants were adequately blinded (was it measured)?

      (3) The TENS parameters used were not optimal and are not those commonly used in clinical practice. This could have explained the lack of TENS effects. The lack of TENS effects has not been discussed and it is concerning. If TENS had been effective (as expected), the story about the auditory effects would not have been presented as the primary mechanisms underlying the current results.

      (4) No primary outcome has been identified. It is important to mention that the interpretation of results is based on the presence of only one statistically significant result. Pain intensity and pain unpleasantness are not affected. This was not properly addressed in the Discussion. What does that mean that secondary hyperalgesia is affected but not pain?

      (5) The use of secondary hyperalgesia as a variable requires further clarification. How is it possible to measure secondary hyperalgesia if there is no lesioned tissue? If heat creates secondary hyperalgesia without lesion, what does that mean physiologically? Is it a valid and reliable "pain" variable?

      (6) The follow-up study has been designed to cover the RPMS sound using pink noise. However, the pink noise was also present during the PHP measurement. How can we determine whether the absence of change is due to the pink noise during the RPMS or the presence of pink noise during PHP? I don't think this is possible to discriminate.

      Appraisal:

      (7) Despite all these potential issues, authors interpret their data with high confidence and with several overstatements in the Title, Abstract, and Discussion. The results do not support their conclusions. The fact that auditory stimulation may produce an analgesic effect is a hypothesis, but the current study cannot ascertain it.

    2. Reviewer #2 (Public review):

      Summary:

      In this article, Abssy, Osokin, Osborne, et al. aimed to demonstrate the effect of Peripheral Magnetic Stimulation (PMS) as a pain relief tool, studying its effects in an experimentally induced pain paradigm applied over healthy subjects. This is a relevant objective, as it will give a proxy indication of its utility as a clinical intervention to treat pain. Shockingly, in the first experiment, the authors found that this effect existed, not only in the active PMS groups but also in the sham PMS. With a clever second experiment, the authors used pink noise to mask the clicking sound and the PMS: this modification abolished the hypoalgesic effect of PMS.

      Strengths:

      This study presents an adequately calculated sample size (n = 100 for study 1 and n = 32 for study 2). This gives trustability to the results and allows for a correct disaggregated analysis to assess gender effects, which correct execution does not often occur. Nuisance variables are adequately addressed, figures and writing are clear, and I especially liked figures 4 and 5 for their easiness of interpretation. They explore two different stimulation protocols for the PMS, extending their results beyond parametrization. Secondary hyperalgesia is a particularly relevant measurement, as it is a common symptom in many relevant painful conditions. Pseudorandomization and counterbalanced design are also appreciated, as well as reinforcement of the results through Bayesian statistical approaches. Regarding the scientific content, the main result (auditory modulation of pain in PMS) is exciting and very interesting by itself and will be relevant for the pain community, granting further research, both from a fundamental and clinical perspective. Personally, I respect that they recognize that results did not match their a priori hypothesis, instead of committing HARKing. And it is a very thrilling mismatch for sure!

      It will be especially interesting for those among us dedicated to neural stimulation for pain treatment.

      Weaknesses:

      Although the study presents solid results, some specific concerns make me reluctant to accept the interpretations that the authors take from said results. I list the most important here.

      (1) My biggest concern in this paper is that the stimulation protocols are not applied after pain was induced in the subjects, but before. This is not bad in itself, but as the paper presents the stimulations as potential "treatments" it generates a severe mismatch between the objective, context (introduction), and impact (discussion) presented for the experiments, and how they are actually designed. This adds to the fact that healthy volunteers are used here to generate a study with low translational capability, that aims to be translational and provide an indication for clinics (maybe this is why the reduction in pain intensity caused by PMS when applied in patients, reported in references [29, 35 and 39], is not observed here).

      (2) TENS treatment duration is simply too short (90s) to be considered a therapeutic TENS intervention. I get that this duration was chosen to match the one of PMS, but TENS is never applied like this in the clinics, in which the duration varies from 10 minutes to an hour (or more). This specific study comparing different durations recommends 40 minutes for knee osteoarthritis pain relief (PMID: 12691335). Under these conditions, this stimulation is more similar to a sham TENS than to a real TENS treatment: I would suggest interpreting it as such. As the paper is right now, it could give the impression that PMS could produce clinical effects not observed in TENS, but while the PMS application resembles a clinical one, the TENS application does not (due to its extremely short duration). As an example, giving paracetamol at a dose 10 times below its effective dose is a placebo, not a paracetamol treatment.

      (3) This study measured pain, not central sensitization. Specifically, the effects refer to the area of secondary hyperalgesia. The IASP definition for central sensitization is "Increased responsiveness of nociceptive neurons in the central nervous system to their normal or subthreshold afferent input." (PMID: 32694387). No neuronal results are reported in this article. Therefore, central sensitization is not measured here, and we do not know if it is reduced by sound. This frontally clashes with the title of the article and with many interpretations of the results. For a deep review on this topic, I recommend PMID: 39278607 and the short article PMID: 30416715.

      (4) There is no mention of blinding/masking/concealing in this manuscript. Was the therapist blind to whether they applied one protocol, another, or a placebo? Were the evaluators blind, as this can heavily influence their measurements? And the volunteers? Was allocation concealed? Was this blinding measured afterwards? Blinding is, together with randomization, the most important methodological feature for those interventional studies. For example, not introducing blinding and concealing directly makes a study lose 4 out of 10 points in the PEDro scale, failing to fulfill criteria 3, 5, 6, and 7 (https://pedro.org.au/english/resources/pedro-scale/). Continuing with methodological considerations, the dropout percentage is high (18% for the first and 25% for the second study), both above the 15% cutoff for criterion 8 of the PEDro, losing another point. It is not mentioned whether the statistical analysis was intention-to-treat or per-protocol. Assuming the second, criterion 9 is failed too. Also, although between-group comparisons are done for study 1, they are not for study 2. Criterion 10 depends on this, so I would recommend doing it to avoid failing it. As it is right now, the study will be a 3/10 on the PEDro scale, being therefore considered "low-quality level evidence". As some of these criteria can be fulfilled in this study, I will recommend doing so to increase its quality level to medium (more in "recommendations for authors").

      (5) Data reporting and statistical treatment can be improved, as only differences are reported and regression to the mean is not accounted for in this study. Moreover, baseline levels for the dependent variables (control session) are not accessible for evaluation and they are not compared statistically, making it impossible to know if the groups were similar at baseline. This will imply failing criterion 3 of the PEDro, for a total of 2/10 points.

    3. Author response:

      Reviewer 1 (Public Review)

      (1) The proposed design is not sufficient to answer the research question. The rationale of the study proposed in the introduction is that auditory stimulation may explain the analgesic effects of RPMS. To answer this question, the authors should have used a factorial design using 4 groups (active RPMS + sound; active RPMS + no sound; sham RPMS + sound; sham RPMS + no sound). Using this design, it would have been possible to determine if the sound, the afferent stimulation, or both are necessary to produce analgesia. Rather, they tested two types of RPMS (iTBS, cTBS) without real rationale, one electrical stimulation and a placebo.

      We will clarify that the study design employed was originally designed to determine whether iTBS or cTBS would be more effective to reduce pain. We included TENS as a positive control, and sham as a negative control. We were indeed surprised by the findings, and present them herein. Future RCTs should be performed to reproduce these findings.

      (2) There are multiple ways that the current design could have introduced biases. The study was not randomized but pseudo-randomised. What does that mean? Was their allocation concealment? Was the assessor and data analyst blinded to group allocation? Did an intention to treat analyses were performed? Did the participants were adequately blinded (was it measured)?

      This study was not designed as an RCT, but rather as experimental study. The study was pseudo-randomized to ensure that the groups had equal allocation and distribution of sexes.

      The groups were blinded to the other stimulations (they were not informed of the various arms of the study, through different consent forms).

      It was not possible to blind the experimenter as the iTBS and cTBS protocols are very different: iTBS has multiple bursts separated by brief intervals, whereas cTBS is continuous). The data were masked for analysis, and only unblinded at the final stage. We will update the manuscript to reflect these changes.

      (3) The TENS parameters used were not optimal and are not those commonly used in clinical practice. This could have explained the lack of TENS effects. The lack of TENS effects has not been discussed and it is concerning. If TENS had been effective (as expected), the story about the auditory effects would not have been presented as the primary mechanisms underlying the current results.

      We acknowledge that this is a limitation of the study. A future study should address this. However, we will not remove the arm for transparency.

      (4) No primary outcome has been identified. It is important to mention that the interpretation of results is based on the presence of only one statistically significant result. Pain intensity and pain unpleasantness are not affected. This was not properly addressed in the Discussion. What does that mean that secondary hyperalgesia is affected but not pain?

      We reiterate that this study was not designed as an RCT, but rather an experimental study with The primary outcomes measures that capture change in  were measures of pain sensitivity (pain intensity NRS, pain unpleasantness NRS, and secondary hyperalgesia). We will clarify this in the revised manuscript.

      We will now include discussion of the effects being solely on secondary hyperalgesia, and not on pain intensity and unpleasantness.

      (5a) The use of secondary hyperalgesia variable is concerning. How is it possible to measure secondary hyperalgesia if there is no lesioned tissue?

      Secondary hyperalgesia refers to hyperalgesia assessed in an area adjacent to or remote of the site of stimulation. In general, it is not required to lesion a tissue to activate the nociceptive system or to induce pain. We have cited other studies that have employed secondary hyperalgesia as a pain outcome measure without inducing a lesion.

      Hyperalgesia reflects increased pain on suprathreshold stimulation. Then, one measures the subjective response to a painful (i.e. suprathreshold) stimulation, then applies a conditioning stimulation (e.g. heat), and measures the subjective response to the same original stimulus. If the response after conditioning is higher than the baseline measure, hyperalgesia has been induced. Secondary hyperalgesia just refers to hyperalgesia assessed in an area adjacent to or remote of the site of stimulation. In general, it is not required to lesion a tissue to activate the nociceptive system or to induce pain.

      (5b) If heat creates secondary hyperalgesia without lesion, what does that mean physiologically?

      Secondary hyperalgesia is normally interpreted as a perceptual correlate of central sensitization.

      (5c) Is it a valid and reliable "pain" variable?

      Yes and yes. A noxious heat stimulus can reliably elicit secondary hyperalgesia (see section 3.2 from Quesada et al. 2021). We also cite several studies that have used secondary hyperalgesia as an outcome measure of central sensitization in pain.

      (6) The follow-up study has been designed to cover the RPMS sound using pink noise. However, the pink noise was also present during the PHP measurement. How can we determine whether the absence of change is due to the pink noise during the RPMS or the presence of pink noise during PHP? I don't think this is possible to discriminate.

      We will add a third study that performs the control analysis with the sound of the rPMS masked, but no pink noise otherwise. The study will be performed in two groups: one with pink noise, and one without pink noise.

      Appraisal

      (7) Despite all these potential issues, authors interpret their data with high confidence and with several overstatements in the Title, Abstract, and Discussion. The results do not support their conclusions. The fact that auditory stimulation may produce an analgesic effect is a hypothesis, but the current study cannot ascertain it.

      We believe that the chief concern with the interpretation lies with concerns with the second study. The proposed third experiment will address these concerns.

      Reviewer 2 (Public Review):

      (1) My biggest concern in this paper is that the stimulation protocols are not applied after pain was induced in the subjects, but before. This is not bad in itself, but as the paper presents the stimulations as potential "treatments" it generates a severe mismatch between the objective, context (introduction), and impact (discussion) presented for the experiments, and how they are actually designed. This adds to the fact that healthy volunteers are used here to generate a study with low translational capability, that aims to be translational and provide an indication for clinics (maybe this is why the reduction in pain intensity caused by PMS when applied in patients, reported in references [29, 35 and 39], is not observed here).

      We will reframe these as prophylaxis, rather than treatment. This study was an experimental study originally designed to determine which stimulation parameters (cTBS or iTBS) would be better suited to modulate pain. We performed the study in healthy individuals undergoing acute pain, akin to a person undergoing painful procedure, which could lead to central sensitization and pain persistence (e.g., post-surgical pain). However, before testing this in individuals undergoing actual procedures, it is essential to determine efficacy in people before translation.

      Khan et al [29] is a case study with neuropathic pain, whereas our study uses a nociceptive pain model. Lim et al [35] employed 10 sessions of rPMS stimulation in patients with acute low back pain. Similar to our study, the change in VAS driven by rPMS was no different than the sham stimulation. We notice that there is no reference 39, and will correct this.

      (2) TENS treatment duration is simply too short (90s) to be considered a therapeutic TENS intervention. I get that this duration was chosen to match the one of PMS, but TENS is never applied like this in the clinics, in which the duration varies from 10 minutes to an hour (or more). This specific study comparing different durations recommends 40 minutes for knee osteoarthritis pain relief (PMID: 12691335). Under these conditions, this stimulation is more similar to a sham TENS than to a real TENS treatment: I would suggest interpreting it as such. As the paper is right now, it could give the impression that PMS could produce clinical effects not observed in TENS, but while the PMS application resembles a clinical one, the TENS application does not (due to its extremely short duration). As an example, giving paracetamol at a dose 10 times below its effective dose is a placebo, not a paracetamol treatment.

      We acknowledge that this is a limitation, and will address this in the Discussion of the revised manuscript.

      (3) This study measured pain, not central sensitization. Specifically, the effects refer to the area of secondary hyperalgesia. The IASP definition for central sensitization is "Increased responsiveness of nociceptive neurons in the central nervous system to their normal or subthreshold afferent input." (PMID: 32694387). No neuronal results are reported in this article. Therefore, central sensitization is not measured here, and we do not know if it is reduced by sound. This frontally clashes with the title of the article and with many interpretations of the results. For a deep review on this topic, I recommend PMID: 39278607 and the short article PMID: 30416715.

      It is widely accepted that central sensitization is the neurophysiological basis of secondary hyperalgesia (see PMID: 11313449; PMID: 10581220).

      The reviewer is conflating secondary hyperalgesia due to central sensitization and chronic pain. Whether chronic pain is driven or maintained by central sensitization is not the goal of our study. However, there is ample evidence that nociceptive drive can induce plasticity in the CNS, which alters pain sensitivity, and that these changes facilitate pain.

      (4a) There is no mention of blinding/masking/concealing in this manuscript. Was the therapist blind to whether they applied one protocol, another, or a placebo? Were the evaluators blind, as this can heavily influence their measurements? And the volunteers? Was allocation concealed? Was this blinding measured afterwards? Blinding is, together with randomization, the most important methodological feature for those interventional studies. For example, not introducing blinding and concealing directly makes a study lose 4 out of 10 points in the PEDro scale, failing to fulfill criteria 3, 5, 6, and 7 (https://pedro.org.au/english/resources/pedro-scale/).

      This study was not designed as an RCT, but rather as experimental study. The study was pseudo-randomized to ensure that the groups had equal allocation and distribution of sexes.

      The groups were blinded to the other stimulations (they were not informed of the various arms of the study, through different consent forms). However, blinding was not measured afterwards (again, this was not meant to be an RCT).

      It was not possible to blind the experimenter as the iTBS and cTBS protocols are very different: iTBS has multiple bursts separated by brief intervals, whereas cTBS is continuous). The data were masked for analysis, and only unblinded at the final stage. We will update the manuscript to reflect these changes.

      (4b) Continuing with methodological considerations, the dropout percentage is high (18% for the first and 25% for the second study), both above the 15% cutoff for criterion 8 of the PEDro, losing another point.

      In the study, only 2 withdrew after feeling the heat, 2 were lost to follow up, and 2 had incomplete data. That totals 6/123 in Study 1. In study 2, none of the participants that met inclusion/exclusion criteria, and who were ‘allocated’ to the study were included (0% dropout/data loss).

      We are unsure how to address this point, as we had clear inclusion/exclusion criteria, and these could only be measured after consenting. As this is an experimental study performed on healthy individuals in a university setting, we are not able to collect any study related data prior to consent.

      We openly reported individuals who did not meet the criteria, and thus were excluded. These criteria are a combination of what is required to collect good quality data, and what we are ethically permitted to do. We understand that in an interventional trial where >15% drop out due to intolerance, or adverse events would indeed be concerning.

      (5) Data reporting and statistical treatment can be improved, as only differences are reported and regression to the mean is not accounted for in this study. Moreover, baseline levels for the dependent variables (control session) are not accessible for evaluation and they are not compared statistically, making it impossible to know if the groups were similar at baseline. This will imply failing criterion 3 of the PEDro, for a total of 2/10 points.

      This only concerns study 1, as study 2 is a within subject study design. Study 1 provides the raw data in Figure 4. We will provide the raw data for each of the primary outcome measures in a supplemental table in the revision.

    1. Background Multiplexing single-cell RNA sequencing experiments reduces sequencing cost and facilitates larger scale studies. However, factors such as cell hashing quality and class size imbalance impact demultiplexing algorithm performance, reducing cost effectivenessFindings We propose a supervised algorithm, demuxSNP, leveraging both cell hashing and genetic variation between individuals (SNPs). The supervised algorithm addresses fundamental limitations in demultiplexing with only one data modality. The genetic variants (SNPs) of the subset of cells assigned with high confidence using a probabilistic hashing

      Reviewer 1: Lei Li Reviewer Comments: Lynch et. al developed demuxSNP, a supervised demultiplexing approach for single-cell cell hashing data in a multi-modal (hashtag expression and SNP profiles) fashion. They utilized a probabilistic method to infer sample identities of cells using cell hashing modality, and then build a KNN model using SNPs of high cofinance ones from previous step. They then use this KNN model to predict cell identities for cells assigned as uncertain or negative by cell hashing.They have demonstrated the performance through a comparison with existing single-modal methods using both real data and simulated data. They have published an R package for the research community. It is interesting and encouraging to see another study focuses on multi-modal demultiplexing for cell hashing data. Below are some major and minor points from my side:1. I am not surprised that a multi-modal demultiplexing beats single-modal methods across both real and simulated datasets. To my knowledge, there are at least two groups proposed multi-modal demultiplexing approach for cell hashing data. Both were uploaded to bioRxiv last year and get published recently. One called hadge (https://link.springer.com/article/10.1186/s13059-024-03249-z ), and another called HTOreader hybrid (https://academic.oup.com/bib/article/25/4/bbae254/7686601), which is discussed by this study. Hadge is a comprehensive framework that integrated popular cell hashing-based and SNP-based methods, allowing for a joint deconvolution by combining best method from each modality. HTOreader hybrid proposed an improved demultiplexing method for cell hashing signals, and then also integrates demultiplexing results from both modality for a better deconvolution in a hybrid fashion. Indeed, this work has implemented different method for the same purpose. I tried both methods, and there're some major updates between bioRxiv version and published version. Thus, even one of them has been discussed, I think it's still necessary to include these two published methods into comparison, to reveal pros and cons of different methods, therefore provide useful information for users to select the method according to their specific experiment configurations.2. demuxSNP method picked top N commonly expressed genes for SNP calculation. In the tutorial on Github, the N was set to 100. I am wondering in a more heterozygous dataset, the N = 100 still sufficient or not. Is there a way for users to determine the N for their specific dataset more systematically? Or the authors can show some data to demonstrate that N = 100 is robust across different datasets?3. The dataset GSE267835 is private. Please provide reviewer token in the Data Availability statement during submission process.4. Color of uncertain cells in Fig1-B is a bit misleading cause in Fig1-A the same color was used to represent "background staining". Even A and B and different panels, however, a big black arrow makes readers thought they're the same data. Therefore, change the color of uncertain cells into another color would be good to avoid confusions.5. In Fig2-A and B, what are the units for the X axis? Are they log2 or log2 hashtag counts? Please add that information to the figure and legend.6. For Fig-2 C and D, please use the formal spell of names of existing methods like you did in Fig2E.7. Please add line numbers to the draft for reviewers' convenience8. Some minor format issues exist. For example, the "Result" section should a header format instead of normal text.

    1. performance of stMMR in multiple analyses, including spatial domain identification, pseudo-spatiotemporal analysis, and domain-specific gene discovery. In chicken heart development, stMMR reconstruct the spatiotemporal lineage structures indicating accurate developmental sequence. In breast cancer and lung cancer, stMMR clearly delineated the tumor microenvironment and identified marker genes associated with diagnosis and prognosis. Overall, stMMR is capable of effectively utilizing the multi-modal information of various SRT data to explore and characterize tissue architectures of homeostasis, development and tumor.

      Reviewer 2: Hongzhi Wen Reviewer Comments: The paper introduces stMMR, a multi-modal graph learning method designed to integrate gene expression, spatial location, and histological information for accurate spatial domain identification from spatially resolved transcriptomics (SRT) data. The method employs graph convolutional networks (GCN) and self-attention modules, along with cross-modal contrastive learning, to enhance feature integration and representation.Strengths:1. Using GCN to capture local spatial dependency is natural and effective. Introducing attention mechanism for capturing global relations intuitively make senses, however, need more justification. Contrastive learning for cross-modal feature fusion is also a natural choice in multimodal learning. Overall, the methodology is novel and solid.2. Extensive benchmark analysis across various types of spatial data and tissues demonstrates superior performance of the method in spatial domain identification, pseudo-spatiotemporal analysis, and domain-specific gene discovery. The empirical evidence is very convincing.3. The method's application to chicken heart development, breast cancer, and lung cancer showcases its potential in reconstructing spatiotemporal lineage structures and delineating tumor microenvironments, highlighting its value in clinical research.Weaknesses:1. In Figure 4, SpaceFlow is the only baseline for the case study. However, the performance of SpaceFlow is not topranked in other experiments. There should be a justification for why SpaceFlow is highlighted here.2. The contribution of the global attention mechanism to the whole framework is not very clear. The authors may provide more intuition and empirical justification (e.g., ablation study) if they would like to highlight this design.3. By introducing the hyperparameters $\alpha$, $\beta$ and $\gamma$ in Eq. (11), the method has a significantly larger search space than other methods. It is important to note how these hyperparameters are chosen in practice, more importantly, whether the test performance is referred when adjusting these hyperparameters. This might result in an unfair evaluation.

    2. AbstractDeciphering spatial domains using spatially resolved transcriptomics (SRT) is of great value for the characterizing and understanding of tissue architecture. However, the inherent heterogeneity and varying spatial resolutions present challenges in the joint analysis of multi-modal SRT data. We introduce a multi-modal geometric deep learning method, named stMMR, to effectively integrate gene expression, spatial location and histological information for accurate identifying spatial domains from SRT data. stMMR uses graph convolutional networks (GCN) and self-attention module for deep embedding of features within unimodal and incorporates similarity contrastive learning for integrating features across modalities. Comprehensive benchmark analysis on various types of spatial data shows superior

      Reviewer 1: Shihua Zhang Reviewer Comments: In this paper, the authors developed a multi-modal deep learning method for identifying spatial domains from ST data by integrating gene expression, spatial location and histological information. This method adopts the graphconvolutional networks and self-attention module for deep embedding of features within unimodal and incorporates similarity contrastive learning for integrating features across modalities. They did several typical analysis to valid this this method. Generally, the wiring of this paper is OK. More specific comments:1. Spatial domain has been overwhelmingly studied recently. The authors need to pay more attention to why it is needed to introduce a new method. The novelty of the current method should be carefully clarified. For example, how the histological information help to improve the performance? Does the "geometric" deep learning really help?2. This method has been applied to some stereotypical data. The authors should applied it to some recently generated data by some new ST techniques.3. Figure 3 stMMR enhances spatial gene expression profiles. It is hard to see how the method enhance the spatial gene expression (e.g., LPL).4. With the accumulation of multi-slice spatial transcriptome data, the integration and alignment of spatial transcriptome data will be essential. Can this method be extended for this situation like STAGATE (Nat Comput Sci.2023 Oct; 3(10):894-906)? This will be valuable for ST analysis.5. The scalability of this method should be carefully explored.6. The authors should provide a detailed tutorial for users.

    1. Conclusions The chromosome-level genome of piauçu exhibits high quality, establishing a valuable resource for advancing research within the group. Our discoveries offer insights into the evolutionary dynamics of Z and W sex chromosomes in fish, emphasizing ongoing degenerative processes and indicating complex interactions between Z and W sequences in specific genomic regions. Notably, amhr2 and bmp7 are potential candidate genes for sex determination in M. macrocephalus.

      Reviewer 2: Changwei Shao Reviewer Comments: The authors reported the M. macrocephalus reference genome with a highly degenerated ZW sex chromosome and analyzed the expression pattern of sex chromosomes. In a word, this work extends our understanding of the mechanisms of sex chromosome evolution of fish species. The interpretation of the results is sound for the most part, and gives enough proof to verify their results. I just have few concerns as followed.1.On line 54, please confirm it. In the tongue-sole, the size of Z chromosome (21.91Mb) is larger than the W chromosome(16.45Mb).2.On line 88, 89 and 116, the numbers mentioned do not correspond with the results in Figure 1A. Please confirm it.3.In the section on "Gene Prediction and Annotation", a more comprehensive prediction of gene structure can be achieved by combining three methods: de novo prediction, transcriptome prediction, and homology prediction. The results obtained from these three approaches can be integrated using the EVM software, followed by annotation assessment with BUSCO. The method section is somewhat vague and lacks clear logic. For protein prediction, it is advisable to utilize multiple databases, such as SwissProt, InterPro, and Nr, to corroborate evidence from various sources.4.On line 210, there is an error in the caption of Figure 3. Figure 3B should be a colinearity map of the linkage groups and chromosomes.5.The SNP sites identified in females may include those from the Z chromosome, linkage group 23 (LG23) will contain SNP information from both the Z and W chromosomes. This could potentially affect the demarcation of the region of sex conflict.6.On the sex chromosomes, are there candidate genes related to sex differentiation in regions with a high enrichment of specific SNPs? please provide a detailed explanation.7.What is the distribution of genes in the Z and W chromosome-specific regions, and what is the gene loss rate?

    2. AbstractBackground Megaleporinus macrocephalus (piauçu) is a Neotropical fish within Characoidei that presents a well-established heteromorphic ZZ/ZW sex-determination system and thus, constitutes a good model for studying W and Z chromosomes in fishes. We used PacBio reads and Hi-C to assemble a chromosome-level reference genome for M. macrocephalus. We generated family segregation information to construct a genetic map, pool-seq of males and females to characterize its sex system, and RNA-seq to highlight candidate genes of M. macrocephalus sex determination.Results M. macrocephalus reference genome is 1,282,030,339 bp in length and has a contig and scaffold N50 of 5.0 Mb and 45.03 Mb, respectively. Based on patterns of recombination suppression, coverage, Fst, and sex-specific SNPs, three major regions were distinguished in the sex chromosome: W-specific (highly differentiated), Z-specific (in degeneration), and PAR. The sex chromosome gene repertoire was composed of genes from the TGF-β family (amhr2, bmp7) and Wnt/β-catenin pathway (wnt4, wnt7a), and some of them were differentially expressed.

      Reviewer1: Yusuke Takehana Reviewer Comments: The authors assembled a chromosome-level genomic sequence and identified the sex chromosomes of the fish Megaleporinus macrocephalus. This manuscript is potentially interesting because evolution of sex chromosomes and sex-determining genes are one of the most fundamental and popular topics in the evolutionary biology. However, the conceptual advance and the novelty of this study are quite limited. It is another paper adding now one more species to the list of assembled genomes in this fish family. In addition, there is nothing new about the description of the sex chromosomes such as their degenerative signature. Such studies have already been conducted many times and similar conclusions have been reported. Furthermore, the experimental evidence presented appears rather preliminary and is not sufficient to support the claims and interpretations presented in discussion. I am therefore afraid that I have to say that the manuscript does not provide new insights into evolution of sex chromosomes, and thus will not be of sufficient interest to the readers of Gigascience.1. Overall, the paper was very difficult to read due to a lack of logic structure and many errors, such as confusing between males and females, between chromosomes and linkage groups, and so on.2. The introduction is not logically written. It is unclear what is known and to what extent, and why the genome of this species is being determined.3. I did not understand why the authors concluded that Chr13 is the W chromosome and not the Z chromosome. They should assemble the Z and W chromosomes separately and confirm them from different information. It is also unclear how they rule out the possibility that the sequences are chimeric. If they really want to reveal the evolutionary process of sex chromosomes, they should use all the data (Hi-C, linkage analysis, Pool-seq, gene information) to compare the structure of Z and W in detail, including synteny with closely related species.4. The analysis on sex chromosome gene candidates is too poor. Basic analyses have not been conducted on whether these genes are W-specific, whether they are in both Z and W, whether they have paralogs or not on autosomes, how much sequence variation there is, when and in which cells they are expressed, etc.5. All of the discussions are superficial and lacking in logic, and it is unclear what they want to discuss.6. The figures legends are poorly explained, and contain incorrect information, so I don't understand the meaning of the data at all.7. This manuscript contained many grammatical errors leading to many confusing statements, and some sentences that were grammatically correct but awkward meaning. I strongly recommend that the authors seek advice of someone with a good knowledge of English, preferably a native speaker.

    1. Conclusions We applied CAT Bridge to experimentally obtained Capsicum chinense (chili pepper) and public human and Escherichia coli (E. coli) time-series transcriptome and metabolome datasets. CAT Bridge successfully identified genes involved in the biosynthesis of capsaicin in C. chinense. Furthermore, case study results showed that the convergent cross mapping (CCM) method outperforms traditional approaches in longitudinal multi-omics analyses. CAT Bridge simplifies access to various established methods for longitudinal multi-omics analysis, and enables researchers to swiftly identify associated gene-metabolite pairs for further validation.

      Reviewer2: JITENDRA KUMAR Barupal Reviewer Comments: To the authors,Thank you for the opportunity to review the manuscript GIGA-D-24-00083. The authors created a tool to predict association between genes and metabolites using various algorithms. The authors provide the tool as a web application, and as a python package. To get the reciprocal relationship between gene and metabolites, i.e. which metabolites can change which gene or vice versa, this tool can be a toolkit for the biologist or bioinformatician.The tool has application specially the relationship between changes in genes and metabolites is not direct, many complex mechanisms exist e.g. epigenetic or polymorphism. So the tool can be alternate to other available tools.Also, the manuscript brings the community focus on causal relationships instead of just correlation based relationships. The tool used temporal causality algorithms for predicting relationships between genes and metabolites.However, I recommend major revisions before publication. Here are my reasons and comments for the revisions:General issues with web accessibility and package installation :1. There are concerns about web accessibility, as indicated by web browsers flagging the connection as insecure. This may stem from geographical restrictions or the absence of HTTPS certification. Addressing these issues would ensure secure access to the server.2. Despite successful initiation of the client application from the git repository as a python module, no results were generated upon launching. It is suggested that the authors distribute the tool as a Docker image to facilitate seamless usage, eliminating concerns regarding dependencies and version compatibility.Other comments :1. There are inconsistencies regarding data preprocessing. While the manuscript mentions that the tool will handle preprocessing, it also indicates that users need to provide processed files. Clarification is needed on whether preprocessing is required. It seems, the tool required preprocessed data.2. For clarity use "causality and correlation" instead of "causality/correlation" algorithms.3.Can the tool process any new temporal numerical data series, or does it specifically filter for genes? For instance, if I provide a list of proteins along with a list of genes, will I receive the association between them? It is suggested to include this in the discussion section.4.Does the tool offer the capability to generate a causal diagram or network from these vectors, thereby providing visual support for their assertion regarding the causal relationship between metabolites and genes? If the author is working in this direction, it is suggested that information can be added in the discussion section.5. What definition of causal relationship did the author use, and could they provide a citation for their definition. Predictability or any other criteria were used for causal relationships. Please include the definition or criteria in the introduction and method section.6. What are the minimum or maximum time points (interval) for input files? e.g. will the tool work if I provide only two times points or If I provide 48 times points. Please include the information in the method section.7. What is the influence of the number of time points on the vector relationship presented in the paper? Have any studies by the authors addressed this question? Please include the results and discussion.8. Could the authors clarify which heuristic algorithm was employed for ranking the genes? Additionally, can they elaborate on how their approach to gene ranking is heuristic rather than relying on mathematical optimization or algorithmic methods? Clarification on the term "heuristic" would be beneficial.9. Could the authors offer an example from studies conducted on yeast, E. coli, or other simple organisms, demonstrating how changes in gene sequences have readily been observed to affect metabolite levels? Please include that in the results section.10. Does the tool generate a vector indicating many-to-many relationships or one-to-one relationships? In other words, does it reveal whether one gene is associated with many metabolites, and vice versa, or if it establishes a single genemetabolite relationship? Please include this in the results section. Also, in the discussion section please include examples of application of these relationships in various fields e.g. metabolic engineering or cancer metabolism.11. Table 1 compares the features of CAT Bridge with other available methods. It should encompass features provided by other tools that are not available in the author's tool, such as knowledge-driven integration or integration with a third-party database. Additionally, it should address the limitation posed by the requirement of time series data, which is not just a strength but also a challenge, particularly for epidemiology studies where multiple time series for gene expression may not be feasible.12. Please use alternative phrases to "Self-generated data," such as "experimentally obtained data," to clarify that the author is utilizing data acquired in the lab to validate the tool. (e.g. line 42, 223, and 492).

    2. AbstractBackground With advancements in sequencing and mass spectrometry technologies, multi-omics data can now be easily acquired for understanding complex biological systems. Nevertheless, substantial challenges remain in determining the association between gene-metabolite pairs due to the non-linear and multifactorial interactions within cellular networks. The complexity arises from the interplay of multiple genes and metabolites, often involving feedback loops and time-dependent regulatory mechanisms that are not easily captured by traditional analysis methods.Findings Here, we introduce Compounds And Transcripts Bridge (abbreviated as CAT Bridge, available at https://catbridge.work), a free user-friendly platform for longitudinal multi-omics analysis to efficiently identify transcripts associated with metabolites using time-series omics data. To evaluate the association of gene-metabolite pairs, CAT Bridge is a pioneering work benchmarking a set of statistical methods spanning causality estimation and correlation coefficient calculation for multi-omics analysis. Additionally, CAT Bridge features an artificial intelligence (AI) agent to assist users interpreting the association results.

      Reviewer 1: Tara Eicher Reviewer Comments: The authors introduce a useful tool (CAT Bridge) for integrating multiple causal and correlative analyses for multi-omics integration, which also includes a visualization and LLM component. The authors further provide two case studies (human and plant) illustrating the utility of CAT Bridge. I believe that this work should be published, as it contributes to the field of multi-omics analysis.However, I am very concerned about the lack of description regarding the LLM. As explained by Mittelstadt et al (https://www.nature.com/articles/s41562-023-01744-0), LLMs do not always provide factual answers. The authors need to justify the use of the LLM to determine the relevance of a gene-metabolite association. In particular, the authors should add to the main text (or at least the supplementary) a detailed description of the prompt construction and should justify why this prompt is expected to result in factual information. Furthermore, the authors should discuss the caveats of using LLMs in this context, starting with the linked article above. I believe that the manuscript will only be publishable once this concern is addressed.In addition, the authors are recommended to address the following more minor concerns:Implementation:1. Your "example file" links at https://catbridge.work are broken. Please fix this.Abstract:1. Line 32: "Nevertheless, substantial challenges remain in determining the association between gene-metabolite pairs due to the complexity of cellular networks." This is not a clear statement. What about the complexity of cellular networks presents challenges in determining the associations?2. Make sure you are using present tense consistently, not past tense (Line 39).3. Please use the scientific name with the common name in parentheses as follows: Capsicum chinense (chili pepper). Use only the scientific name throughout the rest of the document (Line 41).Background:1. Line 56: "Background" should not be plural.2. Lines 59-60: More comprehensive than what? Please elaborate here.3. In Line 60, please include and familiarize yourself with the following reference: Eicher, T., G. Kinnebrew, A. Patt, K. Spencer, K. Ying, Q. Ma, R. Machiraju and E. A. Mathé (2020). "Metabolomics and Multi-Omics Integration: A Survey of Computational Methods and Resources." Metabolites 10: 202.4. Lines 67-68: Citation needed.5. Line 72: Please use the scientific name with the common name in parentheses.6. Lines 74-77: Citations needed.7. Lines 77-78: Give an example of biologically naïve conclusions from purely data-driven strategies.8. Line 78: Discuss how the machine learning models could address the drawbacks of the correlation models and vice-versa.Materials and Methods:1. It seems that CAT Bridge needs to be run on one metabolite at a time. In this case, I would not use the term "gene-metabolite pair association" in Line 104, but rather "associations between genes and the target metabolite".2. Line 115: Clearly state which of these methods are non-linear and which address the lag issue.3. Line 136: Your figures are out of order (Figure 1B comes after Figure 2B).4. Please take a look at the Minimum Standards Reporting Checklist (https://academic.oup.com/gigascience/pages/Minimum_Standards_of_Reporting_Checklist). In particular:a. In the section starting at Line 153, list the number of seedlings used.b. Were all timepoints collected from all seedlings? List the total number of samples.c. How many mg were collected per sample (can use a range here)?d. 3 biological replicates per seedling? Give more detail here.e. What machine was used for the ultrasonic process? If frequency settings are permitted by the machine, list the settings used.f. How many of the 28 younger and 54 older adults had both transcriptome and metabolome data?5. Line 209: "Younger" and "older" are better terms.Results:1. Line 248: How does the AI agent analyze the functional annotations?2. Lines 281-282: "This illustrates the advantage of causal relationship modeling methods over traditional methods".3. Line 290: Please also include the updated IntLIM paper (IntLIM 2.0): Eicher, T., K. D. Spencer, J. K. Siddiqui, R. Machiraju and E. A. Mathe (2023). "IntLIM 2.0: identifying multi-omic relationships dependent on discrete or continuous phenotypic measurements." Bioinformatics Advances 3(1): vbad009.4. Make sure the colors are consistent in Table 1.5. Line 156: The scientific name of the pepper species is inconsistent with other areas of the text.Figures:1. S1 should be provided as a table, not a figure.2. Please make S2 larger. It is difficult to read.3. S3 needs labels (x axis, y axis, legend).

    1. (μχνσ δρξσ)

      score για ένα χωροδιακό πολυφωνικό (ένα ένα τα γραμματικά είδη) choir 1: διαβάζει τις ρίζες των λέξεων choir 2: διαβάζει τις καταλήξεις των λέξεων choir 3: διαβάζει τα ρήματα choir 4: διαβάζει τις λέξεις από ένα μέχρι 3 γράμματα διαβάζει τα άρθρα/αντωνυμίες κλπ

    1. Reviewer #3 (Public review):

      Summary:

      Chen et al. develop and characterize a new approach for screening drugs for epilepsy. The idea is to increase the ability to study seizures in animals with epilepsy because most animal models have rare seizures. Thus, the authors use the existing intrahippocampal kainic acid (IHKA) mouse model, which can have very unpredictable seizures with long periods of time between seizures. The authors employ an additional method to trigger seizures in the IHKA model. This method is closed-loop optogenetic stimulation of area CA1. There are several assumptions: area CA1 is the best location, triggered seizures are the same as spontaneous seizures, and this method will be useful despite requiring a great deal of effort. Regarding the latter, using a mouse model with numerous seizures (such as the pilocarpine model) might be more efficient than using a modified IHKA protocol that requires viral injection for optogenetics, fiber insertion requiring additional surgery, and accurate targeting to reliably trigger seizures on-demand. Aside from these caveats, the authors do succeed in studying seizures more readily in a mouse model of rare seizures. However, the seizures are evoked, not spontaneous. As currently presented, it is not clear how the triggered seizures can be used to investigate if antiseizure medication can reduce seizure burden as measured by seizure severity and seizures per day.

      The authors modified the IHKA model to inject KA into CA3 instead of CA1 in order to preserve the CA1 pyramidal cells that they will later stimulate. To express the excitatory opsin channelrhodopsin (ChR2) in area CA1, they use a virus that expresses ChR2 in cells that express the Thy-1 promoter. The authors demonstrate that CA3 delivery of KA can induce a very similar chronic epilepsy phenotype to the injection of KA in CA1 and show that optical excitation of CA1 can reliably induce seizures. These are the strengths of the study.

      While the authors show that electrophysiological signatures of induced vs spontaneous seizures are similar in many ways, the authors also show several differences and it is not clear if these differences are meaningful. Notably, the induced seizures are robustly inhibited by the antiseizure medication levetiracetam and variably but significantly inhibited by diazepam, similar to many mouse models with chronic recurrent seizure activity. I agree with the authors that this modified IHKA model will be of most value for higher throughput screening of potential antiseizure therapies, but with the caveat that the data may not generalize to other epilepsy models or humans. The authors evaluate the impact of repeated stimulation on the reliability of seizure induction and show that seizures can be reliably induced by CA1 stimulation for as long as 16 days, but the utility of the model would be better demonstrated if seizures could be shown to be inducible over the range of weeks to months.

      Strengths:

      (1) The authors show that the IHKA model of chronic epilepsy can be modified to preserve CA1 pyramidal cells (but at a cost of CA3 cells), allowing on-demand optogenetic stimulation of CA1 that appears to lower seizure threshold and thus trigger a seizure event.

      (2) The authors show that repeated reactivation of CA1 even in untreated mice can promote kindling and induction of seizure activity, indeed generating two mouse models in total.

      (3) Many electrophysiological signatures are similar between the induced and spontaneous seizures, and induced seizures reliably respond to treatment with antiseizure medications.

      (4) Given that more seizures can be observed per mouse using on-demand optogenetics, this model enhances the utility of each individual mouse.

      Weaknesses:

      (1) Evaluation of seizure similarity using the SVM modeling and clustering is not sufficiently explained to show if there are meaningful differences between induced and spontaneous seizures. SVM modeling did not include analysis to assess the overfitting of each classifier since mice were modeled individually for classification.

      (2) The difference between seizures and epileptiform discharges or trains of spikes (which are not seizures) is not made clear.

      (3) The utility of increasing the number of seizures for enhancing statistical power is limited unless the sample size under evaluation is the number of seizures. However, the standard practice is for the sample size to be the number of mice.

      (4) Seizure burden is not easily tested.

      (5) It is unlikely that long-term adaptation to CA1-stimulated seizure induction is absent in these mice. A duration of evaluation longer than 16 days is warranted in light of the downward slope at days 13-16 for induced seizures in Figure 4C.

      (6) Human epilepsy is extensively heterogeneous in both etiology and individual phenotype, and it may be hard to generalize the approach.

      (7) No mention or assessment of mouse sex as a biological variable.

    2. Author response:

      In this initial response to the public review, we outline our plan to address the major concerns raised. Below, we provide a general categorization of the suggestions and our corresponding responses

      Weakness #1: Statistical Concerns - using the number of seizures (rather than the number of animals) may identify small effects that could be insignificant. Effect size should be taken into consideration.

      Reviewer 1:

      “While the data generally supports the authors' conclusions, a weakness of this manuscript lies in their analytical approach where EEG feature-space comparisons used the number of spontaneous or evoked seizures as their replicates as opposed to the number of IHK mice; these large data sets tend to identify relatively small effects of uncertain biological significance as being highly statistically significant.”

      Reviewer 2:

      “In several sections of the paper, the authors argue that two different groups are similar on the basis that no statistical difference was found between the two groups (i.e., p > 0.05); however, the failure to find a statistically significant difference, particularly with relatively small sample sizes, is not rigorous evidence that the two groups are actually similar - they are just "not significantly different.”

      Reviewer 3:

      “(3) The utility of increasing the number of seizures for enhancing statistical power is limited unless the sample size under evaluation is the number of seizures. However, the standard practice is for the sample size to be the number of mice.”

      Reviewer 3:

      “(1) Evaluation of seizure similarity using the SVM modeling and clustering is not sufficiently explained to show if there are meaningful differences between induced and spontaneous seizures. SVM modeling did not include analysis to assess the overfitting of each classifier since mice were modeled individually for classification.”

      We understand the reviewers’ concerns. In this work, we used linear mixed effect model to address two levels of variability –between animals and within animals. The interactive linear mixed effect model shows that most (~90%) of the variability in our data comes from within animals (Residual), the random effect that the model accounts for, rather than between animals. Since variability between animals are low, the model identifies common changes in seizure propagation across animals, while accounting for the variability in seizures within each animal. Therefore, the results we find are of changes that happen across animals, not of individual seizures. We will make text edits to enhance understanding of the linear mixed effect model.

      To address the point raised about similarity, we will explain how the SVM classifier was trained. The purpose of the SVM is not to identify meaningful differences between induced and spontaneous seizures. Rather, it is to classify EEG sections as “seizures” or non-seizures, demonstrating the gross similarity between induced and spontaneous seizures despite minor differences. We will make text clarifications for the SVM model.

      Weakness #2: Clinical and biological significance is unclear.

      Reviewer 1:

      “Furthermore, the clinical relevance of similarly small differences in EEG feature space measurements between seizure-naïve and epileptic mice is also uncertain.”

      Reviewer 2:

      “While the paper may be relevant for the ETSP and contract research organizations (CROs), the paper was not written to attract the interest of biological scientists, even those in this specific area of epilepsy research. It may be of low interest to other neuroscientists… The key issue the authors aim to address is the 30-40% of patients with DRE, but the real problem with DRE patients is not that these people have seizures with no effect of the ASDs; rather, although ASD may reduce seizure burden, these patients continue to have some remaining seizures even after high doses of ASDs, which often leads to adverse effects from the particular ASDs… It remains unclear that the optogenetically induced seizures in this model are better than similarly induced seizures in a naïve animal, and there is no evidence that the model will be useful for finding new ASDs to treat DRE.”

      Reviewer 3:

      “(6) Human epilepsy is extensively heterogeneous in both etiology and individual phenotype, and it may be hard to generalize the approach.”

      Reviewer 2:

      “The authors state that this approach should be used to test for and discover new ASDs for DRE, and also used for various open/closed loop protocols with deep-brain stimulation; however, the paper does not actually discuss rigorously or critically the background literature on other published studies in these areas or how this approach will improve future research for a broader audience than the ETSP and CROs. Thus, it is not clear whether the utility will apply more widely and how extensive a readership will be attracted to this work.”

      We appreciate the reviewer’s concerns. We will revise the manuscript to better emphasize the potential significance of our approach. The on-demand seizure model can be applied to address biologically and clinically relevant questions beyond its utility in drug screening. For example, crossing the Thy1-ChR2 mouse line with genetic epilepsy models, such as Scn1a mutants, could reveal how optogenetic stimulation differentially induces seizures in mutant versus non-mutant mice, providing insights into seizure generation and propagation in Dravet Syndrome. Due to the cellular specificity of optogenetics, we also envision this approach being used to study circuit-specific mechanisms of seizure generation and propagation. Regarding drug-resistant epilepsy (DRE) and anti-seizure drug (ASD) screening, we agree with the reviewer that probing new classes of ASDs for DRE represents the critical goal. However, we believe a full exploration of additional ASD classes and/or modeling DRE lies outside the scope of this manuscript.

      Weakness #3: Definition of Seizure is unclear

      Reviewer 2:

      “Although the figures provide excellent examples of individual electrographic seizures and compare induced seizures in epileptic and naïve animals, it is unclear which criteria were used to identify an actual seizure induced by the optogenetic stimulus, versus a hippocampal paroxysmal discharge (HPD), an "afterdischarge", an "electrophysiological epileptiform event" (EEE, Ref #36, D'Ambrosio et al., 2010 Epilepsy Currents), or a so-called "spike-wave-discharge" (SWD). Were HPDs or these other non-seizure events ever induced using stimulation in animals with IH-KA? A critical issue is that these other electrical events are not actual seizures, and it is unclear whether they were included in the column showing data on "electrographic afterdischarges" in Figure 5 for the studies on ASDs”

      Reviewer 3:

      “(2) The difference between seizures and epileptiform discharges or trains of spikes (which are not seizures) is not made clear.”

      Reviewer 2:

      “The differences between the optogenetically evoked seizures in IH-KA vs naïve mice are interpreted to be due to the "epileptogenesis" that had occurred, but the lesion from the KA-induced injury would be expected to cause differences in the electrically and behaviorally recorded seizures - even if epileptogenesis had not occurred. This is not adequately addressed.”

      Thank you for pointing out the unclear definition of the seizures analyzed. We agree and will revise the text to clarify this issue. In this manuscript, we focused on tonic-clonic seizures. We analyzed animal behavior during evoked events, and a high percentage of induced electrographic events were accompanied by behavioral seizures with a Racine scale of three or above. Regarding epileptogenesis, our model is based on the IHK model, in which spontaneous tonic-clonic seizures occur a few to several days after KA injection. These mice are, by definition, epileptogenic. We will further clarify this methodology in the text.

      Weakness #4: Similarity/Difference with Kindling Not Clear

      Reviewer 2:

      “The authors did not test whether an apparent "kindling" effect, apparently seen in naïve controls, also occurred in animals micro-injected with kainic acid (KA). This effect could cause model instability that might result in variability in response to ASDs. It is not clear whether the number of optogenetically induced seizures in epileptic animals would affect the response to drugs. It is also unclear how much of an improvement the animal model in the present work is over other similar models of TLE, where electrically triggered seizures could simply be applied to one of them.”

      Reviewer 3:

      “(5) It is unlikely that long-term adaptation to CA1-stimulated seizure induction is absent in these mice. A duration of evaluation longer than 16 days is warranted in light of the downward slope at days 13-16 for induced seizures in Figure 4C.”

      We appreciate the reviewer’s comments regarding the “kindling effect” as well as its similarity to the kindling model. We will carefully assess the data and address this in the revised manuscript. In electrical kindling, the activated cellular population is non-specific, including both excitatory and inhibitory neurons. In our model, we specifically activate predominantly excitatory neurons (Thy1-positive neurons), which we observed to participate in convulsant-induced seizures (as demonstrated in Thy1-GCaMP experiments). We consider this specificity an improvement over the kindling model, making our approach more biologically relevant.

      Weakness #5: Time needed to generate model is significant. Unclear if animals were pre-selected

      Reviewer 1:

      “Finally, the multiple surgeries and long timetable to generate these mice may limit the value compared to existing models in drug-testing paradigms.

      Reviewer 2:

      “The authors offer little mention of other research using animal models of TLE to screen ASDs, of which there are many published studies - many of them with other strengths and/or weaknesses. For example, although Grabenstatter and Dudek (2019, Epilepsia) used a version of the systemic KA model to obtain dose-response data on the effects of carbamazepine on spontaneous seizures, that work required use of KA-treated rats selected to have very high rates of spontaneous seizures, which requires careful and tedious selection of animals. The ETSP has published studies with an intra-amygdala kainic acid (IA-KA) model (West et al., 2022, Exp Neurol), where the authors claim that they can use spontaneous seizures to identify ASDs for DRE; however, their lack of a drug effect of carbamazepine may have been a false negative secondary to low seizure rates. The approach described in this paper may help with confounds caused by low or variable seizure rates. These types of issues should be discussed, along with others.”

      We appreciate the reviewer’s insights. In an existing model investigating spontaneous tonic-clonic seizures (such as the intra-amygdala kainate injection model), the time investment is back-loaded, requiring two to three weeks per condition while counting spontaneous seizures, which may occur only once a day. In contrast, our model requires a front-loaded time investment. Once the animals are set up, we can test multiple drugs within a few weeks, providing significant time savings. Additionally, we did not pre-screen animals in our study. Existing models often pre-select mice with high rates of spontaneous seizures, whereas in our model, seizures can be induced even in animals with few spontaneous seizures. We believe that bypassing the need for pre-screening is a key advantage of our induced seizure model.

      Reviewer 3:

      “(7) No mention or assessment of mouse sex as a biological variable.”

      Thank you for pointing this out. Both female and male animals were included in this study: Epileptic cohort: 7 males, 3 females; Naïve cohort: 3 males, 4 females

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      Summary:

      Wilson's Disease (WD) is an inherited rare pathological condition due to a mutation in ATP7B that alters mitochondrial structure and dysfunction. Additionally, WD results in dysregulated copper metabolism in patients. These metabolic abnormalities affect the functions of the liver and can result in cholecystitis. Understanding the immune component and its contribution to WD and cholecystitis has been challenging. In this work, the authors have performed single-cell RNA sequencing of mesenchymal tissue from three WD patients and three liver hemangioma patients.

      Strengths:

      The authors describe the transcriptomic alterations in myeloid and lymphoid compartments.

      Weaknesses:

      In brief, this manuscript lacks a clear focus, and the writing needs vast improvement. Figures lack details (or are misrepresented), the results section only catalogs observations, and the discussion needs to focus on their findings' mechanistic and functional relevance. The major weakness of this manuscript is that the authors do not provide a mechanistic link between the absence of ATP7B and NK cells' impaired/altered functions. While the work is of high clinical relevance, there are various areas that could be improved.

      In this study, we reported for the first time that ATP7B mutation and the resulting metabolic abnormalities in hepatocytes cause functional alteration of immune cells in WD patients. We dissected the transcriptional profiles of liver mesenchymal cells and delineated the functional differences of main immune cells in WD patients through scRNA-seq. The NK cell exhaustion and its clinical significance were further demonstrated.

      The mechanism study is of our concern. Given that the ATP7B mutation is hepatocyte-specific, its effect on immune cells is most probably through intercellular communication rather than through the direct action of ATP7B protein. How ATP7B mutation disturbs the metabolic homeostasis in hepatocyte, how metabolic pathways regulate the release of signal substances, and how signal substances act on the NK cells need to be explained. These contents, together with this manuscript, are beyond the scope of a single article, so we put the novelty in this manuscript.

      We sincerely appreciate the comments. We have improved the manuscript based on your valuable suggestions. The mechanism study is our subsequent research topic. We are actively promoting it and have found that ATP7B mutation rewires a certain metabolism pathway in hepatocyte, and that a critical metabolite functions as the mediator causing NK cell exhaustion.

      Reviewer #2 (Public Review):

      Summary:

      Wilson's disease is a rare genetic disorder caused by mutations in the ATP7B gene. Previous studies have documented that ATP7B mutations can disrupt copper metabolism, affecting brain and liver function. In this paper, the authors performed a retrospective clinical study and found that Wilson's disease has a high incidence of cholecystitis. Single-cell RNA-seq analysis revealed changes in the immune microenvironment, including the activation of immune responses and the exhaustion of natural killer cells.

      Strengths:

      A key finding of this study is that the predominant ATP7B gene mutation in the Chinese population is the 2333G>T (p. R778L) mutation. The authors reported associations between Wilson's disease and cholecystitis, as well as the exhaustion of natural killer cells.

      Weaknesses:

      The underlying mechanisms linking ATP7B mutations to cholecystitis and natural killer cell exhaustion remain unclear. Specifically, it is not yet determined whether copper metabolism alterations directly cause cholecystitis and natural killer cell exhaustion, or if these effects are secondary to liver dysfunction.

      In this study, we reported for the first time that ATP7B mutation and the resulting metabolic abnormalities in hepatocytes cause functional alteration of immune cells in WD patients. We dissected the transcriptional profiles of liver mesenchymal cells and delineated the functional differences of main immune cells in WD patients through scRNA-seq, focusing on the NK cell exhaustion and its clinical significance.

      The mechanism study is of our concern. Given that the ATP7B mutation is hepatocyte-specific, its effect on immune cells is most probably through intercellular communication, so we prioritize the studying of this aspect. How ATP7B mutation disturbs the metabolic homeostasis in hepatocyte, how metabolic pathways regulate the release of signal substances, and how signal substances act on the NK cells need to be explained. These contents, together with this manuscript, are beyond the scope of a single article, so we put the novelty in this manuscript.

      We sincerely appreciate the comments. The mechanism study is the topic of our follow-up study. We are actively promoting the research and we have found that ATP7B mutation rewires a certain metabolism pathway in hepatocyte, and that a critical metabolite functions as the mediator causing NK cell exhaustion.

      Reviewer #1 (Recommendations For The Authors):

      Major:

      (1) Abstract. A major portion of this manuscript focuses on non-NK cells. Data that describes NK cell exhaustion is only minimal. Therefore, the authors should modify the abstract.

      Thank you for your valuable suggestion. We have supplemented the description of functional changes in other immune cells, and have modified the abstract (line 31-35).

      (2) Introduction. There are three paragraphs. The first paragraph discusses cholecystitis. However, there are too many repetitions, and the information is unclear. In the second part, the authors discuss NK cells and their exhaustion. The authors do not establish a clear rationale or logic linking NK cells to WD or cholecystitis. In the last paragraph, the authors describe their findings. Their correlation between NK cell exhaustion and the poor healing process of cholecystitis has no direct experimental proof.

      Thank you for your comments. We have deleted the repetitions and rephrased some sentences (line 72-74). Briefly, in the first paragraph, we proposed the significant prognostic value of immune cell dysfunction for cholecystitis. In the second paragraph, we introduced NK cell exhaustion and its potential to predict prognosis of certain diseases. In the third paragraph, we introduced that the liver is a central organ involved in metabolism and immunity, holding a large number of NK cells. Liver pathologies commonly impact the development and outcome of inflammation-associated diseases such as cholecystitis. WD was selected as a research model. In the last paragraph, we introduced our findings from clinical study, scRNA-seq, clinical samples, and bioinformatics analysis, and concluded at the end.

      (3) Results. Overall, the results section lacks clarity and a clear focus. Figure legends need to be significantly detailed. The authors make too many broad statements without any support. The authors also make too many overstatements.

      Thank you for your valuable suggestion. We have improved the inaccurate statements and made detailed refinement of figure legends. All the changes are marked in the manuscript, and related responses are described below.

      Figure 1: No information is provided about the functional impairment of ATP7B protein due to the mutation found in the cohort of Chinese patients. What does 'immune abnormalities' (line 127) mean? What is the relevance of showing liver fibrosis and copper accumulation in the eye in Figure 1c and d, respectively? Total cholesterol concentrations are still within the range in the plasma of WD patients, but the authors call it higher. ECAR has not changed in WD patients, but the authors claim it has (line 117).

      (1) All these gene mutations in WD disable the protein function and cause the same outcome. (2) We have deleted the inappropriate statement. (3) In clinical observation, we found that WD not only causes copper accumulation in hepatocytes, but also leads to a variety of diseases, including liver fibrosis, Kayser-Fleischer Ring, and lower risk of hyperglycemia. We showed these together with the data of cholecystitis incidence. We think these might suggest the significance of intercellular communication between hepatocytes and other cells in microenvironment. (4) We have deleted the inappropriate statement (line 108-110, 112-113).

      Figure 2: Did the authors use the liver mesenchymal tissue or mesenchymal cells? Figure 2 states that they used mesenchymal cells, different from liver mesenchymal tissue. Numbers within Figure 2b UMAP are not visible. Were the initial T and NK cells annotated as indicated in Figure S2 (CD3D, CD#E, CD3G)? If so, that does not include NK cells.

      (1) The liver mesenchymal cells were used for scRNA-seq. (2) It is possible that the image resolution was reduced due to the compression of files by the submission system during merging process. We confirm that the image resolution of all figures meets publishing requirements, and that all characters on the figures are visible. You can download figure files to view details. (3) It was our negligence that the incomplete cell markers were shown in Figure S2. We have updated the markers (CD3D, CD3E, NKG7), references (Ref #53, #55, and #56), and related figures (Figure 2e, and Figure S2c).

      Figure 3: The authors should change 'Case' to 'WD patients' both in the text and figures. DEGs in Figure 3C indicate a transcriptomic alteration in the B cell compartment, which the authors do not delineate. Also, the rationale and explanation for the CellChat analyses are minimal. Concluding that a change occurred within the TME with minimal data and explanations is unfair.

      Thank you for your comments. (1) We apologize for the confusion caused by the use of nomenclatures and abbreviations in the text and figures. In all scRNA-seq data analysis, presentation, and description, we used specific terms (CASE and CON) to refer to the group of WD patients and controls, as well as their cell population. We have now unified the use of nomenclature in full text and defined them when first appeared (line 126-127), avoiding using lowercase form to prevent confusion. (2) We have now compared the expression of key genes of B cell between the two group in the next section “The dysfunction of main immune cells in WD patients” (line 230-235, Figure 4e, Figure S4e). (3) We have described the results of cellular communication in more detail (line 188-194). (4) We have modified the conclusion and all the related statement in full text (line 29-31, 82-84, 149, 194-195).

      Figure 4: This section deals with multiple cell types with minimal explanations. This section discusses various cell types, but it lacks focus. In particular, the T cell section should be separated and elaborated more in detail.

      (1) In this section, we intended to show the comparison in function of main immune cells that account for a considerable proportion, instead of just showing differently expressed genes that provide minimal information. The evaluation of functional signature, based on the integration of multiple gene expression, allows a direct understanding of the final outcome owing to transcriptional changes. (2) Given that the main functions of T cells did not change significantly and there were more significant changes in innate immunity, the T cell section is relatively short and unsuitable as a separated part.

      Figure 5: What are the distinct subsets of NK cells authors have found in the WD patients and controls? How do these subsets differ between the two groups in numbers and their transcriptomes? The presentation and labeling of Figure 5 and Supplementary Figure 5 need to be vastly improved. The pseudotime presentation in Figure 5b should be presented separately for the patients and the controls. Are the changes in gene expression presented in Figure 5a due to the change in the subset compositions? Figure 5c immuno-staining is not at all visible. A clear explanation should be given for the differences between Figure 5c and Figure 5e, where NKG2A expressions are shown. A better explanation for Figure 5d is required. Did the authors use all the antibodies with the same fluorochrome? If so, what color is that? Can the authors include the individual samples in the bar diagram in Figure 5e? Again, the data in Figure 5 is insufficient to conclude that NK cells are exhausted in WD patients. While the role of changes in the expression of T-BET and EOMES can be related to dysfunction and cellular exhaustion of NK cells, the statement made by the authors needs to be toned down as they do not test with independent experiments.

      (1) The subsets of NK cell were clustered by gene expression profile and labeled by the characteristically expressed gene, using certain algorithm in the routine procedure. They cannot be distinguished in clinical samples by one or several genes or other sorting methods. Thus, we were not able to analyze these subsets in clinical samples. (2) We have supplemented the comparison of numbers and transcriptomes of three NK subtypes between the two groups (line 268-273). (3) We have checked the figures and confirmed that all characters on the figures are visible. (4) We have separately presented the plot in Figure S5d. (5) We compared the expression level of genes presented in Figure 5a between the two groups in three NK subtypes and supplemented this part (line 264-268). The results were very consistent across the three subtypes, suggesting that the results in total NK population were contributed by all three subtypes and not affected by a single composition. (6) KLRC1 is also known as NKG2A. We are sorry for not making a clear explanation, and now we use KLRC1 only in all text to avoid confusion. We have made a more clear and detailed description for Figure 5c, 5d, and 5e (now labeled as Figure 5b, 5c, and 5d), and have included the fluorochrome in Figure 5d (now labeled as Figure 5c) and the individual value in Figure 5e (now labeled as Figure 5d) (line 293-299). (7) In this section, we found the upregulated expression of inhibitory receptors, downregulated expression of effector molecules, and the impaired NK cell-mediated cytotoxicity in NK cell of WD patients from scRNA-seq. Then we validated the findings in clinical liver section samples and clinical blood samples by mIHC and flow cytometry, respectively. According to the recent articles, exhausted NK cells are characterized by decreased production of effector cytokines (e.g., IFNγ), as well as by impaired cytolytic activity, and downregulate expression of certain activating receptors and upregulate expression of inhibitory receptors (e.g., 10.3389/fimmu.2017.00760, 10.1038/s41590-018-0132-0, 10.1038/s41467-019-09212-y, 10.1080/2162402X.2016.1264562). Therefore, we concluded NK cell exhaustion in WD patients. (8) In the part about transcription factors, we kept the description of objective data and deleted the statement of the contribution of transcription factors to NK exhaustion.

      Figure 6: Data presented in Figure 6 and the conclusion made in this manuscript are predictive. There is no direct testing of ATP7B in NK cells to show the functions of this gene. Extension of this to patient survival is purely speculative. As long as authors state these facts clearly in their text, it can be acceptable. However, they do not extend their conclusions to similar liver diseases.

      ATP7B mutation is hepatocyte-specific, and it does not occur in any immune cells. The function of ATP7B in NK cell was not studied. We found the NK exhaustion and poor prognosis of cholecystitis in WD patients. Given that there were researches demonstrating that NK exhaustion is correlated with poor liver cancer prognosis, we hypothesized that NK exhaustion contributes to the poor prognosis of cholecystitis. Bioinformatics studies confirmed our hypothesis and supported the extension of this result to other inflammatory diseases. We had no experimental data, but this result was reliable in bioinformatics method.

      (4) Discussion: While the authors analyzed multiple cell types, the discussion is primarily focused on NK cells. There is no clear link between copper utilization, NK cell function, and exhaustion that the authors articulate.

      Thank you for your comments. The focus of our study is NK cell exhaustion, which is experimentally proven, so we discussed this aspect. We prioritize the effect of intercellular communication and metabolic alteration on the NK cell exhaustion in our follow-up study. Excess copper is released into the circulation in some circumstances in WD patients, but generally they receive long-term de-coppering therapy to maintain intracellular copper at a non-lethal level. Thus, we do not tend to consider copper as a critical factor in this study. In original manuscript, we mentioned the cuproptosis and its potential as a novel target. It is likely to lead to ambiguity and misunderstanding, so we deleted this part to put our point of view clearly.

      (5) Supplementary Figures: The presentation and labeling of these figures need to be changed.

      Thank you for your suggestions. We have modified the figures and confirmed that all characters on the figures are visible.

      Reviewer #2 (Recommendations For The Authors):

      It is better to test whether ATP7B mutation can directly affect immune functions.

      Thank you for your suggestions. Given that the ATP7B mutation is hepatocyte-specific, its effect on immune cells is most probably through intercellular communication. Thus, we prioritize the effect of intercellular communication on the NK cell exhaustion and we are actively promoting the research.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public reviews:

      Reviewer 1

      We would like to express our gratitude to Reviewer 1 for providing a thorough summary of our work and highlighting its strengths. With regards to the weaknesses, we are committed to improve the manuscript by performing the necessary changes. First, we will specify the exact p-value in all cases.

      Regarding the discussion section, we acknowledge the feedback regarding its potential confusion. In line with the reviewer's suggestion, we will reduce the literature review and highlight our findings.

      Finally, for the preprint we did not include cofounders such as HIV infection and ethnicity as our study population did not exhibit viral infections and comprised only Hispanic individuals. We will make a more thorough description of the population of study and address these characteristics explicitly in both the methods section and the initial part of the results.

      Reviewer 2

      We appreciate and thank reviewer 2 for the commentaries. Although it is true that several papers have described the role of microbiome in COVID-19 severity, we firmly believe that our current work stands out. There is not much information related to this association in Mediterranean countries, especially in the south of Spain. In addition, most of the studies only describe microbiota composition in stool or nasopharyngeal samples separately, without investigating any potential relationships between them as we do.

      (1) We agree with the reviewer idea of a limited sample size. We faced the challenge of collecting the samples during the peak of COVID-19 pandemia. Thus, doctors and nurses were overwhelmed and not always available for carrying out patient recruitment following the inclusion criteria. Despite these constraints, we ensured that all included samples met our specified inclusion criteria and were from subjects with confirmed symptomatology.

      In addition, our main goal was to identify whether severity of the disease could be assessed through microbiota composition. Therefore we did not include a healthy group. Despite not having a large N, our results should be reproducible as they are supported by statistical analysis.

      (2) We thank reviewer commentary, and since our original sentence may have lacked clarity, we intend to modify it to ensure it conveys the intended meaning more effectively.

      Nonetheless, we remain confident in the significance of our findings. Not only have we found correlation between microbiota and COVID severity, but we have also described how specific bacteria from each condition is associated with key biochemical parameters of clinical COVID infection.

      (3) We appreciate the feedback provided by the reviewer. In this case, we have performed 16S analysis due to its cost-effectiveness compared to metagenomic approaches. Furthermore, 16S analysis has undergone refinements that ensure comprehensive coverage and depth, along with standardized analysis protocols. Unlike 16S, metagenomic approaches lack software tools such as QIIME that facilitate standardization of analysis and, thus, reduce reproducibility of results.

      (4) We sincerely appreciate this insightful suggestion. simply listing associations between both microbiomes and COVID-19 severity could not be enough, we intend to discuss how microbiota composition may be linked to the mechanisms underlying COVID-19 pathogenesis in our discussion.

      (5) We are grateful for the constructive criticism and intend to rewrite our abstract to enhance clarity. Additionally, we will thoroughly review all figures and their descriptions to ensure accuracy and comprehensibility.

      Reviewer 3

      We acknowledge the annotations made by reviewer 3 and are committed to addressing all identified weaknesses to enhance the quality of our work. Our idea is to modify the methods section and figures to make them easier to understand.

      Specifically, in the case of Figure 1, we recognize an error in the description of the Bray-Curtis test. We appreciate the commentary and we will make the necessary changes. Moreover, there is another observation related to Figure 1 description. We are going to modify it in order to gain accuracy.

      For figure 2 we are planning to add a supplementary table showing the abundance of detected genus. Nevermind, we will also update the manuscript text to provide clarification on how we obtained this result.

      Regarding the clarification about "1% abundance," we want to emphasize that we are referring to relative abundance, where 1 represents 100%. To avoid confusion, we will explicitly state this in both the methods section and figure descriptions. Besides, it is true that the statistical test employed for the analysis is not mentioned in the figure description and we recognize that the image may be difficult to interpret. Therefore, we will modify the text and a supplementary table displaying the abundance and p values is going to be added.

      Furthermore, we agree with the reviewer's suggestion to investigate whether the bacteria identified as potential biomarkers for each condition are specific to their respective severity index or if there is a threshold. Thus, we will reanalyze the data and include a supplementary table with the abundance of each biomarker for each condition. We will also place greater emphasis on these results in our discussion.

      Finally, in response to the reviewer's suggestion, we are going to go through the nasopharyngeal-fecal axis part in the discussion. It is well described that COVID-19 induces a dysbiosis in both microbiomes. Consequently, we understand that the ratio we have described could be an interesting tool for assessing COVID severity development as it considers alterations in both environments. However, we acknowledge that there may be room for improvement in clarifying the significance of this intriguing finding and its implications.

  2. www.planalto.gov.br www.planalto.gov.br
    1. Da Gratuidade da Justiça

      O enquadramento na faixa de isenção de imposto de renda NÃO DEVE SER UTILIZADO COMO CRITÉRIO para o deferimento do benefício da assistência judiciária gratuita.

      AgInt no AREsp 2.441.809-RS , Rel. Ministro Herman Benjamin, Segunda Turma, por unanimidade, julgado em 8/4/2024, DJe 2/5/2024.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      This manuscript from Schwintek and coworkers describes a system in which gas flow across a small channel (10^-4-10^-3 m scale) enables the accumulation of reactants and convective flow. The authors go on to show that this can be used to perform PCR as a model of prebiotic replication.

      Strengths:

      The manuscript nicely extends the authors' prior work in thermophoresis and convection to gas flows. The demonstration of nucleic acid replication is an exciting one, and an enzyme-catalyzed proof-of-concept is a great first step towards a novel geochemical scenario for prebiotic replication reactions and other prebiotic chemistry.

      The manuscript nicely combines theory and experiment, which generally agree well with one another, and it convincingly shows that accumulation can be achieved with gas flows and that it can also be utilized in the same system for what one hopes is a precursor to a model prebiotic reaction. This continues efforts from Braun and Mast over the last 10-15 years extending a phenomenon that was appreciated by physicists and perhaps underappreciated in prebiotic chemistry to increasingly chemically relevant systems and, here, a pilot experiment with a simple biochemical system as a prebiotic model.

      I think this is exciting work and will be of broad interest to the prebiotic chemistry community.

      Weaknesses:

      The manuscript states: "The micro scale gas-water evaporation interface consisted of a 1.5 mm wide and 250 µm thick channel that carried an upward pure water flow of 4 nl/s ≈ 10 µm/s perpendicular to an air flow of about 250 ml/min ≈ 10 m/s." This was a bit confusing on first read because Figure 2 appears to show a larger channel - based on the scale bar, it appears to be about 2 mm across on the short axis and 5 mm across on the long axis. From reading the methods, one understands the thickness is associated with the Teflon, but the 1.5 mm dimension is still a bit confusing (and what is the dimension in the long axis?) It is a little hard to tell which portion (perhaps all?) of the image is the channel. This is because discontinuities are present on the left and right sides of the experimental panels (consistent with the image showing material beyond the channel), but not the simulated panels. Based on the authors' description of the apparatus (sapphire/CNC machined Teflon/sapphire) it sounds like the geometry is well-known to them. Clarifying what is going on here (and perhaps supplying the source images for the machined Teflon) would be helpful.

      We understand. We will update the figures to better show dimensions of the experimental chamber. We will also add a more complete Figure in the supplementary information. Part of the complexity of the chamber however stems from the fact that the same chamber design has also been used to create defined temperature gradients which are not necessary and thus the chamber is much more complex than necessary.

      We added the scheme of the whole PTFE Chip to Figure 2 in the top left corner, indicating the ROI shown in the fluorescence micrographs. Additionally, the channel walls are now clearly indicated by white dotted lines. The dimensions of the setup are now shown clearer, by showing the total width of the channel as well as its height until the gas flux channel, as well as its depth. Changed caption of the figure accordingly and it now reads: “[…] The PTFE chip cutout in the top left corner shows the ROI used for the micrographs. The color scale is equal for both simulation and experiment and Channel dimensions are 4 x 1.5 x 0.25 mm as indicated. Dotted lines visualize the location of the channel walls. […]“

      The data shown in Figure 2d nicely shows nonrandom residuals (for experimental values vs. simulated) that are most pronounced at t~12 m and t~40-60m. It seems like this is (1) because some symmetry-breaking occurs that isn't accounted for by the model, and perhaps (2) because of the fact that these data are n=1. I think discussing what's going on with (1) would greatly improve the paper, and performing additional replicates to address (2) would be very informative and enhance the paper. Perhaps the negative and positive residuals would change sign in some, but not all, additional replicates?

      To address this, we will show two more replicates of the experiment and include them in Figure 2.

      We are seeing two effects when we compare fluorescence measurements of the experiments.

      Firstly, degassing of water causes the formation of air-bubbles, which are then transported upwards to the interface, disrupting fluorescence measurements. This, however, mostly occurs in experiments with elevated temperatures for PCR reactions, such as displayed in Figure 4.

      Secondly, due to the high surface tension of water, the interface is quite flexible. As the inflow and evaporation work to balance each other, the shape of the interface adjusts, leading to alterations in the circular flow fields below.

      Thus the conditions, while overall being in steady state, show some fluctuations. The strong dependence on interface shape is also seen in the simulation. However, modeling a dynamic interface shape is not so easy to accomplish, so we had to stick to one geometry setting. Again here, the added movies of two more experiments should clarify this issue.

      We performed three more replicates of the experiment and included the averaged data points together with their respective standard deviation as error bars in Figure 2d. Additionally, the videos of each individual repeat are now added to the supplementary files for the reader to better understand where the strong fluctuations around half an hour come from. The Figure caption was adjusted to “ […] The maximum relative concentration of DNA increased within an hour to ~30 X the initial concentration, with the trend following the simulation. Error bars are the standard deviation from four independent measurements. […].

      The main text was also changed to better explain how the fluctuations impact the measurements: […] Water continuously evaporated at the interface, but nucleic acids remained in the aqueous phase accumulating near the interface. They could only escape downward either by diffusion or by the vortex induced by the gas flowing across the interface, pushing the molecules back deeper into the bulk (See the flow lines in Fig2(b) taken from the simulation).  As the gas flow continuously removed excess vapor, the evaporation rate remained constant. Thus, except for fluctuations, a stable interface shape should be expected. However, due to the high surface tension of water, the interface is very flexible. As the inflow and evaporation work to balance each other, the shape of the interface adjusts, likely in response to small fluctuations in gas pressure and spatial variations in water surface tension. This is leading to alterations in the circular flow fields below (Supplementary Movie 2).

      As these fluctuations are difficult to simulate, we decided to stick with one interface shape, matching evaporation and inflow speeds. The evaporation rate at the interface was therefore set to be proportional to the vapor concentration gradient and varied spatially along the interface between 5 and 10.5 µm/s (See Suppl. Fig. VI.1(d)). Using the known diffusion coefficient of 95 µm²/s for the 63mer[9]}, the simulation closely matched the experimental results. In both cases, DNA accumulated in regions with circular flow patterns driven by the gas flux (Fig.2(b), right panel).

      5 minutes after starting the experiment, the maximum DNA accumulation was 3-fold, while after one hour of evaporation, around 30-fold accumulation was observed. Due to molecules residing in very shallow volumes when directly at the interface, the fluorescence signal can vary drastically compared to measurements deeper in the bulk. This can be seen in the fluctuations between independent measurements (See Supplementary Movies 2b,2b,2c), especially around 0.5~h shown in Figure 2(d). The simulated maximum accumulation followed the experimental results and starts saturating after about one hour (Fig.2(d)). […]”

      The authors will most likely be familiar with the work of Victor Ugaz and colleagues, in which they demonstrated Rayleigh-Bénard-driven PCR in convection cells (10.1126/science.298.5594.793, 10.1002/anie.200700306). Not including some discussion of this work is an unfortunate oversight, and addressing it would significantly improve the manuscript and provide some valuable context to readers. Something of particular interest would be their observation that wide circular cells gave chaotic temperature profiles relative to narrow ones and that these improved PCR amplification (10.1002/anie.201004217). I think contextualizing the results shown here in light of this paper would be helpful.

      Thanks for pointing this out and reminding us. We apologize. We agree that the chaotic trajectories within Rayleigh-Bénard convection cells lead to temperature oscillations similar to the salt variations in our gas-flux system. Although the convection-driven PCR in Rayleigh-Bénard is not isothermal like our system, it provides a useful point of comparison and context for understanding environments that can support full replication cycles. We will add a section comparing approaches and giving some comparison into the history of convective PCR and how these relate to the new isothermal implementation.

      We added a main text paragraph after the last paragraph in section “Strand Separation Dynamics”: “[…]Rayleigh-Bénard convection cells generate similar patterns to those seen in Fig. 3(c) The oscillations in salt concentration resemble the temperature fluctuations observed in convection-based PCR reactions from earlier studies [32,33], which showed that chaotic temperature variations, compared to periodic ones, enhanced the efficiency of the PCR reaction.[…]

      Again, it appears n=1 is shown for Figure 4a-c - the source of the title claim of the paper - and showing some replicates and perhaps discussing them in the context of prior work would enhance the manuscript.

      We appreciate the reviewer for bringing this to our attention. We will now include the two additional repeats for the data shown in Figure 4c, while the repeats of the PAGE measurements are already displayed in Supplementary Fig. IX.2. Initially, we chose not to show the repeats in Figure 4c due to the dynamic and variable nature of the system. These variations are primarily caused by differences at the water-air interface, attributed to the high surface tension of water. Additionally, the stochastic formation of air bubbles in the inflow—despite our best efforts to avoid them—led to fluctuations in the fluorescence measurements across experiments. These bubbles cause a significant drop in fluorescence in a region of interest (ROI) until the area is refilled with the sample.

      Unlike our RNA-focused experiments, PCR requires high temperatures and degassing a PCR master mix effectively is challenging in this context. While we believe our chamber design is sufficiently gas-tight to prevent air from diffusing in, the high surface-to-volume ratio in microfluidics makes degassing highly effective, particularly at elevated temperatures. We anticipate that switching to RNA experiments at lower temperatures will mitigate this issue, which is also relevant in a prebiotic context.

      The reviewer’s comments are valid and prompt us to fully display these aspects of the system. We will now include these repeats in Figure 4c to give readers a deeper understanding of the experiment's dynamics. Additionally, we will provide videos of all three repeats, allowing readers to better grasp the nature of the fluctuations in SYBR Green fluorescence depicted in Figure 4c.

      The data from the triplicates are now added to Figure 4c, showing how air bubbles, forming through degassing at the high temperatures required for Taq polymerase, disrupt the measurement, as they momentarily dry off the channel and stop the reaction until the channel fills again. Figure caption has been adapted and now reads: “[…] Dotted lines show the data from independent repeats. Air bubbles formed through degassing can momentarily disrupt the reaction. […]”

      We additionally changed the main text to explain the reader the experimental difficulties: “[…] In other repetitions of the reaction, this increase was sometimes even observed earlier, around the one-hour mark (dotted lines). However, air bubbles nucleated by degassing events rise and temporarily dry out the channel, interrupting the reaction until the liquid refills the channel (Supplementary Movies 4,4b,4c\&5). Despite our best efforts, we were unable to fully prevent this, especially given the high temperatures required for Taq polymerase activity. In an identical setting when the gas- and water flux were switched off, no fluorescence increase was found (See Fig. 4(c) red lines). Fluorescence variations are additionally caused by fluctuations in the position of the gas-water interface, as discussed earlier. […]”

      I think some caution is warranted in interpreting the PCR results because a primer-dimer would be of essentially the same length as the product. It appears as though the experiment has worked as described, but it's very difficult to be certain of this given this limitation. Doing the PCR with a significantly longer amplicon would be ideal, or alternately discussing this possible limitation would be helpful to the readers in managing expectations.

      This is a good point and should be discussed more in the manuscript. Our gel electrophoresis is capable of distinguishing between replicate and primer dimers. We know this since we were optimizing the primers and template sequences to minimize primer dimers, making it distinguishable from the desired 61mer product. That said, all of the experiments performed without a template strand added did not show any band in the vicinity of the product band after 4h of reaction, in contrast to the experiments with template, presenting a strong argument against the presence of primer dimers.

      We added a main text section explaining this to the reader: “[…]Suppl. Fig. IX.2 shows all independent repeats of the corresponding experiments. No product was detected in any of these cases, ruling out reaction limitations such as primer dimer formation. Primer dimers would form even in the absence of a template strand and would be identifiable through gel electrophoresis. As Taq polymerase requires a significant overlap between the two dimers to bind, this would result in a shorter product compared to the 61mer used here.  […]”

      Reviewer #2 (Public review):

      Schwintek et al. investigated whether a geological setting of a rock pore with water inflow on one end and gas passing over the opening of the pore on the other end could create a non-equilibrium system that sustains nucleic acid reactions under mild conditions. The evaporation of water as the gas passes over it concentrates the solutes at the boundary of evaporation, while the gas flux induces momentum transfer that creates currents in the water that push the concentrated molecules back into the bulk solution. This leads to the creation of steady-state regions of differential salt and macromolecule concentrations that can be used to manipulate nucleic acids. First, the authors showed that fluorescent bead behavior in this system closely matched their fluid dynamic simulations. With that validation in hand, the authors next showed that fluorescently labeled DNA behaved according to their theory as well. Using these insights, the authors performed a FRET experiment that clearly demonstrated the hybridization of two DNA strands as they passed through the high Mg++ concentration zone, and, conversely, the dissociation of the strands as they passed through the low Mg++ concentration zone. This isothermal hybridization and dissociation of DNA strands allowed the authors to perform an isothermal DNA amplification using a DNA polymerase enzyme. Crucially, the isothermal DNA amplification required the presence of the gas flux and could not be recapitulated using a system that was at equilibrium. These experiments advance our understanding of the geological settings that could support nucleic acid reactions that were key to the origin of life.

      The presented data compellingly supports the conclusions made by the authors. To increase the relevance of the work for the origin of life field, the following experiments are suggested:

      (1) While the central premise of this work is that RNA degradation presents a risk for strand separation strategies relying on elevated temperatures, all of the work is performed using DNA as the nucleic acid model. I understand the convenience of using DNA, especially in the latter replication experiment, but I think that at least the FRET experiments could be performed using RNA instead of DNA.

      We understand the request only partially. The modification brought about by the two dye molecules in the FRET probe to be able to probe salt concentrations by melting is of course much larger than the change of the backbone from RNA to DNA. This was the reason why we rather used the much more stable DNA construct which is also manufactured at a lower cost and in much higher purity also with the modifications. But we think the melting temperature characteristics of RNA and DNA in this range is enough known that we can use DNA instead of RNA for probing the salt concentration in our flow cycling.

      Only at extreme conditions of pH and salt, RNA degradation through transesterification, especially under alkaline conditions is at least several orders of magnitude faster than spontaneous degradative mechanisms acting upon DNA [Li, Y., & Breaker, R. R. (1999). Kinetics of RNA degradation by specific base catalysis of transesterification involving the 2 ‘-hydroxyl group. Journal of the American Chemical Society, 121(23), 5364-5372.]. The work presented in this article is however focussed on hybridization dynamics of nucleic acids. Here, RNA and DNA share similar properties regarding the formation of double strands and their respective melting temperatures. While RNA has been shown to form more stable duplex structures exhibiting higher melting temperatures compared to DNA [Dimitrov, R. A., & Zuker, M. (2004). Prediction of hybridization and melting for double-stranded nucleic acids. Biophysical Journal, 87(1), 215-226.], the general impact of changes in salt, temperature and pH [Mariani, A., Bonfio, C., Johnson, C. M., & Sutherland, J. D. (2018). pH-Driven RNA strand separation under prebiotically plausible conditions. Biochemistry, 57(45), 6382-6386.] on respective melting temperatures follows the same trend for both nucleic acid types. Also the diffusive properties of RNA and DNA are very similar [Baaske, P., Weinert, F. M., Duhr, S., Lemke, K. H., Russell, M. J., & Braun, D. (2007). Extreme accumulation of nucleotides in simulated hydrothermal pore systems. Proceedings of the National Academy of Sciences, 104(22), 9346-9351.].

      Since this work is a proof of principle for the discussed environment being able to host nucleic acid replication, we aimed to avoid second order effects such as degradation by hydrolysis by using DNA as a proxy polymer. This enabled us to focus on the physical effects of the environment on local salt and nucleic acid concentration. The experiments performed with FRET are used to visualize local salt concentration changes and their impact on the melting temperature of dissolved nucleic acids.  While performing these experiments with RNA would without doubt cover a broader application within the field of origin of life, we aimed at a step-by-step / proof of principle approach, especially since the environmental phenomena studied here have not been previously investigated in the OOL context. Incorporating RNA-related complexity into this system should however be addressed in future studies. This will likely require modifications to the experimental boundary conditions, such as adjusting pH, temperature, and salt concentration, to account for the greater duplex stability of RNA. For instance, lowering the pH would reduce the RNA melting temperature [Ianeselli, A., Atienza, M., Kudella, P. W., Gerland, U., Mast, C. B., & Braun, D. (2022). Water cycles in a Hadean CO2 atmosphere drive the evolution of long DNA. Nature Physics, 18(5), 579-585.].

      (2) Additionally, showing that RNA does not degrade under the conditions employed by the authors (I am particularly worried about the high Mg++ zones created by the flux) would further strengthen the already very strong and compelling work.

      Based on literature values for hydrolysis rates of RNA [Li, Y., & Breaker, R. R. (1999). Kinetics of RNA degradation by specific base catalysis of transesterification involving the 2 ‘-hydroxyl group. Journal of the American Chemical Society, 121(23), 5364-5372.], we estimate RNA to have a half-life of multiple months under the deployed conditions in the FRET experiment (High concentration zones contain <1mM of Mg2+). Additionally, dsRNA is multiple orders of magnitude more stable than ssRNA with regards to degradation through hydrolysis [Zhang, K., Hodge, J., Chatterjee, A., Moon, T. S., & Parker, K. M. (2021). Duplex structure of double-stranded RNA provides stability against hydrolysis relative to single-stranded RNA. Environmental Science & Technology, 55(12), 8045-8053.], improving RNA stability especially in zones of high FRET signal. Furthermore, at the neutral pH deployed in this work, RNA does not readily degrade. In previous work from our lab [Salditt, A., Karr, L., Salibi, E., Le Vay, K., Braun, D., & Mutschler, H. (2023). Ribozyme-mediated RNA synthesis and replication in a model Hadean microenvironment. Nature Communications, 14(1), 1495.], we showed that the lifetime of RNA under conditions reaching 40mM Mg2+ at the air-water interface at 45°C was sufficient to support ribozymatically mediated ligation reactions in experiments lasting multiple hours.

      With that in mind, gaining insight into the median Mg2+ concentration across multiple averaged nucleic acid trajectories in our system (see Fig. 3c&d) and numerically convoluting this with hydrolysis dynamics from literature would be highly valuable. We anticipate that longer residence times in trajectories distant from the interface will improve RNA stability compared to a system with uniformly high Mg2+ concentrations.

      Added a new Supplementary section for this. We used the trace from Figure 3(c) and calculated the hydrolysis rate for each timestep by using literature values from RNA [Li, Y., & Breaker, R. R. (1999). Kinetics of RNA degradation by specific base catalysis of transesterification involving the 2 ‘-hydroxyl group. Journal of the American Chemical Society, 121(23), 5364-5372.]. We conclude that the conditions deployed for the experiment are not harsh on RNA, with hydrolysis rates in the E-6 1/min regime. The figure below (also now in the supplementary information) shows the hydrolysis of RNA deployed under the conditions of the experiment in Figure 3. RNA is not expected to hydrolyze under these conditions and timescales, in which a replication reaction would occur. With a half life of around 83 days, even a prebiotically plausible – very slow – replication reaction would not be constrained by hydrolysis boundary conditions in this scenario.

      Referenced to this section in the supplementary information in the maintext: […] In the experimental conditions used here, RNA would also not readily degrade, even if the strand enters the high salt regimes (See Suppl. Sec. IX). Using literature values for hydrolysis rates under the deployed conditions, we estimate dissolved RNA to have a half life of around 83 days. […]

      (3) Finally, I am curious whether the authors have considered designing a simulation or experiment that uses the imidazole- or 2′,3′-cyclic phosphate-activated ribonucleotides. For instance, a fully paired RNA duplex and a fluorescently-labeled primer could be incubated in the presence of activated ribonucleotides +/- flux and subsequently analyzed by gel electrophoresis to determine how much primer extension has occurred. The reason for this suggestion is that, due to the slow kinetics of chemical primer extension, the reannealing of the fully complementary strands as they pass through the high Mg++ zone, which is required for primer extension, may outcompete the primer extension reaction. In the case of the DNA polymerase, the enzymatic catalysis likely outcompetes the reannealing, but this may not recapitulate the uncatalyzed chemical reaction.

      This is certainly on our to-do list for future experiments in this setting. Our current focus is on templated ligation rather than templated polymerization and we are working hard to implement RNA-only enzyme-free ligation chain reaction, based on more optimized parameters for the templated ligation from 2’3’-cyclic phosphate activation that was just published [High-Fidelity RNA Copying via 2′,3′-Cyclic Phosphate Ligation, Adriana C. Serrão, Sreekar Wunnava, Avinash V. Dass, Lennard Ufer, Philipp Schwintek, Christof B. Mast, and Dieter Braun, JACS doi.org/10.1021/jacs.3c10813 (2024)]. But we first would try this at an air-water interface which was shown to work with RNA in a temperature gradient [Ribozyme-mediated RNA synthesis and replication in a model Hadean microenvironment, Annalena Salditt, Leonie Karr, Elia Salibi, Kristian Le Vay, Dieter Braun & Hannes Mutschler, Nature Communications doi.org/10.1038/s41467-023-37206-4 (2023)] before making the jump to the isothermal setting we describe here. So we can understand the question, but it was good practice also in the past to first get to know the setting with PCR, then jump to RNA.

      Recommendations for the authors:

      Reviewer #2 (Recommendations for the authors):

      (1) Could the authors comment on the likelihood of the geological environments where the water inflow velocity equals the evaporation velocity?

      This is an important point to mention in the manuscript, thank you for pointing that out. To produce a defined experiment, we were pushing the water out with a syringe pump, but regulated in a way that the evaporation was matching our flow rate. We imagine that a real system will self-regulate the inflow of the water column on the one hand side by a more complex geometry of the gas flow, matching the evaporation with the reflow of water automatically. The interface would either recede or move closer to the gas flux, depending on whether the inflow exceeds or falls short of the evaporation rate. As the interface moves closer, evaporation speeds up, while moving away slows it down. This dynamic process stabilizes the system, with surface tension ultimately fixing the interface in place.

      We have seen a bit of this dynamic already in the experiments, could however so far not yet find a good geometry within our 2-dimensional constant thickness geometry to make it work for a longer time. Very likely having a 3-dimensional reservoir of water with less frictional forces would be able to do this, but this would require a full redesign of a multi-thickness microfluidics. The more we think about it, the more we envisage to make the next implementation of the experiment with a real porous volcanic rock inside a humidity chamber that simulates a full 6h prebiotic day. But then we would lose the whole reproducibility of the experiment, but likely gain a way that recondensation of water by dew in a cold morning is refilling the water reservoirs in the rocks again. Sorry that I am regressing towards experiments in the future.

      We added a paragraph after the second paragraph in Results and Discussion.

      It now reads: […] For a real early Earth environment we envision a system that self-regulates the water column's inflow by automatically balancing evaporation with capillary flows. The interface adjusts its position relative to the gas flux, moving closer if the inflow is less than the evaporation rate, or receding if it exceeds it. When the interface nears the gas flux, evaporation accelerates, while moving it away slows evaporation. This dynamic process stabilizes the system, with surface tension ultimately fixing the interface's position. […]

      (2) Could the authors speculate on using gases other than ambient air to provide the flux and possibly even chemical energy? For example, using carbonyl sulfide or vaporized methyl isocyanide could drive amino acid and nucleotide activation, respectively, at the gas-water interface.

      This is an interesting prospect for future work with this system. We thought also about introducing ammonia for pH control and possible reactions. We were amazed in the past that having CO2 instead of air had a profound impact on the replication and the strand separation [Water cycles in a Hadean CO2 atmosphere drive the evolution of long DNA, Alan Ianeselli, Miguel Atienza, Patrick Kudella, Ulrich Gerland, Christof Mast & Dieter Braun, Nature Physics doi.org/10.1038/s41567-022-01516-z (2022)]. So going more in this direction absolutely makes sense and as it acts mostly on the length-selectively accumulated molecules at the interface, only the selected molecules will be affected, which adds to the selection pressure of early evolutionary scenarios.

      Of course, in the manuscript, we use ambient air as a proxy for any gas, focusing primarily on the energy introduced through momentum transfer and evaporation. We speculate that soluble gasses could establish chemical gradients, such as pH or redox potential, from the bulk solution to the interface, similar to the Mg2+ accumulation shown in Figure 3c. The nature of these gradients would depend on each gas's solubility and diffusivity. We have already observed such effects in thermal gradients [Keil, L. M., Möller, F. M., Kieß, M., Kudella, P. W., & Mast, C. B. (2017). Proton gradients and pH oscillations emerge from heat flow at the microscale. Nature communications, 8(1), 1897.] and finding similar behavior in an isothermal environment would be a significant discovery.

      Added a paragraph in the Conclusion to showcase this: [… ] Furthermore we expect that other gases, such as CO2, could establish chemical gradients in this environment. Such gradients have been observed in thermal gradients before [23] and finding similar behaviour in an isothermal environment would be a significant discovery.[…]

      (3) Line 162: Instead of "risk," I suggest using "rate".

      Thanks for pointing this out! Will be changed.

      Fixed.

      (4) Using FRET of a DNA duplex as an indicator of salt concentration is a decent proxy, but a more direct measurement of salt concentration would provide further merit to the explicit statement that it is the salt concentration that is changing in the system and not another hidden parameter.

      Directly observing salt concentration using microscopy is a difficult task. While there are dyes that change their fluorescence depending on the local Na+ or Mg2+ concentration, they are not operating differentially, i.e. by making a ratio between two color channels. Only then we are not running into artifacts from the dye molecules being accumulated by the non-equilibrium settings. We were able to do this for pH in the past, but did not find comparable optical salt sensors. This is the reason we ended up with a FRET pair, with the advantage that we actually probe the strand separation that we are interested in anyhow. Using such a dye in future work would however without a doubt enhance the understanding of not only this system, but also our thermal gradient environments.

      (5) Figure 3a: Could the authors add information on "Dried DNA" to the caption? I am assuming this is the DNA that dried off on the sides of the vessel but cannot be sure.

      Thanks to the reviewer for pointing this out. This is correct and we will describe this better in the revised manuscript.

      Added a sentence in the caption to address this: […] Fluctuations in interface position can dry and redissolve DNA repeatedly (see “Dried DNA” in right panel). […]

      (6) Figure 4b and c: How reproducible is this data? Have the authors performed this reaction multiple independent times? If so, this data should be added to the manuscript.

      The data from the gel electrophoresis was performed in triplicates and is shown in full in supplementary information. The data in c is hard to reproduce, as the interface is not static and thus ROI measurements are difficult to perform as an average of repeats. Including the data from the independent repeats will however give the reader insight into some of the experimental difficulties, such as air bubbles, which form from degassing as the liquid heats up, that travel upwards to the interface, disrupting the ongoing fluorescence measurements.

      This was also pointed out by reviewer 1 and addressed there.

      (7) Line 256: "shielding from harmful UV" statement only applies to RNA oligomers as UV light may actually be beneficial for earlier steps during ribonucleoside synthesis. I suggest rephrasing to "shielding nucleic acid oligomers from UV damage.".

      Will be adjusted as mentioned.

      Fixed.

      (8) The final paragraph in the Results and Discussion section would flow better if placed in the Conclusion section.

      This is a good point and we will merge results and discussion closer together.

      Fixed.

      (9) Line 262, "...of early Life" is slightly overstating the conclusions of the study. I suggest rephrasing to "...of nucleic acids that could have supported early life."

      This is a fair comment. We thank the reviewer for his detailed analysis of the manuscript!

      Changed the phrase to: […]In this work we investigated a prebiotically plausible and abundant geological environment to support the replication of nucleic acids. […]

      (10) In references, some of the journal names are in sentence case while others are in title case (see references 23 and 26 for example).

      Thanks - this will be fixed.

      Fixed.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      This study provides compelling evidence that RAR, rather than its obligate dimerization partner RXR, is functionally limiting for chromatin binding. This manuscript provides a paradigm for how to dissect the complicated regulatory networks formed by dimerizing transcription factor families.

      Dahal and colleagues use advanced SMT techniques to revisit the role of RXR in DNA-binding of the type-2 nuclear receptor (T2NR) RAR. The dominant consensus model for regulated DNA binding of T2NRs posits that they compete for a limited pool of RXR to form an obligate T2NR-RXR dimer. Using advanced SMT and proximity-assisted photoactivation technologies, Dahal et al. now test the effect of manipulating the endogenous pool size of RAR and RXR on heterodimerization and DNA-binding in live U2OS cells. Surprisingly, it turns out that RAR, rather than RXR, is functionally limiting for heterodimerization and chromatin binding. By inference, the relative pool size of various T2NRs expressed in a given cell, rather than RXR, is likely to determine chromatin binding and transcriptional output.

      The conclusions of this study are well supported by the experimental results and provide unexpected novel insights into the functioning of the clinically important class of T2NR TFs. Moreover, the presented results show how the use of novel technologies can put long-standing theories on how transcription factors work upside down. This manuscript provides a paradigm for how to further dissect the complicated regulatory networks formed by T2NRs or other dimerizing TFs. I found this to be a complete story that does not require additional experimental work. However, I do have some suggestions for the authors to consider.

      Reviewer #1 (Recommendations For The Authors):

      (1) Does the increased chromatin binding measured when the RAR levels are increased reflect a higher occupancy of a similar set of loci, or are additional loci bound? The authors could discuss this issue in the context of the published literature. Obviously, this could be addressed experimentally by ChIP-seq or a similar analysis, but this would extend beyond the main topic of this manuscript.

      We attempted to explore this experimentally using ChIP-seq with multiple RAR- and RXR-specific antibodies. Unfortunately, our results were inconclusive, as the antibody enrichment relative to the IgG control was insufficient for reliable interpretation. Specifically, our ChIP-seq enrichment levels were only around 1.5fold, while the accepted standard for meaningful ChIP enrichment is typically at least 2-fold. Due to these technical limitations, we decided to defer these experiments for now.

      However, we agree with the reviewer that understanding whether the increased chromatin binding of RAR reflects higher occupancy at the same set of loci or binding to additional loci is a key question. In similar experiments involving the transcription factor TFEB (Esbin et al., 2024, Genes Dev, doi: 10.1101/gad.351633.124) where an increase in the SMT bound fraction occurred, both scenarios—higher occupancy at known loci and binding to additional loci in ChIP-seq was observed. So, addressing this intriguing possibility in future studies focused on RAR and RXR would be interesting.

      (2) The results presented suggest convincingly that endogenous RXR is normally in excess to its binding partners (in U2OS cells). This point could be strengthened further by reducing RXR levels, e.g., by knocking out 1 allele or the use of shRNAs (although the latter method might be too hard to control). Overexpression of another T2NR might also help determine the buffer capacity of RXR.

      We appreciate the reviewers’ acknowledgment that our results convincingly demonstrate that endogenous RXR is typically in excess relative to its binding partners in U2OS cells. We agree that this conclusion could be further reinforced by experiments such as overexpression of another T2NR to test RXR's buffering capacity. We are actively pursuing follow-up experiments involving overexpression of additional T2NRs to address this question in more detail. These studies are ongoing, and we plan to explore the buffer capacity of RXR more extensively in a future manuscript.

      (3) The ~10% difference in fbound of RAR and RXR (in Figs 1 and 2), while they should be 1:1 dimers, is explained by invoking the expression of RXR isoforms. Can the authors be more specific concerning the nature of these isoforms?

      We have provided detailed information about different T2NRs expressed in U2OS cells according to the Expression Atlas and the Human Protein Atlas Database in Supplementary Table S1. Table S1 specifically shows that both isoforms of RXRα and RXRβ are expressed in U2OS cells. Additionally, the caption of Table S1 explicitly notes the presence of isoform RXRβ in U2OS cells. In the main text, we reference Table S1 when discussing the 10% difference in fbound between RARα and RXRα, and we have now suggested that the expression of RXRβ likely accounts for the observed discrepancy.

      Reviewer #2 (Public Review):

      Summary:

      In the manuscript "Surprising Features of Nuclear Receptor Interaction Networks Revealed by Live Cell Single Molecule Imaging", Dahal et al combine fast single molecule tracking (SMT) with proximity-assisted photoactivation (PAPA) to study the interaction between RARa and RXRa. The prevalent model in the nuclear receptor field suggests that type II nuclear receptors compete for a limiting pool of their partner RXRa. Contrary to this, the authors find that over-expression of RARa but not RXRa increases the fraction of RXRa molecules bound to chromatin, which leads them to conclude that the limiting factor is the abundance of RARa and not RXRa. The authors also perform experiments with a known RARa agonist, all trans retinoic acid (atRA) which has little effect on the bound fraction. Using PAPA, they show that chromatin binding increases upon dimerization of RARa and RXRa.

      Strengths:

      In my view, the biggest strength of this study is the use of endogenously tagged RARa and RXRa cell lines. As the authors point out, most previous studies used either in vitro assays or over-expression. I commend the authors on the generation of single-cell clones of knock-in RARa-Halo and Halo-RXRa. The authors then carefully measure the abundance of each protein using FACS, which is very helpful when comparing across conditions. The manuscript is generally well written and figures are easy to follow. The consistent color-scheme used throughout the manuscript is very helpful.

      Weaknesses:

      (1) Agonist treatment:

      The authors test the effect of all trans retinoic acid (atRA) on the bound fraction of RARa and RXRa and find that "These results are consistent with the classic model in which dimerization and chromatin binding of T2NRs are ligand independent." However, all the agonist treatments are done in media containing FBS. FBS is not chemically defined and has been found to have between 10 and 50 nM atRA (see references in PMID 32359651 for example). The addition of 1 nM or 100 nM atRA is unlikely to result in a strong effect since the medium already contains comparable or higher levels of agonist. To test their hypothesis of ligand-independent dimerization, the authors should deplete the media of atRA by growing the cells in a medium containing charcoal-stripped FBS for at least 24 hours before adding agonist.

      We acknowledge the reviewer's concern regarding the presence of atRA in FBS and agree that it may introduce baseline levels of agonist. However, in our experiments, both the 1 nM and 100 nM atRA treatments resulted in observable changes in RAR expression levels (Figure S3C). Additionally, the luciferase assays demonstrated that 100 nM atRA significantly increased retinoic acid-responsive promoter activity (Figure S1C). Given these clear responses to atRA, we believe the observed lack of effect on the chromatin-bound fraction cannot be attributed to the presence of comparable or higher levels of atRA in the FBS, as the reviewer suggests. Moreover, since our results align with the established literature and do not impact the core findings of our study, we decided not to pursue the suggested experiments with charcoal-stripped FBS in this manuscript.  

      (2) Photobleaching and its effect on bound fraction measurements:

      The authors discard the first 500 to 1000 frames due to the high localization density in the initial frames. This will preferentially discard bound molecules that will bleach in the initial frames of the movie and lead to an over-estimation of the unbound fraction.

      For experiments with over-expression of RAR-Halo and Halo-RXR, the authors state that the cells were pre-bleached and that these frames were used to calculate the mean intensity of the nuclei. When pre-bleaching, bound molecules will preferentially bleach before the diffusing population. This will again lead to an over-representation of the unbound fraction since this is the population that will remain relatively unaffected by the pre-bleaching. Indeed, the bound fraction for over-expressed RARa and RXRa is significantly lower than that for the corresponding knock in lines. To confirm whether this is a biological result, I suggest that the authors either reduce the amount of dye they use so that this pre-bleaching is not necessary or use the direct reactivation strategy they use for their PAPA experiments to eliminate the pre-bleaching step.

      As for the measurement of the nuclear intensity, since the authors have access to multiple HaloTag dyes, they can saturate the HaloTagged proteins with a high concentration of JF646 or JFX650 to measure the mean intensity of the protein while still using the PA-JFX549 for SMT. Together, these will eliminate the need to prebleach or discard any frames.

      The Janelia Fluor dyes used in our experiments are known for their high photostability (Grimm et al., 2021, JACS Au, doi: 10.1021/jacsau.1c00006). During the initial 80 ms imaging to calculate the mean nuclear intensity, the laser power was kept at very low intensity (~3%) for a brief duration (~10 seconds), in contrast to the high-intensity (~100%) used during the tracking experiments, which span around 3 minutes. This low-power illumination does not induce significant photobleaching but merely puts the dyes in a temporary dark state. Therefore, this pre-bleaching step closely resembles the direct reactivation strategy employed in our PAPA experiments.

      To further address the reviewer's concern, we performed a frame cut-off analysis for our SMT movies of endogenous RARα-Halo and over-expressed RARα-Halo (Figure S9B). The analysis shows no significant change in the bound fraction of either endogenous or over-expressed RARα-Halo when discarding the initial 1000 frames. Based on these results, we conclude that the pre-bleaching does not lead to an overestimation of the unbound fraction, and that our experimental approach is robust.

      (3) Heterogeneous expression of the SNAP fusion proteins:

      The cell lines expressing SNAP tagged transgenes shown in Fig S6 have very heterogeneous expression of the SNAP proteins. While the bulk measurements done by Western blotting are useful, while doing single-cell experiments (especially with small numbers - ~20 - of cells), it is important to control for expression levels. Since these transgenic stable lines were not FACS sorted, it would be helpful for the reader to know the spread in the distribution of mean intensities of the SNAP proteins for the cells that the SMT data are presented for. This step is crucial while claiming the absence of an effect upon over-expression and can easily be done with a SNAPTag ligand such as SF650 using the procedure outlined for the over-expressed HaloTag proteins.

      We agree with the reviewer that there is heterogeneity in SNAP protein expression across the transgenic lines. In response to the reviewer’s suggestion, we performed the proposed experiment to assess the distribution of mean intensities for two key experimental conditions: Halo-RXRα with overexpressed RARα-SNAP and HaloRXRα with overexpressed RARαRR-SNAP. These results again confirm that the increase in chromatin-bound fraction of Halo-RXRα is observed only in the presence of RARα capable of heterodimerizing with RXRα, supporting our main conclusion (Figure S9).

      For these experiments, we followed the same labelling procedure described in the methods section for tracking endogenous Halo-tagged proteins alongside transgenic SNAP proteins. As shown in Figure S9, for ~ 70 cell nuclei, the distribution of mean intensities is similar for both conditions, with the bound fraction of Halo-RXRα significantly increasing in the presence of RARα-SNAP compared to RARαRR-SNAP. This analysis underscores that the observed effects are indeed due to the functional differences between the two RARα variants rather than variability in expression levels.

      (4) Definition of bound molecules:

      The authors state that molecules with a diffusion coefficient less than 0.15 um2/s are considered bound and those between 1-15 um2/s are considered unbound. Clarification is needed on how this threshold was determined. In previous publications using saSPT, the authors have used a cutoff of 0.1 um2/s (for example, PMID 36066004, 36322456). Do the results rely on a specific cutoff? A diffusion coefficient by itself is only a useful measure of normal diffusion. Bound molecules are unlikely to be undergoing Brownian motion, but the state array method implemented here does not seem to account for non-normal diffusive modes. How valid is this assumption here?

      We acknowledge the inconsistency in the diffusion coefficient thresholds for defining the chromatin-bound fraction used across our group’s publications. The choice of threshold or cutoff (0.1 µm²/s vs 0.15 µm²/s) is largely arbitrary and does not significantly impact the results. To validate this, we tested the effect of different cutoffs on fbound (%) for endogenously expressed Halo-tagged RARα and RXRα (Figure S10). As shown in Figure S10, there was no substantial difference in fbound (%) calculated using a 0.1 µm²/s versus 0.15 µm²/s cutoff (e.g., RARα clone c156: 47±1% vs 49±1%; RXRα clone D6: 34±1% vs 35±1%). 

      Since we have consistently applied the 0.15 µm²/s cutoff throughout this manuscript across all experimental conditions, the comparative analysis of fbound (%) remains valid. While we agree that a Brownian diffusion model may not fully capture the motion of bound molecules, our state array model accounts for localization error, which likely incorporates some of the chromatin motion features. Moreover, the distinction between bound (<0.15 µm²/s) and unbound (1-15 µm²/s) populations is sufficiently large that using a normal diffusion model is reasonable for our analysis.

      (5) Movies:

      Since this is an imaging manuscript, I request the authors to provide representative movies for all the presented conditions. This is an essential component for a reader to evaluate the data and for them to benchmark their own images if they are to try to reproduce these findings.

      We have now included representative movies for all the SMT experimental conditions presented in the manuscript. Please see data availability section of the manuscript.

      (6) Definition of an ROI:

      The authors state that "ROI of random size but with maximum possible area was selected to fit into the interior of the nuclei" while imaging. However, the readout speed of the Andor iXon Ultra 897 depends on the size of the defined ROI. If the ROI was variable for every movie, how do the authors ensure the same sampling rate?

      We used the frame transfer mode on the Andor iXon Ultra 897 camera for our acquisitions, which allows for fast frame rate measurements without altering the exposure time between frames. Additionally, we verified the metadata of all our movies to ensure a consistent frame interval of 7.4 ms across all conditions. This confirms that the sampling rate was maintained uniformly, despite the variability in ROI size. 

      Reviewer #2 (Recommendations For The Authors):

      (1) 'Hoechst' is mis-spelled.

      We have now corrected this typo in the manuscript.

      (2) Cos7 appears in several places throughout the text. I assume this is a typo. If so, please correct it. If not, please explain if some experiments were done in Cos7 cells and kindly provide a justification for that.

      The use of Cos7 cells is intentional and not a typo. Cos7 cells have been previously utilized in studies investigating the interaction between T2NRs (Kliewer et al., 1992, Nature, doi: 10.1038/355446a0). In our study, due to technical issues with antibodies for coIP in U2OS cells, we initially used Cos7 cells for control experiments to verify that Halo-tagging of RARα and RXRα did not disrupt their interaction, by transiently expressing the constructs in Cos7 cells. Following these control experiments, we confirmed the direct interaction of endogenously expressed RAR and RXR in U2OS cells with their respective binding partners using the SMT-PAPA assay. Since these results confirmed that Halo-tagging did not interfere with RAR-RXR interactions, we chose not to repeat the coIP experiments in U2OS cells.

      Reviewer #3 (Public Review):

      Summary:

      This study aims to investigate the stoichiometric effect between core factors and partners forming the heterodimeric transcription factor network in living cells at endogenous expression levels. Using state-of-the-art single-molecule analysis techniques, the authors tracked individual RARα and RXRα molecules labeled by HALO-tag knock-in. They discovered an asymmetric response to the overexpression of counter-partners. Specifically, the fact that an increase in RARα did not lead to an increase in RXRα chromatin binding is incompatible with the previous competitive core model. Furthermore, by using a technique that visualizes only molecules proximal to partners, they directly linked transcription factor heterodimerization to chromatin binding.

      Strengths:

      The carefully designed experiments, from knock-in cell constructions to singlemolecule imaging analysis, strengthen the evidence of the stoichiometric perturbation response of endogenous proteins. The novel finding that RXR, previously thought to be a target of competition among partners, is in excess provides new insight into key factors in dimerization network regulation. By combining the cutting-edge single-molecule imaging analysis with the technique for detecting interactions developed by the authors' group, they have directly illustrated the relationship between the physical interactions of dimeric transcription factors and chromatin binding. This has enabled interaction analysis in live cells that was challenging in single-molecule imaging, proving it is a powerful tool for studying endogenous proteins.

      Weaknesses:

      As the authors have mentioned, they have not investigated the effects of other T2NRs or RXR isoforms. These invisible factors leave room for interpretation regarding the origin of chromatin binding of endogenous proteins (Recommendations 4). In the PAPA experiments, overexpressed factors are visualized, but changes in chromatin binding of endogenous proteins due to interactions with the overexpressed proteins have not been investigated. This might be tested by reversing the fluorescent ligands for the Sender and Receiver. Additionally, the PAPA experiments are likely to be strengthened by control experiments (Recommendations 5).

      We agree that this would be an interesting experiment. However, there are three technical challenges that complicate its implementation: First, as demonstrated in our original PAPA paper, dark state formation is less efficient when dyes are conjugated to Halo compared to SNAPf, making the reverse configuration less optimal. Second, SNAPf-tagged proteins have slower labeling kinetics than Halotagged proteins, often resulting in under-labeling of SNAPf. Third, our SNAPf transgenes were integrated polyclonally. Since background PAPA scales with the concentration of the sender-labeled protein, variable concentrations of the senderlabeled SNAPf proteins would introduce significant variability, complicating the interpretation of the background PAPA signal. Due to these concerns, we believe that performing reciprocal measurements with reversed fluorescent ligands may not yield reliable results. 

      Reviewer #3 (Recommendations For The Authors):

      (1) The term "Surprising features" in the title is ambiguous and may force readers to search for what it specifically refers to. Including a word that evokes specific features might be helpful.

      Our findings contradict previous work, which suggested that chromatin binding of T2NRs is regulated by competition for a limited pool of RXR. In contrast, we found that RAR expression can limit RXR chromatin binding, but not the other way around, which challenges the existing model. This unexpected result is what we refer to as a "surprising feature" in our title, and we believe it accurately reflects the novel insights our study provides. We also think that this is clearly conveyed in our manuscript abstract, supporting the use of "Surprising features" in the title. 

      (2) p.3, line 11 - The threshold of 0.15 μm2s-1 seems to be a crucial value directly linked to the value of fbound. What is the rationale for choosing this specific value? If consistent conclusions can be obtained using threshold values that are similar but different, it would strengthen the robustness of the results.

      Please refer to our response to Reviewer #2’s Public Review point 4. The threshold choice is arbitrary and doesn’t affect the overall conclusions. To test this, we compared fbound (%) values calculated using both 0.1 μm²s-1 and 0.15 μm²s-1 cutoffs. For example, with endogenously expressed Halo-tagged RARα (clone c156), we observed fbound values of 47±1% vs 49±1%, and for RXRα (clone D6), 34±1% vs 35±1%, respectively (Figure S10). Since we have consistently applied the 0.15 μm²s-1 cutoff across all experimental conditions in this manuscript, the comparisons of fbound (%) between different conditions are robust and valid.

      (3) p.4, line 13 - "the fbound of endogenous RARα-Halo (47{plus minus}1%) was largely unchanged upon expression of SNAP (47{plus minus}1%)" part of the sentence is not surprising. It would make more sense if it were expressed as "the fbound of endogenous RARα-Halo (47{plus minus}1%) was largely unchanged upon expression of RXRα-SNAP (49{plus minus}1%), consistent with the control SNAP (47{plus minus}1%).".

      We understand how the original phrasing may be confusing to the readers and have restructured the sentence as suggested by the reviewer for clarity.

      (4) p.6, line 26 - The discussion that "most chromatin binding of endogenous RXRα in U2OS cells depends on heterodimerization partners other than RARα" seems to contradict the top right figure in Figure 4. If that's the case, the binding partner for the bound red molecule might be yellow rather than blue. Given a decrease in the number of RARα molecules with an unchanged binding ratio, the total number of binding molecules has decreased. Could it be interpreted that the potential reduction in RXRα chromatin binding, accompanying the decrease in binding RARα, is compensated for by other partners?

      We agree with the reviewer that both the yellow and blue molecules in Figure 4 represent T2NRs that can heterodimerize with RXR. For simplicity, we chose to omit the depiction of RXR dimerization with other T2NRs (represented in yellow) in Figure 4. We have now included a note in the figure caption to clarify this. We plan to follow up on the buffer capacity of RXR with other T2NRs in a separate manuscript and will discuss this aspect in more detail once we have data from those experiments.

      (5) Fig. 3 - I expected that DR localizations always appear more frequently than PAPA localizations by the difference in the number of distal molecules. Why does the linear line for SNAP-RXRα in Fig. 3 B have a slope exceeding 1? Also, although the sublinearity is attributed to binding saturation, is there any possibility that this sublinearity originates from the PAPA system like the saturation of PAPA reactivation? Control samples like Halo-SNAPf-3xNLS might address these concerns.

      The number of DR and PAPA localizations depends on the arbitrarily chosen intensity and duration of green and violet light pulses. For any given protein pair, different experimental settings can result in PAPA localizations being greater than, less than, or equal to the number of DR localizations. Therefore, the informative metric is not the absolute number of DR and PAPA localizations, but rather how the ratio of PAPA to DR localizations changes between different conditions—such as between interacting pairs and non-interacting controls.

      Regarding the sublinearity, we agree that it is essential to consider whether the observed sublinearity might stem from saturation of the PAPA signal. We know of two ways in which this could occur:

      First, PAPA can be saturated as the duration of the green light pulse increases and dark-state complexes are depleted. However, this cannot explain the nonlinearity that we observe, because the duration of the green light pulse is constant, and thus the probability that a given complex is reactivated by PAPA is also constant. Likewise, holding the violet pulse duration constant yields a constant probability that a given molecule is reactivated by DR. PAPA localizations are expected to scale linearly with the number of complexes, while DR localizations are expected to scale linearly with the total number of molecules. Sublinear scaling of PAPA localizations with DR localizations thus implies that the number of complexes scales sublinearly with the total concentration of the protein.

      Second, saturation could occur if PAPA localizations are undercounted compared to DR localizations. While this is a valid concern, we consider it unlikely in this case because 1) our localization density is below the level at which our tracking algorithm typically undercounts localizations, and 2) we observe sublinearity for RXR → RAR PAPA even though the number of PAPA localizations is lower than the DR localizations; undercounting due to excessive localization density would be expected to introduce the opposite bias in this case.

      (6) Fig. 4 - The differences between A, B, and C on the right side of the model are subtle, making it difficult to discern where to see. Emphasizing the difference in molecule numbers or grouping free molecules at the top might help clarify these distinctions.

      We appreciate the reviewer’s feedback. In response, we have revised Figure 4 by grouping the free molecules on the top right side for panels A, B and C, as suggested.

      (7) While the main results are obtained through single-molecule imaging, no singlemolecule fluorescence images or trajectory plots are provided. Even just for representative conditions, these could serve as a guide for readers trying to reproduce the experiments with different custom-build microscope setups. Also, considering data availability, depositing the source data might be necessary, at least for the diffusion spectra.

      We have now included representative movies for all the presented SMT conditions as source data. Please see data availability section of the manuscript.

      (8) Tick lines are not visible on many of the graph axes. 

      We have revised the figures to ensure that the tick lines are now clearly visible on all graph axes.

      (9) Inconsistencies in the formatting are present in the methods, such as "hrs" vs. "hours", spacing between numbers and units, and "MgCl2". "u" should be "μ" and "x" should be "×". 

      We have corrected the formatting errors.

      (10) Table S4, rows 16 and 17 - Are "RAR"s typos for "RXR"s? 

      We have corrected this in the manuscript.

      (11) p.10~12 - Are three "Hoestch"s typos for "Hoechst"s? 

      This is now corrected in the manuscript.

      (12) p.11, line 17 - According to the referenced paper, the abbreviation should be "HILO" in all capital letters, not "HiLO". 

      This is now corrected in the manuscript.

      (13) "%" on p.3, line 18, and "." on p.6, line 27 are missing. 

      This missing “%”  and “.” are now added.

    1. Reviewer #1 (Public review):

      Summary:

      The manuscript by Yao S. and colleagues aims to monitor the potential autosomal regulatory role of the master regulator of X chromosome inactivation, the Xist long non-coding RNA. It has recently become apparent that in the human system, Xist RNA can not only spread in cis on the future inactive X chromosome but also reach some autosomal regions where it recruits transcriptional repression and Polycomb marking. Previous work has also reported that Xist RNA can show a diffused signal in some biological contexts in FISH experiments.

      In this study, the authors investigate whether Xist represses autosomal loci in differentiating female mouse embryonic stem cells (ESCs) and somatic mouse embryonic fibroblasts (MEFs). They perform a time course of ESC differentiation followed by Capture Hybridization of Associated RNA Targets (CHART) on both female and male ESCs, as well as pulldowns with sense oligos for Xist. The authors also examine transcriptional activity through RNA-seq and integrate this data with prior ChIP-seq experiments. Additional experiments were conducted in MEFs and Xist-ΔB repeat mutants, the latter fails to recruit Polycomb repressors.

      Based on this experimental design, the authors make several bold claims:

      (1) Xist binds to about a hundred specific autosomal regions.<br /> (2) This binding is specific to promoter regions rather than broad spreading.<br /> (3) Xist autosomal signal is inversely correlated with PRC1/2 marks but positively correlated with transcription.<br /> (4) Xist targeting results in the attenuation of transcription at autosomal regions.<br /> (5) The B-repeat region is important for autosomal Xist binding and gene repression.<br /> (6) Xist binding to autosomal regions also occurs in somatic cells but does not lead to gene repression.

      Together, these claims suggest that Xist might play a role in modulating the expression of autosomal genes in specific developmental and cellular contexts in mice.

      Strengths:

      This paper deals with an interesting hypothesis that Xist ncRNA can also function at autosomal loci.

      Weaknesses:

      The revised manuscript now includes many additional bioinformatic analyses to support the premise that Xist RNA targets a specific set of about 100 promoters and attenuates their expression in the early stages of differentiation. I have previously raised significant concerns about the bioinformatic analyses and the robustness of the data, especially those linked to CHART-seq datasets. Despite some improvements, fundamental problems with the analysis remain, precluding a conclusion on whether Xist RNA binds specific autosomal promoters. The main concerns include:

      (1) The authors nicely explain the use of biological replicates; however, they still fail to provide the sufficient analysis I requested on d0 and sense probes. While some quantification is presented in Figures 1E and 1F, the peak calling I asked for has still not been performed. In the response document, the authors report that about 600 peaks were identified in d0 female ESCs compared to about 100 in differentiated conditions. They explain this by the well-known phenomenon of having a background of differentiated cells in d0. In my opinion, this reasoning is flawed. With 98% of cells not inducing Xist in the culture, it is unimaginable why 600 peaks would be detected in the peak calling analysis. Rather, this demonstrates a high background in the CHART peak calling. To assess this further, I have reanalyzed d7 CHART datasets and found robust enrichment of the sense probe on promoters of genes, even stronger than the antisense probe. MACS peak calling also identifies a robust number of peaks on the sense probe. Indeed, even though Figure 1F shows low sense probe enrichment, this is because it focuses on the anti-sense peaks only. An opposite effect is observed when focusing on all genes or on sense-peaks. Thefore it is tough to decide which of the signal is truelly due to Xist binding and what is an inherent problem with the CHART signal. These results cast serious doubts on the biological conclusions of this work and point to a very high background level of promoter signal in both sense and antisense samples.

      (2) The authors do not address the conundrum of their results: how is it possible to have a genome-wide autosomal accumulation of Xist signal at promoters (see Figures 1A and 1B), while simultaneously specifically affecting only 100 promoters in the genome? The signal is either general (as Figures 1A and 1B suggest) or specific (as implied by the peak calling), but it cannot be both. Current data points to the fact that CHART has a bias for the most open parts of the chromatin.

      (3) The text is still very confusing when it comes to Polycomb. Some experiments point to the fact that there are few PRC1/2 marks at putative Xist autosomal binding sites (Figure 3C), while the use of X1 induces the loss of PRC2 marks. I still find this internally contradictory. The authors sadly do not address my concerns with additional analysis. Their current data indicate that upon Xist upregulation, Xist-RNA binds to autosomal regions that are highly expressed and devoid of Polycomb. These loci then become transcriptionally attenuated and gain some (but low) level of PRC2 in a Xist-dependent fashion. If this model is true, then all these regions should not have Xist in d0 of differentiation and should also have slightly lower levels of PRC2. The argument that there is a low level of Xist in 2-5% of cells should not be a problem because most of the signal will come from the 98% of cells not expressing Xist (as seen in Figure 1A). Without timepoint 0, the whole premise of the paper is difficult to interpret. Either the d0 samples are good enough, or the system is so leaky that it is nearly impossible to identify Xist-specific effects. Males are a useful control but are obviously a genetically very different line with distinct epigenetic and signaling statuses. It is crucial to compare the timing of repression/PRC accumulation to conclude if and how Xist is functional on these loci.

      (4) The authors did not address my concerns about the transcriptional analysis. I belive that the control genes are not selected properly. This analysis should not have been performed on just 100 randomly selected regions/genes. Instead, bootstrapping of 100 randomly selected regions/genes should be done, e.g., 1000 times. Additionally, one should only sample from expressed genes to have a comparable control gene set. For example, in Figures 4D and 4E, the distribution of control regions is entirely different. To stress again, relying on a set of 100 randomly selected genes/regions is not statistically robust; controls have to be matched, and bootstrapping has to be performed. Finally, each timepoint uses a different set of autosomal targets. There is a need to visualize the same set of genes across all timepoints (including d0). For example, are genes bound by Xist at d7 highly expressed at d0 and then attenuated only at d7? What happens to them at d14 (see points from 3)? The arguments about d0 heterogeneity are again not convincing (nor is Figure 3H, which shows a different set of genes).

      (5) Transcriptional analysis is often shown only as tracks however the reads for key example genes have to be quantified properly and not just visualized or amalgamated in a violin plot.

    2. Reviewer #2 (Public review):

      Summary:

      To follow-up on recent reports of Xist-autosome interaction the authors examine female (and male transgenic) mESCs and MEFs by CHARTseq. Upon finding that only 10% of reads map to X, they sought to identify reproducible alternative sites of Xist-binding, and identify ~100 autosomal Xist-binding sites in active chromatin regions. They demonstrate a transient down-regulation of autosomal expression. They utilize published male transgenic inducible Xist mESC data to support their findings. In their system, inhibition of Xist reduces autosomal impact.

      Strengths:

      The authors address a topical and interesting question with a series of models including developmental timepoints and utilize unbiased approaches (CHARTseq, RNAseq). For the CHARTseq they have controls of both sense probes and male cells; and indeed do detect considerable background with their controls. The use of 'metagene' plots provides a visual summation of genic impact. They compare with published data.

      Weaknesses:

      The revised text and rebuttal clarified my confusion of the 'follow-up' analyses (Figure 4) compared to published datasets. Further, the figure legends have been improved.

      While the controls were a strength, it appears that when focussed on bound regions, the background (from sense probes) is now also substantially higher than global background (compare 1E to 1A/B). Thus, why do these autosomal targets enrich for the sense probes, and how to distinguish from such background for the ∆B experiments? If male and sense are both controls, then why is sense lower for males than females, doesn't this suggest Xist impact? While authors note d0 might detect Tsix, the signal is only slightly reduced by d14 and never equivalent. Indeed, the new PCA (S1C) does show as noted that female Xist interactions are distinct from sense and male, but the male signal is even more distinct from sense probes.

      It would have been preferable to see the dispersion of the Xist RNA cloud in these ∆B cells, rather than a reference.

      Only 2 replicates were used, but there were multiple time-points: D0, D4, d7, d14; further, the correlation analysis showed good reproducibility, and in response to reviews they note that 2 replicates are standard of practice.

      The conclusion that RepB is "required for localization to the ~100 genes" is based on density (panel 2E); however, these autosomal targets retain enrichment at TSSs (panel 2A) and indeed the text suggests they are the same sites, suggesting that in fact the choice of autosomal region binding is not RepB dependent. Thus, this remains unresolved for me.

      The introduction is clear, and the senior author is a leader in the field; however, by this reviewer's count 19 of the 52 references include the senior author.

      Better descriptors for the supplemental Excel files would be helpful.

      Aim achievement: The authors do identify autosomal sites with enrichment of chromatin marks and evidence of silencing. Their revised text clarifies many issues, although this reviewer still remains unconvinced that the autosomal targeting is repB-dependent.

      The impact of Xist on autosomes is important for consideration of impact of changes in Xist expression with disease (notably cancers). Knowing the targets (if consistent) would enable assessment of such impact.

    3. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review):

      Summary:

      The manuscript by Yao S. and colleagues aims to monitor the potential autosomal regulatory role of the master regulator of X chromosome inactivation, the Xist long non-coding RNA. It has recently become apparent that in the human system, Xist RNA can not only spread in cis on the future inactive X chromosome but also reach some autosomal regions where it recruits transcriptional repression and Polycomb marking. Previous work has also reported that Xist RNA can show a diffused signal in some biological contexts in FISH experiments.

      In this study, the authors investigate whether Xist represses autosomal loci in differentiating female mouse embryonic stem cells (ESCs) and somatic mouse embryonic fibroblasts (MEFs). They perform a time course of ESC differentiation followed by Capture Hybridization of Associated RNA Targets (CHART) on both female and male ESCs, as well as pulldowns with sense oligos for Xist. The authors also examine transcriptional activity through RNA-seq and integrate this data with prior ChIP-seq experiments. Additional experiments were conducted in MEFs and Xist-ΔB repeat mutants, the latter fails to recruit Polycomb repressors.

      Based on this experimental design, the authors make several bold claims:

      (1) Xist binds to about a hundred specific autosomal regions.

      (2) This binding is specific to promoter regions rather than broad spreading.

      (3) Xist autosomal signal is inversely correlated with PRC1/2 marks but positively correlated with transcription.

      (4) Xist targeting results in the attenuation of transcription at autosomal regions.

      (5) The B-repeat region is important for autosomal Xist binding and gene repression.

      (6) Xist binding to autosomal regions also occurs in somatic cells but does not lead to gene repression.

      Together, these claims suggest that Xist might play a role in modulating the expression of autosomal genes in specific developmental and cellular contexts in mice.

      Strengths:

      This paper deals with an interesting hypothesis that Xist ncRNA can also function at autosomal loci.

      Weaknesses: The claims reported in this paper are largely unsubstantiated by the data, with multiple misinterpretations, lacking controls, and inadequate statistics. Fundamental flaws in the experimental design/analysis preclude the validity of the findings. Major concerns are listed below: (1) The entire paper is based on the CHART observation that Xist is specifically targeted to autosomal promoters. Overall, the data analysis is flawed and does not support such conclusions. Importantly the sense WT and the 0h controls are not used, nor are the biological replicates. 

      We respectfully disagree with Rev1 but nevertheless thank the reviewer for making some suggestions that helped to strengthen our manuscript.  We have provided new experiments and analyses in the revised manuscript. Please see responses below.

      Rev1 seems to have missed or misunderstood some key experiments. In fact, the sense WT and 0h controls were shown. Furthermore, we included at least two biological replicates for each experiment.

      We used both male ES cells (which do not express Xist) and sense probes as key negative controls, as outlined in Figure S1. Crucially, we only analyzed peaks that were reproducible between biological replicates. The Xist CHART peaks in differentiating female ES cells were significantly enriched above the “background” defined by the sense probe and male controls. Specifically, in comparison to undifferentiated female ES cells (day 0) where both X chromosomes are active and Xist is not induced, Xist CHART robustly pulled down the X chromosome during cell differentiation (day 4, day 7, and day 14). In contrast, male ES cells showed no significant pull-down of the X chromosome, and the sense group also exhibited markedly reduced binding (new Figure S1B). Furthermore, Principal Component Analysis (PCA) of CHART-seq reads (day 4 as an example) include Xist, sense, and input in WT and ΔRepB female, further confirmed that the sense probe CHART was clearly distinguishable from Xist CHART signals. Please see revised Figure S1C. Together, these findings underscore the specificity and robustness of our CHART results.

      Data is typically visualized without quantification, and when quantified, control loci/gene sets are erroneously selected. Firstly, CHART validation on the X in FigS1 is misleading and not based on any quantifications (e.g., see the scale on Kdm6a (0-190) compared to Cdkl5 (0-40)). If scaled appropriately, there is Xist signal on the escapee. 

      Rev1 may have misread the presented data. In the example raised by Rev1, Fig. S1 is inherently quantitative: e.g., a ratio is a number in Fig. S1A (now Fig. S1B) and all gene tracks in Fig. 1B-E are shown with scales. We showed X-linked genes in Fig. S1 (now Fig. S2) as a control to demonstrate that the CHART worked and that Xist accumulated over time from day 0 to day 14. Our new Figure 1B demonstrates the Xist accumulation in graph format. 

      Our paper focuses on Xist autosomal binding sites. Thus, the X-linked examples were placed in the supplement. Escapee genes do in fact accumulate Xist at their promoter regions and this finding is consistent with data published by Simon et al. (2013, Nature). It was therefore not desirable in this paper to reanalyze X-linked genes, including escapees. Nevertheless, to address the reviewer’s concerns, we present new data in new Figure S3A. Here we analyzed the density of Xist binding across X-linked genes, including both active and inactive genes, as well as escapee genes. From this quantitative analysis, it should be clear that escapees do bind Xist. However, from the metagene plots in Figure S3B, we confirm the previous conclusion that escapees bind Xist at high levels just upstream of the promoter and that there is a depletion of Xist in the escapee gene body, consistent with a barrier preventing Xist from moving into the active gene. 

      All X-linked loci should have been quantified and classified based on escape status; sense control should also be quantified, and biological replicates should be shown separately. 

      Please see above response.

      Additionally, in the revised manuscript, we have examined the Irreproducible Discovery Rate (IDR) to validate the reproducibility of peaks between the two replicates in the revised version, and we included a representative example from female WT ES cells at day 4 (revised Figure S4A). The results showed a strong correlation between the replicates, with an IDR threshold of 0.05 (red point > 0.05). As described in the Methods section, to ensure reliable and robust peak identification, we performed peak calling (MACS2) separately on each replicate, and then used bedtools intersect to identify peaks that overlapped between the two replicates. This stringent process, including strict q-value settings in MACS2, ensures the reliability and reproducibility of the peaks presented in this study.

      Secondly, and most importantly, Figure 1 does not convincingly show specific Xist autosomal binding. Panel A quantification is on extremely variable y-scales and actually shows that Xist is recruited globally to nearly all autosomal genes, likely indicating an unspecific signal. Again, the sense and 0h controls should have been quantified along with biological replicates. 

      Figure 1 shows heatmaps and corresponding metagenes for d0, d4, d7, and d14 female ES cells. Two biological replicates are analyzed. In our revised manuscript, we have used Pearson and Spearman correlation coefficients to measure the strength and direction of a relationship between two biological replicates and shown that the two replicates have high reproducibility (new Figure S1A). On d0, the Xist coverage on autosomes and X chromosome is low, but there is a clear increase on d4, d7, and d14, particularly at the TSS of autosomal genes, as shown by the metagene plots on in Figure 1A-B and the CHART density maps in new Figure 1E-F. We also show relative depletion of Xist signals in the male and sense negative controls.

      Upon inspecting genome browser tracks of all regions reported in the manuscript (Rbm14, Srp9, Brf1, Cand2, Thra, Kmt2c, Kmt2e, Stau2, and Bcl7b), the signal is unspecific on all sites with the possible exception of Kmt2e. On all other loci, there is either a strong signal in the 0h ESC controls or more signal in some of the sense controls. This implies that peak calling is picking up false positive regions. How many peaks would have been picked up if the sense or the 0h controls were used for peak calling? It is likely that there would be a lot since there are also possible "peaks" (e.g., Fzd9) in control tracks. 

      The analysis cannot be performed by visual inspection. A statistical analysis must be performed to call signal above noise. This is why we performed peak-calling on two biological replicates and identified overlapping peaks using bedtools intersect to improve reliability. Significant peaks are noted as black bars under each track. As mentioned above, for our analysis, we focused on the top 100 peaks based on peak scores to ensure robustness. Xist has significantly higher signal compared to the sense probe in the Xist-autosomal peak regions (revised Figure 1E-F). Additionally, we conducted peak calling on undifferentiated ES cells (d0) and detected a significantly higher number of peaks (~600) compared to the differentiated states (d4 or d7) (~100).

      Single-cell sequencing studies have shown that about 2% of undifferentiated mESCs express detectable Xist (Pacini et al., Nat Commun, 2021). The Xist peaks in “day 0” cells may be due to the differentiating population.

      Further inspection of the data was not possible as the authors did not provide access to the raw fastq files. When inspecting results from past published experiments {Engreitz, 2013 #1839} reported regions were not bound by Xist. 

      On the contrary, we deposited the raw data files to GEO prior to the submission of the paper and included the reviewer link to access them. As of August 24, 2024, GEO publicly released these files, allowing for full inspection of the data. 

      Regarding the Engreitz publication, it is not recommended to compare our current study to their analysis for the crucial reason that the Engreitz study was not conducted under physiological conditions. The authors overexpressed the Xist gene in male ES cells. Because Xist RNA can silence genes in male cells as well, this ectopic overexpression normally leads to cell death — thus forcing examination of effects in a narrow time window before Xist can fully spread and act across the genome. Comparing our experiments (endogenous Xist expression in female ES cells) to the ectopic overexpression in male ES cells of Engreitz et al. should therefore not be undertaken.

      Thirdly, contrary to the authors' claim, deleting the B repeat does not lead to a loss of autosomal signal. Indeed, comparing Fig1A and Fig2B side by side clearly shows no difference in the autosomal signal, likely because the autosomal signal is CHART background. Properly quantifying the signal with separate replicates as well as the sense and 0h controls is vital. Overall current data together with published results indicate that CHART peak calling on autosomes is due to technical noise or artefacts.

      In our revised manuscript, we have included the quantitative results as mentioned above in the main and supplementary figure (new Figure 1E-F, Figure 2E-F, and S3A). The data clearly show an enrichment in the Xist CHART samples in differentiating female ES cells.

      We believe the reviewer may be comparing the original Figure 1A and Figure 2A (not Figure 2B). As mentioned above, the analysis cannot be performed by visual inspection. Please see new Figure 2E and 2F. From these data, it should be clear that deleting RepB causes a decrease in Xist targeting to autosomal loci.

      (2) The RNA-seq analysis is also flawed and precludes strong statements. Firstly, the analysis frequently lacks statistical analysis (Fig3B, FigS2B-C) and is often based on visualizations (Fig 3D-G) without quantifications. Day 4 B-repeat deletion does not lead to a significant change in the expression of genes close to Xist signal (Fig3H, d14 does not fully show). 

      Please see new revised Figure 3B and Figures S2B-C (now revised as Figures S6A and S6B). 

      Secondly, for all transcriptional analysis, it is important to show autosomal non-target genes, which is not always done. 

      In the revised manuscript, we included non-target genes for each analysis (new Figure 4E-F, 5D and 5F, 7C and 7E, S7F, S8).

      Indeed, both males and B repeat deletion will lead to transcriptional changes on autosomes as a secondary effect from different X inactivation status. The control set, if used, is inappropriate as it compares one randomly selected set of ~100 genes. This introduces sampling error and compares different classes of genes. Since Xist signal targets more active genes, it is important to always compare autosomal target genes to all other autosomal genes with similar basal expression patterns.

      Please see new Figure S8. We included 100 randomly selected non-target sites on autosomes for this comparative analysis. For consistency, we applied the same flanking regions (10 kb) in the analysis of both target and non-target genes. We believe that this selection method for nontargets is appropriate for two reasons: first, it allows us to control for Xist binding and non-binding; second, it ensures a similar number of genes in both groups, providing a robust foundation for statistical analysis. 

      (3) The ChIP-seq analysis also has some problems. The authors claim that there is no positive correlation between genes close to Xist autosomal binding (10kb) compared to those 50kb away (Fig 3C, S2D); however, this analysis is based entirely on metagene visualization. Signal within the Xist binding sites should be quantified (not genes close by) and compared to other types of genomic loci and promoters. Focusing on the 50kb group only as controls is misleading.

      We believe the reviewer may have misunderstood our conclusions. As stated in the paper, we observed lower coverage of the histone marks H3K27me3 and H2AK119ub, associated with PRC2 and PRC1, respectively. Our conclusions regarding PRC1/2 support the RNA-seq results, indicating that Xist tends to bind to actively expressed genes. In other words, these genes exhibit lower levels of PRC-mediated silencing signals. This observation underscores the relationship between Xist binding and gene activity, highlighting that Xist preferentially associates with regions that are less subject to silencing by polycomb repressive complexes.

      Secondly, the authors only look at PRC mark signal upon differentiation; what about the 0h timepoint, i.e., is there pre-marking? 

      Day 0 is not an appropriate timepoint for this analysis because Xist is not yet induced. There is also a small fraction of cells (<5%) that spontaneously differentiate and start to undergo XCI. Because of these reasons, the day 0 timepoint is considered somewhat heterogeneous and it would be difficult to make conclusions regarding Xist peaks in these samples.

      Most worryingly, the data analysis is not consistent between figures (see Fig3C vs 5H-I). In Fig5, the group of Xist targets was chosen as those within 100kb of Xist binding, which would encompass all the control regions from Fig3C. In this analysis, the authors report that there is Xist-dependent H3K27me3 deposition, and in fact, here the Xist autosomal targets have more of it than the controls. Overall, all of this analysis is misleading, and clear conclusions cannot be made.

      We believe that the reviewer may have also misunderstood the analysis in Figure 5. Figure 5 shows the effect of the Xist inhibitor, X1, on H3K27me3 and gene expression. X1 blocks reduces PRC2 targeting and gene silencing — consistent with X1’s effect on RepA as published in Aguilar et al. 2022. 

      All in all, because the fundamental observation is not robust (see point 1), all subsequent analyses are also affected. There are also multiple other inconsistencies within the analysis; however, they have not been included here for brevity.

      We again respectfully disagree with Rev1 but thank the reviewer for making suggestions that helped to strengthen our manuscript.  We believe that the revised manuscript with new analyses is improved in part because of the reviewer’s critical comments.

      Reviewer #2 (Public review):

      Summary:

      To follow-up on recent reports of Xist-autosome interaction the authors examine female (and male transgenic) mESCs and MEFs by CHARTseq. Upon finding that only 10% of reads map to X, they sought to identify reproducible alternative sites of Xist-binding, and identify ~100 autosomal Xistbinding sites and show a transient impact on expression.

      Strengths:

      The authors address a topical and interesting question with a series of models including developmental timepoints and utilize unbiased approaches (CHARTseq, RNAseq). For the CHARTseq they have controls of both sense probes and male cells; and indeed do detect considerable background with their controls. The use of deletions emphasizes that intact functional Xist is involved. The use of 'metagene' plots provides a visual summation of genic impact.

      Reviewer 2 has made some excellent suggestions. We have revised the manuscript accordingly and are grateful to the reviewer for the recommendations.

      Weaknesses:

      Overall, the result presentation has many 'sample' gene presentations (in contrast to the stronger 'metagene' summation of all genes). The manuscript often relies on discussion of prior X chromosomal studies, while the data generated would allow assessment of the X within this study to confirm concordance with prior results using the current methodology/cell lines. 

      Many of the 'follow-up' analyses are in fact reprocessing and comparison of published datasets. The figure legends are limited, and sample size and/or source of control is not always clear. While similar numbers of autosomal Xist-binding sites were often observed, the presented data did not clarify how many were consistent across time-points/cell types. While there were multiple time points/lines assessed, only 2 replicates were generally done.

      We apologize for the deficiencies in the legend.  The revised manuscript has corrected them.

      We generated many new datasets with deep sequencing, with at least two biological replicates for each. Such experiments are extremely expensive by nature. Thus, two biological replicates are typically considered acceptable.

      Additionally, we performed reanalysis of published datasets to test whether — in the hands of other investigators — cell lines expressing Xist also supported autosomal targeting. Figure 4 is a case in point. Here we examined Tg1 and Tg2, which respond to doxycycline to overexpress Xist from an ectopic site. Transcriptomic analysis showed significant downregulation of autosomal Xist targets, as exemplified by Rbm14 and Bcl7b (new Figure 4C, S9B). In contrast, non-targets of Xist such as Stau1 did not demonstrate significant changes in gene expression (new Figure 4E and 4G). Looking across all autosomal target genes, we observed a significant decrease in mean expression in the Xist overexpressing cell lines (new Figure 4D). The fact that the autosomal changes were also observed in datasets generated by other investigators greatly strengthen our conclusions. 

      Aim achievement:

      The authors do identify autosomal sites with enrichment of chromatin marks and evidence of silencing. More details regarding sample size and controls (both treatment, and most importantly choice of 'non-targets' - discussed in comments to authors) are required to determine if the results support the conclusions.

      Specific scenarios for which I am concerned about the strength of evidence underlying the conclusion:

      I found the conclusion "Thus, RepB is required not only for Xist to localize to the X- chromosome but also for its localization to the ~100 autosomal genes " (p5) in constrast to the statement 2 lines prior: "A similar number of Xist peaks across autosomes in ΔRepB cells was observed and the autosomal targets remained similar". Some quantitative statistics would assist in determining impact, both on autosomes and also X; perhaps similar to the quintile analysis done for expression.

      We have added the Xist coverage panel for day 4 and 7 in the identified Xist-autosomal peak regions (new Figure 1E-F, Figure 2E-F), as mentioned above. The results clearly demonstrate that the deletion of RepB decreases Xist binding to autosomes. Also, we showed that ΔRepB increased X-linked genes expression in our revised Figure 3D. 

      It is stated that there is a significant suppression of X-linked genes with the autosomal transgenes; however, only an example is shown in Figure 4B. To support this statement, a full X chromosomal geneset should be shown in panels F and G, which should also list the number of replicates. 

      Please see new Figure 4B.

      As these are hybrid cells, perhaps allelic suppression could be monitored? Is Med14 usually subject to X inactivation in the Ctrl cells, and is the expression reduced from both X chromosomes or preferentially the active (or inactive) X chromosome?

      If Rev2 is referring to Figure 4, the dataset used in Figure 4 comes from another research group and was previously published (Loda, A. et al. Nat Commun, 2017).

      If Rev2 is referring to our ES cells, they are N2 cell lines.  The X chromosomes are fully hybridized (Cas/Mus), but the autosomes are not fully hybridized (Ogawa et al., Science, 2008). Med14 is subject to XCI and is expressed from the Xa, silenced on the Xi. 

      The expression change for autosomes after transgene induction is barely significant; and it was not clear what was used as the Ctrl? This is a critical comparator as doxycycline alone can change expression patterns.

      We agree that there was a modest change in expression after transgene induction, but it is a significant change. Again, the dataset is from a published study where the authors generated doxycycline-responsive Xist transgenes (see above). The control in this case is Dox-treated wildtype cells. We now clarify these points.

      In the discussion there is the statement. "Genetic analysis coupled to transcriptomic analysis showed that Xist down-regulates the target autosomal genes without silencing them. This effect leads to clear sex difference - where female cells express the ~100 or so autosomal genes at a lower level than male cells (Figure 7H)." This sweeping statement fails to include that in MEFs there is no significant expression difference, in transgenics only borderline significance, and at d14 no significant expression difference. The down-regulation overall seems to be transient during development while targeting is ongoing?

      Indeed, the Xist effects on autosomes seem to occur during cell differentiation in ES cells. While there is no apparent effect in MEFs, we cannot exclude effects on other somatic cells. Regardless of whether the effects are in early development or throughout life, the sex differences may have life-long effects in mammals. The study conducted in human cells by the Plath lab also concluded that the differences primarily affect stem cells.

      Finally, I would have liked to see discussion of the consistency of the identified genes to support the conclusion that the autosomal sites are not merely the results of Xist diffusion.

      We address this in the third paragraph of the Discussion. Our main argument is that if autosomal binding were caused by diffusion, then RepB deletion or X1 treatment would have led to increased binding at autosomal sites, as Xist would bind less to the X chromosome. However, as demonstrated in our study, both treatments resulted in reduced Xist binding on both the X chromosome and autosomes. This finding suggests that the binding is specific and reliant on Xist's RepA and RepB domains, rather than being a passive diffusion process.

      To examine overlap between the conditions (days of differentiation and WT/RepB cells), we generated Venn Diagrams as now shown in Figure S4E.

      The impact of Xist on autosomes is important for consideration of impact of changes in Xist expression with disease (notably cancers). Knowing the targets (if consistent) would enable assessment of such impact.

      We thank Rev2 for the very helpful review and for the forward-looking experiments. Indeed, the physiological changes brought on by autosomal targeting will be of future interest.

      Reviewer #3 (Public review):

      Summary:

      Yao et al use CHART to identify chromatin associated with Xist in female mouse ESCs, and, as control, male ESCs at various timepoints of differentiation. Besides binding of Xist to X chromosome regions they found significant binding to autosomes, concentrating mostly on promoter regions of around 100 autosomal genes, as elucidated by MACS. The authors went on to show that the RepB repeat is mostly responsible for these autosomal interactions using a female ESC line in which RepB is deleted. Evidence is provided that Xist interacts with active autosomal genes containing lower coverage of repressive marks H3K27me3 and H2AK119ub and that RepB dependent Xist binding leads to dampening of expression, but not silencing of autosomal genes. These results were confirmed by overexpression studies using transgenic ESCs with doxycycline-inducible Xist as well as via a small molecule inhibitor of Xist (X1), inducing/inhibiting the dampening of autosomal genes, respectively. Finally, using MEFs and Xist mutants RepB or RepE the authors provide evidence that Xist is bound to autosomal genes in cells after the XCI process but appears not to affect gene expression. The data presented appear generally clear and consistent and indicate some differences between human and mouse autosomal regulation by Xist. Thus, these results are timely and should be published.

      We thank Rev3 for the positive remarks and great suggestions.  We have amended the manuscript per below. 

      Strengths:

      Regulation of autosomal gene expression by Xist is a "big deal" as misregulation of this lncRNA causes developmental defects and human disease. Moreover, this finding may explain sexspecific developmental differences between the sexes. The results in this manuscript identify specific mouse autosomal genes bound by Xist and decipher critical Xist regions that mediate this binding and gene dampening. The methods used in this study are appropriate, and the overall data presented appear convincing and are consistent, indicating some differences between human and mouse autosomal regulation by Xist.

      Weaknesses:

      (1) The figure legends and/or descriptions of data are often very short lacking detail, and this unnecessarily impedes the reading of the manuscript, in particular the figures would benefit not only from more detailed descriptions/explanations of what has been done but also what is shown. 

      We have included more detailed descriptions in the figure legends and throughout the manuscript.

      This will facilitate the reading and overall comprehension by the reader. One out of many examples: In Fig S1B in the CHART data at d4 and d7 there is not only signal in female WT Xist antisense but also in female sense control. For a reader that is not an expert in XCI it would be helpful to point out in the legend that this signal corresponds to the lncRNA Tsix (I suppose), that is transcribed on the other strand.

      We thank the reviewer for this excellent point.  We have amended the Results section accordingly.

      (2) Different scales are used in the lower panels of Figures 1A and 2A, which makes it difficult to directly compare signals between the different differentiation stages.

      We have included a figure combining all timepoints — d0, d4, d7, and d14 WT female Xist CHART signals  — on the X chromosome and autosomes to support our thesis. Please see new Figure 1B.

      (3) In this study some of the findings on mouse cells contrast previously published results in human ESCs: 1) Xist binding occurs preferentially to promoters in mice, not in human. 2) Binding of Xist is mostly detected in polycomb-depleted regions in mice but there is a positive correlation between Xist RNA and PRC2 marks in human ESCs. These differences are surprising but may be very interesting and relevant. While I am aware that this might be a difficult task, it would be helpful to experimentally address this issue in order to distinguish whether species specific and/or methodological differences between the studies are responsible for these differences.

      Indeed, our findings in mouse cells contrast with those observed in humans. As discussed in the manuscript, this discrepancy may be attributed to factors such as cell type, differentiation methods, and the Xist pull-down technique employed (our CHART method utilizes a 20 nt oligo library, whereas RAP uses long oligos). We agree that future work should investigate the underlying causes of these differences between mouse and human systems.

      Recommendations for the authors:

      Reviewer #2 (Recommendations for the authors):

      For Figure 2: labelling ∆B on the panel A timeline (e.g. d0-∆B) would make the results clearer for the audience. Panel B makes most sense beside panel E of Figure 1, so combine here and skip in Figure 1?

      We have modified Figure 2A and thank Rev2 for this suggestion. As for the embedded tables: since we performed peak calling for WT and ∆B separately, we believe that showing both the peak numbers and their corresponding peak patterns provides a clearer representation of the data.

      I agree that at day 7 there appears to be a difference in X; but by day 14 this looks much more minimal - is it just time-shifted rather than altered? Perhaps this could be discussed. Autosomal binding sites show no change in number.

      Day 7 exhibits the strongest Xist binding on the X chromosome, consistent with the de novo establishment phase of XCI when Xist is expressed at the highest levels (300 copies/cell during de novo XCI versus ~100 copies/cell during maintenance [Sunwoo et al., 2015 as cited]. Per our RNA-seq analysis here, we also observed highest Xist expression on day 7 and reduced levels on day 14 (Fig. S5A). This expression difference explains the reduced Xist CHART levels on day 14 compared to day 7. 

      While the X has previously been examined, it would seem beneficial to conduct the same expression analyses (Figure 3) for the X (perhaps supplemental), as the authors have the data 'in hand'. I feel comparison to X in the main figure for panels A and B would fit, while a similar analysis for the X for panel C could be supplemental, presumably supporting the published data to which this data is currently compared. 

      This is a good suggestion. Please find the new data in Figures 2E-F and 3D, which demonstrate that the RepB deletion inhibits Xist binding on the X chromosome, resulting in increased X-linked gene expression, as previously mentioned. Since Xist binds across the X chromosome, we did not perform peak calling as we did for the autosomes. Therefore, applying a similar analysis as in Figures 3A-B may not be appropriate in this case.

      Such a direct comparison to X-data from the same study would be important. For panel H: How many replicates (2)? This should be in the legend. What is the change in median expression? Again, a supplemental figure showing impact on X-linked targets would be useful. Do male and female ESCs show an expression difference prior to differentiation (ie d0)? The data underlying this Figure should be in one of the supplementary tables, showing the full statistical tests and average change. The supplementary tables 8-12 list the WT target genes, not expression differences with the deletion. Again, given that the difference appears transient, might the ∆B cells be altered in rate of differentiation?

      Panel H (revised Figure 3G) includes two replicates, and this has been added to the legends. We have provided a supplementary figure demonstrating that RepB increases the expression levels of X-linked genes on days 4, 7, and 14 (revised Figure 3D). Male and female ESCs show differences in the expression of X-linked genes, as both X chromosomes are active in females at this stage prior to differentiation (revised Figure S5C). 

      A supplementary table with statistical tests and average change information has been included in our revised version (Table S11).

      On the other hand, these Xist-autosomal target genes displayed no significant differences between WT male, female, or ∆B female cells on day 0 — prior to onset of XCI and Xist expression. Please see new Figure 3H. 

      As for whether ∆B cells are altered in their rate of differentiation, the analysis by Colognori et al. 2019 indicates that ∆B cells differentiate similarly to WT cells. (In Figure 6 of Colognori et al. 2019, autosomal genes expressed similarly in WT and ∆B cells, whereas XCI is affected only in ∆B cells)

      We have also modified the legends for our supplementary tables.

      Why were the transgene lines examined upon neuronal differentiation rather than the same approach as in Figures 1-3? I would have thought neuronal differentiation might be more similar to d14, where limited changes remain? Could the authors clarify and discuss?

      We apologize for the confusion. The Tg lines in Figure 4 came from a previously published study. We performed reanalysis of published datasets because we wanted to test whether — in the hands of other investigators — cell lines expressing Xist also supported autosomal targeting. Here we examined Tg1 and Tg2, which respond to doxycycline to overexpress Xist from an ectopic site. Transcriptomic analysis showed significant downregulation of autosomal Xist targets, as exemplified by Bcl7b and Rbm14 (Figure 4C and S9B). In contrast, non-targets of Xist such as Stau1 did not demonstrate significant changes in gene expression (Figure 4E and 4F). Looking across all autosomal target genes, we observed a significant decrease in mean expression in the Xist overexpressing cell lines (Figure 4D). The fact that the autosomal changes were also observed in datasets generated by other investigators greatly strengthen our conclusions. We have clarified this in the Results section.

      Figure 5 - the legend should specify the number of replicates and clarify the blue/green (intuitive, but not specified). Are the 'target' / 'non-target' genes from d4 Chart (but the RNA from d5)? How are 'non-targets' defined - do they match the 'targets' in certain criteria (expression level, chromatin features, GC content)? Do they change per differentiation protocol?

      We have modified the legends to clarify that the 'target' and 'non-target' genes are derived from the day 4 CHART-seq data, while the RNA data is from day 5, as that study sequenced day 5 and not day 4. Non-targets were randomly chosen based on (i) the absence of Xist binding and (ii) similar expression levels. Please see revised Figure S8.

      It would be helpful to compare Xist expression levels across the various models, and the MEF model could be better described - are they polyploid as often happens?

      We have included the Xist expression levels of ES cells and MEF cells in the revised version (revised Figure S5A, 6D). The transformed MEFs are indeed tetraploid, as is typical.

      For 6A to be informative, one needs to know % mapping to X in ES timeline, which is in supplemental, so perhaps 6A should also be supplemental?

      We have moved 6A to the supplemental figure.

      It is odd that ∆B seems to have had more impact in MEFs, and I would like more discussion - but I also think I am missing something: "We observed that Xist signals were more substantially reduced on both the Xi and autosomal regions in ΔRepE MEFs compared to ΔRepB cells", yet in lower panel 6 G it looks like ∆B is LOWER than ∆E? Am I misinterpreting?

      We apologize for the confusing writing.  The revised text now reads:  “To investigate, we utilized a deletion of Xist’s Repeat E (∆RepE), which was previously demonstrated to severely abrogate localization of Xist to the Xi 41,42. We reasoned that the severe loss of Xist binding might unmask a transcriptomic difference. As expected, we observed that Xist signals were somewhat more reduced on the Xi in ΔRepE MEFs compared to ΔRepB cells (Figure 6E-6F). Despite this reduction, peak coverages in autosomal target genes did not increase in ΔRepE MEFs (Figure 6E-6F). However, there was an overall decrease in the number of significant autosomal peaks in ∆RepE MEFs relative to WT cells (Figure 6A). Regardless, we observed no significant transcriptomic differences in ∆RepE MEFs relative to WT MEFs (Figure 7A-7E). Additionally, further examination of RNA sequencing data from male and female MEF cells in two published studies 43,44 corroborated that the expression levels of these autosomal Xist targets did not exhibit significant changes (Figure 7F and 7G). Altogether, the analysis in MEFs demonstrates that Xist continues to bind autosomal genes in post-XCI somatic cells. However, autosomal binding of Xist in post-XCI cells does not overtly impact expression of the associated autosomal genes. Nonetheless, we cannot exclude more subtle changes that do not meet the significance cut-off.”

      Overall, I would like to see how consistent these autosomal peaks are - I shudder to suggest Venn diagrams, but something to show whether there are day/lineage specific peaks and/or ∆repeat B/E resistant peaks. 

      We now present Venn diagrams comparing MEF, ES_d4, and ES_d7, showing approximately 50% overlap between MEF and ES cells (revised Figure S10B). This may be expected, as each timepoint is a different developmental stage of XCI, with expected gene expression differences.

      Very minor comments:

      It would be easier if the supplemental tables were tabs in 1 file!

      We will defer to the editor on how best to format the supplemental tables.

      Similar to the text, could gene names be included in the supplemental?

      We have provided gene names in the supplemental files.

      Figure 3 legend: should 'representing' be representative?

      We have modified it.

      "Xist patterns identified in human cells" p 5; it is challenging to follow human versus mouse, so specify or ensure correct use of XIST/Xist Indeed, we edited the manuscript accordingly.

      Gene names should be italicized.

      We have italicized gene names in our manuscript.

      Ref. 38 lacks details (...).

      We have updated the reference.

      Peak-like characters - perhaps characteristics? P8

      We have modified this.

      Reviewer #3 (Recommendations for the authors):

      On page 6, the 6th sentence in the first paragraph needs correction. "Consistent with Xist's behavior on the X chromosome."

      We have modified the sentence. Thank you.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      The study by Longhurst et al. investigates the mechanisms of chemoresistance and chemosensitivity towards three compounds that inhibit cell cycle progression: camptothecin, colchicine, and palbociclib. Genome-wide genetic screens were conducted using the HAP1 Cas9 cell line, revealing compound-specific and shared pathways of resistance and sensitivity. The researchers then focused on novel mechanisms that confer resistance to palbociclib, identifying PRC2.1. Genetic and pharmacological disruption of PRC2.1 function, but not related PRC2.2, leads to resistance to palbociclib. The researchers then show that disruption of PRC2.1 function (for example, by MTF2 deletion), results in locus-specific changes in H3K27 methylation and increases in D-type cyclin expression. It is suggested that increased expression of D-type cyclins results in palbociclib resistance.

      Strengths:

      The results of this study are interesting and contribute insights into the molecular mechanisms of CDK4/6 inhibitors. Importantly, while CDK4/6 inhibitors are effective in the clinic, tumour recurrence is very high due to acquired resistance.

      Weaknesses:

      A key resistance mechanism is Rb loss, so it is important to understand if resistance conferred by PRC2.1 loss is mediated by Rb, and whether restoration of PRC2.1 function in Rb-deplete cells results in renewed palbociclib sensitivity. It is also important to understand the clinical implications of the results presented. The inclusion of these data would significantly improve the paper. However, besides some presentation issues and typos as described below, it is my opinion that the results are robust and of broad interest.

      Major questions:

      (1) Is the resistance to CDK4/6 inhibition conferred by mutation of MTF2 mediated by Rb?

      (2) Are mutations in PRC2.1 found in genetic analyses of tumour samples in patients with acquired resistance?

      We thank the reviewer for their editing and experimental suggestions, and have integrated their responses into our re-submitted manuscript.

      We also agree that understanding the role of RB1 in mediating palbociclib resistance to the proposed resistance mechanism is of particular interest. However, as there are three RB proteins expressed in human cells, this is a technically difficult question to probe genetically. Despite this technical challenge, we have provided multiple lines of evidence in our resubmitted manuscript that the resistance to palbociclib observed in our PRC2.1-deficent cells is mediated through the canonical CDK4/6-RB1 pathway. First, disruption of RB1 in HAP1 cells results in palbociclib resistance to a level comparable level to PRC2.1 disruption (Fig. 4E). Second, inactivation of SUZ12 or MTF2 increases the number of cells entering S-phase in palbociclib treatment (Fig. 4G) with no increase in basal rates of apoptosis (Fig. S2D), suggesting that any proliferation advantage observed in PRC2.1-defective cells is due to resistance to  palbociclib-induced cell cycle arrest. Third, we show that over expression of CCND1 and CCND2 is sufficient to drive resistance to palbociclib in wild-type HAP1 cells (Fig. S5F).  And finally, increased levels of CCND1 and CCND2 observed in cells lacking PRC2.1 activity results in higher CDK4/6 activity as measured by RB1 phosphorylation, despite palbociclib blockade (Fig. 6F). All these lines of evidence strongly suggest that MTF2-containing PRC2.1 regulates G1 progression in through the canonical CDK4/6RB1 pathway by repressing CCND1 and CCND2 expression. 

      Whether or not MTF2 deletion leads to palbociclib resistance in clinical samples is also of a question of particular interest. Currently, we are unaware of any reports that specifically mention MTF2 deletion as leading to palbociclib resistance, and we were unable to find another example in our own cancer database review. However, we have included references to other examples of MTF2 mutation resulting in chemotherapeutic resistance in our discussion. Additionally, although MTF2 is rarely observed to be mutated in cancers (Ngubo et al. 2023), it is highly differentially expressed and investigating decreased MTF2 transcription in palbociclib resistant tumors, though challenging, might prove fruitful.  However, as mechanisms of palbociclib resistance is an area of active investigation, we speculate that future studies might uncover additional examples of MTF2 mediating resistance to this clinically important chemotherapeutic.  

      Reviewer #2 (Public Review):

      Summary:

      Longhurst et al. assessed cell cycle regulators using a chemogenetic CRISPR-Cas9 screen in haploid human cell line HAP1. Besides known cell cycle regulators they identified the PRC2.1 subcomplex to be specifically involved in G1 progression, given that the absence of members of the complex makes the cells resistant to Palbociclib. They further showed that in HAP1 cells the PRC2.1, but not the PRC2.2 complex is important to repress the cyclins CCND1 and CCND2. This can explain the enhanced resistance to Palbociclib, a CDK4/6Inhibitor, after PRC2.1 deletion.

      Strengths:

      The initial CRISPR screen is very interesting because it uses three distinct chemicals that disturb the cell cycle at various stages. This screen mostly identified known cell cycle regulators, which demonstrates the validity of the approach. The results can be used as a resource for future research.

      The most interesting outcome of the experiment is the finding that knockouts of the PRC2.1 complex make the cell resistant to Palbociclib. In a further experiment, the authors focused on MTF2 and JARID2 as the main components of PRC2.1 and PRC2.2, respectively. Via extensive analyses, including genome-wide experiments, they confirmed that MTF2 is particularly important to repress the cyclins CCND1 and CCND2. The absence of MTF2 therefore leads to increased expression of these genes, sufficient to make the cell resistant to palociclib. This result will likely be of wide interest to the community.

      Weaknesses:

      The main weakness of the manuscript is that the experiments were performed in only one cell line. To draw more general conclusions, it would be essential to confirm some of the results in other cell lines.

      In addition, some of the findings, such as the results from the CRISPR screen as well as the stronger impact of the MTF2 KO on H3K27me3 and gene expression (compared to JARID2 KO), are not unexpected, given that similar results were already obtained before by other labs.

      We thank the reviewer for their suggestions and we believe that we have addressed their main concern about the generality of the MTF2 regulation of D-type cyclin expression in our resubmitted manuscript. We have now shown through shRNA knockdown that MTF2 represses CCND1 in two additional cell lines, the breast cancer MDA-MB-231 and immortalized monkey COS7 cell line (Fig. 6E). However, it is important to note that MTF2 did not control CCND1 expression in every cell line tested (Fig. 6D), underscoring the context-dependent nature of this regulation. Future studies will illuminate what cell or tumor types in which this regulation is observed.

      Additionally, while MTF2 has previously been shown to exert a greater effect on H3K27me3 levels in some circumstances (Loh et al. 2021, Rothberg et al. 2018), a number of notable reports in ES cell lines have concluded that PRC2 localization and H3K27me3 at the majority of genomic sites are dependent on both PRC2.1 and PRC2.2 activity (Healy et al. 2019, Højfeldt et al. 2019, Perino et al. 2020, Oksuz et al. 2018). Therefore, we think it is important to highlight the greater dependence on MTF2 for promoter proximal H3K27me3 levels in our transformed cell line context.  

      Reviewer #3 (Public Review):

      This study begins with a chemogenetic screen to discover previously unrecognized regulators of the cell cycle. Using a CRISPR-Cas9 library in HAP1 cells and an assay that scores cell fitness, the authors identify genes that sensitize or desensitize cells to the presence of palbociclib, colchicine, and camptothecin. These three drugs inhibit proliferation through different mechanisms, and with each treatment, expected and unexpected pathways were found to affect drug sensitivity. The authors focus the rest of the experiments and analysis on the polycomb complex PRC2, as the deletion of several of its subunits in the screen conferred palbociclib resistance. The authors find that PRC2, specifically a complex dependent on the MTF2 subunit, methylates histone 3 lysine 27 (H3K27) in promoters of genes associated with various processes including cell-cycle control. Further experiments demonstrate that Cyclin D expression increases upon loss of PRC2 subunits, providing a potential mechanism for palbociclib resistance.

      The strengths of the paper are the design and execution of the chemogenetic screen, which provides a wealth of potentially useful information. The data convincingly demonstrate in the HAP1 cell line that the MTF2-PRC2 complex sustains the effects of palbociclib (Figure 4), methylates H3K27 in CpG-rich promoters (Figure 5), and represses Cyclin D expression (Figure 6). These results could be of great interest to those studying cell-cycle control, resistance mechanisms to therapeutic cell-cycle inhibitors, and chromatin regulation and gene expression.

      There are several weaknesses that limit the overall quality and potential impact of the study. First, none of the results from the colchicine and camptothecin screens (Figures 1 and 2) are experimentally validated, which lessens the rigor of those data and conclusions. Second, all experiments validating and further exploring results from the palbociclib screen are restricted to the Hap1 cell line, so the reproducibility and generality of the results are not established. While it is reasonable to perform the initial screen to generate hypotheses in the Hap1 line, other cancer and non-transformed lines should be used to test further the validity of conclusions from data in Figures 4-6. Third, conclusions drawn from data in Figures 3D and 4D are not fully supported by the experimental design or results. Finally, there have been other similar chemogenetic screens performed with palbociclib, most notably the study described by Chaikovsky et al. (PMID: 33854239). Results here should be compared and contrasted to other similar studies.

      We thank the reviewer for their suggestions regarding our manuscript. While the genes recovered as mediating cellular responses to camptothecin and colchicine was never confirmed following our chemogenetic screens, we felt our primary findings were in the area of palbociclib resistance and decided focus our follow-up investigations on genes. We included the results camptothecin and colchicine chemogenetic screens as confirmation of the specificity of PRC2 mutation resulting in resistance to palbociclib (Fig. 4C) and for others in the community to use as a resource for future investigations. We have also clarified our results for Figure 3D and 4D in our revised manuscript, as well as included additional plots of these results (Fig. S1DS1F). And, with our resubmitted manuscript, we believe we have addressed their concern of the generality of our results by demonstrating our primary finding that MTF2 regulates D-type cyclins in additional cell lines other than HAP1. We feel these results indicate that while not “general”, there are additional cellular contexts that our main result holds true. In line with this, and to address how our chemogenetic screens fits into the landscape of previous studies, including Chaikosvsky et al., we have included the following lines to our discussion:  “Additionally, other chemogenetic screens utilizing palbociclib and have not identified that inactivation of PRC2 components as either enhancing or reducing palbociclib-induced proliferation defects, suggesting that PRC2 mutation is neutral in the cell lines studied. These observations not only underscore the context-dependent ramifications of mutation of these PRC2 complex members, but also may help inform the context in which CDK4/6 inhibitors are most efficacious.”

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      (1) "We found that only thirteen and twenty genes resulted in sensitivity or resistance, respectively, in every conditions tested and were deemed non-specific and excluded from any further analysis (see Table S2)." It's unclear to me why these genes were deemed 'nonspecific'. Are these genes functionally important for the general exclusion of xenobiotic molecules?

      By this, we simply meant that these effects were not specific to one condition. Such genes could affect drug half-life or a general stress response, but are less likely to have functions directly tied to the pathway targeted by a drug than are genes whose loss affects only one condition.  

      (2) "Given that increased CCND1 levels is sufficient to drive increased CDK4/6 kinase activity, upregulation of these D-type cyclins is likely to be a significant contributor to the palbociclib resistance in MTF2∆ cells." It's unclear to me what is the basis for this statement. This is only true if there is free CDK4/6. If CDK4/6 is already fully occupied by D-type cyclins, then increased CCND1 levels would not be expected to have an effect. 

      While we anticipated that increased levels of CCND1 would result in more CDK4/6-Dtype association, we now demonstrate in the new Figure S5F that there is more CCND1 in complex with CDK6 in both SUZ12∆ and MTF2∆ cell lines. Furthermore, we able to show in Figure S5G that overexpression of D-type cyclins results in resistant to palbociclib-induced proliferation defects in HAP1 cells.

      (3) The description of the results is very confusing in places, especially regarding "resistance" versus "sensitivity" genes. For example: "CCNE1, CDK6, CDK2, CCND2 and CCND1, all of which are integral to promoting the G1/S phase transition, ranked as the 2nd, 24th, 27th, 29th and 46th most important genes for palbociclib resistance, respectively (Figures 1F and 1G). CCND1 and CCND2 bind either CDK4 or CDK6, the molecular targets of palbociclib, whereas CDK2 and CCNE1 form a related CDK kinase that promotes the G1/S transition.

      Similarly, cells with sgRNAs targeting RB1, whose phosphorylation by CDK4/6 is a critical step in G1 progression, displayed substantial resistance to palbociclib." My reading of this paragraph suggests that disruption of the CDK6 locus is associated with palbociclib resistance - surely this is a typo and instead should have been sensitivity? Please explain.

      We thank the reviewer for pointing this out and have corrected this typo  

      (4) Sensitivity to palbociclib was enhanced in cells expressing sgRNAs targeting H4 acetylation, positive regulators of Pol II transcription, and regulators of the DNA Damage Response pathway (Figures 3A and 3B), although this sensitivity was much weaker than that seen with DNA damaging agents. This observation is consistent with long-term treatment with palbociclib inducing DNA damage, as has been suggested by a number of recent publications 65,66." This is also consistent with recent work on Cdk7 inhibitors (Wilson et al. Mol Cell 2023), as Cdk7 inhibition is expected to affect both CDK1/2/4/6 activities and Pol II transcription.

      We thank the reviewer for bringing this observation to our attention and we have added this citation to this passage in our manuscript.

      (5) Figure 3D - would it not make sense to plot the data such that palbo concentration is on the x-axis? It is also difficult to interpret since the data are normalized to starting "% proliferation" at the indicated palbo treatment, when it is likely that % proliferation changes significantly with palbo concentration. Indeed, this is the graphing format used for a later figure (Figure 4D). The data with rotenone suggests palbo antagonizes rotenone-mediated reduction in proliferation. But it's unclear to me whether the graph shows the converse - that rotenone treatment modulates palbo-induced cell cycle arrest.

      This reviewer is correct about the fact that increasing doses of palbociclib in the absence of oxidative phosphorylation do indeed have an effect on proliferation. However, it is helpful to normalize proliferation values to each initial dose of palbociclib and then compare this to the different oxidative phosphorylation inhibitors treatment combinations. To illustrate that the oxidative phosphorylation inhibitors do indeed antagonize palbociclib-induced proliferation defects, we have now included the data graphed as each oxidative phosphorylation inhibitor vs palbociclib as Supplemental Figures S1D-S1F.

      • The highest concentration of GSK126 tested (5µM) does not appear to confer resistance, but perhaps this is due to off-target effects or cytotoxicity?

      We agree with the reviewer that at the highest doses of dose of GSK126, low doses of palbociclib do not confer resistance to palbociclib. However, higher doses do appear to have this effect. We have included a statement in our results section to address this reviewer’s observations. 

      • Disruption of Emi1 leads to resistance (Figure 1F, FZR1), yet overexpression induces resistance (Mouery et al. bioRxiv 2023). Explain.

      We do not understand why EMI1 responds in this way, and therefore we cannot comment on this in the text. 

      Typos/stylistic comments:

      • Typo "However, the net result of these opposing effects on cell cycle progression, and the contribution of the individual subcomplexes to this regulation, rained unclear."

      We thank the reviewer for pointing this out, and we have corrected it.  

      • Use of the word "growth" - I think the authors should be more precise. Is "proliferation" meant here?

      We thank the reviewer for pointing this out, and we have corrected it.

      • n Figure 4G, two of the panels have 8.42%. Is this correct, or may it be a copy/paste error?

      This was an error, but is no longer relevant as we have reconducted and reanalyzed this experiment.

      Reviewer #2 (Recommendations For The Authors):

      Major Points

      (1) Some of the conclusions should be confirmed in additional cell lines. I would suggest testing the resistance to Palbociclib in several additional cell lines, where MTF2 and JARID2 are deleted. If the conclusion can be generalized, one would expect that the differential role of MTF2 versus JARID2 can be confirmed in more cell lines.

      While the PRC2.1-dependent repression of D-type cyclins does not appear to be general, we have now demonstrated in Figures 5SE and 6F that there are multiple different cellular contexts in which our observations are consistent. Specifically, we demonstrate that GSK126 causes upregulation of CCND1 in both immortalized nontumor cells (COS7 cells) and in the breast cancer cell line MDA-MB-231. Moreover, in both cases we showed that this effect is PRC2.1-dependent, as shRNA knockdown of MTF2 increases expression of CCND1.

      (2) In addition, it may be attractive to make use of publicly available RNA-seq data of MTF2 and JARID2 knockout/down cells, to investigate the generality of the finding that PRC2.1 regulates CCND1 and CCND2.

      While it would be useful to address this issue, Figure S5E demonstrates that the repression of D-type cyclin expression by PRC2.1 is context dependent. Furthermore, prior to identifying the lines shown in Figure 6F and 5SE, we were not aware of which lines to focus our investigations on. However, we have now demonstrated a few cellular contexts in which either chemical inhibition of PRC2 or knockdown of MTF2 results in de-repression of CCND1 expression.

      (3) At a bare minimum the authors should strongly discuss the limitations of the study, and tone down the conclusions.

      We would agree with this based upon the data in the original submitted manuscript, however, now that we have shown that this effect is more general, this is less critical. That said, we do not see this effect in all cell lines, and we have made this apparent in the final version of the manuscript.

      Minor point

      (1) In my view, Figures 1-3 should be shortened to the most essential points, and some data/figures should be moved to the supplementary figures. Especially the STING genenetwork graphs are in my view not particularly meaningful.

      While we understand the opinion of this reviewer, we feel that these data will be of significant interest to some readers.  

      (2) Figure 6E and 6F/G appear to be largely redundant. This can perhaps be made more concise.

      This has been addressed in the new version of Figure 6

      (3) Figure 5D should be enlarged. 

      We thank the reviewer for this suggestion and have enlarged the image.

      Reviewer #3 (Recommendations For The Authors):

      The manuscript could be edited to improve clarity. In several places, the scientific logic motivating an experiment is confusing, and there are several hypotheses and conclusions that seem opposite from what the data are suggesting. Some aspects of the figures were also unclear. Specific examples include the following:

      (1) Last sentence of abstract : "Our results demonstrate a role for PRC2.1, but not PRC2.2, in promoting G1 progression." Data show that knockout of PRC2.1 components promotes G1 progression through upregulation of CycD, so the conclusion here is the opposite.

      We thank the reviewer for catching this error. We have now changed this to “in antagonizing G1 progression”.

      (2) In the second paragraph of the results, CCNE1, CDK2, etc are described as scoring high for palbociclib resistance, but those genes scored as sensitizing. Also, in that paragraph, it is described that a drug is sensitizing cells to loss of a gene, which seems like incorrect logic. It should be clarified that knock-out of a gene either sensitizes or desensitizes cells to the drug.

      We thank the reviewer for catching this error. We have now corrected it.  

      (3) In the motivation for the experiment in Figure 3D, it is written: "we asked whether chemical inhibition of oxidative phosphorylation could rescue sensitivity to palbociclib". Considering that knock-out of genes that mediate oxidative phosphorylation confer resistance to palbociclib, it is confusing why it was expected that chemical inhibitors would restore sensitivity.

      We are sorry if the original wording was confusing. We have now changed this to “combined inhibition of oxidative phosphorylation and CDK4/6 activity mutually rescue the proliferation defect imposed by agents targeting the other process”.  

      (4) If the intention of Figure 3D is to test the hypothesis that chemical inhibition of oxidative phosphorylation modulates sensitivity to palbociclib, the clarity of Figure 3D would be improved if data were shown such that palbociclib concentration is on the x-axis and the different curves are different drug concentrations.

      It appears that there is some mutual suppression, which inhibition of each process rescues cells partly from inhibition of the other. In fact, with these drugs the stronger of the two is seen as the rescue of mitochondrial poisons by palbociclib. We have now discussed this in the text.  

      (5) The authors should check the units on the x-axis in Figure 4D, should they be log[uM Palbo] or log [nM Palbo]?

      We thank the reviewer for catching this error. We have now corrected it

      (6) It should be clarified which data are summarized in the graph to the right in Figure 4G, are these experiments with palbociclib?

      This is currently included in the figure legends.

      (7) The text suggests that the control CCNE1 knockout is shown in Figure 4E, but those data are missing.

      This has been corrected in Figure 4E.

      Several conclusions are not well supported by the data and should be revised or more data and analysis should be added.

      (1) The titular conclusion that the "PRC2.1 Subcomplex Opposes G1 Progression through Regulation of CCND1 and CCND2" has only been demonstrated in the context of a Cdk4/6 inhibitor in HAP1 cells. There is little evidence supporting this claim that is broadly applicable. For example, data in Figure 4G show small and not demonstrable significant differences in G1 and S phase populations in the mock experiments. Also, experiments in other cells are needed to support the rigor and generality of the conclusion.

      Our chemogenetic screen and competitive proliferation assay data in Figure 4A, 4C and 4E support the conclusion that PRC2.1 and PRC2.2 play opposing roles in G1 progression. Furthermore, we have repeated the initial BrdU incorporation experiments shown in Figure 4G and have been able to demonstrate that JARID2∆ cells do indeed display a significant decrease of cells entering into S-phase when treated with palbociclib. Most importantly, in the Figures 6D and 6E we show additional cell lines where this is the case.  Therefore, we feel that this title is valid in the current version of the manuscript, where we have shown it to be the case in multiple tumor-derived human cell lines as well as immortalized non-human primate cells.  

      (2) It is unclear how the data in Figure 3D support the conclusion that the administered inhibitors of oxidative phosphorylation influence response to palbociclib.

      As noted in the response to point 4, we have now discussed this mutual rescue more thoroughly in the text.  

      (3) In Figure 4D, the IC50 values should be calculated and statistical significance based on biological replicates should be determined. Also, the conclusion that "increasing doses of GSK126 withstood palbociclib-induced growth suppression" is overstated, as ultimately all drug conditions succumb to palbocilib suppression of proliferation, although there may be differences in sensitivity.

      We have now  included a statical analysis of each data point in Figure 4D.  

      Editorial comments:

      (1) The title does not seem to optimally capture the content of the paper. Please consider changing it, e.g. focusing on palbociclib resistance. 

      While we used this particular drug to make the original observation, we feel it is more general to discuss the underlying biology (cyclin gene control) than the pharmacological methodology. Moreover, we have now extended our findings about the regulation of D-type cyclins by PRC2.1 to several cell lines, derived from both cancers and primary cells, re-enforcing the fact that this effect is observed more broadly.   

      (2) Please indicate the biological system (haploid human HAP1 cells) in either title or abstract.

      The abstract now indicates that we have observed this in CML, breast cancer and immortalized primary cells.

    2. Reviewer #3 (Public review):

      This study begins with a chemogenetic screen to discover previously unrecognized regulators of the cell cycle. Using a CRISPR-Cas9 library in HAP1 cells and an assay that scores cell fitness, the authors identify genes that sensitize or desensitize cells to the presence of palbociclib, colchicine, and camptothecin. The results suggest that these three drugs inhibit proliferation through different mechanisms, and with each treatment, expected and unexpected pathways were found to affect drug sensitivity. The authors focus the rest of the experiments and analysis on the polycomb complex PRC2, as deletion of several of its subunits in the screen conferred palbociclib resistance. The authors find that PRC2, specifically a complex dependent on the MTF2 subunit, methylates histone 3 lysine 27 (H3K27) in promoters of genes associated with various processes including cell-cycle control. Further experiments demonstrate that Cyclin D expression increases upon loss of PRC2 subunits, providing a potential mechanism for palbociclib resistance.

      The strengths of the paper are the design and execution of the chemogenetic screen, which provides a wealth of potentially useful information. The data convincingly demonstrate in the HAP1 cell line that the MTF2-PRC2 complex sustains the effects of palbociclib (Fig. 4), methylates H3K27 in CpG-rich promoters (Fig. 5), and represses Cyclin D expression (Fig. 6). The correlation between MTF2-PRC2 inhibition and increased Cyclin D levels is shown in multiple cell lines using both genetic and chemical approaches. These results could be of great interest to those studying cell-cycle control, resistance mechanisms to therapeutic cell-cycle inhibitors, and chromatin regulation and gene expression.

      There are a few weaknesses that somewhat temper the overall quality and potential impact of the study. First, the results from the colchicine and camptothecin screens (Fig. 1 and 2) are not experimentally validated, which lessens the rigor of those data and conclusions. Second, some experiments validating and further exploring results from the palbociclib screen (Figs. 4 and 5) are restricted to the Hap1 cell line, so the generality of some conclusions is not established. Third, conclusions drawn from data in Fig. 4D are not fully supported by proper use of biological replicates and analysis of the results.

      Comments on revisions:

      Proper statistical analysis considering biological replicates is still not applied to determine whether differences in palbociclib IC50 values at different GSK126 concentrations are significant.

    1. Reviewer #2 (Public review):

      Summary:

      In this manuscript, Dr. Yao Li et al. documented the metabolomic profile of the aorta from OVX rats and that from OVX plus E2. These conditions mimic post-menopause hypertension and hormonal replacement therapy.

      Strengths:

      The authors state that this is probably the first study to examine the metabolic changes in the aorta of post-menopause hypertension.

      Weaknesses:

      There are several weaknesses, and a few of them are quite serious.

      (1) The aorta is not a resistant artery and has little to do with hypertension. The authors should have used resistant arteries for this study. The expression of several adrenergic receptors and cholinergic receptors in the aorta and resistant arteries are different. It is unknown whether the aorta metabolomic profile has any relevance to BP and whether they are similar to that of the resistant arteries. I understand the logistics issue of obtaining enough tissues from resistant arteries. At least, once some leads are discovered in the aorta, the authors should validate it in resistant arteries. This should be feasible.

      (2) The aorta and all the arteries have three layers. It is critically important to know whether the metabolic changes occur in the intima or in the media, while the adventitia probably has little to do with vasoconstriction and hypertension. If the authors want to use the aorta to conduct the preliminary study, they should completely remove the adventitia and then use samples with and without their endothelium stripped and then assess their metabolomic profiles. After the leads are obtained from this preliminary profiling, they should be validated in endothelium and smooth muscles of the resistant artery. The current experiments are not appropriately designed.

      (3) The tail-cuff BP measurement is a technique of the last century. The current gold standard of BP measurement is by telemetry. The tail-cuff method is particularly problematic in this study because the 1-2 h restraining of the rats for more than 10 times BP measurement will cause significant stress in the animal, and their stress hormone secretion might cause biased metabolomic profiles in the OVX versus shames operated mice. The problem can be totally avoided by using telemetry.

      (4) Although the L-AABA showed a high p-value (10^-4) of a decrease in the OVX rats, the fold change is small (2-3 folds). Such a small change should be validated using a different method to be convincing.

      (5) The authors claim (or hypothesize) that the reduced AABA level in OVX can cause vascular remodeling. This can be easily validated by the histology of the OVX-resistant artery, and they should do that during the revision. The authors should also examine the M1 macrophage function from the OVX mice to validate their claimed link of AABA to M1.

      (6) As mentioned above, the authors need to pinpoint the changes of AABA to target cells, i.e., endothelial cells, SMC, or M1, and then use in vitro or in vivo cell biology approaches to assess whether these cells in the OVX rat indeed have an abnormality in function and, indeed, such functional changes are responsible for the BP phenotype.

      (7) The results of the current study can be condensed into 1 or 2 figures that can serve as a base or a starting point for a deeper scientific study.

      Summary

      The experimental design of this manuscript is inappropriate, and the methods are not up to the current standards. The whole study is descriptive and rudimentary. It lacks validation and mechanism. The data from this manuscript might be of some value and can serve as the first step for more investigation of the mechanism of post-menopause hypertension.

    2. Author response:

      Reviewer #1 (Public review):

      Summary:

      The authors aim to investigate the relationship between low estrogen levels, postmenopausal hypertension, and the potential role of the molecule L-AABA as a biomarker for hypertension. By employing metabolomic analysis and various statistical methods, the study seeks to understand how estrogen deficiency affects blood pressure and identify key metabolites involved in this process, with a particular focus on L-AABA.

      Strengths:

      The study addresses a relevant and understudied area: the role of estrogen and metabolites in postmenopausal hypertension. It presents a novel hypothesis that L-AABA may serve as a protective factor against hypertension, which could have significant clinical implications if proven.

      We appreciate the acknowledgment of our study’s focus on an important and understudied area. Our hypothesis regarding L-AABA’s role as a possible protective factor against hypertension indeed holds promise for advancing clinical implications.

      Weaknesses:

      The evidence linking L-AABA to hypertension is largely correlative, lacking experimental validation or mechanistic proof. Key limitations, such as the inadequacy of the ovariectomy model in replicating human menopause, are acknowledged but not addressed with alternative approaches. In summary, while the study offers an intriguing hypothesis, its conclusions are premature and require further experimental validation and human data to substantiate the claims.

      We recognize the limitations regarding the correlative nature of our findings and the inadequacy of the OVX model in replicating human menopause. Future research will prioritize experimental validation and incorporate human studies to solidify our conclusions.

      Reviewer #2 (Public review):

      Summary:

      In this manuscript, Dr. Yao Li et al. documented the metabolomic profile of the aorta from OVX rats and that from OVX plus E2. These conditions mimic post-menopause hypertension and hormonal replacement therapy.

      Strengths:

      The authors state that this is probably the first study to examine the metabolic changes in the aorta of post-menopause hypertension.

      As pointed out by the reviewer, our study may be the first to investigate changes in aortic metabolism in postmenopausal hypertension. As an exploratory study, our goal is to depict the overall characteristics and explore possible research directions.

      Weaknesses:

      There are several weaknesses, and a few of them are quite serious.

      (1) The aorta is not a resistant artery and has little to do with hypertension. The authors should have used resistant arteries for this study. The expression of several adrenergic receptors and cholinergic receptors in the aorta and resistant arteries are different. It is unknown whether the aorta metabolomic profile has any relevance to BP and whether they are similar to that of the resistant arteries. I understand the logistics issue of obtaining enough tissues from resistant arteries. At least, once some leads are discovered in the aorta, the authors should validate it in resistant arteries. This should be feasible.

      We acknowledge the limitation of using the aorta and will aim to include studies on resistant arteries to validate our metabolomic findings.

      (2) The aorta and all the arteries have three layers. It is critically important to know whether the metabolic changes occur in the intima or in the media, while the adventitia probably has little to do with vasoconstriction and hypertension. If the authors want to use the aorta to conduct the preliminary study, they should completely remove the adventitia and then use samples with and without their endothelium stripped and then assess their metabolomic profiles. After the leads are obtained from this preliminary profiling, they should be validated in endothelium and smooth muscles of the resistant artery. The current experiments are not appropriately designed.

      Future studies will involve detailed profiling of specific arterial layers, focusing on the intima and media to enhance the relevance of our findings related to hypertension.

      (3) The tail-cuff BP measurement is a technique of the last century. The current gold standard of BP measurement is by telemetry. The tail-cuff method is particularly problematic in this study because the 1-2 h restraining of the rats for more than 10 times BP measurement will cause significant stress in the animal, and their stress hormone secretion might cause biased metabolomic profiles in the OVX versus shames operated mice. The problem can be totally avoided by using telemetry.

      We appreciate the suggestion and will consider telemetry for more accurate blood pressure measurements in future experiments to minimize stress-related bias.

      (4) Although the L-AABA showed a high p-value (10^-4) of a decrease in the OVX rats, the fold change is small (2-3 folds). Such a small change should be validated using a different method to be convincing.

      We plan to employ additional methods to validate the observed changes in L-AABA levels in the following research, ensuring robustness of our findings.

      (5) The authors claim (or hypothesize) that the reduced AABA level in OVX can cause vascular remodeling. This can be easily validated by the histology of the OVX-resistant artery, and they should do that during the revision. The authors should also examine the M1 macrophage function from the OVX mice to validate their claimed link of AABA to M1.

      We intend to conduct histological analyses and examine M1 macrophage function in OVX-resistant arteries to validate our hypothesis in the following research.

      (6) As mentioned above, the authors need to pinpoint the changes of AABA to target cells, i.e., endothelial cells, SMC, or M1, and then use in vitro or in vivo cell biology approaches to assess whether these cells in the OVX rat indeed have an abnormality in function and, indeed, such functional changes are responsible for the BP phenotype.

      Addressing these points, we aim to pinpoint specific cell types affected by AABA variations and conduct in vitro and in vivo studies to examine their physiological impacts in the following research.

      (7) The results of the current study can be condensed into 1 or 2 figures that can serve as a base or a starting point for a deeper scientific study.

      Thank you for your suggestion. As a omics research, our research approach may differ from traditional mechanism studies.

      Summary

      The experimental design of this manuscript is inappropriate, and the methods are not up to the current standards. The whole study is descriptive and rudimentary. It lacks validation and mechanism. The data from this manuscript might be of some value and can serve as the first step for more investigation of the mechanism of post-menopause hypertension.

      Reviewer #3 (Public review):

      Summary:

      The decrease in estrogen levels is strongly associated with postmenopausal hypertension. Dr. Yao Li and colleagues aimed to investigate the metabolomic mechanisms of underlying postmenopausal hypertension using OVX and OVX+E2 rat models. They successfully established a correlation between reduced estrogen levels and the development of hypertension in rats. They identified L-alpha-aminobutyric acid (AABA) as a potential marker for postmenopausal hypertension. The research explored the metabolic alterations in aortic tissues and proposed several potential mechanisms contributing to postmenopausal hypertension.

      Strengths:

      The group performed a comprehensive enrichment analysis and various statistical analyses of the metabolomics data.

      As summarized by the reviewer, our current study conducted a comprehensive analysis of metabolomics data. It is also a reliable foundation for further mechanism research.

      Weaknesses:

      (1) The manuscript is descriptive in nature, although they mentioned their primary objective is to explore the potential mechanisms linking low estrogen levels with postmenopausal hypertension. No mechanism insights have been interrogated in this study, which has been mentioned by the authors in the discussion. The connection between E2, AABA, and macrophage needs to be validated in endothelial cells, vascular smooth muscle cells, and other aortic tissue cells. Without such verification, the manuscript predominantly raises hypotheses only based on metabolomic data.

      We have proposed research hypotheses based on detailed omics data. Further research on the mechanisms involving endothelial and vascular smooth muscle cells to validate the pathway connections between E2, AABA, and macrophages is undoubtedly the future direction of this study.

      (2) The serum contains three forms of estrogen: Estradiol, Estrone, and Estriol. The authors used the Rat E2 ELISA kit. Ideally, all three forms of estrogen should be measured.

      Future assays will aim to measure Estradiol, Estrone, and Estriol to capture a more comprehensive picture of estrogen’s role in postmenopausal hypertension.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Corynebacterium glutamicum is an important organism with industrial applications, and it constitutes a model organism for the study of other Corynebacteriales, which include important pathogens such as Mycobacterium tuberculosis and Corynebacterium diphtheriae. This work provides with a thorough structural and functional characterization of the S-layer structure of C. glutamicum based on solid data acquired through protein engineering, structural biology, and cell microscopy/imaging studies, and phylogenetic analysis. The authors have determined an atomic structural model of the S-layer from C. glutamicum formed by the protein PS2 exhibiting a different degree of conservation between external and mycomembrane facing surfaces; and they show evidence in vivo of how the S-layer assemble at the cell poles in this organism in line with the actinobacterial elongasome. The authors also show that the presence of the S-layer provides resistance to lysozyme, elaborating several hypotheses that may explain this observation, and demonstrate PS2 S-layer as a feasible platform for covalent surface display both in vitro and in vivo.

      The conclusions are well supported by the data provided. When required, experiments have been performed with an adequate number of replicates and their statistical analysis is properly provided. The information provided in the data and methods´ section are sufficient for experimental reproducibility.

      No major comments

      Minor comments. Text and figures are clear but the following aspects/questions should be addressed/clarified:

      • While the authors indicate that "S-layers are two-dimensional monolayered crystals typically composed of a single (glyco)protein ..." (lines 23-24), I haven´t found a reference to whether PS2 is glycosylated or not in the text. If PS2 is glycosylated and its glycosylation sites are known, where would they locate in the glycosylated structure? Would glycosylation affect the size of the pores observed with the recombinant protein?
      • Related to the permeability of lysozyme through the S-layer. Have the authors considered the IP of the lysozyme, which (assuming it is hen white egg lysozyme) it´s > 9. Could the negatively charged PS2 nonspecifically capture/retain at the outward-facing surface? Could glycosylation have something to do with it too?
      • Clarify which AF version was used for predicting the structural model of PS2, AF2 (as described in the main text, lines 160 and 656) or AF3 (supplementary Fig. 2). If it was AF3, the corresponding reference should be updated.
      • Line 205. Replace "Analysis of its surface electrostatics reveals that PS2..." by "Analysis of the electrostatic potential surface of PS2 reveals that..."
      • Figure 2 - panels (e), (g), (i), (j). Cartoons and specially ball-and-stick representations colored in white are very hard to visualize or not visible. Please use a darker color or darken the edges to improve visibility. Labels in panel (g) are very small and difficult to see, please increase their size (panels (i) and (j) seem okay).
      • Line 215. "vivo and recombinant PS2AD" remove "and".
      • Line 1011-12. Replace "...positive in blue to negative in red" by "...positive and negative charges are colored in blue and red, respectively"
      • Supplementary figure 3. Reduce the label size in the why axis (probability)
      • Supplementary figure 5. It is labeled as "4" instead of "5". The view captured in the figure does not allow to visualize the interface clearly. Please consider another orientation or an alternative representation such an open-book view.
      • Supplementary table 1. Replace "BL21(DH3)" by "BL21 (DE3)"
      • Review abbreviations, italics, spaces between number and units... (page 517, C. glutamicum in italic; choose either cryoEM or cryo-EM abbreviation for consistency...)

      Significance

      From my point of view as structural biologist, the present work uncovers key aspects of PS2 S-layer architecture and assemble over the OM of C. glutamicum, together with evidences in vivo of how the S-layer incorporated at cell poles alongside with the bacterial elongasome, and data supporting an implication of the PS2 S-layer in cell envelop stability. These findings constitute an important advance in the structural and functional understanding of S-layer in corynebacteriales, at the time it opens new questions regarding the role of the S-layer in cell integrity and as an interaction interface with external factors. Moreover, the authors present data supporting the feasibility of the PS2 S-layer of C. glutamicum as a protein-based platform to anchor with potential application in bioengineering materials and synthetic biology, and which has the potential to expand the biotechnological/industrial applications of C. glutamicum. Thus, this work has a significant relevance for the scientific community investigating the structure, function and biogenesis of the S-layer and the cell envelop, in particular. But also in a more general outlook, for the scientific community working in the fields of host-pathogen interactions and bioengineering materials.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this study, the authors investigate the structure of the surface layer, or S-layer, in the Corynebacteriales organism Corynebacterium glutamicum. Studies of the S-layer in Corynebacteriales are lacking and both the function and assembly of the S-layer is unclear in the context of the unique cell envelope of these microbes. In other bacterial systems, the S-layer has been implicated in many critical biological processes. The authors report the ex vivo cryo-EM structure of the S-layer protein PS2 from C. glutamicum, which shows hexameric symmetry with additional trimeric interfaces. The authors show that the C-terminal membrane anchor domain, which is not resolved in the reported structure, is important for lipid binding based on heterologous expression experiments in E. coli. Used the Spytag/Spycatcher tagging system and fluorescence microscopy, the authors determine that S-layer assembly occurs at the cell poles and is likely coordinated the polar growth mechanism of C. glutamicum.

      Major comments:

      1. The description of the different types of symmetry within the PS2 structure was confusing and difficult to correlate with the structure as depicted in Fig. 2. Authors should clarify labeling and coloring in Fig. 2e, g-j.
      2. Investigation of the function of the S-layer as a permeability barrier (Fig. 3e) would be strengthened by testing susceptibility of cells +/- S-layer to different classes of antibiotics or osmotic shock (optional).
      3. Due to the probable importance of the membrane-anchoring domain on S-layer function, can the authors comment on potential predicted structure of the regions of the membrane anchoring domain that was not resolved in their structure? How does this region differ between different Corynebacteriales species (or in S-layer proteins in mycobacterial species) that have different mycomembrane dimensions?
      4. The authors need to clarify if the version of PS2 used in the live cell imaging experiments detailed in Fig. 4d-f are PS2AD or PS2FL. While they show that PS2AD-Spytag is able to self-assemble, it is possible that the dynamics of PS2AD assembly in vivo are very different from PS2FL due to the absence of the membrane-anchoring domain. Comparison of dynamics between these two constructs would also be a nice addition to the paper.
      5. The pulse labeling experiments using Spycatcher would be strengthened by including fluorescent D-amino acids within the same cells to show true co-localization of S-layer assembly and PG synthesis. This could also shed light on the timescale of S-layer assembly in relation to biogenesis of other layers of the cell envelope.

      Minor comments:

      1. line 71: authors should elaborate on terminology "P6 symmetry"
      2. In Fig. 1g, it is not immediately clear that there is lattice formation. Authors should consider including an inset zoomed in box to make this clearer.
      3. line 165, 171, 194: do the authors mean "protomers"? If not, use of the word "promoter" is confusing
      4. citation on line 374 is incorrect. The cited paper focuses on inner membrane proteins that transport mycolic acid. Also, many of the mycomembrane porins, especially in C. glutamicum do not have a beta-barrel structure (see Ziegler et al, JMC, 2008)
      5. Citations detailing polar assembly of other envelope layers would provide additional support for generalized polar assembly of Corynebacteriales cell envelope (arabinogalactan- Marando et al, JACS, 2022) (mycolic acid biosynthesis proteins- Thouvenel et al, Sci Reports, 2023)
      6. Some of the supplementary figures are not referenced in the text

      Significance

      This study is of broad interest to both the Corynebacteriales and S-layer fields. The study is thorough and detailed but could be strengthened by some clarification in how the structure is presented and further discussion of the biological implications of the membrane anchoring domain. There is a long-held interest in understanding how the unique Corynebacteriales cell envelope is assembled, and the work contributes substantially to the field.

      Reviewer expertise: bacterial genetics, bacterial cell envelope, protein transport

    1. Author response:

      The following is the authors’ response to the original reviews.

      eLife Assessment

      This useful study reports on the discovery of an antimicrobial agent that kills Neisseria gonorrhoeae. Sensitivity is attributed to a combination of DedA assisted uptake of oxydifficidin into the cytoplasm and the presence of a oxydifficidin-sensitive RplL ribosomal protein. Due to the narrow scope, the broader antibacterial spectrum remains unclear and therefore the evidence supporting the conclusions is incomplete with key methods and data lacking. This work will be of interest to microbiologists and synthetic biologists.

      General comment about narrow scope: The broader antibacterial spectrum of oxydifficidin has been reported previously (S B Zimmerman et al., 1987). The main focus of this study is on its previously unreported potent anti-gonococcal activity and mode of action. While it is true that broad-spectrum antibiotics have historically played a role in effectively controlling a wide range of infections, we and others believe that narrow-spectrum antibiotics have an overlooked importance in addressing bacterial infections. Their advantage lies in their ability to target specific pathogens without markedly disrupting the human microbiota.

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      Kan et al. report the serendipitous discovery of a Bacillus amyloliquefaciens strain that kills N. gonorrhoeae. They use TnSeq to identify that the anti-gonococcal agent is oxydifficidin and show that it acts at the ribosome and that one of the dedA gene products in N. gonorrhoeae MS11 is important for moving the oxydifficidin across the membrane.

      Strengths:

      This is an impressive amount of work, moving from a serendipitous observation through TnSeq to characterize the mechanism by which Oxydifficidin works.

      Weaknesses:

      (1) There are important gaps in the manuscript's methods.

      The requested additions to the method describing bacterial sequencing and anti-gonococcal activity screening will be made. However, we do not think the absence of these generic methods reduces the significance of our findings.

      (2) The work should evaluate antibiotics relevant to N. gonorrhoeae.

      (1) It is not clear to us why reevaluating the activity of well characterized antibiotics against known gonorrhoeae clinical strains would add value to this manuscript. The activity of clinically relevant antibiotics against antibiotic-resistant N. gonorrhoeae clinical isolates is well described in the literature. Our use of antibiotics in this study was intended to aid in the identification of oxydifficidin’s mode of action. This is true for both Tables 1 and 2.

      (2) If the reviewer insists, we would be happy to include MIC data for the following clinically relevant antibiotics: ceftriaxone (cephalosporin/beta-lactam), gentamicin (aminoglycoside), azithromycin (macrolide), and ciprofloxacin (fluoroquinolone).

      (3) The genetic diversity of dedA and rplL in N. gonorrhoeae is not clear, neither is it clear whether oxydifficidin is active against more relevant strains and species than tested so far.

      (1) We thank the reviewer for this suggestion. We aligned the DedA sequence from strain MS11 with DedA proteins from 220 N. gonorrhoeae strains that have high-quality assemblies in NCBI. The result showed that there are no amino acid changes in this protein. Using the same method, we observed several single amino acid changes in RplL. This included changes at A64, G25 and S82 in 4 strains with one change per strain. These sites differ from R76 and K84, where we identified changes that provide resistance to oxydifficidin. Notably, in a similar search of representative Escherichia, Chlamydia, Vibrio, and Pseudomonas NCBI deposited genomes, we did not identify changes in RplL at position R76 or K84.

      (2) While the usefulness of screening more clinically relevant antibiotics against clinical isolates as suggested in comment 2 was not clear to us, we agree that screening these strains for oxydifficidin activity would be beneficial. We have ordered Neisseria gonorrhoeae strain AR1280, AR1281 (CDC), and Neisseria meningitidis ATCC 13090. They will be tested when they arrive.

      Reviewer #2 (Public Review):

      Summary:

      Kan et al. present the discovery of oxydifficidin as a potential antimicrobial against N. gonorrhoeae, including multi-drug resistant strains. The authors show the role of DedA flippase-assisted uptake and the specificity of RplL in the mechanism of action for oxydifficidin. This novel mode of action could potentially offer a new therapeutic avenue, providing a critical addition to the limited arsenal of antibiotics effective against gonorrhea.

      Strengths:

      This study underscores the potential of revisiting natural products for antibiotic discovery of modern-day-concerning pathogens and highlights a new target mechanism that could inform future drug development. Indeed there is a recent growing body of research utilizing AI and predictive computational informatics to revisit potential antimicrobial agents and metabolites from cultured bacterial species. The discovery of oxydifficidin interaction with RplL and its DedA-assisted uptake mechanism opens new research directions in understanding and combating antibiotic-resistant N. gonorrhoeae. Methodologically, the study is rigorous employing various experimental techniques such as genome sequencing, bioassay-guided fractionation, LCMS, NMR, and Tn-mutagenesis.

      Weaknesses:

      The scope is somewhat narrow, focusing primarily on N. gonorrhoeae. This limits the generalizability of the findings and leaves questions about its broader antibacterial spectrum. Moreover, while the study demonstrates the in vitro effectiveness of oxydifficidin, there is a lack of in vivo validation (i.e., animal models) for assessing pre-clinical potential of oxydifficidin. Potential SNPs within dedA or RplL raise concerns about how quickly resistance could emerge in clinical settings.

      (1) Spectrum/narrow scope: The broader antibacterial spectrum of oxydifficidin has been reported previously (S B Zimmerman et al., 1987). The focus of this study is on its previously unreported potent anti-gonococcal activity and its mode of action. While it is true that broad-spectrum antibiotics have historically played a role in effectively controlling a wide range of infections, we and others believe that narrow-spectrum antibiotics have an overlooked importance in addressing bacterial infections. Their advantage lies in their ability to target specific pathogens without markedly disrupting the human microbiota.

      (2) Animal models: We acknowledge the reviewer’s insight regarding the importance of in vivo validation to enhance oxydifficidin’s pre-clinical potential. However, due to the labor-intensive process needed to isolate oxydifficidin, obtaining a sufficient quantity for animal studies is beyond the scope of this study. Our future work will focus on optimizing the yield of oxydifficidin and developing a topical mouse model for subsequent investigations.

      (3) Potential SNPs: Please see our response to Reviewer #1’s comment 3. We acknowledge that potential SNPs within dedA and rplL raise concerns regarding clinical resistance, which is a common issue for protein-targeting antibiotics. Yet, as pointed out in the manuscript, obtaining mutants in the lab was a very low yield endeavor.

      Reviewer #3 (Public Review):

      Summary:

      The authors have shown that oxydifficidin is a potent inhibitor of Neisseria gonorrhoeae. They were able to identify the target of action to rplL and showed that resistance could occur via mutation in the DedA flippase and RplL.

      Strengths:

      This was a very thorough and clearly argued set of experiments that supported their conclusions.

      Weaknesses:

      There was no obvious weakness in the experimental design. Although it is promising that the DedA mutations resulted in attenuation of fitness, it remains an open question whether secondary rounds of mutation could overcome this selective disadvantage which was untried in this study.

      We thank the reviewer for the positive comment. We agree that investigating factors that could compensate for the fitness attenuation caused by DedA mutation would enhance our understanding of the role of DedA.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      (1) The use of the term "N. gonorrhoeae wildtype" should not be used. It is uninformative, as the species contains a large amount of diversity. Instead, please name the strain. From Figure 1, it looks like the authors used MS11. Since MS11 is a longstanding lab strain and likely does not reflect circulating N. gonorrhoeae, and since H041 is no longer in circulation, the authors should ideally test the compound against more representative strains of N. gonorrhoeae. This includes panels of isolates available through the CDC, for example (https://www.cdc.gov/drugresistance/resistance-bank/index.html). I encourage the authors to include FC428 or another recently identified isolate with the penA 60 allele to demonstrate oxydifficidin's activity against contemporary concerning isolates/lineages.

      (1) “N. gonorrhoeae MS11” is now used instead of “N. gonorrhoeae WT” in this manuscript.

      (2) In our revised manuscript, we have added MIC data for recently identified Neisseria gonorrhoeae isolates AR#1280 and AR#1281 which contain the penA 60 allele (Table 1). The data shows oxydifficidin maintains its potent activity against these multidrug-resistant strains. We also added a description of this data to the results section as shown below.

      Original text: “Oxydifficidin was more potent against N. gonorrhoeae MS11 than almost all other antibiotics we tested. In fact, it was only slightly less active than the highly optimized third-generation cephalosporin, ceftazidime.([18]) However, unlike third-generation cephalosporins, oxydifficidin retained activity against the multidrug resistant H041 clinical isolate (Table 1).([4]) H041 is resistant to the “standard of care” cephalosporin ceftriaxone (2 µg/mL) as well as a number of other antibiotics that are normally active against N. gonorrhoeae (penicillin G, 4 µg/mL; cefixime, 8 µg/mL; levofloxacin, 32 µg/mL).”

      Changed to: “Oxydifficidin was more potent against N. gonorrhoeae MS11 than most other antibiotics we tested. Notably, unlike clinically used antibiotics such as ceftriaxone, azithromycin, and ciprofloxacin, oxydifficidin retained activity against all multidrug-resistant clinical isolates we examined (Table 1).” (Line 77-79)

      (2) Does oxydifficidin have activity against N. meningitidis? It is the species most closely related to N. gonorrhoeae and the other pathogenic Neisseria.

      Oxydifficidin has potent activity against N. meningitidis ATCC 13090. In our revised manuscript, we have included its MIC data in Figure 1c.

      (3) Given claims that oxydifficidin activity in N. gonorrhoeae as compared to other Neisseria reflects N. gonorrhoeae's dedA and sensitive rplL, it would be good to assess the allelic diversity of these genes in N. gonorrhoeae. There are over 20,000 genomes from clinical isolates of N. gonorrhoeae in databases. It should be straightforward to check whether dedA and rplL allelic variants already exist in the population. Should variants be observed, oxydifficidin should be tested against the associated strains of N. gonorrhoeae.

      Response: We thank the reviewer for this suggestion. We aligned the DedA sequence from strain MS11 with DedA proteins from 220 N. gonorrhoeae strains that have high-quality assemblies in NCBI. The result showed that there are no amino acid changes in this protein. Using the same method, we observed several single amino acid changes in RplL. This included changes at A64, G25 and S82 in 4 strains with one change per strain. These sites differ from R76 and K84, where we identified changes that provide resistance to oxydifficidin. Notably, in a similar search of representative Escherichia, Chlamydia, Vibrio, and Pseudomonas NCBI deposited genomes, we did not identify changes in RplL at position R76 or K84.

      New text: “A survey of 220 N. gonorrhoeae strains with high-quality assemblies in NCBI found no mutations in the DedA protein.” (Line 104-105)

      “These two mutations were not found in the survey of the same collection of N. gonorrhoeae strains used to look for DedA mutations.” (Line 143-144)

      (4) Clinically relevant antibiotics for N. gonorrhoeae are penicillin, tetracycline, spectinomycin, gentamicin, ciprofloxacin, azithromycin, ceftriaxone; moreover, zoliflodacin and gepotidacin have reportedly successfully completed phase 3 trials. The authors should redo their MIC testing with these antibiotics (e.g., for Figures 1 and 2 and Tables 1 and 2), both because this will enable direct comparison with the many clinical isolates that have undergone testing and because these are the drugs most pertinent to clinical practice. Ampicillin, ceftazidime, chloramphenicol, bacitracin, and daptomycin are not relevant. Could the authors explain why they tested vancomycin, polymyxin B, irgasan, melittin, avilamycin, and thiostrepton?

      Our use of antibiotics with diverse modes of action (e.g. vancomycin, polymyxin B, irgasan, melittin, avilamycin, and thiostrepton) in this study was intended to aid in the identification of oxydifficidin’s mode of action. This is true for both Tables 1 and 2.

      To address the reviewer’s concern, in our revised manuscript, we have added MIC data for the following clinically relevant antibiotics: ceftriaxone (cephalosporin/beta-lactam), gentamicin (aminoglycoside), azithromycin (macrolide), and ciprofloxacin (fluoroquinolone) to Table 1.

      (5) Please describe the characteristics of the transposon library (finding four transposons in a single strain does seem unexpected, given how most transposon libraries aim for one transposon insertion per strain).

      We understand that one transposon insertion per strain is ideal for transposon libraries. This Bacillus strain proved to be recalcitrant to genetic manipulation. In the rare cases where we obtained resistance colonies upon electroporation with the transposon, all colonies contained multiple (≥ 4) transposon insertions. This made it impractical to build a library with one transposon insertion per library member.

      We assumed that the anti-N. gonorrhoeae activity most likely originated from a natural product BGC, which typically range from 10-100 kb in size.

      Based on the average of 50 kb per BGC, ~80 transposon insertions would be required to fully search the 4.2 Mb genome of Bacillus amyloliquefaciens BK for a BGC. At 4 mutations per transformant, 1x coverage of the genome would require only 20 library members.

      After extensive electroporation of transposon into Bacillus amyloliquefaciens BK, we were able to obtain a library of 50 members, including one mutant (Tn5-3) that lacked anti-N. gonorrhoeae activity.

      New text added to the methods section:

      “A library containing 50 transposon mutants was obtained. In the mutants examined, each strain contained ≥4 transposon insertions” (Line 337-339)

      (6) Please describe in the methods how you sequenced and annotated the genome of Bacillus amyloliquefaciens BK.

      The sequencing method is now described in “Genomic Sequencing and annotation of Bacillus amyloliquefaciens” section. The genome of Bacillus amyloliquefaciens BK was not fully annotated. Mutations were identified as described in the updated methods section below.

      New text:

      “Genomic Sequencing and annotation of Bacillus amyloliquefaciens

      Genomic DNA from Bacillus amyloliquefaciens BK WT and transposon mutant Tn5-3 was isolated using PureLink Microbiome DNA purification kit (Invitrogen) according to the manufacturer’s instructions.

      The Bacillus amyloliquefaciens BK WT genome was assembled by mapping its sequencing data onto the annotated genome of Bacillus amyloliquefaciens FZB42 using Geneious Prime. Differences in the mutant strain Tn5-3 were identified by mapping its sequencing data onto the assembled Bacillus amyloliquefaciens BK WT genome. The mutated genes were then annotated using NCBI BLAST. The oxydifficidin BGC was annotated using the antiSMASH online server.” (Line 253-260)

      (7) Please describe in the methods how you screened the library for strains that lacked anti-gonococcal activity.

      The method is added to our revised manuscript as section “Screening of Bacillus Strains Lacking Anti-N. gonorrhoeae Activity”.

      New text:

      “Screening of Bacillus Strains Lacking Anti-N. gonorrhoeae Activity

      The transposon mutants of Bacillus amyloliquefaciens BK were grown overnight in LB medium at 30 °C. Each overnight culture was then diluted 1:5000, and 1 μl of the diluted culture was spotted onto a GCB agar plate swabbed with N. gonorrhoeae cells. The plate was then incubated overnight at 37 °C with 5% CO2. The mutant strain (Tn5-3) lacking anti-N. gonorrhoeae activity was identified due to its failure to produce a zone of growth inhibition in the resulting N. gonorrhoeae lawn.” (Line 341-346)

      (8) Was only one strain found that was a 'non-producer' of anti-N. gonorrhoeae activity? Line 68 suggests that this was only one of multiple non-producers. Is that correct? If so, did you work up the others, and did they also have disruptions in the same biosynthetic gene cluster?

      Only one strain was identified as a “non-producer” of anti-N. gonorrhoeae activity. We have modified the text to clarify this point.

      Original text: “The sequencing of one non-producer strain revealed that it surprisingly contained four transposon insertions and one frame shift mutation.”

      Changed to: “The sequencing of the non-producer strain revealed that it surprisingly contained four transposon insertions and one frame shift mutation.” (Line 53-54 )

      (9) All sequences (including Bacillus amyloliquefaciens BK) must be deposited in a public database (e.g., NCBI) and the accession numbers reported in the manuscript.

      Genomic sequence data of Bacillus amyloliquefaciens BK has been deposited in GenBank, and its accession number (GCA_019093835.1) now appears in figure legend of Figure S1a.

      Figure S1a legend:

      “Genome-based phylogenetic tree containing Bacillus amyloliquefaciens BK and closely related Bacillus spp. The tree was built by Genome Clustering of MicroScope using neighbor-joining method. The NCBI accession numbers of Bacillus strains used in the tree are GCA_000196735.1, GCA_000204275.1, GCA_000015785.2, GCA_019093835.1, GCA_000009045.1, GCA_000011645.1, GCA_000172815.1, GCA_000008005.1, and GCA_000007845.1 (from top to bottom).”

      Minor

      (10) Statements in the article would benefit from fact-checking. For example:

      - gonorrhea is not the second most prevalent sexually transmitted infection worldwide; it is the second most reported bacterial sexually transmitted infection.

      - Treatment is ceftriaxone 500mg IM x1 in the US, but 1g IM x1 in the UK and Europe. The UK guidelines also permit ciprofloxacin, should sequencing indicate gyrA 91S. I suggest reviewing / specifying which treatment guidelines you're referring to.

      We appreciate the reviewer’s corrections. The word “prevalent” is now changed to “reported”.

      Original text: “Gonorrhea, which is caused by Neisseria gonorrhoeae, is the second most prevalent sexually transmitted infection worldwide.”

      Changed to: “Gonorrhea, which is caused by Neisseria gonorrhoeae, is the second most reported sexually transmitted infection worldwide.” (Line 2-3)

      Original text: “Gonorrhea is the second most prevalent sexually transmitted infection worldwide, its causative agent is the bacterium Neisseria gonorrhoeae.”

      Changed to: “Gonorrhea is the second most reported sexually transmitted infection worldwide, its causative agent is the bacterium Neisseria gonorrhoeae.” (Line 18-19)

      “In the USA” is now added to the sentence stating gonorrhea treatment.

      Original text: “The high dose (500 mg) of the cephalosporin ceftriaxone is currently the only recommended therapy for treating gonorrhea infections.”

      Changed to: “The high dose (500 mg) of the cephalosporin ceftriaxone is currently the only recommended therapy for treating gonorrhea infections in the USA.” (Line 20-22)

      (11) Please make sure all results are in the results section. The report of cell morphology, for example, should be in the results, not the discussion.

      In our revised manuscript, we have included the cell morphology data in the results section with the text changes below.

      Original text: “Interestingly, not only was dedA deficient N. gonorrhoeae less susceptible to oxydifficidin, oxydifficidin also kills this mutant more slowly (Figure 2b) than WT N. gonorrhoeae MS11.”

      Changed to: “Interestingly, not only was dedA deficient N. gonorrhoeae less susceptible to oxydifficidin, oxydifficidin also kills this mutant more slowly (Figure 2b) than WT N. gonorrhoeae MS11. The dedA deletion mutant also showed an altered cell morphology with reduced membrane integrity and lower formation of micro-colonies (Figure S4). (Line 100-104)

      Original text: “The dedA deletion mutant also showed an altered cell morphology with reduced membrane integrity and lower formation of micro-colonies (Figure S4), indicating that it should show reduced pathogenesis and fitness, and, as a result, not accumulate in a clinical setting, which adds to the therapeutic appeal of oxydifficidin.”

      Changed to: “The dedA deletion mutant exhibited altered cell morphology, characterized by diminished membrane integrity and reduced micro-colony formation, indicating that it should show reduced pathogenesis and fitness, and, as a result, not accumulate in a clinical setting, which adds to the therapeutic appeal of oxydifficidin” (Line 206-210)

      (12) Tables 1 and 2 should be combined and should address the most relevant antibiotics

      The MIC data of additional relevant antibiotics are now included in Table 1. However, we still believe that keeping Tables 1 and 2 separate enhances the clarity of the manuscript. Table 2 specifically focuses on diverse ribosomal targeting antibiotics, which highlights the unique binding site of oxydifficidin.

      (13) Supplemental Figure 1a. The tree could be better resolved, and there are four entries with the identical listing of "Bacillus amyloliquefaciens subsp. plantarum" on different branches. In the methods or the legend, please indicate the accession numbers for these genomes. Also please specify how this tree was made-is it a maximum likelihood tree? Something else?

      The tree is now better resolved and includes new entries. The requested information regarding accession numbers and tree construction method has been included in the figure legend.

      New supplemental Figure 1a legend:

      “a. Genome-based phylogenetic tree containing Bacillus amyloliquefaciens BK and closely related Bacillus spp. The tree was built by Genome Clustering of MicroScope using neighbor-joining method. The NCBI accession numbers of Bacillus strains used in the tree are GCA_000196735.1, GCA_000204275.1, GCA_000015785.2, GCA_019093835.1, GCA_000009045.1, GCA_000011645.1, GCA_000172815.1, GCA_000008005.1, and GCA_000007845.1 (from top to bottom).”

      Reviewer #2 (Recommendations For The Authors):

      The conclusions drawn in the manuscript are well-supported by the experimental data presented.

      I have the below minor comments:

      (1) "serendipitously identified" - I feel this wording should be avoided throughout the manuscript. The point of a research paper is to communicate methodology and experimental detail, and this language portrays the opposite.

      While we agree that methodology and experimental procedures are paramount in scientific reporting, we believe it is equally important to convey, particularly to younger generations, that a part of the scientific process is often unplanned and can benefit from chance observations. Therefore, we would like to keep this wording.

      (2) The introduction should include the biological roles/function of DedA proteins in bacteria.

      DedA proteins perform a wide array of biological roles and functions in bacteria. In the results section (Line 107-116), we have described the most well-established of these functions, particularly the flippase activity, which appears to be directly related to oxydifficidin sensitivity. We believe that introducing this information in the results section enhances the manuscript’s clarity and flow.

      (3) "When we screened this contaminant for antibacterial activity against lawns of other Gram-negative bacteria it did not produce a zone of growth of inhibition against any of the bacteria we tested (e.g., Escherichia coli, Vibrio cholerae, Caulobacter crescentus)." Can these data Figures be included in the Supplements?

      This result was recorded in the lead author’s notebook, but no image was saved.

      (4) Line 52: Was any base analyses performed on the Tn-mutants i.e., how many insertion-sites? Depth of mutants? Was a library constructed in this study or previously? Why were only BGC assessed?

      Please see our response to Reviewer #1’s comment (5). We focused on BGCs because we believed the anti-N. gonorrhoeae activity most likely resulted from a molecule encoded by a natural product BGC.

      (5) Line 98: Do the other 2 predicted DedA-like proteins also have a role in uptake of oxydifficidin? Is there some redundancy in uptake?

      We generated knockout mutants for two other predicted DedA-like proteins in N. gonorrhoeae MS11, and the MIC of oxydifficidin for these mutants remained the same as for the N. gonorrhoeae MS11 wild type strain. Therefore, we believe that the DedA protein discussed in this manuscript is the primary transporter of oxydifficidin. However, we cannot completely rule out the possibility of redundancy in oxydifficidin uptake by other DedA-like proteins.

      New text: “We also generated deletion mutants for two other predicted dedA-like genes, and the MIC of oxydifficidin for these mutants remained the same as for the N. gonorrhoeae MS11 wild type strain.” (Line 98-100)

      Reviewer #3 (Recommendations For The Authors):

      This is a well presented manuscript and I could not immediately see any issues with it.

      We appreciate the reviewer’s positive feedback.

    1. Author response:

      We are submitting a revised manuscript with major additions that address the main concerns in the initial reviews. At the highest level, this revision provides i) orthogonal biochemical measurements that yield concrete evidence of lysosomal protein aggregates, and ii) a plausible mechanism linking lysosomal lipid handling and protein aggregation through disruption of ESCRT function. We believe these additions significantly improve the completeness of this study and the conclusions that can be drawn from the data.

      Below are more specific highlights on the addition in this revision:

      -       We included orthogonal techniques (thioflavin-T staining and Lyso-IP followed by differential extraction) and confirmed the accumulation of RIPA-insoluble protein aggregates at the lysosomes in cells under lipid perturbation (Figure 3).

      -       We performed TMT-Proteomics and identified accumulation of insoluble ESCRT components at the lysosomes under lipid perturbation (Figure 4). Two new authors involved in this effort are added onto the manuscript.

      -       The ESCRT result prompted us to revisit lysosomal membrane integrity. With improved imaging conditions and analysis we were able to see increased membrane permeabilization under lipid perturbation. VPS4A overexpression partially rescued this phenotype, suggesting that lipid accumulation impairs ESCRT disassembly (Figure 5).

      -       Together, the results suggest that lipid perturbation impairs ESCRT function, compromising both lysosomal membrane repair and microautophagy, resulting in the accumulation of endogenous protein aggregates at the lysosomes (Graphical Abstract).

      Reviewer #1 (Recommendations For The Authors):

      (1) Perhaps the most prominent limitation of this work is the unilateral focus on native cells (i.e. cells under no endogenous or exogenous stress) as the model for protein aggregate formation. Furthermore, although the ProteoStat stain has been utilized by many investigators before, the sole reliance on this stain as the read-out for their assays is concerning. To compound the concern, the ProteoStat-positive puncta co-localize with lysosmal markers which was surprising even to the authors. All in all, it behooves the authors to test proteostasis in multiple parallel ways to actually define what they are studying. How is it possible that protein aggregates under native conditions are only co-localized with lysosomes? Are we really studying protein aggregates which should predominantly be cytoplasmic insoluble aggregates?

      (a) They need to get away from a simple stain like ProteoStat and conduct co-stainings with other markers such as poly-ubiquitin antibodies and other chaperones to define what and where else exactly are these aggregates.

      Co-staining with poly-ubiquitin was included in the original manuscript. We added orthogonal staining with another widely used amyloid dye, Thioflavin-T, and provided fine-grained quantification of lysosomal vs cytosolic localization of various signals (Figures S4A-C & 3A-B).

      (b) They need to do Immunoblots with and without triton insolubility to see if these aggregates are insoluble as most would predict. They can do lysosomal isolation vs cytoplasmic to see if the insoluble aggregates are really lysosomal.

      We performed Lyso-IP followed by differential detergent extraction to confirm the accumulation of insoluble proteins at the lysosomes (Figure 3C). Proteomic analysis identified some of these insoluble proteins as ESCRT subunits (Figure 4).

      (c) They should compare aggregate formation in the native state versus cells with lysosomal inhibition via Bafilomycin or chloroquine versus cells with proteosomal inhibition. The lysosomal inhibition experiments are particularly informative given the lysosomal relevance they have uncovered.

      We included other small molecule inhibitors and at different time points to compare the effect of different modes of proteostasis challenge (Figure S4A-D). Together with the ESCRT finding, our results suggest the role of microautophagy in our system, and provide a model of how ProteoStat- and/or ubiquitin- positive substrates become partitioned between the cytoplasm and lysosomes under different perturbations.

      (d) Many protein aggregates which are too bulky for proteosome degradation will traditionally be dealt with by aggrephagy. Why is this not observed?

      Knockdown of core macroautophagy components did not impact Proteostat intensity in our CRISPRi screen, suggesting that basal macroautophagy plays a negligible role in clearing endogenous amyloid-like structures in our experimental system. We provide an alternative model that these aggregates instead arrive at the lysosomes via microautophagy.

      (2) After addressing #1, they can validate if the genes they identified by CRISPR screens are also important in modulation of protein aggregate burden in other systems. For example, if they inhibit lysosomes by Bafilo or Chloroquine to obtain protein aggregates and then Knockdown the identified genes in the CRISPR screens, will they get the same results?

      We addressed the effect of different modes of proteostasis challenge as recommended above. Deacidifying the lysosomes alone causes intense protein aggregation (Figure S4A-D) and eventually cell death, and was thus not combined with other perturbations.

      (3) They identify lysosomal lipid metabolism genes/pathways as the culprit for inducing proteostasis. In particular sphingolipid and cholesteryl ester species appear to be operational here. However, there are no specific lipids species or specific lipid metabolism gene that is causative. Rather, you have to knockdown entire processes to have an effect. This suggests that the focus on lysosome health (i.e. permeability, proteolysis, etc) is rudimentary. When you have to knockdown entire classes of lipids, this would indicate more broad effects on cellular lipids (including membrane lipids beyond the lysosome) and related cellular health?

      We included data on the effect of knocking down MYLIP, PSAP, and as a comparison PSMD2 on the growth rate of K562 cells (Figure S5A). MYLIP and PSAP KDs, which cause predominantly an accumulation of lipids, do not impede cell growth. Increasing lipid uptake by MYLIP KD increases cell proliferation under our culture conditions, suggesting a general negative impact on cell health was not required for the association between lipid levels and protein aggregates.

      (a) They conduct a superficial methyl-beta-cyclodextrin experiment with equivocal results. The use of MBCD for different time-courses to deplete various membrane cholesterol pools including the plasma membrane pool is important to ascertain what aspect of the cellular cholesterol is affecting proteostasis. MBCD +/- cholesterol reintroduction time-courses for rescue will also be key to determine the culprit cellular cholesterol pool.

      The MBCD / Filipin experiment helped us determine that ProteoStat doesn’t directly stain cholesterol, nor any major plasma membrane components. Free cholesterol was implicated in neither the screen nor the lipidomics and was not the subject of targeted experiments.

      (b) The same concept can be applied to sphingolipids. There are sphingolipids in abundance in multiple membrane compartments. Which ones are causal here? More nuanced evaluation of this with sphingolipid staining/tracking can be conducted.

      We attempted experiments where sphingolipids were added back to cells grown in FBS-depleted media. Nevertheless, we were not able to consistently deliver these lipid species and doing so while ensuring the correct subcellular localization at physiologically relevant level would require substantial methods development.

      (c) As part of this, are lipid rafts and/or caveolae being affected by the perturbations in cholesterol and sphingolipids? Lipid rafts are highly enriched in these 2 lipids which could link to their preteostasis observation.

      Indeed, ceramides released from SM hydrolysis are proposed to self-assembled into microdomains with negative curvature that can promote the formation of intralumenal vesicles (Alonso and Goni, 2018; Niekamp et al 2022). We propose that SM accumulation may hinder this process by counteracting the negative membrane curvature and impede microautophagy.

      (d) How about ER membrane lipids? The UPR and subsequent effects on proteostasis are intricately involved with ER lipid bilayer composition.

      We did not perform lipidomics on ER membranes in this study, though we note that at steady state, sphingolipids and cholesterol esters are not expected to be enriched at the ER (Ikonen and Zhou, 2021). We checked whether lipid-related genetic perturbations induced the UPR in published perturb-seq data in K562 cells. Neither MYLIP nor PSAP knockdown induced a UPR.

      In conclusion, the manuscript is interesting but the excitement over a link between lysosome-related lipid metabolism and proteostasis needs to be tamped until a more robust experimental approach is employed to generate supportive and corroborating results.

      Reviewer #2 (Recommendations For The Authors):

      - The paper has a number of grammatically awkward sentences. Editing these would enhance clarity.

      - It is important to show the co-localization of aggregates with the lysosome. This is shown in supplements but should be in a main figure. Here the authors cite previous work indicating that ProteoStat puncta co-localize with ubiquitinated proteins and state that they do not see this, then essentially just move on. Is there an explanation for this discrepancy and can it be resolved? What do they think is really going on? What happens to levels of ubiquitinated proteins when lipid metabolism is perturbed as in these experiments?

      We have included the lipid-induced lysosomal protein aggregation data in the main text (Figure 3A-B), and provided fine-grained quantification of the cytosolic-vs-lysosomal ProteoStat / Ub / ThT signals under different aggregate-inducing conditions (Figure S4A-D). We discuss these results in the main text and propose a model involving ESCRT-mediated microautophagy in the main text. This is supported further by the LysoIP-proteomics and LMP analysis.

      - Please add an indicator of amino acid numbers to Fig. 3C.

      These annotations are now included (now Figure S3C).

      - The legend for 3D is mislabelled.

      We have corrected the legend (now Figure S3D).

      Reviewer #3 (Recommendations For The Authors):

      Protein homeostasis and lipid homeostasis are both are important for maintaining cellular functions. However, the crosstalk remains largely unknown. The manuscript entitled as "Impairment of lipid homoeostasis causes accumulation of protein aggregates in the lysosome" deals with this interesting topic. An important link between lysosomal protein aggregation and sphingolipids/cholesterol esters metabolism were discovered. The topic belonging to the Cell Biology domain also falls into the aims and scope of eLife. Here are the revisions I recommend:

      (1) From lipidomics analysis, a remarkable correlation between levels of sphingomyelin and cholesterol ester and ProteoStat staining was found. Could the authors explain how sphingomyelin and cholesterol ester are quantified? The two lipids are not included as internal standards from the lipidomics experiment.

      Sphingomyelin and cholesterol ester internal standards are included in the Avanti 330707 SPLASH® LIPIDOMIX® Mass Spec Standard, which was supplied at 3% v/v to the MeOH/H2O cell lysis buffer. We have amended the Methods section to clarify this.

      (2) Could the authors perhaps delete Figure 1B and show it on Figure 2A only? There is no need to show the same figure two times. The threshold of both False Discovery Rate and Median Enrichment needs to be added. From Figure 2A, the Lysosomal hydrolases (GBA, LIPA, GALC) seems located in statistically insignificant region. Based on previous studies, the GBA could have an effect on sphingolipid levels, then how to explain that sphingomyelin was highly correlated with ProteoSate staining?

      We have combined the two volcano plots into a single figure (now Figure 1D), and added a line to help visualize the gene effects while considering the combined contribution of FDR and enrichment. Individual lysosomal hydrolases indeed have insignificant effects on ProteoStat and this is discussed in the main text as having relatively constrained impacts on the general lipidome. For example, while GBA and GALC KDs can lead to accumulation of their immediate substrates (glucosylceramide and galactosylceramide, respectively), they do not directly impinge on sphingomyelin.

      (3) The authors show the corelation between ProteoState staining and different lipids/lipid classes in Figure 3B and Figure S3A. It is not necessary to show the corelation with individual lipids (such as sphingomyelin(d18:1/24:0) and cholesterol ester(18:2). The corelation with full collection of lipid classes would be more representative, which is only list in Figure 3B and Figure S3A. It is suggested to add the information of how many individual lipids in each chass are used for the correlation analysis. Replace Figure 3A to Figure S3A, and put Figure 3A as supplementary figure are suggested.

      We decided to retain the correlation of two individual lipids (a sphingomyelin and a cholesterol ester species) with ProteoStat as examples to illustrate with clarity how we obtained the class-wide comparison. The number of individual lipids included in each class for correlation analysis is now included in Figures 2F and S3A.

      (4) The authors state that lipid uptake and metabolism modulate proteostasis. However, only cholesterol and LDL were tested. It would be more precise to state as cholesterol uptake and metabolism modulate proteostasis. In addition, sphingolipids and cholesterol esters accumulate with increased lysosomal protein aggregation. It would be interesting to see the effects of sphingolipids uptake, since sphingolipids are correlated with proteostasis better than cholesterol.

      We attempted to add back specific sphingolipids to assess sufficiency. However, we found it challenging to ensure that these lipids were distributed to the correct subcellular locations at physiologically relevant levels. Without this crucial information, it was difficult to draw any conclusions about the sufficiency of the sphingolipids we tested to impair proteostasis.

      Alonso A, Goñi FM. 2018. The Physical Properties of Ceramides in Membranes. Annu Rev Biophys 47:633–654. doi:10.1146/annurev-biophys-070317-033309

      Ikonen E, Zhou X. 2021. Cholesterol transport between cellular membranes: A balancing act between interconnected lipid fluxes. Dev Cell 56:1430–1436. doi:10.1016/j.devcel.2021.04.025

      Niekamp P, Scharte F, Sokoya T, Vittadello L, Kim Y, Deng Y, Südhoff E, Hilderink A, Imlau M, Clarke CJ, Hensel M, Burd CG, Holthuis JCM. 2022. Ca2+-activated sphingomyelin scrambling and turnover mediate ESCRT-independent lysosomal repair. Nat Commun 13:1875. doi:10.1038/s41467-022-29481-4

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Anne Schreuder et al studied the genetic vulnerabilities of two previously described hypomorphic BRCA1 missense mutations- I26A and R1699Q, and compare these to a BRCA1-depleted setting to identify novel vulnerabilities of the two hypomorphic BRCA1 alleles. The authors showed that BRCA1I26A mutated cells have very similar vulnerabilities to BRCA1 wildtype cells, while the BRCA1R1699Q mutation induced a more similar phenotype to BRCA1-deficient cells. Then the authors unveiled a unique vulnerability to the loss of NDE1 with increased genomic instability specifically in BRCA1R1699Q mutated cells, but not BRCA1-proficient or -deficient cells. While the experiment design strategy is quite reasonable and the data are quite solid, some of the interpretations look not that convincing.

      Major concerns:

      1. According to your data, RAD51 IRIF were reduced to similar levels in BRCA1 depleted cells reconstituted with either BRCA1R1699Q or BRCA1I26A mutants,suggesting that both the two mutants have defects in HR (Line 110-117). And when you tested the Olaparib sensitivity, your results showed that unlike wildtype BRCA1, re-expression of BRCA1I26A only partially rescued the sensitivity, suggesting that BRCA1I26A still have the capacity to perform HR (Line 120-125). Since the function of BRCA1I26A is quite controversial in the field, the authors should explain in your experiment why RAD51 IRIF of BRCA1I26A is not correlated with its HR level, these two data should be consistent.

      Moreover, in line 194-195, you mentioned that this finding correlates with research showing that the I26A mutation does not affect tumour suppression and HR by BRCA1. It is hard to tell whether BRCA1I26A is defective or functional in HR, what's your opinion about it? If you agree that BRCA1I26A is functional in HR, then why it affects RAD51 IRIF. <br /> 2. In line 175, the authors validated the synthetic lethal interaction between BRCA1 and CSA in BRCA1-depleted RPE1 cells and BRCA1-mutated HCC1937 cells (Figure 2D, Supplemental figure 2A, B, C). Actually, HCC1937 is a both BRCA1 and FAM35A-mutated cell line (DOI: 10.15252/embj.201899543), it is a HR functional cell line that does not response to PARPi. According to your CRSIPR data in Table1 and others publications, loss of 53BP1 or its downstream factors such as C20ORF196, FAM35A are synthetic survival with BRCA1 loss. If CSA is synthetic lethal with BRCA1 loss in HCC1937, suggesting that CSA is not simply synthetic lethal with BRCA1 loss of function, at least not only synthetic lethal with HR deficiency. CSA maybe a promising drug target for treating with the PARPi resistant or the PARPi non-response patients. The authors should mention it in the manuscript. 3. RPE1 hTERT P53-/- BRCA1-/- cells have very severe cell growth defects compared with RPE1 hTERT P53-/- BRCA1+/+ cells. Did you see a growth defect or a certain cell death when you induce acute BRCA1 depletion? In Figure 1C, you only showed the survival rate compared with PARPi non-treatment group. Can you also show the growth curve of all these cell lines? 4. Based on your CRISPR screen results from Table 1,2,3, you made the conclusion that BRCA1I26A exhibits vulnerabilities similar to BRCA1-proficient cells and BRCA1R1699Q exhibits vulnerabilities similar to BRCA1-deficient cells. However, when looked at the data carefully, XRCC1 and several FA genes were all found as synthetic lethality hits with BRCA1-deficient, BRCA1I26A, BRCA1R1699Q. And the known genes such as TP53BP1 and ATMIN were found beneficial for survival in the all three screens. If BRCA1I26A exhibits vulnerabilities similar to BRCA1-proficient cells, then why the known hits in the screen are same with BRCA1-deficient cells. Loss of NDE1 is specifically toxic to cells expressing BRCA1R1699Q. Did you find any target that specifically toxic to cells expressing BRCA1I26A?

      It is hard to tell whether your conclusion is correct. Of course, the three cells have some same and also different genetic background, you may consider how to separate the difference. Separate the difference will benefit the treatment for patients with different BRCA1 mutation background.

      Minor concerns:

      1. RPE1 hTERT P53-/- BRCA1-/- cell line has very clear BRCA1 depletion background. Why at the beginning, you did not choose to use RPE1 hTERT P53-/- BRCA1-/- cells reconstituted with BRCA1 wildtype, BRCA1R1699Q or BRCA1I26A mutants to perform the CRISPR screens? What are the advantages of your strategy by first depletion of BRCA1 with auxin and then inducible express BRCA1 constructs? It looks much more complicated.
      2. In Supplemental Figure 2C, a blot to detect BRCA1 should be included.

      Significance

      If the conclusions are correct, the new findings will tell the importance to differentiate between patient-derived mutations when assessing novel treatment targets.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Exploiting synthetic lethality based on functional correlations has the potential to significantly improve the survival of cancer patients by reducing resistance to targeted therapies and increasing anti-tumour efficacy when combined with other treatment modalities. Schreuder et al., aim to identifying novel vulnerabilities of patient-derived mutations that could improve patient stratification based on a specific genetic background. Precisely, they established a model system to perform a genome-wide CRISPR-Cas9 KO screen to identify genomic vulnerabilities of BRCA1 variants with reported hypomorphic phenotypes, namely BRCA1 R1699Q and BRCA1 I26A in engineered RPE1 hTERT cells with AID tag. Using this system the authors were able to confirm known synthetic lethal genes reported in literature (e.g. APEX2, PARP1, POLQ) comparing cells with acute BRCA loss and BRCA1 deficiency. Moreover, the screen identified two genes, CSA and GPX4 that were not previously described as synthetic lethal with BRCA1 loss. What is potentially interesting, but marginally explored, is the identification of a unique synthetic vulnerability of cells expressing BRCA1 R1699Q mutant and NDE1 gene encoding for a dynamic scaffold protein essential in neocortical neurogenesis and heterochromatin patterning by H4K20me3, whose loss of function results in nuclear architecture aberrations and DNA double-strand breaks (Chomiak et al., iScience 2022). Accordingly, cells ablated of NDE1 and expressing BRCA1 R1699Q mutant show less proliferation of cells expressing either BRCA1 WT or BRCA1-depleted. Furthermore, cells lacking NDE1 show increased genomic instability by means of increased micronuclei and anaphase bridges compared to BRCA1 proficient and BRCA1 R1699Q mutant.

      Major comments:

      1. The authors claim that cells expressing BRCA1-I26A are largely HR-proficient, based on a milder effect on Olaparib sensitivity compared to cells expressing BRCA1-R1699A (Fig. 1C). However, I26A mutant cells are defective in RAD51 IRIF (Fig. 1B), indicative of an HR defect. Recently it has been shown that BRCA1 RING mutations that do not impact BARD1 binding, including I26A, render BRCA1 unable to accumulate to DNA damage sites and unable to form RAD51 foci when such mutation is combined with mutations that disable RAP80-BRCA1 interaction (Sherker et al., 2021). How do the authors explain this discrepancy with the literature?
      2. The reduction in survival following CSA depletion in BRCA1-proficient vs. -deficient cells is only 20% (Figure 2 and S2B). In my opinion, such a minor difference is not supporting the notion of a SL interaction between BRCA1 and CSA. To substantiate CSA as synthetic lethal hit, I would recommend the authors comparing the effect of CSA loss to that of EXO1 or BLM loss, both genes recently identified by the same group as SL partners of BRCA1 using the same experimental screening set up (van de Kooij et al, 2024). Moreover, validation data for GPX4 is missing.
      3. Similar to the minor effect observed for CSA, DOT1L and OTUD5 depletions caused rather mild and/or divergent phenotypes between the two sgRNAs used (Figure 4B), rather arguing against robust SL interactions between those genes and BRCA1 deficiency that could be therapeutically exploited.
      4. To strengthen their conclusion in Figure 4C the authors should perform complementation experiments with NDE1 WT and, ideally, with NDE1 mutant(s). On a related note, are NDE1 knock-out cells expressing BRCA1-R1699A more sensitive to PARPi?
      5. Graphs shown in Fig. 1A-C, Fig. 4B, S2D, S3B, S3E and S3F are lacking proper statistical analysis of the differences. Some experiments have only been repeated twice (e.g. Figure 1C), precluding running statistical tests.

      Minor comments:

      1. The authors should include representative images for results shown in Fig.1 A-C
      2. The authors should add immunoblots for BRCA1 in Fig. S2C to indicate successful BRCA1 cDNA complementation in HCC1937 cells.
      3. Most numbers in the Venn diagram shown in Figure 3A cannot be read when printed.
      4. In the western blots shown in Supplemental Figure 1A, the electrophoretic mobility of BRCA1 variants expressed in RPE1 is quite variable. Could the authors indicate in the Figure (e.g. with arrowheads) which bands represent endogenous and which transgenic BRCA1. Moreover, in BRCA-wt complemented cells there are two bands following auxin/DOX addition, whereas there is one band observed in cells expressing BRCA1 hypomorphic variants
      5. Line 229 please correct "BRCA1-proficient" to "BRCA1-depleted".

      Significance

      General assessment:

      This manuscript starts with an attractive hypothesis, which is the generation of a cellular system to study patient-derived hypomorphic BRCA1 missense mutations rather than using BRCA1 knockout cells. Performing CRISPR/Cas9-mediated genome-wide synthetic lethal screens in this system allowed uncovering genetic vulnerabilities of cells expressing BRCA1-R1699A, a pathogenic mutant identified in several cancer patients. The data are of good quality and the manuscript is coherent and generally well written (few typos). However, some data describe mainly negative results (e.g. BRCA1-I26A mutant) or weak phenotypes while other more interesting aspects are not rigorously exploited (e.g. NDE1 SL) and therefore need to be interpreted with more caution and extended by additional experiments.

      Advance:

      BRCA1-R1699Q is classified as a pathogenic variant despite its low penetrance and intermediate cancer risk in breast and ovarian cancer compared to other variants. A recent case report highlighted the unique clinical outcome of a patient with the BRCA1 R1699Q variant, suggesting that this variant may differ from others in terms of cancer risk and drug response (Saito et al., Cancer Treatment and Research Communications 2022). These findings underscore the need for further studies to confirm these observations and to elucidate the specific mechanisms underlying the response to platinum agents and PARP inhibitors in patients with the BRCA1 R1699Q variant. The manuscript proposed by the authors has the potential to help understanding how BRCA1 missense mutations can contribute to determine treatment sensitivity and pave the way to patient stratification.

      Audience:

      This manuscript is suitable for a specialized, basic research audience.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript constitutes further analysis of a dataset generated for a previously-published study from the same group. The experiments in the previous work use an RNA-DNA proximity assay to capture RNAs that interact with chromatin, especially beyond their site of transcription, by crosslinking-and proximity ligation. The previous work added one novel feature to this treatment, compared to other studies by the same group, where they treated the nuclei with RNase A prior to crosslinking. The initial study concluded that long-range chromatin interaction via chromatin looping is affected by RNase treatment. In the current manuscript, the group analyze the data from this experiment in more detail. They describe some notable features of RNAs that remain after RNase treatment and where they are associated within the genome. Overall, the further analyses are somewhat useful, with some exceptions for specific analyses that are not clear in the current manuscript. The work is very complementary to the previously published original study, to the point that it is surprising it was not included in that study.

      Strengths:

      (1) The analyses are a useful complement that fill in gaps from the Calandrelli et al paper. Some of the findings are suggestive of RNA-protein networks that operate at long distances to regulate promoters.

      Weaknesses:

      (1) The beginning of the Results section, and elsewhere, describes steps that likely were performed in the previous publication from which the data are being further analyzed and possibly partially reanalyzed. The current manuscript should more clearly describe if there are any aspects of the pipeline that have been modified from the Calandrelli study (which does not have much detail regarding iMARGI parameters in the published paper) for the further analysis in this manuscript.

      (2) The RNase treatment approach is similar to that addressed in recent papers from the Jenner and Davidovich groups (https://doi.org/10.1016/j.celrep.2024.113856; https://doi.org/10.1016/j.celrep.2024.113858) where these groups found RNase treatment significantly affected solubility of chromatin, causing aggregation. The authors should address this work and place it in light of their current study.

      (3) Figure 1f: it is not clear what it means for genes to be "non-differentially expressed" in this context. Isn't this also RNase-insensitive? And how is the "Ctrl specific" RNA set determined? This is confusing, since RNase is assumed to degrade most of the RNA in these samples.

      (4) Figure 2a: The results are somewhat surprising, given that protein-coding genes are depleted more in the RNase treatment. Is the Ctrl set the same as in 1f? This emphasizes the importance of defining that population better.

      (5) Figure 3a: The text references this figure in ways that do not match the figure, referencing at least nine column clusters when there are only six. Heatmaps of certain TFs and "RAH explained" percentages don't seem to match the Results section description, either. The authors claim EZH2 binding sites are the top TF overlap with RAHs and yet do not include EZH2 in Figure 3a. Suz12 (EZH2 binding partner) and H3K27me3 (EZH2 product) are also referenced in the text for this figure, but not included in the figure itself.

      (6) The manuscript uses the term "non-diffusive RNA-chromatin interactome" which is not directly supported by data. The authors use the term initially to describe the RNase-resistant species in their previous work, but through the current study, they support a model where the RNase resistance is simply due to protection by protein binding, not by any constraints on diffusion in particular chromatin environments.

    2. Reviewer #3 (Public review):

      Summary:

      The study investigated stable RNA-chromatin interactions by applying RNase treatment before the iMARGI (in situ mapping of RNA-genome interactome) procedure to remove promiscuous, unprotected RNA transcripts and selectively enrich for RNA-inaccessible, potentially functional RNA-chromatin interactions (RNA-Transcription factor and RNA-histone). The researchers found that short-range interactions (<1kb) are RNase resistant, possibly due to the protection from RNA polymerases. They noticed that long-range RNA-chromatin interactions (>2Mbp or interchromosomal) were also enriched after RNase treatment, hypothesizing that these interactions are stabilized by chromatin-binding proteins. They found that genic caRNAs were sensitive, while repeat-derived caRNAs, such as rRNA and satellite repeats, were resistant to RNase. Long non-coding RNAs (lncRNAs), particularly those associated with diseases, were over-represented among RNase-insensitive transcripts, indicating their potential regulatory significance. Additionally, RNase-insensitive caRNAs exhibited higher evolutionary conservation, implying that they are protected by protein complexes, especially in long-range interactions. RNA Attachment Hot Zones (RAHs) enriched post-RNase treatment were found to localize in functional genomic regions such as promoters, transcription factor binding sites (TFBS), and histone modification sites. Importantly, RNase treatment amplified specific RNA-transcription factor interactions, with caRNA signals being preserved at TFBS for factors with RNA-binding capabilities, suggesting that direct RNA-protein binding helps protect caRNAs from degradation. They also found that different TFs are enriched with specific caRNA species, distinguishing them from their genomic footprints. In addition, transcripts with higher abundance tend to enrich at more TFBS. Overall, the study highlights the role of RNase-inaccessible caRNAs in chromatin regulation and provides insight into their functional significance in genome organization.

      Strengths:

      This study involves rigorous and comprehensive data analysis involving datasets with very high sequencing depth and appropriate statistical tests (e.g., chi-square tests to validate the association between caRNAs and TFBS statistically). This analysis was further strengthened by comparing their results with orthogonal datasets, such as RedChIP and fRIP-seq, providing robust, cross-validated evidence for the caRNA-TFBS associations. In addition to examining broad interactions, the authors identified specific long-range RNA-chromatin interactions and pinpointed specific transcription factors and histone modification markers that are associated with these interactions. The authors explored the evolutionary implications of RNase-insensitive caRNAs and their potential medical relevance, particularly by identifying caRNAs linked to disease-associated genes and long non-coding RNAs (lncRNAs). This combination of detailed analysis, along with functional relevance, broadens the scope of the research, making it a significant contribution to chromatin biology.

      Weaknesses:

      However, I have the following concerns regarding the studies:<br /> (1) I don't understand the logic behind calling promoters, enhancers, and similar regions "functionally important regions" when describing the enrichment of RNase-insensitive interactions. Genic regions that are RNase-sensitive are also functionally relevant. So, what makes promoters, enhancers, etc, unique in terms of functionality?<br /> (2) First, while the study offers strong evidence for associations between caRNAs, transcription factors, and chromatin markers, it lacks direct functional validation experiments such as RNA knockdown or CRISPR interference, to confirm the specific roles of these RNAs in gene regulation or chromatin structure modifications.<br /> (3) Another limitation is the incomplete investigation of caRNAs with short-range interactions (<1kb). The authors hypothesized that these are protected by RNA polymerases but did not provide supporting experimental evidence or references to previous studies. Offering either experimental validation or a rationale for excluding these short-range interactions would strengthen this hypothesis. The conclusion that authors drew on that "chromatin-associated RNAs (caRNAs) involved in short- to middle-range interactions are more susceptible to RNase treatment" was unclear for the specific "short-range" distance. The data shown in Supplementary Figure 2a contradicted the conclusion in the discussion that "long-distance RNA-chromatin interactions are preferentially preserved after RNase treatment, while short-range interactions are depleted." as well as the suggestion made linking RNase inaccessibility to evolutionarily conserved in the paper.<br /> (4) The study heavily relies on RNase treatment to isolate stable RNA-chromatin interactions, which might neglect important transient or weak interactions and overlook the functional relevance of RNase-sensitive interactions, hence missing the dynamic nature of RNA-chromatin interactions.<br /> (5) Tthe analysis is limited to human embryonic stem cells (H1 cells), which might restrict the generalizability of the findings. Expanding the study to include a cell type that represents a broader range of cell types or tissues will strengthen the conclusions.<br /> (6) The term "RNase A treatment" in the methods section could be clearer if specified as "RNase-treated iMARGI," which encompasses the standard iMARGI protocol.<br /> (7) There is some ambiguity regarding whether the researchers generated new data or reanalyzed existing datasets. While it is mentioned early on that previously published RNase-treated iMARGI datasets were reanalyzed, the text later states that "three biological replicates were generated for the RNase-treated samples." Clarifying whether the data were newly generated in this study or obtained from public datasets would improve the clarity.<br /> (8) The color scheme should be the same for heatmaps for control, and RNase-treated samples in Figure 4.

    3. Author response:

      We thank the editors and reviewers for their thorough evaluation of our manuscript. We appreciate the constructive feedback and insights provided. 

      We acknowledge that some of our conclusions would benefit from more measured statements and additional computational controls. We will revise the manuscript to better reflect the scope and limitations of our analytical approach. While we cannot add new experimental validations at this stage, we will strengthen our computational analyses and clarify our methodology.

      Below, we outline our planned revisions to address the major points raised in the public reviews:

      Clarification of Terms and Definitions:

      (1) We will make it clearer in our manuscript to emphasize that we reuse the same raw datasets from our previous study as described in Calendrilli et al, 2023, and there is no modification to the experimental methods or data. 

      (2) We will provide clear definitions for:

      - "Non-differentially expressed" genes

      - "Ctrl specific" RNA sets

      - The composition of control populations in different analyses

      (3) We will revise the use of "non-diffusive RNA-chromatin interactome" and “RNase-resistant” terminology to better reflect our actual findings.

      (4) We will also improve clarity regarding:

      - The rationale for focusing on specific genomic regions

      - The interpretation of evolutionary conservation data

      (5) We will provide additional rationale on the exclusion of short-range interactions.

      Figure Revisions:

      (1) Figure 3a: We will correct any discrepancy between text references and figure content.

      (2) Figure 4: We will standardize the color scheme between control and RNase-treated samples.

      (3) We will follow the reviewer's suggestion to move figure 1g to the supplementary file. 

      Additional Computational Analyses:

      (1) We will consider adding controls for RNA length effects and integrate any existing knowledge on the protection extent variation across different RBP.

      Discussions:

      (1) We will carefully rephrase our conclusions to more accurately reflect the scope and limitations of our computational findings, ensuring we do not overstate the implications.

      (2) We will expand the discussion of limitations, including:

      - The focus on RNase-resistant interactions only

      - The cell-type specificity of our findings

      - The lack of functional validation

      - The limited ability to discern and study the transient or weak RNA-chromatin interactions using the current dataset

      (3) Regarding the recent papers from Jenner and Davidovich groups about RNase treatment effects on chromatin solubility:

      - We will discuss these findings in our revised manuscript

      - We will address potential limitations this may impose on our interpretations

    1. Reviewer #3 (Public review):

      Summary:

      Membrane-bound pyrophosphatases (mPPases) are homodimeric proteins that hydrolyze pyrophosphate and pump H+/Na+ across membranes. They are attractive drug targets against protist pathogens. Non-hydrolysable PPi analogue bisphosphonates such as risedronate (RSD) and pamidronate (PMD) serve as primary drugs currently used. Bisphosphonates have a P-C-P bond, with its central carbon can accommodate up to two substituents, allowing a large compound variability. Here the authors solved two TmPPase structures in complex with the bisphosphonates etidronate (ETD) and zoledronate (ZLD) and monitored their conformational ensemble using DEER spectroscopy in solution. These results reveal the inhibition mechanism of these compounds, which is crucial for developing future small molecule inhibitors.

      Strengths:

      The authors show that seven different bisphosphonates can inhibit TmPPase with IC50 values in the micromolar range. Branched aliphatic and aromatic modifications showed weaker inhibition.

      High-resolution structures for TmPPase with ETD (3.2 Å) and ZLD (3.3 Å) are determined. These structures reveal the binding mode and shed light on the inhibition mechanism. The nature of modification on the bisphosphonate alters the conformation of the binding pocket.

      The conformational heterogeneity is further investigated using DEER spectroscopy under several conditions.

      Weaknesses:

      The authors observed asymmetry in the TmPPase-ELD structure above the hydrolytic center. The structural asymmetry arises due to differences in the orientation of ETD within each monomer at the active site. As a result, loop5-6 of the two monomers is oriented differently, resulting in the observed asymmetry. The authors attempt to further establish this asymmetry using DEER spectroscopy experiments. However, the (over)interpretation of these data leads to more confusion than any further understanding. DEER data suggest that the asymmetry observed in the TmPPase-ELD structure in this region might be funneled from the broad conformational space under the crystallization conditions.

      DEER data for position T211R1 at the enzyme entrance reveal a highly flexible conformation of loop5-6 (and do not provide any direct evidence for asymmetry, Figure EV8). Similarly, data for position S521R1 near the exit channel do not directly support the proposed asymmetry for ETD. Despite the high quality of the data, they reveal a very similar distance distribution. The reported changes in distances are very small (+/- 0.3 nm), which can be accommodated by a change of spin label rotamer distribution alone. Further, these spin labels are located on a flexible loop, thereby making it difficult to directly relate any distance changes to the global conformation.

      The interpretations listed below are not supported by the data presented:

      (1) 'In the presence of Ca2+, the distance distribution shifts towards shorter distances, suggesting that the two monomers come closer at the periplasmic side, and consistent with the predicted distances derived from the TmPPase:Ca structure.'

      Problem: This is a far-stretched interpretation of a tiny change, which is not reliable for the reasons described in the paragraph above.

      (2) 'Based on the DEER data on the IDP-bound TmPPase, we observed significant deviations between the experimental and the in silico distances derived from the TmPPase:IDP X-ray structure for both cytoplasmic- (T211R1) and periplasmic-end (S525R1) sites (Figure 4D and Figure EV8D). This deviation could be explained by the dimer adopting an asymmetric conformation under the physiological conditions used for DEER, with one monomer in a closed state and the other in an open state.'

      Problem: The authors are trying to establish asymmetry using the DEER data. Unfortunately, no significant difference is observed (between simulation and experiment) for position 525 as the authors claim (Figure 4D bottom panel). The observed difference for position 112 must be accounted for by the flexibility and the data provide no direct evidence for any asymmetry.

      (3) 'Our new structures, together with DEER distance measurements that monitor the conformational ensemble equilibrium of TmPPase in solution, provide further solid experimental evidence of asymmetry in gating and transitional changes upon substrate/inhibitor binding.'

      Problem: See above. The DEER data do not support any asymmetry.

      (4) Based on these observations, and the DEER data for +IDP, which is consistent with an asymmetric conformation of TmPPase being present in solution, we propose five distinct models of TmPPase (Figure 7).

      Problem: Again, the DEER data do not support any asymmetry and the authors may revisit the proposed models.

      (5) 'In model 2 (Figure 7), one active site is semi-closed, while the other remains open. This is supported by the distance distributions for S525R1 and T211R1 for +Ca/ETD informed by DEER, which agrees with the in silico distance predictions generated by the asymmetric TmPPase:ETD X-ray structure'

      Problem: Neither convincing nor supported by the data

    2. Author response:

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      This work examines the binding of several phosphonate compounds to a membrane-bound pyrophosphatase using several different approaches, including crystallography, electron paramagnetic resonance spectroscopy, and functional measurements of ion pumping and pyrophosphatase activity. The work attempts to synthesize these different approaches into a model of inhibition by phosphonates in which the two subunits of the functional dimer interact differently with the phosphonate.

      Strengths:

      This study integrates a variety of approaches, including structural biology, spectroscopic measurements of protein dynamics, and functional measurements. Overall, data analysis was thoughtful, with careful analysis of the substrate binding sites (for example calculation of POLDOR omit maps).

      Weaknesses:

      Unfortunately, the protein did not crystallize with the more potent phosphonate inhibitors. Instead, structures were solved with two compounds with weak inhibitory constants >200 micromolar, which limits the molecular insight into compounds that could possibly be developed into small molecule inhibitors. Likewise, the authors choose to focus the spectroscopy experiments on these weaker binders, missing an opportunity to provide insight into the interaction between more potent binders and the protein.

      We acknowledge the reviewer concern regarding the choice of weaker inhibitors. We attempted co-crystallization with all available inhibitors, including those with higher potency. However, despite numerous efforts, these potent inhibitors yielded low-resolution crystals, making them unsuitable for detailed structural analysis. Therefore, we chose to focus on the weaker binders, as we were able to obtain high-quality crystal structures for these compounds. This allowed us to perform DEER spectroscopy with the added advantage of accurately analyzing the data against structural models derived from X-ray crystallography. Using these weaker inhibitors enabled a more precise interpretation of the DEER data, thus providing reliable insights into the conformational dynamics and inhibition mechanism. However, as suggested by the reviewer, in the revised version, we will perform DEER analysis on the more potent inhibitors to provide additional insight into their interactions.

      In general, the manuscript falls short of providing any major new insight into membrane-bound pyrophosphatases, which are a very well-studied system. Subtle changes in the structures and ensemble distance distributions suggest that the molecular conformations might change a little bit under different conditions, but this isn't a very surprising outcome. It's not clear whether these changes are functionally important, or just part of the normal experimental/protein ensemble variation.

      We respectfully disagree with the reviewer. The scale of motions seen in this study correspond to those seen in the full panoply of crystal structures of mPPases. Some proteins undergo very large conformational changes during catalysis – such as the rotary ATPase. This one doesn’t, meaning that the precise motions we describe are likely to be relevant. Conformational changes in the ensemble, whether large or small, represent essential protein motions which underlie key mPPase catalytic function. Our DEER spectroscopy data demonstrate the sensitivity and resolution necessary to monitor these subtle changes in equilibria, even if these are only a few Angstroms. For several of the conditions we investigated by DEER in solution, corresponding x-ray structures have been solved, with the derived distances agreeing well with the DEER distributions. This further validates the biological relevance of the structures, including serial time-resolved ones that indicate asymmetry.

      The ZLD-bound crystal structure doesn't predict the DEER distances, and the conformation of Na+ binding site sidechains in the ZLD structure doesn't predict whether sodium currents occur. This might suggest that the ZLD structure captures a conformation that does not recapitulate what is happening in solution/ a membrane.

      We agree with the reviewer that the ZLD-bound crystal structure does not predict the DEER distances. However, we believe this discrepancy arises from the effect of the bulkiness of ZLD inhibitor, which prevents the closure of the hydrolytic centre. Additionally, the absence of Na+ at the ion gate in the ZLD-bound structure suggests that Na+ transport does not occur, a conclusion further supported by our electrometric measurements. We agree with the reviewer, that the distances observed in the DEER experiments might represent a potential new conformation in solution, which may not be captured by the static X-ray structure, thereby offering insights into the dynamic nature of the protein under physiological conditions. Finally, the static x-ray structures have not captured the asymmetric conformations that must exist to explain half-of-the-sites reactivity.

      Reviewer #2 (Public review):

      Summary:

      Crystallographic analysis revealed the asymmetric conformation of the dimer in the inhibitor-bound state. Based on this result, which is consistent with previous time-resolved analysis, authors verified the dynamics and distance between spin introduced label by DEER spectroscopy in solution and predicted possible patterns of asymmetric dimer.

      Strengths:

      Crystal structures with inhibitor bound provide detailed coordination in the binding pocket thus useful information for the PPase field and maybe for drug development.

      Weaknesses:

      The distance information measured by DEER is advantageous for verifying the dynamics and structure of membrane protein in solution. However, regarding T211 data, which, as the authors themselves stated, lacks measurement precision, it is unclear for readers how confident one can judge the conclusion leading from these data for the cytoplasmic side.

      We thank the reviewer for acknowledging the advantageous use of the DEER methodology for identifying dynamic states of membrane proteins in solution. We used two sites in our analysis: S525 (periplasm) and T211 (cytoplasm). As we clearly stated in the original manuscript, S525R1 yielded high-quality DEER data, while T211R1 yielded weak (or no) visual oscillations, leading to broad, though different distributions for the several conditions tested. Our main conclusions are based on the S525R1 data. We included the T211R1 data because, although it does not provide definitive evidence, it is consistent with our proposed model and offers additional insights into biologically relevant conditions. Furthermore, the shifts in the centre of mass (Fig EV8D) of the broad T211R1 distributions show a trend that is consistent with our model; although not proving it, it does not exclude it either. Lastly, these data do indeed confirm an important structural feature of mPPase in solution conditions which is the intrinsically high dynamic state of the loop5-6 where T211 is located, and consistent with our previous (Kellosalo et al., Science,  2012; Li et al., Nat. Commun, 2016; Vidilaseris et al., Sci. Adv., 2019; Strauss et al., EMBO Rep., 2024) and current x-ray crystallography data.

      The distance information for the luminal site, which the authors claim is more accurate, does not indicate either the possibility or the basis for why it is the ensemble of two components and not simply a structure with a shorter distance than the crystal structure.

      We thank the reviewer for pointing out this possibility and alternative interpretation of our DEER data. In the revised version, we will show that our DEER data are consistent with (and do not exclude) asymmetry and rephrase to be inclusive of other possibilities. Importantly, this additional possibility does not affect the current interpretation of the data in our manuscript.

      Reviewer #3 (Public review):

      Summary:

      Membrane-bound pyrophosphatases (mPPases) are homodimeric proteins that hydrolyze pyrophosphate and pump H+/Na+ across membranes. They are attractive drug targets against protist pathogens. Non-hydrolysable PPi analogue bisphosphonates such as risedronate (RSD) and pamidronate (PMD) serve as primary drugs currently used. Bisphosphonates have a P-C-P bond, with its central carbon can accommodate up to two substituents, allowing a large compound variability. Here the authors solved two TmPPase structures in complex with the bisphosphonates etidronate (ETD) and zoledronate (ZLD) and monitored their conformational ensemble using DEER spectroscopy in solution. These results reveal the inhibition mechanism of these compounds, which is crucial for developing future small molecule inhibitors.

      Strengths:

      The authors show that seven different bisphosphonates can inhibit TmPPase with IC50 values in the micromolar range. Branched aliphatic and aromatic modifications showed weaker inhibition.

      High-resolution structures for TmPPase with ETD (3.2 Å) and ZLD (3.3 Å) are determined. These structures reveal the binding mode and shed light on the inhibition mechanism. The nature of modification on the bisphosphonate alters the conformation of the binding pocket.

      The conformational heterogeneity is further investigated using DEER spectroscopy under several conditions.

      Weaknesses:

      The authors observed asymmetry in the TmPPase-ELD structure above the hydrolytic center. The structural asymmetry arises due to differences in the orientation of ETD within each monomer at the active site. As a result, loop5-6 of the two monomers is oriented differently, resulting in the observed asymmetry. The authors attempt to further establish this asymmetry using DEER spectroscopy experiments. However, the (over)interpretation of these data leads to more confusion than any further understanding. DEER data suggest that the asymmetry observed in the TmPPase-ELD structure in this region might be funneled from the broad conformational space under the crystallization conditions.

      See also the response below - We respectfully disagree with the reviewer. The asymmetry was previously established using serial time crystallography (Strauss et al., EMBO Rep, 2024) and biochemical assays (e.g. Malinen et al., Prot. Sci., 2022; Artukka et al., Biochem J, 2018; Luoto et al., PNAS, 2013) and also partially seen in one static structure (Vidilaseris et al., Sci Adv 2019). DEER data only show that the previously proposed asymmetry could also be present within the conformational ensemble in solution conditions. Indeed, our data do not (and cannot) exclude this possibility.

      DEER data for position T211R1 at the enzyme entrance reveal a highly flexible conformation of loop5-6 (and do not provide any direct evidence for asymmetry, Figure EV8).

      Please see relevant response above. We acknowledge that T211 is indeed situated on a highly dynamic loop, which is important for gating and our DEER data confirm its high flexibility. Given we have not observed oscillations of this site, leading to broad distributions, we have stated in the original manuscript that we will not establish the presence of any asymmetry in solution on the basis of T211, rather relying on the S525 site, for which we have acquired high-quality DEER data, as was also pointed out and have been commented on by all reviewers.

      Similarly, data for position S521R1 near the exit channel do not directly support the proposed asymmetry for ETD.

      The reviewer appears to suggest that we hold the S525R1 DEER data as direct proof of asymmetry; this is combative on the grounds that to directly prove asymmetry would require time-resolved DEER measurements, far beyond the scope of this work. Rather, we have applied DEER measurements to explore whether asymmetry (observed previously via time-resolved X-ray crystallography) is also present (or indeed a possibility) in solution. We simply state that the DEER data are consistent with asymmetry (i.e., that the mean distance increases in the presence of ETD compared to the apo-state). This is a restrained interpretation of the data.

      Despite the high quality of the data, they reveal a very similar distance distribution. The reported changes in distances are very small (+/- 0.3 nm), which can be accommodated by a change of spin label rotamer distribution alone. Further, these spin labels are located on a flexible loop, thereby making it difficult to directly relate any distance changes to the global conformation

      We thank the reviewer for recognising the high quality of our DEER data for the S525R1, where visual oscillations in the raw traces, as in our case, reportedly lead to highly accurate and reliable distributions, able to separate (in fortuitous cases) helical movements of only a few Angstroms. The ability of DEER/PELDOR offering near Angstrom resolution was previously demonstrated by the acquisition and solution of high resolution multi-subunit spin-labelled membrane protein structures (Pliotas at al., PNAS, 2012; Pliotas et al., Nat Struct Mol Biol, 2015; Pliotas, Methods Enzymol, 2017) as well as it ability in detecting small (and of similar to mPPase magnitude) conformational changes in different integral membrane proteins systems (Kapsalis et al., Nature Comms, 2019; Kubatova et al., PNAS, 2023; Schmidt et al., JACS, 2024; Lane et al., Structure, 2024; Hett et al., JACS, 2021; Zhao et al., Nature, 2024), occurring under different conditions and/or stimuli in solution and/or lipid environment. The changes here are not very small (e.g. ~ 7 Angstroms between the two mean distance extremes (Ca vs IDP)) for DEER’s proven detection sensitivity, and with all other conditions showing changes between those extremes.

      These changes are relatively small, but they are expected for membrane ion pumps. Indeed, none of the mPPase structures show helical movements of greater than a half a turn, and that only in helices 6 and 12. There appear to be larger-scale loop closing motions of the 5-6 loop that includes T211, due to the presence of E217 which binds to one of the Mg2+ ions that coordinate the leaving group phosphate. (This is, inter alia, the reason that this loop is so flexible: it can not order before substrate is bound.) Here we have the resolution to detect such subtle differences by DEER, given there are clear shifts in our time domain data and these are reflected in the mean distances in the distributions. Therefore, our study demonstrates the sensitivity and resolution DEER offers in detecting subtle conformational transitions, key in membrane proteins pathways. To further belabour this point, we do not quantify the DEER data (for instance through parametric fitting) to extract populations of different conformational states and we appreciate that to do so would be highly prone to error; however we do (and can, we feel without overinterpretation) assert that the mean distances shift.

      The interpretations listed below are not supported by the data presented:

      (1) 'In the presence of Ca2+, the distance distribution shifts towards shorter distances, suggesting that the two monomers come closer at the periplasmic side, and consistent with the predicted distances derived from the TmPPase:Ca structure.' Problem: This is a far-stretched interpretation of a tiny change, which is not reliable for the reasons described in the paragraph above.

      While the authors overall agree with the reviewer assessment that ±0.3 nm is a small (not a minor) change, there are literature examples quantifying (or using for quantification) distribution peaks separated by similar Δr. (Kubatova et al., PNAS, 2023; Schmidt et al., JACS, 2024; Hett et al., JACS, 2021; Zhao et al., Nature, 2024). In particular, none of the mPPase structures show helical movements of greater than a half a turn (in helices 6 and 12 in particular). There appear to be larger-scale loop closing motions of the 5-6 loop that includes T211, due to the presence of E217 which binds to one of the Mg2+ ions that coordinate the leaving group phosphate. (This is, inter alia, the reason that this loop is so flexible: it can not order before substrate is bound.)

      Importantly, we have fitted Gaussians to the experimental distance distributions of 525R1 output by the Comparative Deer Analyzer 2.0 and observed a change in the distribution width in presence of Ca2+, implying the rotameric freedom of the spin label is restricted. However, the CW-EPR for 525R1 indicate that the rotational correlation time of the spin label is highly consistent between conditions (the spectra are almost identical); this cannot be explained simply by rotameric preference of the spin label (as asserted by the reviewer 3), as there is no (further) immobilisation observed from the CW-EPR of apo-state (Figure EV9) to that in presence of Ca2+. Furthermore, in the absence of conformational changes, it is reasonable to assume (and demonstrable from the CW-EPR data) that the rotamer cloud should not significantly change between conditions. However, Gaussian fits of the two extreme cases yielding the longest (i.e., in presence of IDP) and shortest (in presence of ZTD) mean distances for the 525R1 DEER data indicated significant (i.e., above the noise floor after Tikhonov validation) probability density for the IDP condition at 50 Å (P(r) = 0.18). This occurs at four standard deviations above the mean of the ZTD condition, which by random chance should occur with <0.007% probability. Indeed, one can say that to observe 18% probability density at four standard deviations above the mean by random chance would occur on the order of one in 4 x 10^6.

      As in previous response the method can detect changes of such magnitude which are not small, but physiologically relevant and expected for integral membrane proteins, such as mPPases. Indeed, even in equal (or more) complex systems such as heptameric mechanosensitive channel proteins DEER provided sub-Angstrom accuracy, when a spin labelled high resolution XRC structure was solved (Pliotas et al., PNAS, 2012; Pliotas et al., Nat Struct Mol Biol, 2015). Despite this is ideal case where DEER accuracy was experimentally validated another high resolution structural method on modified membrane protein and is not very common it demonstrates the power of the method , especially when strong oscillations are present in the raw DEER data (as here for mPPase 525R1), even when multiple distances are present, Angstrom resolution is achievable in such challenging protein classes.

      (2) 'Based on the DEER data on the IDP-bound TmPPase, we observed significant deviations between the experimental and the in silico distances derived from the TmPPase:IDP X-ray structure for both cytoplasmic- (T211R1) and periplasmic-end (S525R1) sites (Figure 4D and Figure EV8D). This deviation could be explained by the dimer adopting an asymmetric conformation under the physiological conditions used for DEER, with one monomer in a closed state and the other in an open state.'

      Problem: The authors are trying to establish asymmetry using the DEER data. Unfortunately, no significant difference is observed (between simulation and experiment) for position 525 as the authors claim (Figure 4D bottom panel). The observed difference for position 112 must be accounted for by the flexibility and the data provide no direct evidence for any asymmetry.

      Reviewer 3 is wrong in suggesting that we are trying to prove asymmetry through the DEER data. That is a well-known fact in the literature (eg Vidilaseris et al, Sci Adv 2019 where we show (1) that the exit channel inhibitor ATC (i.e., close to 525) binds better in solution to the TmPPase:PPi complex than the TmPPase:PPi2 complex, and (2) that ATC binds in an asymmetric fashion to the TmPPase:IDP2 complex with just one ATC dimer on one of the exit channels. We merely use the DEER data to support this well-established fact.

      However, we agree that the DEER data in presence of IDP does not provide direct proof for asymmetry; particularly mutant T211R1 yields in silico distributions too short for measurement by DEER. It is possible that the deviations observed (and particularly likely for T211R1) arise from conformational heterogeneity in solution. We will rephrase this paragraph accordingly: “Owing to the broad nature of the T211R1 (cytoplasmic site) distance distributions, we refrain from interpreting shifts in this data. For the 525R1 (periplasmic site) for which we obtained data of high quality (as also pointed out by both reviewers 2 and 3) we observed deviations between the experimental and the in-silico distances derived from the TmPPase:IDP X-ray structure. While this deviation is less pronounced than for the +ZTD condition, the deviation is consistent with an asymmetric conformation in solution.”

      (3) 'Our new structures, together with DEER distance measurements that monitor the conformational ensemble equilibrium of TmPPase in solution, provide further solid experimental evidence of asymmetry in gating and transitional changes upon substrate/inhibitor binding.'

      Problem: See above. The DEER data do not support any asymmetry.

      We feel that the reviewer comments here are somewhat unfounded. The DEER data (and we will limit discussion only to the 525R1 mutant in this regard) satisfy relevant criteria of the white paper (Schiemann et al., 2021, JACS) from the EPR community (signal-to-noise ratio w.r.t modulation depth of > 20 in all cases; replicates have been performed and will be added into the main-text or supplementary; near quantitative labelling efficiency (evidenced by lack of free spin label signal in the CW-EPR spectra); analysed using the CDA (now Figure EV10, this data we will promote to the main-text) to avoid confirmation bias).

      While the DEER data do not prove asymmetry, we do not claim proof of asymmetry in the above sentence. We concede to rephrase the offending sentence above as: “Our new structures, together with DEER distance measurements that monitor the conformational ensemble of TmPPase in solution, do not exclude asymmetry in gating and transitional changes upon substrate/inhibitor binding and are consistent with our proposed model.” We feel that this reframed conjecture of asymmetry is well founded; indeed, comparing the experimental apo-state 525R1 distance distribution with in-silico modelling performed on the hybridised asymmetric structure (i.e., comprised of one monomer bound to Ca2+ and another bound to IDP) yields an overlap coefficient (Islam and Roux, JPC B, 2015) of >0.97. This implies the envelope of the modelled distance distribution is quantitatively inside the envelope of the experimental distance distribution. Thus, the DEER data do not exclude asymmetry (previously observed by time-resolved XRC) in solution. While we appreciate that ideally one would measure time-resolved DEER to directly correlate kinetics of conformational changes within the ensemble to the catalytic cycle of mPPase,(and this is something we aim to do in the future), it is beyond the the scope of this study.

      Indeed, half-of-the-sites reactivity has been demonstrated in at least the following papers (Vidilaseris et al, Sci Acv. ,2019, Strauss et al, EMBO Rep. 2024, Malinen et al Prot Sci, 2022, Artukka et al Biochem J, 2018; Luoto et al, PNAS, 2013). Half-of-the sites activity requires asymmetry in the mechanism, and therefore asymmetric motions in the active site (viz 211) and exit channel (viz 525). As mentioned above, we have demonstrated this for other inhibitors (Vidilaseris et al 2019) and as part of a time-resolved experiment (Strauss et al 2024). In fact, given the wealth of evidence showing that the symmetrical crystal structures sample a non- or less-productive conformation of the protein, it would be quixotic to propose the DEER experiments - in solution - do not generate asymmetric conformations. It certainly doesn’t obey Occam’s razor of choosing the simplest possible explanation that covers the data.

      (4) Based on these observations, and the DEER data for +IDP, which is consistent with an asymmetric conformation of TmPPase being present in solution, we propose five distinct models of TmPPase (Figure 7).

      Problem: Again, the DEER data do not support any asymmetry and the authors may revisit the proposed models.

      We respectfully disagree with the reviewer. Please see our detailed response above. However, in the revised version, we will clarify that the proposed models are not solely based on the DEER data but are grounded in both current and previously solved structures, with the DEER data providing additional consistency with these models.

      (5) 'In model 2 (Figure 7), one active site is semi-closed, while the other remains open. This is supported by the distance distributions for S525R1 and T211R1 for +Ca/ETD informed by DEER, which agrees with the in silico distance predictions generated by the asymmetric TmPPase:ETD X-ray structure'

      Problem: Neither convincing nor supported by the data

      We respectfully disagree with the reviewer. However, owing to the conformational heterogeneity of T211R1, in the revised version, we will exclude it in the above sentence, to the effect: Please see our detailed response above.

    1. Reviewer #3 (Public review):

      Summary:

      The authors demonstrated MK2i could enhance the therapeutic efficacy of MTAs. With Tumor xenograft and migration assay, the author suggested that the p38-MK2 pathway may serve as a promising therapeutic target in combination with MTAs in cancer treatment.

      Strengths:<br /> The authors provided a potential treatment for breast cancer.

      Weaknesses:

      (1) In Figure 2, the authors used a human retinal pigment epithelial-1 (RPE1) cell line to show that breast cancer cells are more sensitive to CMPD1 treatment. MCF10A cells would be suggested here as a suitable control. Besides, to compare the sensitivity, IC50 indifferent cell lines should be measured.

      (2) The data of MDA-MB-231 in Figure 1D is not consistent with CAL-51 and T47D, also not consistent with the data in Figures 2B-C.

      (3) To support the authors' conclusion in Figure 5, an additional animal experiment performed by tail vein injection would be helpful.

      (4) Page 14, to evaluate the combination result of MK2i and vinblastine, an in vivo animal assay must be performed.

      (5) The authors used RNA-seq to show some pathways affected by CMPD1. What are the key/top genes that were affected? How about the mechanism?

      (6) Line 127, more experiments should be involved to support the conclusion.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. Point-by-point description of the revisions

      • *Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors present the use of previously identified biosensors in a single-molecule concentration regime to address lipid effector recruitment. Using controlled and careful single-cell based analysis, the study investigates how expression of the commonly used PIP3 sensor based on Akt-PH domain interferes with the native detection of PIP3. Predominantly live-cell fluorescence microscopy coupled to image analysis drives their studies.

      Conceptually, this manuscript carefully and quantitatively describes the influence of lipid biosensor overexpression and presents a means to overcome the inherent and long-recognized problems therein. This solution, namely employing low expression of the lipid biosensor, should be generally applicable. The work is of general interest to cell biologists focused on answering questions at membranes and organelles, including especially those interested in lipid-mediated signaling transductions.

      Reviewer 1 Major:

      #1.1 The terminology "single molecule biosensor" is not really appropriate. A protein is not "single-molecule". An enzyme does not "single molecule". Better is biosensors at single-molecule expression levels. In most cases, this should be changed. Single-molecule vs single-cell vs. bulk measurements are often poorly defined in quantifications and conflating these does not help the case, which is already supported by generally clear data.

      We appreciate the reviewer’s thoughtful critique of our grammatically incorrect use of jargon; we saw this as soon as they mentioned it! We have amended the manuscript where appropriate as detailed:

      • Title is now changed to “Lipid Biosensors Expressed at Single Molecule Levels Mitigates Inhibition of Endogenous Effector Proteins”
      • Last paragraph of the introduction on __ 2__ now reads “As well as alleviating inhibition of PI3K signaling, biosensors expressed at these low levels show improved dynamic range and report more accurate kinetics than their over-expressed counterparts."
      • The title of the results section on __ 6__ is now: Mitigating PIP3 competition using biosensors expressed at single molecule levels
      • Last paragraph of the results section on 6 now reads: “this showed that when expressed at single molecule levels, the biosensor has substantially better dynamic range”. #1.2 Figure 1D-F, images not as clearly describing quantitation as one would hope. Untransfected cells in 1E should demonstrate more translocated Akt-pS473 than transfected, but it is difficult for this reviewer to find. Consider inset images in addition to the wider field. Consider also moving the "negative" data of Fig 1B-C to Supplement.

      We regret not making this figure easier to interpret; we have substantially updated the figure, as comprehensively detailed in our point-by-point response to reviewer 2’s point 2.3. To specifically address this reviewer’s concerns:

      The older figure used non-confocal, low-resolution images that were used for quantification. Such an approach was employed to enable fluorescence from the entire cellular volume to be captured, which produces more robust quantification. However, to the reviewer’s point, it is not possible to see the translocation of PH-AKT1 nor translocated AKT-pS473 in these images. Fortunately, we had in parallel captured high resolution confocal images for some experiments. These are now shown in Fig 1D-E, which clearly shows translocated AKT-pS473 and PH-AKT-EGFP

      #1.3 The cell line being used is not clearly specified after the initial development of the NG1 followed by CRISPRed NG2 onto Akt. For example, for the Figure 3C experiments, the text states "complete ablation of endogenous AKT1-NG2" but this information is not apparent from the figure legend or figure. Throughout the cell line used and the aspects transfected need to be made explicitly clear.

      We are grateful to the reviewer for highlighting this ambiguity. We have now defined the gene-edited cells used throughout as “AKT1-NG2 cells” and expressly used this term when referring to experiments in figures 2-5.

      #1.4 Fig. 5 shows single cells. It is therefore unclear if broken promoters have resulted in decreased expression. This point is important because the expression plasmids should be made publicly available, and for their use to be understood properly, this must be clarified. The details of the plasmids are unclear. Perhaps listed in the table? - unclear. This aspect would be important for the field to effectively use the reagents.

      Thank you for drawing our attention to the lack of adequate detail here. We have now updated the results text to expressly reference Morita et al., 2022 where the origins of the truncated CMV promoters are detailed. We have also updated the plasmids table 1 to add pertinent details for these constructs: *pCMVd3 plasmids are based on the pEGFP-C1 backbone, with the CMV promoter truncated to remove 18 of the 26 putative transcription factor binding sites in the human Cytomegalovirus Major Intermediate Enhancer/Promoter (pCMV∆3 as described in Morita et al., 2012). The full sequences will be deposited with the plasmids on Addgene.

      We did not perform a formal comparison of full vs truncated promoters. Our only observation is that the truncated promoters greatly help in increasing the number of expressing cells presenting single-molecule resolvable expression levels (though the approach can still work with full promoters).

      #1.5 This manuscript speculates several times that with more abundant PIs like PI45P2, the observed saturation effect is probably not happening. This should be removed. While the back of envelope calculations may reflect an ideal scenario, the heterogeneity of distribution and multiple key cellular structures involved would seem to corral increased PI45P2 levels in certain regions. These factors amid multivalency and electrostatic mechanisms of lipid effector recruitment (e.g. MARCKS) suggest that speculation may be too strong. Moreover, Maib et al JCB 2024 demonstrated PI4P probe overexpression could directly mask the ability to detect PI4P post-fixation - not fully, but partially. Repeating the titration experiments of this manuscript for multiple PIs is entirely beyond the scope of reasonable, and hence, such experiments are not requested, in favor of adopting more conscientious speculation.

      The reviewer’s point is well taken. Whilst we still believe the overall argument for lipids is sounds (for example, PS or cholesterol are far too abundant for any expressed, stoichiometric binding protein to bind the majority of the population) even abundant phosphoinositides like PI4P and PI(4,5)P2 are an edge case. We have therefore undated the first paragraph of the introduction on __p. 1 __to be less explicit: One of the most prominent is the fact that lipid engagement by a biosensor occludes the lipid’s headgroup, blocking its interaction with proteins that mediate biological function. It follows that large fractions of lipid may be effectively outcompeted by the biosensor, inhibiting the associated physiology. We have argued that, in most cases, this is unlikely because the total number of lipid molecules outnumbers expressed biosensors by one to two orders of magnitude (Wills et al., 2018). However, for less abundant lipids, total molecule copy numbers may be in the order of tens to hundreds of thousands, making competition by biosensors a real possibility.

      We also removed the explicit discussion of PI(4,5)P2 from the introduction, and focus now solely on the PI3K lipids.

      Reviewer 1 Minor:

      1.6 Schematics throughout need simplification, enabling their enlargement.

      We have now enlarged the size of all schematics

      #1.7 Numerous spelling (Fig. 4 schemas) and capitalizations need fixing.

      Thank you for drawing our attention to these. We have thoroughly proof-read the figure panels and corrected errors.

      #1.8 Pg 1 Famous is not appropriate wording

      We respectfully beg to differ with the reviewer here. We believe it is perfectly accurate to state that PIP3 is a second messenger molecule that is known about by many people; we see this as the dictionary definition of the word “famous”.

      #1.9 Fig. 1A statistical testing of microscopy quantifications absent (generally, throughout) and should be included.

      This was indeed an oversight on our part. We have now added appropriate multiple comparisons tests to the data presented in figures 1F, 3F, 4C, 4F and 5C.

      #1.10 Fig.1. In a transient transfection, the protein expression is not uniform. Please explain how you normalized the quantification.

      We hope this is now clarified by the expanded “Image Analysis” part of the methods section on pp. 10-11 (relevant sentence is underlined): For immunofluorescence, we identified individual cells by auto thresholding the DAPI channel using the “Huang” method, followed by the Watershed function to segment bunched cells that appeared to touch. We then used the Voronoi function to generate boundary lines for the segmentation of the cells. To identify cytoplasm, auto thresholding of the CellMask channel using the “Huang” function was employed, with the cells segmented by adding the nuclear Voronoi boundaries. The “analyze particles” function was then used to identify individual cellular ROIs that were greater than 10 µm2 and were not touching the image periphery. These ROIs were used to measure the raw 12-bit intensity of the EGFP and AKT-pS473 channels. A cutoff of EGFP > 100 was used to define EGFP-positive cells, since this value was greater than the mean ± 3 standard deviations of the non-transfected cells’ EGFP intensity. Background intensity of AKT-pS473 was estimated from control cells subject to immunofluorescence in the absence of AKT-pS473 antibody; this value was subtracted from the measured values of all other conditions.

      #1.11 Fig. 1D. EGFP expression levels increased with EGF stimulation. How is this possible?

      There appeared to be a difference due to the presence of 5 strongly expressing cells in the chosen field in the original field for the EGF stimulated, EGFP cells. However, this arose just by chance. The new set of high-resolution images in the new figure 1 were selected to be more representative.

      #1.12 Fig. 1D. The images have pS473 whereas the y-axis label on box plots has p473. Can these box plots be labelled separately for consistency?

      Thank you. This has now been corrected in the revised Figure 1.

      #1.13 Fig.1. T308 phosphorylation is mentioned in Figure 1, but only pS473 data is shown.

      Both T308 and S473 phosphorylation are indicative of AKT activation. However, antibodies suitable for immunofluorescence are only available for pS473, hence why our experiments are restricted to this moiety.

      #1.14 Fig.1 legend. 'Over-expression of PH-AKT is hypothesised to outcompete the endogenous AKT's PH domain'. Why do you need to state a hypothesis in the legend?

      We included this statement for the benefit of the casual reader – i.e. one who looks at the pictures, but doesn’t read the main text!

      #1.15 Fig.1E You stated that the PH-AKT R25C-EGFP is stimulated by EGF addition. However, the GFP signal looks the same in both unstimulated and stimulated. Could you please clarify? Are you sure that the stimulation worked?

      We have clarified the second paragraph of the results section “Inhibition of AKT activation by PIP3 biosensor”__on __p. 4 as follows: In the non PIP3 binding PH-AKT1R25C-EGFP positive cells, we still observed an increase in pS473 intensity.

      The revised figure 1 images also show that PH-AKT1R25C does not translocate to the membrane with EGF stimulation.

      #1.16 You mention...that the AKT enzyme is activated by PDK1 and TORC2, which phosphorylate at residues T308 and S473, respectively. Phosphorylation is also known to occur on T450 at c-tail. Does this phosphorylation also contribute to its activation?

      Yes and no. Threonine 450 phosphorylation is thought to occur co-translationally and is important for AKT stability (see Truebestein et al as cited in the manuscript). It is not really relevant in the context for T308 and S473, which are phosphorylated acutely to activate the protein.

      #1.17 Fig. 1 scale bar in all images equivalent?

      We have now added scale bars to panels in both figure 1D and E to clarify.

      __#1.18 __Pg. 1 paragraph 1 "we have argued..." vs. paragraph 3"...consider that an..." feels like arguing with themselves.

      We believe the re-write we have done in response to major point #1.5 clarifies this point also.

      #1.19 Pg. 1 para 3 what is RFC score - must explain

      We have now defined this more clearly in third __paragraph of the __introduction on p. 1: PH domain containing PIP3effector proteins can be predicted based on sequence comparison to known PIP3 effectors vs non effectors using a recursive functional classification matrix for each amino acid (Park et al., 2008).

      #1.20 Discussion of numbers of PIP3 vs. effectors etc may not be appropriate for the introduction, as the points made by these calculations are already made in the previous paragraphs. May fit better in pg 6 Mitigating PIP3 titration... with an accompanying schematic.

      Respectfully, we prefer to keep this discussion of molecular concentrations, as this adds details and specifics to the pathway that is core to the paper.

      #1.21 Pg 2 "a neonGreen" not well defined, needs accurate description.

      We have clarified this in the sentence in the first paragraph of the results section “Genomic tagging of AKT1…” __on __p. 4, which includes the citation to the full description of the tag: To that end, we used gene editing to incorporate a bright, photostable neonGreen fluorescent protein to the C-terminus of AKT1 via gene editing using a split fluorescent protein approach (Kamiyama et al., 2016).

      #1.22 Fig 2C should give a unstimulated trajectory of puncta/100 um2 to compare with the stimulated

      Unfortunately, we did not record a full 5.5-minute video-rate time-lapse with unstimulated cells. However, we do not believe this control is essential for this experiment, since this example data is included to illustrate (1) the problem of photobleaching, which is clear in the 30-s pre-stimulus and (2) the variability in the raw molecule counts.

      #1.23 Fig 2C and F and G should be systematized for easier comparison. E.g. min vs seconds, 0 timepoint of EGF/rapa addition

      We have made the adjustment to figure 2C to be consistent with 2F and G:

      #1.24 Pg 5 "...and calibrated them..." unclear what is being calibrated, as the text later states that the histograms are fit to monomer/dimer/multimer model resulting in 98.1% in monomer. Minor point.

      We have clarified this point in the second paragraph of the results section “__Genomic tagging of AKT1…” __on __p. 4 __as follows: We analyzed the intensity of these spots and compared them to intensity distributions from a known monomeric protein localized to the plasma membrane (PM) and expressed at single molecule levels

      #1.25 Explain why baselines in Fig2CFG are different

      We did not comment on figure 2C; it is a single cell measurement, as opposed to the mean of 20 cells reported in F. However, we do now clarify the difference between figure 2F and G as the very end of the “Genomic tagging of AKT1…” results section on p 4: Notably, baseline AKT-NG2 localization increased from ~5 to ~15 per 100 µm2 in iSH2 cells, perhaps because the iSH2 construct does not contain the inhibitory SH2 domains of p85 regulatory subunits, producing higher basal PI3K activity.

      #1.26 Fig. 2 has quantification with images; Fig. 3 has it separate. Make consistent.

      We sometimes combine images with quantification, and other times separate the panel containing graphs. This is done deliberately, depending on whether the reader is directed to both together, or whether we consider the data separately in the results section.

      #1.27 Fig. 3B comes before images? Where are the images? Also, y-axis = Intensity (a.u.). Is intensity just full image field? Or per cell? All very unclear.

      We have modified both the graph y-axis label and the figure legend to clarify: (C) TIRF imaging of AKT1-NG2 cells from (B) stimulated with 10 ng/ml EGF

      #1.28 Fig. 3C missing images

      We believe the reviewer is referring to the mCherry channel for the “0 ng cDNA” condition. These images are missing because they do not exist. Since these cells were transfected with pUC19, there was no mCherry fluorescence to image.

      #1.29 Fig 3 C needs brightness/contrast adjusted as images are nearly entirely black (zero values).

      We believe the addition of insets addresses this concern. To the reviewer’s specific suggestion, we found that further increases in the brightness and contrast will bring up the camera noise, but this then occludes the signal from single molecules, such as those found after EGF stimulation of the 0 ng condition.

      #1.30 Fig 3C needs scale bar systemization

      We believe that the incorporation of scaled 6 µm insets addresses this point.

      #1.31 Fig 4 needs 4 panels A-D

      We have now added these individual panel labels to figure 4.

      #1.32 Pg 6 5-OH phosphatases needs reference

      We have added a citation to Trésaugues at the very end of the “Sequestration of PIP3 by lipid biosensors” results section on p. 6, which describes the activity of the whole 5-OH phosphatase activity against PIP3, not just the SHIP phosphatases.

      #1.33 Fig 5B, make images bigger

      Again, we trust that the addition of insets to all single molecule images has addressed this point.

      Reviewer 1 Referees cross-commenting**

      I have read the other reviews and find them entirely reasonable. My impression is we landed on similar general content that needs work, none of which is out of line. The importance and care taken in the author's work is uniformly lauded.

      We agree. At the risk of restoring to alliteration, we have been delighted to receive a trio of clear, concise and consistent comments on the manuscript! We believe it is now much improved.

      Reviewer #1 (Significance (Required)):

      This manuscript clearly and reasonably demonstrates that the commonly used PIP3 sensor can be titrated to low concentrations, at which it does not interfere with Akt translocation and activation. This work is a good technical reference for the field. Signal transduction and membrane biologists should be especially interested in the data. The reviewer/s have core expertise in phosphoinositides, protein biochemistry, cell biology, and membrane biophysics.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors characterize the inhibition of lipid second messenger mediated cell signaling through lipid biosensors that outcompete endogenous effector proteins. This is a very important study that as it quantitatively assesses an issue that many people suspected to exit, yet never properly characterized. This paper is therefore as much a service to the community as a research study in its own right and should be published without undue delay. I am glad that the authors decided to carry out this study & really appreciate their work.

      I do however, have a number of suggestions that I think will make the manuscript stronger and can be readily implemented, mostly by reformulating and/or re-analysis of exiting datasets. I've structured my comments by the datasets in the respective figures to follow the logic of the paper.

      Reviewer 2 Major:

      #2.1 Throughout the manuscript, statistical tests are missing, e.g. in figures 1C-F. This must be amended in the revised version. The authors are making a very quantitative point about buffering, data should be treated accordingly.

      We have now added appropriate multiple comparisons tests to figures 1F, 3F, 4C, 4F and 5C.

      #2.2 I do not think that "PIP3 titration" is the best term to describe the observed effect. "Titration" usually implies the controlled modulation of a concentration, e. g. in analytical chemistry. I think either "competitive binding of PIP3" or "buffering of free PIP3" are more adequate.

      This point is well taken. We have now replaced the word “titration” throughout, replacing it with either “competitive binding” or “sequestration”.

      #2.3 Specific comments: Figure 1

      #2.3a Why are data in 1D-Ff shown as median, with interquartile ranges and 10-90 percentile distance when everything else in the paper is mean +/- se? There might be a good reason for it, but I did not find it mentioned everywhere

      For consistency’s sake, we have changed figure 1F to show a bar graph, though as noted in the figure legend: Graphs show medians ± 95% confidence interval of the median from 82-160 cells pooled from three experiments (medians are reported since the data are not normally distributed).

      #2.3b The authors should test, whether the difference between the +EGF conditions in 1D (EGFP) and 1F (PH-AktR25C-EGFP) is indeed statistically significant. If this observation holds up, what does it mean? Is the mutant still competing with endogenous Akt despite the much-reduced binding affinity? The authors should discuss.

      We have re-analyzed the data in figure 1, with the quantitative data presented in figure 1F combined with statistical analysis. The new data shows no significant effect of the PH-AKT1R25C mutant in either resting or EGF stimulated condition

      There results are also described in the__ second paragraph__ of the first results section on pp. 3-4: This analysis showed that the R25C mutant had no substantial effect on pS473 levels, whereas wild-type PH-AKT greatly inhibited pS473 staining in EGF-stimulated cells as well as reducing basal levels in serum starved cells (Fig. 1F).

      #2.3c How were biosensor/GFP positive cells chosen? Did the authors choose a defined fluorescence intensity cut-off? I think that a pure manual selection is problematic from a methodological point of view as this may introduce biases. Since the authors use Fiji, they can also simply use the "Analyze particles" function, which allows to automatically segment cells from a thresholded image. By choosing the same threshold for all images, it would be ensured that all images are treated exactly the same way.

      We had initially opted for manual outlining of cells since automatic segmentation of irregularly-shaped HEK293a cells is imperfect. However, we agree with André that this opens the possibility of bias. We have therefore re-run the analysis with an automated segmentation and thresholding approach, as suggested. This is detailed in the__ second paragraph__ of the first results section on pp. 3-4: In parallel, we imaged cells with a low resolution 0.75 NA air objective to capture fluorescence from the cells’ entire volume, then quantified these images using an automatically determined threshold for GFP-positive cells (see Materials and Methods). This analysis showed that the R25C mutant had no substantial effect on pS473 levels, whereas wild-type PH-AKT greatly inhibited pS473 staining in EGF-stimulated cells as well as reducing basal levels in serum starved cells (Fig. 1F).

      Further detail is provided in the first paragraph of the “Image analysis” subsection of the methods on pp. 10-11: For immunofluorescence, we identified individual cells by auto thresholding the DAPI channel using the “Huang” method, followed by the Watershed function to segment bunched cells that appeared to touch. We then used the Voronoi function to generate boundary lines for the segmentation of the cells’ cytoplasm. To identify cytoplasm, auto thresholding of the CellMask channel using the “Huang” function was employed, with the images segmented by adding the nuclear Voronoi boundaries. The “analyze particles” function was then used to identify individual cellular ROIs that were greater than 10 µm2 and were not touching the image periphery. These ROIs were used to measure the raw 12-bit intensity of the EGFP and AKT-pS473 channels. A cutoff of EGFP > 100 was used to define EGFP-positive cells, since this value was greater than the mean ± 3 standard deviations of the untransfected cells’ EGFP intensity. Background intensity of AKT-pS473 was estimated from control cells subject to immunofluorescence with the AKT-pS473 antibody omitted; this value was subtracted from the measured values of all other conditions.

      #2.3d I am missing a statement in the methods section that all images were acquired using the same settings.

      This was indeed an important oversight on our part – thanks for spotting the omission of this crucial detail. This is now included at the end of the “Immunofluorescence” section of the Methods on pp. 9-10: Identical laser excitation power, scan speeds and photomultiplier gains were used across experiments to enable direct comparison.

      #2.3e I recommend that the authors include a single cell correlation plot of EGFP fluorescence intensity vs AktpS473 intensity in Figure 1 D-F. This should be rather informative & make the concentration dependence clear.

      We did not observe a strong correlation between PH-AKT1-EGFP intensity and pS473 staining, likely driven by both the imprecision of the cell segmentation and the fact that very low concentrations of PH domain effectively inhibit endogenous AKT1 (as we show in the later figures with the more precise, live cell AKT-NG2 recruitment experiments: see response to #2.5).

      #2.3f I further recommend that the authors look at alterations of baseline Akt activity in the presence of the biosensor. In the images it looks like there might be an effect, but this is then lost in the analysis due to the normalization.

      As covered in our response to #2.3b, there is indeed an inhibition of baseline pS473 in PH-AKT1-EGFP expressing cells, now explicitly quantified and documented in results.

      #2.3g Please include zoomed image insets in Fig. 1D-F, in the current magnification one needs to zoom in quite a bit to see the effect in the raw data. It is a clear effect, but having a zoomed version would make for much easier reading.

      We now include high-resolution confocal images instead of low power, low NA volumes as shown in the last version of the manuscript, which we believe addresses this point and also reviewer #1.2.

      2.3h Up to the authors: I wonder whether it is possible to extract an IC50 value for the competitive inhibition of Akt by the respective biosensors. The transient expression gives the authors access to a wide range of expression levels at the single cell level, which could be quantified by counterstaining with a EGFP-nanobody at a different color (since the EGFP fluorophore went through the fixation process, it is likely unsuitable for quantification) and microscope calibration. Activity could be quantified as the ratio of observed and expected Akt-pS473 fluorescence (derived from the mean FI per cell from the EGFP control). This is not strictly necessary, but would be a beautiful quantitative experiment, give an easy-to-understand number & make the paper much stronger.

      This is a great suggestion, but does not produce precise enough data to work out, as we detail in response to #2.3e. From our data in new figure 3F and figure 5, it seems we have not explored the appropriate expression range to see intermediate levels of inhibition necessary to estimate IC50. This would be a cool experiment though!

      __#2.4 __Specific comments: Figure 2. Overall, compelling data. However, 25 molecules/100 um^2 at maximal recruitment feels low. Assuming a total cell surface area of appr. 2000 um^2 per cell and taking a baseline of 5 molecules/100 um^2 into account, this would mean that only about 400 copies of Akt are recruited in response to a pretty robust stimulus. Is it possible that the association reaction of the split GFP is not complete under these conditions? I think that a direct measurement of intracellular endogenous Akt concentration is required to put these numbers into context.

      This is an excellent point that we had missed. We now specifically address this point in the third paragraph of the “Genomic tagging of AKT…” section on p. 4: __Accumulation of AKT-NG2 was ~25 molecules per 100 µm2, which assuming a surface area of ~1,500 µm2 per cell corresponds to ~375 molecules total. It should be noted that tagging likely only occurred at a single allele in each cell, and the population still exhibited expression of non-edited AKT1 (__Fig. 2B). Given that HEK293 are known to be pseudotriploid (Bylund et al., 2004), the true number of AKT1 molecules would be at least 1,125. However, given an estimated total copy number of 23,000 AKT1 in these cells (Cho et al., 2022), this is still only about 5%. However, we do not interpret these raw numbers due to uncertainties in the efficiency of NG2 complementation under these conditions, as well as potential for reduced expression from the edited allele.

      We also removed the specific comment on molecule density from the abstract.

      #2.5 Specific comments: Figure 3 I think that the classification by plasmid dose does not make a lot of sense, as the resulting expression levels are rather similar. I suggest to pool all traces and calculate mean curves by actual expression levels using a binning approach (e.g. 0-50 au, 50-100 au and so on in raw intensity from Figure 3b). If there is an effect in the realized concentration regime, this should pick it up.

      This is an excellent suggestion, and we have done just that: thank you! The data is now included as a new panel Fig. 3F. The result is described in the results section, “Sequestration of PIP3 by lipid biosensors”, end of the first paragraph on pp. 4-6: To observe the concentration-dependence of AKT1-PH-mCherry inhibition, we pooled the single cell data from these experiments and split transfected cells into cohorts based on raw expression level (excitation and gain were consistent between experiments, allowing direct comparison). This analysis showed profound inhibition of AKT1-NG2 recruitment at all expression levels, with a slightly reduced effect only visible in the lowest expressing cohort (Fig. 2F).

      #2.6 Specific comments: Figure 5 These are very interesting data, in particular with regard to the underlying PIP3 dynamics. I agree with the conclusion of the authors that shielding of PIP3 from degradation is the likely culprit. What I would like to see here is actual kinetic fits - and different terms. On- and off-rate imply biosensor binding, but these are likely rather fast and not on the minute-timescale. The detected processes are much more likely to reflect production and degradation of PIP3 and that should be reflected in the terminology. For the fit: I think that a simple rate law for subsequent reactions ([PIP3]=C(e^-k1t-e^k2t)) will give good results and yield effective rate constants for PIP3 generation and degradation. This implies the quasi-steady state assumption for biosensor binding and implies that [PIP3] is proportional to the biosensor bound [PIP3], but these are reasonable assumptions to make.

      The is an excellent suggestion, which we have added. Specifically, fits are now present on Figs. 5G and 5I; we describe these in the last paragraph of results on p. 8: Normalizing data from both expression modes to their maximum response (Fig. 5G) and fitting kinetic profiles for cooperative synthesis and degradation reactionsrevealed the rate of synthesis is remarkably similar: 1.09 min–1 (95% C.I. 1.02-1.17) for single molecule expression vs 1.02 min-1 (95% C.I. 0.98-1.06) for over-expression. On the other hand, degradation slowed with over expression from 0.34 min–1 (95% C.I. 0.24-0.58) to 0.13 min–1 (95% C.I. 0.12-0.15). This is expected, since synthesis of PIP3molecules would not be prevented by biosensor. On the other hand, PIP3 degradation could be slowed by the over-expressed biosensor competing with PTEN and 5-OH phosphatases that degrade PIP3. An even more exaggerated result is achieved with the cPHx1 PI(3,4)P2 biosensor; this shows an increase in fold-change over baseline of 600% for single molecule expression levels, compared to only 100% in over-expressed cells (Fig. 5H). Again, the degradation rate of the signal is substantially slowed by the over-expressed sensor, reducing from 0.27 min–1 (95% C.I. 0.22-0.39) to 0.16 min–1 (95% C.I. 0.14-0.19), whereas synthesis remains only minorly impacted, changing from 0.61 min–1 (95% C.I. 0.57-0.64) to 0.54 min–1 (95% C.I. 0.52-0.56) with over-expression (Fig. 5I). Collectively, these data show that single molecule based PI3K biosensors show improved dynamic range and kinetic fidelity compared to the same sensors over-expressed.

      Details of the fits are given in a new methods section on p. 11:

      Fitting of reaction kinetics

      Curve fitting was performed in Graphpad Prism 9 or later. For the data presented in Figs. 5G and 5I, both synthesis and degradation phases displayed clear “s” shaped profiles not well fit by simple first order kinetics. Since activation of the PI3K pathway involves many multiplicative interactions between adapters and allosteric activation of the enzymes themselves, we assumed cooperativity and fit reactions with the two phase reaction as follows:

      Where Ft denotes ∆Ft/∆FMAX, nsyn and ndeg are the Hill coefficients of the respective synthesis and degradation reactions, and the rate constants for the reactions are derived from ksyn = 1/τsyn and kdeg = 1/τdeg.

      André Nadler

      Reviewer #2 (Significance (Required)):

      This is an important paper, analyses the effects of over-expressed lipid biosensors on cell signalling in some detail and will be of significant interest to a broad readership.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This is essentially a methods paper in which the authors provide a detailed and highly quantitative analysis of the potentially deleterious effects of expressing phosphoinositide-binding domains as biosensors. Specifically, they study the effects on PIP3 signalling, using biosensors that are widely used in the field.

      They show that the most-commonly used method of expressing PIP3 biosensors using transient transfection with viral promotors has clear deleterious effects on downstream signalling due to out-competing the endogenous effectors. Importantly, they also describe a new approach to overcome this by developing new plasmids and methodology to express these reporters at low levels.

      Reviewer 3 Major comments:

      The work in this paper is thorough and very nicely done. I particularly appreciate the efforts to quantitate or estimate actual numbers and densities of molecules, which significantly strengthen their arguments. The data are excellent and strongly support all their conclusions. I would therefore be happy to see this work published in its current form.

      Reviewer 3 Minor comments:

      I only have some minor and optional suggestions for improvement.

      #3.1 In figure 1D-F they show that PH-Atk-EGFP expression can suppress downstream Akt activation by quantifying P-Akt signal my microscopy. In these panels they say tgey selectively measure this in GFP-expressing cells, but it is not clear how they define which cells are expressing GFP - was a threshold used? Also, it would be nice to also measure both PH-Akt-GFP and P-Akt staining by flow cytometry to look for a correlation. Is there a threshold of biosensor expression that blocks downstream signalling, or is there a linear relationship? This might help specifically measure how much biosensor is too much.

      This is an important comment, also raised by reviewer 2. We provide a detailed explanation and outline revisions that address this in our response to reviewer #2.3c; essentially, we replaced the analysis with an automated segmentation and quantification, estimating GFP-positive cells from a fraction of non transfected cells. We have not performed a FACS analysis, but as we note in our response to #2.3e __and #2.3h, the correlation between EGFP and pAKT staining is imprecise in these experiments. The new __Fig. 3C does address this point for AKT1-NG2 recruitment, as described in our response to #2.5.

      #3.2 Some of their microscopy images (e.g. Fig 1D-F, Fig 5) are very small and would benefit from a zoom box - especially when they are trying to demonstrate single molecule detection.

      This is a fair point raised by all of the reviewers in one form or another. We have added zoomed insets to all of the single molecule images in Figs 2-5, and added higher magnification, confocal section images to Fig. 1.

      Reviewer #3 (Significance (Required)):

      This is both a methods paper and cautionary tale for cell biologists working in this field. Whilst everyone who uses these probes should be aware of the potential risk of biosensors titrating our effectors, this is often not sufficiently acknowledged. This paper is a very nice and clear demonstration of these risks, exemplified with probably the most highly-used biosensor and key downstream signalling pathway.

      Whilst the concepts presented are not especially novel, this paper nonetheless makes an important contribution to the community and hopefully will make others more cautious in how they use these biosensors.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors present the use of previously identified biosensors in a single-molecule concentration regime to address lipid effector recruitment. Using controlled and careful single-cell based analysis, the study investigates how expression of the commonly used PIP3 sensor based on Akt-PH domain interferes with the native detection of PIP3. Predominantly live-cell fluorescence microscopy coupled to image analysis drives their studies.

      Conceptually, this manuscript carefully and quantitatively describes the influence of lipid biosensor overexpression and presents a means to overcome the inherent and long-recognized problems therein. This solution, namely employing low expression of the lipid biosensor, should be generally applicable. The work is of general interest to cell biologists focused on answering questions at membranes and organelles, including especially those interested in lipid-mediated signaling transductions.

      Major:

      1. The terminology "single molecule biosensor" is not really appropriate. A protein is not "single-molecule". An enzyme does not "single molecule". Better is biosensors at single-molecule expression levels. In most cases, this should be changed. Single-molecule vs single-cell vs. bulk measurements are often poorly defined in quantifications and conflating these does not help the case, which is already supported by generally clear data.
      2. Figure 1D-F, images not as clearly describing quantitation as one would hope. Untransfected cells in 1E should demonstrate more translocated Akt-pS473 than transfected, but it is difficult for this reviewer to find. Consider inset images in addition to the wider field. Consider also moving the "negative" data of Fig 1B-C to Supplement.
      3. The cell line being used is not clearly specified after the initial development of the NG1 followed by CRISPRed NG2 onto Akt. For example, for the Figure 3C experiments, the text states "complete ablation of endogenous AKT1-NG2" but this information is not apparent from the figure legend or figure. Throughout the cell line used and the aspects transfected need to be made explicitly clear.
      4. Fig. 5 shows single cells. It is therefore unclear if broken promoters have resulted in decreased expression. This point is important because the expression plasmids should be made publicly available, and for their use to be understood properly, this must be clarified. The details of the plasmids are unclear. Perhaps listed in the table? - unclear. This aspect would be important for the field to effectively use the reagents.
      5. This manuscript speculates several times that with more abundant PIs like PI45P2, the observed saturation effect is probably not happening. This should be removed. While the back of envelope calculations may reflect an ideal scenario, the heterogeneity of distribution and multiple key cellular structures involved would seem to corral increased PI45P2 levels in certain regions. These factors amid multivalency and electrostatic mechanisms of lipid effector recruitment (e.g. MARCKS) suggest that speculation may be too strong. Moreover, Maib et al JCB 2024 demonstrated PI4P probe overexpression could directly mask the ability to detect PI4P post-fixation - not fully, but partially. Repeating the titration experiments of this manuscript for multiple PIs is entirely beyond the scope of reasonable, and hence, such experiments are not requested, in favor of adopting more conscientious speculation.

      Minor:

      1. Schematics throughout need simplification, enabling their enlargement.
      2. Numerous spelling (Fig. 4 schemas) and capitalizations need fixing.
      3. Pg 1 Famous is not appropriate wording
      4. Fig. 1A statistical testing of microscopy quantifications absent (generally, throughout) and should be included.
      5. Fig.1. In a transient transfection, the protein expression is not uniform. Please explain how you normalized the quantification.
      6. Fig. 1D. EGFP expression levels increased with EGF stimulation. How is this possible?
      7. Fig. 1D. The images have pS473 whereas the y-axis label on box plots has p473. Can these box plots be labelled separately for consistency?
      8. Fig.1. T308 phosphorylation is mentioned in Figure 1, but only pS473 data is shown.
      9. Fig.1 legend. 'Over-expression of PH-AKT is hypothesised to outcompete the endogenous AKT's PH domain'. Why do you need to state a hypothesis in the legend?
      10. Fig.1E You stated that the PH-AKT R25C-EGFP is stimulated by EGF addition. However, the GFP signal looks the same in both unstimulated and stimulated. Could you please clarify? Are you sure that the stimulation worked?
      11. You mention...that the AKT enzyme is activated by PDK1 and TORC2, which phosphorylate at residues T308 and S473, respectively. Phosphorylation is also known to occur on T450 at c-tail. Does this phosphorylation also contribute to its activation?
      12. Fig. 1 scale bar in all images equivalent?
      13. Pg. 1 paragraph 1 "we have argued..." vs. paragraph 3"...consider that an..." feels like arguing with themselves.
      14. Pg. 1 para 3 what is RFC score - must explain
      15. Discussion of numbers of PIP3 vs. effectors etc may not be appropriate for the introduction, as the points made by these calculations are already made in the previous paragraphs. May fit better in pg 6 Mitigating PIP3 titration... with an accompanying schematic.
      16. Pg 2 "a neonGreen" not well defined, needs accurate description.
      17. Fig 2C should give a unstimulated trajectory of puncta/100 um2 to compare with the stimulated
      18. Fig 2C and F and G should be systematized for easier comparison. E.g. min vs seconds, 0 timepoint of EGF/rapa addition
      19. Pg 5 "...and calibrated them..." unclear what is being calibrated, as the text later states that the histograms are fit to monomer/dimer/multimer model resulting in 98.1% in monomer. Minor point.
      20. Explain why baselines in Fig2CFG are different
      21. Fig. 2 has quantification with images; Fig. 3 has it separate. Make consistent.
      22. Fig. 3B comes before images? Where are the images? Also, y-axis = Intensity (a.u.). Is intensity just full image field? Or per cell? All very unclear.
      23. Fig. 3C missing images
      24. Fig 3 C needs brightness/contrast adjusted as images are nearly entirely black (zero values).
      25. Fig 3C needs scale bar systemization
      26. Fig 4 needs 4 panels A-D
      27. Pg 6 5-OH phosphatases needs reference
      28. Fig 5B, make images bigger

      Referees cross-commenting

      I have read the other reviews and find them entirely reasonable. My impression is we landed on similar general content that needs work, none of which is out of line. The importance and care taken in the author's work is uniformly lauded.

      Significance

      This manuscript clearly and reasonably demonstrates that the commonly used PIP3 sensor can be titrated to low concentrations, at which it does not interfere with Akt translocation and activation. This work is a good technical reference for the field. Signal transduction and membrane biologists should be especially interested in the data. The reviewer/s have core expertise in phosphoinositides, protein biochemistry, cell biology, and membrane biophysics.

    1. Reviewer #2 (Public review):

      Summary:

      This study presents a valuable finding that the activation of TFEB by sulforaphane (SFN) could promote lysosomal exocytosis and biogenesis in NPC, suggesting a potential mechanism by SFN for the removal of cholesterol accumulation, which may contribute to the development of new therapeutic approaches for NPC treatment.

      Strengths:

      The cell-based assays are convincing, utilizing appropriate and validated methodologies to support the conclusion that SFN facilitates the removal of lysosomal cholesterol via TFEB activation.

      Weaknesses:

      (1) The in vivo experiments demonstrate the therapeutic potential of SFN for NPC. A clear dose-response analysis would further strengthen the proposed therapeutic mechanism of SFN. Additional data supporting the activation of TFEB by SFN for cholesterol clearance in vivo would strengthen the overall impact of the study

      (2) In Figure 4, the authors demonstrate increased lysosomal exocytosis and biogenesis by SFN in NPC cells. Including a TFEB-KO/KD in this assay would provide additional validation of whether these effects are TFEB-dependent.

      (3) For lysosomal pH measurement, the combination of pHrodo-dex and CF-dex enables ratiometric pH measurement. However, the pKa of pHrodo red-dex (according to Invitrogen) is ~6.8, while lysosomal pH is typically around 4.7. This discrepancy may account for the lack of observed lysosomal pH changes between WT and U18666A-treated cells. Notably, previous studies (PMID: 28742019) have reported an increase in lysosomal pH in U18666A-treated cells.

      (4) The authors are also encouraged to perform colocalization studies between CF-dex and a lysosomal marker, as some researchers may be concerned that NPC1 deficiency could reduce or block the trafficking of dextran along endocytosis.

      (5) In vivo data supporting the activation of TFEB by SFN for cholesterol clearance would significantly enhance the impact of the study. For example, measuring whole-animal or brain cholesterol levels would provide stronger evidence of SFN's therapeutic potential.

    2. Reviewer #3 (Public review):

      Summary:

      The authors demonstrate that activation of TFEB facilitates cholesterol clearance in cell models of Niemann-Pick type C (NPC). This is done through a variety of approaches including activation of TFEB by sulforaphane (SFN), a naturally occurring small-molecule TFEB agonist. SFN induces TFEB nuclear translocation and promotes lysosomal exocytosis. In an NPC mouse model, SFN dephosphorylates/activates TFEB in the brain and rescues the loss of Purkinje cells.

      Strengths:

      NPC is a severe disease and there is little in the way of treatment. The manuscript points towards some treatment options. However, the title, the title "Small-molecule activation of TFEB Alleviates Niemann-Pick Disease..." is far too strong and should be changed.

      Weaknesses:

      (1) The manuscript is extremely hard to read due to the writing; it needs careful editing for grammar and English.

      (2) There are a number of important technical issues that need to be addressed.

      (3) The TFEB influence on filipin staining in Figure 1A is somewhat subtle. In the mCherry alone panels there is a transfected cell with no filipin staining and the mCherry-TFEBS211A cells still show some filipin staining.

      (4) Figure 1C is impressive for the upregulation of filipin with U18666A treatment. However, SFN is used at 15 microM. This must be hitting multiple pathways. Vauzour et al (PMID: 20166144) use SFN at 10 nM to 1microM. Other manuscripts use it in the low microM range. The authors should repeat at least some key experiments using SFN at a range of concentrations from perhaps 100 nM to 5 microM. The use of 15 microM throughout is an overall concern.

    3. Author Response:

      Thank you for your interest in our paper. We would also like to thank the anonymous reviewers for their critical and constructive comments. Although the reviewers found our work interesting, they raised several important concerns about our study. To address these concerns, mostly we will perform new experiments as following.

      1. Examine whether antioxidant-NAC can block SFN-induced TFEB-nuclear translocation in NPC cells;

      2. Examine whether calcineurin inhibitor (FK506+CsA) or Ca 2+ inhibitor (Bapta-AM) can block SFN-induced TFEB-nuclear translocation in NPC cells.

      3. Investigate whether cholesterol was cleared by activation of TFEB by SFN in vivo tissues.

      4. Investigate whether SFN-evoked the lysosomal exocytosis is TFEB-dependent by using TFEB-KO cells.

      5. Examine the effect of NPC1 deficiency on dextran trafficking by studying the localization of CF- dex and Lamp1.

      6. Perform cytotoxicity experiments to examine whether SFN used in this study is cytotoxic in various cell lines

      In addition, according to the reviewers’ suggestions, we will make clarifications and corrections wherever appropriate in the manuscript. Below please find our point-by-point responses and plans to the reviewers’ comments.

      Reviewer #1 (Public review):

      Summary:

      The authors are trying to determine if SFN treatment results in dephosphorylation of TFEB, subsequent activation of autophagy-related genes, exocytosis of lysosomes, and reduction in lysosomal cholesterol levels in models of NPC disease.

      Strengths:

      (1) Clear evidence that SFN results in translocation of TFEB to the nucleus.

      (2) In vivo data demonstrating that SFN can rescue Purkinje neuron number and weight in NPC1-/- animals.

      Thank you for the support!

      Weaknesses:

      (1) Lack of molecular details regarding how SFN results in dephosphorylation of TFEB leading to activation of the aforementioned pathways. Currently, datasets represent correlations.

      Thank you for this constructive comment. The reviewer is right that in this manuscript the molecular mechanism of SFN-activated TFEB has not been discussed in details. Because previously we have shown that SFN induces TFEB nuclear translocation via a Ca 2+ - dependent but MTOR (mechanistic target of rapamycin kinase)-independent mechanism through a moderate increase in reactive oxygen species (ROS). And calcineurin-mediated TFEB dephosphorylation underlies SFN-induced TFEB activation. These data have been published in 2021 autophagy (Li, Shao et al. 2021) . Therefore, in this study we did not mention this part. We will add the molecular mechanism of TFEB activation by SFN in the discussion part. And to further confirm this mechanism in NPC cells, we will also perform experiments including: 1) examine whether antioxidant-NAC can block SFN-induced TFEB-nuclear translocation in NPC cells; 2) examine whether calcineurin inhibitor (FK506+CsA) can block SFN-induced TFEB-nuclear translocation in NPC cells.

      (2) Based on the manuscript narrative, discussion, and data it is unclear exactly how steady-state cholesterol would change in models of NPC disease following SFN treatment. Yes, there is good evidence that lysosomal flux to (and presumably across) the plasma membrane increases with SFN. However, lysosomal biogenesis genes also seem to be increasing. Given that NPC inhibition, NPC1 knockout, or NPC1 disease mutations are constitutively present and the cell models of NPC disease contain lysosomes (even with SFN) how could a simple increase in lysosomal flux decrease cholesterol levels? It would seem important to quantify the number of lysosomes per cell in each condition to begin to disentangle differences in steady state number of lysosomes, number of new lysosomes, and number of lysosomes being exocytosed.

      Thank you for the suggestion. It is important to define the three states 1) original number of lysosomes, 2) number of new lysosomes, and 3) number of lysosomes being exocytosis. However, we have checked literature, so far it seems that there is no good method that could clearly differentiate the three states of lysosomes.

      (3) Lack of evidence supporting the authors' premise that "SFN could be a good therapeutic candidate for neuropathology in NPC disease".

      Suggestion was taken! We will investigate whether cholesterol was reduced by activation of TFEB by SFN in vivo to strength the point that SFN could be a potential therapeutic compound for NPC treatment. And to avoid confusion, we have removed this sentence.

      Reviewer #2 (Public review):

      Summary:

      This study presents a valuable finding that the activation of TFEB by sulforaphane (SFN) could promote lysosomal exocytosis and biogenesis in NPC, suggesting a potential mechanism by SFN for the removal of cholesterol accumulation, which may contribute to the development of new therapeutic approaches for NPC treatment.

      Strengths:

      The cell-based assays are convincing, utilizing appropriate and validated methodologies to support the conclusion that SFN facilitates the removal of lysosomal cholesterol via TFEB activation.

      Weaknesses:

      (1) The in vivo experiments demonstrate the therapeutic potential of SFN for NPC. A clear dose-response analysis would further strengthen the proposed therapeutic mechanism of SFN. Additional data supporting the activation of TFEB by SFN for cholesterol clearance in vivo would strengthen the overall impact of the study

      We understand the reviewer’s point. We examined two doses of SFN-30 and 50mg/kg. As shown in Fig.6, SFN (50mg/kg), but not 30mg/kg prevents a degree of Purkinje cell loss in the lobule IV/V of cerebellum, suggesting a dose-correlated preventive effect of SFN. In vivo experiments with higher concentrations of SFN and optimized dosage form of SFN were planned in the future study, but will not be included in this study.

      We will investigate whether cholesterol was cleared by activation of TFEB by SFN in vivo.

      (2) In Figure 4, the authors demonstrate increased lysosomal exocytosis and biogenesis by SFN in NPC cells. Including a TFEB-KO/KD in this assay would provide additional validation of whether these effects are TFEB-dependent.

      Thank you for this valuable suggestion. We will investigate whether SFN-evoked the lysosomal exocytosis is TFEB-dependent by using TFEB-KO cells.

      (3) For lysosomal pH measurement, the combination of pHrodo-dex and CF-dex enables ratiometric pH measurement. However, the pKa of pHrodo red-dex (according to Invitrogen) is ~6.8, while lysosomal pH is typically around 4.7. This discrepancy may account for the lack of observed lysosomal pH changes between WT and U18666A-treated cells. Notably, previous studies (PMID: 28742019) have reported an increase in lysosomal pH in U18666A-treated cells.

      We understand the reviewer’s point. But we used pHrodo™ Green-Dextran (P35368, Invitrogen), but not pHrodo red-dex to measure the lysosomal luminal acidity. According to the product information from Invitrogen, pHrodo Green-dex conjugates are non-fluorescent at neural pH, but fluorescence bright green at acidic pH ranges 4-9, such as those in endosomes and lysosomes. Therefore, pHrodo Green-dex can be used to monitor the acidity of lysosome (Hu, Li et al. 2022) . We also used LysoTracker Red DND-99 (Thermo Scientific, L7528) to measure lysosomal pH (Fig. 4G, H), which is consistent with results of pHrodo Green/CF measurement. Overall, in our hands, we have not detected pH change of lysosomes in U18666A-treated NPC1 cell models.

      (4) The authors are also encouraged to perform colocalization studies between CF-dex and a lysosomal marker, as some researchers may be concerned that NPC1 deficiency could reduce or block the trafficking of dextran along endocytosis.

      Suggestion was taken! We will examine the effect of NPC1 deficiency on dextran trafficking by studying the localization of CF-dex and Lamp1.

      (5) In vivo data supporting the activation of TFEB by SFN for cholesterol clearance would significantly enhance the impact of the study. For example, measuring whole-animal or brain cholesterol levels would provide stronger evidence of SFN's therapeutic potential.

      We really appreciate the reviewer’s suggestions. We will investigate whether cholesterol was cleared by activation of TFEB by SFN in vivo.

      Reviewer #3 (Public review):

      Summary:

      The authors demonstrate that activation of TFEB facilitates cholesterol clearance in cell models of Niemann-Pick type C (NPC). This is done through a variety of approaches including activation of TFEB by sulforaphane (SFN), a naturally occurring small-molecule TFEB agonist. SFN induces TFEB nuclear translocation and promotes lysosomal exocytosis. In an NPC mouse model, SFN dephosphorylates/activates TFEB in the brain and rescues the loss of Purkinje cells.

      Strengths:

      NPC is a severe disease and there is little in the way of treatment. The manuscript points towards some treatment options. However, the title, the title "Small-molecule activation of TFEB Alleviates Niemann-Pick Disease..." is far too strong and should be changed.

      Weaknesses:

      (1) The manuscript is extremely hard to read due to the writing; it needs careful editing for grammar and English.

      We will thoroughly check grammar to improve the manuscript.

      (2) There are a number of important technical issues that need to be addressed.

      We will address the technical issues mentioned in the following.

      (3) The TFEB influence on filipin staining in Figure 1A is somewhat subtle. In the mCherry alone panels there is a transfected cell with no filipin staining and the mCherry-TFEBS211A cells still show some filipin staining.

      We understand the reviewer’s point. We will investigate whether cholesterol is cleared by activation of TFEB by SFN in vivo.

      (4) Figure 1C is impressive for the upregulation of filipin with U18666A treatment. However, SFN is used at 15 microM. This must be hitting multiple pathways. Vauzour et al (PMID: 20166144) use SFN at 10 nM to 1microM. Other manuscripts use it in the low microM range. The authors should repeat at least some key experiments using SFN at a range of concentrations from perhaps 100 nM to 5 microM. The use of 15 microM throughout is an overall concern.

      We understand the reviewer’s point. See RESPONSE #1, previously we have shown that SFN (10–15 μM, 2–9 h) induces robust TFEB nuclear translocation in a dose- and time-dependent manner in HeLa GFP-TFEB stable cells as well as in other human cell lines without cytotoxicity (Li, Shao et al. 2021) . According to previous results, in this study, we chose SFN (15 μM) to examine its effect on cholesterol clearance. We will add the information in the discussion part. In this study, we will perform dose-response TFEB nuclear translocation in NPC model cells as well as cytotoxicity experiments to examine whether the concentrations of SFN used in various cell lines are toxic.

      References:

      Hu, M. Q., P. Li, C. Wang, X. H. Feng, Q. Geng, W. Chen, M. Marthi, W. L. Zhang, C. L. Gao, W. Reid, J. Swanson, W. L. Du, R. Hume and H. X. Xu (2022). "Parkinson's disease-risk protein TMEM175 is a proton-activated proton channel in lysosomes.” Cell 185(13): 2292-+.

      Li, D., R. Shao, N. Wang, N. Zhou, K. Du, J. Shi, Y. Wang, Z. Zhao, X. Ye, X. Zhang and H. Xu (2021). “Sulforaphane Activates a lysosome-dependent transcriptional program to mitigate oxidative stress.” Autophagy 17(4): 872-887.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02546

      Corresponding author: Woo Jae, Kim

      1. General Statements

      This is the second version of revision.

      After thoroughly reviewing the comments provided by the EMBO Journal reviewers, we found their feedback to be highly constructive and valuable for enhancing our manuscript without the need for additional experiments. For example, Reviewer 1 acknowledged that our "data are intriguing and some of the experiments are quite convincing," but suggested that the manuscript contained excessive data that required simplification. This sentiment was echoed by Reviewer 2. In response, we have completely reformatted our manuscript to eliminate unnecessary imaging quantification data and CrzR-related screening data. The reviewers noted the density of our experimental data, which has led us to focus on the SIFa to Crz-CrzR circuit mechanisms related to heart function and interval timing in future projects.

      Reviewer 2's comments were generally more moderate, and we successfully addressed all five of their points with detailed explanations and modifications to our manuscript. They positively remarked that "Overall, this highly interesting study advances our knowledge about the behavioral roles of SIFamide and contributes to an understanding of how motivated behavior such as mating is orchestrated by modulatory peptides." Additionally, Reviewer 3 accepted our manuscript without any further comments.

      In summary, we believe we have effectively addressed all concerns raised by Reviewers 1 and 2, resulting in a clearer manuscript that is more accessible to a broader audience.

      2. Point-by-point description of the revisions

      Reviewer #1

      General Comments: In this revision of their manuscript, Zhang et al have attempted to address most of the points raised by the reviewers, however, they have not assuaged my most important concerns. The manuscript contains a ton of information, but at times this is to the detriment of the narrative flow. I had a lot of trouble following the rationale of each experiment, and the throughline from one experiment to the next is not always obvious. The data are intriguing, and some of the experiments are quite convincing, but other experiments are either superfluous or have methodological issues. I will summarize the most acute issues below.

      • *Answer: Thank you for your thoughtful feedback and for acknowledging our efforts to address your previous comments. We appreciate your recognition of the intriguing nature of our data and the convincing aspects of our experiments. In this second revision, we have taken your concerns regarding the narrative flow and data overload to heart. We have completely reshaped our manuscript, significantly reducing unnecessary data, including the NP5270 data and overlapping quantification results that did not contribute meaningfully to the storytelling. Our goal was to streamline the presentation of our findings to enhance clarity and coherence, ensuring that each experiment clearly supports the overarching narrative. We believe these revisions will not only improve the readability of our manuscript but also allow readers to follow the rationale behind each experiment more easily. We are confident that this refined approach will make our contributions clearer and more impactful. Thank you once again for your constructive insights, which have been invaluable in guiding us toward a more focused and compelling presentation of our work.

      Comment 1. *The authors argue that genetic controls are unnecessary because they have been conducted in previously published papers. I am concerned with this argument, as it is good practice to repeat controls with each experiment. However, I am overall convinced by the basic phenotype indicating that panneuronal SIFaR knockdown eliminates the changes in mating duration associated with previous experience. As for the more restricted 24F06-GAL4, the phenotype is odd-the flies do actually change their mating duration, just in the opposite direction of controls. Doesn't this imply that these flies are still capable of "interval timing", and of changing their mating strategy following exposure to rivals or following sexual experience? *

      • *

      __ Answer:__ We appreciate the reviewer's critical comments regarding genetic control and the intriguing phenotypes we observed in specific genetic combinations. We fully agree with the reviewer and have repeated all genetic control experiments for this revision, confirming that our genetic controls consistently demonstrate intact LMD and SMD behaviors, as previously reported. These genetic control experiments have been included in Supplementary Information 1-2. We are grateful to the reviewer for the opportunity to reaffirm that LMD and SMD represent stable behavioral phenotypes suitable for genetically studying interval timing, supported by reproducible data.

      • *

      We acknowledge the reviewer's insightful comments about the exciting phenotype observed when SIFaR is knockdown which shows both singly reared and sexually experienced male show lengthened mating duration in contrast to normal LMD and SMD behaviors. Actually, we have observed such phenotype when specific neural circuits are disrupted such as when sNPF peptidergic signaling is disrupted in restricted neuronal population [4]. We are now investigating such phenotype as hypothesis as disinhibition. We explained this phenotype and about disinhibition in main text as below.

      In the spatial, the targeted reduction of SIFaR expression in the GAL424F06 neuronal subset resulted in a notable alteration of mating behavior. Both singly reared and sexually experienced flies exhibited an extended mating duration relative to naïve flies, contrary to the expected reduction. This observation indicates a deficit in the neural mechanism responsible for modulating mating duration, suggesting a disinhibition-like effect within the neural circuitry governing mating behavior. We have also previously observed a similar phenotype when sNPF peptidergic signaling is inhibited in specific neuronal circuits [62]. Disinhibition, characterized by the alleviation of inhibitory constraints, permits the activation of neural circuits that are ordinarily repressed. This process is instrumental in sculpting behavioral patterns and facilitating the sequential progression of behaviors. Through the orchestrated promotion of select neuronal activation and concurrent inhibition of competing neural routes, disinhibition empowers the brain with the ability to dynamically ascertain and preserve the requisite behavioral state, concurrently smoothing the transition to ensuing behavioral phases [63]. It is known that Drosophila neural circuits also exhibit disinhibition phenotypes in light preference and ethanol sensitization [64,65]. Further investigation is needed to uncover the underlying mechanisms of this disinhibition-like phenotype observed in LMD and SMD behaviors.

      This reversed phenotype strongly suggests a disruption in interval timing, as one would expect that if interval timing were normal and intact, male flies would decrease their mating duration in response to appropriate environmental changes. For instance, research has shown that patients with Parkinson's disease exhibit heterogeneity in temporal processing, leading to disrupted interval timing phenotypes [5]. Therefore, if male flies subjected to social isolation or sexual experience do not show a reduction in mating duration compared to control conditions, it indicates a potential disruption in their interval timing mechanisms. We appreciate the reviewer's encouragement to further explore this intriguing disinhibition-like phenotype, and we plan to investigate this aspect in our future projects.

      Comment 2. *I am glad the see the addition of data assessing the extent of SIFaR and CrzR RNAi knockdown; however, this has not completely addressed my concerns about interpretation of behavioral phenotypes. In both cases, the knockdown was assessed by qPCR using the very strong tub-GAL4 driver. mRNA levels are decreased but not nearly eliminated. Thus, when in line 177-178 the authors assert: "Consequently, we infer that the knockdown of SIFaR using the HMS00299 line nearly completely diminishes the levels of the SIFaR protein," the statement is not supported by the data. The qPCR results showed a knockdown at the mRNA level of ~50%. No assays were conducted to measure protein levels. The conclusions should be tempered to align with the data. Furthermore, it is not clear that knockdown is as successful with other drivers, which means that negative behavioral data must be interpreted with caution. For example, the lack of phenotype with repo-GAL4 driving SIFaR RNAi or elav-GAL4 driving CrzR RNAi could be due to a lack of efficient knockdown. This should be acknowledged. *

         __Answer:__ We appreciate the reviewer's critical observation regarding the efficiency of SIFaR knockdown. We fully agree that it is essential to confirm both for ourselves and our readers that the SIFaR knockdown phenotype is robust and convincing. At the outset of this project, we tested all available SIFaR-RNAi strains following established protocols within the fly community to ensure consistency in our findings. When we employed strong drivers such as tub-GAL4 and nSyb-GAL4 for SIFaR-RNAi knockdown, we observed that the flies failed to eclose and exhibited a lethal phenotype during the larval stage, which closely resembles the homozygous lethal phenotype seen in SIFaR mutants. This suggests that, in most cases, the effects of SIFaR knockdown can effectively mimic those of SIFaR mutations. To share our methodology and reinforce our findings, we have added clarifying statements in the main text as follows:
      

      "Employment of broad drivers, including the tub-GAL4 and the strong neuronal driver nSyb-GAL4, with HMS00299 line consistently results in 100% embryonic lethality (data not shown). This phenotype mirrors the homozygous lethality observed in the SIFaRB322 mutant."

      • *

      Due to the significant lethality phenotype observed, we conducted PCR analyses using a combination of tub-GAL80ts and SIFaR-RNAi. As detailed in Fig. 1E, we reared the flies at 22{degree sign}C to suppress RNAi expression and then shifted the temperature to 29{degree sign}C for just three days prior to performing PCR. While our PCR results indicate a 50% reduction in SIFaR levels, we believe that experiments conducted without the tub-GAL80ts system would likely demonstrate an even greater reduction in SIFaR expression. To clarify this point and provide additional context, we have included the following description in the main text:

      "The silencing of SIFaR mRNA was achieved at approximately 50% using the HMS00299 knockdown line in combination with tub-GAL80ts, with RNAi induction lasting for three days (bottom diagram in Fig. 1E). Notably, the same tub-GAL4 driver, when used without the tub-GAL80ts combination, resulted in embryonic lethality while still reducing SIFaR mRNA levels by 50% after three days of RNAi induction. This finding suggests that SIFaR knockdown using the HMS00299 line with GAL4 drivers is likely sufficient to elicit the observed LMD and SMD behaviors. This rationale underscores the effectiveness of our experimental approach and its potential implications for understanding the role of SIFaR in mating behaviors."

      We also concur with the reviewer that the absence of a behavioral phenotype associated with CrzR-RNAi may be due to inefficient RNAi knockdown. Consequently, we have included a description of this issue in the main text as follows:

      • *

      "It is important to consider that the 50% knockdown of SIFaR and CrzR may be sufficient to disrupt LMD and/or SMD behavior. However, the lack of phenotype with repo-GAL4 or elav-GAL4 could be due to a less efficient knockdown. This possibility highlights the need for cautious interpretation of negative behavioral data."

      Comment 3. *Regarding the issue of outcrossing, I am confused by the authors' statement: "To reduce the variation from genetic background, all flies were backcrossed for at least 3 generations to CS strain. For the generation of outcrosses, all GAL4, UAS, and RNAi lines employed as the virgin female stock were backcrossed to the CS genetic background for a minimum of ten generations. Notably, the majority of these lines, which were utilized for LMD assays, have been maintained in a CS backcrossed state for long-term generations subsequent to the initial outcrossing process, exceeding ten backcrosses." It's not clear what this means. Perhaps the authors could definitively state how many times each line was outcrossed. The genetic background is important because of 1) the lack of all controls, and 2) the variability of the behavioral phenotype. Often, the presence or absence of LMD or SMD appears to depend on the behavior of the control flies. When these flies show low mating duration, there is typically not a reduction following sexual experience or group raising. Could these differences derive from genetic background or transgenic insertion effects? *

      Answer: We appreciate the reviewer's concern regarding the potential for confusion stemming from our descriptions of the genetic background. As the reviewer noted, we have published multiple papers on LMD and SMD behaviors, and we have conducted our experiments with careful attention to controlling the genetic background [1-3,6-8]. In response to the reviewer's comments about the importance of genetic control and background, we have completed all necessary genetic control experiments and confirmed that all our flies have been backcrossed for more than ten generations to the Canton-S (CS) strain. We believe that we have adequately addressed the reviewer's concerns regarding potential differences arising from genetic background or transgenic insertion effects. To provide readers with more detailed information about our genetic background, we have added a paragraph in the MATERIALS AND METHODS section as follows:

      "The CS background was selected as the experimental background due to its well-characterized and consistent LMD and SMD behaviors. To ensure that genetic variation did not confound our results, all GAL4, UAS, and RNAi lines employed in our assays were rigorously backcrossed into the CS strain, often exceeding ten generations of backcrossing. This approach was undertaken to isolate the effects of our genetic manipulations from those of genetic background. We assert that the extensive backcrossing to the CS background, in concert with the internal control in LMD and SMD, provides a stable platform for the accurate interpretation of the LMD and SMD phenotypes observed in our experiments."

      Comment 4. *I continue to have substantial concerns about the thresholding method used across many experiments to quantify overlap, and then to claim that this indicates that synaptic connections are being made between different neuronal populations. The degree of overlap will depend on factors including the settings during imaging (was care taken to prevent pixel saturation?). It is also not clear to me from the methods whether analysis was done on single confocal images or on projections. The images shown in the figures look like maximum projections of a confocal stack. Overlap would have to be assessed on individual confocal sections-it is possible that this is what was done for analysis but not clear from the description in the methods. Furthermore, a lot of figure space is dedicated to superfluous information. For example, in Figure 1F-J, there is a massive amount of space dedicated to assessing the agree of overlap between red stinger and CD4GFP, each driven from the same SIFaR2A driver, and further assessing what percentage of the CD4GFP signal overlaps with nc82, with the apparent goal of showing that a lot of the SIFaR signal is at active zones. This information does little to drive the narrative forward, and is quite confusing to read. Finally, the confocal images are generally too small to actually assess. *

         __Answer:__ We appreciate the reviewer's concerns regarding our imaging quantification methods. We recognize the importance of providing a clear and transparent methodology for both readers and the broader scientific community. Instead of using maximum projection of confocal images, we employed a projection method that incorporates the standard deviation function available in ImageJ. Based on our experience, this approach yields more reliable quantification results, allowing for a more accurate assessment of our data. To ensure clarity and reproducibility, we have detailed our methods in the MATERIALS AND METHODS section as follows:
      
      • *

      "The quantification of the overlap was performed using confocal images with projection by standard deviation function provided by ImageJ to ensure precise measurements and avoid pixel saturation artifacts."

      We appreciate the reviewer's suggestion regarding the inclusion of image quantification data for overlapping regions, which may not be essential to the logical flow of our narrative and could lead to confusion for readers. In response, we have removed nearly all of the quantification data related to overlapping regions, retaining only those that we consider critical for the paper. Currently, only Fig. S3B-E remains, as it is important for illustrating how SIFa neuronal arborization interacts with SIFaR neurons in the central nervous system.

      Additionally, we fully agree with the reviewer that the overall size of the confocal images was too small for effective assessment. To address this concern, we have enlarged all confocal images and increased the spacing in the figures. We believe these improvements will enhance the clarity of our manuscript and facilitate a better understanding of our findings.

      • *

      Comment 5. *In general, the figures are still very cluttered, with panels too close together, and the labels are hard to read. *

      Answer: We thank the reviewer for their valuable feedback regarding the clarity of our figures. In response to their concern, we have enlarged the figures to enhance readability and ensure that the panels are more distinct. We believe these adjustments will significantly improve the viewer's ability to interpret the data. We appreciate the reviewer's attention to detail, which has helped us to refine the presentation of our findings.

      Comment 6. *There are no methodological details on how the VFB was used. The authors have not addressed my concern that they are showing only the neuronal skeleton (rather than the actual site of synapses). They are simply identifying all locations where the neuronal skeleton overlaps an entire brain region, and suggesting that these represent synapses. Many papers use the VFB to denote the actual location of synapses, which should be done in Figures 3B and S4A. *

      Answer: We appreciate the reviewer's constructive comments regarding the methodological details of using VFB data. We fully agree that we cannot draw definitive conclusions about SIFa projections to specific regions based solely on neuronal skeleton data, which do not indicate the actual locations of synapses. To address this concern, we have made it clear to readers that the VFB skeleton data serves only as a preliminary indication of potential SIFa projections to GA, FB, and AL.

      To confirm the presence of actual synapses from SIFa neurons, we conducted a thorough analysis using FlyWire data, which validated our findings from VFB. By integrating insights from VFB with the detailed synaptic mapping provided by FlyWire, we can confidently assert the functional relevance of these connections within the context of SIFa neuronal activity. This comprehensive approach not only bolsters our conclusions but also enhances our understanding of how SIFa neurons interact within the broader neural circuitry. We believe this rationale highlights the significance of our work in elucidating the complex relationships among these neuronal populations. We have detailed our findings in the main text as follows:

      "We utilized the "Virtual Fly Brain (VFB)" platform, an interactive tool designed for exploring neuronal connectivity, to gain insights into the connectivity of SIFa neurons with four other neurons, specifically GA, FB, and AL (Fig. 3B and Fig. S4B) [74]. While VFB provides valuable information, it does not offer precise locations of synapses originating from SIFa neurons. To address this limitation, we incorporated data from the FlyWire connectome, which allowed us to confirm that SIFa projections indeed form actual synapses with GA, AL, FB, and SMP (Fig. S3F and S3G) [75]. This multi-faceted approach enhances the robustness of our findings by integrating different data sources to validate neuronal connections."

      • *

      Comment 7. *The changes in GRASP and CaLexA with experience are very interesting, and suggest a substantial rearrangement of synaptic connectivity associated with changes in mating duration following group rearing or female exposure. I am still concerned, however, that the nsyb and tGRASP images look so different. I wouldn't expect them to be identical, but it is puzzling that the nsyb-GRASP data show connections in a few discrete brain areas, while the tGRASP data show connections in a much larger overall brain area, but curiously not in the major regions seen with nsyb-GRASP (ie PI, FB and GA). Shouldn't the tGRASP signal appear in all the places that the nsyb-GRASP does? For CaLexA and GRASP data, the methods should indicate the timing of the dissections and staining relative to the group/sexual experience. *

      Answer: We appreciate the reviewer's constructive comments regarding our GRASP data, which indeed reveal an intriguing neural plasticity phenotype, as the reviewer noted. In our previous response, we suggested that the observed differences may be attributed to the distinct SIFa-GAL4 strains utilized, as described in another manuscript focused on SIFa inputs [9]. In that manuscript, we classified the four SIFa neurons into two groups: SIFaDA (dorsal-lateral) and SIFaVP (ventral-posterior). The SIFa2A-GAL4 specifically labels only the SIFaVP neurons, while the SIFa-PT driver labels all four neurons. We acknowledge that we did not clearly communicate this distinction to the reviewer or our readers, and we apologize for any confusion this may have caused. To rectify this oversight, we have added a detailed explanation of these differences in the main text as follows:

      "The subtle differences in GRASP signals observed in Fig. 3A may stem from the distinct expression patterns of the SIFa2A-lexA and GAL4SIFa.PT drivers. We would like to emphasize that the SIFa2A driver labels only a subset of SIFa neurons in other regions (Kim 2024)."

      We recognize that a clear and transparent methodology is essential for generating reproducible data. In response to the reviewer's suggestion, we have revised our MATERIALS AND METHODS section to include more detailed descriptions of the dissection conditions. This enhancement aims to provide readers with the necessary information to replicate our experiments effectively.

      "To ascertain calcium levels and synaptic intensity from microscopic images, we dissected and imaged five-day-old flies of various social conditions and genotypes under uniform conditions. For group reared (naïve) flies, the flies were reared in group condition and dissect right after 5 days of rearing without any further action. For single reared flies, the flies were reared in single condition and dissect at the same time as group reared flies right after 5 days of rearing without any further action. For sexual experienced flies, the flies were reared in group condition after 4 days of rearing and will be given virgins to give them sexual experience for one day, those flies will also be dissected at the same time as group and single reared flies after one day."

      • *

      Comment 8. *The calcium imaging data are odd. In most cases, the experimental flies don't actually show an increase in calcium levels but rather a lack of a decrease that is present in the ATR- controls. Also, in the cases where they argue for an excitatory affect of SIF neuron stimulation, the baseline signal intensity appears higher in ATR- controls compared to ATR+ experimental flies (eg Fig 5L, 6O), while it is significantly higher in ATR+ flies compared to ATR- controls when the activation results in decreased calcium signals. Perhaps more details on how these experiments were conducted and whether data were normalized in some way would help to clarify this. *

      Answer: Thank you for your valuable feedback. We appreciate your careful analysis of our calcium imaging data and have addressed your concerns below:

      In our experiments, we observed that ATR+ flies maintained relatively stable calcium levels, whereas ATR- controls exhibited a gradual decrease. Under confocal imaging, GFP signals typically decrease over time, which we observed in ATR- controls. However, ATR+ flies did not exhibit this decline. To better convey this observation, we have refined the language in the manuscript. Specifically, we now describe this as a tendency to sustain the activity of Crz neurons in the OL and AG regions (Fig. 6K-M, Fig. S6G-I). This is supported by the sustained intracellular calcium activity in ATR+ flies compared to the gradual decline to baseline levels observed in ATR- controls (Fig. 6K-M).

      Baseline signal intensity differences: You correctly noted that in some cases, the baseline signal intensity appears higher in ATR- controls compared to ATR+ flies. These differences are likely due to technical factors, such as variations in the distance between the imaged brain and the objective lens. Even minor positional shifts in the brain (forward or backward) can affect the observed signal intensity.

      Our analyses focus on relative changes in fluorescence intensity within the same sample, which we present as line graphs to highlight trends rather than absolute values. However, we acknowledge that showing the magnitude of relative values instead of absolute values may have caused some confusion. We have revised the images to better align with our conclusions, ensuring that the adjustments do not affect the observed relative changes.

      Normalization and experimental details: The calcium imaging data were normalized to ΔF/F to account for differences in baseline fluorescence intensity. However, we recognize that further clarification of the normalization process and experimental setup is essential. We have expanded the methods section to include detailed descriptions of data acquisition, normalization steps, and statistical analyses.

      As the reviewer correctly noted, calcium signals in ATR+ flies are generally higher than those in ATR- flies. However, it appears that the calcium levels exhibit a maintained response rather than a dramatic increase compared to the control ATR- condition, particularly in the case shown in Fig. 6K, which illustrates SIFa-to-Crz signaling. We believe this observation may reflect the actual physiological conditions under which SIFa influences SIFaR neurons to sustain activity during activation. We have included our interpretation of these findings in the main text as follows:

      "Upon optogenetic stimulation of SIFa neurons, we observed a tendency to maintain the activity of Crz neurons in OL and AG regions (Fig. 6K-M, Fig. S6H-J), evidenced by a sustained activity in intracellular Ca2+ levels that persisted in a high level compared to control ATR- condition which shows gradual declining to baseline levels (Fig. 6K-M). In contrast to the OL and AG regions, the cells in the upper region of the SIP consistently show a decrease in Ca2+ levels following stimulation of the SIFa neurons (Fig. 6N-P)."

      To enhance readers' understanding of our calcium imaging results, we have reformatted our GCaMP data for improved clarity and included additional details in the MATERIALS AND METHODS section regarding the quantification of GCaMP imaging methods. Furthermore, as the reviewer correctly noted, discrepancies in baseline activity were due to our error in presenting the baseline data. We have now corrected this oversight accordingly.

      • *

      Comment 9. *The models in Fig 4 J and T show data from Song et al, though I could not find a citation for this. I would omit this part of the model since these data are not discussed at all in the manuscript. *

      Answer: We appreciate the reviewer for correctly identifying our oversight in failing to properly cite Song et al.'s paper. This error occurred partly because the preprint was not available at the time we submitted our manuscript. We now have a preprint for Song et al.'s paper, which discusses the contributions of SIFa neurons to various energy balance behaviors, and we plan to submit this paper back-to-back with our current submission to PLOS Biology. We have briefly cited Song et al.'s work in the manuscript; however, we have removed references to it from Fig. 4J and T to avoid any potential confusion for readers.

      Comment 10. *The graphs for the SCOPE data (eg Figure 8I-L) are still too small to make sense of. *

      Answer: We enlarged the tSNE plot generated from the SCOPE data.

      • *

      Comment 11. The rationale behind including the data in Figure 9 is not well explained. I would omit this data to help streamline and focus the manuscript.

      Answer: We fully understand and agree with the reviewer's concerns, and we have removed all previous versions of Figure 9 from the manuscript to prevent any confusion regarding the storyline.

      • *

      Comment 12. *The single control group is still being duplicated in two different graphs but with different names in each graph. The authors updated figure caption hints at this but does not make it explicit. At the very least, these should be given the same name across all graphs, as is done, for example, in the CaLexA experiments in Figure 4B-C. *

      Answer: We concur with the reviewer and have changed the label for all "group" conditions to "naïve" in all figures.

      • *

      Comment 13. *Lines 640-641: Moreover, the pacemaker function is essential for the generation of interval timing capabilities (Meck et al, 2012; Matell, 2014; Buhusi & Meck, 2005), with the heart being recognized as the primary pacemaker organ within the animal body". This is an intriguing idea, however, I attempted to look at the cited references and don't see any claim about the heart being involved in interval timing. I could not find a paper matching the citation of Matell 2014. Meck et al 2012 is an introduction to a Frontiers in Integrative Neuroscience Research Topic and does not mention the heart, nor does the Buhusi and Meck 2005 paper. Perhaps there is a more suitable reference to make the assertion that the fly's interval timer would be affected by changes in heart rate. My suggestion would be to simplify the manuscript, focusing on the most robust findings-the behavioral effect of SIFaR knockdown, the GRASP and CaLexA data showing differences following group rearing or female exposure, and the effect of Crz knockdown in SIFaR neurons. Other details could be included but would have to be verified with more rigorous experiments. *

      __ Answer:__ We appreciate the reviewer's interest in our exploration of the role of heart function in interval timing. While we found that knocking down CrzR in the heart specifically disrupts LMD behavior, we agree that our manuscript needs to be streamlined for clarity. As a result, we have eliminated all CrzR-RNAi knockdown data except for the oenocyte, neuronal and glial knockdown data presented in Fig. S8C-H. This decision was made to ensure a more focused comparison with the SIFaR knockdown experiments shown in Fig. 1. We are dedicated to further investigating the role of Crz-CrzR in heart function and its influence on interval timing in a future project. This approach allows us to maintain clarity in our current manuscript while laying the groundwork for more comprehensive studies ahead.

      In line with the reviewer's suggestions, we have simplified our manuscript by eliminating unnecessary data, such as overlapping image quantification and CrzR-RNAi screening, allowing us to focus on SIFaR knockdown and GRASP, as well as CaLexA with GCaMP imaging. We are grateful to the reviewer for providing us with the opportunity to delineate the role of CrzR in heart function related to LMD as a significant future project. We believe that our manuscript has been greatly improved by the reviewer's constructive feedback.

      • *

      __ __


      Reviewer #2

      General Comments:* The authors investigate mating behavior in male fruit flies, Drosophila melanogaster, and test for a role of the SIFamide receptor (SIFaR) in this type of behavior, in particular mating duration in dependence of social isolation and prior mating experience. The anatomy of SIFamide-releasing neurons in comparison with SIFamide receptor-expressing neurons is characterized in a detail-rich manner. Isolating males or exposing them to mating experience modifies the anatomical organization of SIFamidergic axon termini projecting onto SIFamide receptor-expressing neurons. This structural synaptic plasticity is accompanied by changes in calcium influx. Lastly, it is reported that corazonin-releasing neurons are modulated by SIFamide releasing neurons and impact the duration of mating behavior.

      Overall, this highly interesting study advances our knowledge about the behavioral roles of SIFamide, and contributes to an understanding how motivated behavior such as mating is orchestrated by modulatory peptides. The manuscript has some points that are less convincing.*

      __ Answer:__ We appreciate the reviewer's positive feedback regarding our investigation into the role of the SIFamide receptor (SIFaR) in mating behavior in male Drosophila melanogaster. We are pleased that the detailed characterization of SIFamide-releasing neurons and their anatomical changes in response to social isolation and mating experience has been recognized as a valuable contribution to the understanding of synaptic plasticity and its impact on behavior. We are also grateful that the reviewer described our manuscript as a "highly interesting study" that advances knowledge about the behavioral roles of SIFamide and contributes to the understanding of how motivated behaviors, such as mating, are orchestrated by modulatory peptides. We sincerely thank the reviewer for these encouraging comments about our work.

      We acknowledge the reviewer's concerns about certain aspects of our manuscript that may be less convincing. We are committed to addressing these points thoroughly to strengthen our arguments and enhance the clarity of our findings. In response to the feedback, we have made several revisions throughout the manuscript, including clarifying our methodology, enhancing the presentation of our data, and providing additional context where needed. We believe these changes will improve the overall quality of the manuscript and make our conclusions more compelling. Thank you for your thoughtful review, and we look forward to your further insights.

      Comment 1. *It remains unclear why the authors link the differentially motivated duration of mating behavior with the psychological concept of interval timing. This distracts from the actually interesting neurobiology and is not necessary to make the study interesting. The study deals with the modulation of mating behavior by SIFamide. The abstraction that SIFamide plays a role in the neuronal calculation of time intervals for the perception of time sequenc es is not convincing in itself. *

      • Answer: We appreciate the reviewer's thoughtful comments regarding our conclusion that links SIFamide to interval timing in mating behavior. We recognize that our data primarily indicate that SIFamide is essential for normal mating duration and influences the motivation-dependent aspects of this behavior. We also acknowledge the need for more robust evidence to establish a clearer connection between these findings and interval timing. Recent research by Crickmore et al. has provided valuable insights into how mating duration in Drosophila *serves as an effective model for examining changes in motivation over time as behavioral goals are achieved. For example, around six minutes into mating, sperm transfer occurs, resulting in a significant shift in the male's nervous system, where he no longer prioritizes continuing the mating at the expense of his own survival. This pivotal change is mediated by four male-specific neurons that release the neuropeptide Corazonin (Crz). When these Crz neurons are inhibited, sperm transfer does not take place, and as a result, the male fails to reduce his motivation, leading to matings that can extend for hours instead of the typical duration of approximately 23 minutes [10].

      Recent research conducted by Crickmore et al. has secured NIH R01 funding (Mechanisms of Interval Timing, 1R01GM134222-01) to investigate mating duration and sperm transfer timing in Drosophila as a genetic model for understanding interval timing. Their study emphasizes how fluctuations in motivation over time can affect mating behavior, particularly noting that significant behavioral changes occur during mating. For instance, around six minutes into the mating process, sperm transfer takes place, which corresponds with a notable decrease in the male's motivation to continue mating [10]. These findings indicate that mating duration serves not only as an endpoint for behavior but may also reflect fundamental mechanisms associated with interval timing.

         We believe that by leveraging the robustness and experimental tractability of these findings, along with our own work on SIFamide's role in mating behavior, we can gain deeper insights into the molecular and circuit mechanisms underlying interval timing. We will revise our manuscript to clarify this relationship and emphasize how SIFamide may interact with other neuropeptides and neuronal circuits involved in motivation and timing.
      
         In addition to the efforts of Crickmore's group to connect mating duration with a straightforward genetic model for interval timing, we have previously published several papers demonstrating that LMD and SMD can serve as effective genetic models for interval timing within the fly research community. For instance, we have successfully connected SMD to an interval timing model in a recently published paper [3], as detailed below:
      

      "We hypothesize that SMD can serve as a straightforward genetic model system through which we can investigate "interval timing," the capacity of animals to distinguish between periods ranging from minutes to hours in duration.....

      In summary, we report a novel sensory pathway that controls mating investment related to sexual experiences in Drosophila. Since both LMD and SMD behaviors are involved in controlling male investment by varying the interval of mating, these two behavioral paradigms will provide a new avenue to study how the brain computes the 'interval timing' that allows an animal to subjectively experience the passage of physical time [11-16]."

         Lee, S. G., Sun, D., Miao, H., Wu, Z., Kang, C., Saad, B., ... & Kim, W. J. (2023). Taste and pheromonal inputs govern the regulation of time investment for mating by sexual experience in male Drosophila melanogaster. *PLoS Genetics*, *19*(5), e1010753.
      
         We have also successfully linked LMD behavior to an interval timing model and have published several papers on this topic recently [6-8].
      
         Sun, Y., Zhang, X., Wu, Z., Li, W., & Kim, W. J. (2024). Genetic Screening Reveals Cone Cell-Specific Factors as Common Genetic Targets Modulating Rival-Induced Prolonged Mating in male Drosophila melanogaster. *G3: Genes, Genomes, Genetics*, jkae255.
      
         Zhang, T., Zhang, X., Sun, D., & Kim, W. J. (2024). Exploring the Asymmetric Body's Influence on Interval Timing Behaviors of Drosophila melanogaster. *Behavior Genetics*, *54*(5), 416-425.
      
         Huang, Y., Kwan, A., & Kim, W. J. (2024). Y chromosome genes interplay with interval timing in regulating mating duration of male Drosophila melanogaster. *Gene Reports*, *36*, 101999.
      
         Finally, in this context, we have outlined in our INTRODUCTION section below how our LMD and SMD models are related to interval timing, aiming to persuade readers of their relevance. We hope that the reviewer and readers are convinced that mating duration and its associated motivational changes such as LMD and SMD provide a compelling model for studying the genetic basis of interval timing in *Drosophila*.
      

      "The dimension of time is the fundamental basis for an animal's survival. Being able to estimate and control the time between events is crucial for all everyday activities [25]. The perception of time in the seconds-to-hours range, referred to as 'interval timing', is involved in foraging, decision making, and learning via activation of cortico-striatal circuits in mammals [26]. Interval timing requires entirely different neural mechanisms from millisecond or circadian timing [27-29]. There is abundant psychological research on time perception because it is a universal cognitive dimension of experience and behavioral plasticity. Despite decades of research, the genetic and neural substrates of temporal information processing have not been well established except for the molecular bases of circadian timing [30,31]. Thus, a simple genetic model system to study interval timing is required. Considering that the mating duration in fruit flies, which averages approximately 20 minutes, is well within the range addressed by interval timing mechanisms, this behavioral parameter provides a relevant context for examining the neural circuits that modulate the Drosophila's perception of time intervals. Such an investigation necessitates an understanding of the extensive neural and behavioral plasticity underlying interval timing [32-37]."

      We would like to highlight that many researchers are currently working to bridge the gap between interval timing as a purely psychological concept and its neurobiological underpinnings, as illustrated in the following articles [15,17-20]. We appreciate the reviewer's concerns regarding the relationship between mating duration and interval timing. However, we believe that our LMD and SMD model can effectively bridge the gap between psychological concepts and neurobiological mechanisms using a straightforward genetic model organism. By employing Drosophila as our model, we aim to elucidate the underlying neural circuits that govern these behaviors, thereby contributing to a deeper understanding of how interval timing is represented in both psychological and biological contexts.

      Matell, M. S. Neurobiology of Interval Timing. Adv. Exp. Med. Biol. 209-234 (2014) doi:10.1007/978-1-4939-1782-2_12.

      Matell, M. S. & Meck, W. H. Cortico-striatal circuits and interval timing: coincidence detection of oscillatory processes. Cogn. Brain Res. 21, 139-170 (2004).

      Merchant, H. & Lafuente, V. de. Introduction to the neurobiology of interval timing. Adv Exp Med Biol 829, 1-13 (2014).

      Golombek, D. A., Bussi, I. L. & Agostino, P. V. Minutes, days and years: molecular interactions among different scales of biological timing. Philosophical Transactions Royal Soc B Biological Sci 369, 20120465 (2014).

      Balcı, F. & Toda, K. Editorial: Psychological and neurobiological mechanisms of time perception and temporal information processing: insight from novel technical approaches. Front. Behav. Neurosci. 17, 1208794 (2023).

      Comment 2. *For all behavioral experiments, genetic controls should always be conducted. That is, both the heterozygous Gal4-line as well as the heterozygous UAS-line should be used as controls. This is laborious, but important and common standard. The authors often report data only for offspring from genetc crosses in which UAS-lines and Gal4-lines are combined (e.g. figure S1). This is not sufficient. *

      • *Answer: We are grateful for the reviewer's constructive suggestions regarding the genetic control experiments. In response to similar concerns raised by another reviewer, we have conducted all necessary genetic control experiments and included the results in Supplementary Information 1-2. We hope that this thorough effort will demonstrate to both the reviewer and readers that the LMD and SMD behaviors represent stable and reproducible phenotypes for investigating the genetic components of interval timing.

      Comment 3. *There are quite a lot of citations of preprints, including preprints from the authors's own lab. It seems inappropriate to cite non-peer reviewed preprints in order to present the basic principles of the study (interval timing in flies) as recognized knowledge. In general, it is unclear whether the information presented in these multiple preprints will turn out to be credible and acceptable. *

      • *Answer: We concur with the reviewer and have removed most of the preprint material, retaining only one preprint that discusses SIFa function, which has been co-submitted with this manuscript.

      Comment 4. *Anatomical images are often very small and not informative. For example, figure S1 O, R, S and U shows small images of fly brains and ventral nerve chords that do not convincingly describe the expression of fluorescent proteins. The choice of a threshold to quantify fluorescence seems arbitrary. It is also not clear what the quantification "83% of brain and 71% of VNC SIFaR+ neurons" actually tells us. This quantification does not rely on counting neurons (such as 83% of neurons), but only shows how fluorescence in these neurons overlaps with an immunostaining of an ubiquitous active zone protein. The same is true for figure S2 or S3: overlapping brain areas do not inform you about numbers of cells, as stated in the text. *

      Answer: We appreciate the reviewer's concerns regarding our imaging quantification methods. In response to similar questions raised by another reviewer, we have thoroughly reformatted our methods section and eliminated much of the overlapping data that appeared unnecessary for this paper. We recognize the importance of providing a clear and transparent methodology for both readers and the broader scientific community. Instead of using maximum projection of confocal images, we employed a projection method that incorporates the standard deviation function available in ImageJ. Based on our experience, this approach yields more reliable quantification results, allowing for a more accurate assessment of our data. To ensure clarity and reproducibility, we have detailed our methods in the MATERIALS AND METHODS section as follows:

      • *

      "The quantification of the overlap was performed using confocal images with projection by standard deviation function provided by ImageJ to ensure precise measurements and avoid pixel saturation artifacts."

      We appreciate the reviewer's suggestion regarding the inclusion of image quantification data for overlapping regions, which may not be essential to the logical flow of our narrative and could lead to confusion for readers. In response, we have removed nearly all of the quantification data related to overlapping regions, retaining only those that we consider critical for the paper. Currently, only Fig. S3B-E remains, as it is important for illustrating how SIFa neuronal arborization interacts with SIFaR neurons in the central nervous system.

      Additionally, we fully agree with the reviewer that the overall size of the confocal images was too small for effective assessment. To address this concern, we have enlarged all confocal images and increased the spacing in the figures. We believe these improvements will enhance the clarity of our manuscript and facilitate a better understanding of our findings.

      Comment 5. *The authors have consistently confused the extensive overlap of neuronal processes (dendrites and presynaptic regions) across large brain areas with synaptic connections. One cannot infer functional synaptic connectivity from the overlap of these fluorescent signals. *

      Answer: We appreciate the reviewer's feedback and, in light of similar comments from another reviewer, we have removed most of the DenMark and syt.eGFP data, retaining only Fig. 3A. We are grateful for the constructive suggestions, which have significantly enhanced our manuscript. We believe that these revisions have clarified the narrative for readers, allowing for a more focused exploration of SIFaR's role in synaptic plasticity and neuronal orchestration.

      Reviewer #3

      General Comments: In this revised manuscript, the authors have fully and satisfactorily addressed my comments on the previous version. I recommend publication of this manuscript.

      __ Answer:__ We would like to extend our heartfelt thanks for the careful consideration and positive assessment of our revised manuscript. Your insightful feedback has been instrumental in shaping the final version of our work, and we are delighted to hear that our revisions have met your expectations.

      Your dedication to ensuring the quality and rigor of the scientific literature is truly commendable, and we are immensely grateful for the time and effort you have devoted to reviewing our paper. Your support for publication is a significant encouragement to us and validates the hard work we have put into addressing the issues you raised.

      Please accept our sincere appreciation for your professional and constructive approach throughout the review process. We look forward to the possibility of contributing to the scientific community through the dissemination of our research.

      REFERENCES

      1. Kim WJ, Jan LY, Jan YN. Contribution of visual and circadian neural circuits to memory for prolonged mating induced by rivals. Nat Neurosci. 2012;15: 876-883. doi:10.1038/nn.3104
      2. Kim WJ, Jan LY, Jan YN. A PDF/NPF Neuropeptide Signaling Circuitry of Male Drosophila melanogaster Controls Rival-Induced Prolonged Mating. Neuron. 2013;80: 1190-1205. doi:10.1016/j.neuron.2013.09.034
      3. Lee SG, Sun D, Miao H, Wu Z, Kang C, Saad B, et al. Taste and pheromonal inputs govern the regulation of time investment for mating by sexual experience in male Drosophila melanogaster. PLOS Genet. 2023;19: e1010753. doi:10.1371/journal.pgen.1010753
      4. Zhang X, Miao H, Kang D, Sun D, Kim WJ. Male-specific sNPF peptidergic circuits control energy balance for mating duration through neuron-glia interactions. bioRxiv. 2024; 2024.10.17.618859. doi:10.1101/2024.10.17.618859
      5. Merchant H, Luciana M, Hooper C, Majestic S, Tuite P. Interval timing and Parkinson's disease: heterogeneity in temporal performance. Exp Brain Res. 2008;184: 233-248. doi:10.1007/s00221-007-1097-7
      6. Sun Y, Zhang X, Wu Z, Li W, Kim WJ. Genetic Screening Reveals Cone Cell-Specific Factors as Common Genetic Targets Modulating Rival-Induced Prolonged Mating in male Drosophila melanogaster. G3: Genes, Genomes, Genet. 2024; jkae255. doi:10.1093/g3journal/jkae255
      7. Zhang T, Zhang X, Sun D, Kim WJ. Exploring the Asymmetric Body's Influence on Interval Timing Behaviors of Drosophila melanogaster. Behav Genet. 2024; 1-10. doi:10.1007/s10519-024-10193-y
      8. Huang Y, Kwan A, Kim WJ. Y chromosome genes interplay with interval timing in regulating mating duration of male Drosophila melanogaster. Gene Rep. 2024; 101999. doi:10.1016/j.genrep.2024.101999
      9. Kim WJ, Song Y, Zhang T, Zhang X, Ryu TH, Wong KC, et al. Peptidergic neurons with extensive branching orchestrate the internal states and energy balance of male Drosophila melanogaster. bioRxiv. 2024; 2024.06.04.597277. doi:10.1101/2024.06.04.597277
      10. Thornquist SC, Langer K, Zhang SX, Rogulja D, Crickmore MA. CaMKII Measures the Passage of Time to Coordinate Behavior and Motivational State. Neuron. 2020;105: 334-345.e9. doi:10.1016/j.neuron.2019.10.018
      11. Buhusi CV, Meck WH. What makes us tick? Functional and neural mechanisms of interval timing. Nat Rev Neurosci. 2005;6: 755-765. doi:10.1038/nrn1764
      12. Merchant H, Harrington DL, Meck WH. Neural Basis of the Perception and Estimation of Time. Annu Rev Neurosci. 2012;36: 313-336. doi:10.1146/annurev-neuro-062012-170349
      13. Allman MJ, Teki S, Griffiths TD, Meck WH. Properties of the Internal Clock: First- and Second-Order Principles of Subjective Time. Annu Rev Psychol. 2013;65: 743-771. doi:10.1146/annurev-psych-010213-115117
      14. Rammsayer TH, Troche SJ. Neurobiology of Interval Timing. Adv Exp Med Biol. 2014; 33-47. doi:10.1007/978-1-4939-1782-2_3
      15. Golombek DA, Bussi IL, Agostino PV. Minutes, days and years: molecular interactions among different scales of biological timing. Philosophical Transactions Royal Soc B Biological Sci. 2014;369: 20120465. doi:10.1098/rstb.2012.0465
      16. Jazayeri M, Shadlen MN. A Neural Mechanism for Sensing and Reproducing a Time Interval. Curr Biol. 2015;25: 2599-2609. doi:10.1016/j.cub.2015.08.038
      17. Balcı F, Toda K. Editorial: Psychological and neurobiological mechanisms of time perception and temporal information processing: insight from novel technical approaches. Front Behav Neurosci. 2023;17: 1208794. doi:10.3389/fnbeh.2023.1208794
      18. Gür E, Duyan YA, Arkan S, Karson A, Balcı F. Interval timing deficits and their neurobiological correlates in aging mice. Neurobiol Aging. 2020;90: 33-42. doi:10.1016/j.neurobiolaging.2020.02.021
      19. Merchant H, Lafuente V de. Introduction to the neurobiology of interval timing. Adv Exp Med Biol. 2014;829: 1-13. doi:10.1007/978-1-4939-1782-2_1
      20. Matell MS. Neurobiology of Interval Timing. Adv Exp Med Biol. 2014; 209-234. doi:10.1007/978-1-4939-1782-2_12
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Zhang et al., "Long-range neuropeptide relay as a central-peripheral communication mechanism for the context-dependent modulation of interval timing behaviors".

      The authors investigate mating behavior in male fruit flies, Drosophila melanogaster, and test for a role of the SIFamide receptor (SIFaR) in this type of behavior, in particular mating duration in dependence of social isolation and prior mating experience. The anatomy of SIFamide-releasing neurons in comparison with SIFamide receptor-expressing neurons is characterized in a detail-rich manner. Isolating males or exposing them to mating experience modifies the anatomical organization of SIFamidergic axon termini projecting onto SIFamide receptor-expressing neurons. This structural synaptic plasticity is accompanied by changes in calcium influx. Lastly, it is shown that corazonin-releasing neurons are modulated by SIFamide releasing neurons and impact the duration of mating behavior.

      Overall, this highly interesting study advances our knowledge about the behavioral roles of SIFamide, and contributes to an understanding how motivated behavior such as mating is orchestrated by modulatory peptides. The approach to take the entire organism, including peripheral tissue, into consideration, is very good and a rather unique point. The manuscript has only some points that are less convincing, and these should be addressed.

      Major concerns:

      • It is highly interesting that the duration of mating behavior is dependent on external and motivational factors. In fact, that provides an elegant way to study which neuronal mechanisms orchestrate these factors. However, it remains elusive why the authors link the differentially motivated durations of mating behavior to the psychological concept of interval timing. This distracts from the actually interesting neurobiology, and is not necessary to make the study interesting.
      • In figure 4 A and 4K, fluorescence microscopy images of brains and ventral nerve chords are shown, one illustrating GRASP experiments, and one showing CaLexA experiments. The extreme difference between the differentially treated flies (bright fluorescence versus almost no fluorescence) is - in its drastic form- surprising. Online access to the original confocal microscopy images (raw data) might help to convince the reader that these illustrations do not reflect the most drastic "representative" examples out of a series of brain stainings.
      • In particular for behavioral experiments, genetic controls should always be conducted. That is, both the heterozygous Gal4-line as well as the heterozygous UAS-line should be used as controls. This is laborious, but important.

      Minor comments:

      • Line 75: word missing ("...including FEEDING-RELATED BEHAVIOR, courtship, ...").
      • Line 120: word missing ("SIFaR expression in adult neurons BUT not glia...").
      • I find the figures often to be quite overloaded, and anatomical details often very small (e.g., figure 7A).

      Significance

      Overall, this highly interesting study advances our knowledge about the behavioral roles of SIFamide, and contributes to an understanding how motivated behavior is orchestrated by modulatory peptides. The approach to take the entire organism, including peripheral tissue, into consideration, is very good and a rather unique point.

      Since decades it has been investigated how sensory stimuli are processed and encoded by the brain, and how behavioral actions are executed. Likewise, principles underlying learning and memory, sleep, orentation, circadian rhythms, etc. are subject to intense investigation. However, how motivational factors (sleep pressure, hunger, sexual drive) are actually "encoded", signaled and finally used to orchstrate behavior and guide decision-making is, to a very large degree, unknown - in any species. The model use here (Drosophila and its peptidergic system wit SIFamide as a central hub) represents actually a ideal entry point to study just this question. In this sense, the manuscript is at the forefront of modern, state-of-the-art neurobiology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript from Zhang et al. primarily investigates the contribution of the SIFa neuropeptide receptor (SIFaR) to mating duration in male fruit flies. Through RNAi-mediated downregulation, they show that SIFaR receptor is necessary for previous experience to alter mating duration. Using cell-specific knockdown and rescue of the SIFaR receptor, they identify a population of ~400 neurons that could underlie this effect. This is still a large number of cells but is narrowed from the ~1,200 total SIFaR-expressing neurons. They then use the GRASP synaptic labeling technique to show that SIFa+ neurons form synapses onto the relevant SIFaR-expressing population, and that the area of synaptic contact is systematically altered depending on the fly's past mating history. Finally, they provide evidence to argue that SIFa neurons act through SIFaR neurons that release the neuropeptide corazonin to regulate mating duration. Overall, the authors have used an impressive array of techniques in their attempt to define the neural circuits and molecules involved in changing internal state to modify the duration of mating.

      Major Comments:

      1. The authors are to be commended for the sheer quantity of data they have generated, but I was often overwhelmed by the figures, which try to pack too much into the space provided. As a result, it is often unclear what components belong to each panel. Providing more space between each panel would really help.
      2. The use of three independent RNAi lines to knock down SIFaR expression is experimentally solid, as the common phenotype observed with all 3 lines supports the conclusion that the SIFaR is important for mating duration choice. However, the authors have not tested whether these lines effectively reduce SIFaR expression, nor whether the GAL80 constructs used to delimit knockdown are able to effectively do so. This makes it hard to make definitive conclusions with these manipulations, especially in the face of negative results. A lack of complete knockdown is suggested by the fact that the F24F06 driver rescues lethality when used to express SIFaR in the B322 mutant background, but does not itself produce lethality when used to express SIFaR RNAi. The authors should either conduct experiments to determine knockdown efficiency or explicitly acknowledge this limitation in drawing conclusions from their experiments. A similar concern relates to the CrzR knockdown experiments (eg Figure 7).
      3. Most of the behavioral experiments lack traditional controls, for example flies that contain either the GAL4 or UAS elements alone. The authors should explain their decision to omit these control experiments and provide an argument for why they are not necessary to correctly interpret the data. In this vein, the authors have stated in the methods that stocks were outcrossed at least 3x to Canton-S background, but 3 outcrosses is insufficient to fully control for genetic background.
      4. Throughout the manuscript, the authors appear to use a single control condition (sexually naïve flies raised in groups) to compare to both males raised singly and males with previous sexual experience. These control conditions are duplicated in two separate graphs, one for long mating duration and one for short mating duration, but they are given different names (group vs naïve) depending on the graph. If these are actually the same flies, then this should be made clear, and they should be given a consistent name across the different "experiments".
      5. The authors have consistently conflated overlap of neuronal processes with synaptic connections. Claims of synaptic connectivity deriving solely from overlap of processes should be tempered and qualified.
        • For example, they say (Lines 201-202) "These findings suggest that SIFa neurons and GAL424F06-positive neurons form more synapses in the VNC than in the brain." This is misleading. Overlap of24F06-LexA>CD8GFP and SIFa-GAL4>CD8RFP tells us nothing about synapse number, or even whether actual synapses are being formed.
        • Lines 210-211: "The overlap of DenMark and syt.EGFP signals was highly enriched in both SOG and ProNm regions, indicating that these regions are where GAL424F06 neurons form interconnected networks". This is misleading. Overlap of DenMark and syt.EGFP does not indicate synapses (especially since these molecules can be expressed outside the expected neuronal compartment if driven at high enough levels).
        • Lines 320-322: "Neurons expressing Crz exhibit robust synaptic connections with SIFaR24F06 neurons located in the PRW region of the SOG in the brain (panels of Brain and SOG in Fig. 5A)". This is again misleading. They are not actually measuring synapses here, but instead looking at area of overlap between neuronal processes of Crz and SIFaR cells.
        • In Figs 3B and S4A, they are claiming that all neuronal processes within a given delineated brain area are synapses. The virtual fly brain and hemibrain resource have a way to actually identify synapses. This should be used in addition to the neuron skeleton. Otherwise, it is misleading to label these as synapses.
        • Furthermore, measuring the area of GRASP signal is not the same as quantifying synapses. We don't know if synapse number changes (eg in lines 240-242).
      6. In general, the first part of the manuscript (implicating SIFaR in mating duration) is much stronger than the second part, which attempts to demonstrate that SIFa acts through Crz-expressing neurons to induce its effects. The proof that SIFa acts through Crz-expressing neurons to modify mating duration is tenuous. The most direct evidence of this, achieved via knockdown on Crz in SIFaR-expressing cells, is relegated to supplemental figures. The calcium response of the Crz neurons to SIFa neuron activation (Fig. 6) is more of a lack of a decrease that is observed in controls. Also, this is only done in the VNC. Why not look in the brain, because the authors previously stated a hypothesis that the "transmission of signals through SIFaR in Crz-expressing neurons is limited to the brain" (lines 381-382)?

      Furthermore, the authors suggest that Crz acts on cells in the heart to regulate mating duration. It would be useful to add a discussion/speculation as to possible mechanisms for heart cells to regulate mating decisions. Is there evidence of CrzR in the heart? The SCope data presented in Fig. 7I-L and S7G-H is hard to read. 7. In several cases, the effects of being raised single are opposite the effects of sexual experience. For example, in Fig. 4T, calcium activity is increased in the AG following sexual experience, but decreased in flies raised singly. Likewise, Crz-neurons in the OL have increased CaLexA signal in singly-raised flies but reduced signals in flies with previous sexual experience. In some cases, manipulations selectively affect LMD or SMD. It would be useful to discuss these differences and consider the mechanistic implications of these differential changes, when they all result in decreased mating duration. This could help to clarify the big picture of the manuscript.

      Minor Comments:

      1. For CaLexA experiments (eg Fig 7A-D), signal intensity should be quantified in addition to area covered. Increased intensity would indicate greater calcium activity within a particular set of neurons.
      2. In Figure 5K: quantification of cell overlap is missing. In the text they state that there are ~100 neurons that co-express SIFaR24F06 and Crz. How was this determined? Is there a graph or numerical summary of this assertion?
      3. In lines 709-711: "Our experience suggests that the relative mating duration differences between naïve and experienced condition and singly reared are always consistent; however, both absolute values and the magnitude of the difference in each strain can vary. So, we always include internal controls for each treatment as suggested by previous studies." I had trouble understanding this section of methods. What is done with the data from the internal controls?
      4. Could the authors comment on why the brain GRASP signal is so different in Figures 3A and 4A? I realize that different versions of GRASP were used in these experiments, but I would expect broad agreement between the different approaches.

      Significance

      This study will be most relevant to researchers interested in understanding neuronal control of behavior. The manuscript offers a conceptual advance in identifying cell types and molecules that influence mating duration decisions. The strength of the manuscript is the number of different assays used; however, there is a sense that this has occurred at the cost of providing a cohesive narrative. The first part of the manuscript (detailing the role of SIFaR in LMD and SMD) is relatively stronger and more conclusive.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We want to thank both reviewers for their thorough and constructive review of our manuscript. Below, we have re-iterated their comments followed by an explanation of how we have revised the manuscript to address this.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This manuscript presented by Segeren et al. applied an interesting HRASG12V inducible cell model to study the mechanism of cellular resistance to replication stress inducing agents. They also employed a novel reversible fixation technique which allows them to FAC sort cells according to their replication stress levels before applying single cell sequencing analysis to the same cell populations. By comparing cells with low levels of replication stress to cells with high levels of replication stress, they found that reduction in gene expression of FOXM1 target genes potentially protects cells against replication stress induced by CHK1i plus gemcitabine combination. Overall, this is a very interesting study. However, the following points should be addressed prior to publication:

      Major: 1. Figure 3E and 3F showed two lists of differentially expressed genes in γH2Ax low cells. However, instead of arbitrarily extracting the FOXM1 target genes and TP53 targeted genes, it would be appreciated if the author could perform an unbiased and unsupervised gene set enrichment analysis such as Enrichr.

      As recommended, we performed an enrichment analysis using Enrichr to identify transcriptional programs associated with the we used the genes that were downregulated in the γH2AX-low cells. FOXM1 appeared as a prominent hit in different databases (both experimental and computational). We have included the lists of differentially expressed genes as an additional supplemental table (Table S1) and have included the Enrichr results as Table S3 (i.e. CHEA and ENCODE). We have described our results in lines 198-200 of the revised manuscript.

      1. At the experiment design stage, the authors also included HRASG12V status as a test condition because they previously found that HRASG12V mutation induces basal level replication stress and they would like to include this condition to study the adaptation to replication stress (line 110). However, the difference in HRASG12V negative and HRASG12V positive cells was not followed up in the later part of the paper. Can they show lists of differentially expressed genes identified under HRASG12V negative conditions as well (in the same format of Figure 3E and 3F) and comment on the differences as well?

      In the original manuscript, we included heatmaps of differentially expressed genes in the control cells in Figure S2. For improved clarity, we have modified this figure so that the heatmaps are labeled "Control cells". In the revised manuscript, we have also included Table S2, which lists the differentially expressed genes between yH2AX low and yH2AX high control cells, and Table S3, which lists the Enrichr results obtained based on these gene lists.

      We observed FOXM1 target genes in both the control and HRASG12V cells. Thus, the mechanism we identify does not appear to be specific to oncogenic Ras expression. We discuss this in lines 221-225. Because there were no other notable differences between the gene sets, we do not focus on this in the manuscript.

      1. In line 194 and in Figure S2B, the authors claimed that ANLN, HMGB2, CENPE, MKI67, and UBE2C demonstrated co-expression, but other genes displaying similar correlation scores were not commented (such as F3, CYR61, CTGF, etc). To avoid being biased at the analysis stage, the authors should define clearly what the cut-off of correlation score is and why only co-expression of ANLN, HMGB2, CENPE, MKI67, and UBE2C were mentioned.

      As suggested, we explain now in the revised manuscript that we focused on gene clusters consisting of at least 3 genes, that had a correlation coefficient greater than or equal to 0.4 with at least one other gene within the clusters. This cutoff is typically defined as representing a "moderate to good" correlation in biological data (Overholser, Sowinski, 2008). To make clear which clusters correlating gene sets passed these criteria, we have also highlighted these genes in Figure S3B. This returned the cluster we had already identified as FOXM1 targets, and as well spotted by the reviewer, a larger cluster which included F3, CYR61, CTGF, SERPINE1, ANKRD1, KRTAP2-3, UGCG, and AMOTL. Our Enrichr analysis did not identify any putative transcription factors linking the genes in this larger cluster. We are still interested to identify the putative transcription regulation mechanism linking these genes in future studies, but this is beyond the scope of the current manuscript. We have described these observations in lines 211-218.

      1. In line 215, instead of validating CENPE, UBE2C, HMGB2, ANLN, and MKI67 individually, the authors decided to validate FOXM1 instead, because they believe all the aforementioned genes are targets of FOXM1, therefore, validating FOXM1 alone would suffice. Again, this makes the validation process also biased. CENPE, UBE2C, HMGB2, ANLN, and MKI67 should be validated individually because they might sensitize cells to replication stress via different mechanisms. Besides, if all these genes were identified together because they are FOXM1 target genes, why did the authors not identify FOXM1 itself as a differentially expressed gene from the single cell sequencing? The sequencing only analyzed the S/G2/M cells, expression of FOXM1 should be detected easily.

      We agree with the reviewer that the omission of individual FOXM1 target genes in the validation process makes a biased impression. Therefore we ordered siRNAs against CENPE, UBE2C, HMGB2, ANLN, and MKI67. Similar to the other DE genes in the original mini-screen we first knocked down these genes using the siRNA Smartpools (pools of 4 individual siRNAs against each genes). Here, we observed a decrease in γH2AX signal compared to drug-treated cells transfected with all 5 Smartpools compared to drug-treated cells transfected with control siRNA. We next moved on to the deconvolution step of the screen, where we transfected cells with 4 individual siRNA against each gene. Here, we observed inconsistent effects of ANLN, CENPE, and HMGB2 when comparing the individual siRNAs, which all produce efficient knockdown of their target genes. But interestingly, for both MKI67 and UBE2C, each of the 4 individual siRNAs similar decreased yH2AX signal, though it was not as strong as the decrease observed when FOXM1 is knocked out. Understanding the exact mechanism of how MKI67 and UBE2C reduce replication stress is beyond the scope of this paper, but we hypothesize that, as with FOXM1, it is likely linked to their role in promoting progression through the cell cycle. These results are shown in Figures S5, and we mention these remarkable findings in the revised abstract and discuss these in the light of the recent literature in the Discussion section (lines 275-286).

      Then, we also addressed the comment about FOXM1 not being changed in the single cell RNA-seq analysis. We could indeed readily detect FOXM1 expression our single-cell RNA sequencing data. The difference in expression did not change significantly in cells sorted according to γH2AX level (Figure 4C). Because FOXM1 is highly regulated post-translationally, we hypothesized that an increase in the (active) protein is correlated to increased replication stress rather than transcript levels. This was indeed the case and we further explain our experiment to test this hypothesis in response to Point #6 (results are displayed in Figure 4D and described in lines 201-209).

      1. As pointed out by the author in the Discussion, single cell sequencing is not good at differentiating the causes from the consequences. The author tried to validate many of the differentially expressed genes in γH2Ax low cells. However, the fact that only FOXM1 knockdown passed the validation and deconvolution pointed out that the great majority of the identified genes are not the cause of the sensitivity change to replication stress inducing agents but likely the consequences. Therefore, in Figure S2C and S2D, it would be better that the authors could just name the genes as 'downregulated genes' in Figure S2C and 'upregulated genes' in Figure S2D. Taking into consideration that the expression change in the great majority of these genes are just consequences of sensitivity change to replication stress, defining them as 'potentially sensitizing' genes and 'potentially conferring resistance' genes is rather misleading.

      We agree that the way we originally labeled these plots may have been misleading. We have renamed then to "Downregulated in yH2AXlow" and "Upregulated in yH2AXlow", as recommended by the reviewer.

      1. To better prove that FOXM1 is the leading cause of the sensitivity to CHK1i+Gemcitabine induced replication stress, can the authors show the FOXM1 expression status in the tolerant cell population identified in Figure 1B (lowest panel)? Alternatively, can they plot FOXM1 expression level in the same tSNE plots shown in Figure 3B to 3D to see whether some of the γH2Ax low populations also show reduced FOXM1 expression?

      FOXM1 expression levels were not increased with gH2AXhigh versus gH2AXlow HRASG12V cells in the single cell RNA-sequencing data (Figure 4C in revised manuscript). However, as mentioned in our answer to point #4 we performed an additional experiment, which showed a strong positive correlation between phospho-FOXM1 and γH2AX (as measured by flow cytometry) in S-phase cells (Figure 4D). This indicates that the active form of the FOXM1 indeed increases as yH2AX levels increase, consistent with the observed increase in FOXM1 target genes. These results are described in lines 201-209.

      1. Clonogenic survival assay in Figure 4D was not quantified properly in Figure 4E. To rule out the siFOXM1 mediated growth/survival defects and to only focus on the siFOXM1 mediated resistance to CHK1i+Gemcitabine, the survival rate (intensity percent in this case) of CHK1i+Gemcitabine treated condition should be normalized against the survival rate of the Vehicle condition. E.g., the intensity percent of the siSCRAMBLE after treatment should be divided by the intensity percent of the untreated siSCRAMBLE; the intensity percent of the si#1 after treatment should be divided by the intensity percent of the untreated si#1, and so on. If the authors would like to show siFOXM1 induced growth/survival defects, they can still present the left part of the Figure 4E (the Vehicle group).

      Originally, we chose to show the absolute IntensityPercent for all groups, without normalizing to the untreated group, because we wanted to also highlight the FOXM1-mediated changes in growth. We agree that normalizing the IntensityPercent of the drug-treated group to the vehicle group better highlights the siFOXM1-mediated resistance. We have therefore re-analyzed the data and presented it this way in Figure 5E (described in lines 293-295). We moved our original Figure 4E to a new supplemental figure (Figure S4B) to still point out the effects of siFOXM1 on cell growth in untreated cells.

      Minor:

      1. In line 176, the author claimed that 'Interestingly, rare cells treated with CHK1i + gemcitabine are located within the untreated cell cluster (Fig. 3C)'. However, it is not as obvious where these cells are in the plot, especially to people who are new to tSNE plots. It would be appreciated if the authors could label these cells by circling them with red lines and make the point stronger.

      Rather than circling these points (we thought this would make the plot too "busy"), we have created an inset that zooms in on the region where we see the untreated cells within the untreated cell cluster. Within the inset, we use arrows to point out the cells we are referring to. This can be seen in our updated Figure 3C.

      1. In Figure S2B, it will be ideal to label clearly which genes are upregulated genes and which are downregulate.

      On the x-axis of the heatmap, we have drawn lines to separate the downregulated and upregulated genes.

      1. In line 50, the word 'multifaced' needs to be corrected to 'multifaceted'.

      Thank you for catching this, we have fixed it.

      1. It is unclear what 'underly drug resistance' means in line 150.

      We have reworded this sentence so that is more clear. It is now written as follows: "we aimed to identify gene-expression programs that mediate the low level of RS in a subset of cells, which could potentially mediate drug resistance". This change is in lines 155.

      1. It is advised that the phrase 'cell cycle position' could be changed to 'cell cycle phase' or 'cell cycle stage'.

      We purposefully used the phrase "cell cycle position" because we wanted to emphasis gradient-like progress through the cell cycle rather than a discrete distinction from one-phase to the next. We have reworded the text slightly to now say "position within S-phase" (lines 163, 187, 191, 208), since all the cells we are interested in are in S phase, but some are further through S phase than others.

      1. In line 185, the word 'in' after 'within' can be removed.

      Thank you for catching this, we have fixed it.

      1. In line 194, 'Among genes downregulated in γH2AXlow cells, the expression of ANLN, HMGB2, CENPE, MKI67 and UBE2C correlated' is missing an 'are' in front of the word 'correlated'.

      Thank you for catching this, we have fixed it.

      1. In line 239, Fig.SC3 should be Fig. S3C.

      Thank you for catching this, we have fixed it.

      1. FOXM1 is known as a crucial gene for G2/M transition. Therefore, FOXM1 knockdown cells are expected to be mostly arrested at the G2/M interface. Therefore, in line 244, it is incorrect to say stronger FOXM1 knockdown induced a 'lower proportion of cells in G2 phase'. In fact, as shown in Figure 4C, cells are accumulating in G2 phase (peaking around 11M on the DAPI axis) and depleted from G1 phase (peaking around 7M).

      We have reworded this to say that there is "a higher proportion of cells in S-phase and a less distinct G2 peak" (lines 270-271). The DAPI profiles of the scrambled, siFOXM1 #1, and siFOXM1 #2 conditions all show an S-phase "valley" between a G1 and G2 peak (the valley sits at about 8M-9M). In the siFOXM1 #3 and siFOXM1 #4 conditions, we no longer see this valley, therefore we interpret this as cells still in S-phase. If they had progressed from S-phase into G2 phase, we expect that we would again see this "valley" to the left of a clear G2 peak. In the figure below, we overlayed DNA content histograms of the different FOXM1 targeting siRNAs with the scrambled siRNA to demonstrate this point more clearly.

      Reviewer #1 (Significance (Required)):

      Advance: The study reported a novel reversible fixation technique which can lead to potentially good citations. However, the findings from the single cell sequencing alone fell short in novelty to reach high impact because FOXM1 has been reported to impact on cellular sensitivity to CHK1 inhibition mediated replication stress (PMC7970065). Moreover, the study did not provide mechanistic explanation to the observed phenotype but only validated the finding from the sequencing, and the gene of focus (FOXM1) was not originally identified from the sequencing, slightly undermining the paper's foundation. To make it a better paper. the authors need to be less biased when it comes to data analysis and interpretation.

      Audience: People who are interested in basic research in cell cycle, DNA damage, cancer, chemotherapy would be interested.

      My expertise: Cancer, DNA damage, cell cycle

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Replication stress activates ATR and CHEK1 kinases as part of the inter S phase DNA damage response. CHEK1 kinase inhibitors (CHK1i) have been shown to induce an accumulation of unresolved replication stress and widespread DNA damage and cell death caused by replication catastrophe, and are therefore under clinical evaluation. At the same time, CHEK1 inhibition results in the activation of CDK1 and FOXM1 and premature expression of G2/M genes (Saldivar et al., 2018 Science). FOXM1-drivent premature mitosis has been shown to be required for the replication catastrophe and CHK1i sensitivity (Branigan et al., 2021 Cell Rep.). In this study, Segeren and colleagues set out to investigate the mechanisms of replication stress tolerance. They used CHK1i inhibitors in combination with the DNA-damaging chemotherapeutic agent Gemcitabine and oncogenic HRASG12V expression to increase replication stress. The authors utilized an intriguing setup of combined immunofluorescence staining followed by single cell RNA-seq analysis to overcome limitations of bulk cell analyses. In particular, the authors sought to identify genes that are differentially regulated in replication stress-tolerant cells compared to sensitive cells. However, even single cell analyses can be confounded by differences in cell cycle distribution. To mitigate this, the authors selected mid S-phase cells for their analysis. While this may not have completely eliminated minor differences in cell cycle progression, the authors identified FOXM1-regulated G2/M cell cycle genes, among others, that were down-regulated in the tolerant cells. When the authors followed up on the effect of these genes on replication stress tolerance, they identified FOXM1 knockdown as the only robust mediator of replication stress tolerance.

      Major comments:

      The authors observed that cell cycle distribution could be a major confounding factor in their single cell analysis and attempted to reduce this variation by selecting mid S-phase cells based on the DAPI signal. The authors then chose to compare gH2AXlow and gH2AXhigh subpopulations of RPE-HRASG12V cells because their "DAPI signal was comparable" (line 181-184). However, their data show that these subpopulations also show differences in their DAPI signal distribution, with gH2AXlow cells tending to have lower DAPI signals than gH2AXhigh cells (Supplementary Figure 2A). Thus, the major confounding factor that the authors sought to remove seems to have prevailed and it remains possible that the difference in cell cycle gene expression is merely due to differences in cell cycle progression of the individual cells. Given that DAPI information seem to be readily available for the individual cells, the authors should normalize their analysis to the DAPI signal to remove this potential confounding effect or clearly state this potential limitation.

      We agree that indeed it is very challenging to fully disentangle the influence of cell cycle distribution on our analysis. And indeed, the γH2AXlow HRASG12V cells have slightly reduced median DNA content compared to γH2AXmid and γH2AXhigh. However, this was not the case in the RPE control cells, and we still found that FOXM1 target genes were strongly enriched in the γH2AXhigh cells (Fig S2C and Table S4). Therefore, it is highly unlikely that bias in S-phase position distributions does not explain our results. Nevertheless, to be transparent about this write in the Results on lines 192-193 the following: "The other groups all showed similar DAPI intensities, although gH2AXlow RPE-HRASG12V cells showed a slight but statistically significant reduction compared to their gH2AXhigh counterparts (Fig. S2A)".

      In our subsequent experiments to assess the relationship between phospho-FOXM1 (representing the transcriptionally active protein) and γH2AX, we observed that though there was a strong correlation between pFOXM1 and γH2AX, there was no correlation between phospho-FOXM1 and DAPI (Figure 4D-E). We therefore would like to point out that although our readout for replication stress inevitably increases as cells progress through DNA replication, heterogeneity in phospho-FOXM1 levels cannot be explained by position in S-phase. These results are described in lines 203-209.

      Finally, we do not think it would be statistically appropriate to use the DAPI signal (generated by fluorescence intensity as measured by the flow cytometer) as a normalization factor for our gene expression data.

      Minor comments:

      The findings of Saldivar et al., 2018 Science and Branigan et al., 2021 Cell Rep. should be mentioned in the introduction.

      As recommended, we mentioned both these papers in the introduction. In line 62, we cite the Branigan paper as showing that modulation of cell cycle regulators is a strategy used by cancer cells to resist replication stress. In lines 63-65, we reference them as follows: "The RS response is tightly linked with cell cycle progression, as multiple intra S-phase checkpoint kinases play a role in curtailing proteins involved in the S-G2 transition (Branigan et al., 2021, Saldivar et al., 2018)."

      The authors conclude that "cell cycle position can be a major confounding factor when evaluating the transcriptomic response to RS." It should be noted that stochastic differences in the cell cycle distribution of bulk cells are perhaps the best-known confounder in single cell analyses (see, for example, Buettner et al., 2015 Nat. Biotechnol.).

      We chose to reference the Buettner paper to justify our decision to select only cycling cells in our scRNA seq approach. Our reference to the paper, and to the fact that cell cycle distribution is a major confounder in single cell analysis, is in lines 138-140.

      Supplementary Figure 2A: The median should be added to the violin plots.

      As suggested, we have added medians to the violin plots. In addition, we added details on statistical analysis.

      The statement "Differential expression analysis revealed 19 genes that were significantly downregulated in gH2AXlow RPE-HRASG12V cells, suggesting that elevated levels of these genes are correlated with sensitivity to RS-inducing drugs" refers to Figure 3E and Table S1. However, Table S1 lists the "key resources" and does not seem to be related to this statement. A table showing log2fold-changes and FDR values should be added and referenced here.

      We have generated tables with the fold change values of differentially expressed genes between the yH2AX low and yH2AX high cells. These are found in Table S1 (for HRAS G12V cells) and Table S2 (for Control cells) in the supplementary file of the revised manuscript. The "key resources" has been moved to Table S5.

      The statement "Remarkably, Braningan and co-workers observed no effect of full FOXM1 deletion on cell cycle progression" seems somewhat inconsistent with what has been stated and assessed in that study. The authors may want to replace "progression" with "distribution". A reduction in proliferation is commonly observed when FOXM1 levels are reduced.

      In addition, the authors may want to consider that their addition of HRASG12V and Gemcitabine may contribute to a more substantial S phase checkpoint response.

      We agree with the reviewer that a reduction in proliferation is commonly observed when FOXM1 levels are reduced (Barger et al., 2021, Cheng et al., 2022, Yang et al., 2015, Wu et al., 2010), but in Branigan et al., they see no decrease in proliferation with knockout of FOXM1. They state "There were no apparent differences in the growth rate of the LIN54 and FOXM1 KO versus EV cells over 10 days (Figure 1G)". Though they do not elaborate on why they see this unexpected response, we suspect a permanent full knockout of FOXM1 could cause compensatory adaptation in their cell lines. In our experiments, we perform transient knockdowns, so cells may not have the time to adapt to the loss of FOXM1 and obtain compensatory mechanisms that would allow them to continue cycling as rapidly as control cells treated with non-targeting siRNA.

      However, we decided to remove this from the Discussion section, as it seemed to interrupt the discussion about the potential mechanisms underlying protection against DNA damage by FOXM1 depletion.

      The statement that "the mechanism by which high FOXM1 activity is a prerequisite to accumulate DNA damage in S-phase during CHK1 inhibition remains to be uncovered" seems to neglect that premature mitosis has been suggested as a mechanistic cause (Branigan et al., 2021 Cell Rep.). It would be helpful if the authors could elaborate on this.

      In our discussion, we do already emphasize the described role of FOXM1 in promoting premature mitosis (lines 330-337), but we argue that in our experimental conditions we are observing another - previously undescribed- role for FOXM1 in promoting replication stress during S phase. We previously observed with live cell imaging that CHK1i + gemcitabine does not cause premature mitosis in RPE-HRASG12V cells (published in Segeren et al. Oncogene 2022, Figure 5). Instead, these cells typically showed a cell cycle exit from G2. This makes it highly unlikely that premature mitosis is the reason why these cells would accumulate excessive DNA damage. We realize now that it was an important omission not to elaborate on this and have added this clarification to the Discussion (lines 341-345 in revised manuscript). In addition, we have removed a few lines of less important text (about the lack of direct effect of FOXM1 KO in the Branigan paper; see answer to previous point) to improve clarity and readability.

      Reviewer #2 (Significance (Required)):

      General assessment: The strength of the study is the intriguing methodology of combined immunofluorescence followed by single cell RNA-seq. The limitations are that this methodology does not seem to fully solve the stated problems. In addition, the study is essentially limited to confirming previous findings.

      Advance: The study strengthens current knowledge but provides essentially no advance. The authors confirm existing knowledge with an additional approach. While this is not an advance in itself, it is important to the community.

      Audience: I felt that the study would appeal to a basic science audience. In particular, the CHK1i and intra S-phase checkpoint areas, with limited interest beyond that.

      My relevant expertise lies in transcriptomics, gene regulation and the cell cycle.

      Reference list

      Barger, C.J., Chee, L., Albahrani, M., Munoz-Trujillo, C., Boghean, L., Branick, C., Odunsi, K., Drapkin, R., Zou, L. & Karpf, A.R. 2021, "Co-regulation and function of FOXM1/RHNO1 bidirectional genes in cancer", eLife, vol. 10, pp. 10.7554/eLife.55070.

      Branigan, T.B., Kozono, D., Schade, A.E., Deraska, P., Rivas, H.G., Sambel, L., Reavis, H.D., Shapiro, G.I., D'Andrea, A.D. & DeCaprio, J.A. 2021, "MMB-FOXM1-driven premature mitosis is required for CHK1 inhibitor sensitivity", Cell reports, vol. 34, no. 9, pp. 108808.

      Cheng, Y., Sun, F., Thornton, K., Jing, X., Dong, J., Yun, G., Pisano, M., Zhan, F., Kim, S.H., Katzenellenbogen, J.A., Katzenellenbogen, B.S., Hari, P. & Janz, S. 2022, "FOXM1 regulates glycolysis and energy production in multiple myeloma", Oncogene, vol. 41, no. 32, pp. 3899-3911.

      Overholser, B.R. & Sowinski, K.M. 2008, "Biostatistics primer: part 2", Nutrition in clinical practice : official publication of the American Society for Parenteral and Enteral Nutrition, vol. 23, no. 1, pp. 76-84.

      Saldivar, J.C., Hamperl, S., Bocek, M.J., Chung, M., Bass, T.E., Cisneros-Soberanis, F., Samejima, K., Xie, L., Paulson, J.R., Earnshaw, W.C., Cortez, D., Meyer, T. & Cimprich, K.A. 2018, "An intrinsic S/G(2) checkpoint enforced by ATR", Science (New York, N.Y.), vol. 361, no. 6404, pp. 806-810.

      Segeren, H.A., van Liere, E.A., Riemers, F.M., de Bruin, A. & Westendorp, B. 2022, "Oncogenic RAS sensitizes cells to drug-induced replication stress via transcriptional silencing of P53", Oncogene, vol. 41, no. 19, pp. 2719-2733.

      Wu, Q., Liu, C., Tai, M., Liu, D., Lei, L., Wang, R., Tian, M. & Lu, Y. 2010, "Knockdown of FoxM1 by siRNA interference decreases cell proliferation, induces cell cycle arrest and inhibits cell invasion in MHCC-97H cells in vitro", Acta Pharmacologica Sinica, vol. 31, no. 3, pp. 361-366.

      Yang, K., Jiang, L., Hu, Y., Yu, J., Chen, H., Yao, Y. & Zhu, X. 2015, "Short hairpin RNA- mediated gene knockdown of FOXM1 inhibits the proliferation and metastasis of human colon cancer cells through reversal of epithelial-to-mesenchymal transformation", Journal of experimental & clinical cancer research : CR, vol. 34, no. 1, pp. 40-1.

      We want to thank both reviewers for their thorough and constructive review of our manuscript. Below, we have re-iterated their comments followed by an explanation of how we have revised the manuscript to address this.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript presented by Segeren et al. applied an interesting HRASG12V inducible cell model to study the mechanism of cellular resistance to replication stress inducing agents. They also employed a novel reversible fixation technique which allows them to FAC sort cells according to their replication stress levels before applying single cell sequencing analysis to the same cell populations. By comparing cells with low levels of replication stress to cells with high levels of replication stress, they found that reduction in gene expression of FOXM1 target genes potentially protects cells against replication stress induced by CHK1i plus gemcitabine combination.

      Overall, this is a very interesting study. However, the following points should be addressed prior to publication:

      Major:

      1. Figure 3E and 3F showed two lists of differentially expressed genes in γH2Ax low cells. However, instead of arbitrarily extracting the FOXM1 target genes and TP53 targeted genes, it would be appreciated if the author could perform an unbiased and unsupervised gene set enrichment analysis such as Enrichr.
      2. At the experiment design stage, the authors also included HRASG12V status as a test condition because they previously found that HRASG12V mutation induces basal level replication stress and they would like to include this condition to study the adaptation to replication stress (line 110). However, the difference in HRASG12V negative and HRASG12V positive cells was not followed up in the later part of the paper. Can they show lists of differentially expressed genes identified under HRASG12V negative conditions as well (in the same format of Figure 3E and 3F) and comment on the differences as well?
      3. In line 194 and in Figure S2B, the authors claimed that ANLN, HMGB2, CENPE, MKI67, and UBE2C demonstrated co-expression, but other genes displaying similar correlation scores were not commented (such as F3, CYR61, CTGF, etc). To avoid being biased at the analysis stage, the authors should define clearly what the cut-off of correlation score is and why only co-expression of ANLN, HMGB2, CENPE, MKI67, and UBE2C were mentioned.
      4. In line 215, instead of validating CENPE, UBE2C, HMGB2, ANLN, and MKI67 individually, the authors decided to validate FOXM1 instead, because they believe all the aforementioned genes are targets of FOXM1, therefore, validating FOXM1 alone would suffice. Again, this makes the validation process also biased. CENPE, UBE2C, HMGB2, ANLN, and MKI67 should be validated individually because they might sensitize cells to replication stress via different mechanisms. Besides, if all these genes were identified together because they are FOXM1 target genes, why did the authors not identify FOXM1 itself as a differentially expressed gene from the single cell sequencing? The sequencing only analyzed the S/G2/M cells, expression of FOXM1 should be detected easily.
      5. As pointed out by the author in the Discussion, single cell sequencing is not good at differentiating the causes from the consequences. The author tried to validate many of the differentially expressed genes in γH2Ax low cells. However, the fact that only FOXM1 knockdown passed the validation and deconvolution pointed out that the great majority of the identified genes are not the cause of the sensitivity change to replication stress inducing agents but likely the consequences. Therefore, in Figure S2C and S2D, it would be better that the authors could just name the genes as 'downregulated genes' in Figure S2C and 'upregulated genes' in Figure S2D. Taking into consideration that the expression change in the great majority of these genes are just consequences of sensitivity change to replication stress, defining them as 'potentially sensitizing' genes and 'potentially conferring resistance' genes is rather misleading.
      6. To better prove that FOXM1 is the leading cause of the sensitivity to CHK1i+Gemcitabine induced replication stress, can the authors show the FOXM1 expression status in the tolerant cell population identified in Figure 1B (lowest panel)? Alternatively, can they plot FOXM1 expression level in the same tSNE plots shown in Figure 3B to 3D to see whether some of the γH2Ax low populations also show reduced FOXM1 expression?
      7. clonogenic survival assay in Figure 4D was not quantified properly in Figure 4E. To rule out the siFOXM1 mediated growth/survival defects and to only focus on the siFOXM1 mediated resistance to CHK1i+Gemcitabine, the survival rate (intensity percent in this case) of CHK1i+Gemcitabine treated condition should be normalized against the survival rate of the Vehicle condition. E.g., the intensity percent of the siSCRAMBLE after treatment should be divided by the intensity percent of the untreated siSCRAMBLE; the intensity percent of the si#1 after treatment should be divided by the intensity percent of the untreated si#1, and so on. If the authors would like to show siFOXM1 induced growth/survival defects, they can still present the left part of the Figure 4E (the Vehicle group).

      Minor:

      1. In line 176, the author claimed that 'Interestingly, rare cells treated with CHK1i + gemcitabine are located within the untreated cell cluster (Fig. 3C)'. However, it is not as obvious where these cells are in the plot, especially to people who are new to tSNE plots. It would be appreciated if the authors could label these cells by circling them with red lines and make the point stronger.
      2. In Figure S2B, it will be ideal to label clearly which genes are upregulated genes and which are downregulate.
      3. In line 50, the word 'multifaced' needs to be corrected to 'multifaceted'.
      4. It is unclear what 'underly drug resistance' means in line 150.
      5. It is advised that the phrase 'cell cycle position' could be changed to 'cell cycle phase' or 'cell cycle stage'.
      6. In line 185, the word 'in' after 'within' can be removed.
      7. In line 194, 'Among genes downregulated in γH2AXlow cells, the expression of ANLN, HMGB2, CENPE, MKI67 and UBE2C correlated' is missing an 'are' in front of the word 'correlated'.
      8. In line 239, Fig.SC3 should be Fig. S3C.
      9. FOXM1 is known as a crucial gene for G2/M transition. Therefore, FOXM1 knockdown cells are expected to be mostly arrested at the G2/M interface. Therefore, in line 244, it is incorrect to say stronger FOXM1 knockdown induced a 'lower proportion of cells in G2 phase'. In fact, as shown in Figure 4C, cells are accumulating in G2 phase (peaking around 11M on the DAPI axis) and depleted from G1 phase (peaking around 7M).

      Significance

      Advance:

      The study reported a novel reversible fixation technique which can lead to potentially good citations. However, the findings from the single cell sequencing alone fell short in novelty to reach high impact because FOXM1 has been reported to impact on cellular sensitivity to CHK1 inhibition mediated replication stress (PMC7970065). Moreover, the study did not provide mechanistic explanation to the observed phenotype but only validated the finding from the sequencing, and the gene of focus (FOXM1) was not originally identified from the sequencing, slightly undermining the paper's foundation. To make it a better paper. the authors need to be less biased when it comes to data analysis and interpretation.

      Audience:

      People who are interested in basic research in cell cycle, DNA damage, cancer, chemotherapy would be interested.

      My expertise:

      Cancer, DNA damage, cell cycle

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      The work from Petazzi et al. aimed at identifying novel factors supporting the differentiation of human hematopoietic progenitors from induced pluripotent stem cells (iPSCs). The authors developed an inducible CRISPR-mediated activation strategy (iCRISPRa) to test the impact of newly identified candidate factors on the generation of hematopoietic progenitors in vitro. They first compared previously published transcriptomic data of iPSCderived hemato-endothelial populations with cells isolated ex vivo from the aorta-gonadmesonephros (AGM) region of the human embryo and they identified 9 transcription factors expressed in the aortic hemogenic endothelium that were poorly expressed in the in vitro differentiated cells. They then tested the activation of these candidate factors in an iPSCbased culture system supporting the differentiation of hematopoietic progenitors in vitro. They found that the IGF binding protein 2 (IGFBP2) was the most upregulated gene in arterial endothelium after activation and they demonstrated that IGFBP2 promotes the generation of functional hematopoietic progenitors in vitro.

      Strengths:

      The authors developed an extremely useful doxycycline-inducible system to activate the expression of specific candidate genes in human iPSC. This approach allows us to simultaneously test the impact of 9 different transcription factors on in vitro differentiation of hematopoietic cells, and the system appears to be very versatile and applicable to a broad variety of studies.

      The system was extensively validated for the expression of 1 transcription factor (RUNX1) in both HeLa cells and human iPSC, and a detailed characterization of this test experiment was provided.

      The authors exhaustively demonstrated the role of IGFBP2 in promoting the generation of functional hematopoietic progenitors in vitro from iPSCs. Even though the use of IGFBP2interacting proteins IGF1 and IGF2 have been previously reported in human iPSC-derived hematopoietic differentiation in vitro (Ditadi and Sturgeon, Methods 2016; Ng et al., Nature Biotechnology 2016), and IGFBP-2 itself has been shown to promote adult HSC expansion ex vivo (Zhang et al., Blood 2008), its role on supporting in vitro hematopoiesis was demonstrated here for the first time.

      Weaknesses:

      Although the authors performed a very thorough characterization of the system in proof-ofprinciple experiments activating a single transcription factor, the data provided when 9 independent factors were used is not sufficient to fully validate the experimental strategy. Indeed, in the current version of the manuscript, it is not clear whether the results presented in both the scRNAseq analysis and the functional assays are the consequence of the simultaneous activation of all 9 TF or just a subset of them. This is essential to establish whether all the proposed factors play a role during embryonic hematopoiesis, and a more complete analysis of the scRNAseq dataset could help clarify this aspect.

      Similarly, the data presented in the manuscript are not sufficient to clarify at what stage of the endothelial-to-hematopoietic transition (EHT) the TF activation has an impact. Indeed, even though the overall increase of functional hematopoietic progenitors is fully demonstrated, the assays proposed in the manuscript do not clarify whether this is due to a specific effect at the endothelial level or to an increased proliferation rate of the generated hematopoietic progenitors. Similar conclusions can be applied to the functional validation of IGFBP2 in vitro.

      The overall conclusions are sometimes vague and not always supported by the data. For instance, the authors state that the CRISPR activation strategy resulted in transcriptional remodeling and a steer in cell identity, but they do not specify which cell types are involved and at what level of the EHT process this is happening. In the discussion, the authors also claim that they provided evidence to support that RUNX1T1 could regulate IGFBP2 expression. However, this is exclusively based on the enrichment of RUNX1T1 gRNA in cells expressing higher levels of IGFBP2 and it does not demonstrate any direct or indirect association of the two factors.

      We thank the reviewer for the positive comments about the importance of our work and have now addressed the points raised as weaknesses by performing additional analysis and experiments, adding a new schematic of the mechanism, and rewording our claims.

      We have clarified the different effects mediated by the activation and the IGFBP2 addition in a summary section at the end of the results and added Figure 6, showing this in visual form. We have also clearly stated the limitations related to the correlation between RUNX1T1 and IGFBP2 in the discussion and toned down our claims regarding this throughout the entire paper. We have also reworded the text to clarify the specific cell types identified in the sequencing data that we refer to.

      Reviewer #2 (Public Review):

      To enable robust production of hematopoietic progenitors in-vitro, Petazzi et al examined the role of transcription factors in the arterial hemogenic endothelium. They use IGFBP2 as a candidate gene to increase the directed differentiation of iPSCs into hematopoietic progenitors. They have established a novel induced-CRISPR mediated activation strategy to drive the expression of multiple endogenous transcription factors and show enhanced production of hematopoietic progenitors through expansion of the arterial endothelial cells. Further, upregulation of IGFBP2 in the arterial cells facilitates the metabolic switch from glycolysis to oxidative phosphorylation, inducing hematopoietic differentiation. While the overall study and resources generated are good, assertions in the manuscript are not entirely supported by the experimental data and some claims need further experimental validation.

      We thank the reviewer for the positive comments, and we have provided new data and analysis to make sure that all our assertations are clearly supported and also reworded those where limitations were identified by the reviewers.

      Recommendations for the authors:

      Reviewing Editor (Recommendations For The Authors):

      The assessment could change from "incomplete" to "solid" if the authors: i) improve data analysis (for both scRNAseq and functional assays) by providing additional information that could strengthen their conclusions, as suggested in the specific comments by both reviewers; ii) either provide new functional evidence supporting their mechanistic conclusion or alternatively tone down the claims that are not fully supported by data and acknowledge the limitations raised by reviewers in the discussion; (iii) the issue of paracrine signaling to expand only hematopoietic progenitors needs to be addressed.

      We have now improved the data analysis and provided additional functional tests to strengthen our conclusions and toned down those that were identified by the reviewers as not supported enough and included a discussion on these limitations. We have also reworded the section about the paracrine signaling throughout the paper.

      Reviewer #1 (Recommendations For The Authors):

      Figure 1 contains exclusively published data. It might be more appropriate to use it as a supplementary figure or as part of a more exhaustive figure (maybe combining Figures 1 and 2 together?).

      Figure 1 contained novel bioinformatic analyses that represent the base of our research and it has a different content and focus to figure 2, which is already a large figure. We therefore believe it is better to keep it as a separate figure, containing a new panel now too. 

      It seems there is an issue with Figure S3 labelling:

      • In line 112, Figure S2A-B does not display genomic PCR and sequencing results;

      • In line 123, Figure S3D-E does not show viability and proliferation data;

      • In line 127, Figure S3G does not show mCherry expression in response to DOX;

      We apologies for the confusion with the numbers, we have now correctly labelled the figures.

      It would be more informative to include gates and frequency on flow cytometry plots in Figure S3, to be able to evaluate the extent of the reduction in mCherry expression.

      We have now included the gating and frequency of mCherry-expressing cells in Supplementary Figure 3D.

      It is not clear from the text and figures whether the SB treatment was maintained throughout the hematopoietic differentiation protocol (line 122):

      • If so, it would be important to confirm that HDAC treatment does not affect EHT cultures

      • If not, can the authors provide some evidence that transgene silencing is not occurring during hematopoietic differentiation?

      We have clarified that we decided to treat the cells with SB exclusively in maintenance condihons because HDACs have been shown to be essenhal for the EHT (lines 138-142). We have now also included addihonal data showing the high expression of the mCherry tag reporhng the iSAM expression on day 8 (Supplementary Figure 4F).

      Can the authors provide a simple diagram summarizing the experimental strategy for each differentiation experiment in the respective supplementary figure? For instance, at what stage of the protocol was DOX added in Figure 3? Or at what stage IGFBP2 was added in Figure 5? It would be a very useful addition to the interpretation of the results.

      We have now included three schemahcs for all the experiments in the manuscript in supplementary figure 4 A-C.

      In Figure 3, the authors should provide more detailed information about the data filtering of the scRNAseq experiment, and more specifically:

      • How many cells were included in the analysis for each library after QC and filtering?

      • How "cells in which the gRNAs expression was detected" were selected? Do they include only cells showing expression of gRNAs for all 9 TF?

      This informahon is now included in the method sechon lines 773-781; the detailed code is available on the GitHub link provided in the same sechon. We have filtered the cells expressing one gRNA for the non-targehng gRNA (iSAM_NT) control and more than one for the iSAM_AGM sample. 

      In Figure 3A, it is not clear whether the expression of the 9 factors is consistently detected in all cells or just a subset of them, and the heatmap in Figure 3A does not provide this information. It would be more accurate to provide expression on a per-cell basis, for instance, as a violin plot displaying single dots representing each cell. 

      We have now included this violin plot in Supplementary Figure 4G as requested. However, this visualisation is difficult to interpret because some of the target genes’ expression seems variable in both experimental and control conditions. We had envisaged that this could have been the case and so this is why we had included the three different controls.  For this reason we chose to show the normalised expression which takes all the different variables into account (Figure 3A). 

      In Figure 3B-C, it seems that clusters EHT1 and EHT2 do not express endothelial markers anymore. Are these fully differentiated hematopoietic cells rather than cells undergoing EHT? In general, it would be quite important to provide evidence of expressed marker genes characterizing each cluster (eg. heatmap summarizing top DEG in the supplementary figure?). 

      We have now provided a spreadsheet containing the clusters’ markers that we used in

      Supplementary Table 1) a heatmap in Figure 3E. Furthermor,e we have now edited Figure 3C to include Pan Endothelial markers (PECAM1 and CDH5). These data show that the EHT1 and EHT2 cluster both express endothelial markers but are progressively downregulated as expected during endothelial to hematopoietic transition. We have also included and discussed this in the manuscript lines 192-195 and a schematic for the mechanism in Figure 6.

      In Figure 3E, displaying the proportion of clusters within each sample/library would be a more accurate way of comparing the cell types present in each library (removing potential bias introduced by loading different numbers of cells in each sample).

      We have now included the requested data in Supplementary Figure 4I and it confirms again the expansion of arterial cells in the activated cells.    

      In Figure 3G, by plating 20,000 total CD34+, the assay does not account for potential differences in sample composition. It is then hard to discriminate between the increased number of progenitors in the input or an enhanced ability of HE to undergo EHT. This is an important aspect to consider to precisely identify at what level the activation of the 9 factors is acting. A proper quantification of flow cytometry data summarizing the % of progenitors, arterial cells, etc. would be useful to interpret these results.

      Lines 204-205 reworded. We are very much aware of the fact that the CD34+ cell population consists of a range of cells across the EHT process and this is precisely why we carried out this single cell sequencing analyses.  We purposely tested the effect of the observed changes in composition by colony assays

      In Figure 3G, it seems that NT cells w/o DOX have very little CFU potential (if any). Can the authors provide an explanation for this?

      We think that the limited CFU potential is due to the extensive genetic manipulation and selection that the cells underwent for the derivation of all the iSAM lines but this did not impede us from observing an effect of gene activation on CFU numbers. This is one of the primary reasons that we then validated our overall findings using the parental iPSC line in control condition and with the addition of IGFBP2. We show that the parental iPSC line gives rise to hematopoietic progenitor, both immunophenotypically (Figure 4D) and functionally, at expected levels (Figure 4B left column).

      Figure 4A shows an upregulation of IGFBP2 in arterial cells as a result of TF activation. However, from the data presented here, it is not possible to evaluate whether this is specific to the arterial cluster, or it is a common effect shared by all cell types regardless of their identity. 

      Data has now been included in Supplementary Figure 4H, which shows that all the cells show an increase in IGFBP2, but arterial cells show the highest increase. We have now edited the text to reflect this, in lines 228-230.

      In Figure 5A-B only a minority of arterial cells express RUNX1 in response to IGFBP2 treatment. Is this sufficient to explain the very significant increase in the generation of functional hematopoietic progenitors described in Figure 4? Quantification and statistical analysis of RUNX1 upregulation would strengthen this conclusion.

      We have now provided the statistical analysis showing significant upregulation of RUNX1 upon IGFBP2 addition. The p values are now provided in the figure 5 legend.

      In Figure 5 the authors conclude that IGFBP2 remodels the metabolic profile of endothelial cells. However, it is not clear which cell types and clusters were included in the analysis of Figure 5C-G. Is the switch from Glycolysis to Oxidative Phosphorylation specific to endothelial cells? Or it is a more general effect on the entire culture, including hematopoietic cells? 

      We based this conclusion on the fact that the single-cell RNAseq allows to verify that the metabolic differences are obtained in the endothelial cells. Given that we sorted the adherent cells, the majority of these are endothelial cells as shown in Figure 5A. The Seahorse pipeline includes a number of washing steps resulting in the analyses being performed on the adherent compartment which we know consists primarily of endothelial cells. We cannot exclude some contamination from non-endothelial cells but we highlight to this reviewer that the initial observation of the metabolic changes was identified in endothelial cells in the single cell sequencing data. Taken together, we believe that this implies that metabolic changes are specific to this population. We have clarified this in the line 317.

      In the discussion, the authors conclude that they "provide evidence to support the hypothesis that RUNX1T1 could regulate IGFBP2 expression". To further support this conclusion, the authors could provide a correlation analysis of the expression of the two genes in the cell type of interest. 

      Following the observation of the IGFBP2 high expression across clusters, we have now reworded this sentence in lines 382-385  We have tried to perform the correlation analysis but we believe this not to be appropriate due to the detection level of the gRNA, we have now included this as a limitation point in the discussion lines 416-427, and also toned down the conclusion we did draw about RUNX1T1 throughout the whole manuscript.

      As mentioned by the authors, IGFBP2 binds IGF1 and IGF2 modulating their function. Both IGF1 (http://dx.doi.org/10.1016/j.ymeth.2015.10.001) and IGF2 (doi:10.1038/nbt.3702) have been used in iPSC differentiation into definitive hematopoietic cells. It would be relevant to discuss/reference this in the discussion.

      We have now included the suggested reference in the section where we discuss the role of IGFBP2 in binding IGF1 and IGF2.

      Reviewer #2 (Recommendations For The Authors):

      (1) Figure 1 compares the transcriptome of human AGM and in-vitro derived hemogenic endothelial cells (HECs). It is not clear why only the genes downregulated in the latter were chosen. Are there any significantly upregulated genes, knockdown/knockout which could also serve a similar purpose? Single-cell transcriptome database analysis is very preliminary. A detailed panel with differences in cluster properties of HECs between the two systems should be provided. A heatmap of all differentially expressed genes between the two samples must be generated, along with a logical explanation for choosing the given set of genes. 

      We have now included another panel in figure 1 to better clarify the logic behind the strategy used to identify our target genes (Figure 1A).

      (2) Figure 2 - a panel describing the workflow of gRNA design and targeting for the 9 candidate genes, along with lentiviral packaging and transduction would make it easier to follow. 

      We have now included three schematics for all the experiments in the manuscript in supplementary figure 4 A-C. 

      (3) Figure 3- to assess the effect of arterial cell expansion on the emergence of hematopoietic progenitors, CD34+ Dll4+ cells should be sorted for OP9 co-culture assay.

      Using only CD34+ cells does not answer the question raised. Also, the CFU assay performed does not fully support the claim of enhanced hematopoietic differentiation since only CFU-E and CFU-GM colonies are increased in Dox-treated samples, with no effect on other colony types. OP9 co-culture assay with these cells would be required to strengthen this claim. 

      We wanted to clarify that the effect on the methylcellulose coming from the activated cells was not limited to CFU-E, as the reviewer reported; instead, it also affected CFU-GM and CFU-M. 

      We have now performed additional experiments where we sorted the CD34+ compartment into DLL4- and DLL4+ in Supplementary Figure 5D-E, which we discussed in lines 250-258. 

      (4) In Figure 3F, there appears to be a lot of variation in the DLL4% fold change values for

      DOX treated iSAM_AGM sample, which weakens the claim of increased arterial expansion.

      Can the authors explain the probable reason? It is suggested that the two other controls (iSAM_+DOX and iSAM_-DOX) should be included in this analysis. It is imperative to also show % populations rather than just fold change to gain confidence.

      We agree that there is a lot of variability. That is because differentiation happens in 3D in embryoid bodies, which contain many different cell types that differentiate in different proportions across independent experiments. We have now included the raw data in Supplementary Figure 4 D, with additional statistical analysis to show the expansion of arterial cells including also the suggested additional controls.

      (5) How does activation of these target genes cause increased arterialization? Is the emergence of non-HE populations suppressed? Or is it specific to the HE? The data on this should be clarified and also discussed. ANTO/Lesley text

      We have provided additional data clarifying the connection between increased arterialisation and hemogenic potential. We showed that the activation induces increased arterialisation and that IGFBP2 acts by supporting the acquisition of hemogenic potential. We have discussed this in lines 326-348 and provided a new figure to explain this in detail (figure 6)

      (6) Considering that IGFBP2 was chosen from the activated target gene(s) cluster, can the authors explain why the reduced CFU-M phenomenon observed in Figure 3G does not appear in the MethoCult assay for IGFBP2 treated cells (Figure 4B)?

      The difference could be explained by the fact that in Figure 3G, the cells underwent activation of multiple genes, while in Figure 4B, they were only exposed to IGFBP2. Our results show that IGFBP2 could at least partially explain the phenotype that we see with the activation, but we believe that during the activation experiments, there might be other signals available that might not be induced by IGFBP2 alone. We have also added a summary section and a figure to clarify the different mechanisms of action of the gene activation and IGFBP2.

      (7) Figure 4- while the experiments conducted support the role of IGFBP2 in increasing hematopoietic output, there is no experimental evidence to prove its function through paracrine signalling in HECs. The authors need to provide some evidence of how IGFBP2 supplementation specifically expands only the hematopoietic progenitors. Experimental strategies involving specifically targeting IGFBP2 in hemogenic/arterial endothelial cells are required to prove its cell type specific function. Additionally, assessing the in vivo functional potential of the hematopoietic cells generated in the presence of IGFBP2, by bone-marrow transplantation of CD34+ CD43+ cells, is essential. 

      The role of IGFBP2 in the context of HSC production and expansion was not the topic of our research, and we have not claimed that IGFBP2  affects the long-term repopulating capacity of HSPCs. Therefore, we believe that the requested experiments are not required to support the specific claims that we do make. We have now provided more experiments and bioinformatic analysis that support the role of IGFBP2 in inducing the progression of EHT from arterial cells to hemogenic endothelium, and to avoid misunderstandings, we have toned down our claims by editing the text regarding its paracrine effect s. 

      (8) Figure 4C-D -It is recommended to plot % populations along with fold change value. As this is a key finding, it is important to perform flow cytometry for additional hematopoietic markers- CD144, CD235a and CD41a to demonstrate whether this strategy can also expand erythroid-megakaryocyte progenitors. Telma

      Figure 4C already shows the percentage values; we have now added the percentage for Figure 4D in SF5C. We have also performed additional analysis as requested and added the data obtained to Supplementary Figure 5D.

      (9) In Figure 5, analysis showing the frequency of cells constituting different clusters, between untreated and IGFBP2-treated samples in the single-cell transcriptome analysis is essential. Additional experiments are required to validate the function of IGFBP2 through modulation of metabolic activity. Inhibition of oxidative phosphorylation in the IGFBP2treated cells should reduce the hematopoietic output. Authors should consider doing these experiments to provide a stronger mechanistic insight into IGFBP2-mediated regulation of hematopoietic emergence.

      We have now included the requested cluster composition in Supplementary Figure 5F. We decided not to include further tests on the metabolic profile of IGFBP2 as we already discussed in other papers that showed, using selective inhibitors, that the EHT coincides with a glycol to OxPhos switch. 

      (10) It is very striking to see that IGFBP2 supplementation changes the transcriptional profile of developing hematopoietic cells by increasing transcription of OXPHOS-related genes with concomitant reduction of glycolytic signatures, particularly at Day 13. However, the mitochondrial ATP rate measurements do not seem convincing. The bioenergetic profiles show that when mitochondrial inhibitors are added, both groups exhibit decreased OCR values and, on the other hand, higher ECAR. This indicates that both groups have the capability to utilize OXPHOS or glycolysis and may only differ in their basal respiration rates.

      Differences in proliferation rate can cause basal respiration to change. There is no information on how the bioenergetic profile was normalized (cell no./protein amount). Given that IGFBP2 has been shown to increase proliferation, it is very likely that the cells treated with IGFBP2 proliferated faster and therefore have higher OCR. The data needs to be normalized appropriately to negate this possibility.

      We have previously tested whether IGFBP2 causes an increase in proliferation by analysing the cell cycle of cells treated with it, as we initially thought this could be a mechanism of action. We have now provided the quantification of the cell cycle in the cells treated with IGFBP2, showing no effect was observed in cell cycle Supplementary Figure 4E. Following this analysis, we decided to plate the same number of cells and test their density under the microscope before running the experiment; each experiment was done in triplicate for each condition. We have now added this info to the method sections lines 806-813.  We did not comment on the basal difference, which we agree might be due to several factors, but we only compared the difference in response to the inhibitors, which isn’t affected by the basal level but exclusively by their D values. We have also included the formulas used to calculate the ATP production rate.

      Overall, it appears that IGFBP2 does not seem to primarily cause metabolic changes, but simply accelerates the metabolic dependency on OXPHOS. Hence, the term 'metabolic remodelling' must be avoided unless IGFBP2 depletion/loss of function analysis is shown.

      We thank the reviewer for suggesting how to interpret the data about the dependency on OXPHOS. We have now changed the conclusions and claims about the effect of IGFBP2. We have also included a cell cycle analysis of the hematopoietic cells derived upon IGFBP2 addition to show that they don’t show differences in proliferation that could cause the increase in colony formation we observed. Regarding the assay, we have plated the same number of cells for each group to make sure we were comparing the same number of cells, which we also assessed in the microscope before the test, and we eliminated the suspension cells during the washes that preceded the measurement. The review is correct in indicating that there is a basal difference in the value of OCR and ECAR where the IGFBP2 is lower at the start and not higher, which would not conceal higher proliferation. Finally, the ATP production rate is calculated on the variation of OCR and ECAR upon the addition of inhibitors, which normalizes for the basal differences.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Recommendations For The Authors):

      Summary:

      In this manuscript, the molecular mechanism of interaction of daptomycin (DAP) with bacterial membrane phospholipids has been explored by fluorescence and CD spectroscopy, mass spectrometry, and RP-HPLC. The mechanism of binding was found to be a two-step process. A fast reversible step of binding to the surface and a slow irreversible step of membrane insertion. Fluorescence-based titrations were performed and analysed to infer that daptomycin bound simultaneously two molecules of PG with nanomolar affinity in the presence of calcium. Conformational change but not membrane insertion was observed for DAP in the presence of cardiolipin and calcium.

      Strengths:

      The strength of the study is skillful execution of biophysical experiments, especially stoppedflow kinetics that capture the first surface binding event, and careful delineation of the stoichiometry.

      Weaknesses:

      The weakness of the study is that it does not add substantially to the previously known information and fails to provide additional molecular details. The current study provides incremental information on DAP-PG-calcium association but fails to capture the complex in mass spectrometry. The ITC and NMR studies with G3P are inconclusive. There are no structural models presented. Another aspect missing from the study is the reconciliation between PG in the monomer, micellar, and membrane forms.

      Besides the two-stage process, another important finding in the current work is the stable complex that plays a critical role in the drug uptake both in vitro and in B. subtilis. This complex has been shown to be a stable species in HPLC and its binding stoichiometry and affinity have been quantitatively characterized. The complex may not be stable enough in gas phase to be detected in the MS analysis, which was designed to detect the phospholipid and Dap components, not the complex itself. The structural model of this complex is clearly proposed and presented in Figure 6. 

      The NMR and ITC studies have a very clear conclusion that Dap has a weak interaction with the PG headgroup alone, which is unable to account for the Dap-PG interaction observed in the fluorescence studies. Thus, the whole PG molecule has to be involved in the interaction, leading to the discovery of the stable complex.  

      Reviewer #2 (Recommendations For The Authors):

      (1) I appreciate and agree with the comment that there are stages of daptomycin insertion, and these might involve the formation of different complexes with different binding partners (e.g. pre-insertion vs quaternary vs bactericidal). However, it seems like lipid II is an apparent participant in daptomycin membrane dynamics (Grein et al. Nature Communications 2020). It's not clear why this was excluded from analysis by the authors, or what basis there is for the discussion statement that the quaternary complex can shift into the bactericidal complex by exchanging 1 PG for lipid II. 

      We agree that lipid II and other isoprenyl lipids may be involved in the uptake and insertion of daptomycin into membrane according to the results of the Nat. Comm. paper. However, these isoprenyl lipids are very small components of the membrane in comparison to PG and their contribution to the drug uptake is thus expected to be much less significant. Nonetheless, we included farnesyl pyrophosphate (FPP) as an analog of bactoprenol pyrophosphate (C55PP), which was reported to have the same promoting effect as lipid II in the previous study, in our study but found no promoting effect in the fluorescence assay (Fig. 2B). In addition, no complex was formed when FPP replaced PG in our preparation and analysis of the drug-lipid complex. In consideration of these negative results and the expected small contribution, other isoprenyl lipids or their analogs were not included in the study.

      The statement of forming the proposed bactericidal complex from the identified complex is a speculation that is possible only when lipid II has a higher affinity for Dap than a PG ligand. To avoid confusion, we deleted the sentence’ in the revision. 

      (2) The detailed examination of daptomycin dynamics, particularly on the millisecond scale, in this paper is ideal for characterizing the effect of lipid II on daptomycin insertion. It would be helpful to either include lipid II in some analyses (micelle binding, fluorescence shifts, CD) or at least address why it was excluded from the scope of this work.

      As mentioned in the response to the first comment, we did not exclude isoprenyl lipids in our study but used some of their analogs in the fluorescence assay. Besides FPP mentioned above, we also tested geranyl pyrophosphate and geranyl monophosphate but obtained the same negative results. Lipid II was not directly used because it is one of the three isoprenyl lipids reported to have the same promoting effects in the Nat. Comm. paper and also because its preparation is not easy. Even if lipid II were different from other isoprenyl lipids in promoting membrane binding, its contribution is likely negligible at the reversible stage compared to the phospholipids because of its minuscule content in bacterial membrane. This is the main reason we did not use the isoprenyl lipids in the fast kinetic study (this stage only involves reversible binding, not insertion). 

      (3) Grein et al. 2020 saw that PG did not have a strong effect on daptomycin interaction with membranes. I believe this discrepancy is more likely due to the complex physical parameters of supported bilayers versus micelles/vesicles or some other methodological variable, but if the authors have more insight on this, it would be valuable commentary in the discussion.

      We totally agree that the discrepancy is likely due to the different conditions in the assays. It is hard to tell exactly what causes the difference. Thus, we did not attempt to comment on the cause of this difference in the discussion.

      (4) Isolation of the daptomycin complex from B. subtilis cells clearly had different traces from the in vitro complex; is it possible that lipid II is present in the B. subtilis complex? If not, a time-course extraction could be useful to support the model that different complexes have different activities. Isolates from early-stage incubation with daptomycin may lack lipid II but isolates from longer incubations may have lipid II present as the complex shifts from insertion to bactericidal.

      From the day we isolated the complex from B. subtilis, we have been looking for evidence for the previously proposed lipid complexes containing lipid II or other isoprenyl lipids but have not been successful. We did not see any sign of lipid II or other isoprenyl lipids in the MALDI or ESI mass spectroscopic data. The minute peaks in the HPLC traces are not the expected complexes in separate LC-MS analysis. However, this does not mean that such complexes are not present in the isolated PG-containing complex because: (1) the amount of such complexes may be too small to be detected due to the low content of the isoprenyl lipids; (2) the isoprenyl lipids, particularly lipid II, are not easily ionizable due to their size and unique structure for detection in mass spectrometry. 

      We don’t think the drug treatment time is the reason for the failure in detecting lipid II or other isoprenyl lipids. In our reported experiment, the cells were treated with a very high dose of Dap for 2 hours before extraction. In a separate experiment done recently, we treated B. subtilis at 1/3 of the used dose under the same condition and found all treated cells were dead after 1 hour in a titration assay, consistent with the results from reported time-killing assays in the literature. From this result, the proposed bactericidal lipid-containing complex should have been formed in the treated cells used in our extraction and isolated along with the PG-containing complex. It was not detected likely due to the reasons discussed above. To avoid the interference of the PG-containing complex, a large amount of bacterial cells might have to be treated at a low dose to isolate enough amount of the lipid II-containing complex for identification. However, isolation or identification of the lipid II-containing complex is outside the scope of the current investigation and is therefore not pursued. 

      (5) Part of the daptomycin mechanism of interacting with bacterial membranes involves the flipping of daptomycin from one leaflet to another. There was some mentioned work on the consistency of results between micelles and vesicles, but the dynamics or existence of a flipping complex in the bilayer system wasn't addressed at all in this paper.

      The current investigation makes no attempt to solve all problems in the daptomycin mode of action and is limited to the uptake of the drug, up to the point when Dap is inserted into the membrane. Within this scope, flipping of the complex is not yet involved and is thus irrelevant to the study. How the complex is flipped and used to kill the bacteria is what should be investigated next.  

      (6) The authors mention data with phosphatidylethanolamine in the text, but I could not find the data in the main or supplemental figures. I recommend including it in at least one of the figures.

      It is much appreciated that this error is identified. The POPE data was lost when the graphic (Fig. 2B) was assembled in Adobe to create Figure 2. We re-draw the graphic and reassemble the figure to solve this problem. Fig. 2B has also been modified to use micromolar for the concentration of the lipids.

      (7) Readability point: I'd suggest some consistency in the concentrations mentioned. Making the concentrations either all molar-based or all percentage-based would make comparison across figures easier.

      As suggested, we have changed the % into micromolar concentrations in Fig. 2B and also in Fig. 3A. 

      (8) The model figure is quite difficult to interpret, particularly the final stage of the tail unfolding. I recommend the authors use a zoomed-in inset for this stage, or at least simplify the diagram by removing the non-participating lipid structures. The figure legend for the model figure should also have a brief description of the events and what the arrows mean, particularly the POPS PG arrow in the final panel of the figure. I am assuming here the authors are implying that daptomycin can transiently interact with one lipid species and move to another, but the arrow here suggests that daptomycin is moving through the lipid headgroup space.

      We really appreciate the suggestions. As suggested, we put an inset to show the preinsertion complex more clearly. In addition, we have removed the green arrows originally intended to show the re-organization/movement of the phospholipids. Moreover, the legend is changed to ‘Proposed mechanism for the two-phased uptake of Dap into bacterial membrane. In the first phase, Dap reversibly binds to negative phospholipids with a hidden tail in the headgroup region, where it combines with two PG molecules to form a pre-insertion complex. In the second phase, the hidden tail unfolds and irreversibly inserts into the membrane. The inset shows the headgroup of the pre-insertion complex with the broad arrow showing the direction for the unfolding of the hidden tail. The red dots denote Ca2+.’  

      (9) The authors listed the Kd for daptomycin and 2 PG as 7.2 x 10-15 M2. Is this correct? This is an affinity in the femtomolar range.

      Please note that this Kd is for the simultaneous binding of two PG molecules, not for the binding of a single ligand that we usually refer to. Assuming that each PG contributes equally to this interaction, the binding affinity for each ligand is then the squared root of 7.2 x 10-15 M2, which equals to 8.5 x 10-8 M. This is equivalent to a nanomolar affinity for PG and is a reasonably high affinity.

      Reviewer #3 (Recommendations For The Authors):

      (1) The authors reported an increase in daptomycin intensity with the increasing amount of negatively charged DMPG. A similar observation has been reported for GUVs, however, the authors did not refer to this paper in their manuscript: E. Krok, M. Stephan, R. Dimova, L. Piatkowski, Tunable biomimetic bacterial membranes from binary and ternary lipid mixtures and their application in antimicrobial testing, Biochim. Biophys. Acta - Biomembr. 1865 (2023) [1]. This paper is also consistent with the authors' observation that there is negligible fluorescence detected for the membranes composed of PC lipids upon exposure to the Dap treatment.

      As suggested, this paper is cited as ref. 29 in the revision by adding the following sentence at the end of the section ‘Dependence of Dap uptake on phosphatidylglycerol.’: ‘PG-dependent increase of the steady-state fluorescence was also observed in giant unilamellar vesicles (GUVs).29’. The numbering is changed accordingly for the remaining references.  

      (2) Please include the plot of the steady-state Kyn fluorescence vs the content of POPA (Figure 2C shows traces for DMPG, CL, and POPS). Both POPA and POPS lipids are negatively charged, however, POPS seems to interact with Dap, while POPA does not. In my opinion, this observation is really interesting and might deserve a more thorough discussion. The authors might want to describe what could be the mechanism behind this lipid-specific mode of binding.

      As suggested, a plot is now added for POPA in Fig. 2C, which is basically a flat line without significant increase for the Kyn fluorescence. Indeed, the different effect of the negative phospholipids is very interesting, indicating that the reversible binding of Dap to the lipid surface is dependent not only on the Ca2+-mediated ionic interaction but also the structure of the headgroup. In other words, Dap recognizes the phospholipids at the surface binding stage. Considering this headgroup specificity, the last sentence in the second paragraph in “Discussion’ is changed from ‘In addition, due to the low lipid specificity, this reversible binding likely involves Ca2+-mediated ionic interaction between Dap and the phosphoryl moiety of the headgroups.’ to ‘In addition, due to the specificity for negative phospholipids (Fig. 2B and 2C), this reversible binding of Dap likely involves both a nonspecific Ca2+-mediated ionic interaction and a specific interaction with the remaining part of the headgroups.’

      (3) The authors write that they propose a novel mechanism for the Ca2+-dependent insertion of Dap to the bacterial membrane, however, they rather ignored the already published findings and hypotheses regarding this process. In fact the role of Ca2+, as well as the proposed conformational changes of Dap, which allow its deeper insertion into the membrane are well known:

      The role of Ca2+ ions in the mechanism of binding is actually three-fold: (i) neutralization of daptomycin charge [2], (iii) creating the connection between lipids and daptomycin and (iii) inducing two daptomycin conformational changes. It should be noted that the interactions between calcium ions and daptomycin are 2-3 orders of magnitude stronger than between daptomycin and PG lipids [3,4]. Thus, upon the addition of CaCl2 to the solution, the divalent cations of calcium bind preferentially to the daptomycin, rather than to the negatively charged PG lipids, which results in the decrease of daptomycin net negative charge but also leads to its first conformational change [4]. Upon binding between calcium ions and two aspartate residues, the area of the hydrophobic surface increases, which allows the daptomycin to interact with the negatively charged membrane. In the next step, Ca2+ acts as a bridge connecting daptomycin with the anionic lipids. This event leads to the second conformational change, which enables deeper insertion of daptomycin into the lipid membrane and enables its fluorescence [4]. The overall mechanism has a sequential character, where the binding of daptomycin-Ca2+ complex to the negatively charged PG (or CA) occurs at the end.

      The authors should focus on emphasizing the novelty of their manuscript, keeping in mind the already published paper.

      We agree with the comments on the three general roles of calcium ion in the Dap interaction with membrane. The current investigation does not ignore the previous findings, which involve many more works than mentioned above, but takes these findings as common knowledge. Actually, the role of calcium ion is not the focus of current work. Instead, the current work focuses on how the drug is taken up and inserted into the membrane in the presence of the ion and how its structure changes in this process. With the known roles of calcium ion in mind, we propose an uptake mechanism (Fig. 6) that shows no conflict with the common knowledge.

      We would like to point out that the ‘deeper insertion into the membrane’ in the comment is different from the membrane insertion referred to in our manuscript. This ‘deeper insertion’ still remains in the reversible stage of binding to the membrane surface because all negative phospholipids can do this (causing a conformational change and fluorescence increase, as quantified in Fig.2C) but now we know that only PG can enable irreversible membrane insertion because of our work. In addition, the comment that calcium binding to daptomycin causes first conformational change is not supported by our finding that no conformational change is found for Dap in the presence of calcium in a lipid-free environment (Fig. 3B). One important aspect of novelty and contribution of our work is to clear up some of these ambiguities in the literature. Another contribution of our work is to demonstrate the formation of a stable complex between Dap and PG with a defined stoichiometry and its crucial role in the drug uptake. 

      (4) One paragraph in the section "Ca2+- dependent interaction between Dap and DMPG" is devoted to a discussion of the formation of precipitate upon extraction of DMPG-containing micelles, exposed to Dap in the calcium-rich environment. Contrary, in the absence of Dap, no precipitate was detected. The authors did not provide any visual proof for their statement. Please include proper photographs in the supplementary information.

      The precipitate formed upon extraction of the DMPG-containing micelles was too little to be visually identifiable but could be collected by centrifugation and detected by fluorescence or HPLC after dissolving in DMSO. For visualization, we show below the precipitate formed using higher amount of Dap and DMPG. The Dap-DMPG-Ca2+ complex (left tube) was formed by mixing 1 mM Dap, 2 mM DMPG and 1 mM Ca2+ and the control (right tube) was a mixture of 2 mM DMPG and 1 mM Ca2+. This is now added as Fig. S7 in the supplementary information (the index is modified accordingly) and cited in the main text.

      (5) The authors wrote that it is not clear how many calcium ions are bound to Dap-2PG complex (page 11, Discussion section). There are already reports discussing this issue. I recommend citing the paper discussing that exactly two Ca2+ ions bind to a single Dap molecule: R. Taylor, K. Butt, B. Scott, T. Zhang, J.K. Muraih, E. Mintzer, S. Taylor, M. Palmer, Two successive calcium-dependent transitions mediate membrane binding and oligomerization of daptomycin and the related antibiotic A54145, Biochim. Biophys. Acta - Biomembr. 1858, (2016) 1999-2005 [5]

      We were aware of the cited work that shows binding of two Ca2+ but also noted that there are more works showing one Ca2+ in the binding, such as the paper in [Ho, S. W., Jung, D., Calhoun, J. R., Lear, J. D., Okon, M., Scott, W. R. P., Hancock, R. E. W., & Straus, S. K. (2008), Effect of divalent cations on the structure of the antibiotic daptomycin. European Biophysics Journal, 37(4), 421–433.]. That was the reason we said ‘it is not clear how many calcium ions are bound to Dap-2PG complex’. Now, both papers are cited (as Ref. #33, 34) to support this statement.

      (6) The authors wrote two contradictory statements:

      -  PG cannot be found in mammalian cell membranes:

      "Moreover, the complete dependence of the membrane insertion on PG also explains why Dap selectively attacks Gram-positive bacteria without affecting mammalian cells, because PG is present only in bacterial membrane but not in mammalian membrane. " (Page 10, Discussion section, last sentence of the first paragraph)

      "However, Dap absorbed on bacterial surface is continuously inserted into the acyl layer via formation of complex with PG in a time scale of minutes, whereas no irreversible insertion of Dap occurs on mammalian membrane due to the absence of PG while the bound Dap is continuously released to the circulation as the drug is depleted by the bacteria." (Page 13, Discussion section)

      -  PG in trace amounts is present in mammalian membranes:

      "The proposed requirement of the pre-insertion quaternary complex increases the threshold of PG content for the membrane insertion to happen and thus makes it impossible on the surface of mammalian cells even if their plasma membrane contains a trace amount of PG." (Page 13, Discussion section).

      In fact, phosphatidylglycerol comprises 1-2 mol% of the mammalian cell membranes. Please, correct this information, which in this form is misleading to the readers.

      We appreciate the comments about the PG content in mammalian cells. Changes are made as listed below:

      (1) p10, the sentence is changed to ‘Moreover, the complete dependence of the membrane insertion on PG also explains why Dap selectively attacks Gram-positive bacteria without affecting mammalian cells, because PG is a major phospholipid in bacterial membrane but is a minor component in mammalian membrane.’ 

      (2) p13, the sentence is changed to ‘However, Dap absorbed on bacterial surface is continuously inserted into the acyl layer via formation of complex with PG in a time scale of minutes, whereas little irreversible insertion of Dap occurs on mammalian membrane due to the low content of PG while the bound Dap is continuously released to the circulation as the drug is depleted by the bacteria.’

      (3) p13, another sentence is modified to ‘The proposed requirement of the pre-insertion quaternary complex increases the threshold of PG content for the membrane insertion to happen and thus makes it less likely on the surface of mammalian cells that contain PG at a low level in the membrane.’ 

      (7) Please include information that Dap is effective only against Gram-positive bacteria and does not show antimicrobial properties against Gram-negative strains. The authors focused on emphasizing that Dap does not affect mammalian membranes, most likely due to the low PG content, however even membranes of Gram-negative bacteria are not susceptible to the Dap, despite the relatively high content of negatively charged PG in the inner membrane (e.g. inner cell membrane of E. coli has ~20% PG).

      The requested information is already included in ‘Introduction’. In this part, Dap is introduced to be only active against Gram-positive bacteria, implicating that it is not active against Gram-negative bacteria. The reason Dap is inactive against E. coli or other Gramnegative bacteria is because the outer membrane prevents the antibiotic from accessing the PG in the inner membrane to cause any harm. When the outer membrane is removed, Dap will also attack the plasma membrane of Gram-negative bacteria. 

      Literature cited in the comments:

      (1) E. Krok, M. Stephan, R. Dimova, L. Piatkowski, Tunable biomimetic bacterial membranes from binary and ternary lipid mixtures and their application in antimicrobial testing, Biochim. Biophys. Acta - Biomembr. 1865 (2023). https://doi.org/10.1101/2023.02.12.528174.

      (2) S.W. Ho, D. Jung, J.R. Calhoun, J.D. Lear, M. Okon, W.R.P. Scott, R.E.W. Hancock, S.K. Straus, Effect of divalent cations on the structure of the antibiotic daptomycin, Eur. Biophys. J. 37 (2008) 421-433. https://doi.org/10.1007/S00249-007-0227-2/METRICS.

      (3) A. Pokorny, P.F. Almeida, The Antibiotic Peptide Daptomycin Functions by Reorganizing the Membrane, J. Membr. Biol. 254 (2021) 97-108. https://doi.org/10.1007/s00232-02100175-0.

      (4) L. Robbel, M.A. Marahiel, Daptomycin, a bacterial lipopeptide synthesized by a nonribosomal machinery, J. Biol. Chem. 285 (2010) 2750127508. https://doi.org/10.1074/JBC.R110.128181.

      (5) R. Taylor, K. Butt, B. Scott, T. Zhang, J.K. Muraih, E. Mintzer, S. Taylor, M. Palmer, Two successive calcium-dependent transitions mediate membrane binding and oligomerization of daptomycin and the related antibiotic A54145, Biochim. Biophys. Acta - Biomembr. 1858 (2016) 1999-2005. https://doi.org/10.1016/J.BBAMEM.2016.05.020.

    1. https://errors.edgesuite.net/18.aa2d3e17.1733239827.ea4e98df

      Explanation:

      The annotated text, which is a URL (https://errors.edgesuite.net/18.aa2d3e17.1733239827.ea4e98df), appears to be a reference link that leads to an error page. This indicates that the link provided in the original text is either broken or incorrect.

      Given the user question, which asks for an annotation of a Spanish bill with key provisions and related bills, the broken link is significant for several reasons:

      1. Access to Information: The broken link prevents access to the actual content of the Spanish bill that the user is interested in. This is crucial because without access to the original document, it is impossible to analyze or annotate the key provisions or related bills.

      2. Reliability of Source: The presence of a broken link raises questions about the reliability and accuracy of the source. It suggests that the provided reference might not have been verified, which is important in legal and academic contexts where accuracy is paramount.

      3. User Guidance: Highlighting the broken link is essential to inform the user that the provided reference is not functional. This helps in setting the correct expectations and guides the user to seek an alternative source or correct the link if possible.

      4. Implications for Research: For anyone conducting research or needing detailed information about the bill, the broken link is a significant hindrance. It implies that further steps need to be taken to locate the correct document, such as contacting the source, searching for the document through official legislative databases, or using other references that might be available.

      In summary, the annotated text underscores the importance of having a functional and accurate reference link when dealing with legal documents. It highlights the need for the user to obtain the correct URL to access the Spanish bill in question, which is crucial for providing a comprehensive annotation of its key provisions and related bills.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      This work used a comprehensive dataset to compare the effects of species diversity and genetic diversity within each trophic level and across three trophic levels. The results showed that species diversity had negative effects on ecosystem functions, while genetic diversity had positive effects. These effects were observed only within each trophic level and not across the three trophic levels studied. Although the effects of biodiversity, especially genetic diversity across multi-trophic levels, have been shown to be important, there are still very few empirical studies on this topic due to the complex relationships and difficulty in obtaining data. This study collected an excellent dataset to address this question, enhancing our understanding of genetic diversity effects in aquatic ecosystems.

      Strengths:

      The study collected an extensive dataset that includes species diversity of primary producers (riparian trees), primary consumers (macroinvertebrate shredders), and secondary consumers (fish). It also includes the genetic diversity of the dominant species at each trophic level, biomass production, decomposition rates, and environmental data.

      The conclusions of this paper are mostly well supported by the data and the writing is logical and easy to follow.

      Weaknesses:

      (1) While the dataset is impressive, the authors conducted analyses more akin to a "meta-analysis," leaving out important basic information about the raw data in the manuscript. Given the complexity of the relationships between different trophic levels and ecosystem functions, it would be beneficial for the authors to show the results of each SEM (structural equation model).

      We understand the point raised by the reviewer. We now provide individual SEMs (Figure 3), although we limit causal relationships to those for which the p-value was below 0.2 for the sake of graphical clarity. We also provide the percentage of explained variance for each ecosystem function. We detail the graph in the Results section (see l. 317-328) and discuss them (see l. 387-398). Note that we do not detail each function separately as this would (in our opinion) result in a long descriptive paragraph from which it might be difficult to get some key information. Rather, we summarize the percentage of explained variance for each function and discuss the strength of environmental vs biodiversity effects for some examples. In the Discussion, we explain why environmental effects (on functions and biodiversity) are relatively weak. We mainly attribute this to the sampling scheme that follows an East-West gradient (weak altitudinal range) rather than an upstream-downstream gradient as it is traditionally done in rivers. The reasoning behind this sampling scheme is explained in our companion paper (Fargeot et al. Oikos 2023) to which we now refer more explicitly in the MS. Briefly, using an upstream-downstream gradient would have certainly push up the effects of the environment, but this would have made extremely complex the inference of biodiversity effects due to strong collinearity among environmental and biodiversity parameters.

      (2) The main results presented in the manuscript are derived from a "metadata" analysis of effect sizes. However, the methods used to obtain these effect sizes are not sufficiently clarified. By analyzing the effect sizes of species diversity and genetic diversity on these ecosystem functions, the results showed that species diversity had negative effects, while genetic diversity had positive effects on ecosystem functions. The negative effects of species diversity contradict many studies conducted in biodiversity experiments. The authors argue that their study is more relevant because it is based on a natural system, which is closer to reality, but they also acknowledge that natural systems make it harder to detect underlying mechanisms. Providing more results based on the raw data and offering more explanations of the possible mechanisms in the introduction and discussion might help readers understand why and in what context species diversity could have negative effects.

      (We now provide more details. However, we are unfortunately not sure that this helped reaching some stronger explanation regarding underlying mechanisms. To be frank, we did not succeed in improving mechanistic inferences based on the outputs of the SEM models. We explored visually some additional relationships (e.g. relationships between the biomass of the focal species and that of other species in the assemblage) that we now discuss a bit more, but again, this did not really help in better understanding processes. We realize this is a limitation of our study and that this can be frustrating for readers. Nonetheless, as said in the Discussion, field-based study must be taken for what they are; observational studies forming the basis for future mechanistic studies. Although we failed to explain mechanisms, we still think that we provide important field-base evidence for the importance of biodiversity (as a whole) for ecosystem functions.

      3) Environmental variation was included in the analyses to test if the environment would modulate the effects of biodiversity on ecosystem functions. However, the main results and conclusions did not sufficiently address this aspect.

      This is now addressed, see our response to your first comment. We now explain (result section) and discuss environmental effects. As explained in the MS, environmental effects are similar in strength to those of biodiversity and are not that high, which is partly explained by the sampling scheme (see Fargeot et al. 2023). This is a choice we’ve made at the onset of the experiment, as we wanted to focus on biodiversity effects and avoid strong collinearity as it is generally the case in rivers (which impedes any proper and strong statistical inferences).

      Reviewer #2 (Public review):

      Summary:

      Fargeot et al. investigated the relative importance of genetic and species diversity on ecosystem function and examined whether this relationship varies within or between trophic-level responses. To do so, they conducted a well-designed field survey measuring species diversity at 3 trophic levels (primary producers [trees], primary consumers [macroinvertebrate shredders], and secondary consumers [fishes]), genetic diversity in a dominant species within each of these 3 trophic levels and 7 ecosystem functions across 52 riverine sites in southern France. They show that the effect of genetic and species diversity on ecosystem functions are similar in magnitude, but when examining within-trophic level responses, operate in different directions: genetic diversity having a positive effect and species diversity a negative one. This data adds to growing evidence from manipulated experiments that both species and genetic diversity can impact ecosystem function and builds upon this by showing these effects can be observed in nature.

      Strengths:

      The study design has resulted in a robust dataset to ask questions about the relative importance of genetic and species diversity of ecosystem function across and within trophic levels.

      Overall, their data supports their conclusions - at least within the system that they are studying - but as mentioned below, it is unclear from this study how general these conclusions would be.

      Weaknesses:

      (4) While a robust dataset, the authors only show the data output from the SEM (i.e., effect size for each individual diversity type per trophic level (6) on each ecosystem function (7)), instead of showing much of the individual data. Although the summary SEM results are interesting and informative, I find that a weakness of this approach is that it is unclear how environmental factors (which were included but not discussed in the results) nor levels of diversity were correlated across sites. As species and genetic diversity are often correlated but also can have reciprocal feedbacks on each other (e.g., Vellend 2005), there may be constraints that underpin why the authors observed positive effects of one type of diversity (genetic) when negative effects of the other (species). It may have also been informative to run SEM with links between levels of diversity. By focusing only on the summary of SEM data, the authors may be reducing the strength of their field dataset and ability to draw inferences from multiple questions and understand specific study-system responses.

      We have addressed this remark and we ask the reviewers and the readers to refer to our response to comment 1 from reviewer 1. Regarding co-variation among biodiversity estimates (SGDCs according to Vellend’s framework), we have addressed these issues in a companion paper that we now cite and expand further in the MS (Fargeot et al. Oikos, 2023). Given the size of the dataset and its complexity (and associated analyses), we have decided to focus on patterns of species and genetic biodiversity in a first paper (Oikos paper) and then on the link between biodiversity and functions (this paper). As it can be read in the Oikos’s paper, there are no co-variation in term of biodiversity estimates; species diversity is not correlated to genetic diversity, and within facet, there are not co-variation among species. In addition, environmental predictors are highly estimate-specific (i.e. environmental predictors sustaining species and genetic estimates are idiosyncratic). As a result (see the new Figure 3), environmental effects are relatively weak (the same intensity that those of biodiversity) and collinearity among parameters is relatively weak. The second point is important, as this permit to better infer parameters from models, and this allows to discuss direct relationships (as observed in Figure 3, indirect environmental effects are relatively rare). We provide in the Discussion a bit more explanation about the absence of co-variation among biodiversity estimates (see l. 433-440).

      (5) My understanding of SEM is it gives outputs of the strength/significance of each pathway/relationship and if so, it isn't clear why this wasn't used and instead, confidence intervals of Z scores to determine which individual BEFs were significant. In addition, an inclusion of the 7 SEM pathway outputs would have been useful to include in an appendix.

      We now provide p-values (Table S2) and the seven models (Figure 3).

      (6) I don't fully agree with the authors calling this a meta-analysis as it is this a single study of multiple sites within a single region and a specific time point, and not a collection of multiple studies or ecosystems conducted by multiple authors. Moreso, the authors are using meta-analysis summary metrics to evaluate their data. The authors tend to focus on these patterns as general trends, but as the data is all from this riverine system this study could have benefited from focusing on what was going on in this system to underpin these patterns. I'd argue more data is needed to know whether across sites and ecosystems, species diversity and genetic diversity have opposite effects on ecosystem function within trophic levels.

      We agree. “Meta-regression” would perhaps be more adequate than “meta-analyses”. We changed the formulation.

      Reviewer #3 (Public review):

      The manuscript by Fargeot and colleagues assesses the relative effects of species and genetic diversity on ecosystem functioning. This study is very well written and examines the interesting question of whether within-species or among-species diversity correlates with ecosystem functioning, and whether these effects are consistent across trophic levels. The main findings are that genetic diversity appears to have a stronger positive effect on function than species diversity (which appears negative). These results are interesting and have value.

      However, I do have some concerns that could influence the interpretation.

      (7) Scale: the different measures of diversity and function for the different trophic levels are measured over very different spatial scales, for example, trees along 200 m transects and 15 cm traps. It is not clear whether trees 200 m away are having an effect on small-scale function.

      Trees identification and invertebrate (and fish) sampling are done on the same scale. Trees are spread along the river so that their leaves fall directly in the river. Traps have been installed all along the same transect in various micro-habitats. Diversity have been measured at the exact same scale for all organisms. We have modified the MS to make this clear.

      (8) Size of diversity gradients: More information is needed on the actual diversity gradients. One of the issues with surveys of natural systems is that they are of species that have already gone through selection filters from a regional pool, and theoretically, if the environments are similar, you should get similar sets of species, without monocultures. So, if the species diversity gradients range from say, 6 to 8 species, but genetic diversity gradients span an order of magnitude more, you can explain much more variance with genetic diversity. Related to this, species diversity effects on function are often asymptotic at high diversity and so if you are only sampling at the high diversity range, we should expect a strong effect.

      Fish species number varies from 1 to 11, invertebrate family number varies from 15 to 42 and the tree species number varies from 7 to 20 (see Fargeot et al. 2023 for details). We have added this information in the M&M. The gradients are hence relatively large and do not cover a restricted set of values. There is a variance in species number among sites, even if sites are collected along a relatively weak altitudinal gradient. This is obviously complex to compare to SNP (genomic) diversity. Genetic and species effects are similar in effect sizes (percentage of explained variance), so it does not seem we have biased one of the two gradients of biodiversity.

      (9) Ecosystem functions: The functions are largely biomass estimates (expect decomposition), and I fail to see how the biomass of a single species can be construed as an ecosystem function. Aren't you just estimating a selection effect in this case?

      The biomass estimated for a certain area represents an estimate of productivity, whatever the number of species being considered. Obviously, productivity of a species can be due to environmental constraints; the biomass is expected to be lower at the niche margin (selection effect). But if these environmental effects are taken into account (which is the case in the SEMs), then the residual variation can be explained by biodiversity effects. We provide an explanation (l. 217-219).

      (10) Note that the article claims to be one of the only studies to look at function across trophic levels, but there are several others out there, for example:

      Thanks, we now cite some of these studies (Li et al 2020, Moi et al. 2021, Seibold et al. 2018).

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Introduction:

      The introduction of the manuscript is generally well-structured, and the scientific questions are clearly presented. However, in each paragraph where specific aspects are introduced, the authors do not focus sufficiently on the given points. The current introduction discusses the weaknesses of previous studies extensively but lacks detailed explanations of mechanisms and a clear anticipation of this study's contributions.

      For example:

      L72-77: The authors mention that "genetic diversity may functionally compensate for a species loss," but this point is not highly relevant to the main analyses of this study, which focus on comparing the relative effects of species diversity and genetic diversity.

      Yes true, we understand the point made by the reviewers. We deleted this part of the sentence.

      L87-95: As previously noted, "whether environmental variation decreases or enhances the relative influence of genetic and species diversity on ecosystem functions" was not addressed in this study. Additionally, the last sentence seems unnecessary here, as it does not relate to "environmental variation." The phrase "generate insightful knowledge for future mechanistic models" is vague. It would be helpful to specify what kind of knowledge and what types of future mechanistic models are being referred to.

      We modified these two sentences. We now posit the prediction that what has been observed under controlled conditions (that genetic and species have effects of similar magnitude) might not be the norm under fluctuating environments (because it has been shown that environmental variation modulates the strength of interspecific BEFS and create huge variance).

      L96-116: The use of "for instance" three times in this paragraph makes the structure seem scattered, as only examples are provided. Improving the transition words can help the text focus better on the main point.

      We have modified some parts of this section to better reflect predictions

      L115-116: Again, it would be beneficial to specify what kind of insightful information can be provided.

      We have modified this sentence by making more explicit some of the information that may be gained.

      L117-134: Stating clear expectations can help the introduction focus on the mechanisms and assist readers in following the results.

      We now provide some predictions. We were reluctant to make predictions in the first version of the MS as we have the feeling that predictions can go on very different direction depending on how we set the scene. We therefore stick to predictions that we think are the most logical (the simplest ones). This illustrates the lack of theoretical papers on these issues.

      Methods:

      L287-293: The method for estimating the standard effect size is unclear. I assume it was derived from the SEM models? This needs further clarification.

      Yes, it is derived from the standardized estimate from each pSEM. This is now explained in the MS.

      Results:

      As mentioned in the public review, it is very important to show the results of analyzing raw data.

      Done, see Figure 3 and Results section.

      Table 1: The font and format of the PCA table are different from other tables and appear vague, resembling a picture rather than a table.

      Changed.

      Table 2 (and supplementary table): "D.f." is not explained in the table legend. Is 1 the numerator df and 30 the denominator df? Is the denominator the residual? Additionally, the table legend mentions "magnitude and direction." ANOVA only tests if the biodiversity effects are significantly different between species or genetic diversity, but not the magnitude. For example, -0.5 and 0.5 are very different, but their effect magnitudes are the same.

      This is a mistake; sorry the format of the Table was from a previous version of the MS in which we used linear models rather that linear mixed models (both lead to the same results). The ANOVA used to test the significance of fixed terms in linear mixed model are based on Wald chi-sqare tests, and it should have been read “Chi-value” rather than “F-value” in both tables and the only degree of freedom in this test is the one at the numerator. This has been changed. We have changed the caption of the Table (“ANOVA table for the linear mixed model testing whether the relationships between biodiversity and ecosystem functions measured in a riverine trophic chain differ between the biodiversity facets (species or genetic diversity) and the types of BEF (within- or between-trophic levels)”)

      Minor:

      There should always be a space between a number and a unit. In the manuscript, spaces are inconsistently used between numbers and units.

      Corrected

      Reviewer #2 (Recommendations for the authors):

      (1) In the introduction, the authors could focus more and build out what they predicted/hypothesized as well as what has been found in the manipulated experiments that examined the role of species and genetic diversity. That would enhance the background information for a more general audience, and highlight expected results and why.

      We modified the Introduction according to comments made by reviewer 1 and clarified the predictions as best as we can.

      (2) Similarly, the discussion is fairly big picture, but this dataset focused exclusively on this 3-trophic interaction in a riverine system. It could be beneficial to dig into the ecology to find out why the opposite effects of species and genetic diversity are seen within trophic levels in this system.

      We have added some explanations based on the specific pSEM (see our responses to the public reviews for details). But as said in the responses to the public reviews, even with mode detailed models, it is hard to tease apart mechanisms. One important point is that genetic and species diversity do not correlate one to each other (they do not co-vary over space), which means the effect of one facet is independent from the other. However, apart from that, we can’t really tell more without more mechanistic approaches. We understand this is frustrating, but this is the nature of field-based data. This does not mean they are useless. On the contrary, they confirm and expand patterns found under controlled conditions (which for ecologists is quite important as nature is our playground), but they are limited in inferences of mechanisms.

      (3) It would also be informative if the authors specified what positive and negative Z scores mean. It seems counterintuitive in Figure 3. For example, in the upper left, it's denoted as a larger intraspecific effect - which I'd assume is higher genetic (within species) diversity - but is this not where species diversity effects are higher? In theory this figure could be similar to Figure 1 from Des Roches et al. 2018 - where showing the 1:1 line of where species and genetic diversity effects are similar and then how some are more impacted by SD or GD as that links to the overall question, right?

      For example: Figure 3 makes it seem that GD effects are stronger (more positive) for within trophic responses (which is reflected in the text), but in that quadrant, it states that the interspecific effect is larger?

      yes, you’re true Figure 3 (now Figure 4) is not ideal. We added an explicit explanation for interpreting Zr in the main text. In addition, we modified the text in the quadrat as this was not correct. Note that it cannot be directly be compared to that of DesRoches et al. In DesRoches et al., there is a single effect size (ES) per situation (which is roughly expressed as “ES = effect of species - effect of genotypes”). Here, there are two ES per situation, one for the species effect, the other for the genetic effect, which makes the biplot more complex (as species and genetic can be similar in magnitude, but opposite in direction, e.g., 0.5 and -0.5). We may have done as DesRoches et al. (“ES = effect of species - effect of genotypes”), but as we don’t have absolute ES (as in DesRoches) the resulting signs of the ES are non sensical…Not easy for us to find a clever solution (or said differently, we were not clever enough to find an easy solution).  Nonetheless, we tried another visualization by including “sub-quadrats” into the four main quadrats. We hope this will be clearer

      (4) It's unclear why authors included both a simplified linear mixed model with diversity type and biodiversity facet as fixed factors, and then a second linear model that included trophic level (with those other 2 factors and interactions), but only showed results of trophic level from that more complex model. It is unclear why they include two models when the more complex one would have evaluated all aspects of their research question and shown the same patterns.

      You’re true, the more complex model evaluates both aspects. Nonetheless, as the hypotheses were strictly separated, we thought it is simpler to associate one model to one hypothesis. We agree that this duplicates information, but we would like to keep the two models to make the text more gradual.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02545

      Corresponding author(s): Woo Jae, Kim

      1. General Statements

      We sincerely appreciate the positive and constructive feedback provided by all three reviewers. Their insightful comments have been invaluable in guiding our revisions. In response, we have made every effort to address their suggestions through additional experiments and by restructuring our manuscript to improve clarity and coherence.

      In this revision, we have streamlined the presentation of our data to enhance the narrative flow, ensuring that it is more accessible to a general readership. We believe that these changes not only strengthen our manuscript but also align with the reviewers' recommendations for improvement.

      We are hopeful that the revisions we have implemented meet the expectations of the reviewers and contribute to a clearer understanding of our findings. Thank you once again for your thoughtful critiques, which have greatly aided us in refining our work.


      2. Point-by-point description of the revisions

      Reviewer #1

      General comment: This manuscript by Song et al. investigates the molecular mechanisms underlying changes in mating duration in Drosophila induced by previous experience. As they have shown previously, they find that male flies reared in isolation have shorter mating duration than those reared in groups, and also that male flies with previous mating experience have shorter mating duration than sexually naïve males. They have conducted a myriad of experiments to demonstrate that the neuropeptide SIFa is required for these changes in mating duration. They have further provided evidence that SIFa-expressing neurons undergo changes in synaptic connectivity and neuronal firing as a result of previous mating experience. Finally, they argue that SIFa neurons form reciprocal connections with sNPF-expressing neurons, and that communication within the SIFa-sNPF circuit is required for experience-dependent changes in mating duration. These results are used to assert that SIFa neurons track the internal state of the flies to modulate behavioral choice.

         __Answer:__ We appreciate the reviewer's thoughtful comments and commendations regarding our manuscript. The recognition of our investigation into the molecular mechanisms influencing mating duration in *Drosophila* is greatly valued. In particular, we are grateful for the reviewer's positive remarks about our comprehensive experimental approach to demonstrate the role of the neuropeptide SIFa in these changes. The evidence we provided indicating that SIFa-expressing neurons undergo alterations in synaptic connectivity and neuronal firing due to previous social experiences is crucial for elucidating the underlying neural circuitry involved in experience-dependent behaviors. Finally, we are thankful for the recognition of our assertion that SIFa neurons form reciprocal connections with sNPF-expressing neurons, emphasizing the importance of this circuit in modulating behavioral choices based on internal states. To provide stronger evidence for the interactions between SIFa and sNPF, we conducted detailed GCaMP experiments, which revealed intriguing neural connections between these two neuropeptides. We have included this new data in our main figure. We believe these insights contribute significantly to the existing literature on neuropeptidergic signaling and its implications for understanding complex behaviors in *Drosophila*. We look forward to addressing any further comments and enhancing our manuscript based on your invaluable feedback. Thank you once again for your constructive critique and support.
      

      Major concerns:

      Comment 1. The authors are to be commended for the sheer quantity of data they have generated, but I was often overwhelmed by the figures, which try to pack too much into the space provided. As a result, it is often unclear what components belong to each panel. Providing more space between each panel would really help.

         __Answer:__ We sincerely appreciate the reviewer’s commendation regarding the extensive data we have generated in our study. It is gratifying to know that our efforts to provide a comprehensive analysis of the molecular mechanisms underlying changes in mating duration have been recognized. We understand the concern regarding the density of information presented in our figures. We aimed to convey a wealth of data to support our findings, but we acknowledge that this may have led to some confusion regarding the organization and clarity of the panels. We are grateful for your constructive feedback on this matter. In response, we have significantly reduced the density of the main figures and decreased the size of the graphs to improve clarity. We have also increased the spacing between panels to ensure that each component is more easily distinguishable. Further details will be provided in our responses to each comment below.
      
      • *

      Comment 2. This is a rare instance where I would recommend paring down the paper to focus on the more novel, clear and relevant results. For example, all of Figure 2 shows the projection pattern of SIFa+ neuron dendrites and axons, which have been reported by multiple previous papers. Figure 7G and J show trans-tango data and SIFaR-GAL4 expression patterns, which were previously reported by Dreyer et al., 2019. These parts could be removed to supplemental figures. Figure 5 details experiments that knock down expression of different neurotransmitter receptors within the SIFa-expressing cells. The results here are less definitive than the SIFa knockdown results, and the SCope data supporting the idea that these receptors are expressed in SIFa-expressing neurons is equivocal. I would recommend removing these data (perhaps they could serve as the basis for another manuscript) or focusing solely on the CCHa1R results, which is the only manipulation that affects both LMD and SMD.

         __Answer:__ We sincerely appreciate the reviewer’s positive feedback regarding the extensive data generated in our study. We also fully agree with the reviewer that the sheer volume of our data made it challenging to support our hypothesis that SIFa neurons serve as a hub for integrating multiple neuropeptide inputs and orchestrating various behaviors related to energy balance, as highlighted in our new Figure 5N.
      
         In response to the reviewer's suggestions, we have streamlined our manuscript by removing excessive and redundant data to enhance clarity and simplicity. First, we have moved Figure 2 to the supplementary materials as the reviewer noted that the branching patterns of SIFa neurons are well-documented in previous literature. Second, we relocated the trans-tango data from Figure 7G to Figure S7, since this information is also well-established. We retained this data in the supplementary section to illustrate the connection of SIFa to our recent findings regarding SIFaR24F06 neuron connections. Additionally, we have completely removed the neuropeptide receptor input screening data previously included in Figure 5, as well as Figure S8, which presented fly SCope tSNE data. As suggested by the reviewer, we plan to utilize these data for a future paper focused on investigating the underlying mechanisms of SIFa inputs that modulate SIFa activity. Thanks to the reviewer’s constructive suggestions, we believe our manuscript is now more convincing and clearer for readers.
      

      Comment 3. Finally, I would like the authors to spend more time explaining how they think the results tie together. For example, how do the authors think the changes in branching and activity in SIFa-expressing neurons tie to the change in mating duration provoked by previous experience? It would benefit the manuscript to simplify and clarify the message about what the authors think is happening at the mechanistic level. The various schematics (eg. Fig 7N) describe the results but the different parts feel like separate findings rather than a single narrative. (MECHANISMS diagram and explanation)

         __Answer:__ We appreciate the reviewer’s constructive comments, which have significantly improved our manuscript and conclusions for our readers. As the reviewer will see, we have made substantial revisions in line with the suggestions provided. We dedicated additional time to clarify the electrical activities and synaptic plasticity of SIFa neurons in relation to internal states that orchestrate various behaviors. We have summarized our hypothesis regarding the mechanistic role of SIFa neurons in Figure 5N. In brief, we propose that SIFa neurons function as a hub that receives diverse neuropeptidergic signals, which subsequently alters their electrical activity and synaptic branching. This, in turn, leads to different internal states. The internal states of SIFa neurons can then be interpreted by SIFaR-expressing cells, which help orchestrate various behaviors and physiological responses. We aim to address these aspects further in another manuscript that has been co-submitted alongside this one [1].
      

      Comment 4. Most of the experiments lack traditional controls. For example, in experiments in Fig 1C-K, one would typically include genetic controls that contain either the GAL4 or UAS elements alone. The authors should explain their decision to omit these control experiments and provide an argument for why they are not necessary to correctly interpret the data. In this vein, the authors have stated in the methods that stocks were outcrossed at least 3x to Canton-S background, but 3 outcrosses is insufficient to fully control for genetic background.

         __Answer:__ We sincerely thank the reviewer for insightful comments regarding the absence of traditional genetic controls in our study of LMD and SMD behaviors. We acknowledge the importance of such controls and wish to clarify our rationale for not including them in the current investigation. The primary reason for not incorporating all genetic control lines is that we have previously assessed the LMD and SMD behaviors of GAL4/+ and UAS/+ strains in our earlier studies. Our past experiences have consistently shown that 100% of the genetic control flies for both GAL4 and UAS exhibit normal LMD and SMD behaviors. Given these findings, we deemed the inclusion of additional genetic controls to be non-essential for the present study, particularly in the context of extensive screening efforts. We understand the value of providing a clear rationale for our methodology choices. To this end, we have added a detailed explanation in the "MATERIALS AND METHODS" section and the figure legends of Figure 1. This clarification aims to assist readers in understanding our decision to omit traditional controls, as outlined below.
      

      "Mating Duration Assays for Successful Copulation

      The mating duration assay in this study has been reported[33,73,93]. To enhance the efficiency of the mating duration assay, we utilized the Df (1)Exel6234 (DF here after) genetic modified fly line in this study, which harbors a deletion of a specific genomic region that includes the sex peptide receptor (SPR)[94,95]. Previous studies have demonstrated that virgin females of this line exhibit increased receptivity to males[95]. We conducted a comparative analysis between the virgin females of this line and the CS virgin females and found that both groups induced SMD. Consequently, we have elected to employ virgin females from this modified line in all subsequent studies. For naïve males, 40 males from the same strain were placed into a vial with food for 5 days. For single reared males, males of the same strain were collected individually and placed into vials with food for 5 days. For experienced males, 40 males from the same strain were placed into a vial with food for 4 days then 80 DF virgin females were introduced into vials for last 1 day before assay. 40 DF virgin females were collected from bottles and placed into a vial for 5 days. These females provide both sexually experienced partners and mating partners for mating duration assays. At the fifth day after eclosion, males of the appropriate strain and DF virgin females were mildly anaesthetized by CO2. After placing a single female in to the mating chamber, we inserted a transparent film then placed a single male to the other side of the film in each chamber. After allowing for 1 h of recovery in the mating chamber in 25℃ incubators, we removed the transparent film and recorded the mating activities. Only those males that succeeded to mate within 1 h were included for analyses. Initiation and completion of copulation were recorded with an accuracy of 10 sec, and total mating duration was calculated for each couple. All assays were performed from noon to 4pm. Genetic controls with GAL4/+ or UAS/+ lines were omitted from supplementary figures, as prior data confirm their consistent exhibition of normal LMD and SMD behaviors [33,73,93,96,97]. Hence, genetic controls for LMD and SMD behaviors were incorporated exclusively when assessing novel fly strains that had not previously been examined. In essence, internal controls were predominantly employed in the experiments, as LMD and SMD behaviors exhibit enhanced statistical significance when internally controlled. Within the LMD assay, both group and single conditions function reciprocally as internal controls. A significant distinction between the naïve and single conditions implies that the experimental manipulation does not affect LMD. Conversely, the lack of a significant discrepancy suggests that the manipulation does influence LMD. In the context of SMD experiments, the naïve condition (equivalent to the group condition in the LMD assay) and sexually experienced males act as mutual internal controls for one another. A statistically significant divergence between naïve and experienced males indicates that the experimental procedure does not alter SMD. Conversely, the absence of a statistically significant difference suggests that the manipulation does impact SMD. Hence, we incorporated supplementary genetic control experiments solely if they deemed indispensable for testing. All assays were performed from noon to 4 PM. We conducted blinded studies for every test[98,99] .

         While we have previously addressed this type of reviewer feedback in our published manuscript [2–7], we appreciate the reviewer’s suggestion to include traditional genetic control experiments. In response, we conducted all feasible combinations of genetic control experiments for LMD/SMD during the revision period. The results are presented in the supplementary figures and are described in the main text.
      
         We appreciate the reviewer's inquiry regarding the genetic background of our experimental lines. In response to the comments, we would like to clarify the following. All of our GAL4, UAS, or RNAi lines, which were utilized as the virgin female stock for outcrosses, have been backcrossed to the Canton-S (CS) genetic background for over ten generations. The majority of these lines, particularly those employed in LMD assays, have been maintained in a CS backcrossed status for several years, ensuring a consistent genetic background across multiple generations. Our experience has indicated that the genetic background, particularly that of the X chromosome inherited from the female parent, plays a pivotal role in the expression of certain behavioral traits. Therefore, we have consistently employed these fully outcrossed females as virgins for conducting experiments related to LMD and SMD behaviors. It is noteworthy that, in contrast to the significance of genetic background for LMD behaviors, we have previously established in our work [6] that the genetic background does not significantly influence SMD behaviors. This distinction is important for the interpretation of our findings. To provide a comprehensive understanding of our experimental design, we have detailed the genetic background considerations in the __"Materials and Methods"__ section, specifically in the subsection __"Fly Stocks and Husbandry"__ as outlined below.
      

      "To reduce the variation from genetic background, all flies were backcrossed for at least 3 generations to CS strain. For the generation of outcrosses, all GAL4, UAS, and RNAi lines employed as the virgin female stock were backcrossed to the CS genetic background for a minimum of ten generations. Notably, the majority of these lines, which were utilized for LMD assays, have been maintained in a CS backcrossed state for long-term generations subsequent to the initial outcrossing process, exceeding ten backcrosses. Based on our experimental observations, the genetic background of primary significance is that of the X chromosome inherited from the female parent. Consequently, we consistently utilized these fully outcrossed females as virgins for the execution of experiments pertaining to LMD and SMD behaviors. Contrary to the influence on LMD behaviors, we have previously demonstrated that the genetic background exerts negligible influence on SMD behaviors, as reported in our prior publication [6]. All mutants and transgenic lines used here have been described previously."

      Comment 5. Throughout the manuscript, the authors appear to use a single control condition (sexually naïve flies raised in groups) to compare to both males raised singly and males with previous sexual experience. These control conditions are duplicated in two separate graphs, one for long mating duration and one for short mating duration, but they are given different names (group vs naïve) depending on the graph. If these are actually the same flies, then this should be made clear, and they should be given a consistent name across the different "experiments".

         __Answer:__ We are grateful to the reviewer for highlighting the potential for confusion among readers regarding the visualization methods used in our figures. In response to this valuable feedback, we have now included a more detailed explanation of the graph visualization techniques in the legends of Figure 1, as detailed below. This additional information should enhance the clarity and understanding of the figure for all readers.
      

      In the mating duration (MD) assays, light grey data points denote males that were group-reared (or sexually naïve), whereas blue (or pink) data points signify males that were singly reared (or sexually experienced). The dot plots represent the MD of each male fly. The mean value and standard error are labeled within the dot plot (black lines). Asterisks represent significant differences, as revealed by the unpaired Student’s t test, and ns represents non-significant differences M.D represent mating duration. DBMs represent the 'difference between means' for the evaluation of estimation statistics (See MATERIALS AND METHODS). Asterisks represent significant differences, as revealed by the Student’s t test (* p

      Comment 6. The authors use SCope data to provide evidence for co-expression of SIFa and other neurotransmitters or neuropeptide receptors. The graphs they show are hard to read and it is not clear to what extent the gene expression is actually overlapping. It would be more definitive to show graphs that indicate which percentage of SIFa-expressing cells co-express other neurotransmitter components, and what the actual level of expression of the genes is. The authors should also provide more information on how they identified the SIFa+ cells in the fly atlas dataset. These are important pieces of information to be able to interpret the effects of manipulation of these other neurotransmitter systems within SIFa-expressing cells on mating duration.

      __ Answer: We appreciate the reviewer for pointing out the potential for confusion among readers regarding the visualization methods used in our figures, particularly concerning the tSNE plots of scRNA-seq data. As mentioned in our previous response, we have removed most of the tSNE plots related to co-expression data with SIFa and NPRs, which we believe will reduce any confusion for readers interpreting these plots. However, we have retained a few tSNE plots, specifically Figures 2N-O, to confirm the potential co-expression of the ple and Vglut genes in SIFa cells. We understand the reviewer’s concerns about the clarity of the presented data and the necessity for more detailed information regarding the extent of co-expression and the identification of SIFa-expressing cells. To address these concerns, we have included a comprehensive description of our methods in the __MATERIALS AND METHODS section below.

      "Single-nucleus RNA-sequencing analyses

      The snRNAseq dataset analyzed in this paper is published in [112] and available at the Nextflow pipelines (VSN, https://github.com/vib-singlecell-nf), the availability of raw and processed datasets for users to explore, and the development of a crowd-annotation platform with voting, comments, and references through SCope (https://flycellatlas.org/scope), linked to an online analysis platform in ASAP (https://asap.epfl.ch/fca). For the generation of the tSNE plots, we utilized the Fly SCope website (https://scope.aertslab.org/#/FlyCellAtlas/*/welcome). Within the session interface, we selected the appropriate tissues and configured the parameters as follows: 'Log transform' enabled, 'CPM normalize' enabled, 'Expression-based plotting' enabled, 'Show labels' enabled, 'Dissociate viewers' enabled, and both 'Point size' and 'Point alpha level' set to maximum. For all tissues, we referred to the individual tissue sessions within the '10X Cross-tissue' RNAseq dataset. Each tSNE visualization depicts the coexpression patterns of genes, with each color corresponding to the genes listed on the left, right, and bottom of the plot. The tissue name, as referenced on the Fly SCope website is indicated in the upper left corner of the tSNE plot. Dashed lines denote the significant overlap of cell populations annotated by the respective genes. Coexpression between genes or annotated tissues is visually represented by differentially colored cell populations. For instance, yellow cells indicate the coexpression of a gene (or annotated tissue) with red color and another gene (or annotated tissue) with green color. Cyan cells signify coexpression between green and blue, purple cells for red and blue, and white cells for the coexpression of all three colors (red, green, and blue). Consistency in the tSNE plot visualization is preserved across all figures.

      Single-cell RNA sequencing (scRNA-seq) data from the Drosophila melanogaster were obtained from the Fly Cell Atlas website (https://doi.org/10.1126/science.abk2432). Oenocytes gene expression analysis employed UMI (Unique Molecular Identifier) data extracted from the 10x VSN oenocyte (Stringent) loom and h5ad file, encompassing a total of 506,660 cells. The Seurat (v4.2.2) package (https://doi.org/10.1016/j.cell.2021.04.048) was utilized for data analysis. Violin plots were generated using the “Vlnplot” function, the cell types are split by FCA.

         We have also included detailed descriptions in the figure legends for the initial tSNE plot presented below to help readers clearly understand the significance of this visualization.
      

      "Each tSNE visualization depicts the coexpression patterns of genes, with each color corresponding to the genes listed on the left, right, and/or bottom of the plot. The tissue name, as referenced on the Fly SCope website is indicated in the upper left corner of the tSNE plot. Consistency in the tSNE plot visualization is preserved across all figures."

      Comment 7. I would like to see more information on how the thresholding and normalization was done for immunohistochemistry experiments. Was thresholding applied equally across all datasets? Furthermore, "overlap" of Denmark and Syt-eGFP is taken as evidence for synaptic connectivity, but the latter requires more than just overlap in the location of the axon terminal and dendrite regions of the neuron.

      __ Answer: Thank you for your continued engagement with our manuscript and for highlighting the need for further clarification on our methods. Your attention to the details of our immunohistochemistry experiments is commendable, and we agree that providing a clear explanation of our thresholding and normalization procedures is essential for the transparency and reproducibility of our results. We concur that the intensity of these signals is indeed correlated with the area measurements, which is a critical factor to consider. In response to the reviewer's valuable suggestion, we have revised our approach and now present our data based on intensity measurements. Additionally, we have updated the labeling of our Y-axis to "Norm. GFP Int.", which stands for "normalized GFP intensity". This change ensures clarity and consistency in the presentation of our data. We primarily adhered to the established methods outlined by Kayser et al. [8]. To address your first point, we have now included a more detailed description of our thresholding and normalization procedures in the __MATERIALS AND METHODS section as below.

      "Quantitative analysis of fluorescence intensity

      To ascertain calcium levels and synaptic intensity from microscopic images, we dissected and imaged five-day-old flies of various social conditions and genotypes under uniform conditions. The GFP signal in the brains and VNCs was amplified through immunostaining with chicken anti-GFP primary antibody. Image analysis was conducted using ImageJ software. For the quantification of fluorescence intensities, an investigator, blinded to the fly's genotype, thresholded the sum of all pixel intensities within a sub-stack to optimize the signal-to-noise ratio, following established methods [93]. The total fluorescent area or region of interest (ROI) was then quantified using ImageJ, as previously reported. For CaLexA or TRIC signal quantification, we adhered to protocols detailed by Kayser et al. [94], which involve measuring the ROI's GFP-labeled area by summing pixel values across the image stack. This method assumes that changes in the GFP-labeled area and intensity are indicative of alterations in the CaLexA and TRIC signal, reflecting synaptic activity. ROI intensities were background-corrected by measuring and subtracting the fluorescent intensity from a non-specific adjacent area, as per Kayser et al. [94]. For normalization, nc82 fluorescence is utilized for CaLexA, while RFP signal is employed for TRIC experiments, as the RFP signal from the TRIC reporter is independent of calcium signaling [76]. For the analysis of GRASP or tGRASP signals, a sub-stack encompassing all synaptic puncta was thresholded by a genotype-blinded investigator to achieve the optimal signal-to-noise ratio. The fluorescence area or ROI for each region was quantified using ImageJ, employing a similar approach to that used for CaLexA or TRIC quantification [93]. 'Norm. GFP Int.' refers to the normalized GFP intensity relative to the RFP signal."

      Comment 8. None of the RNAi experiments have been validated to demonstrate effective knockdown. In many cases, this would be difficult to do because of a lack of an antibody to quantify in a cell-specific manner; however, this fact should be acknowledged, especially in cases where there was found to be a lack of phenotype, which could result from lack of knockdown. The authors could also look for evidence in the literature of cases where RNAi lines they have used have been previously validated. For SIFa, knockdown can be easily confirmed with the SIFa antibody the authors have used elsewhere in the manuscript.

      __ Answer:__ We appreciate the reviewer’s constructive and critical comments regarding the validation of our RNAi experiments through effective knockdown. We understand the reviewer’s concerns about achieving effective knockdown with RNAi; however, we have demonstrated in our unpublished preprint that the neuronal knockdown using independent SIFa-RNAi lines aligns with the SIFa mutant phenotype, which is consistent with our current findings on SIFa knockdown (Wong 2019). In most cases involving RNAi experiments, we have utilized independent RNAi strains to confirm consistent phenotypes and have compared these results with those from mutant phenotypes [1,9]. Therefore, we are confident that our observed SIFa phenotype results from effective RNAi knockdown. Nevertheless, we respect the reviewer’s comments and have conducted additional SIFa knockdown experiments using various GAL4 drivers, followed by immunostaining with SIFa antibodies. As shown in Figure S1B, both neuronal GAL4 drivers and SIFa-GAL4 effectively reduced SIFa immunoreactivity. We believe this indicates that our SIFa knockdown efficiently phenocopies the SIFa mutant phenotype. We also described this result in manuscript as below:

      "Using the GAL4SIFa.PT driver and the elavc155 driver, we observed a significant decrease in SIFa immunoreactivity following SIFa-RNAi treatment, thereby confirming the effective knockdown of SIFa in these cells. In contrast, when SIFa-RNAi was expressed under the control of the repo-GAL4 driver, no significant change in SIFa immunoreactivity was detected (Fig. S1B). This control experiment highlights the specificity of the SIFa-RNAi effect and supports the conclusion that the behavioral changes observed in SMD and LMD are likely attributable to the targeted reduction of SIFa in the intended neuronal populations."

      Minor comments:

      Comment 1. There are quite a lot of citations to preprints, including preprints of the manuscripts under review. It seems inappropriate to cite a preprint of the manuscript you are submitting because it gives a false sense of strengthening the assertions being made in the manuscript.

         __Answer:__ We agree with the reviewer and have omitted all preprints that are currently under review, except for those that are deemed necessary, such as the Zhang et al. 2024 preprint, which is being submitted alongside this manuscript.
      

      Comment 2. It seems that labels are incorrect on a number of the immunohistochemistry figures. For example, in Fig 2N, it labels dendrites as green, but this is sytEGFP, which is the presynaptic terminal.

      __ Answer:__ We thoroughly reviewed and corrected the errors in the labels.

      Comment ____3. Fig 4N shows grasp between SIFa-LexA and sNPF-R-GAL4, but the authors have argued that these two components should both be expressed in SIFa-expressing cells. This would make grasp signal misleading, because it would appear in the SIFa-expressing cells even without synaptic contacts due to both split GFP molecules being expressed in these cells.

         __Answer:__ We appreciate the reviewer’s critical comments regarding the interpretation of our GRASP experiments. As the reviewer noted, we acknowledge that the GRASP results also indicate synaptic contacts between SIFa cells. We have elaborated on these results in the following sections.
      

      "This indicates that the synapses between SIFa cells expressing sNPF-R become stronger (S5K to S5M Fig)."

         However, we understand that readers may find the interpretation of this GRASP data confusing, so we have included additional explanations below to clarify.
      

      This indicates that the synapses between SIFa cells expressing sNPF-R become stronger (S5K to S5M Fig) since we have found that SIFa cells express sNPF-R (Fig 3M, S5E and S5G)

      Comment 4. For quantifying TRIC and CaLexA experiments (eg. Figure 6A-E), intensity of signal should be measured in addition to the area covered by the signal.

      __ Answer:__ We concur with the reviewer. Since all of our analyses indicated that the area covered by the signal correlates with the signal intensity, we opted to use normalized intensity rather than area coverage.

      Conclusive Comments: This study will be most relevant to researchers interested in understanding neuronal control of behavior. It has provided novel information about the mechanisms underlying mating duration in flies, which is used to delineate how internal state influences behavioral outcomes. This represents a conceptual advance, particularly in identifying a cell type and molecule that influences mating duration decisions. The strength of the manuscript is the number of different assays used to investigate the central question from a number of angles. The limitation is that there is a lack of a big picture tying the different components of the manuscript together. Too much data is presented without providing a framework to understand how the data points fit together.

      • Answer: We sincerely appreciate the reviewer’s positive feedback regarding our study and the recognition of its relevance to researchers interested in understanding the neuronal control of behavior. We are grateful for the acknowledgment of our novel insights into the mechanisms underlying mating duration in Drosophila*, particularly in how internal states influence behavioral outcomes. The identification of specific cell types and molecules that affect mating duration decisions indeed represents a significant conceptual advance. We also appreciate the reviewer’s commendation of the diverse array of assays employed in our investigation, which allowed us to approach our central question from multiple perspectives.

        In response to the reviewer’s constructive criticism regarding the lack of a cohesive framework tying the various components of our manuscript together, we have completely restructured our manuscript. We removed redundant data and incorporated additional convincing experiments, such as GCaMP analyses, to enhance clarity and coherence. Furthermore, we have provided a simplified yet comprehensive overview that describes the role of SIFa as a hub for neuropeptidergic signaling. This framework illustrates how SIFa orchestrates multiple behaviors related to energy balance through calcium signaling and synaptic plasticity via SIFaR-expressing cells.

        We believe these revisions address the reviewer’s concerns and provide a clearer understanding of how the different elements of our study fit together, ultimately strengthening the overall impact of our manuscript. Thank you for your valuable feedback, which has guided us in improving our work.

      Reviewer #2

      General Comments:* In the present study, the authors employ mating behavior in male fruit flies, Drosophila melanogaster, to investigate the behavioral roles of the neuropeptide SIFamide. The duration of mating behavior in these animals varies depending on context, previous experience, and internal metabolic state. The authors use this variability to explore the neuronal mechanisms that control these influences. In an abstraction step, they compare the different mating durations to concepts of neuronal interval timing.

      The behavioral functions of the neuropeptide SIFamide have been thoroughly characterized in several studies, particularly in the contexts of circadian rhythm and sleep, courtship behavior, and food uptake. This study adds new data, demonstrating that SIFamide is essential for the proper control of mating behavior, highlighting the interconnection of various state- and motivation-dependent behaviors at the neuronal level. However, the hypothesis that mating behavior is related to interval timing is not convincingly supported.

      Experimentally, the authors show that RNAi-mediated downregulation of SIFamide affects mating duration in male flies. They use combinations of RNAi lines under the control of various Gal4 lines to identify additional neurotransmitters, neuropeptides, and receptors involved in this process. This approach is complemented by neuroanatomical staining and single-cell RNA sequencing.*

      * Overall, the study advances our knowledge about the behavioral roles of SIFamide, which is certainly important, interesting, and worthy of being reported. However, the manuscript also raises several serious caveats and includes points that remain speculative, are less convincing, or are simply incorrect.*

      • Answer: We would like to thank the reviewer for their thoughtful and constructive comments regarding our study. We appreciate the recognition of our investigation into the behavioral roles of the neuropeptide SIFamide in male Drosophila melanogaster*, particularly how we explored the variability in mating duration influenced by context, previous experience, and internal metabolic state. We are grateful for the acknowledgment that our study adds valuable data demonstrating the essential role of SIFamide in regulating mating behavior, highlighting the interconnectedness of various state- and motivation-dependent behaviors at the neuronal level.

        We also appreciate the reviewer's recognition of our experimental approach, which includes RNAi-mediated downregulation of SIFamide, the use of various Gal4 lines to identify additional neurotransmitters, neuropeptides, and receptors involved in this process, as well as our incorporation of neuroanatomical staining and single-cell RNA sequencing.

        In response to the reviewer’s concerns regarding the hypothesis that mating behavior is related to interval timing, we acknowledge that this aspect requires further clarification and support. We have revisited this hypothesis in our manuscript to strengthen its foundation and address any speculative elements. We aim to provide more robust evidence and clearer connections between mating behavior and neuronal interval timing.

        Furthermore, we have taken care to address any points that may have been perceived as less convincing or incorrect. We are committed to refining our manuscript to ensure that all claims are well-supported by our data. Thank you once again for your valuable feedback. We believe that these revisions will enhance the clarity and impact of our study while addressing the concerns raised.

      Major concerns:

      Comment 1. The authors conclude from their mating experiments that SIFamide controls interval timing. This conclusion is not supported by the data, which only indicate that SIFamide is required for normal mating duration and modulates the motivation-dependent component of this behavior. There is no clear evidence linking this to interval timing.

      __ Answer: __We appreciate the reviewer’s insightful comments regarding our conclusion linking SIFamide to interval timing in mating behavior. We acknowledge that our data primarily demonstrate that SIFamide is required for normal mating duration and modulates the motivation-dependent aspects of this behavior, and we recognize the need for clearer evidence connecting these observations to interval timing. Current research by Crickmore et al. has shed light on how mating duration in Drosophila serves as a powerful model for exploring changes in motivation over time as behavioral goals are achieved. For instance, at approximately six minutes into mating, sperm transfer occurs, leading to a significant shift in the male's nervous system: he no longer prioritizes sustaining the mating at the expense of his own survival. This change is driven by the output of four male-specific neurons that produce the neuropeptide Corazonin (Crz). When these Crz neurons are inhibited, sperm transfer does not occur, and the male fails to downregulate his motivation, resulting in matings that can last for hours instead of the typical ~23 minutes [10].

         Recent research by Crickmore et al. has received NIH R01 funding (Mechanisms of Interval Timing, 1R01GM134222-01) to explore mating duration in *Drosophila* as a genetic model for interval timing. Their work highlights how changes in motivation over time can influence mating behavior, particularly noting that significant behavioral shifts occur during mating, such as the transfer of sperm at approximately six minutes, which correlates with a decrease in the male's motivation to continue mating [10]. These findings suggest that mating duration is not only a behavioral endpoint but may also reflect underlying mechanisms related to interval timing.
      
         We believe that by leveraging the robustness and experimental tractability of these findings, along with our own work on SIFamide's role in mating behavior, we can gain deeper insights into the molecular and circuit mechanisms underlying interval timing. We will revise our manuscript to clarify this relationship and emphasize how SIFamide may interact with other neuropeptides and neuronal circuits involved in motivation and timing.
      
         In addition to the efforts of Crickmore's group to connect mating duration with a straightforward genetic model for interval timing, we have previously published several papers demonstrating that LMD and SMD can serve as effective genetic models for interval timing within the fly research community. For instance, we have successfully connected SMD to an interval timing model in a recently published paper [6], as detailed below:
      

      "We hypothesize that SMD can serve as a straightforward genetic model system through which we can investigate "interval timing," the capacity of animals to distinguish between periods ranging from minutes to hours in duration.....

      In summary, we report a novel sensory pathway that controls mating investment related to sexual experiences in Drosophila. Since both LMD and SMD behaviors are involved in controlling male investment by varying the interval of mating, these two behavioral paradigms will provide a new avenue to study how the brain computes the ‘interval timing’ that allows an animal to subjectively experience the passage of physical time [11–16]."

         Lee, S. G., Sun, D., Miao, H., Wu, Z., Kang, C., Saad, B., ... & Kim, W. J. (2023). Taste and pheromonal inputs govern the regulation of time investment for mating by sexual experience in male Drosophila melanogaster. *PLoS Genetics*, *19*(5), e1010753.
      
         We have also successfully linked LMD behavior to an interval timing model and have published several papers on this topic recently [4,5,7].
      
         Sun, Y., Zhang, X., Wu, Z., Li, W., & Kim, W. J. (2024). Genetic Screening Reveals Cone Cell-Specific Factors as Common Genetic Targets Modulating Rival-Induced Prolonged Mating in male Drosophila melanogaster. *G3: Genes, Genomes, Genetics*, jkae255.
      
         Zhang, T., Zhang, X., Sun, D., & Kim, W. J. (2024). Exploring the Asymmetric Body’s Influence on Interval Timing Behaviors of Drosophila melanogaster. *Behavior Genetics*, *54*(5), 416-425.
      
         Huang, Y., Kwan, A., & Kim, W. J. (2024). Y chromosome genes interplay with interval timing in regulating mating duration of male Drosophila melanogaster. *Gene Reports*, *36*, 101999.
      
         Finally, in this context, we have outlined in our INTRODUCTION section below how our LMD and SMD models are related to interval timing, aiming to persuade readers of their relevance. We hope that the reviewer and readers are convinced that mating duration and its associated motivational changes such as LMD and SMD provide a compelling model for studying the genetic basis of interval timing in *Drosophila*.
      

      "The mating duration of male fruit flies is a suitable model for studying interval timing and it could change based on internal states and environmental context. Previous studies by our group[27–30] and others[31,32] have established several frameworks for investigating the mating duration using sophisticated genetic techniques that can analyze and uncover the neural circuits’ principles governing interval timing. In particular, males exhibit LMD behavior when they are exposed to an environment with rivals, which means they prolong their mating duration. Conversely, they display SMD behavior when they are in a sexually saturated condition, meaning they reduce their mating duration[33,34]."

      Comment 2. On line 160, the authors state, "The connection between the dendrites and axons of the SIFamide neuronal processes is unknown." This is not entirely correct. State-of-the-art connectome analyses can determine synaptic connectivities between SIFamidergic neurons and pre-/postsynaptic neurons. The authors also overlook the thorough connectivity analysis by Martelli et al. (2017), which includes functional analyses and detailed anatomical descriptions that the current study confirms.

      __ Answer:__ We appreciate the reviewer for acknowledging the efforts of Martelli et al. in elucidating the neuronal architecture of SIFa neurons. We recognize that it was an oversight on our part to state that "the connection between the dendrites and axons of SIFa neurons is unknown." This error arose because our manuscript has been in preparation for over ten years, predating the publication of Martelli et al.'s work. That statement likely reflects an outdated section of the manuscript.

      We fully acknowledge the findings from previous publications and have removed that sentence entirely from our manuscript. In its place, we have added the following statement:

      "The established connections and architecture of SIFa neurons has been described by Martelli et al., which enhances our understanding of their functional roles within the neuronal circuitry [51]. To identify the dendritic and axonal components of SIFa-neuronal processes, we employed a similar approach to that reported by Martelli [51]."

      Thank you for your valuable feedback, which has helped us improve the clarity and accuracy of our manuscript.

      Comment 3. The mating experiments are overall okay, with sufficiently high sample sizes and appropriate statistical tests. However, many experiments lack genetic controls for the heterozygous parental strains, such as Gal4-ines AND UAS-lines. This is of course of importance and common standard.

      __ Answer: __While we have previously addressed this type of reviewer feedback in our published manuscript [2–7] as well as this manuscript by Reviewer #1, we appreciate the reviewer’s suggestion to include traditional genetic control experiments. In response, we conducted all feasible combinations of genetic control experiments for LMD/SMD during the revision period. The results are presented in the supplementary figures and are described in the main text.

      Comment 4. *Using a battery of RNAi lines, the authors aim to uncover which neurotransmitters might be co-released from SIFamide neurons to influence mating behavior. However, a behavioral effect of an RNAi construct expressed in SIFamidergic neurons does not demonstrate that the respective transmitter is actually released from these neurons. Alternative methods are needed to show whether glutamate, dopamine, serotonin, octopamine, etc., are present and released from SIFamide neurons. It is particularly challenging to prove that a certain substance acts as a transmitter released by a specific neuron. For example, anti-Tdc2 staining does not actually cover SIFamide neurons, and dopamine has not been described as present in SIFamide neurons. *

      __ Answer:__ We appreciate the reviewer’s constructive comments regarding the need to demonstrate the presence of the responsible neurotransmitters in SIFa neurons. While many studies utilize neurotransmitter-synthesizing enzymes such as TH, VGlut, Gad1, and Trhn to assess neurotransmitter effects, we recognize the importance of conclusively establishing that glutamate and dopamine play significant roles in modulating energy balance within SIFa neurons.

         First, the enrichment of tyramine (TA), octopamine (OA), and dopamine (DA) in SIFa neurons was suggested in the study by Croset et al. (2018) [17]. Although we tested Tdc2-RNAi and observed interesting phenotypes, we chose not to publish these findings, as our data on glutamate and dopamine provide a more compelling explanation for how SIFa cotransmission with these neurotransmitters can independently influence various behaviors, including sleep and mating duration.
      
         To confirm the expression of DA in SIFa neurons, we employed a well-established genetic toolkit for dissecting dopamine circuit function in *Drosophila* [18]. Our findings indicate that TH-C-GAL4 specifically labels SIFa neurons, which have been confirmed as dopaminergic (S4M Fig). Our genetic intersection data, along with Xie et al.'s findings from 2018, confirm that a subset of SIFa neurons is indeed dopaminergic. We have described these new results in the main text as follows:
      

      To further verify the presence of DA neurons within the SIFa neuron population, we utilized a well-established genetic toolkit for dissecting DA circuits and confirmed part of SIFa neurons are dopaminergic (S4M Fig) [58].

          To confirm the glutamatergic characteristics of SIFa neurons, we conducted several experiments that established glutamate as the most critical neurotransmitter for generating interval timing in both SIFa and SIFaR neurons. First, to demonstrate the presence of glutamatergic synaptic vesicles in SIFa neurons, we utilized a conditional glutamatergic synaptic vesicle marker for *Drosophila*, developed by Certel et al. [19]. Our results confirmed that SIFa neurons exhibit strong expression of glutamatergic synaptic vesicles (Fig. 2P and Fig. S4N as a genetic control). We have described these new results in the main text as follows:
      

      “To further verify the presence of DA neurons within the SIFa neuron population, we utilized a well-established genetic toolkit for dissecting DA circuits and confirmed part of SIFa neurons are dopaminergic (S4M Fig) [58]. We also employed a conditional glutamatergic synaptic vesicle marker to confirm the presence of glutamatergic SIFa neurons (Fig 2P and Fig S4N) [59].”

         To further confirm that glutamate release from SIFa neurons influences the function of SIFaR neurons, we tested several RNAi strains targeting glutamate receptors. Our results showed that the knockdown of glutamate receptors in SIFaR-expressing neurons produced phenotypes similar to those observed with VGlut-RNAi knockdown in SIFa neurons (Fig. G-L). We believe that this series of experiments demonstrates that glutamate and dopamine work in conjunction with SIFa to modulate interval timing and other behaviors related to energy balance. We have described these new results in the main text as follows:
      

      "To further substantiate the role of glutamate in SIFa-mediated behaviors. we targeted knockdown of VGlut receptors in SIFaR-expressing neurons. Strikingly, the knockdown of VGlut receptors in these neurons also disrupted SMD behavior, mirroring the phenotype observed upon direct suppression of glutamatergic signaling in SIFa neurons (S4G to S4L Fig). This suggests that glutamate is an essential neurotransmitter for modulating interval timing in SIFa neurons.”

      Comment 5. Single-cell RNA sequencing data alone is insufficient to claim multiple transmitter co-release from SIFamide neurons. Figures illustrating single-cell RNA sequencing, such as Figure 3P-R, are not intuitively understandable, and the figure legends lack sufficient information to clarify these panels. As a side note, Tdc2 is not only present in octopaminergic neurons, but also in tyraminergic neurons.

      __ Answer:__ We agree with the reviewer that scRNA-seq data alone is insufficient to support claims of multiple transmitter co-release in SIFa neurons. We also appreciate the reviewer for highlighting the potential for confusion among readers regarding the visualization methods used in our figures, particularly the tSNE plots of the scRNA-seq data. As noted in our previous response to Reviewer #1, we have removed most of the tSNE plots related to co-expression data involving SIFa and NPRs, which we believe will help clarify the interpretation for readers. However, we have retained a few tSNE plots, specifically Figures 2N-O, to illustrate the potential co-expression of the ple and Vglut genes in SIFa cells.

         We understand the reviewer’s concerns regarding the clarity of the presented data and the need for more detailed information about the extent of co-expression and the identification of SIFa-expressing cells. To address these concerns, we have provided a comprehensive description of our methods in the __MATERIALS AND METHODS__ section below.
      

      "Single-nucleus RNA-sequencing analyses

      The snRNAseq dataset analyzed in this paper is published in [20]and available at the Nextflow pipelines (VSN, https://github.com/vib-singlecell-nf), the availability of raw and processed datasets for users to explore, and the development of a crowd-annotation platform with voting, comments, and references through SCope (https://flycellatlas.org/scope), linked to an online analysis platform in ASAP (https://asap.epfl.ch/fca). For the generation of the tSNE plots, we utilized the Fly SCope website (https://scope.aertslab.org/#/FlyCellAtlas/*/welcome). Within the session interface, we selected the appropriate tissues and configured the parameters as follows: 'Log transform' enabled, 'CPM normalize' enabled, 'Expression-based plotting' enabled, 'Show labels' enabled, 'Dissociate viewers' enabled, and both 'Point size' and 'Point alpha level' set to maximum. For all tissues, we referred to the individual tissue sessions within the '10X Cross-tissue' RNAseq dataset. Each tSNE visualization depicts the coexpression patterns of genes, with each color corresponding to the genes listed on the left, right, and bottom of the plot. The tissue name, as referenced on the Fly SCope website is indicated in the upper left corner of the tSNE plot. Dashed lines denote the significant overlap of cell populations annotated by the respective genes. Coexpression between genes or annotated tissues is visually represented by differentially colored cell populations. For instance, yellow cells indicate the coexpression of a gene (or annotated tissue) with red color and another gene (or annotated tissue) with green color. Cyan cells signify coexpression between green and blue, purple cells for red and blue, and white cells for the coexpression of all three colors (red, green, and blue). Consistency in the tSNE plot visualization is preserved across all figures.

      Single-cell RNA sequencing (scRNA-seq) data from the Drosophila melanogaster were obtained from the Fly Cell Atlas website (https://doi.org/10.1126/science.abk2432). Oenocytes gene expression analysis employed UMI (Unique Molecular Identifier) data extracted from the 10x VSN oenocyte (Stringent) loom and h5ad file, encompassing a total of 506,660 cells. The Seurat (v4.2.2) package (https://doi.org/10.1016/j.cell.2021.04.048) was utilized for data analysis. Violin plots were generated using the “Vlnplot” function, the cell types are split by FCA."

         We have also included detailed descriptions in the figure legends for the initial tSNE plot presented below to help readers clearly understand the significance of this visualization.
      

      "Each tSNE visualization depicts the coexpression patterns of genes, with each color corresponding to the genes listed on the left, right, and/or bottom of the plot. The tissue name, as referenced on the Fly SCope website is indicated in the upper left corner of the tSNE plot. Consistency in the tSNE plot visualization is preserved across all figures."

         We appreciate the reviewer for acknowledging that Tdc2 is present in both TA and OA neurons. As we mentioned earlier, we have completely removed the Tdc2-related results from this manuscript, as we believe that more detailed experiments are necessary to confirm the roles of TA and OA in SIFa neurons.
      

      Comment 6. The same argument applies to the expression of sNPF receptors in SIFamide neurons. The rather small anatomical stainings shown in figure 4M do not convincingly and unambiguously show that actually sNPF receptors are located on SIFamide neurons.

      __ Answer:__ We appreciate the reviewer for pointing out that the co-expression of sNPF-R and SIFa needs further verification, and we agree with this assessment. To confirm the co-expression of SIFa with sNPF-R, we conducted a mini-screen of various sNPF-R driver lines and found that the chemoconnectome (CCT) sNPF-R2A driver which represent the physiological expression patterns of sNPF-R, consistently labels SIFa neurons [21].

         To further establish the functional connection between the SIFa and sNPF systems, we performed GCaMP experiments using SIFa-driven GCaMP in conjunction with sNPF-R neurons expressing P2X2, which can be activated by ATP treatment. As shown in Figures 3N-P, we demonstrated that activation of sNPF-R neurons by ATP significantly increases calcium levels in SIFa neurons. Our results strongly suggest that the sNPF-sNPF-R/SIFa system is functionally present and plays a role in modulating interval timing behaviors.
      

      Comment 7. The authors use the GRASP technique (figure 4N) to determine whether synaptic connections are subject to modulation as a result from the animals' individual experience. The overall extremely bright fluorescence at the dorsal areas of both brain hemispheres (figure 4 N, middle panel) raises doubts whether this signal is actually a specific GRASP fluorescence between two small populations of neurons.

      Answer: We appreciate the reviewer for critically highlighting the inadequacies in our presentation of the GRASP data. We agree that one of our previous panels contained excessive background noise, making it difficult for reviewers and readers to discern the different neuronal connections. To address this issue, we have replaced it with a more representative image that clearly illustrates the strengthening of synaptic connections from SIF to sNPF-R in several neurons, including SIFa cells (Fig. S5J). We hope that this updated image will help convince both the reviewer and readers of the validity of our GRASP data.

      Comment 8. The authors cite Martelli et al. (2017) with the hypothesis that sNPF-releasing neurons provide input signals to SIFamide neurons to modulate feeding behavior. However, the cited manuscript does not contain such a hypothesis. The authors should review the reference in more detail.

      __ Answer:__ We appreciate reviewer to correctly point our misunderstanding of references. We agree with reviewer that Martelli et al.'s paper didn't mention about sNPF signaling transmits hunger and satiety information to SIFa neurons. We removed this sentence and replaced it as below correctly mentioning that sNPF signaling is related to feeding behavior however it's connection to SIFa neurons are not known. We greatly appreciate the reviewer for acknowledging our efforts to accurately cite previous articles that support our rationale and ideas.

      " Short neuropeptide F (sNPF) signaling plays a crucial role in regulating feeding behavior in Drosophila melanogaster, influencing food intake and body size [60,66,67]. However, there is currently no direct evidence reported linking sNPF signaling to SIFa neurons."

      Comment ____9. In lines 281 ff., the authors state that SIFamide neurons receive inputs from peptidergic neurons but simultaneously claim that "this speculation is based on morphological observations." This is incorrect. The functional co-activation/imaging analyses provided in Martelli et al. (2017) should not be ignored.

      * Answer: We fully agree with the reviewer that we misinterpreted Martelli et al.'s analysis. We have removed "this speculation is based on morphological observations." from* the following sentence and finalize as below:

      "The SIFa neurons receive inputs from many peptidergic pathways including Crz, dilp2, Dsk, sNPF, MIP, and hugin"

      Comment 10. Figure 6: A transcriptional calcium sensor (TRIC) was used to quantify the accumulation GFP induced by calcium influx in SIFamide neurons. However, I could not find any description of the method in the materials and methods section, nor any explanation how the data were acquired or analyzed. What is the RFP expression good for? How exactly are thresholds determined, and why are areas rather than fluorescence intensities quantified? Overall, this part of the manuscript is rather confusing and needs more explanation.

      __ Answer: Thank you for your continued engagement with our manuscript and for highlighting the need for further clarification on our methods. Your attention to the details of our immunohistochemistry experiments is commendable, and we agree that providing a clear explanation of our thresholding and normalization procedures is essential for the transparency and reproducibility of our results. We primarily adhered to the established methods outlined by Kayser et al. [8]. To address your first point, we have now included a more detailed description of our thresholding and normalization procedures in the __MATERIALS AND METHODS section as below.

      "Quantitative analysis of fluorescence intensity

      To ascertain calcium levels and synaptic intensity from microscopic images, we dissected and imaged five-day-old flies of various social conditions and genotypes under uniform conditions. The GFP signal in the brains and VNCs was amplified through immunostaining with chicken anti-GFP, rabbit anti-DsRed, and mouse anti-nc82 primary antibodies. Image analysis was conducted using ImageJ software. For the quantification of fluorescence intensities, an investigator, blinded to the fly's genotype, thresholded the sum of all pixel intensities within a sub-stack to optimize the signal-to-noise ratio, following established methods [100]. The total fluorescent area or region of interest (ROI) was then quantified using ImageJ, as previously reported. For CaLexA or TRIC signal quantification, we adhered to protocols detailed by Kayser et al. [101], which involve measuring the ROI's GFP-labeled area by summing pixel values across the image stack. This method assumes that changes in the GFP-labeled area and intensity are indicative of alterations in the CaLexA and TRIC signal, reflecting synaptic activity. ROI intensities were background-corrected by measuring and subtracting the fluorescent intensity from a non-specific adjacent area, as per Kayser et al. [101]. For normalization, nc82 fluorescence is utilized for CaLexA, while RFP signal is employed for TRIC experiments, as the RFP signal from the TRIC reporter is independent of calcium signaling [72] . For the analysis of GRASP or tGRASP signals, a sub-stack encompassing all synaptic puncta was thresholded by a genotype-blinded investigator to achieve the optimal signal-to-noise ratio. The fluorescence area or ROI for each region was quantified using ImageJ, employing a similar approach to that used for CaLexA or TRIC quantification [100]. 'Norm. GFP Int.' refers to the normalized GFP intensity relative to the RFP signal.

      • *

      __Comment 11. __Similarly, it remains unclear how exactly syteGFP fluorescence and DenMark fluorescence were quantified. Why are areas indicated and not fluorescence intensity values? In fact, it appears worrisome that isolation of males should lead to a drastic decline in synaptic terminals (as measure through a vesicle-associated protein) by ~ 30%, or, conversely, keeping animals in groups lead to an respective increase (figure 7D). The technical information how exactly this was quantified is not sufficient.

      __ Answer: __Thank you for your ongoing engagement with our manuscript and for emphasizing the need for clarification on our methods. We appreciate your attention to the details of our immunohistochemistry experiments and agree that a clear explanation of our thresholding and normalization procedures is vital for transparency and reproducibility. We acknowledge that signal intensity correlates with area measurements, which is an important consideration. In response to your valuable suggestion, we have revised our approach to present data based on intensity measurements and updated the Y-axis labeling to "Norm. GFP Int." (normalized GFP intensity) for clarity. We primarily followed the established methods from Kayser et al. (2014) [8]. Additionally, we have included a more detailed description of our thresholding and normalization procedures in the "Quantitative analysis of fluorescence intensity" in __MATERIALS AND METHODS __section as we quoted above.

      • *

      Minor concerns:

      Comment 1. Reference 29 and reference 33 are the same.

         __Answer:__ We removed reference 29.
      

      Comment 2. In figure legends, abbreviations should be explained when used first (e.g., figure 1 A "MD", is explained below for panel C-F), or "CS males". __ __

      __Answer: __We have ensured that abbreviations are explained only when they are first used in the figure legends.

      Comment 3. Indications for statistical significance must be shown in all figure legends at the end of each figure legend, not only in figure 1. __ __

      __ Answer:__ We appreciate the reviewer’s advice. However, we have published all our other manuscripts using the same format for mating duration, stating, "The same notations for statistical significance are used in other figures," in the first figure where we describe our statistical significances. We intend to continue with this approach initially and will then adhere to the journal's policy.

      Comment 4. The figures appear overloaded. For example why do you need two different axis designations (mating duration and differences between means)? __ __

      __ Answer: __We appreciate the reviewer's suggestion to refine our figures, and we have indeed reformatted them to provide clearer presentation and improved readability. Our decision is based on the fact that our analysis encompasses not only traditional t-tests but also incorporates estimation statistics, which have been demonstrated to be effective for biological data analysis [22]. The inclusion of DBMs is essential for the accurate interpretation of these estimation statistics, ensuring a comprehensive representation of our findings. This is the primary area where we present two different axis designations.

      Comment 5. Line: 1154: Typo: gluttaminergic should be glutamatergic.

         __Answer:__ We fixed all.
      

      Comment 6. The authors frequently write "system" when referring to transmitter types, e.g., "glutaminergic system", "octopaminergic system", etc. It I not clear what the term "system" actually refers to. If the authors claim that SIFamide neurons release these transmitters in addition to SIFamide, they should state that precisely and then add experiments to show that this is the case.

         __Answer:__ We agree with reviewer and removed the word 'system' after the name of neurotransmitter's name.
      

      Comment 7. Figure S6: It is not explained in the figure legend what fly strain "UAS-ctrl" actually is. Does "ctrl" mean control? And what genotype is hat control? __ __

      __Answer: __It was wild-type strain. We fixed it as "+".

      Comment 8. Figure legend S6, line 1371: The authors indicate experiments using UAS-OrkDeltaC. I could not find these data in the figure. __ __

      __Answer: __It's now in Fig.S6U-W.

      Comment 9. Line 470: "...reduced branching of SIFa axons at the postsynaptic level" should perhaps be "presynaptic level"?

      Answer: Reviewer is correct. We fixed it.

      Conclusive Comments:* Overall, the study advances our knowledge about the behavioral roles of SIFamide, which is certainly important, interesting, and worthy of being reported. However, the manuscript also raises several serious caveats and includes points that remain speculative and are less convincing.

      Overall, the neuronal basis of action selection based on motivational factors (metabolic state, mating experience, sleep/wake status, etc.) is not well understood. The analysis of SIFamide function in insects might provide a way to address the question how different motivational signals are integrated to orchestrate behavior.*

      • *Answer: Thank you for your thoughtful review and for recognizing the significance of our study in advancing knowledge about the behavioral roles of SIFamide. We appreciate your acknowledgment that our work is important, interesting, and worthy of publication.

      We understand your concerns regarding the caveats and speculative points raised in the manuscript. We agree that the neuronal basis of action selection influenced by motivational factors—such as metabolic state, mating experience, and sleep/wake status—remains poorly understood. We believe that our analysis of SIFamide function in insects offers valuable insights into how various motivational signals are integrated to orchestrate behavior.

      In response to your comments, we have made revisions to clarify our findings and address the concerns raised. We aim to strengthen the arguments presented in the manuscript and provide a more robust discussion of the implications of our results. Thank you once again for your constructive feedback, which has been instrumental in improving the clarity and impact of our work.

      • *

      * *

      Reviewer #3

      General Comments:* The Manuscript Peptidergic neurons with extensive branching orchestrate the internal states and energy balance of male Drosophila melanogaster by Yuton Song and colleagues addresses the question how SIFamidergic neurons coordinate behavioral responses in a context-dependent manner. In this context the authors investigate how SIFa neurons receive information about the physiological state of the animal and integrate this information into the processing of external stimuli. The authors show that SIFamidergic neurons and sNPPF expressing neurons form a feedback loop in the ventral nerve cord that modulate long mating (LMD) and shorter mating duration (SMD).

      The manuscript is well written and very detailed and provides an enormous amount of data corroborating the claims of the authors. However, before publication the authors may want to address some points of concern that warrant some deeper explanation.*

      • *__Answer: __Thank you for your positive feedback on our manuscript. We appreciate your recognition of the importance of our study in investigating how SIFa neurons integrate information about the physiological state of the animal with external stimuli, as well as your acknowledgment of the substantial data we provide to support our claims. We understand your concerns regarding certain points that require deeper explanation, and we are committed to addressing these issues to enhance the clarity and robustness of our findings. Your insights into the neuronal basis of action selection influenced by motivational factors are invaluable, and we believe that our exploration of SIFamide function in insects contributes significantly to understanding how various motivational signals orchestrate behavior. Thank you once again for your constructive comments, which will help us improve our manuscript before publication.

      Major concerns:

      Comment 1. On page 6 line 110 the authors describe that knocking-down SIFamide in glia cell does not change LMD or SMD and say that SIFa expression in glia does not contribute to interval timing behavior. However, the authors do not provide any information why they investigate the role of SIFa expression in glia. Is there any SIFa-expression in glia? The authors should somehow demonstrate using antibody labelling against SIFamide whether any glia specific expression of this peptide is to be expected. If they cannot provide this data - the take home message of the experiment cannot be that glia knockdown of SIFamide does not affect the behavior because you cannot knockdown anything that is not there.

      • *

      • In the latter case the experiment could be considered as a nice negative control for the elav-Gal4 pan-neuronal knockdown of SIFamide. The authors provide some Figure supplement where they use repo-Gal80 to partially answer this question. However, the authors should keep in mind that Gal4-drivers are not always complete in the expression pattern. Accordingly, the result should be corroborated with immune-labelling against SIFamide directly.*

      __ Answer: __We appreciate the reviewer's constructive and critical comments regarding the use of our glial cell drivers. As the reviewer rightly pointed out, we believe that glial control is not essential for our manuscript, given that the expression of SIFa is well established in only four neurons. Therefore, we have removed the data related to glial drivers from this manuscript.

      Comment 2. At this point I would like to directly comment on the figure quality. The figures are so crowded that the described anatomical details are hardly visible. In my opinion the manuscript would profit from less data in the main part and more stringent description of the core of the biological problem the authors want to address. The authors may want to reduce data from the main text and provide additional data that are not directly related to the main story as supplementary information.

      __ Answer: __We agree with the reviewer. As another reviewer also suggested that we streamline our figures and data, we have completely restructured our figures and their presentation. In response, we have significantly reduced the density of the main figures and decreased the size of the graphs to enhance clarity. Additionally, we have increased the spacing between panels to ensure that each component is more easily distinguishable. Further details will be provided in our responses to each comment below.

      • *

      Comment 3. On page 8 starting with line 140 the authors describe the architecture of SIFamidergic neurons using several anatomical markers e.g., Denmark and further state that they have discovered that the dendrites of SIFa neurons span just the central brain area. Seeing that these data have been published in Martelli et al., 2017 the authors should tune down the claim that this was discovered in their work but rather corroborated earlier results.

      __ Answer: __We acknowledge this error, as another reviewer also raised this issue. We have corrected our manuscript as follows:

      "The established connections and architecture of SIFa neurons has been described by Martelli et al., which enhances our understanding of their functional roles within the neuronal circuitry [51]. To identify the dendritic and axonal components of SIFa-neuronal processes, we employed a similar approach to that reported by Martelli [51]."

      Comment 4. In the next chapter, the authors aim at identifying the presynaptic inputs from SIFa positive neurons that may influence interval timing behavior and make a broad RNAi knock-down screen targeting a majority of neuromodulators. The authors claim that glutaminergic and dopaminergic signaling is necessary for interval timing behavior. I guess the authors mean "glutamatergic" instead of "glutaminergic" as glutamine is the precursor but not the neurotransmitter.

      __ Answer: __The reviewer is correct. We have corrected this error and changed all instances to "glutamatergic."

      Comment 5____. Furthermore, the authors show that the knock down of Tdc2 with RNAi has comparable effects on SMD than Glutamate and dopamine but appear to not further discuss this in the main text. To me it is not clear why the authors exclude Tdc2 from their resume. The authors should explain this in detail.

         __Answer:__ We appreciate the reviewer’s constructive comments regarding the need for a more detailed demonstration of the role of Tdc2 data. While we did test Tdc2-RNAi and observed interesting phenotypes, we decided not to include these findings in our publication, as our data on glutamate and dopamine offer a more compelling explanation for how SIFa cotransmission with these neurotransmitters can independently influence various behaviors, such as sleep and mating duration. Consequently, we have removed all data related to Tdc2. We believe that further evaluation is necessary to better understand the roles of the tyramine and octopamine systems in SIFa neurons.
      

      Comment 6. The authors base their assumptions that the tested neurotransmitters are expressed in SIFamidergic neurons on Scope database analysis. But a transcript does not necessarily mean that it will be translated too. To my knowledge there is no available data in the literature showing that tyrosine hydroxylase is expressed in SIFamidergic neurons (see e.g., Mao and Davis, 2010). To show that ple or Tdc2 are indeed expressed and translated into functional enzymes in SIFamidergic neurons the authors should provide the according antibody labelling corroborating the result from the transcriptome analysis.

      __ Answer:__ We appreciate the reviewer’s constructive comments regarding the role of neurotransmitters in conjunction with SIFa in modulating interval timing behaviors. To confirm the expression of dopamine (DA) in SIFa neurons, we utilized a well-established genetic toolkit for dissecting dopamine circuit function in Drosophila [18]. Our findings demonstrate that TH-C-GAL4 specifically labels SIFa neurons, which have been confirmed to be dopaminergic (Fig. S4M). This aligns with the genetic intersection data and the findings from Xie et al. (2018), confirming that a subset of SIFa neurons is indeed dopaminergic. We have included these new results in the main text as follows:

      " To further verify the presence of DA neurons within the SIFa neuron population, we utilized a well-established genetic toolkit for dissecting DA circuits and confirmed part of SIFa neurons are dopaminergic (S4M Fig) [58]."

         To confirm the glutamatergic characteristics of SIFa neurons, we conducted several experiments that established glutamate as the most critical neurotransmitter for generating interval timing in both SIFa and SIFaR neurons. First, to demonstrate the presence of glutamatergic synaptic vesicles in SIFa neurons, we utilized a conditional glutamatergic synaptic vesicle marker for *Drosophila*, developed by Certel et al. [19]. Our results confirmed that SIFa neurons exhibit strong expression of glutamatergic synaptic vesicles (Fig. 2P and Fig. S4N as a genetic control). We have described these new results in the main text as follows:
      

      "To further substantiate the role of glutamate in SIFa-mediated behaviors. we targeted the expression of VGlut receptor in neurons that carry the SIFaR. Strikingly, the knockdown of VGlut receptor in these neurons also disrupted SMD behavior, mirroring the phenotype observed upon direct suppression of glutamatergic signaling in SIFa neurons (S4O-L Fig)."

         To further confirm that glutamate release from SIFa neurons influences the function of SIFaR neurons, we tested several RNAi strains targeting glutamate receptors. Our results showed that the knockdown of glutamate receptors in SIFaR-expressing neurons produced phenotypes similar to those observed with VGlut-RNAi knockdown in SIFa neurons (Fig. S4I-N). We believe that this series of experiments demonstrates that glutamate and dopamine work in conjunction with SIFa to modulate interval timing and other behaviors related to energy balance. We have described these new results in the main text as follows:
      

      "We also further verified that the knockdown of glutamate receptors in SIFaR-expressing neurons produces phenotypes similar to those resulting from VGlut knockdown in SIFa neurons (S4G to S4L Fig). This suggests that glutamate is an essential neurotransmitter for modulating interval timing in SIFa neurons."

      Comment 7. The authors compare the LMD and SMD behavior of the animals with reduced expression with "heterozygous control animals" the authors should describe in detail what these are - are these controls the driver lines or the effector lines or a mix of both? The authors should provide the data for heterozygous driver line controls as well as heterozygous effector line controls to exclude any genetic background influence on the measured behavior. Accordingly, the authors should provide the data for the same controls for the sleep experiment in figure 3O and all the other behavioral experiments in the following parts of the manuscript.

      __ Answer: __We sincerely thank the reviewer for insightful comments regarding the absence of traditional genetic controls in our study of LMD and SMD behaviors. We acknowledge the importance of such controls and wish to clarify our rationale for not including them in the current investigation. The primary reason for not incorporating all genetic control lines is that we have previously assessed the LMD and SMD behaviors of GAL4/+ and UAS/+ strains in our earlier studies. Our past experiences have consistently shown that 100% of the genetic control flies for both GAL4 and UAS exhibit normal LMD and SMD behaviors. Given these findings, we deemed the inclusion of additional genetic controls to be non-essential for the present study, particularly in the context of extensive screening efforts. We understand the value of providing a clear rationale for our methodology choices. To this end, we have added a detailed explanation in the "MATERIALS AND METHODS" section and the figure legends of Figure 1. This clarification aims to assist readers in understanding our decision to omit traditional controls, as outlined below.

      "Mating Duration Assays for Successful Copulation

      The mating duration assay in this study has been reported [33,73,93]. To enhance the efficiency of the mating duration assay, we utilized the Df (1) Exel6234 (DF here after) genetic modified fly line in this study, which harbors a deletion of a specific genomic region that includes the sex peptide receptor (SPR)[94,95]. Previous studies have demonstrated that virgin females of this line exhibit increased receptivity to males [95]. We conducted a comparative analysis between the virgin females of this line and the CS virgin females and found that both groups induced SMD. Consequently, we have elected to employ virgin females from this modified line in all subsequent studies. For naïve males, 40 males from the same strain were placed into a vial with food for 5 days. For single reared males, males of the same strain were collected individually and placed into vials with food for 5 days. For experienced males, 40 males from the same strain were placed into a vial with food for 4 days then 80 DF virgin females were introduced into vials for last 1 day before assay. 40 DF virgin females were collected from bottles and placed into a vial for 5 days. These females provide both sexually experienced partners and mating partners for mating duration assays. At the fifth day after eclosion, males of the appropriate strain and DF virgin females were mildly anaesthetized by CO2. After placing a single female in to the mating chamber, we inserted a transparent film then placed a single male to the other side of the film in each chamber. After allowing for 1 h of recovery in the mating chamber in 25℃ incubators, we removed the transparent film and recorded the mating activities. Only those males that succeeded to mate within 1 h were included for analyses. Initiation and completion of copulation were recorded with an accuracy of 10 sec, and total mating duration was calculated for each couple. All assays were performed from noon to 4pm. Genetic controls with GAL4/+ or UAS/+ lines were omitted from supplementary figures, as prior data confirm their consistent exhibition of normal LMD and SMD behaviors [33,73,93,96,97]. Hence, genetic controls for LMD and SMD behaviors were incorporated exclusively when assessing novel fly strains that had not previously been examined. In essence, internal controls were predominantly employed in the experiments, as LMD and SMD behaviors exhibit enhanced statistical significance when internally controlled. Within the LMD assay, both group and single conditions function reciprocally as internal controls. A significant distinction between the naïve and single conditions implies that the experimental manipulation does not affect LMD. Conversely, the lack of a significant discrepancy suggests that the manipulation does influence LMD. In the context of SMD experiments, the naïve condition (equivalent to the group condition in the LMD assay) and sexually experienced males act as mutual internal controls for one another. A statistically significant divergence between naïve and experienced males indicates that the experimental procedure does not alter SMD. Conversely, the absence of a statistically significant difference suggests that the manipulation does impact SMD. Hence, we incorporated supplementary genetic control experiments solely if they deemed indispensable for testing. All assays were performed from noon to 4 PM. We conducted blinded studies for every test[98,99] .

         While we have previously addressed this type of reviewer feedback in our published manuscript [2–7], we appreciate the reviewer’s suggestion to include traditional genetic control experiments. In response, we conducted all feasible combinations of genetic control experiments for LMD/SMD during the revision period. The results are presented in the supplementary figures and are described in the main text.
      

      __Comment 8. __On page 11 line 231 to page 12 line 233 the authors claim that "sNPF signaling transmits hunger and satiety information to SIFa neurons in order to control food search and feeding" and cite Martelli et al., 2017. Could the authors explain more in detail how the Martelli paper somehow proposes this idea? I do not find the link between sNPF signaling hunger and SIFamide in this precise paper.

      __ Answer:__ We appreciate the reviewer for accurately pointing out our misunderstanding of the references. We agree that Martelli et al.'s paper does not mention that sNPF signaling transmits hunger and satiety information to SIFa neurons. Consequently, we have removed the relevant sentence and replaced it with a statement correctly indicating that while sNPF signaling is related to feeding behavior, its connection to SIFa neurons remains unknown. We are grateful to the reviewer for acknowledging our efforts to accurately cite previous articles that support our rationale and ideas.

      " Short neuropeptide F (sNPF) signaling plays a crucial role in regulating feeding behavior in Drosophila melanogaster, influencing food intake and body size [60,66,67] . However, there is currently no direct evidence reported linking sNPF signaling to SIFa neurons."


      Comment 9. On page 15 line 302 - 303 the authors write that "except for PK2-R2, all other genes coexpress with SIFa in SCope data, indicating that hugin inputs to SIFa may not be transmitted through peptidergic signaling" - if SIFamidergic neurons do not express hugin-receptors how do the authors explain the inverted effect of PK2-R2-RNAi on single housed male courtship index when compared to heterozygous SIFaPT Gal4 control that show a reduction under comparable conditions.

      __ Answer:__ We appreciate the reviewer’s constructive comments. In line with another reviewer’s suggestion, we have completely removed results of other neuropeptidergic inputs, focusing instead on how sNPF inputs modulate SIFa-mediated behavioral modulation using more advanced techniques such as GCaMP (Fig 3N). Consequently, the phenotypes resulting from various knockdowns of neuropeptide receptors are currently under investigation for a separate manuscript that we are preparing. We hope to successfully address how different neuropeptidergic inputs regulate SIFa neuron activity through various strategies.

      Comment 10. On page 17 line 350 - 351 the authors write that "Stimulation of SIFa neurons resulted in an elevation in food consumption. Further, the authors write that "deactivation of SIFa neurons leads to a decrease in food consumption in male flies". From the way this is formulated it is not visible that the role of SIFamide in feeding control was published by Martelli and colleagues before. As the authors do not discuss the finding further in their discussion but cite the concerned paper in other aspects it appears as the authors intentionally want to omit this information to the reader. The authors may add a note that this has been shown before for female flies by Martelli and colleagues.

      __ Answer:__ We appreciate reviewer's concern for properly mention previous Martelli et al.'s results about female feeding behavior modulated by SIFa neurons' activity. We agree with reviewer and added sentence as below in main text.

      "Nevertheless, the temporary deactivation of SIFa neurons leads to a decrease in food consumption in male flies (Fig 4N and S6F to S6H) as previously described by Martelli et al.'s report in female flies [43]."

      Comment 11. SIFamide receptor and GnIHR are discussed as descendants from a common ancestor and the authors nicely demonstrate that SIFamide does not only control homeostatic behavior as shown by Martelli and colleagues but also controls reproductive behavior. The evolution of such behavior control mechanisms may be integrated in the discussion too.

      Answer: We appreciate the reviewer’s constructive comments, which enhance the evolutionary significance of our study. We agree with the reviewer and have added the following paragraph to the DISCUSSION section:

      "The relationship between SIFamide receptors (SIFaR) and gonadotropin inhibitory hormone receptors (GnIHR) [89] highlights an intriguing evolutionary connection, as both are believed to have descended from a common ancestor [90,91]. This study expands on previous findings by Martelli et al., demonstrating that SIFamide not only regulates homeostatic behaviors but also plays a significant role in reproductive behavior [43]. GnIHR regulates food intake and reproductive behavior in opposing directions, thereby prioritizing feeding behavior over other behavioral tasks during times of metabolic need [92]. The evolution of these behavioral control mechanisms suggests a complex interplay between neuropeptides that modulate both physiological states and reproductive strategies. As SIFamide influences various behaviors, including feeding and sexual activity, it may be integral to understanding how organisms adapt their reproductive strategies in response to environmental and internal cues. This integration of behavioral modulation underscores the evolutionary significance of SIFamide signaling in coordinating essential life functions in Drosophila melanogaster and potentially other species, revealing pathways through which neuropeptides can shape behavior across different contexts."

      Conclusive Comments: The manuscript by Song and colleagues is very interesting and may attract a broad readership. However, the authors miss to make clear what was already known and published on the role of SIFamide in homeostatic behavior control before their own study. Seen that the receptors for SIFamide and GnRHI derive from a common ancestor and apparently both GnRHI and SIFamide share similar roles in behavioral control this might indeed suggests that the basic function of this SIFaR/GnIHR-signaling pathway is conserved. This more broad evolutionary aspect is missing in the discussion of the manuscript.

      • *Answer: We wholeheartedly agree with the reviewer regarding the evolutionary significance of SIFaR's function in relation to GnIHR, and we have expanded the DISCUSSION section to emphasize this important aspect.

      "The relationship between SIFamide receptors (SIFaR) and gonadotropin inhibitory hormone receptors (GnIHR) [89] highlights an intriguing evolutionary connection, as both are believed to have descended from a common ancestor [90,91]. This study expands on previous findings by Martelli et al., demonstrating that SIFamide not only regulates homeostatic behaviors but also plays a significant role in reproductive behavior [43]. GnIHR regulates food intake and reproductive behavior in opposing directions, thereby prioritizing feeding behavior over other behavioral tasks during times of metabolic need [92]. The evolution of these behavioral control mechanisms suggests a complex interplay between neuropeptides that modulate both physiological states and reproductive strategies. As SIFamide influences various behaviors, including feeding and sexual activity, it may be integral to understanding how organisms adapt their reproductive strategies in response to environmental and internal cues. This integration of behavioral modulation underscores the evolutionary significance of SIFamide signaling in coordinating essential life functions in Drosophila melanogaster and potentially other species, revealing pathways through which neuropeptides can shape behavior across different contexts."





      Reference

      1. Zhang T, Wu Z, Song Y, Li W, Sun Y, Zhang X, et al. Long-range neuropeptide relay as a central-peripheral communication mechanism for the context-dependent modulation of interval timing behaviors. bioRxiv. 2024; 2024.06.03.597273. doi:10.1101/2024.06.03.597273
      2. Kim WJ, Jan LY, Jan YN. A PDF/NPF Neuropeptide Signaling Circuitry of Male Drosophila melanogaster Controls Rival-Induced Prolonged Mating. Neuron. 2013;80: 1190–1205. doi:10.1016/j.neuron.2013.09.034
      3. Kim WJ, Jan LY, Jan YN. Contribution of visual and circadian neural circuits to memory for prolonged mating induced by rivals. Nat Neurosci. 2012;15: 876–883. doi:10.1038/nn.3104
      4. Zhang T, Zhang X, Sun D, Kim WJ. Exploring the Asymmetric Body’s Influence on Interval Timing Behaviors of Drosophila melanogaster. Behav Genet. 2024; 1–10. doi:10.1007/s10519-024-10193-y
      5. Sun Y, Zhang X, Wu Z, Li W, Kim WJ. Genetic Screening Reveals Cone Cell-Specific Factors as Common Genetic Targets Modulating Rival-Induced Prolonged Mating in male Drosophila melanogaster. G3: Genes, Genomes, Genet. 2024; jkae255. doi:10.1093/g3journal/jkae255
      6. Lee SG, Sun D, Miao H, Wu Z, Kang C, Saad B, et al. Taste and pheromonal inputs govern the regulation of time investment for mating by sexual experience in male Drosophila melanogaster. PLOS Genet. 2023;19: e1010753. doi:10.1371/journal.pgen.1010753
      7. Huang Y, Kwan A, Kim WJ. Y chromosome genes interplay with interval timing in regulating mating duration of male Drosophila melanogaster. Gene Rep. 2024; 101999. doi:10.1016/j.genrep.2024.101999
      8. Kayser MS, Yue Z, Sehgal A. A Critical Period of Sleep for Development of Courtship Circuitry and Behavior in Drosophila. Science. 2014;344: 269–274. doi:10.1126/science.1250553
      9. Wong K, Schweizer J, Nguyen K-NH, Atieh S, Kim WJ. Neuropeptide relay between SIFa signaling controls the experience-dependent mating duration of male Drosophila. Biorxiv. 2019; 819045. doi:10.1101/819045
      10. Thornquist SC, Langer K, Zhang SX, Rogulja D, Crickmore MA. CaMKII Measures the Passage of Time to Coordinate Behavior and Motivational State. Neuron. 2020;105: 334-345.e9. doi:10.1016/j.neuron.2019.10.018
      11. Buhusi CV, Meck WH. What makes us tick? Functional and neural mechanisms of interval timing. Nat Rev Neurosci. 2005;6: 755–765. doi:10.1038/nrn1764
      12. Merchant H, Harrington DL, Meck WH. Neural Basis of the Perception and Estimation of Time. Annu Rev Neurosci. 2012;36: 313–336. doi:10.1146/annurev-neuro-062012-170349
      13. Allman MJ, Teki S, Griffiths TD, Meck WH. Properties of the Internal Clock: First- and Second-Order Principles of Subjective Time. Annu Rev Psychol. 2013;65: 743–771. doi:10.1146/annurev-psych-010213-115117
      14. Rammsayer TH, Troche SJ. Neurobiology of Interval Timing. Adv Exp Med Biol. 2014; 33–47. doi:10.1007/978-1-4939-1782-2_3
      15. Golombek DA, Bussi IL, Agostino PV. Minutes, days and years: molecular interactions among different scales of biological timing. Philosophical Transactions Royal Soc B Biological Sci. 2014;369: 20120465. doi:10.1098/rstb.2012.0465
      16. Jazayeri M, Shadlen MN. A Neural Mechanism for Sensing and Reproducing a Time Interval. Curr Biol. 2015;25: 2599–2609. doi:10.1016/j.cub.2015.08.038
      17. Croset V, Treiber CD, Waddell S. Cellular diversity in the Drosophila midbrain revealed by single-cell transcriptomics. eLife. 2018;7: e34550. doi:10.7554/elife.34550
      18. Xie T, Ho MCW, Liu Q, Horiuchi W, Lin C-C, Task D, et al. A Genetic Toolkit for Dissecting Dopamine Circuit Function in Drosophila. Cell Reports. 2018;23: 652–665. doi:10.1016/j.celrep.2018.03.068
      19. Certel SJ, Ruchti E, McCabe BD, Stowers RS. A conditional glutamatergic synaptic vesicle marker for Drosophila. G3. 2022;12: jkab453. doi:10.1093/g3journal/jkab453
      20. Li H, Janssens J, Waegeneer MD, Kolluru SS, Davie K, Gardeux V, et al. Fly Cell Atlas: A single-nucleus transcriptomic atlas of the adult fruit fly. Science. 2022;375: eabk2432. doi:10.1126/science.abk2432
      21. Deng B, Li Q, Liu X, Cao Y, Li B, Qian Y, et al. Chemoconnectomics: Mapping Chemical Transmission in Drosophila. Neuron. 2019;101: 876-893.e4. doi:10.1016/j.neuron.2019.01.045
      22. Claridge-Chang A, Assam PN. Estimation statistics should replace significance testing. Nat Methods. 2016;13: 108–109. doi:10.1038/nmeth.3729

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the present study, the authors employ mating behavior in male fruit flies, Drosophila melanogaster, to investigate the behavioral roles of the neuropeptide SIFamide. The duration of mating behavior in these animals varies depending on context, previous experience, and internal metabolic state. The authors use this variability to explore the neuronal mechanisms that control these influences. In an abstraction step, they compare the different mating durations to concepts of neuronal interval timing.

      The behavioral functions of the neuropeptide SIFamide have been thoroughly characterized in several studies, particularly in the contexts of circadian rhythm and sleep, courtship behavior, and food uptake. This study adds new data, demonstrating that SIFamide is essential for the proper control of mating behavior, highlighting the interconnection of various state- and motivation-dependent behaviors at the neuronal level. However, the hypothesis that mating behavior is related to interval timing is not convincingly supported.

      Experimentally, the authors show that RNAi-mediated downregulation of SIFamide affects mating duration in male flies. They use combinations of RNAi lines under the control of various Gal4 lines to identify additional neurotransmitters, neuropeptides, and receptors involved in this process. This approach is complemented by neuroanatomical staining and single-cell RNA sequencing. Overall, the study advances our knowledge about the behavioral roles of SIFamide, which is certainly important, interesting, and worthy of being reported. However, the manuscript also raises several serious caveats and includes points that remain speculative, are less convincing, or are simply incorrect.

      Major concerns:

      • The authors conclude from their mating experiments that SIFamide controls interval timing. This conclusion is not supported by the data, which only indicate that SIFamide is required for normal mating duration and modulates the motivation-dependent component of this behavior. There is no clear evidence linking this to interval timing.
      • On line 160, the authors state, "The connection between the dendrites and axons of the SIFamide neuronal processes is unknown." This is not entirely correct. State-of-the-art connectome analyses can determine synaptic connectivities between SIFamidergic neurons and pre-/postsynaptic neurons. The authors also overlook the thorough connectivity analysis by Martelli et al. (2017), which includes functional analyses and detailed anatomical descriptions that the current study confirms.
      • The mating experiments are overall okay, with sufficiently high sample sizes and appropriate statistical tests. However, many experiments lack genetic controls for the heterozygous parental strains, such as Gal4-ines AND UAS-lines. This is of course of importance and common standard.
      • Using a battery of RNAi lines, the authors aim to uncover which neurotransmitters might be co-released from SIFamide neurons to influence mating behavior. However, a behavioral effect of an RNAi construct expressed in SIFamidergic neurons does not demonstrate that the respective transmitter is actually released from these neurons. Alternative methods are needed to show whether glutamate, dopamine, serotonin, octopamine, etc., are present and released from SIFamide neurons. It is particularly challenging to prove that a certain substance acts as a transmitter released by a specific neuron. For example, anti-Tdc2 staining does not actually cover SIFamide neurons, and dopamine has not been described as present in SIFamide neurons. Single-cell RNA sequencing data alone is insufficient to claim multiple transmitter co-release from SIFamide neurons. Figures illustrating single-cell RNA sequencing, such as Figure 3P-R, are not intuitively understandable, and the figure legends lack sufficient information to clarify these panels. As a side note, Tdc2 is not only present in octopaminergic neurons, but also in tyraminergic neurons.
      • The same argument applies to the expression of sNPF receptors in SIFamide neurons. The rather small anatomical stainings shown in figure 4M do not convincingly and unambiguously show that actually sNPF receptors are located on SIFamide neurons.
      • The authors use the GRASP technique (figure 4N) to determine whether synaptic connections are subject to modulation as a result from the animals' individual experience. The overall extremely bright fluorescence at the dorsal areas of both brain hemispheres (figure 4 N, middle panel) raises doubts whether this signal is actually a specific GRASP fluorescence between two small populations of neurons.
      • The authors cite Martelli et al. (2017) with the hypothesis that sNPF-releasing neurons provide input signals to SIFamide neurons to modulate feeding behavior. However, the cited manuscript does not contain such a hypothesis. The authors should review the reference in more detail.
      • In lines 281 ff., the authors state that SIFamide neurons receive inputs from peptidergic neurons but simultaneously claim that "this speculation is based on morphological observations." This is incorrect. The functional co-activation/imaging analyses provided in Martelli et al. (2017) should not be ignored.
      • Figure 6: A transcriptional calcium sensor (TRIC) was used to quantify the accumulation GFP induced by calcium influx in SIFamide neurons. However, I could not find any description of the method in the materials and methods section, nor any explanation how the data were acquired or analyzed. What is the RFP expression good for? How exactly are thresholds determined, and why are areas rather than fluorescence intensities quantified? Overall, this part of the manuscript is rather confusing and needs more explanation.
      • Similarly, it remains unclear how exactly syteGFP fluorescence and DenMark fluorescence were quantified. Why are areas indicated and not fluorescence intensity values? In fact, it appears worrisome that isolation of males should lead to a drastic decline in synaptic terminals (as measure through a vesicle-associated protein) by ~ 30%, or, conversely, keeping animals in groups lead to an respective increase (figure 7D). The technical information how exactly this was quantified is not sufficient.

      Minor comments:

      • Reference 29 and reference 33 are the same.
      • In figure legends, abbreviations should be explained when used first (e.g., figure 1 A "MD", is explained below for panel C-F), or "CS males".
      • Indications for statistical significance must be shown in all figure legends at the end of each figure legend, not only in figure 1.
      • The figures appear overloaded. For example why do you need two different axis designations (mating duration and differences between means)?
      • Line: 1154: Typo: gluttaminergic should be glutamatergic.
      • The authors frequently write "system" when referring to transmitter types, e.g., "glutaminergic system", "octopaminergic system", etc. It I not clear what the term "system" actually refers to. If the authors claim that SIFamide neurons release these transmitters in addition to SIFamide, they should state that precisely and then add experiments to show that this is the case.
      • Figure S6: It is not explained I the figure legend what fly strain "UAS-ctrl" actually is. Does "ctrl" mean control? And what genotype is hat control?
      • Figure legend S6, line 1371: The authors indicate experiments using UAS-OrkDeltaC. I could not find these data in the figure.
      • Line 470: "...reduced branching of SIFa axons at the postsynaptic level" should perhaps be "presynaptic level"?

      Significance

      Overall, the study advances our knowledge about the behavioral roles of SIFamide, which is certainly important, interesting, and worthy of being reported. However, the manuscript also raises several serious caveats and includes points that remain speculative and are less convincing.

      Overall, the neuronal basis of action selection based on motivational factors (metabolic state, mating experience, sleep/wake status, etc.) is not well understood. The analysis of SIFamide function in insects might provide a way to address the question how different motivational signals are integrated to orchestrate behavior.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript by Song et al. investigates the molecular mechanisms underlying changes in mating duration in Drosophila induced by previous experience. As they have shown previously, they find that male flies reared in isolation have shorter mating duration than those reared in groups, and also that male flies with previous mating experience have shorter mating duration than sexually naïve males. They have conducted a myriad of experiments to demonstrate that the neuropeptide SIFa is required for these changes in mating duration. They have further provided evidence that SIFa-expressing neurons undergo changes in synaptic connectivity and neuronal firing as a result of previous mating experience. Finally, they argue that SIFa neurons form reciprocal connections with sNPF-expressing neurons, and that communication within the SIFa-sNPF circuit is required for experience-dependent changes in mating duration. These results are used to assert that SIFa neurons track the internal state of the flies to modulate behavioral choice.

      Major Comments:

      1. The authors are to be commended for the sheer quantity of data they have generated, but I was often overwhelmed by the figures, which try to pack too much into the space provided. As a result, it is often unclear what components belong to each panel. Providing more space between each panel would really help.

      This is a rare instance where I would recommend paring down the paper to focus on the more novel, clear and relevant results. For example, all of Figure 2 shows the projection pattern of SIFa+ neuron dendrites and axons, which have been reported by multiple previous papers. Figure 7G and J show trans-tango data and SIFaR-GAL4 expression patterns, which were previously reported by Dreyer et al., 2019. These parts could be removed to supplemental figures. Figure 5 details experiments that knock down expression of different neurotransmitter receptors within the SIFa-expressing cells. The results here are less definitive than the SIFa knockdown results, and the SCope data supporting the idea that these receptors are expressed in SIFa-expressing neurons is equivocal. I would recommend removing these data (perhaps they could serve as the basis for another manuscript) or focusing solely on the CCHa1R results, which is the only manipulation that affects both LMD and SMD.

      Finally, I would like the authors to spend more time explaining how they think the results tie together. For example, how do the authors think the changes in branching and activity in SIFa-expressing neurons tie to the change in mating duration provoked by previous experience? It would benefit the manuscript to simplify and clarify the message about what the authors think is happening at the mechanistic level. The various schematics (eg Fig 7N) describe the results but the different parts feel like separate findings rather than a single narrative. 2. Most of the experiments lack traditional controls. For example, in experiments in Fig 1C-K, one would typically include genetic controls that contain either the GAL4 or UAS elements alone. The authors should explain their decision to omit these control experiments and provide an argument for why they are not necessary to correctly interpret the data. In this vein, the authors have stated in the methods that stocks were outcrossed at least 3x to Canton-S background, but 3 outcrosses is insufficient to fully control for genetic background. 3. Throughout the manuscript, the authors appear to use a single control condition (sexually naïve flies raised in groups) to compare to both males raised singly and males with previous sexual experience. These control conditions are duplicated in two separate graphs, one for long mating duration and one for short mating duration, but they are given different names (group vs naïve) depending on the graph. If these are actually the same flies, then this should be made clear, and they should be given a consistent name across the different "experiments". 4. The authors use SCope data to provide evidence for co-expression of SIFa and other neurotransmitters or neuropeptide receptors. The graphs they show are hard to read and it is not clear to what extent the gene expression is actually overlapping. It would be more definitive to show graphs that indicate which percentage of SIFa-expressing cells co-express other neurotransmitter components, and what the actual level of expression of the genes is. The authors should also provide more information on how they identified the SIFa+ cells in the fly atlas dataset. These are important pieces of information to be able to interpret the effects of manipulation of these other neurotransmitter systems within SIFa-expressing cells on mating duration. 5. I would like to see more information on how the thresholding and normalization was done for immunohistochemistry experiments. Was thresholding applied equally across all datasets? Furthermore, "overlap" of Denmark and Syt-eGFP is taken as evidence for synaptic connectivity, but the latter requires more than just overlap in the location of the axon terminal and dendrite regions of the neuron. 6. None of the RNAi experiments have been validated to demonstrate effective knockdown. In many cases, this would be difficult to do because of a lack of an antibody to quantify in a cell-specific manner; however, this fact should be acknowledged, especially in cases where there was found to be a lack of phenotype, which could result from lack of knockdown. The authors could also look for evidence in the literature of cases where RNAi lines they have used have been previously validated. For SIFa, knockdown can be easily confirmed with the SIFa antibody the authors have used elsewhere in the manuscript.

      Minor Comments:

      1. There are quite a lot of citations to preprints, including preprints of the manuscripts under review. It seems inappropriate to cite a preprint of the manuscript you are submitting because it gives a false sense of strengthening the assertions being made in the manuscript.
      2. It seems that labels are incorrect on a number of the immunohistochemistry figures. For example, in Fig 2N, it labels dendrites as green, but this is sytEGFP, which is the presynaptic terminal.
      3. Fig 4N shows grasp between SIFa-LexA and sNPF-R-GAL4, but the authors have argued that these two components should both be expressed in SIFa-expressing cells. This would make grasp signal misleading, because it would appear in the SIFa-expressing cells even without synaptic contacts due to both split GFP molecules being expressed in these cells.
      4. For quantifying TRIC and CaLexA experiments (eg Figure 6A-E), intensity of signal should be measured in addition to the area covered by the signal.

      Significance

      This study will be most relevant to researchers interested in understanding neuronal control of behavior. It has provided novel information about the mechanisms underlying mating duration in flies, which is used to delineate how internal state influences behavioral outcomes. This represents a conceptual advance, particularly in identifying a cell type and molecule that influences mating duration decisions. The strength of the manuscript is the number of different assays used to investigate the central question from a number of angles. The limitation is that there is a lack of a big picture tying the different components of the manuscript together. Too much data is presented without providing a framework to understand how the data points fit together.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer 1:

      - The manuscript needs comprehensive proofreading for language and formatting. In many instances, spaces are missing or not required.

      Thank you for your comments. The manuscript has been thoroughly proofread for errors in language and formatting.

      - Could the authors explore correlation network analyses to get additional insights into the structure of different clusters? 

      We have added a co-occurrence analysis (at species taxonomic level) based on SparCC to the manuscript (Figure 2).

      This is described on Page 9 line 141-148

      - The GitHub link is not correct. 

      The github repository has now been made public.

      - It is not possible to access the dataset on ENA. 

      We have changed the ENA study PRJEB57401 status to open.

      - Add the graphs obtained with decontam analysis as a supplementary figure. 

      We have added the outputs of decontam (.csv files with feature lists of ASVs that were filtered based on the prevalence and frequency tests) to the github repository.

      - There is nothing about the RPL group in the results section, while the authors discuss this issue in the introduction. What about the controls with proven fertility? 

      Thank you. We have amended the manuscript to compare characteristics between the RPL, unexplained subfertility and controls groups.

      Line 1279-130 page 8:  

      “The study group represented 85% of samples with high sperm DNA fragmentation, 85% of samples with elevated ROS and 79% of samples with oligospermia. Rates of abnormal seminal parameters including low sperm concentration, reduced progressive motility and ROS concentrations were found to be highest in the MFI group (Supplementary Figure 1). Baseline characteristics between the RPL, unexplained subfertility and controls groups were similar.

      Line 150-154 Page 9: 

      “Bacterial richness, diversity and load were similar between all patient groups examined in the study (Supplementary Figure 4).

      - While correctly stated in the title, the term microbiota should be used throughout the manuscript instead of "microbiome" 

      Thank you. This misnomer has been amended throughout the manuscript.

      Minor corrections:

      Line 25: provoke is not a good term here. 

      Thank you. The term ‘provoke’ has been removed

      Line 26: why does semen culture have a limited scope? 

      Thank you. Line 40-41 Page 3 has been amended:

      “It is therefore plausible that asymptomatic seminal infections may be associated with impaired reproductive function in some men. Since semen culture has a limited scope for studying the seminal microbiota due to its inability to identify all present microbiota next generation sequencing (NGS) approaches have been reported recently by a growing number of investigators (13, 14, 15, 16, 17, 18, 19)”.

      Line 68: write μl correctly

      Thank you. This has been corrected

      Line 131: several organisms at the genus level. 

      Thank you. This has been corrected

      Line 136: what are the relative abundances of these genera? Is this relevant? 

      The mean relative abundances for the key taxa mention in each cluster are all above 20%. This information has been added to the manuscript text on page 9, line 153.

      Line 173: Molina et al. 

      Thank you. This has been corrected

      Line 173: the contaminations are referred to the low biomass nature of testicular samples. If present, bacteria of accessory gland secretions are an integral part of the seminal microbiota itself. Please review these sentences. 

      Thank you. This had been reworked to highlight the important of urethral contamination, which you later allude to as a limitation of our study is the failure to provide paired urine and semen samples.

      Page 11 line 194-196

      “Molina et al report that 50%-70% of detected bacterial reads may be environmental contaminants in a sample from extracted testicular spermatozoa (35); with the addition of passage along the urethra it is likely that contamination of ejaculated semen would be much higher.”

      Table 1: remove results interpretation from table caption. 

      Thank you this has been acted upon.

      Table 1: why in some cases, like in DNA fragmentation index, the total is not equal to n=223? 

      This is due to missing data/ analysis not possible for some men due to the requirement of a minimum number of sperm in the ejaculate to perform DNA fragmentation testing.

      Table 1: "frag" is not defined. 

      Thank you, this has been amended

      Tables 2, 3 & 4: bacterial genera in italics. 

      Thank you, this has been amended

      Figure 1A: add the fertility status information above the cluster colors. 

      Thank you, this has been amended in Figure 1.

      Figure 1C: the color code is confusing. Use different colors for each cluster. 

      Figure 1 legend: bacterial genera in italics. 

      Figures 1 & 2: the authors should use similar chart formatting in the two tables. 

      Thank you, this has been amended

      Reviewer 2:

      (1) The patient groups have different diagnoses and should be handled as different groups, and not fused into one 'patient' group in analyses. <br /> Why are the data in tables presented as controls and cases? I would consider men from couples with recurrent pregnancy loss, unexplained infertility, and male factor infertility to have different seminal parameters (not to fuse them into one group). This means, that the statistical analyses should be performed considering each group separately, and not to fuse 3 different infertility diagnoses into one patient group. 

      We have conducted detailed analyses, requested by the reviewer, comparing seminal DNA, ROS and microbiota characteristics between each individual patient groups (Supplimental figures 1 and 4). No specific taxa (at either genera or species-level) were found to differ in relative abundance between the diagnostic groups. However, we expect associations between parameters such as reactive oxygen species, or DNA fragmentation, and relative abundance of bacterial species, to be general and not restricted to or specific to each diagnostic group. Therefore, we also conducted further analyses aggregating data from all patient groups to investigate relationships common to these different forms of male reproductive dysfunction.

      (2) Were any covariables included in the statistical analyses, e.g. age, BMI, smoking, time of sexual abstinence, etc? 

      Covariates were not included in the statistical analyses. This has been added in the manuscript to the limitations.

      Page 14 line 267-268

      “Additionally, we did not have other covariables such as smoking status with which to include in further analyses”.

      (3) Furthermore, it is known that 16S rRNA gene analysis does not provide sensitive enough detection of bacteria on the species level. How much do the authors trust their results on the species level? 

      The limitations of taxonomic assignment using 16S rRNA gene metataxonomics are well documented. However, the capacity to assign sequence amplicons at species level depends on the sequence variability of the 16S rRNA gene for each of the taxa reported and the specific gene region chosen. In this study, amplification of the V1-V2 region was performed using a mixed 28f primer set (see methods for details) that enables resolution and assignment of several bacterial species highly relevant to the reproductive tract including Lactobacillus spp., such as L. crispatus and L. iners, (e.g. https://doi.org/10.3389/fcell.2021.641921, https://doi.org/10.1128/msystems.01039-23, https://doi.org/10.1186/s12915-023-01702-2). In this study, we report the presence of L. iners, but not L. crispatus in semen samples, and we have also identified a specific association/co-occurrence between Gardnerella vaginalis and Lactobacillus iners, similar to that observed in vaginal bacterial communities.

      (4) Were the analyses of bacterial genera and species abundances with seminal quality parameters controlled for diagnosis and other confounders? 

      As stated in point 2, no adjustment was made for co-variates. No differences in microbiome composition were observed among the three diagnostic groups, so no adjustments were made to our analysis.

      (5) The authors stress that their study is the biggest on the microbiome in semen. However, when considering that the study consists of 4 groups (with n=46-63), it does not stand out from previous studies. 

      Our study is overall the largest investigating interactions between the seminal microbiome and male reproductive dysfunction. Other studies have included greater numbers of men with infertility.

      (6) Weaknesses: There is a lack of paired seminal/urinal samples. 

      Thank you. This limitation has been added.

      Page 14 line 266-267

      “A further limitation of this study, and others, is the lack of reciprocal genital tract microbiota testing of the female partners, or paired seminal and urinary samples from male participants”.

      Recommendation for authors to consider:

      Including previous classical reviews in the introduction: DOI:10.1097/MOU.0000000000000742 <br /> DOI: 10.1038/s41585-019-0250-y 

      Thank you. This has been added.

      Mentioning in the M&M section that there is a supplementary text with a more detailed M&M part. 

      Thank you. This has been added. Further methodological detail can be found in supplementary text.

      Revising the use of 'microbiota' and 'microbiome', they are not synonyms. When talking of 16S rRNA gene analysis, we consider 'microbiome' analysis. 

      Thank you. This misnomer has been amended throughout the manuscript.

      Revising the text, there are several erratas (e.g. verb missing, etc). 

      Thank you for your comments. The manuscript has been thoroughly proofread for errors in language and formatting.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews: 

      Reviewer #1 (Public Review):

      Summary: 

      In the manuscript entitled "Magnesium modulates phospholipid metabolism to promote bacterial phenotypic resistance to antibiotics", Li et al demonstrated the role of magnesium in promoting phenotypic resistance in V. alginolyticus. Using standard microbiological and metabolomic techniques, the authors have shown the significance of fatty acid biosynthesis pathway behind the resistance mechanism. This study is significant as it sheds light on the role of an exogenous factor in altering membrane composition, polarization, and fluidity which ultimately leads to antimicrobial resistance. 

      Strengths: 

      (1) The experiments were carried out methodically and logically. 

      (2) An adequate number of replicates were used for the experiments. 

      Weaknesses: 

      (1) The introduction section needs to be more informative and to the point.  

      Thank you so much for your suggestion. We have revised the introduction to make it more informative and to the point as following:

      “Non-inheritable antibiotic or phenotypic resistance represents a serious challenge for treating bacterial infections. Phenotypic resistance does not involve genetic mutations Phenotypic resistance does not involve genetic mutations and is transient, allowing bacteria to resume normal growth. Biofilm and bacterial persisters are two phenotypic resistance types that have been extensively studied (Brandis et al., 2023; Corona & Martinez, 2013). Biofilms have complex structures, containing elements that impede antibiotic diffusion, sequestering and inhibiting their activity (Ciofu et al., 2022). Biofilm-forming bacteria and persisters also have distinct metabolic states that significantly reduce their antibiotic susceptibility (Yan & Bassler, 2019). These two types of phenotypic resistance share the common feature in their retarded or even cease of growth in the presence of antibiotics (Corona & Martinez, 2013). However, specific factors that promote phenotypic resistance and allow bacteria to proliferate in the presence of antibiotics remain poorly defined.

      Metal ions have a diverse impact on the chemical, physical, and physiological processes of antibiotic resistance  (Booth et al, 2011; Lu et al, 2020; Poole, 2017). This includes genetic elements that confer resistance to metals and antibiotics (Poole, 2017) and metal cations that directly hinder (or enhance) the activity of specific antibiotic drugs (Zhang et al., 2014). The metabolic environment can also impact the sensitivity of bacteria to antibiotics (Jiang et al., 2023; Lee & Collins, 2012; Peng et al., 2015; Zhang et al., 2020; Zhao et al., 2021). Light metal ions, such as magnesium, sodium, and potassium, can behave as cofactors for different enzymes (Du et al., 2016) and influence drug efficacy. Heavy metal ions, including Cu2+ and Zn2+, confer resistance to antibiotics (Yazdankhah et al., 2014; Zhang et al., 2018). Recent reports suggest that sodium negatively regulates redox states to promote the antibiotic resistance of Vibrio alginolyticus (Yang et al., 2018), while actively growing Bacillus subtilis cope with ribosome-targeting antibiotics by modulating ion flux (Lee et al, 2019). In Gram-negative bacteria, by contrast, zinc enhances antibiotic efficacy by potentiating carbapenem, fluoroquinolone, and β-lactam-mediated killing (Isaei et al., 2016; Zhang et al., 2014). Magnesium influences bacterial structure, cell motility, enzyme function, cell signaling, and pathogenesis (Wang et al., 2019). This mineral also modulates microbiota to harvest energy from the diet (Garcia-Legorreta et al., 2020), allowing Bacillus subtilis to cope with ribosome-targeting antibiotics by modulating ion flux (Lee et al., 2019). However, the role of magnesium in promoting phenotypic resistance is less well understood.

      Vibrios inhabit seawater, estuaries, bays, and coastal waters, regions full of metal ions such as magnesium (Kumarage et al., 2022). Magnesium is the second most dissolved element in seawater after sodium. At a salinity of 3.5% seawater, the magnesium concentration is about 54 mM (Potis, 1968), and in deep seawater, can be as high as 2,500 mM (Wang et al., 2024). Vibrio parahaemolyticus and V. alginilyticus are two representative Vibrio pathogens that infect humans and aquatic animals, resulting in illness and economic loss, respectively (Grimes, 2020). (Fluoro)quinolones such as balofloxacin are used to treat Vibrio infection, however, resistance has emerged due to overuse (Suyamud et al., 2024). Indeed, (fluoro)quinolones are one of China's two primary residual chemicals associated with aquaculture (Liu et al., 2017). Vibrio can develop quinolone resistance through mutations in the DNA gyrase gene or through plasmid-mediated mechanisms (Dutta et al., 2021). Thus, the use of V. parahaemolyticus and V. alginilyticus as bacterial representatives, and balofloxacin as a quinolone-based antibacterial representative, can help to define novel magnesiumdependent phenotypic resistance mechanisms of pathogenic Vibrio species. 

      The current study evaluated whether magnesium induces phenotypic resistance in Vibrio species and defined the molecular/genetic basis for this resistance. Genetic approaches, GC-MS analysis of metabolite and membrane remodeling upon antibiotic exposure, membrane physiology, and extensive antimicrobial susceptibility testing were used for the evaluations.”

      (2) The weakest point of this paper is in the logistics through the results section. The way authors represented the figures and interpreted them in the results section (or the figure legends) does not match. The figures are difficult to interpret and are not at all self-explanatory. 

      Thank you so much for your suggestion. We have followed your suggestion to check the match between result and figures. They are now revised. 

      (3) There are too many mislabeling of the figure panels in the main text which makes it difficult to find out which figures the authors are explaining. There should be more explanation on why and how they did the experiments and how the results were interpreted. 

      Thank you so much for your suggestion. We have checked the figures and main text to ensure that we make every figure clearly stated.  

      Reviewer #2 (Public Review): 

      Summary: 

      In this study, the authors aimed to identify if and how magnesium affects the ability of two particular bacteria species to resist the action of antibiotics. In my view, the authors succeeded in their goals and presented a compelling study that will have important implications for the antibiotic resistance research community. Since metals like magnesium are present in all lab media compositions and are present in the host, the data presented in this study certainly will inspire additional research by the community. These could include research into whether other types of metals also induce multi-drug resistance, whether this phenomenon can be observed in other bacterial species, especially pathogenic species that cause clinical disease, and whether the underlying molecular determinants (i.e. enzymes) of metal-induced phenotypic resistance could be new antimicrobial drug targets themselves. 

      Strengths: 

      This study's strengths include that the authors used a variety of methodologies, all of which point to a clear effect of exogenous Mg2+ on drug resistance in the targeted species. I also commend the authors for carrying out a comprehensive study, spanning evaluation of whole cell phenotypes, metabolic pathways, genetic manipulation, to enzyme activity level evaluation. The fact that the authors uncovered a molecular mechanism underlying Mg2+-induced phenotypic resistance is particularly important as the key proteins should be studied further.

      Weaknesses: 

      I believe there are weaknesses in the manuscript, however. The authors take for granted that the reader is familiar with all the assays utilized, and do not properly explain some experiments, and thus I highly suggest that the authors add a brief statement in each situation describing the rationale for each selected methodology (more details are in the private review to the authors). The Results section is also quite long and bogs down at times, and I suggest that the authors reduce its length by 10 to 20%. In contrast, the Introduction is sparse and lacks key aspects, for example, there should be mention of the study's main purpose and approaches, plus an introduction to the authors' choice of species and their known drug resistance properties, as well as the drug of choice (balofloxacin). Another notable weakness is that the authors evaluated Mg2+-induced phenotypic resistance only against two closely related species, and thus the generalizability of this mechanism of drug resistance is not known. The paper would be strengthened if the authors could demonstrate this type of phenotypic resistance in at least one more Gram-negative species and at least one Gram-positive species (antimicrobial susceptibility evaluations would suffice), each of which should be pathogenic to humans. Demonstrating magnesium-induced phenotypic drug resistance in the WHO Priority Bacterial Pathogens would be particularly important. 

      In general, the conclusions drawn by the authors are justified by the data, except for the interpretation of some experiments. Importantly, this paper has discovered new antimicrobial resistance mechanisms and has also pointed to potential new targets for antimicrobials. 

      Thank you so much for your suggestion! We followed your idea the revise the manuscript as following:

      (1) We added a brief statement in the situation to explain the result and methodology according to your suggestion in the private review.

      (2) To make the streamline of the story more logic, we moved the whole second result to supplementary text and supplementary figure. 

      (3) We revised the introduction part by adding additional information to make it informative and to the point as following:

      “Non-inheritable antibiotic or phenotypic resistance represents a serious challenge for treating bacterial infections. Phenotypic resistance does not involve genetic mutations Phenotypic resistance does not involve genetic mutations and is transient, allowing bacteria to resume normal growth. Biofilm and bacterial persisters are two phenotypic resistance types that have been extensively studied (Brandis et al., 2023; Corona & Martinez, 2013). Biofilms have complex structures, containing elements that impede antibiotic diffusion, sequestering and inhibiting their activity (Ciofu et al., 2022). Biofilm-forming bacteria and persisters also have distinct metabolic states that significantly reduce their antibiotic susceptibility (Yan & Bassler, 2019). These two types of phenotypic resistance share the common feature in their retarded or even cease of growth in the presence of antibiotics (Corona & Martinez, 2013). However, specific factors that promote phenotypic resistance and allow bacteria to proliferate in the presence of antibiotics remain poorly defined.

      Metal ions have a diverse impact on the chemical, physical, and physiological processes of antibiotic resistance  (Booth et al, 2011; Lu et al, 2020; Poole, 2017). This includes genetic elements that confer resistance to metals and antibiotics (Poole, 2017) and metal cations that directly hinder (or enhance) the activity of specific antibiotic drugs (Zhang et al., 2014). The metabolic environment can also impact the sensitivity of bacteria to antibiotics (Jiang et al., 2023; Lee & Collins, 2012; Peng et al., 2015; Zhang et al., 2020; Zhao et al., 2021). Light metal ions, such as magnesium, sodium, and potassium, can behave as cofactors for different enzymes (Du et al., 2016) and influence drug efficacy. Heavy metal ions, including Cu2+ and Zn2+, confer resistance to antibiotics (Yazdankhah et al., 2014; Zhang et al., 2018). Recent reports suggest that sodium negatively regulates redox states to promote the antibiotic resistance of Vibrio alginolyticus (Yang et al., 2018), while actively growing Bacillus subtilis cope with ribosome-targeting antibiotics by modulating ion flux (Lee et al, 2019). In Gram-negative bacteria, by contrast, zinc enhances antibiotic efficacy by potentiating carbapenem, fluoroquinolone, and β-lactam-mediated killing (Isaei et al., 2016; Zhang et al., 2014). Magnesium influences bacterial structure, cell motility, enzyme function, cell signaling, and pathogenesis (Wang et al., 2019). This mineral also modulates microbiota to harvest energy from the diet (Garcia-Legorreta et al., 2020), allowing Bacillus subtilis to cope with ribosome-targeting antibiotics by modulating ion flux (Lee et al., 2019). However, the role of magnesium in promoting phenotypic resistance is less well understood.

      Vibrios inhabit seawater, estuaries, bays, and coastal waters, regions full of metal ions such as magnesium (Kumarage et al., 2022). Magnesium is the second most dissolved element in seawater after sodium. At a salinity of 3.5% seawater, the magnesium concentration is about 54 mM (Potis, 1968), and in deep seawater, can be as high as 2,500 mM (Wang et al., 2024). Vibrio parahaemolyticus and V. alginilyticus are two representative Vibrio pathogens that infect humans and aquatic animals, resulting in illness and economic loss, respectively (Grimes, 2020). (Fluoro)quinolones such as balofloxacin are used to treat Vibrio infection, however, resistance has emerged due to overuse (Suyamud et al., 2024). Indeed, (fluoro)quinolones are one of China's two primary residual chemicals associated with aquaculture (Liu et al., 2017). Vibrio can develop quinolone resistance through mutations in the DNA gyrase gene or through plasmid-mediated mechanisms (Dutta et al., 2021). Thus, the use of V. parahaemolyticus and V. alginilyticus as bacterial representatives, and balofloxacin as a quinolone-based antibacterial representative, can help to define novel magnesiumdependent phenotypic resistance mechanisms of pathogenic Vibrio species. 

      The current study evaluated whether magnesium induces phenotypic resistance in Vibrio species and defined the molecular/genetic basis for this resistance. Genetic approaches, GC-MS analysis of metabolite and membrane remodeling upon antibiotic exposure, membrane physiology, and extensive antimicrobial susceptibility testing were used for the evaluations.”

      (4) We examined the effect of magnesium in WHO listed priority strains, which confirmed the results as following:

      “Importantly, exogenous MgCl2 also increased MICs of clinic isolates, carbapenemresistant Escherichia coli, carbapenem-resistant Klebsiella pneumoniae, carbapenemresistant Pseudomonas aeruginosa and carbapenem-resistant Acinetobacter baumannii to balofloxacin (Fig 1G).”

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors): 

      (1) There are many grammatical mistakes to point out. The manuscript needs proofreading and editing.

      We appreciate this comment! The manuscript has been revised by a native speaker.

      (2) The introduction could be more informative. A little more description of magnesium - such as what it does to antibiotics and how it's known to affect the microbiome - might be helpful for the general readers. The question remains why out of all the metal ions that might affect antibiotic resistance (many of them are less explored), authors particularly decided to work on the effect of magnesium. The introduction should cover the rationale of their hypothesis. Also, the authors might want to briefly talk about the model organisms (V. algonolyticus and V. parahemolyticus) describing how threatening they are and how they are becoming resistant to antibiotics. 

      We appreciate this comment! We revise the introduction by providing additional information as following:

      “In Gram-negative bacteria, by contrast, zinc enhances antibiotic efficacy by potentiating carbapenem, fluoroquinolone, and β-lactam-mediated killing (Isaei et al., 2016; Zhang et al., 2014). Magnesium influences bacterial structure, cell motility, enzyme function, cell signaling, and pathogenesis (Wang et al., 2019). This mineral also modulates microbiota to harvest energy from the diet (Garcia-Legorreta et al., 2020), allowing Bacillus subtilis to cope with ribosome-targeting antibiotics by modulating ion flux (Lee et al., 2019). However, the role of magnesium in promoting phenotypic resistance is less well understood.

      Vibrios inhabit seawater, estuaries, bays, and coastal waters, regions full of metal ions such as magnesium (Kumarage et al., 2022). Magnesium is the second most dissolved element in seawater after sodium. At a salinity of 3.5% seawater, the magnesium concentration is about 54 mM (Potis, 1968), and in deep seawater, can be as high as 2,500 mM (Wang et al., 2024). Vibrio parahaemolyticus and V. alginilyticus are two representative Vibrio pathogens that infect humans and aquatic animals, resulting in illness and economic loss, respectively (Grimes, 2020). (Fluoro)quinolones such as balofloxacin are used to treat Vibrio infection, however, resistance has emerged due to overuse (Suyamud et al., 2024). Indeed, (fluoro)quinolones are one of China's two primary residual chemicals associated with aquaculture (Liu et al., 2017). Vibrio can develop quinolone resistance through mutations in the DNA gyrase gene or through plasmid-mediated mechanisms (Dutta et al., 2021). Thus, the use of V. parahaemolyticus and V. alginilyticus as bacterial representatives, and balofloxacin as a quinolone-based antibacterial representative, can help to define novel magnesiumdependent phenotypic resistance mechanisms of pathogenic Vibrio species. 

      The current study evaluated whether magnesium induces phenotypic resistance in Vibrio species and defined the molecular/genetic basis for this resistance. Genetic approaches, GC-MS analysis of metabolite and membrane remodeling upon antibiotic exposure, membrane physiology, and extensive antimicrobial susceptibility testing were used for the evaluations. ”

      (3) Figure 1C is mislabeled as 1B (line 100). Line 101: The sentence is not clear and very confusing. What is meant by 15.6mM - 62.4 mM? Are they talking about the concentration of BLFX (though in the figure the concentration was shown in µg)? Please rewrite the sentence in a simplified way. Also, the zone of inhibition was decreased with increasing MgCl2, not increased. 

      We appreciate this comment! These have been revised, including that Fig 1B is now corrected as Fig. 1C. Line 101, which is now Line 122. The sentence was revised as following:

      “At balofloxacin doses of 1.56, 3.125, 6.25, and 12.5 µg, the zone of inhibition decreased with increasing MgCl2 (Fig 1D)”

      (4) In the western blot images, it would be nice to indicate the MW of the protein bands shown. The loading control used for the experiments should be clearly mentioned in the figure legends. 

      We appreciate this comment! The MWs are indicated in the western-blot image throughout the manuscript. 

      The loading control is clearly stated in the figure legend as following:

      “Whole cell lysates resolved by SDS-PAGE gel was stained with Coomassie brilliant blue as loading control.”. 

      (5) Figures 2 B and C: the figure legend does not explain what the authors wanted to show. It's not clear how they plotted the inhibitory curve, or the binding efficacy. These panels need an explanation of how the analysis was done.

      We appreciate this comment! The figure 2 is now removed to Suppl. Fig 2, and the description of figure 2 is moved to Suppl. Text. We revise the description of the result as following, which is in Suppl. Text:

      “Prior studies suggest that the chelation of antibiotics by magnesium ions inhibits antibiotic uptake (Deitchman et al., 2018; Lunestad and Goksøyr, 1990). To investigate whether magnesium binds to balofloxacin, balofloxacin was pre-incubated with magnesium, and zone of inhibition (ZOI) analysis was conducted. Six different concentrations of balofloxacin were separately incubated with six different concentrations of MgCl2, and then spotted on filter paper so that a defined amount of balofloxacin could be used for ZOI. While lower concentrations of MgCl2, (0.78, 3.125, or 12.5 mM) did not alter the ZOI, higher concentrations, including 50 and 200 mM MgCl2, decreased the ZOI (Suppl. Fig 2A), suggesting that even high doses of magnesium had only a partial effect on balofloxacin through direct binding. For example, at 200 mM MgCl2 and 5 or 10 μg/mL balofloxacin, the balofloxacin ZOI was 53.2 and 70.3% of the ZOI at 0 mM MgCl2, suggesting that  50% of the antibiotics were still functional. Intracellular BLFX also decreased with increasing MgCl2 (Suppl. Fig 2B), while exogenous Mg2+ increased intracellular Mg2+ levels in a dose-dependent manner. For example, exogenous 50 and 200 mM MgCl2 increased intracellular Mg2+ levels to 1.21 and 1.31 mM, respectively (Suppl. Fig 2C). The relationship between TolC, an efflux pump that transports quinolones from bacterial cells, and Mg2+ was also assessed (Kobylka et al., 2020; Song et al., 2020). The expression of TolC/tolC was unaffected by Mg2+ (Suppl. Fig 2D). Magnesium is critical for LPS stability. LPS levels increased at 200 mM Mg2+ (Suppl. Fig 2E), however, the loss of waaF, lpxA, and lpxC, three key genes involved in LPS biosynthesis, did not influence balofloxacin sensitivity/resistance in the presence of Mg2+ (Suppl. Fig 2F). These findings suggest that magnesium-induced LPS biosynthesis does not contribute directly to BLFX resistance and demonstrate that Mg2+ influx is involved in balofloxacin resistance.”

      (6) For the metabolomics results, it will help immensely if the authors provide a volcano plot of the identified metabolites and plot the heat map according to the -log2 metabolite intensities. In Figure 3A, it's not clear what information is conveyed through Euclidean distance calculations of the heat map. In Figure 3 B, the authors mentioned that the OPLS-DA test was conducted, although the figure shows a PCA plot, so it's not clear how these two are connected. Figure 3 E: the figure legend says scattered plot, but the panel represents color-coded numerical values, not a scattered plot. Also, it's not clear how they got those values. 

      We appreciate this comment! We quite agree with you that if the differential metabolites could be shown as volcano plot. However, we didn’t adopt volcano plot in this study because this is a magnesium concentration-dependent metabolomes that includes 6 groups in parallel. Volcano plots may give a complex view of the comparison among different groups. We also tried to plot the heat map according to the -log2 metabolite intensities. Although this analysis cluster 200 mM and 50 mM groups better, the data of low magnesium concentrations was not consistent, which may be due to the minor metabolic change of low concentrations magnesium. Thank you for your understanding. 

      For Euclidean distance calculations, we explain in the figure legend as following:

      “Euclidean distance calculations were used to generate a heatmap that shows clustering of the biological and technical replicates of each treatment.” 

      In Figure 2B, which was Figure 3B in previous version, it has been replaced with OPLS-DA analysis in the revised version. 

      In Figure 2E, which was Figure 3E in previous version, it is revised as following:

      “E. Areas of the peaks of palmitic acid and stearic acid generated by GC-MS analysis.” 

      (7) In Figure 4, the figure legends (as well as the in the text) are not properly referred to. Please make sure to refer to the correct panel. 

      We appreciate this comment! The figure legends have been corrected to match the panel and text. 

      Figure 4F: how was the synergy analysis done? In the methods section, the authors described the antibiotic bactericidal assay protocol, but there was no clear indication of how they generated the isobologram. 

      We appreciate this comment! We provide additional information in the Figure 3F legend, which was Figure 4F in previous version,  as following: 

      “Synergy analysis for BFLX with palmitic acid for V. alginolyticus. Synergy was performed by comparing the dose needed for 50% inhibition of the synergistic agents (white) and non-synergistic (i.e., additive) agents (purple).”

      (8) Figure 5 A: the scatter plot is plotted according to the area along the Y axis: which "area" is represented here? There is absolutely no explanation, neither in the results nor in the figure legends. Using box plots might be a better option than using a scattered plot.

      We appreciate this comment! “Area” has been noted in the revised manuscript as following:

      “The area indicates the area of the peak of the metabolite in total ion chromatography of GC-MS.” 

      (9) In Figure 6 A, the heat map is plotted according to the column Z scores. What is meant by "column Z score"? The corresponding figure legend says, "heat map showing differential abundance of lipid". Z scores do not represent an abundance of a variable, so the conclusion might not be appropriate here. 

      We appreciate this comment! In Figure 5A, which was Figure 6A in previous version, column Z score shows the abundance of metabolites analyzed, which is automatically generated in the heat map analysis to give a sign of these metabolites tested. The legend has been revised as following: 

      “Heatmap showing changes in differential lipid levels at the indicated concentration of MgCl2.”  

      (10) Line 313-314: it should be Figure EV6C.  

      We appreciate this comment! The citation has been corrected.

      (11) The authors have shown that Mg+2 does not alter the LPS transport system, however, there was some significant increase in LPS expression at 200mM MgCl2. It would be interesting if the authors could also check if Mg+2 has any effect on the outer membrane protein (OMP) integrity (by checking OMP components BamA and LptD).  

      We appreciate this comment!  We have carefully examined the membrane permeability in Figure 7. We thus didn’t perform additional experiment here to see the change of BamA and LptD. Thank you very much for your understanding.

      (12) I wonder if the authors could check the effect of extracellular Mg+2 during the co-treatment of palmitic acid, linoleic acid, and balofloxacin. Will there still be the antagonistic effect or the presence of Mg+2 could change the phenotype? 

      We appreciate this comment! Additional experiments is performed as following:

      “Furthermore, magnesium had a minimal effect on the antagonistic effect of palmitic acid, linolenic acid, and balofloxacin (Fig 4G), suggesting that this mineral functions through lipid metabolism.” 

      Reviewer #2 (Recommendations For The Authors)

      (1) As mentioned in the Public Review, I strongly believe that the impact of this study will be more significant if magnesium-induced phenotypic drug resistance could be demonstrated in at least one other Gram-negative and one other Grampositive species, both of which should be human pathogens. The full suite of experiments would not be necessary for this suggestion; evaluation of the effect of Mg concentration in growth media on the drug resistance of other species, testing the different antibiotic types used in this study, would be sufficient. 

      We appreciate this comment! Additional experiments have performed to test this idea. Mg2+ has the similar effect on carbapenem-resistant Escherichia coli, carbapenem-resistant Klebsiella pneumoniae, carbapenem-resistant Pseudomonas aeruginosa and carbapenem-resistant Acinetobacter baumannii as the similar as on the Vibrio species in shown in Figure 1G. These have been described following as

      “Importantly, exogenous MgCl2 also increased MICs of clinic isolates, carbapenemresistant Escherichia coli, carbapenem-resistant Klebsiella pneumoniae, carbapenemresistant Pseudomonas aeruginosa and carbapenem-resistant Acinetobacter baumannii to balofloxacin (Fig 1G).”

      (2) I recommend that the Introduction section be expanded. I recommend one or two sentences introducing the two Vibrio species selected for study. I.e. why did the authors choose these two species? What is known about their phenotypic drug resistance in the literature? Why did the authors select balofloxacin for their studies, is it a common antimicrobial used vs Vibrios? As well, the end of the Introduction section ends abruptly with no transition to the present study itself. The end of the introduction should include one or two sentences introducing the main purpose of the study, its approach, and the techniques undertaken. For example, "In this study, we evaluated whether magnesium induces phenotypic resistance in Vibrio species and the molecular/genetic basis for such resistance. We used genetic approaches, GC-MS analysis of metabolite and membrane remodeling upon antibiotic exposure, membrane physiology, and extensive antimicrobial susceptibility evaluations." 

      We appreciate this comment! We revise the introduction by providing additional information as following:

      “In Gram-negative bacteria, by contrast, zinc enhances antibiotic efficacy by potentiating carbapenem, fluoroquinolone, and β-lactam-mediated killing (Isaei et al., 2016; Zhang et al., 2014). Magnesium influences bacterial structure, cell motility, enzyme function, cell signaling, and pathogenesis (Wang et al., 2019). This mineral also modulates microbiota to harvest energy from the diet (Garcia-Legorreta et al., 2020), allowing Bacillus subtilis to cope with ribosome-targeting antibiotics by modulating ion flux (Lee et al., 2019). However, the role of magnesium in promoting phenotypic resistance is less well understood.

      Vibrios inhabit seawater, estuaries, bays, and coastal waters, regions full of metal ions such as magnesium (Kumarage et al., 2022). Magnesium is the second most dissolved element in seawater after sodium. At a salinity of 3.5% seawater, the magnesium concentration is about 54 mM (Potis, 1968), and in deep seawater, can be as high as 2,500 mM (Wang et al., 2024). Vibrio parahaemolyticus and V. alginilyticus are two representative Vibrio pathogens that infect humans and aquatic animals, resulting in illness and economic loss, respectively (Grimes, 2020). (Fluoro)quinolones such as balofloxacin are used to treat Vibrio infection, however, resistance has emerged due to overuse (Suyamud et al., 2024). Indeed, (fluoro)quinolones are one of China's two primary residual chemicals associated with aquaculture (Liu et al., 2017). Vibrio can develop quinolone resistance through mutations in the DNA gyrase gene or through plasmid-mediated mechanisms (Dutta et al., 2021). Thus, the use of V. parahaemolyticus and V. alginilyticus as bacterial representatives, and balofloxacin as a quinolone-based antibacterial representative, can help to define novel magnesiumdependent phenotypic resistance mechanisms of pathogenic Vibrio species. 

      The current study evaluated whether magnesium induces phenotypic resistance in Vibrio species and defined the molecular/genetic basis for this resistance. Genetic approaches, GC-MS analysis of metabolite and membrane remodeling upon antibiotic exposure, membrane physiology, and extensive antimicrobial susceptibility testing were used for the evaluations. ”

      (3) The authors introduce the acronym AWST but never use it again in the paper, instead they use SWT. The authors should introduce SWT only for consistency. 

      We appreciate this comment! We have corrected all the “SWT” to “ASWT”

      (4) Line 76 is not clear: what is meant by "some of which could influence drug efficacy" - the enzymes that utilize light metal ions are co-factors? Or the metals directly?  

      We appreciate this comment! The information we wanted to deliver is that light metal ions can serve as cofactors to catalyze biochemical reaction. Such chemical reaction would alter the drug efficacy, e.g. the Fe-S cluster are metallocofactor for proteins which regulates redox chemistry including antibioticinduced redox change. However, this information is not appropriate for this manuscript, so we delete this sentence. 

      (5) Line 90: add a reference corroborating that this chemical composition is a mimic of marine water. The NaCl concentration used in particular looks quite low. 

      We appreciate this comment! It was a typo error. The NaCl concentration was 210 mM as shown in Suppl. Table 1. We also provide details of the chemical composition of the marine water as following:

      “Marine environments and agriculture, where antibiotics are commonly used, are rich in magnesium. To investigate whether this mineral impacts antibiotic activity, the minimal inhibitory concentration (MIC) of V. alginolyticus ATCC33787 and V. parahaemolyticus VP01, which we referred as ATCC33787 and VP01 afterwards, isolated from marine aquaculture, to balofloxacin (BLFX) in Luria-Bertani medium

      (LB medium) plus 3% NaCl as LBS medium and “artificial seawater” (ASWT) medium that included the major ion species in marine water (Wilson, 1975) (LB medium plus 210 mM NaCl, 35 mM Mg2SO4, 7 mM KCl, and 7 mM CaCl2) were assessed”

      (6) Line 98 and Figure 1B. M9 is indicated in the text but does not appear in the figure, the figure only shows SWT. This should be checked. Line 99: based on Figure 1C, the authors are adding MgCl2 to SWT, SWT should be mentioned in this line. Line 100: I believe this is referring to Figure 1C, which should be checked. 

      We appreciate this comment! 

      Line 98, which is now Line 118: We have corrected M9 to ASWT as following:

      “However, the MIC for BLFX was higher in ASWT medium supplemented with Mg2SO4 or MgCl2 than in LB medium (Fig 1B).”

      Line 99, which is now Line 133: the sentence is corrected as following:

      “The MIC for BLFX increased at higher concentrations of MgCl2 in ASWT”

      Line 100, which is now Line 135: we have corrected Fig 1B to Fig. 1C.

      (7) Line 101: text and Figure 1D are not consistent, as Figure 1D does not show this level of precision in added MgCl2 as indicated in the text (15.6 - 62.4 mM).  

      We appreciate this comment! The sentence has been corrected as following: “At balofloxacin doses of 1.56, 3.125, 6.25, and 12.5 µg, the zone of inhibition decreased with increasing MgCl2 (Fig 1D)””.  

      (8) MgCl2 clearly induces increasing levels of BLFX resistance, and to high levels, but not for every antibiotic. For example, the level of increased resistance to blactams is low (ceftriaxone) and plateaus (ceftazidime). As well, resistance to gentamicin plateaus at a lower level than the other aminoglycosides. These observations do not take away from the conclusion that Mg induces multi-drug resistance, but since the behaviour of the MICs for these drugs is different than the other drugs, they should be mentioned. Also, Figure 1F - tetracyclines (plural) is used for vertical axis label - does this refer to the tetracycline itself or the class itself, and if the class, which one was tested? 

      We appreciate this comment! We revise the description as following: “Notably, magnesium had a reduced effect on ceftriaxone and gentamicin than other antibiotics.”

      The tetracyclines is labeled as “Oxytetracycline” in the revised manuscript. 

      - The magnesium chelation experiments presented in Figure 2 are not clear. The authors should briefly mention how this was done around line 128, and what data underlies the values in Figure 2C. Figure 2B is also not clear to me at all. Similarly, how the authors measured intracellular balofloxacin and Mg2+ is not clear and should be mentioned briefly around lines 130-132. 

      We appreciate this comment! These have been rewritten following as  “To investigate whether magnesium binds to balofloxacin, balofloxacin was preincubated with magnesium, and zone of inhibition (ZOI) analysis was conducted. Six different concentrations of balofloxacin were separately incubated with six different concentrations of MgCl2, and then spotted on filter paper so that a defined amount of balofloxacin could be used for ZOI. While lower concentrations of MgCl2, (0.78, 3.125, or 12.5 mM) did not alter the ZOI, higher concentrations, including 50 and 200 mM MgCl2, decreased the ZOI (Suppl. Fig 2A), suggesting that even high doses of magnesium had only a partial effect on balofloxacin through direct binding. For example, at 200 mM MgCl2 and 5 or 10 μg/mL balofloxacin, the balofloxacin ZOI was 53.2 and 70.3% of the ZOI at 0 mM MgCl2, suggesting that  50% of the antibiotics were still functional. Intracellular BLFX also decreased with increasing MgCl2 (Suppl. Fig 2B), while exogenous Mg2+ increased intracellular Mg2+ levels in a dose-dependent manner. For example, exogenous 50 and 200 mM MgCl2 increased intracellular Mg2+ levels to 1.21 and 1.31 mM, respectively (Suppl. Fig 2C). The relationship between TolC, an efflux pump that transports quinolones from bacterial cells, and Mg2+ was also assessed (Kobylka et al., 2020; Song et al., 2020). The expression of TolC/tolC was unaffected by Mg2+ (Suppl. Fig 2D). Magnesium is critical for LPS stability. LPS levels increased at 200 mM Mg2+ (Suppl. Fig 2E), however, the loss of waaF, lpxA, and lpxC, three key genes involved in LPS biosynthesis, did not influence balofloxacin sensitivity/resistance in the presence of Mg2+ (Suppl. Fig 2F). These findings suggest that magnesium-induced LPS biosynthesis does not contribute directly to BLFX resistance and demonstrate that Mg2+ influx is involved in balofloxacin resistance.”

      - Line 135: LPS cannot be "expressed", as the authors word it here. This should be corrected. Also, the inspection of Figure 2G actually shows the levels of LPS increase with increased Mg2+. The authors should re-evaluate these results and change their description around this area of the Results. 

      We appreciate this comment! We have removed the whole Figure 2 to Supplementary Text and Supplementary Figure 2. We rewrite this part as following: “The relationship between TolC, an efflux pump that transports quinolones from bacterial cells, and Mg2+ was also assessed (Kobylka et al., 2020; Song et al., 2020). The expression of TolC/tolC was unaffected by Mg2+ (Suppl. Fig 2D). Magnesium is critical for LPS stability. LPS levels increased at 200 mM Mg2+ (Suppl. Fig 2E), however, the loss of waaF, lpxA, and lpxC, three key genes involved in LPS biosynthesis, did not influence balofloxacin sensitivity/resistance in the presence of Mg2+ (Suppl. Fig 2F). These findings suggest that magnesium-induced LPS biosynthesis does not contribute directly to BLFX resistance and demonstrate that Mg2+ influx is involved in balofloxacin resistance.”

      - Section: MgCl2 affects bacterial metabolism. Authors switched to M9 medium - why? This contrasts with other sections using SWT and should be explained. Also, I cannot evaluate whether the statistical analysis of the data here was performed correctly and was appropriate for this type of experiment. I advise the authors to move the details in lines 166-169 to the Materials and Methods and replace this section instead with a more accessible description of the statistical analysis that a non-expert would be able to appreciate. Furthermore, analysis of Figure 3A indicates that the levels of asparagine, 4-hydroxybutyric acid, uracil, cystathionine, fumaric acid, and aminoethanol have significantly changed at high MgCl2, but these are not mentioned in the text. I suggest the authors mention these if they are relevant to the 12 enriched pathways, especially the biosynthesis of fatty acids. 

      We appreciate this comment! 

      We indicate the reason we use M9 medium as following:

      “To better understand how magnesium affects bacterial metabolism” for explaining why the M9 medium was used.”

      The information lines 166-169 indicated has been removed to M &M. 

      We have carefully examined the abundance of the metabolites and the enriched pathway. Among the listed metabolites, only fumarate is within the enriched pathways. We mention this point in our revised manuscript as following:

      “The increase in fatty acid biosynthesis could be partially explained by an imbalanced pyruvate cycle/TCA cycle, in which fumarate levels increased at higher Mg2+ while succinate levels increased at lower Mg2+ (Suppl. Fig 5B). These findings indicated that glycolysis fluxes into fatty acid biosynthesis rather than the pyruvate cycle/TCA cycle. The relevance of fatty acids and BLFX was demonstrated by the observation that exogenous palmitic acid increased bacterial resistance to balofloxacin (Fig 2F). These results suggest that fatty acid metabolism may be critical to magnesium-based phenotypic resistance.”

      - Line 211 appears to refer to Figure 4F and should be checked. Similarly in line 216 - appears this should be Figure 4H, and line 218 should be Figure 4H. Line 226: add a reference to Fig 4I (after arcA was decreased). Line 227: what are genes N646_1004 and N646_1885? Based on Fig 4J these are crp - authors should add to line 227. Line 228 appears to refer to Figure 4J, not Figure 4I. Line 229 - should be Figure 4K, not Figure 4I. Line 231 - should be 4L, not 4K. Line 239 - should be 4M.

      We appreciate this comment! The text and figure is now matched. 

      - Line 312: the descriptions of "11 lipids, 32 lipids, and 53", and then "26 lipids, 52 lipids, and 107 lipids" are not clear at all and should be corrected. 

      We appreciate this comment! The sentence is revised as following:

      “The abundance of 11, 32, and 53 lipids was increased in 3.125, 50, and 200 mM MgCl2-treated bacteria, respectively, while the abundance of 26, 52, and 107 lipids was decreased in 3.125, 50, and 200 mM MgCl2-treated bacteria, respectively (Suppl. Fig 7C)”

      - Line 340. What is the assay the authors are using to measure the levels of the PGS and PSS enzymes? This is not mentioned or clear in this part of the Results.  

      We appreciate this comment!  We provide the information in the manuscript as following:

      “Levels of PGS and PSS were quantified by ELISA kits according to manufacture’s instruction (Shanghai Fusheng Industrial Co., Ltd., China)”

      - Line 372: What is the assay for measuring membrane depolarization? This is not mentioned and I suggest it should be. Line 374: Figure 7B does not show time dependence, only dose dependence, this should be corrected, it is assumed the authors are referring to Fig 7C for the time dependence data. 

      We appreciate this comment! We provide the information in the result as following:  

      “The voltage-sensitive dye, DiBAC4(3) showed that 12.5–200 mM MgCl2 promoted membrane depolarization in a dose-dependent manner (Fig 6A)”

      We also explain how DiBAC4(3) can be used to measure membrane depolarization in the Materials and Methods section as following:

      “DiBAC4(3) is a s voltage-sensitive probe that penetrates depolarized cells, binding intracellular proteins or membranes exhibiting enhanced fluorescence and red spectral shift.”

      To make it clear the specific figure, we revise the sentence as following:

      “Meanwhile, MgCl2 had a dose-dependent (Fig 6B) and time-dependent (Fig 6C) effect on proton motive force (PMF).”

      - Line 384: mention how FM5-95 measures membrane permeability. The authors should also clarify how this reagent is used to measure membrane fluidity, and it is not clear if the data for this is presented in Figure 7 - please clarify. Regarding SYTO9 dye experiment: the authors should briefly explain the experimental design - how SYTO9 dye operates and why FACS was chosen. What is labeled with FITC?  

      We appreciate this comment! We clarify the reason we use FM5-95 in the Methods and Materials section as following:

      “Measurement of fluidity by fluorescence microscopy

      Measurement of membrane fluidity is performed as previously described (Wen et al., 2022). Briefly, ATCC33787 were cultured in medium with indicated concentrations of MgCl2, collected and then adjusted to OD 0.6. Aliquot of 100 μL bacteria cells of each sample were diluted to 1 mL and 10 μL (10 mg/mL) FM5-95 (Thermo Fisher

      Scientific, USA) was added. FM5-95 is a lipophilic styryl dye that insert into the outer leaflet of bacterial membrane and become fluorescence. This dye preferentially bind to the microdomains with high membrane fluidity(Wen et al., 2022). After incubated for 20 min at 30 ℃ at vibration without light, the sample was centrifuged for 10 min at 12,000 rpm. The pellets were resuspended with 20 μL of 3% NaCI. Aliquot of 2 μL sample was dropped on the agarose slide, and take photos under the inverted fluorescence microscope.”

      This data is presented as micrographs in Fig. 6D, which shows the decreased FM5-95 staining with increasing concentrations of MgCl2. We make this description clear with the following revision:

      “FM5-95 staining decreased with increasing concentrations of Mg2+, and no staining was observed in the presence of 200 mM Mg2+ (Fig 6D).”

      We explain the reason why we use SYTO9 as following:

      “SYTO9, a green fluorescent dye that binds to nucleic acid, enters and stains bacteria cells when there is an increase in membrane permeability (Lehtinen et al., 2004; McGoverin et al., 2020). Staining decreased with increasing MgCl2, indicating that bacterial membrane permeability declined in an Mg2+ dose-dependent manner (Fig 6E).”

      We didn’t use FACS in this study, while we only analyze the fluorescence distribution with the equipment. To make it clear, we revise the sentence as following:

      “After incubated for 15 min at 30 ℃ at vibration without light, the mixtures were filtered and measured by flow cytometry (BD FACSCalibur, USA).”

      - Lines 391-397. The statement that palmitic acid shifts the peaks in Figure 7F is not supported by the data. There is essential no change in the major peak position within each MgCl2 concentration set with increasing palmitic acid. For the linolenic acid data, it is clear that linolenic acid increases permeability only at 50 mM MgCl2-this should be mentioned in the text. 

      We appreciate this comment! We revise the sentence as following:

      “Exogenous palmitic acid also shifted the fluorescence signal peaks to the left in an MgCl2-dependent manner while palmitic acid only slightly shifted the peaks (Fig 6F). In contrast, exogenous linolenic acid shifted the peak to the right in a dose-dependent manner at 50 mM MgCl2 (Fig 6G).” 

      - Line 404-405 - as mentioned earlier, the assay for the update of BLFX should be mentioned (if it is done so earlier in the text, then it does not need to be here).  

      We appreciate this comment! It has been mentioned in the introduction.  

      - Discussion: CpxA/R-OmprF pathway is mentioned here for the first time. Is this one of the pathways modified by MgCl2 as determined during the course of the study? If so, this should be reworded to mention that. If not, the relevance of this particular pathway as it relates to light metals and phenotypic resistance should be discussed.

      We appreciate this comment! Since it is not relevant to the discussion of Mg2+ and fatty acid biosynthesis, we delete this sentence in the revised manuscript.  

      -The following grammatical errors should be corrected:

      -line 55 change to: "genetic mutations; instead, this type of resistance is transient, and bacteria resume normal growth"

      -line 57: change to "resistance types are biofilm" 

      -line 61: change to "states that significantly" 

      -line 63: change to "resistance share the common feature in they retard or even cease in the presence" 

      -line 65: change to "resistance that allow bacteria to proliferate" 

      -line 81: change "But whether" to "Whether" 

      -line 178: change to "may be critical to the Mg-based phenotypic resistance"

      -line 86: change to "Marine environments and agriculture are rich in magnesium, where..." 

      -line 93: change in to vs

      -line 154: insert space after metabolism 

      -line 158: change 'identified" to "focused on the levels of" 

      -line 160: change "The levels of forty-one metabolites" 

      -line 198: change shared to share 

      -line 310: increased is duplicated, delete one 

      -line 451: add "the" before ratio 

      -line 453: gram should be capitalized 

      -line 462: "the regulation" should be reworded to "More importantly, the effect of exogenous MgCl targets the..." 

      -line 469: add dash between Mg2+ and limited

      -line 478: change "the crucial" to "a crucial" 

      -there are numerous locations in the manuscript where the word "magnetism" is used when clearly the word is supposed to be magnesium - this should be corrected

      We appreciate this comment! These have been corrected or revised. 

      Editors comments:

      Page 2 line 27; Page 25 line number 426; page 27 line number 481: In the abstract and discussion, only Vibrio alginolyticus was mentioned, even though two Vibrio species were used in the study. It would be helpful to understand the rationale behind the focus on this particular species.

      We appreciate this comment! We have revised the introduction to provide additional information as following:

      “Vibrios inhabit seawater, estuaries, bays, and coastal waters, regions full of metal ions such as magnesium (Kumarage et al., 2022). Magnesium is the second most dissolved element in seawater after sodium. At a salinity of 3.5% seawater, the magnesium concentration is about 54 mM (Potis, 1968), and in deep seawater, can be as high as 2,500 mM (Wang et al., 2024). Vibrio parahaemolyticus and V. alginilyticus are two representative Vibrio pathogens that infect humans and aquatic animals, resulting in illness and economic loss, respectively (Grimes, 2020). (Fluoro)quinolones such as balofloxacin are used to treat Vibrio infection, however, resistance has emerged due to overuse (Suyamud et al., 2024). Indeed, (fluoro)quinolones are one of China's two primary residual chemicals associated with aquaculture (Liu et al., 2017). Vibrio can develop quinolone resistance through mutations in the DNA gyrase gene or through plasmid-mediated mechanisms (Dutta et al., 2021). Thus, the use of V. parahaemolyticus and V. alginilyticus as bacterial representatives, and balofloxacin as a quinolone-based antibacterial representative, can help to define novel magnesium-dependent phenotypic resistance mechanisms of pathogenic Vibrio species.”

      On Page 2, line 34: The abstract contains some undefined abbreviations, such as 'PE' and 'PG', which should be explained. 

      We appreciate this comment! We explain the PE and PG in the revised abstract as following:

      “phosphatidylethanolamine (PE) biosynthesis is reduced and phosphatidylglycerol (PG)”

      On Page 2, line 31-32: For the statement "Exogenous supplementation of fatty acids confirm the role of fatty acids in antibiotic resistance…" it would be beneficial to specify whether the fatty acids were saturated or unsaturated. 

      Response, We appreciate this comment! We revise the sentence as following:

      “Exogenous supplementation of unsaturated and saturated fatty acids increased and decreased bacterial susceptibility to antibiotics, respectively, confirming the role of fatty acids in antibiotic resistance.”

      The potential effects of the specific ions (SO4 and Cl2) present in the Mg2SO4 and MgCl2 compounds used in the study were not discussed. It would be useful to understand if these ions had any influence on the observed outcomes.

      We appreciate this comment! We revise the sentence as following:

      “However, the MIC for BLFX was higher in ASWT medium supplemented with Mg2SO4 or MgCl2 than in LB medium (Fig 1B). And Mg2SO4 or MgCl2 had no

      difference on MIC, suggesting it is Mg2+ not other ions contribute to the MIC change.”

      On Page 8, line 141: The heading of Figure 2, "Mg2+ elevates intracellular Mg2+," seems redundant and could be revised for clarity or modified. 

      We appreciate this comment! Figure 2 is now moved to supplementary figure as Suppl. Fig 2. The title is revised as following:

      “Figure 2. Mg2+ decreases balofloxacin uptake.”

      On Page 4, line 91: some terms/abbreviations, such as 'LB' and 'M9,' require expansion or definition to ensure the reader's understanding.

      We appreciate this comment! We include the expansion for LB and M9 in the  revised manuscript as following:

      “Luria-Bertani medium (LB medium)” and “M9 minimal medium (M9 medium)”

      Page 4, line 92: The real seawater composition used in the experiments should be supported by a reference.

      We appreciate this comment! We provide the reference in the revised manuscript as following:

      ““artificial seawater” (ASWT) medium that included the major ion species in marine water (Wilson, 1975) (LB medium plus 210 mM NaCl, 35 mM Mg2SO4, 7 mM KCl, and 7 mM CaCl2)”

      Page 4 line, number 93: the he full names of the bacterial strains (e.g., ATCC33787 and VP01) should be provided instead of just the strain numbers.

      We appreciate this comment! We revised the sentence as following:

      “To investigate whether this mineral impacts antibiotic activity, the minimal inhibitory concentration (MIC) of V. alginolyticus ATCC33787 and V. parahaemolyticus VP01, which we referred as ATCC33787 and VP01 afterwards,”

      Finally, there appears to be a potential contradiction between the statements on page 12, lines 211-212 and 214-216, regarding the effects of Mg2+ on the synthesis of unsaturated fatty acids. Further explanation may be needed to reconcile these seemingly contradictory points.

      We appreciate this comment! For line 221-226, which was previously line 211-212, is about the gene expression for fatty acid biosynthesis. While, Line 228 and 233, which was previously line 214-216 is about the gene expression for fatty acid degradation. We agree that the previous description is a little bit confuse. We revise the sentence to emphasize that we focus on fatty acid degradation so that the readers can tell them apart. 

      In the text, we revised it as following:

      “In addition, we also quantified gene expression during fatty acid degradation to determine whether Mg2+ affects this process”  In the figure legend, we also indicate that 

      “H. qRT-PCR for the expression of genes encoding fatty acid degradation in the absence or presence of the indicated concentrations of MgCl2”

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02648

      Corresponding author(s): Kevin Berthenet (kevin.berthenet@lyon.unicancer.fr) and Gabriel Ichim (gabriel.ichim@lyon.unicancer.fr)

      1. General Statements

      We thank all the reviewers for their time and their constructive criticism, based on which we propose the revision plan detailed bellow. All our responses are indicated in italics font. When is the case, the figures for the reviewers are included just below the answer. Only where indicated they have been included in the manuscript. The line numbers indicated here refer to those in original manuscript.

      The two reviews are listed in full at the end of the document.

      2. Description of the planned revisions

      Reviewer #1

      In this manuscript, the authors report a non-apoptotic role for caspase 3 in promoting cell migration. RNA sequencing revealed a "gene signature" associated with caspase 3 knockdown in a melanoma cell line, although there is no investigation of the connection between caspase 3 expression and the regulation of gene expression. Mass spectrometry-based experiments (AP-MS and BioID) identified numerous interacting proteins, with coronin 1B being the most extensively characterized. Data provided indicates that there is a direct interaction between caspase 3 and coronin 1B, and that caspase 3 influences coronin 1B phosphorylation basally and following ligand stimulation. Both proteins are required for efficient cell migration in scratch wound assays. Data is provided indicating that the actions of caspase 3 are independent of proteolytic activity, although the pharmacological inhibition of caspase activity is not complete, nor is the knockdown of BAX/BAK, making these conclusions poorly substantiated. Evaluation of pathways regulating caspase 3 expression implicates the SP1 transcription factor.

      Response: We thank the reviewer for their supportive comment. Regarding specific pharmacological inhibition of caspase-3, work is under way to complement the results obtained with a pan-caspase inhibitor (qVD-OPh). We will use specific effector caspases inhibitors, complemented by several other approaches: complete KO of BAX and BAK proteins to prevent all eventual mitochondrial permeabilization and low-level effector caspase activation, overexpression (OE) of the anti-apoptotic protein BCL-xL to also prevent residual mitochondrial permeabilization, while also OE XIAP, a potent caspase inhibitor. The promising preliminary data using two effector caspases specific inhibitors (Ac-DEVD-CHO and Ac-DNLD-CHO) in two different melanoma cells, during wound healing migration, is shown below, with no effect on melanoma cell migration.

      Line 129 - The data in Sup. Fig. 1H-L are technical, but where are the mass spectrometry results from the BioID2 experiments? These technical figures are really only relevant if the BioID2 system has been used for protein pull-downs, not for the IF analysis in Fig. 2B.

      Response: We apologize for lack of precision in the article logical flow, we will now incorporate the MS data based on the BioID2 experiment earlier in the manuscript.

      Line 143 - Figure 2C - it is not entirely convincing that caspase 7 is not associated with the cytoskeleton, there is a visible band in lysates from both cell lines, in contrast with GAPDH which is convincingly cytoplasmic. This is particularly true in the WM852 cell lines, in which the Caspase 3 band is almost the same as Caspase 7. These results would also be more convincing if there was IF of Caspase 7 and actin to show whether it is or is not enriched in regions of higher F-actin levels.

      Response: Indeed, our data points towards an enrichment of caspase-3 at the cell cortex. Since generally caspase-7 protein levels are lower, we detected it less in the cytosolic fraction. As suggested, now we performed more sensitive IF colocalization confocal imaging between caspase-7 and F-actin and find it also partially localized to the cortical cytoskeleton (see below). However, this effector caspase is not involved in melanoma cell migration (see wound healing assay below, with two different siRNAs for CASP7 and the positive control of siRNA CASP3).

      Figure 2D - knockdowns with only a single siRNA are insufficient, this should be replicated with additional siRNAs. In addition to the effect on actin anisotropy, it appears as though cells are smaller, is this and any other morphological changes reproducible?

      Response: We plan to strengthen the data shown in Fig.2D with additional siRNAs, as shown below. In addition, high-content screening (HCS) microscopy will provide several other cell morphology descriptors.

      Figure 2D-E. Is it cytochalasin B or D used in these experiments? The text and figures don't agree with each other. 5. Figure 2F-G, same comments above for 2D-E (i.e. comments 3 & 4).

      Response: The experimental conditions will be better detailed in the revised manuscript.

      Figure 2F-G, it appears as though the fewer focal adhesions in the Caspase 3 knockdown cells are bigger per focal adhesion, is this a consistent result? If so, what is the explanation?

      Response: In addition to number, we also plan to quantify the size of focal adhesions.

      Figure 2H - it's not clear how this RNAseq data is relevant to the manuscript. There are some genes in the heat map, but it's not clear which ones are changed in their expression in the caspase 3 knockdown cells, nor is it clear how this is relevant to the proposed mechanisms of Caspase 3 interacting with and influencing the phosphorylation of coronin 1B. If there is no connection, then these data can be removed.* *

      Response: As suggested by the reviewer, the RNAseq data presented in Figure 2H will be removed from the revised manuscript since it is not very relevant.

      Supp. Figure 3 - given that there is data from multiple siRNAs for the incucyte migration data, it should be in the primary figures. And since there are multiple siRNAs and CRISPR/Cas9 KO cells, there should be nothing limiting the replication of the other data presented from only a single siRNA.

      Response: Several siRNA are now used for replicating key results as shown above.

      Figure 3A - how was cell adhesion measured? The methods section says "cell adhesion was determined through cell shape analysis and scoring" But this is very vague.

      Response: We thank the reviewer for spotting out this ambiguity, in the revised manuscript we will be more precise in Material and Methods section.

      Figure 3L - was the Casp7 knockdown experiments done with multiple siRNAs? Both melanoma cell lines? Why is this figure only shown out to 24 hours, whereas the other Incucyte experiment run out to 48 hours? Where is the western blot confirming the caspase 7 knockdown? This is important to establish a clear lack of effect.

      Response*: We apologize for lacking more details, we now provide several siRNA for caspase-7, all showing no or minimal effect of melanoma cell migration (see answer to point 2). *

      Line 190 - it is not true to say that in the presence of QVD there is no longer any caspase activity induced by actinomycin D/ABT263 in supplemental Figures 3J-K. The way that the Y axis has been broken diminishes the difference between untreated and treated cells. In fact, there is apparently over 3-4 times more caspase activity in the actinomycin D/ABT263 treated cells in the presence of QVD relative to basal caspase activity. As a result, it cannot be concluded that there is no residual caspase activity.

      Response: We were not precise enough in describing the data in S3J-K. In the revised manuscript we will clearly say that since treatment with a pan-caspase inhibitor does not have the effect of lowering any basal caspase activity (column 1 versus 2), we conclude that in melanoma cells (WM793 and WM852) there is no basal caspase activation that could drive cell motility. The ActD/ABT263 treatment was used as positive control for bona fide induction of effector caspase activation. These results will be complemented by BAX/BAK DKO and BCLxL OE.

      Line 192 - Does the knockout of BAX/BAK (which apparently reduced but did not eliminate BAX/BAK protein levels in Supp. Fig. 3L) actually "completely block" caspase activity via the mitochondrial pathway? This has not been demonstrated.

      Response: We now provide a fluorometric effector caspases assay showing abrogation of caspase activity in BAX/BAK DKO cells (see below, caspase activating treatment is ActinomycinD plus ABT263). In addition, we will improve the DKO efficacy.

      Line 217 - coronin 1B was a hit from which assays? IP-MS and/or BioID2? I see that this is shown in Figure 5A but not referenced in this sentence.

      Line 218 - the reference to Figure 5A should be in the previous sentence. Line 220 - Can it really be said that the interaction is specific since there is a coronin 1B band in the GFP "negative" control?__ __

      Response*: The revised manuscript will address these inadequacies. *

      Line 222 - it is a good control to show that siRNA-knockdown of Caspase 3 reduced the PLA signal in Figure 5C, but the reciprocal experiment of looking at what happens with Coronin 1B knockdown should be included. How does the PLA signal relate to phalloidin-stained F-actin?

      Response: The proximity ligation assay (PLA) is now complemented by KD of Coronin 1B (see below) and we will try to also add the phalloidin staining for F-actin, if compatible with the PLA protocol.

      Line 224 - looking at the line scans, is the lack of recruitment of coronin 1B to the F-actin at the edge of the protrusion in the Caspase 3 knockdown cells reproducible? Is the point that caspase 3 recruits Coronin 1B? There is an obvious difference in the F-actin at the cell edge, but if the F-actin were as dense in the Caspase 3 knockdown cells as they are for the control, would the same lack of coronin 1B be apparent?

      Response: This aspect will be better addressed/discussed in the revised manuscript.

      Line 227 - where is the western blot showing the effectiveness of the coronin 1B knockdown to accompany Figure 5F.

      Response: The efficacy of coronin 1B KD will be added in the revised manuscript.

      Figure 5G - the blots indicate that there is no change in phospho-PKCalpha in the caspase 3 knockdown cells, although phospho-coronin 1B does decrease. This has not been commented upon in the text. Is the implication that there is a non-PKCalpha mediated mechanism for coronin 1B phosphorylation that is dependent on caspase 3?

      Figure 5H - following from the previous point, there is no phospho-PKCalpha blot that would be a positive control for the effect of PDGF stimulation on PKC activation, in control and caspase 3 knockdown cells, to evaluate whether the effect on coronin 1B phosphorylation was upstream or downstream of PKCalpha. This is also true for Supp. Fig. 4H.

      Response*: Since there are several PKC isoforms that might be co-expressed in melanoma cells, it is possible that PKCalpha is not the one responsible for phosphorylating Coronin 1B. We will be more precise in our investigations by using a pan-phospho-PKC antibody. *

      Does phosphorylation of coronin 1B affect its interaction with caspase 3?

      Response: We will assess by Co-IP the interaction of caspase-3 with both non-phosphorylated and phosphorylated Coronin 1B.

      Figure 6 - as before, only a single siRNA to knockdown SP1 is insufficient to robustly support the conclusions.

      Response: As shown below, we addressed this helpful comment by using several siRNAs to assess the role of SP1 in melanoma cell motility, in two different melanoma cell lines.

      • *

      Reviewer #2

      In this manuscript, the authors provide substantial amounts of experimental evidence that caspase-3, more precisely pro-caspase-3, might be involved in promoting melanoma cell migration and invasion. As such, this function, which might stem from scaffolding roles independent of proteolytic activity (yet not shown entirely convincingly), could possibly be similar to those attributed to other caspases, yet the latter omitted experiments testing for the necessity of enzyme activity. The data are novel and interesting and obviously deserve publication. Yet, a number of criticisms need to be listed.

      Response*: We thank the reviewer for upholding the novelty of our study. As also rightfully pointed by R1, we will strive in a revised manuscript to definitely show that caspase-3 participate to melanoma cell motility independently of its pro-apoptotic protease role: we will use two effector caspases specific inhibitors (Ac-DEVD-CHO and Ac-DNLD-CHO, as shown above) complemented by several other approaches: complete KO of BAX and BAK protein to prevent all eventual mitochondrial permeabilization and low-level effector caspase activation, OE of the anti-apoptotic protein BCL-xL to also prevent residual mitochondrial permeabilization, while also OE XIAP, a potent caspase inhibitor. *

      • *

      • First and foremost, I don't seem to find ethical approval information on the animal experiments. While I do not work with zebrafish myself, I am also somewhat concerned by the size of tumours seen in some of the depicted fish. It is highly important that appropriate information in this direction, including possible endpoints, is provided. Response*: We completely agree with the reviewer, yet the ethical approval is already provided in the manuscript (line 588) and will be complemented by adding the endpoints. *

      The second major issue lies in figure 1. The figure as a whole seems to be very much forced to support or motivate later experimental findings. The authors lack sufficient clarity on some of the approaches and seem to judge on the data to a good bit as they see fit. (…)

      I´d suggest to largely take out Fig1 in its current form, spend time on properly describing any analysis of public data, carefully interpret these and move them probably to the end of the results. Currently, it just leaves the impression that the data were pushed as hard as possible to promote the good work that follows.

      Response*: We will carefully consider the reviewer’s comments and rework the bioinformatics analysis presented in Figure 1 (and associated supplementary figure), making sure we will present certain data as correlation (and not causality) and go into more details on the physio-pathological features of melanoma patients with low/high caspase-3 expression. *

      • *

      The text on line 129ff seems to have omitted any outcomes from the Suppl. Fig1H-L. What was found and what are we supposed to learn from this?

      Response: We apologize for lack of precision in the article logical flow, we will now incorporate the MS data based on the BioID2 experiment earlier in the manuscript.* *

      Lines 146/147 state similar effects upon CASP3 depletion and cytochalasin D. I cannot make that out from Fig.2D. Can you be more specific or visualize this better?

      Response: We will fix this by including zoomed and detailed images of individual cells.

      • Is it possible to state whether effects such as in Fig.3B are general rather than showing just 1 cell?

      Response: The defects in cell adhesion for caspase-3-depleted cells are quantified in Figure 3A. Moreover, we will add representative images.

      • *

      It is unclear how the genes in Fig.2H were defined and why would all of these differ (unless this was an inclusion criterion for the panel). Are these considered to be downstream of CASP3 somehow? I don't fully get the message here. Is this panel even required here?

      Response: As it brings little information, panel 2H will be excluded from the revised manuscript.

      To fully prove independence of caspase-3 activity, it would be appropriate to k/o caspase-3 to then reconstitute the cells with inactive caspase-3.

      • *

      Response: We will try our best of addressing this comment in the revised manuscript.

      Fig.4C and associated text: Statements on changes in tumor size cannot be made from data on tumor free survival.

      Response: We apologize for the misleading data interpretation; this will be tuned down in a revised manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, the authors report a non-apoptotic role for caspase 3 in promoting cell migration. RNA sequencing revealed a "gene signature" associated with caspase 3 knockdown in a melanoma cell line, although there is no investigation of the connection between caspase 3 expression and the regulation of gene expression. Mass spectrometry-based experiments (AP-MS and BioID) identified numerous interacting proteins, with coronin 1B being the most extensively characterized. Data provided indicates that there is a direct interaction between caspase 3 and coronin 1B, and that caspase 3 influences coronin 1B phosphorylation basally and following ligand stimulation. Both proteins are required for efficient cell migration in scratch wound assays. Data is provided indicating that the actions of caspase 3 are independent of proteolytic activity, although the pharmacological inhibition of caspase activity is not complete, nor is the knockdown of BAX/BAK, making these conclusions poorly substantiated. Evaluation of pathways regulating caspase 3 expression implicates the SP1 transcription factor.

      Major comments:

      1. Line 129 - The data in Sup. Fig. 1H-L are technical, but where are the mass spectrometry results from the BioID2 experiments? These technical figures are really only relevant if the BioID2 system has been used for protein pull-downs, not for the IF analysis in Fig. 2B.
      2. Line 143 - Figure 2C - it is not entirely convincing that caspase 7 is not associated with the cytoskeleton, there is a visible band in lysates from both cell lines, in contrast with GAPDH which is convincingly cytoplasmic. This is particularly true in the WM852 cell lines, in which the Caspase 3 band is almost the same as Caspase 7. These results would also be more convincing if there was IF of Caspase 7 and actin to show whether it is or is not enriched in regions of higher F-actin levels.
      3. Figure 2D - knockdowns with only a single siRNA are insufficient, this should be replicated with additional siRNAs. In addition to the effect on actin anisotropy, it appears as though cells are smaller, is this and any other morphological changes reproducible?
      4. Figure 2D-E. Is it cytochalasin B or D used in these experiments? The text and figures don't agree with each other.
      5. Figure 2F-G, same comments above for 2D-E (i.e. comments 3 & 4).
      6. Figure 2F-G, it appears as though the fewer focal adhesions in the Caspase 3 knockdown cells are bigger per focal adhesion, is this a consistent result? If so, what is the explanation?
      7. Figure 2H - it's not clear how this RNAseq data is relevant to the manuscript. There are some genes in the heat map, but it's not clear which ones are changed in their expression in the caspase 3 knockdown cells, nor is it clear how this is relevant to the proposed mechanisms of Caspase 3 interacting with and influencing the phosphorylation of coronin 1B. If there is no connection, then these data can be removed.
      8. Supp. Figure 3 - given that there is data from multiple siRNAs for the incucyte migration data, it should be in the primary figures. And since there are multiple siRNAs and CRISPR/Cas9 KO cells, there should be nothing limiting the replication of the other data presented from only a single siRNA.
      9. Figure 3A - how was cell adhesion measured? The methods section says "cell adhesion was determined through cell shape analysis and scoring" But this is very vague.
      10. Figure 3L - was the Casp7 knockdown experiments done with multiple siRNAs? Both melanoma cell lines? Why is this figure only shown out to 24 hours, whereas the other Incucyte experiment run out to 48 hours? Where is the western blot confirming the caspase 7 knockdown? This is important to establish a clear lack of effect.
      11. Line 190 - it is not true to say that in the presence of QVD there is no longer any caspase activity induced by actinomycin D/ABT263 in supplemental Figures 3J-K. The way that the Y axis has been broken diminishes the difference between untreated and treated cells. In fact, there is apparently over 3-4 times more caspase activity in the actinomycin D/ABT263 treated cells in the presence of QVD relative to basal caspase activity. As a result, it cannot be concluded that there is no residual caspase activity.
      12. Line 192 - Does the knockout of BAX/BAK (which apparently reduced but did not eliminate BAX/BAK protein levels in Supp. Fig. 3L) actually "completely block" caspase activity via the mitochondrial pathway? This has not been demonstrated.
      13. Line 217 - coronin 1B was a hit from which assays? IP-MS and/or BioID2? I see that this is shown in Figure 5A but not referenced in this sentence.
      14. Line 218 - the reference to Figure 5A should be in the previous sentence.
      15. Line 220 - Can it really be said that the interaction is specific since there is a coronin 1B band in the GFP "negative" control?
      16. Line 222 - it is a good control to show that siRNA-knockdown of Caspase 3 reduced the PLA signal in Figure 5C, but the reciprocal experiment of looking at what happens with Coronin 1B knockdown should be included. How does the PLA signal relate to phalloidin-stained F-actin?
      17. Line 224 - looking at the line scans, is the lack of recruitment of coronin 1B to the F-actin at the edge of the protrusion in the Caspase 3 knockdown cells reproducible? Is the point that caspase 3 recruits Coronin 1B? There is an obvious difference in the F-actin at the cell edge, but if the F-actin were as dense in the Caspase 3 knockdown cells as they are for the control, would the same lack of coronin 1B be apparent?
      18. Line 227 - where is the western blot showing the effectiveness of the coronin 1B knockdown to accompany Figure 5F?
      19. Figure 5G - the blots indicate that there is no change in phospho-PKCalpha in the caspase 3 knockdown cells, although phospho-coronin 1B does decrease. This has not been commented upon in the text. Is the implication that there is a non-PKCalpha mediated mechanism for coronin 1B phosphorylation that is dependent on caspase 3?
      20. Figure 5H - following from the previous point, there is no phospho-PKCalpha blot that would be a positive control for the effect of PDGF stimulation on PKC activation, in control and caspase 3 knockdown cells, to evaluate whether the effect on coronin 1B phosphorylation was upstream or downstream of PKCalpha. This is also true for Supp. Fig. 4H.
      21. Does phosphorylation of coronin 1B affect its interaction with caspase 3?
      22. Figure 6 - as before, only a single siRNA to knockdown SP1 is insufficient to robustly support the conclusions.

      Minor comments:

      1. Figure 2C - all caps for CASP7
      2. Figures 2D,F - Cytochalsin
      3. Figure 2H, the labelling of gene names is too small to read.
      4. Supplemental Fig 1A - why is A375 here? Why plot a graph and not just write a percentage protein remaining under the figure? There are no errors indicated, so presumably this is N = 1.
      5. Line 127 - smal

      Significance

      The manuscript is interesting and novel, making it relevant for a broad basic research audience. The role of caspase 3 in non-apoptotic biological processes is not extensively characterized, making this study an advance in the field. The methods are appropriate and well-executed. The statistical methods are mostly appropriate, although some assays (e.g. wound healing assays) do not have associated statistical analysis. Most of the conclusions are adequately substantiated by the results, but as indicated above and in the points below, this is not entirely consistent. There is an issue with only a single siRNA being used in several experiments that should be addressed.

    1. Résumé de la vidéo [00:00:00][^1^][1] - [01:27:29][^2^][2]:

      Cette vidéo présente l'Agence de Conseil Interne de l'État, ses missions, méthodes et projets. Elle met l'accent sur la transformation publique et l'amélioration des politiques prioritaires du gouvernement.

      Points forts : + [00:00:00][^3^][3] Introduction de l'agence * Lancement officiel le 26 mars * Objectifs : transformation publique et politiques prioritaires * Présentation des missions et méthodes + [00:01:16][^4^][4] Enjeux principaux * Améliorer l'efficacité des politiques publiques * Améliorer la qualité de service aux usagers * Améliorer la performance des organisations + [00:04:01][^5^][5] Méthodes et interventions * Approches sur mesure selon les besoins * Utilisation des compétences des consultants * Mobilisation des méthodes et outils d'analyse stratégique + [00:05:58][^6^][6] Exemples de projets * Réforme des lycées professionnels * Pack nouveau départ pour les femmes victimes de violence conjugale * Expérimentations et outils opérationnels + [00:47:45][^7^][7] Résultats et impacts * Simplification des processus * Amélioration de la qualité de service * Mesure des impacts et déploiement des méthodes

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      In this paper Homan et al used mouse models of Metabolic Dysfunction-Associated Steatotic Liver Disease and different specific target deletions in cells to rule out the role of Complement 3a Receptor 1 in the pathogenesis of disease. They provided limited evidence and only descriptive results that despite C3aR being relevant in different contexts of inflammation, however, these tenets did not hold true.

      Weaknesses:

      (1) The results are based on readouts showing that C3aR is not involved in the pathogenesis of liver metabolic disease.

      (2) The description of the mouse models they used to validate their findings is not clear. Lysm-cre mice - which are claimed to delete C3aR in (?) macrophages are not specific for these cells, and the genetic strategy to delete C3aR in Kupffer cells is not clear.

      (3) Taking this into account, it is very challenging to determine the validity of these data, also considering that they are merely descriptive and correlative.

      We generated 2 different cohorts of mice using LysM-Cre (Jackson Strain #004781) to drive deletion in all macrophages and Clec4f-Cre (Jackson Strain #033296) to specifically ablate C3ar1 in Kupffer cells. These experimental models have been clearly defined in the revised manuscript on pages 5 and 7 and in the methods section (page 10). The reviewer’s point is well taken that the LysM-Cre transgene can also be active in granulocytes and some dendritic cells. Even so, despite deletion of C3ar1 in macrophages and other granulocytes, we do not see a major effect on hepatic steatosis and fibrosis in this GAN diet induced model of MASLD/MASH. This was a somewhat surprising finding. We do not agree that our findings are correlative. We specifically ablated C3aR1 in macrophages or Kupffer cells and found no significant differences in the major readouts of steatosis and fibrosis for MASLD/MASH between control and knockout mice. It is possible that in other models of liver injury that we did not test (e.g., short-term treatment with a hepatotoxin such as carbon tetrachloride), there may be differences in liver injury in mice lacking C3ar1 in macrophages, but the GAN diet model has been shown to better parallel the gene expression changes in human MAFLD/MASH. This has been added to the discussion (page 9).

      Reviewer #2 (Public review):

      Summary:

      Homan et al. examined the effect of macrophage- or Kupffer cell-specific C3aR1 KO on MASLD/MASH-related metabolic or liver phenotypes.

      Strengths:

      Established macrophage- or Kupffer cell-specific C3aR1 KO mice.

      Weaknesses:

      Lack of in-depth study; flaws in comparisons between KC-specific C3aR1KO and WT in the context of MASLD/MASH, because MASLD/MASH WT mice likely have a low abundance of C3aR1 on KCs.

      Homan et al. reported a set of observation data from macrophage or Kupffer cell-specific C3aR1KO mice. Several questions and concerns as follows could challenge the conclusions of this study:

      (1) As C3aR1 is robustly repressed in MASLD or MASH liver, GAN feeding likely reduced C3aR1 abundance in the liver of WT mice. Thus, it is not surprising that there were no significant differences in liver phenotypes between WT vs. C3aR1KO mice after prolonged GAN diet feeding. It would give more significance to the study if restoring C3aR1 abundance in KCs in the context of MASLD/MASH.

      GAN diet feeding resulted in higher liver C3ar1 compared to regular diet (Figure 1H). This thus became an impetus for studying the effects of C3ar1 deletion in macrophages or Kupffer cells, which are responsible for the majority of liver C3ar1 expression, in MASLD/MASH (Figures 2B and 3H). This point has been added to the text on page 5.

      (2) Would C3aR1KO mice develop liver abnormalities after a short period of GAN diet feeding?

      We did not assess if short term GAN diet feeding resulted in significant differences in liver abnormalities in the C3ar1 macrophage or Kupffer cell knockout mice. Perhaps the reviewer’s point is that perhaps with shorter periods of GAN diet feeding there may be a phenotype in the KO mice. We agree that this is entirely possible, though with shorter feeding timeframes what is typically seen is hepatic steatosis without fibrosis. Nevertheless, the most important element in our opinion for a disease preventing or modifying model lies with the longer-term GAN diet feeding. With long term GAN diet feeding that has been previously shown to model human MASLD/MASH, we did not observe significant differences in liver abnormalities with the KO mice. This has been added to the discussion (page 8).

      (3) What would be the liver macrophage phenotypes in WT vs C3aR1KO mice after GAN feeding?

      Similar to the above point, given the lack of a major MASLD/MASH phenotype in hepatic steatosis and fibrosis, we did not further profile the liver macrophage profiles of the macrophage or Kupffer cell C3ar1 KO mice with GAN feeding.

      (4) In Fig 1D, >25wks GAN feeding had minimal effects on female body weight gain. These GAN-fed female mice also develop NASLD/MASH liver abnormalities?

      We thank the reviewer for this question. In general, female GAN-fed mice develop milder MASLD/MASH abnormalities. We have included additional data in the revised manuscript in Figure S4. These results show no to minimal development of a MASLD/MASH gene signature.

      (5) Would C3aR1KO result in differences in liver phenotypes, including macrophage population/activation, liver inflammation, lipogenesis, in lean mice?

      We have provided additional data further characterizing liver inflammation, lipogenesis and macrophages in macrophage C3ar1 KO mice under lean/regular diet conditions in Figure 2K. These results show a potential trend but no substantial development of a MASLD/MASH gene signature.

      (6) The authors should provide more information regarding the generation of KC-specific C3aR1KO. Which Cre mice were used to breed with C3aR1 flox mice?

      Clec4f-Cre transgenic mice were used to generate Kupffer cell specific KO of C3ar1. This has been clarified and explicitly stated in the revised manuscript on page 7 and in the methods section.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      These data should be repeated using a more established model of Kupffer cell target deletion via Clec4-F mice.

      Our data with Kupffer cell C3ar1 deletion is indeed done with Clec4f-Cre transgenic mice. This has been clarified in the revised manuscript on page 7 and in the methods section.

      Reviewer #2 (Recommendations for the authors):

      (1) Typo: "iver" in the abstract

      (2) Line 97, "GAN diet I" should be "GAN diet"?

      These points have been corrected in the revised manuscript.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review): 

      Summary: 

      Recent years have seen spectacular and controversial claims that loss of function of the RNA splicing factor Ptbp1 can efficiently reprogram astrocytes into functional neurons that can rescue motor defects seen in 6-hydroxydopamine (6-OHDA)-induced mouse models of Parkinson's disease (PD). This latest study is one of a series that fails to reproduce these observations, but remarkably also reports that neuronal-specific loss of function of Ptbp1 both induces expression of dopaminergic neuronal markers in striatal neurons and rescues motor defects seen in 6-OHDA-treated mice. The claims, if replicated, are remarkable and identify a straightforward and potentially translationally relevant mechanism for treating motor defects seen in PD models. However, while the reported behavioral effects are strong and were collected without sample exclusion, other claims made here are less convincing. In particular, no evidence that Ptbp1 loss of function actually occurs in striatal neurons is provided, and the immunostaining data used to claim that dopaminergic markers are induced in striatal neurons is not convincing. Furthermore, no characterization of the molecular identity of Ptbp1-deficient striatal neurons is provided using single-cell RNA-Seq or spatial transcriptomics, making it difficult to conclude that these cells are indeed adopting a dopaminergic phenotype. 

      Overall, while the claims of behavioral rescue of 6-OHDA-treated mice appear compelling, it is essential that these be independently replicated as soon as possible before further studies on this topic are carried out. Insights into the molecular mechanisms by which neuronalspecific loss of function of Ptbp1 induces behavioral rescue are lacking, however. Moreover, the claims of induction of neuronal identity in striatal neurons by Ptbp1 require considerable additional work to be convincing.

      We thank the reviewer for the detailed analysis of our study. Please find our answers to the points raised by the reviewer below in blue.

      Strengths of the study: 

      (1) The effect size of the behavioral rescue in the stepping and cylinder tests is strong and significant, essentially restoring 6-OHDA-lesioned mice to control levels.

      (2) Since the neurotoxic effects of 6-OHDA treatment are highly variable, the fact that all behavioral data was collected blinded and that no samples were excluded from analysis increases confidence in the accuracy of the results reported here. 

      We appreciate the reviewer’s feedback and acknowledgement of the strengths of our study. We undertook several optimization steps in the surgery, post-operative care, and handling of the animals for behavior experiments to ensure high reproducibility of our experiments.

      Weaknesses of the study:  

      (1) Neurons express relatively little Ptbp1. Indeed, cellular expression levels as measured by scRNA-Seq are substantially below those of astrocytes and other non-neuronal cell types, and Ptbp1 immunoreactivity has not been observed in either striatal or midbrain neurons (e.g. Hoang, et al. Nature 2023). This raises the question of whether any recovery of Th expression is indeed mediated by the loss of function of Ptbp1 rather than by off-target effects. AAVmediated rescue of Ptbp1 expression could help clarify this.

      In the original manuscript, we delivered control vectors that only express the ABE to 6-OHDAlesioned mice (labeled as AAV-ctrl) and did not detect TH positive cells in the midbrain or striatum of control mice or rescue of spontaneous motor skills. We can therefore exclude that the delivery procedure, AAV-PHP.eB capsid, or ABE expression caused adverse effects leading to induction of TH expression and functional rescue of spontaneous motor behaviors in PD mice. To further exclude that these effects were caused by off-target editing, we experimentally determined off-target binding sites of our sgRNA (sgRNA-ex3) using GUIDEseq and subsequently analyzed these sites in treated animals by NGS (Figure 3 – supplement 3). While two off-target sites were identified, it is unlikely that base editing at these sites caused the observed phenotypes. One off-target site was identified in the myopalladin (Mypn) gene, which encodes for a muscle-specific protein that plays a role in regulating the structure and growth of skeletal and cardiac muscle (Filomena et al., 2021, 2020).  The other site is not located in a coding region, but in an intron of the ankyrin-1 (Ank1) gene, encoding for an adaptor protein linking membrane proteins to the underlying cytoskeleton (Cunha and Mohler, 2009). Even though this gene is also expressed in neurons, base editing within this intronic region did not lead to changes in transcript levels (Figure 3 – supplement 3). Thus, the induction of TH expression upon adenine base editing with sgRNA-ex3 is likely a direct consequence of PTBP1 downregulation.

      Further supporting this conclusion, in the revised manuscript we additionally show PTBP1 downregulation at the RNA and protein level in the SNc and striatum after base editor treatment (Figure 2 – figure supplement 5; figure 3 – supplement 2).

      (2) It is not clear why dopaminergic neurons, which are not normally found in the striatum, are observed following Ptbp1 knockout. This is very similar to the now-debunked claims made in Zhou, et al. Cell 2020, but here performed using the hSyn rather than GFAP mini promoter to control AAV expression. While this is the most dramatic and potentially translationally relevant claim of the study, this claim is extremely surprising and lacks any clear mechanistic explanation for why it might happen in the first place.  

      We agree with the reviewer that our study does not provide mechanistic insights into how Ptbp1 downregulation in neurons leads to the induction of dopaminergic markers in the striatum. As we believe that this is not within the scope of a revision, we discuss potential follow-up experiments in the discussion section of the revised manuscript.

      This observation is even more surprising in light of reports that antisense oligonucleotidemediated knockdown of Ptbp1, which should have affected both neuronal and glial Ptbp1 expression, failed to induce expression of dopaminergic neuronal markers in the striatum (Chen, et al. eLife 2022). Selective loss of function of Ptbp1 in striatal and midbrain astrocytes likewise results in only modest changes in gene expression. 

      Using 6-OHDA lesioned Aldh1l1-CreERT2;Rpl22lsl-HA mice, the Chen et al. study (eLife 2022) assessed potential astrocyte to neuron conversion by quantifying the presence of HA-labeled neurons after ASO-mediated knockdown of Ptbp1. Even though they did not detect HApositive neurons in the SNc, suggesting absence of astrocyte to neuron conversion, the images in Figure 4D reveal TH positive cells in the lesioned hemisphere, similar to our observations in Figure 2B-D. While it cannot be excluded that these TH positive cells are remnants from an incomplete 6-OHDA lesion, they could also be endogenous neurons with induced expression of dopaminergic markers after ASO-mediated knockdown of Ptbp1. Furthermore, Chen et al. performed the apomorphine test to assess changes in motor skills, which did not reveal an improvement in our study either.

      It is critically important that this claim be independently replicated, and that additional data be provided to conclusively show that striatal neurons are indeed expressing dopaminergic markers.

      Our behavior and immunofluorescence experiments involving mice injected into the striatum were performed with two independently generated cohorts of 6-OHDA mice. In detail, the 6OHDA mice were generated by two independent surgeons from different labs (>6 months between experiments of these cohorts), leading to comparable behavioral outcomes before and after treatment. Subsequent behavior and immunofluorescence experiments with each cohort were performed and analyzed by two independent and blinded researchers, showing comparable results.

      (3) More generally, since multiple spectacular and irreproducible claims of single-step glial-toneuron reprogramming have appeared in high-profile journals in recent years, a consensus has emerged that it is essential to comprehensively characterize the identity of "transformed" cells using either single-cell RNA-Seq or spatial transcriptomics (e.g. Qian, et al. FEBS J 2021; Wang and Zhang, Dev Neurobiol 2022). These concerns apply equally to claims of neuronal subtype conversion such as those advanced here, and it is essential to provide these same datasets. 

      In the revised version, we have analyzed the expression of additional neuronal markers in TH positive cells of the striatum using 4i imaging. Briefly, our results showed that the vast majority of TH-expressing cells also expressed the markers DAT and NEUN, further corroborating the neuronal and dopaminergic identity of these cells. Additional analysis revealed that this TH/DAT/NEUN expressing cell population expressed markers of GABAergic neurons, either of medium spiny neurons (~50%) and various types of interneurons (~50%). While our 4i analysis has allowed us to broadly classify these TH-expressing populations, we agree that detailed transcriptional analysis at the single cell level is required to understand the molecular mechanisms underlying the generation of TH positive cells. These analyses are, however, not within the scope of a revision and would require a thorough dedicated study. We have added these results and discussion points to the revised manuscript.

      (4) Low-power images are generally lacking for immunohistochemical data shown in Figures 3 and 4, which makes interpretation difficult. DAPI images in Figure 3C do not appear nuclear. Immunostaining for Th, DAT, and Dcx in Figure 4 shows a high background and is difficult to interpret. 

      We thank the reviewer for closely evaluating these images and suggestions for improvement. In the revised manuscript, we provide low power images and higher magnification insets as requested to allow for easier interpretation.

      (5) Insights into the mechanism by which neuronal-specific loss of Ptbp1 function induces either functional recovery, or dopaminergic markers in striatal neurons, is lacking.

      In the revised manuscript, we provide a more detailed discussion of mechanisms that could potentially be involved in the functional recovery or expression of dopaminergic markers. However, deciphering the exact molecular mechanisms underlying these observations requires thorough transcriptional analysis at the single cell level, which is out of scope of this revision.

      Reviewer #2 (Public Review):

      Summary: 

      The manuscript by Bock and colleagues describes the generation of an AAV-delivered adenine base editing strategy to knockdown PTBP1 and the behavioral and neurorestorative effects of specifically knocking down striatal or nigral PTBP1 in astrocytes or neurons in a mouse model of Parkinson's disease. The authors found that knocking down PTBP1 in neurons, but not astrocytes, and in striatum, but not nigra, results in the phenotypic reorganization of neurons to TH+ cells sufficient to rescue motor phenotypes, though insufficient to normalize responses to dopaminomimetic drugs.

      Strengths: 

      The manuscript is generally well-written and adds to the growing literature challenging previous findings by Qian et al., 2020 and Zhou et al., 2020 indicating that astrocytic downregulation of PTBP1 can induce conversion to dopaminergic neurons in the midbrain and improve parkinsonian symptoms. The base editing approach is interesting and potentially more therapeutically relevant than previous approaches.

      Weaknesses: 

      The manuscript has several weaknesses in approach and interpretation. In terms of approach, the animal model utilized, the 6-OHDA model, though useful to examine dopaminergic cell loss, exhibits accelerated neurodegeneration and none of the typical pathological hallmarks (synucleinopathy, Lewy bodies, etc.) compared to the typical etiology of Parkinson's disease, limiting its translational interpretation. 

      We thank the reviewer for the detailed assessment of our study and pinpointing its current weaknesses. Please find our answers to all comments below in blue.

      We agree with the reviewer that the 6-OHDA model lacks the typical pathological hallmarks of PD. Nevertheless, we chose this model for two reasons:

      i) The 6-OHDA model was used by both Qian et al. (2020) and Zhou et al. (2020). To allow comparison of our results to these studies, it was crucial to use the same model. Notably, the 6-OHDA model was also used by Chen et al. (2022) and Hoang et al. (2023) for comparison to the two studies from 2020.

      ii) The 6-OHDA model is straightforward to generate and displays robust motor impairments for evaluation of potential therapeutic effects of neuroregeneration treatment approaches. We therefore believe that the model is well-suited to analyze the cellular and behavioral effects (specifically motor skills) of PTBP1 downregulation. 

      In future studies, it would be critical to include models that also display typical pathological hallmarks of the disease to further evaluate the therapeutic effect of this base editing approach. These experiments are, however, not within the scope of this study, which was aimed to focus on the cellular and behavioral effects of PTBP1 downregulation. 

      In addition, there is no confirmation of a neuronal or astrocytic knockdown of PTBP1 in vivo; all base editing validation experiments were completed in cell lines. 

      In the revised manuscript, we assess in vivo base editing efficiencies at the Ptbp1 target site in the SNc (AAV-hsyn, 15.6%) and striatum (AAV-hysn, 21.1%). Furthermore, we assessed in vivo Ptbp1 downregulation at the RNA and protein level to complement our in vitro data (Figure 2 – figure supplement 5; figure 3 – supplement 2).

      Finally, it is unclear why the base editing approach was used to induce loss-of-function rather than a cell-type specific knockout, if the goal is to assess the effects of PTBP1 loss in specific neurons. 

      We expressed base editors under cell-type specific promoter to induce a reliable loss-offunction mutation at the Ptbp1 exon-intron junction in neurons or astrocytes. Performing these mutations with Cas9 nucleases instead would have had potential limitations and risks, including i) indel mutations do not always lead to a frameshift and loss-of-function despite high indel formation at the targeted site, ii) nucleases induce DNA double strand breaks, which can have serious side effects (e.g. chromosomal rearrangements or translocations), and iii) ‘mosaicisms’ as edited cells contain different indel mutations, which may result in different effects and thus complicate analysis of the downstream effects. We discuss these points in the revised manuscript.  

      In terms of interpretation, the conclusion by the authors that PTBP1 knockdown has little likelihood to be therapeutically relevant seems overstated, particularly since they did observe a beneficial effect on motor behavior. We know that in PD, patients often display negligible symptoms until 50-70% of dopaminergic input to the striatum is lost, due to compensatory activity of remaining dopaminergic cells. Presumably, a small recovery of dopaminergic neurons would have an outsized effect on motor ability and may improve the efficacy of dopaminergic drugs, particularly levodopa, at lower doses, averting many problematic side effects. Since striatal dopamine was assessed by whole-tissue analysis, which is not necessarily reflective of synaptic dopamine availability, it is difficult to assess whether the ~10% increase in TH+ cells in the striatum was sufficient to improve dopamine function. However, the improvement in motor activity suggests that it was.

      As pointed out by the reviewer, it is difficult to estimate the therapeutic effect and importance of a ~10% increase in TH+ cells for PD patient. Guided by the reviewer’s suggestion, we have included a more in-depth discussion of our results and its potential therapeutic value as well as outstanding questions for future studies in the revised manuscript.

      Reviewer #3 (Public Review):

      This study explores the use of an adenine base editing strategy to knock down PTBP1 in astrocytes and neurons of a Parkinson's disease mouse model, as a potential AAV-BE therapy. The results indicate that editing Ptbp1 in neurons, but not astrocytes, leads to the formation of tyrosine hydroxylase (TH)+ cells, rescuing some motor symptoms.

      Several aspects of the manuscript stand out positively. Firstly, the clarity of the presentation. The authors communicate their ideas and findings in a clear and understandable manner, making it easier for readers to follow. 

      The Materials and methods section is well-elaborated, providing sufficient detail for reproducibility. 

      The logical flow of the manuscript makes sense, with each section building upon the previous one coherently.

      The ABE strategy employed by the authors appears sound, and the manuscript presents a coherent and well-supported argument.

      Positively, some of the data in this study effectively counteracts previous work in line with more recent publications, demonstrating the authors' ability to contribute to the ongoing conversation in the field.

      We thank the reviewer for appreciating the effort we have put into this study. Please find below a point-by-point reply to the weaknesses raised by the reviewer. 

      However, while the in vitro data yields promising results, it may have been overly optimistic to assume that the efficiencies observed in dividing cells will directly translate to in vivo conditions. This consideration is important given the added complexities of vector optimization, different cell types targeted in vitro versus in vivo, as well as unknown intrinsic limitations of the base editing technology. 

      We agree with the reviewer that in vitro base editing efficiencies might not directly translate to in vivo editing outcomes. We therefore assessed in vivo base editing efficiencies at the Ptbp1 locus and PTBP1 downregulation in the striatum and midbrain. Our data revealed that in vivo base editing activity was lower than in our in vitro setting (in vitro: Figure 1; figure 1 – figure supplement 2; in vivo: figure 2 – figure supplement 5; figure 3 – supplement 2). However, we believe that these rates are slightly underestimated since we sequenced DNA isolated from the whole tissue (striatum or SNc) and not from purified astrocytes or neurons. Moreover, we could demonstrate that editing led to a reduction of Ptbp1 transcript and PTBP1 protein level (Figure 2 – figure supplement 5; figure 3 – supplement 2).

      In addition, certain aspects of the manuscript would benefit from a more in-depth and comprehensive discussion rather than being only briefly touched upon. Such a discussion would enhance the relevance of the obtained results and provide the foundation for improvement when using similar approaches.

      Following the reviewer’s suggestion, we included a more in-depth discussion of our results in the revised manuscript.

      Recommendations for the authors:

      Reviewing Editor (Recommendations for the Authors):

      A summary of key recommendations that might improve the eLife assessment in a subsequent submission are provided below, as a guide to help the authors focus on changes that might enhance the strength of evidence (e.g., from "incomplete" to "solid").

      (1) Provide further explanation of the mechanistic relationship between the downregulation of Ptbp1 and TH+ dopaminergic neuron reprogramming. Additional discussion of this topic should also be included.

      (2) Demonstrate proof of editing in the intended targeted cells in vitro and/or in vivo.

      (3) Show evidence of successful Base Editor delivery in vivo.

      (4) Perform a deeper characterization of TH+ cells in vivo and provide a more thorough discussion of the identity of the targeted cells. This may include an exploration of whether TH+ cells detected are TH+ interneurons and/or establish their identity based on transcriptomics or a similar approach.

      (5) Provide better-quality representative images supporting the quantitative data.

      (6) Please include full statistical reporting including exact p-values wherever possible alongside the summary statistics (test statistic and df) and 95% confidence intervals. These should be reported for all key questions and not only when the p-value is less than 0.05 in the main manuscript.

      In the revised manuscript, we provided 1) suggestions of the mechanistic relationship between Ptbp1 knockdown, dopamine synthesis, and the functional rescue of spontaneous behaviors, 2) proof of in vivo base editing and successful base editor delivery, 3) deeper characterization of TH-expressing cells in vivo using 4i imaging, 4) better quality images, and 5) full statistical reporting.  

      Individual Reviewer recommendations for the authors are included below.

      Reviewer #1 (Recommendations For The Authors):

      Confirm loss of Ptbp1 function in infected striatal neurons. Single-cell RNA-Seq or spatial transcriptomic analysis must be performed to characterize the identity of the edited striatal neurons. The quality of the immunostaining in Figures 3 and 4 needs to be improved, and lowpower images provided. Were eLife a conventional journal, I would have insisted on all these being included prior to publication. Please also arrange for independent replication of the behavioral rescue and induction of dopaminergic marker gene expression in the striatum. 

      In the revised manuscript, we confirmed Ptbp1 downregulation at the tissue level in the SNc and striatum by RT-qPCR and western blot and included low-power images for easier interpretation. Additionally, we assessed expression of additional neuronal markers on striatal sections using 4i imaging and found that TH/DAT/NEUN positive populations either expressed markers of medium spiny neurons or interneurons. We have included these results in the revised manuscript.

      Our behavioral and imaging experiments involving mice injected into the striatum were in fact performed with two independently generated cohorts of 6-OHDA mice. In detail, the 6OHDA mice were generated by two independent surgeons from different labs (>6 months between experiments of these two cohorts), leading to comparable behavioral outcomes before and after treatment. The experiments with each cohort were performed and analyzed by two independent and blinded researchers, yielding comparable results. 

      Reviewer #2 (Recommendations For The Authors):

      (1) In the introduction, lines 43-45: This statement is inaccurate. Current treatment strategies do not focus on slowing or halting disease progression. There is currently no accepted therapy that does this. Dopaminergic therapies and deep brain stimulation can compensate for circuitry dysfunction as a result of dopamine cell loss but do not slow the disease. The referenced paper used is older and does not refer to new treatments for PD and is a summary article for a special issue of the Disease Models and Mechanisms journal. Please ensure that all references used are appropriate for the statement they are attached to.

      We thank the reviewer for pointing this out. We have rephrased this statement accordingly and provided an appropriate reference describing current treatment strategies.

      (2) The number of TH+ cells in the intact nigra seems low compared to published data. Suggest a stereological approach may be better than the Abercrombie method.

      Following the reviewer’s suggestion, we re-quantified the number of TH positive cells using a stereological approach (Nv:Vref method). We have included these results in the revised manuscript. 

      (3) Have the authors considered that the striatal TH+ cells could be TH+ striatal interneurons? 

      In the revised manuscript, we performed additional 4i imaging experiments to further analyze the identity of the TH positive cells in the striatum. Briefly, we found that TH/DAT/NEUN positive populations either expressed markers of GABAergic medium spiny neurons or interneurons. We have added these results to the revised manuscript (Figure 4). 

      (4) The Western blot shown in Figure 1 C for C8-D1A has some abnormalities and makes it difficult to judge the bands. Also, for 1B, the legends are difficult to see.

      In the revised manuscript, we have repeated the respective western blot to make interpretation of the bands easier, and adapted the legends in Figure 1B for better visibility.

      (5) Figure 2: Please show representative images for the GFAP-targeted editing.

      Representative images of the GFAP-targeted groups can be found in Figure 2 – figure supplement 3.

      (6) Figure 2, Supplement 3: Please include quantification.

      The quantifications for these images can be found in Figure 2D and 2F. 

      (7) Figure 1, Supplement 2: The gene name in A is misspelled.

      Thank you for point this out. In the revised manuscript, we added the correct gene name.

      (8) Line 267-276: As previously indicated, the statement here is overstated based on the data provided. In addition, the citation provided to justify this claim (Kannari et al., 2000) is an odd choice as the dosage of L-DOPA utilized was not therapeutically relevant (50 mg/kg). A better indication of efficacy would be the return to basal, unaffected levels rather than the fold increase in dopamine levels. A better comparison would be Lindgren et al., 2010 who showed that L-DOPA-treated animals with a physiologically relevant dose (6 mg/kg) that did not induce dyskinesia, showed a return to basal, non-lesioned dopamine levels in the striatum after LDOPA by microdialysis. To really support this claim, the authors would need to use an approach that could measure synaptic dopamine availability, rather than whole-tissue dopamine levels, such as microdialysis, fiber photometry, or an equivalent.

      Following the reviewer’s suggestions, we replaced this reference with Lindgren et al. (2010) and provide a more detailed interpretation of our results and remaining questions for future studies.  

      Reviewer #3 (Recommendations For The Authors):

      Major and minor issues are discussed below by section.

      INTRODUCTION and AIM - Lines 36-73

      - The authors effectively contextualize the aim of their study by providing comprehensive background information on previous research regarding cell 'reprogramming' into dopaminergic neurons in the SNc. However, the introduction lacks contextualization of TH+ cells and PD. For readers who may not be well-versed in the Parkinson's field, understanding the importance of TH (Tyrosine Hydroxylase) may be challenging, since the term "TH+ cells" is mentioned only once by the end of the introduction (line 71), to then become a key element in the entire study.

      - Providing a brief explanation of the role of Tyrosine Hydroxylase in the synthesis of L-DOPA would facilitate the reader's comprehension of why the presence of TH+ cells following Base Editing treatment is relevant.

      - Further elaboration on the relationship between the downregulation of the general RNA binding protein, PTBP1, and the specific dopaminergic-related readout, TH, would improve coherence and strengthen the linkage between the introductory section and the results.

      We thank the reviewer for the constructive suggestions. In the introduction of the revised manuscript, we describe the meaning and importance of TH in the context of dopamine synthesis and PD. Likewise, we briefly outlined the importance of the PTBP1/nPTBP regulatory loops during neuronal differentiation and maturation. 

      RESULTS 

      Result Section 1 - Line 75-109

      - Thorough screening of sgRNAs targeting splice junctions across the Ptbp1 gene in HEPA cells, shows the achievement of high levels of editing (80-90%) with sgRNA-ex3 and sgRNAex7. 

      - The data also indicates that editing translates into significant reductions in ptbp1 expression, along with an increase in the expression of genes repressed by PTBP1.

      - Despite obtaining lower percentages of editing events in N2a neuroblastoma cells and the C8-D1A astroglial cell line, the differential expression levels of ptbp1 and the readout genes remain significant. However, the gRNA screening assay is performed in immortalized, dividing cells. 

      - Providing proof that Adenosine Base Editing of Ptbp1 is successful in non-dividing cells (such as SNc and/or striatal primary neurons) would strengthen the case for the potential therapy in the intended cell type.

      Following the reviewer’s comment, we show in vivo base editing rates in the SNc and striatum of treated PD mice in the revised manuscript (Figure 2 – figure supplement 5; figure 3 – supplement 2).

      - Moreover, assessing the expression levels of tyrosine hydroxylase by qPCR after Ptbp1 base editing in vitro could help contextualize the use of TH+ detection as an in vivo readout and may help explain why the total number of TH+ cells is low after ABE treatment in vivo - as shown in following sections.

      In the revised manuscript, we now provide quantifications of in vivo base editing efficiencies in the SNc (~15%) and striatum (~20%). As expected from these lower in vivo base editing rates, downregulation of Ptbp1 at the transcript and protein level was less pronounced compared to our in vitro experiments. It seems likely that higher base editing efficiency and more pronounced downregulation of Ptbp1 could lead to a larger population of TH expressing cells. We have added these results and interpretations to the revised manuscript.

      - Furthermore, although ABEs are less prone to generating bystander and other nucleotide changes compared to CBEs, it is still possible. Figures 1 (line 811) and 1-supplement 2 (line 842) only show a brief window of the Sanger sequencing trace. Updating these figures to display a wider view of the sequencing trace would enhance transparency. If unwanted edits are detected, while they may not significantly alter the relevance, impact, or structure of the paper, they may become an important aspect of the discussion. 

      Indeed, ABEs can induce bystander edits and we also detected such edits at the Ptbp1 target site. However, since our base editing strategy was designed to yield a loss of Ptbp1 function, bystander editing at the splice site was not a primary focus in our analysis. Nevertheless, we included CRISPResso output images showing the specific editing outcomes in a wider analysis window in the revised manuscript (Figure 3 – figure supplement 2). 

      Result Section 2 - Lines 110-159

      A split intein system is used in vivo with sgRNA-ex3, after updating the promoter to make it cell-specific: hSyn to restrict expression to neurons and GFAP to restrict expression to astrocytes. 

      However, no other assay is performed to assess whether a) the promoter change and/or b) splitting Cas9 may affect the editing efficiency compared to their initial in vitro approach.

      In the revised manuscript, we assessed the performance of the in vivo AAV vectors encoding the split intein ABE with sgRNA-ex3 in vitro in N2a and C8-D1A cells. Our results show that all vectors are functional and result in base editing at the target locus.

      -  Addressing whether this is the case may explain the low number of TH+ cells observed in vivo. 

      - The authors could also consider staining for Cas9 to address whether the low number of TH+cells could be attributed to a poor Cas9 delivery.

      To confirm successful in vivo base editor delivery, we quantified in vivo base editing efficiencies in the SNc and striatum of PD mice. Our analysis revealed in vivo base editing efficiencies at both tissue sites, confirming that base editors were successfully delivered. Editing efficiencies were, however, substantially lower (Figure 2 – figure supplement 5; figure 3 – supplement 2).  than in our in vitro cell line setting (Figure 1; figure 1 – figure supplement 2). Even though tissue editing rates likely underestimate the cell type-specific editing rates in astrocytes or neurons, higher base editing rates would have likely resulted in a higher number of TH positive cells. We have added these results and their implications to the revised manuscript. 

      -  Moreover, despite the presence of TH, in Figure 2 E,F authors examine the striatal innervation from newly generated TH+ cells in the SNc by Fluorescence Intensity (FI) to conclude that the edited cells do not form projections towards the striatum. Considering the low levels of TH+ positive cells obtained, the accumulation of gross FI might not be the most accurate way to assess the presence or absence of cell projections.

      - Using another marker that stains the projections rather than the cell soma, and that is a marker of dopaminergic neurons, might be a better way to address this.

      To address the reviewer’s comment, we analyzed the presence of potential dopaminergic fibers in the mfb, where projections are more concentrated (around the injection coordinates of 6-OHDA), using the dopaminergic marker DAT. In line with our previous observations in the striatum, we did not detect an increase in DAT fluorescence intensity upon treatment on the lesioned hemisphere (Figure 2 – figure supplement 4).  

      Result Section 3 - Line 160-182

      Minor issue

      - The same dual split intein system is used in the striatum. However, in Figure 3 - Figure Supplement 1 - line 958 and in Figure 3 - Figure Supplement 4 - line 1000authors show the injection of 2x the viral genomes indicated along the manuscript. In previous experiments the SNc 2x108vg/animal was used whereas this figure shows 4x108vg/animal injected in the striatum. 

      - The authors should clarify if the vg injected in the striatum was different from what they previously indicated.

      Compared to injection in the SNc, the volume of vector injected in the striatum was doubled since the region is significantly larger. We clarified that the injected vector genomes were different between striatum and SNc in the revised manuscript.

      Result Section 4- Line 183-220

      In this section, the authors thoroughly examine the neuronal nature of TH+ cells through NeuN co-staining and iterative immunofluorescence imaging (4i). BrdU experiments are conducted to determine the origin of these cells, leading to the conclusion that TH+ cells derive from nondividing cells and express the neuronal marker DAT, characteristic of dopamine-producing neurons (DANs). Cell shape of the TH+ cells in the striatum and SNc is also evaluated measuring their Feret's diameter and their cell surface. Authors conclude there's heterogeneity in the TH+ cell population due to the presence of TH+/Neun- as well as differences in cell shape. 

      However, their explanation of this heterogeneity is solely attributed to differences in the microenvironment and lacks further elaboration. Similarly, their observation that almost half the number of TH+ striatal cells after treatment express CTIP2 (Line 213 and Figure 4B), a marker for GABAergic medium spiny neurons, which they state as "interesting" (line 213) is not developed further. Delving deeper into these topics could strengthen the discussion.

      In the revised manuscript, we provided a more in-depth discussion of the 4i imaging results and potential therapeutic implications. Additionally, we suggest follow-up experiments to analyze the identity, function, and molecular mechanisms underlying the expression of TH upon PTBP1 downregulation in future studies. 

      Result Section 5- Line 221-243

      Two drug-free and two drug-induced behavioral tests are conducted in control and treated animals to evaluate the restoration of motor functions following treatment. Consistent with their previous findings, only the treatment targeted to neurons resulted in the restoration of motor functions in drug-free behavioral tests. The rationale behind each test and its evaluation is clearly explained.

      DISCUSSION 

      - In the discussion section, the authors effectively re-examine their results contextualizing their data with previous studies in the field. However, it would be helpful at this point in the manuscript to reconsider the use of the term 'cell reprogramming,' as this study does not involve actual cell reprogramming. The concept "reprograming" entails the process of transforming adult cells into a stem cell-like state, to then differentiate them into a different cell type. As proven in section 4 by a BrdU proliferation assay, the targeted cells are differentiated neurons. Considering BrdU is administered 5 days after ABE treatment, if true cell reprogramming was taking place, there should be evidence of BrdU incorporation. Cell reprogramming or reprograming is mentioned 4 times in the manuscript (line 34, line 54, line 265, line 277). Therefore, using another terminology would be more accurate.

      Following the reviewer’s suggestion, we removed the term “cell reprograming” from the manuscript and rather describe it as induction of TH expression in endogenous neurons.

      - As noted in the comments of section 4, a more thorough discussion about the various possibilities for heterogeneity would enhance the manuscript's contribution to the PD field.

      In the revised manuscript, we provided a more in-depth discussion of the 4i imaging results and potential therapeutic implications. 

      - Despite observing low numbers of TH+ cells, no significant rescue of drug-induced behaviors, and low levels of released dopamine, the authors merely state that these results make the therapy non-viable, but there is no further exploration or discussion. Whether the limitations lie in the ABE strategy itself, such as its efficiency in targeting and editing of differentiated neurons; or if the issues lie on the injection and delivery, is never discussed. A deeper argumentation on the possible underlying reasons for these challenges would greatly enhance the manuscript and contribute to the advancement of ABE therapies in the brain.

      We believe that the efficacy of our base editing approach could be significantly enhanced by optimizing the delivery. Currently, we are using a dual AAV approach to deliver intein-split ABEs. Since this approach relies on the delivery of higher AAV doses to achieve cotransduction of a cell by two different AAVs, the efficiency could be significantly enhanced by using smaller Cas9 orthologues that can be delivered as a single AAV. Furthermore, in this study we performed a single injection into the dorsal striatum to deliver ABE-expressing AAVs. Performing multiple injections into the rostral, medial, and caudal regions of the striatum might allow us to transduce more cells and induce TH expression in a larger population of striatal neurons. We have included these points in the revised manuscript.

      - While drug-induced behaviors are not recovered, the data demonstrates a rescue of spontaneous behaviors. Further discussion on the potential differences in circuitry underlying these variations in behavioral rescue would also enrich the manuscript's discussion.

      In the revised manuscript, we provide suggestions for potential mechanisms involved in the rescue of spontaneous behavior vs. absence of rescue of drug-induced behaviors. 

      FIGURES AND FIGURE SUPPLEMENTS

      General minor issue - low magnification images in the following figures, make it difficult to visualize positive cells in tissue sections: Figure 2; Figure 2- supplement 1; Figure 2 - supplement 3, Figure 3- supplement 1. Adding a higher magnification imaging of positive cells in tissue sections of SNc and striatum might help with the visualization. 

      As suggested by the reviewer, we included higher magnification images in the corresponding figures to improve interpretation of our results.

    1. Reviewer #2 (Public review):

      Summary:

      The authors aimed to investigate the functionality of the GnRH (gonadotropin-releasing hormone) pulse generator in different mouse models to understand its role in reproductive physiology and its implications for conditions like polycystic ovary syndrome (PCOS). They compared the GnRH pulse generator activity in control mice, peripubertal androgen (PPA) treated mice, and prenatal androgen (PNA) exposed mice. The study sought to elucidate how androgen exposure affects the GnRH pulse generator and subsequent LH (luteinizing hormone) secretion, contributing to the pathophysiology of PCOS.

      Strengths:

      (1) Comprehensive Model Selection: The use of both PPA and PNA mouse models allows for a comparative analysis that can distinguish the effects of different timings of androgen exposure.

      (2) Detailed Methodology: The methods employed, such as photometry recordings and serial blood sampling, are robust and allow for precise measurement of GnRH pulse generator activity and LH secretion.

      (3) Clear Results Presentation: The experimental results are well-documented with appropriate statistical analyses, ensuring the findings are reliable and reproducible.

      (4) Relevance to PCOS: The study addresses a significant gap in understanding the neuroendocrine mechanisms underlying PCOS, making the findings relevant to both basic science and potentially clinical research.

      Weaknesses

      (1) Model Limitations: While the PNA mouse model is suggested as the most appropriate for studying PCOS, the authors acknowledge that it does not completely replicate the human condition, particularly the elevated LH response seen in women with PCOS.

      (2) Complex Data Interpretation: The reduced progesterone feedback and its effects on the GnRH pulse generator in PNA mice add complexity to data interpretation, making it challenging to draw straightforward conclusions.

      (3) Machine Learning (ML) Selection and Validation: While k-means clustering is a useful tool for pattern recognition, the manuscript lacks detailed justification for choosing this specific algorithm over other potential methods. The robustness of clustering results has not been validated.

      (4) Biological Interpretability: Although the machine learning approach identified cyclical patterns, the biological interpretation of these clusters in the context of PCOS is not thoroughly discussed. A deeper exploration of how these clusters correlate with physiological and pathological states could enhance the study's impact.

      (5) Sample Size: The study uses a relatively small number of animals (n=4-7 per group), which may limit the generalisability of the findings. Larger sample sizes could provide more robust and statistically significant results.

      (6) Scope of Application: The findings, while interesting, are primarily applicable to mouse models. The translation to human physiology requires cautious interpretation and further validation.

      Comments on revised version:

      I did not find the response to my main concerns regarding justification for the choice of the number of clusters (k) and providing evidence of cluster robustness satisfactory at all. It sounds contradictory to me to state that the authors have used unsupervised ML approach when at the same time had clear understanding of the data and the features they wanted to capture. Unsupervised approaches are meant to reveal features that are not apparent by eye... however in their response the authors state, "...our aim was to develop an unsupervised approach that would automatically detect the onset and existence of the key features of pulse generator cyclicity that were apparent by eye...". This sounds like a rather supervised ML approach to me.<br /> Furthermore, I am still unsure why did the authors choose k=5, i.e. assumed there are 5 clusters in the data, and did they explore other possible values for k?<br /> - If not why not? How does this fit with the claims that their ML approach is unsupervised, in other words purely data-driven without making any assumptions?<br /> - If yes did they compare the robustness of their clustering results obtained for different values of k?

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review): 

      In the presented manuscript, the authors investigate how neural networks can learn to replay presented sequences of activity. Their focus lies on the stochastic replay according to learned transition probabilities. They show that based on error-based excitatory and balance-based inhibitory plasticity networks can selforganize towards this goal. Finally, they demonstrate that these learning rules can recover experimental observations from song-bird song learning experiments. 

      Overall, the study appears well-executed and coherent, and the presentation is very clear and helpful. However, it remains somewhat vague regarding the novelty. The authors could elaborate on the experimental and theoretical impact of the study, and also discuss how their results relate to those of Kappel et al, and others (e.g., Kappel et al (doi.org/10.1371/journal.pcbi.1003511))). 

      We agree with the reviewer that our previous manuscript lacked comparison with previously published similar works. While Kappel et al. demonstrated that STDP in winner-take-all circuits can approximate online learning of hidden Markov models (HMMs), a key distinction from our model is that their neural representations acquire deterministic sequential activations, rather than exhibiting stochastic transitions governing Markovian dynamics. Specifically, in their model, the neural representation of state B would be different in the sequences ABC and CBA, resulting in distinct deterministic representations like ABC and C'B'A', where ‘A’ and ‘A'’ are represented by different neural states (e.g., activations of different cell assemblies). In contrast, our network learns to generate stochastically transitioning cell assemblies which replay Markovian trajectories of spontaneous activity obeying the learned transition probabilities between neural representations of states. For example, starting from reactivation from assembly ‘A’, there may be an 80% probability to transition to assembly ‘B’ and 20% to ‘C’. Although Kappel et al.'s model successfully solves HMMs, their neural representations do not themselves stochastically transition between states according to the learned model. Similar to the Kappel et al.'s model, while the models proposed in Barber (2002) and Barber and Agakov (2002) learn the Markovian statistics, these models learned a static spatiotemporal input patterns only and how assemblies of neurons show stochastic transition in spontaneous activity has been still unclear. In contrast with these models, our model captures the probabilistic neural state trajectories, allowing spontaneous replay of experienced sequences with stochastic dynamics matching the learned environmental statistics.

      We have included new sentences for explain these in ll. 509-533 in the revised manuscript.

      Overall, the work could benefit if there was either (A) a formal analysis or derivation of the plasticity rules involved and a formal justification of the usefulness of the resulting (learned) neural dynamics; 

      We have included a derivation of our plasticity rules in ll. 630-670 in the revised manuscript. Consistent with our claim that excitatory plasticity updates the excitatory synapse to predict output firing rates, we have shown that the corresponding cost function measures the discrepancy between the recurrent prediction and the output firing rate. Similarly, for inhibitory plasticity, we defined the cost function that evaluates the difference between the excitatory and inhibitory potential within each neuron. We showed that the resulting inhibitory plasticity rule updates the inhibitory synapses to maintain the excitation-inhibition balance.

      and/or (B) a clear connection of the employed plasticity rules to biological plasticity and clear testable experimental predictions. Thus, overall, this is a good work with some room for improvement. 

      Our proposed plasticity mechanism could be implemented through somatodendritic interactions. Analogous to previous computational works (Urbanczik and Senn., 2014; Asabuki and Fukai., 2020; Asabuki et al., 2022), our model suggests that somatic responses may encode the stimulus-evoked neural activity states, while dendrites encode predictions based on recurrent dynamics that aim to minimize the discrepancy between somatic and dendritic activity. To directly test this hypothesis, future experimental studies could simultaneously record from both somatic and dendritic compartments to investigate how they encode evoked responses and predictive signals during learning (Francioni et al., 2022).

      We have included new sentences for explain these in ll. 476-484 in the revised manuscript.

      Reviewer #2 (Public Review): 

      Summary: 

      This work proposes a synaptic plasticity rule that explains the generation of learned stochastic dynamics during spontaneous activity. The proposed plasticity rule assumes that excitatory synapses seek to minimize the difference between the internal predicted activity and stimulus-evoked activity, and inhibitory synapses try to maintain the E-I balance by matching the excitatory activity. By implementing this plasticity rule in a spiking recurrent neural network, the authors show that the state-transition statistics of spontaneous excitatory activity agree with that of the learned stimulus patterns, which are reflected in the learned excitatory synaptic weights. The authors further demonstrate that inhibitory connections contribute to well-defined state transitions matching the transition patterns evoked by the stimulus. Finally, they show that this mechanism can be expanded to more complex state-transition structures including songbird neural data. 

      Strengths: 

      This study makes an important contribution to computational neuroscience, by proposing a possible synaptic plasticity mechanism underlying spontaneous generations of learned stochastic state-switching dynamics that are experimentally observed in the visual cortex and hippocampus. This work is also very clearly presented and well-written, and the authors conducted comprehensive simulations testing multiple hypotheses. Overall, I believe this is a well-conducted study providing interesting and novel aspects of the capacity of recurrent spiking neural networks with local synaptic plasticity. 

      Weaknesses: 

      This study is very well-thought-out and theoretically valuable to the neuroscience community, and I think the main weaknesses are in regard to how much biological realism is taken into account. For example, the proposed model assumes that only synapses targeting excitatory neurons are plastic, and uses an equal number of excitatory and inhibitory neurons. 

      We agree with the reviewer. The network shown in the previous manuscript consists of an equal number of excitatory and inhibitory neurons, which seems to lack biological plausibility. Therefore, we first tested whether a biologically plausible scenario would affect learning performance by setting the ratio of excitatory to inhibitory neurons to 80% and 20% (Supplementary Figure 7a; left). Even in such a scenario, the network still showed structured spontaneous activity (Supplementary Figure 7a; center), with transition statistics of replayed events matching the true transition probabilities (Supplementary Figure 7a; right). We then asked whether the model with our plasticity rule applied to all synapses would reproduce the corresponding stochastic transitions. We found that the network can learn transition statistics but only under certain conditions. The network showed only weak replay and failed to reproduce the appropriate transition (Supplementary Fig. 7b) if the inhibitory neurons were no longer driven by the synaptic currents reflecting the stimulus, due to a tight balance of excitatory and inhibitory currents on the inhibitory neurons. We then tested whether the network with all synapses plastic can learn transition statistics if the external inputs project to the inhibitory neurons as well. We found that, when each stimulus pattern activates a non-overlapping subset of neurons, the network does not exhibit the correct stochastic transition of assembly reactivation (Supplementary Fig. 7c). Interestingly, when each neuron's activity is triggered by multiple stimuli and has mixed selectivity, the reactivation reproduced the appropriate stochastic transitions (Supplementary Fig. 7d).

      We have included these new results as new Supplementary Figure 7 and they are explained in ll.215-230 in the revised manuscript.

      The model also assumes Markovian state dynamics while biological systems can depend more on history. This limitation, however, is acknowledged in the Discussion. 

      We have included the following sentence to provide a possible solution to this limitation: “Therefore, to learn higher-order stochastic transitions, recurrent neural networks like ours may need to integrate higher-order inputs with longer time scales.” in ll.557-559 in the revised manuscript. 

      Finally, to simulate spontaneous activity, the authors use a constant input of 0.3 throughout the study. Different amplitudes of constant input may correspond to different internal states, so it will be more convincing if the authors test the model with varying amplitudes of constant inputs. 

      We thank the reviewer for pointing this out. In the revised manuscript, we have tested constant input with three different strengths. If the strength is moderate, the network showed accurate encoding of transition statistics in the spontaneous activity as we have seen in Fig.2. We have additionally shown that the weaker background input causes spontaneous activity with lower replay rate, which in turn leads to high variance of encoded transition, while stronger inputs make assembly replay transitions more uniform. We have included these new results as new Supplementary Figure 6 and they are explained in ll.211214 in the revised manuscript.

      Reviewer #3 (Public Review): 

      Summary: 

      Asabuki and Clopath study stochastic sequence learning in recurrent networks of Poisson spiking neurons that obey Dale's law. Inspired by previous modeling studies, they introduce two distinct learning rules, to adapt excitatory-to-excitatory and inhibitory-to-excitatory synaptic connections. Through a series of computer experiments, the authors demonstrate that their networks can learn to generate stochastic sequential patterns, where states correspond to non-overlapping sets of neurons (cell assemblies) and the state-transition conditional probabilities are first-order Markov, i.e., the transition to a given next state only depends on the current state. Finally, the authors use their model to reproduce certain experimental songbird data involving highly-predictable and highly-uncertain transitions between song syllables. 

      Strengths: 

      This is an easy-to-follow, well-written paper, whose results are likely easy to reproduce. The experiments are clear and well-explained. The study of songbird experimental data is a good feature of this paper; finches are classical model animals for understanding sequence learning in the brain. I also liked the study of rapid task-switching, it's a good-to-know type of result that is not very common in sequence learning papers. 

      Weaknesses: 

      While the general subject of this paper is very interesting, I missed a clear main result. The paper focuses on a simple family of sequence learning problems that are well-understood, namely first-order Markov sequences and fully visible (nohidden-neuron) networks, studied extensively in prior work, including with spiking neurons. Thus, because the main results can be roughly summarized as examples of success, it is not entirely clear what the main point of the authors is. 

      We apologize the reviewer that our main claim was not clear. While various computational studies have suggested possible plasticity mechanisms for embedding evoked activity patterns or their probability structures into spontaneous activity (Litwin-Kumar et al., Nat. Commun. 2014, Asabuki and Fukai., Biorxiv 2023), how transition statistics of the environment are learned in spontaneous activity is still elusive and poorly understood. Furthermore, while several network models have been proposed to learn Markovian dynamics via synaptic plasticity (Brea, et al. (2013); Pfister et al. (2004); Kappel et al. (2014)), they have been limited in a sense that the learned network does not show stochastic transition in a neural state space. For instance, while Kappel et al. demonstrated that STDP in winner-take-all circuits can approximate online learning of hidden Markov models (HMMs), a key distinction from our model is that their neural representations acquire deterministic sequential activations, rather than exhibiting stochastic transitions governing Markovian dynamics. Specifically, in their model, the neural representation of state B would be different in the sequences ABC and CBA, resulting in distinct deterministic representations like ABC and C'B'A', where ‘A’ and ‘A'’ are represented by different neural states (e.g., activations of different cell assemblies). In contrast, our network learns to generate stochastically transitioning cell assemblies that replay Markovian trajectories of spontaneous activity obeying the learned transition probabilities between neural representations of states. For example, starting from reactivation from assembly ‘A’, there may be an 80% probability to transition to assembly ‘B’ and 20% to ‘C’. Although Kappel et al.'s model successfully solves HMMs, their neural representations do not themselves stochastically transition between states according to the learned model. Similar to the Kappel et al.'s model, while the models proposed in Barber (2002) and Barber and Agakov (2002) learn the Markovian statistics, these models learned a static spatiotemporal input patterns only and how assemblies of neurons show stochastic transition in spontaneous activity has been still unclear. In contrast with these models, our model captures the probabilistic neural state trajectories, allowing spontaneous replay of experienced sequences with stochastic dynamics matching the learned environmental statistics.

      We have explained this point in ll.509-533 in the revised manuscript.

      Going into more detail, the first major weakness I see in this paper is the heuristic choice of learning rules. The paper studies Poisson spiking neurons (I return to this point below), for which learning rules can be derived from a statistical objective, typically maximum likelihood. For fully-visible networks, these rules take a simple form, similar in many ways to the E-to-E rule introduced by the authors. This more principled route provides quite a lot of additional understanding on what is to be expected from the learning process. 

      We thank the reviewer for pointing this out. To better demonstrate the function of our plasticity rules, we have included the derivation of the rules of synaptic plasticity in ll. 630-670 in the revised manuscript. Consistent with our claim that excitatory plasticity updates the excitatory synapse to predict output firing rates, we have shown that the corresponding cost function measures the discrepancy between the recurrent prediction and the output firing rate. Similarly, for inhibitory plasticity, we defined the cost function that evaluates the difference between the excitatory and inhibitory potential within each neuron. We showed that the resulting inhibitory plasticity rule updates the inhibitory synapses to maintain the excitation-inhibition balance.

      For instance, should maximum likelihood learning succeed, it is not surprising that the statistics of the training sequence distribution are reproduced. Moreover, given that the networks are fully visible, I think that the maximum likelihood objective is a convex function of the weights, which then gives hope that the learning rule does succeed. And so on. This sort of learning rule has been studied in a series of papers by David Barber and colleagues [refs. 1, 2 below], who applied them to essentially the same problem of reproducing sequence statistics in recurrent fully-visible nets. It seems to me that one key difference is that the authors consider separate E and I populations, and find the need to introduce a balancing I-to-E learning rule. 

      The reviewer’s understanding that inhibitory plasticity to maintain EI balance is one of a critical difference from previous works is correct. However, we believe that the most striking point of our study is that we have shown numerically that predictive plasticity rules enable recurrent networks to learn and replay the assembly activations whose transition statistics match those of the evoked activity. Please see our reply above.

      Because the rules here are heuristic, a number of questions come to mind. Why these rules and not others - especially, as the authors do not discuss in detail how they could be implemented through biophysical mechanisms? When does learning succeed or fail? What is the main point being conveyed, and what is the contribution on top of the work of e.g. Barber, Brea, et al. (2013), or Pfister et al. (2004)? 

      Our proposed plasticity mechanism could be implemented through somatodendritic interactions. Analogous to previous computational works (Senn, Asabuki), our model suggests that somatic responses may encode the stimulusevoked neural activity states, while dendrites encode predictions based on recurrent dynamics that aim to minimize the discrepancy between somatic and dendritic activity. To directly test this hypothesis, future experimental studies could simultaneously record from both somatic and dendritic compartments to investigate how they encode evoked responses and predictive signals during learning.

      To address the point of the reviewer, we conducted addionnal simulations to test where the model fails. We found that the model with our plasticity rule applied to all synapses only showed faint replays and failed to replay the appropriate transition (Supplementary Fig. 7b). This result is reasonable because the inhibitory neurons were no longer driven by the synaptic currents reflecting the stimulus, due to a tight balance of excitatory and inhibitory currents on the inhibitory neurons. Our model predicts that mixed selectivity in the inhibitory population is crucial to learn an appropriate transition statistics (Supplementary Fig. 7d). Future work should clarify the role of synaptic plasticity on inhibitory neurons, especially plasticity at I to I synapses. We have explained this result as new supplementary Figure7 in the revised manuscript.

      The use of a Poisson spiking neuron model is the second major weakness of the study. A chief challenge in much of the cited work is to generate stochastic transitions from recurrent networks of deterministic neurons. The task the authors set out to do is much easier with stochastic neurons; it is reasonable that the network succeeds in reproducing Markovian sequences, given an appropriate learning rule. I believe that the main point comes from mapping abstract Markov states to assemblies of neurons. If I am right, I missed more analyses on this point, for instance on the impact that varying cell assembly size would have on the findings reported by the authors.

      The reviewer’s understanding is correct. Our main point comes from mapping Markov statistics to replays of cell assemblies. In the revised manuscript, we performed additional simulations to ask whether varying the size of the cell assemblies would affect learning. We ran simulations with two different configurations in the task shown in Figure 2. The first configuration used three assemblies with a size ratio of 1:1.5:2. After training, these assemblies exhibited transition statistics that closely matched those of the evoked activity (Supplementary Fig.4a,b). In contrast, the second configuration, which used a size ratio of 1:2:3, showed worse performance compared to the 1:1.5:2 case (Supplementary Fig.4c,d). These results suggest that the model can learn appropriate transition statistics as long as the size ratio of the assemblies is not drastically varied.

      Finally, it was not entirely clear to me what the main fundamental point in the HVC data section was. Can the findings be roughly explained as follows: if we map syllables to cell assemblies, for high-uncertainty syllable-to-syllable transitions, it becomes harder to predict future neural activity? In other words, is the main point that the HVC encodes syllables by cell assemblies? 

      The reviewer's understanding is correct. We wanted to show that if the HVC learns transition statistics as a replay of cell assemblies, a high-uncertainty syllable-to-syllable transition would make predicting future reactivations more difficult, since trial-averaged activities (i.e., poststimulus activities; PSAs) marginalized all possible transitions in the transition diagram.

      (1) Learning in Spiking Neural Assemblies, David Barber, 2002. URL: https://proceedings.neurips.cc/paper/2002/file/619205da514e83f869515c782a328d3c-Paper.pdf  

      (2) Correlated sequence learning in a network of spiking neurons usingmaximum likelihood, David Barber, Felix Agakov, 2002. URL: http://web4.cs.ucl.ac.uk/staff/D.Barber/publications/barber-agakovTR0149.pdf  

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors): 

      In more detail: 

      A) Theoretical analysis 

      The plasticity rules in the study are introduced with a vague reference to previous theoretical studies of others. Doing this, one does not provide any formal insight as to why these plasticity rules should enable one to learn to solve the intended task, and whether they are optimal in some respect. This becomes noticeable, especially in the discussion of the importance of inhibitory balance, which does not go into any detail, but rather only states that its required, both in the results and discussion sections. Another unclarity appears when error-based learning is discussed and compared to Hebbian plasticity, which, as you state, "alone is insufficient to learn transition probabilities". It is not evident how this claim is warranted, nor why error-based plasticity in comparison should be able to perform this (other than referring to the simulation results). Please either clarify formally (or at least intuitively) how plasticity rules result in the mentioned behavior, or alternatively acknowledge explicitly the (current) lack of intuition. 

      The lack of formal discussion is a relevant shortcoming compared to previous research that showed very similar results with formally more rigorous and principled approaches. In particular, Kappel et al derived explicitly how neural networks can learn to sample from HMMs using STDP and winner-take-all dynamics. Even though this study has limitations, the relation with respect to that work should be made very clear; potentially the claims of novelty of some results (sampling) should be adjusted accordingly. See also Yanping Huang, Rajesh PN Rao (NIPS 2014), and possibly other publications. While it might be difficult to formally justify the learning rules post-hoc, it would be very helpful to the field if you very clearly related your work to that of others, where learning rules have been formally justified, and elaborate on the intuition of how the employed rules operate and interact (especially for inhibition). 

      Lastly, while the importance of sampling learned transition probabilities is discussed, the discussion again remains on a vague level, characterized by the lack of references in the relevant paragraphs. Ideally, there should be a proof of concept or a formal understanding of how the learned behaviour enables to solve a problem that is not solved by deterministic networks. Please incorporate also the relation to the literature on neural sampling/planning/RL etc. and substantiate the claims with citations. 

      We have included sentences in ll. 691-696 in the revised manuscript to explain that for Poisson spiking neurons, the derived learning rule is equivalent to the one that minimizes the Kullback-Leibler divergence between the distributions of output firing and the dendritic prediction, in our case, the recurrent prediction (Asabuki and Fukai; 2020). Thus, the rule suggests that the recurrent prediction learns the statistical model of the evoked activity, which in turn allows the network to reproduce the learned transition statistics.

      We have also added a paragraph to discuss the differences between previously published similar models (e.g., Kappel et al.). Please see our response above.

      B) Connection to biology 

      The plasticity rules in the study are introduced with a vague reference to previous theoretical studies of others. Please discuss in more detail if these rules (especially the error-based learning rule) could be implemented biologically and how this could be achieved. Are there connections to biologically observed plasticity? E.g. for error-based plasticity has been discussed in the original publication by Urbanzcik and Senn, or more recently by Mikulasch et al (TINS 2023). The biological plausibility of inhibitory balance has been discussed many times before, e.g. by Vogels and others, and a citation would acknowledge that earlier work. This also leaves the question of how neurons in the songbird experiment could adapt and if the model does capture this well (i.e., do they exhibit E-I balance? etc), which might be discussed as well. 

      Last, please provide some testable experimental predictions. By proposing an interesting experimental prediction, the model could become considerably more relevant to experimentalists. Also, are there potentially alternative models of stochastic sequence learning (e.g., Kappel et al)? How could they be distinguished? (especially, again, why not Hebbian/STDP learning?) 

      We have cited the Vogels paper to acknowledge the earlier work. We have also included additional paragraphs to discuss a possible biologically plausible implementation of our model and how our model differs from similar models proposed previously (e.g., Kappel et al.). Please see our response above.

      Other comments 

      As mentioned, a derivation of recurrent plasticity rules is missing, and parameters are chosen ad-hoc. This leaves the question of how much the results rely on the specific choice of parameters, and how robust they are to perturbations. As a robustness check, please clarify how the duration of the Markov states influences performance. It can be expected that this interacts with the timescale of recurrent connections, so having longer or shorter Markov states, as it would be in reality, should make a difference in learning that should be tested and discussed.

      We thank the reviewer for pointing this out. To address this point, we performed new simulations and asked to what extent the duration of Markov states affect performance. Interestingly, even when the network was trained with input states of half the duration, the distributions of the durations of assembly reactivations remain almost identical to those in the original case (Supplementary Figure 3a). Furthermore, the transition probabilities in the replay were still consistent with the true transition probabilities (Supplementary Figure 3b). We have also included the derivation of our plasticity rule in ll. 630-670 in the revised manuscript. 

      Similarly, inhibitory plasticity operates with the same plasticity timescale parameter as excitatory plasticity, but, as the authors discuss, lags behind excitatory plasticity in simulation as in experiment. Is this required or was the parameter chosen such that this behaviour emerges? Please clarify this in the methods section; moreover, it would be good to test if the same results appear with fast inhibitory plasticity. 

      We have performed a new simulation and showed that even when the learning rate of inhibitory plasticity was larger than that of excitatory plasticity, inhibitory plasticity still occurred on a slower timescale than excitatory plasticity. We have included this result in a new Supplementary Figure 2 in the revised manuscript.

      What is the justification (biologically and theoretically) for the memory trace h and its impact on neural spiking? Is it required for the results or can it be left away? Since this seems to be an important and unconventional component of the model, please discuss it in more detail. 

      In the model, it is assumed that each stimulus presentation drives a specific subset of network neurons with a fixed input strength, which avoids convergence to trivial solutions. Nevertheless, we choose to add this dynamic sigmoid function to facilitate stable replay by regulating neuron activity to prevent saturation. We have explained this point in ll.605-611 in the revised manuscript.

      Reviewer #2 (Recommendations For The Authors): 

      I noticed a couple of minor typos: 

      Page 3 "underly"->"underlie" 

      Page 7 "assemblies decreased settled"->"assemblies decreased and settled"

      We have modified the text. We thank the reviewer for their careful review.

      I think Figure 1C is rather confusing and not intuitive. 

      We apologize that the Figure 1C was confusing. In the revised figure, we have emphasized the flow of excitatory and inhibitory error for updating synapses.

      Reviewer #3 (Recommendations For The Authors): 

      One possible path to improve the paper would be to establish a relationship between the proposed learning rules and e.g. the ones derived by Barber. 

      When reading the paper, I was left with a number of more detailed questions I omitted from the public review: 

      (1) The authors introduce a dynamic sigmoidal function for excitatory neurons, Eq. 3. This point requires more discussion and analysis. How does this impact the results? 

      In the model, it is assumed that each stimulus presentation drives a specific subset of network neurons with a fixed input strength, which avoids convergence to trivial solutions. Nevertheless, we choose to add this dynamic sigmoid function to facilitate stable replay by regulating neuron activity to prevent saturation. We have explained this point in ll.605-611 in the revised manuscript.

      (2) For Poisson spiking neurons, it would be great to understand what cell assemblies bring (apart from biological realism, i.e., reproducing data where assemblies can be found), compared to self-connected single neurons. For example, how do the results shown in Figure 2 depend on assembly size? 

      We have changed the cell assembly size ratio and how it affects learning performance in a new Supplementary Figure 4. Please see our reply above.

      (3) The authors focus on modeling spontaneous transitions, corresponding to a highly stochastic generative model (with most transition probabilities far from 1). A complementary question is that of learning to produce a set of stereotypical sequences, with probabilities close to 1. I wondered whether the learning rules and architecture of the model (in particular under the I-to-E rule) would also work in such a scenario. 

      We thank the reviewer for pointing this out. In fact, we had the same question, so we considered a situation in which the setting in Figure 2 includes both cases where the transition matrix is very stochastic (prob=0.5) and near deterministic (prob=0.9).

      (4) An analysis of what controls the time so that the network stays in a certain state would be welcome. 

      We trained the network model in two cases, one with a fast speed of plasticity and one with a slow speed of plasticity. As a result, we found that the duration of assembly becomes longer in the slow learning case than in the fast case. We have included these results as Supplementary Figure 5 in the revised manuscript.

      Regarding the presentation, given that this is a computational modeling paper, I wonder whether *all* the formulas belong in the Methods section. I found myself skipping back and forth to understand what the main text meant, mainly because I missed a few key equations. I understand that this is a style issue that is very much community-dependent, but I think readability would improve drastically if the main model and learning rule equations could be introduced in the main text, as they start being discussed. 

      We thank the reviewer for the suggestion. To cater to a wider audience, we try to explain the principle of the paper without using mathematical formulas as much as possible in the main text.

    1. Reviewer #1 (Public review):

      Summary:

      The authors aimed to quantify feral pig interactions in eastern Australia to inform disease transmission networks. They used GPS tracking data from 146 feral pigs across multiple locations to construct proximity-based social networks and analyze contact rates within and between pig social units.

      Strengths:

      (1) Addresses a critical knowledge gap in feral pig social dynamics in Australia.

      (2) Uses robust methodology combining GPS tracking and network analysis.

      (3) Provides valuable insights into sex-based and seasonal variations in contact rates.

      (4) Effectively contextualizes findings for disease transmission modeling and management.

      (5) Includes comprehensive ethical approval for animal research.

      (6) Utilizes data from multiple locations across eastern Australia, enhancing generalizability.

      Weaknesses:

      (1) Limited discussion of potential biases from varying sample sizes across populations

      (2) Some key figures are in supplementary materials rather than the main text.

      (3) Economic impact figures are from the US rather than Australia-specific data.

      (4) Rationale for spatial and temporal thresholds for defining contacts could be clearer.

      (5) Limited discussion of ethical considerations beyond basic animal ethics approval.

      The authors largely achieved their aims, with the results supporting their conclusions about the importance of sex and seasonality in feral pig contact networks. This work is likely to have a significant impact on feral pig management and disease control strategies in Australia, providing crucial data for refining disease transmission models.

    2. Reviewer #2 (Public review):

      Summary:

      The paper attempts to elucidate how feral (wild) pigs cause distortion of the environment in over 54 countries of the world, particularly Australia.

      The paper displays proof that over $120 billion worth of facilities were destroyed annually in the United States of America.

      The authors have tried to infer that the findings of their work were important and possess a convincing strength of evidence.

      Strengths:

      (1) Clearly stating feral (wild) pigs as a problem in the environment.

      (2) Stating how 54 countries were affected by the feral pigs.

      (3) Mentioning how $120 billion was lost in the US, annually, as a result of the activities of the feral pigs.

      (4) Amplifying the fact that 14 species of animals were being driven into extinction by the feral pigs.

      (5) Feral pigs possessing zoonotic abilities.

      (6) Feral pigs acting as reservoirs for endemic diseases like brucellosis and leptospirosis.

      (7) Understanding disease patterns by the social dynamics of feral pig interactions.

      (8) The use of 146 GPS-monitored feral pigs to establish their social interaction among themselves.

      Weaknesses:

      (1) Unclear explanation of the association of either the female or male feral pigs with each other, seasonally.

      (2) The "abstract paragraph" was not justified.

      (3) Typographical errors in the abstract.

    3. Author response:

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      The authors aimed to quantify feral pig interactions in eastern Australia to inform disease transmission networks. They used GPS tracking data from 146 feral pigs across multiple locations to construct proximity-based social networks and analyze contact rates within and between pig social units.

      Strengths:

      (1) Addresses a critical knowledge gap in feral pig social dynamics in Australia.

      (2) Uses robust methodology combining GPS tracking and network analysis.

      (3) Provides valuable insights into sex-based and seasonal variations in contact rates.

      (4) Effectively contextualizes findings for disease transmission modeling and management.

      (5) Includes comprehensive ethical approval for animal research.

      (6) Utilizes data from multiple locations across eastern Australia, enhancing generalizability.

      Weaknesses:

      (1) Limited discussion of potential biases from varying sample sizes across populations

      This is a really good comment, and we will address this in the discussion as one of the limitations of the study.

      (2) Some key figures are in supplementary materials rather than the main text.

      We will move some of our supplementary material to the main text as suggested.

      (3) Economic impact figures are from the US rather than Australia-specific data.

      We included the impact figures that are available for Australia (for FDM), and we will include the estimated impact of ASF in Australia in the introduction.

      (4) Rationale for spatial and temporal thresholds for defining contacts could be clearer.

      We will improve the explanation of why we chose the spatial and temporal thresholds based on literature, the size of animals and GPS errors.

      (5) Limited discussion of ethical considerations beyond basic animal ethics approval.

      This research was conducted under an ethics committee's approval for collaring the feral pigs. This research is part of an ongoing pest management activity, and all the ethics approvals have been highlighted in the main manuscript.

      The authors largely achieved their aims, with the results supporting their conclusions about the importance of sex and seasonality in feral pig contact networks. This work is likely to have a significant impact on feral pig management and disease control strategies in Australia, providing crucial data for refining disease transmission models.

      Reviewer #2 (Public review):

      Summary:

      The paper attempts to elucidate how feral (wild) pigs cause distortion of the environment in over 54 countries of the world, particularly Australia.

      The paper displays proof that over $120 billion worth of facilities were destroyed annually in the United States of America.

      The authors have tried to infer that the findings of their work were important and possess a convincing strength of evidence.

      Strengths:

      (1) Clearly stating feral (wild) pigs as a problem in the environment.

      (2) Stating how 54 countries were affected by the feral pigs.

      (3) Mentioning how $120 billion was lost in the US, annually, as a result of the activities of the feral pigs.

      (4) Amplifying the fact that 14 species of animals were being driven into extinction by the feral pigs.

      (5) Feral pigs possessing zoonotic abilities.

      (6) Feral pigs acting as reservoirs for endemic diseases like brucellosis and leptospirosis.

      (7) Understanding disease patterns by the social dynamics of feral pig interactions.

      (8) The use of 146 GPS-monitored feral pigs to establish their social interaction among themselves.

      Weaknesses:

      (1) Unclear explanation of the association of either the female or male feral pigs with each other, seasonally.

      This will be better explain in the methods.

      (2) The "abstract paragraph" was not justified.

      We have justified the abstract paragraph as requested by the reviewer.

      (3) Typographical errors in the abstract.

      Typographical errors have been corrected in the Abstract.

      Reviewer #3 (Public review):

      Summary:

      The authors sought to understand social interactions both within and between groups of feral pigs, with the intent of applying their findings to models of disease transmission. The authors analyzed GPS tracking data from across various populations to determine patterns of contact that could support the transmission of a range of zoonotic and livestock diseases. The analysis then focused on the effects of sex, group dynamics, and seasonal changes on contact rates that could be used to base targeted disease control strategies that would prioritize the removal of adult males for reducing intergroup disease transmission.

      Strengths:

      It utilized GPS tracking data from 146 feral pigs over several years, effectively capturing seasonal and spatial variation in the social behaviors of interest. Using proximity-based social network analysis, this work provides a highly resolved snapshot of contact rates and interactions both within and between groups, substantially improving research in wildlife disease transmission. Results were highly useful and provided practical guidance for disease management, showing that control targeted at adult males could reduce intergroup disease transmission, hence providing an approach for the control of zoonotic and livestock diseases.

      Weaknesses:

      Despite their reliability, populations can be skewed by small sample sizes and limited generalizability due to specific environmental and demographic characteristics. Further validation is needed to account for additional environmental factors influencing social dynamics and contact rates

      This is a good point, and we thank the reviewer for pointing out this issue. We will discuss the potential biases due to sample size in our discussion. We agree that environmental factors need to be incorporated and tested for their influence on social dynamics, and this will be added to the discussion as we have plans to expand this research and conduct, the analysis to determine if environmental factors are influencing social dynamics.

    1. Reviewer #1 (Public review):

      The paper explored cross-species variance in albumin glycation and blood glucose levels in the function of various life-history traits. Their results show that<br /> (1) blood glucose levels predict albumin gylcation rates<br /> (2) larger species have lower blood glucose levels<br /> (3) lifespan positively correlates with blood glucose levels and<br /> (4) diet predicts albumin glycation rates.

      The data presented is interesting, especially due to the relevance of glycation to the ageing process and the interesting life-history and physiological traits of birds. Most importantly, the results suggest that some mechanisms might exist that limit the level of glycation in species with the highest blood glucose levels.

      While the questions raised are interesting and the amount of data the authors collected is impressive, I have some major concerns about this study:

      (1) The authors combine many databases and samples of various sources. This is understandable when access to data is limited, but I expected more caution when combining these. E.g. glucose is measured in all samples without any description of how handling stress was controlled for. E.g glucose levels can easily double in a few minutes in birds, potentially introducing variation in the data generated. The authors report no caution of this effect, or any statistical approaches aiming to check whether handling stress had an effect here, either on glucose or on glycation levels.

      (2) The database with the predictors is similarly problematic. There is information pulled from captivity and wild (e.g. on lifespan) without any confirmation that the different databases are comparable or not (and here I'm not just referring to the correlation between the databases, but also to a potential systematic bias (e.g. captivate-based sources likely consistently report longer lifespans). This is even more surprising, given that the authors raise the possibility of captivity effects in the discussion, and exploring this question would be extremely easy in their statistical models (a simple covariate in the MCMCglmms).

      (3) The authors state that the measurement of one of the primary response variables (glycation) was measured without any replicability test or reference to the replicability of the measurement technique.

      (4) The methods and results are very poorly presented. For instance, new model types and variables are popping up throughout the manuscript, already reporting results, before explaining what these are e.g. results are presented on "species average models" and "model with individuals", but it's not described what these are and why we need to see both. Variables, like "centered log body mass", or "mass-adjusted lifespan" are not explained. The results section is extremely long, describing general patterns that have little relevance to the questions raised in the introduction and would be much more efficiently communicated visually or in a table.

    2. Reviewer #2 (Public review):

      Summary

      In this extensive comparative study, Moreno-Borrallo and colleagues examine the relationships between plasma glucose levels, albumin glycation levels, diet, and life-history traits across birds. Their results confirmed the expected positive relationship between plasma blood glucose level and albumin glycation rate but also provided findings that are somewhat surprising or contradicting findings of some previous studies (relationships with lifespan, clutch mass, or diet). This is the first extensive comparative analysis of glycation rates and their relationships to plasma glucose levels and life history traits in birds that are based on data collected in a single study and measured using unified analytical methods.

      Strengths

      This is an emerging topic gaining momentum in evolutionary physiology, which makes this study a timely, novel, and very important contribution. The study is based on a novel data set collected by the authors from 88 bird species (67 in captivity, 21 in the wild) of 22 orders, which itself greatly contributes to the pool of available data on avian glycemia, as previous comparative studies either extracted data from various studies or a database of veterinary records of zoo animals (therefore potentially containing much more noise due to different methodologies or other unstandardised factors), or only collected data from a single order, namely Passeriformes. The data further represents the first comparative avian data set on albumin glycation obtained using a unified methodology. The authors used LC-MS to determine glycation levels, which does not have problems with specificity and sensitivity that may occur with assays used in previous studies. The data analysis is thorough, and the conclusions are mostly well-supported (but see my comments below). Overall, this is a very important study representing a substantial contribution to the emerging field of evolutionary physiology focused on the ecology and evolution of blood/plasma glucose levels and resistance to glycation.

      Weaknesses

      My main concern is about the interpretation of the coefficient of the relationship between glycation rate and plasma glucose, which reads as follows: "Given that plasma glucose is logarithm transformed and the estimated slope of their relationship is lower than one, this implies that birds with higher glucose levels have relatively lower albumin glycation rates for their glucose, fact that we would be referring as higher glycation resistance" (lines 318-321) and "the logarithmic nature of the relationship, suggests that species with higher plasma glucose levels exhibit relatively greater resistance to glycation" (lines 386-388). First, only plasma glucose (predictor) but not glycation level (response) is logarithm transformed, and this semi-logarithmic relationship assumed by the model means that an increase in glycation always slows down when blood glucose goes up, irrespective of the coefficient. The coefficient thus does not carry information that could be interpreted as higher (when <1) or lower (when >1) resistance to glycation (this only can be done in a log-log model, see below) because the semi-log relationship means that glycation increases by a constant amount (expressed by the coefficient of plasma glucose) for every tenfold increase in plasma glucose (for example, with glucose values 10 and 100, the model would predict glycation values 2 and 4 if the coefficient is 2, or 0.5 and 1 if the coefficient is 0.5). Second, the semi-logarithmic relationship could indeed be interpreted such that glycation rates are relatively lower in species with high plasma glucose levels. However, the semi-log relationship is assumed here a priori and forced to the model by log-transforming only glucose level, while not being tested against alternative models, such as: (i) a model with a simple linear relationship (glycation ~ glucose); or (ii) a log-log model (log(glycation) ~ log(glucose)) assuming power function relationship (glycation = a * glucose^b). The latter model would allow for the interpretation of the coefficient (b) as higher (when <1) or lower (when >1) resistance in glycation in species with high glucose levels as suggested by the authors.

      Besides, a clear explanation of why glucose is log-transformed when included as a predictor, but not when included as a response variable, is missing.

      The models in the study do not control for the sampling time (i.e., time latency between capture and blood sampling), which may be an important source of noise because blood glucose increases because of stress following the capture. Although the authors claim that "this change in glucose levels with stress is mostly driven by an increase in variation instead of an increase in average values" (ESM6, line 46), their analysis of Tomasek et al.'s (2022) data set in ESM1 using Kruskal-Wallis rank sum test shows that, compared to baseline glucose levels, stress-induced glucose levels have higher median values, not only higher variation.

      Although the authors calculated the variance inflation factor (VIF) for each model, it is not clear how these were interpreted and considered. In some models, GVIF^(1/(2*Df)) is higher than 1.6, which indicates potentially important collinearity; see for example https://www.bookdown.org/rwnahhas/RMPH/mlr-collinearity.html). This is often the case for body mass or clutch mass (e.g. models of glucose or glycation based on individual measurements).

      It seems that the differences between diet groups other than omnivores (the reference category in the models) were not tested and only inferred using the credible intervals from the models. However, these credible intervals relate to the comparison of each group with the reference group (Omnivore) and cannot be used for pairwise comparisons between other groups. Statistics for these contrasts should be provided instead. Based on the plot in Figure 4B, it seems possible that terrestrial carnivores differed in glycation level not only from omnivores but also from herbivores and frugivores/nectarivores.

      Given that blood glucose is related to maximum lifespan, it would be interesting to also see the results of the model from Table 2 while excluding blood glucose from the predictors. This would allow for assessing if the maximum lifespan is completely independent of glycation levels. Alternatively, there might be a positive correlation mediated by blood glucose levels (based on its positive correlations with both lifespan and glycation), which would be a very interesting finding suggesting that high glycation levels do not preclude the evolution of long lifespans.

    3. Author response:

      Reviewer #1:

      (1) This concern is addressed in the ESM6, and partly in the ESM1. Indeed, many of the concerns raised by the reviewer later are already addressed on the multiple supplementary materials provided, so we kindly ask the reviewer to read them before moving forward into the discussion.

      (2) This concern is reasonable, but its solution is not "extremely easy", as the reviewer states. The reviewer indicates the use of captive-based versus non-captive-based sources, remarking maximum lifespan, the main variable that is clearly expected to be systematically biased by the source of the data. Nevertheless, except for the ZIMS database, which includes only captive individuals, and some sources, as CNRS databases and EURING, which exclusively includes wild populations, the remaining databases, which are indeed where the vast majority of the data was collected from (i.e. Amniotes database, Birds of the World and AnAge) do not make any distinction. This means that they include just the maximum lifespan from the species as known by the authors of such databases' entries, regardless of provenance, which is also not usually made explicit by the database. Therefore, correcting for this would imply checking all the primary sources. Considering that these databases sometimes do not cite the primary source, but a secondary one, and that on several occasions such source is a specialized book that is not easily accessible, and still these referenced datasets may not indicate the source of the data, tracing all of this information becomes an arduous task, that would even render the usage of databases themselves useless. We will include some details about the concerns of database usage in the discussion to address this.

      Furthermore, it remains relevant to indicate that what we discuss later about the possible effects of captivity is about our usage of animals that come from both sources, not about the provenance of the literature-extracted data used (i.e. captive or wild maximum lifespan, for example), which is an independent matter. We can test for the first for next submission, but very difficultly could we test for the second (as the reviewer seems to be pointing to). In any case, as we do not have in any case the same species from both a captive and a wild source, it would be difficult to determine if the effect tested comes from captivity or from species-specific differences.

      (3) We will add data on the replicability of the glycation measurement in the next manuscript version. The CV for several individuals of different species measured repeated times is quite low (always below 2%).

      (4) The reviewer remarks reported here are already addressed on the supplementary material (ESM6), given the lack of space in the main manuscript. We therefore kindly ask the reviewer to read the supplementary material added to the submission. If the editors agree, all or a considerable part of this could be transferred to the main text for clarity, but this would severely extend the length of a text that the reviewer already considered very long.

      Reviewer #2:

      Thanks for spotting this issue with the coefficient, as it is actually a redaction mistake. It is a remnant of a previous version of the manuscript in which a log-log relation was performed instead. Previous reviewers raised concerns about the usage of log transformation for glycation, this variable being (theoretically) a proportion variable (to which we argue that it does not behave as such), which they considered not to be transformed with a logarithm. After this, we still finally took the decision of not to transform this variable. In this line, the transformations of variables were decided generally by preliminary data exploration. In this particular case, both approaches lead to the same conclusion of higher glycation resistance in the species with higher glucose. Nevertheless, we will consider exploring the comparison of different versions for the resubmission.

      About the issue related to handling time, this variable is not available, for the reasons already exposed in the answer to the other reviewer. Moreover, Kruskal-Wallis test, by its nature, does not determine differences in medians between groups per se, as the reviewer claims, but just differences in ranks-sums. It can be equivalently used for that purpose when the groups' distributions are similar, but not when they differ, as we see here with a difference in variance. What a significant outcome in a Kruskal-Wallis test tells us, thus, is just that the groups differ (in their ranks-sums), which here is plausibly caused by the higher variance in the stressed individuals. Even if we conclude that the average is higher in those groups, mere comparisons of averages for groups with very different variances render different interpretations than when homoscedasticity is met, particularly more so when the distribution of groups overlaps. For example, in a case like this, where the data is left censored (glucose levels cannot be lower than 0), most of this higher variance is related to many values in the stressed groups lying above all the baseline values. This, of course, would increase the average, but such a parameter would not mean the same as if the distributions did not overlap.

      Regarding the GVIFs, why the values are above 1.6 is not well known, but we do not consider this a major concern, as the values are never above 2.2, level usually considered more worrying. We will include a brief explanation of this in the results section. Also, we explicitly calculated life history variables adjusted for body mass, which should eliminate their otherwise strong correlation. There exist other biological and interpretational reasons justified in the ESM6 for using the residuals on the models, instead of the raw values, despite previously raised concerns.

      Given the asseveration by the reviewer that credible intervals are not to be used for the post hoc comparisons, as this is what the whiskers shown in Figure 4B represent, the affirmation of this graph suggesting any difference between groups remains doubtful. New comparisons have now been made with the function HPDinterval() applied to the differences between each diet category calculated from the posterior values of each group, confirming no significant differences exist.

      We do not understand the suggestion made in relation to the model shown in Table 2. Removing glucose from the model could have two results, as the reviewer indicates: 1. Maximum lifespan (ML) relates with glycation, potentially spuriously through the effect of glucose (in this case not included) on both; 2. ML does not relate to glycation, and therefore "high glycation levels do not preclude the evolution of long lifespans", which is what we are already showing with the current model, which also controls for glucose, in an attempt to determine if not just raw glycation values, but glycation resistance, relates to longevity. This is intended to asses if long-lived species may show mechanisms that avoid glycation, by showing levels lower than expected for a non-enzymatic reaction.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript explores the RNA binding activities of the fission yeast Swi6 (HP1) protein and proposes a new role for Swi6 in RNAi-mediated heterochromatin establishment. The authors claim that Swi6 has a specific and high affinity for short interfering RNAs (siRNAs) and recruits the Clr4 (Suv39h) H3K9 methyltransferases to siRNA-DNA hybrids to initiate heterochromatin formation. These claims are not in any way supported by the incomplete and preliminary RNA binding or the in vivo experiments that the authors present. The proposed model also lacks any mechanistic basis as it remains unclear (and unexplored) how Swi6 might bind to specific small RNA sequences or RNA-DNA hybrids. Work by several other groups in the field has led to a model in which siRNAs produced by the RNAi pathway load onto the Ago1-containing RITS complex, which then binds to nascent transcripts at pericentromeric DNA repeats and recruits Clr4 to initiate heterochromatin formation. Swi6 facilitates this process by promoting the recruitment of the RNA-dependent RNA polymerase leading to siRNA amplification.

      Weaknesses:

      (1) The claims that Swi6 binds to specific small RNAs or to RNA-DNA hybrids are not supported by the evidence that the authors present. Their experiments do not rule out non-specific charged-based interactions. Claims about different affinities of Swi6 for RNAs of different sizes are based on a comparison of KD values derived by the authors for a handful of S. pombe siRNAs with previous studies from the Buhler lab on Swi6 RNA binding. The authors need to compare binding affinities under identical conditions in their assays. The regions of Swi6 that bind to siRNAs need to be identified and evidence must be provided that Swi6 binds to RNAs of a specific length, 20-22 mers, to support the claim that Swi6 binds to siRNAs. This is critical for all the subsequent experiments and claims in the study.

      (2) The in vivo results do not validate Swi6 binding to specific RNAs, as stated by the authors. Swi6 pulldowns have been shown to be enriched for all heterochromatic proteins including the RITS complex. The sRNA binding observed by the authors is therefore likely to be mediated by Ago1/RITS.

      Most of the binding in Figure S8C seems to be non-specific.

      In Figure S8D, the authors' data shows that Swi6 deletion does not derepress the rev dh transcript while dcr1 delete cells do, which is consistent with previous reports but does not relate to the authors' conclusions.

      Previous results have shown that swi6 delete cells have 20-fold fewer dg and dh siRNAs than swi6+ cells due to decreased RNA-dependent RNA polymerase complex recruitment and reduced siRNA amplification.

      (3) The RIP-seq data are difficult to interpret as presented. The size distribution of bound small RNAs, and where they map along the genome should be shown as for example presented in previous Ago1 sRNA-seq experiments.

      It is also unclear whether the defects in sRNA binding observed by the authors represent direct sRNA binding to Swi6 or co-precipitation of Ago1-bound sRNAs.

      The authors should also sequence total sRNAs to test whether Swi6-3A affects sRNA synthesis, as is the case in swi6 delete cells.

      (4) The authors examine the effects of Swi6-3A mutant by overexpression from the strong nmt1 promoter. Heterochromatin formation is sensitive to the dosage of Swi6. These experiments should be performed by introducing the 3A mutations at the endogenous Swi6 locus and effects on Swi6 protein levels should be tested.

      (5) The authors' data indicate an impairment of silencing in Swi6-3A mutant cells but whether this is due to a general lower affinity for nucleosomes, DNA, RNA, or as claimed by the authors, siRNAs is unclear. These experiments are consistent with previous findings suggesting an important role for basic residues in the HP1 hinge region in gene silencing but do not reveal how the hinge region enhances silencing.

      (6) RNase H1 overexpression may affect Swi6 localization and silencing indirectly as it would lead to a general reduction in R loops and RNA-DNA hybrids across the genome. RNaseH1 OE may also release chromatin-bound RNAs that act as scaffolds for siRNA-Ag1/RITS complexes that recruit Clr4 and ultimately Swi6.

      (7) Examples of inaccurate presentation of the literature.<br /> a. The authors state that "RNA binding by the murine HP1 through its hinge domains is required for heterochromatin assembly (Muchardt et al, 2002). The cited reference provides no evidence that HP1 RNA binding is required for heterochromatin assembly. Only the hinge region of bacterially produced HP1 contributes to its localization to DAPI-stained heterochromatic regions in fixed NIH 3T3 cells.<br /> b. "... This scenario is consistent with the loss of heterochromatin recruitment of Swi6 as well as siRNA generation in rnai mutants (Volpe et al, 2002)." Volpe et al. did not examine changes in siRNA levels in swi6 mutant cells. In fact, no siRNA analysis of any kind was reported in Volpe et al., 2002.

    2. Author response:

      In this manuscript, we have addressed one of the possible modes of recruitment of Swi6 to the putative heterochromatin loci.

      Our investigation was guided by earlier work showing ability of HP1 a to bind to a class of RNAs and the role of this binding in recruitment of HP1a to heterochromatin loci in mouse cells (Muchardt et al). While there has been no clarity about the mechanism of Swi6 recruitment given the multiple pathways being involved, the issue is compounded by the overall lack of understanding as to how Swi6 recruitment occurs only at the repeat regions. At the same time, various observations suggested a causal role of RNAi in Swi6 recruitment.

      Thus, guided by the work of Muchardt et al we developed a heuristic approach to explore a possibly direct link between Swi6 and heterochromatin through RNAi pathway. Interestingly, we found that the lysine triplet found in the hinge domain in HP1, which influences its recruitment to heterochromatin in mouse cells, is also present in the hinge domain of Swi6, although we were cautious, keeping in mind the findings of Keller et al showing another role of Swi6 in binding to RNAs and channeling them to the exosome pathway. 

      Accordingly, we envisaged that a mode of recruitment of Swi6 through binding to siRNAs to cognate sites in the dg-dh repeats shared among mating type, centromere and telomere loci could explain specific recruitment as well as inheritance following DNA replication. In accordance we framed the main questions as follows: i) Whether Swi6 binds specifically and with high affinity to the siRNAs and the cognate siRNA-DNA hybrids and whether the Swi63K-3A mutant is defective in this binding, ii) whether this lack of binding of Swi63K-3A affects its localization to heterochromatin, iii) whether the this specificity is validated by binding of Swi6 but not Swi63K-3A  to siRNAs and siRNA-DNA hybrids in vivo and iv) whether the binding mode was qualitatively and quantitatively different from that of Cen100 RNA or random RNAs, like GFP RNA.

      We think that our data provides answers to these lines of inquiry to support a model wherein the Swi6-siRNA mediated recruitment can explain a cis-controlled nucleation of heterochromatin at the cognate sites in the genome. We have also partially addressed the points raised by the study by Keller et al by invoking a dynamic balance between different modes of binding of Swi6 to different classes of RNA to exercise heterochromatin formation by Swi6 under normal conditions and RNA degradation under other conditions.

      While we aver about our hypothesis, we do acknowledge the need for more detailed investigation both to buttress our hypothesis and address the dynamics of siRNA binding and recruitment of Swi6  and how Swi6 functions fit in the context of other components of heterochromatin assembly, like the HDACs and Clr4 on one hand and exosome pathway on the other. Our future studies will attempt to address these issues.

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      This manuscript explores the RNA binding activities of the fission yeast Swi6 (HP1) protein and proposes a new role for Swi6 in RNAi-mediated heterochromatin establishment. The authors claim that Swi6 has a specific and high affinity for short interfering RNAs (siRNAs) and recruits the Clr4 (Suv39h) H3K9 methyltransferases to siRNA-DNA hybrids to initiate heterochromatin formation. These claims are not in any way supported by the incomplete and preliminary RNA binding or the in vivo experiments that the authors present. The proposed model also lacks any mechanistic basis as it remains unclear (and unexplored) how Swi6 might bind to specific small RNA sequences or RNA-DNA hybrids. Work by several other groups in the field has led to a model in which siRNAs produced by the RNAi pathway load onto the Ago1-containing RITS complex, which then binds to nascent transcripts at pericentromeric DNA repeats and recruits Clr4 to initiate heterochromatin formation. Swi6 facilitates this process by promoting the recruitment of the RNA-dependent RNA polymerase leading to siRNA amplification.

      Weaknesses:

      (1) a) The claims that Swi6 binds to specific small RNAs or to RNA-DNA hybrids are not supported by the evidence that the authors present. Their experiments do not rule out non-specific charged-based interactions.

      We disagree. We have used synthetic siRNAs of 20-22 nt length to do EMSA assay, as mentioned in the manuscript. Further, we have sequenced the small RNAs obtained after RIP experiments to validate the enrichment of siRNA in Swi6 bound fraction as compared to the mutant Swi6-bound fraction. These results are internally consistent regardless of the mode of binding. In any case the binding occurs primarily through the chromodomain although it is influenced by the hinge domain (see below).

      Furthermore, we have carried out EMSA experiments using Swi6 mutants carrying all three possible double mutations of the K residues in the KKK triplet and found that there was no difference in the binding pattern as compared to the wt Swi6: only the triple mutant “3K-3A” showed the effect. These results suggest that that the bdining is not completely dependent on the basic residues. These results will be included in the revised version.

      We also have some preliminary data from SAXS study showing that the CD of wt Swi6 shows a change in its structure upon binding to the siRNA, while the “3K-3A” mutant of Swi6 has a compact, folded structure that occludes the binding site of Swi6 in the chromodomain.” We propose to mention this preliminary finding in the revised version as unpublished data.

      b) Claims about different affinities of Swi6 for RNAs of different sizes are based on a comparison of KD values derived by the authors for a handful of S. pombe siRNAs with previous studies from the Buhler lab on Swi6 RNA binding. The authors need to compare binding affinities under identical conditions in their assays.

      Thus, the EMSA data do suggest sequence specificity in binding of Swi6 to specific siRNA sequences (Figure S5) and implies specific residues in Swi6 being responsible for that. Thus, Identification of the residues in Swi6 involved in siRNA binding in the CD would definitely be interesting, as also the experimental confirmation of the consensus siRNA sequence. It may however be noted that as against the binding of Swi6 to siRNAs occurs through CD, that of Cen100 or GFP RNA was shown be through the hinge domain by Keller et al.

      The estimation of Kd by the Buhler group was based on NMR study, which we are not in a position to perform in the near future. Nonetheless, we did carry out EMSA study using the ‘Cen100’ RNA, same as the one used by the Keller et al study. Surprisingly, in contrast with the result of EMSA in agarose gel showing binding of Swi6 to “Cen100” RNA as reported by Keller et al, we fail to observe any binding in EMSA done in acrylamide gel. (The same is true of the RevCen 100). While this raises issues of why the Keller et al chose to do EMSA in agarose gel instead of the conventional approach of using acrylamide gel, it does lend support to our claim of stronger binding of Swi6 to siRNAs. Another relevant observation of binding of Swi6 to the “RevCen” RNA precursor RNAs but a detectable binding to siRNAs denoted as VI-IX (as measured by competition experiments, that are derived from RevCen RNA; Figure S4 and S7), which are derived by Dcr1 cleavage of the ‘’RevCen’’ RNA.

      We also disagree that we carried out EMSA with a small bunch of siRNAs. As indicated in Figure 1 and S1, we synthesized nearly 12 siRNAs representing the dg-dh repeats at Cen, mat and tel loci and measured their specificity of binding to Swi6 using EMSA assay by labeling the ones labelled “D”, “E” and “V” directly and those of the remaining ones by the latter’s ability to compete against the binding (Figure 1, S4). These results point to presence of a consensus sequence in siRNAs that shows highly specific and strong binding to Swi6 in the low micromolar range.

      Further, our claim of binding of Swi6 and not Swi63K>3A to siRNA in vivo is validated by RIP experiments, as shown in Fig 2 and S9.

      c) The regions of Swi6 that bind to siRNAs need to be identified and evidence must be provided that Swi6 binds to RNAs of a specific length, 20-22 mers, to support the claim that Swi6 binds to siRNAs. This is critical for all the subsequent experiments and claims in the study.

      We have provided both in vitro data, which is va;idiated in vivo by RIP experiments, as mentioned above. However, we agree that it wpuld be very interesting to identify the residues in Swi6 chromdomain responsible for binding to siRNA. However, such an investigation is beyond the scope of the present study.

      (2) a) The in vivo results do not validate Swi6 binding to specific RNAs, as stated by the authors. Swi6 pulldowns have been shown to be enriched for all heterochromatic proteins including the RITS complex. The sRNA binding observed by the authors is therefore likely to be mediated by Ago1/RITS.

      We disagree with the first comment. Our RIP experiments do validate the in vitro results (Fig 1, 2, S4 and S9), as argued above. The observation alluded to by the reviewer “Swi6 pulldowns have been shown to be enriched for all heterochromatic proteins including the RITS complex” is not inconsistent with our observation; it is possible that the siRNA may be released from the RITS complex and transferred to Swi6, possibly due to its higher affinity.

      Thus, we would like to suggest that the role of Swi6 is likely to be coincidental or subsequent to that of Ago1/RITS (see below). We think that the binding by Swi6 to the siRNA and siRNA-DNA hybrid and could be also carried out in cis at the level of siRNA-DNA hybrids.

      This point needs to be addressed in future studies.

      b) Most of the binding in Figure S8C seems to be non-specific.

      We would like to point out that the result in Figure S8C needs to be examined together with the Figure S8B, which shows RNA bound by Swi6 but not Swi63K-3A to hybridize with dg, dh and dh-k probes.

      c) In Figure S8D, the authors' data shows that Swi6 deletion does not derepress the rev dh transcript while dcr1 delete cells do, which is consistent with previous reports but does not relate to the authors' conclusions.

      The purpose of results shown in Figure S8D is just to compare the results of Swi6 with that of Swi63K-3A.

      d) Previous results have shown that swi6 delete cells have 20-fold fewer dg and dh siRNAs than swi6+ cells due to decreased RNA-dependent RNA polymerase complex recruitment and reduced siRNA amplification.

      This result is consistent with our results invoking a role of Swi6 in binding to, protecting and recruiting siRNAs to homologous sites.

      To find if the overall production of siRNA is compromised in swi6 3K->3A mutant, we i) calculated the RIP-Seq read counts for swi6 3K->3A , swi6+ and vector control in 200 bp genomic bins , ii) divided the Swi6 3K->3A and swi6+ signals by that of control, iii) removed the background using the criteria of signal value < 25% of max signal, and iv) counted the total reads (in excess to control) in all peak regions in both samples.  This revealed a total count of 10878 and 8994 respectively for Swi6 3K->3A  and swi6+ samples, possibly implying that the overall siRNA production is not compromised in the Swi6 3K->3A mutant.

      (3) a) The RIP-seq data are difficult to interpret as presented. The size distribution of bound small RNAs, and where they map along the genome should be shown as for example presented in previous Ago1 sRNA-seq experiments.

      Please see the response to 2(d).

      b) It is also unclear whether the defects in sRNA binding observed by the authors represent direct sRNA binding to Swi6 or co-precipitation of Ago1-bound sRNAs.

      The correspondence between our in vivo and in vitro results suggests that the binding to Swi6 would be direct. We do not observe a complete correspondence between the Swi6- and Ago-bound siRNAs. We think Swi6 binding may be coincident with or following RITS complex formation.

      This point will be discussed in the Revision.

      The authors should also sequence total sRNAs to test whether Swi6-3A affects sRNA synthesis, as is the case in swi6 delete cells.

      Please see response to 2(d) above.

      (4) The authors examine the effects of Swi6-3A mutant by overexpression from the strong nmt1 promoter. Heterochromatin formation is sensitive to the dosage of Swi6. These experiments should be performed by introducing the 3A mutations at the endogenous Swi6 locus and effects on Swi6 protein levels should be tested.

      Although we agree, we think that the heterochromatin formation is occurring in presence of nmt1-driven Swi6 but not Swi63K>3A, as indicated by the phenotype and Swi6 enrichment at otr1R::ade6, imr1::ura4 and his3-telo (Figure 3) and mating type (Fig. S10). Furthermore, the both GFP-Swi6 and GFPSwi63K>3A are expressed at similar level (Fig. S8A).

      (5) The authors' data indicate an impairment of silencing in Swi6-3A mutant cells but whether this is due to a general lower affinity for nucleosomes, DNA, RNA, or as claimed by the authors, siRNAs is unclear. These experiments are consistent with previous findings suggesting an important role for basic residues in the HP1 hinge region in gene silencing but do not reveal how the hinge region enhances silencing.

      Our study aims to correlate the binding of Swi6 but not Swi63K-3A to siRNA with its localization to heterochromatin. A similar difference in binding of Swi6 but not Swi63K-3A to siRNA-DNA hybrid, together with sensitivity of silencing and Swi6 localization to heterochromatin to RNaseH support the above correlations as being causally connected.

      In terms of mechanism of binding, we need to clarify that the primary mode of binding is through the CD and not the hinge domain, although the hinge domain does influence this binding. This result is different from those of Keller et al.

      We have some structural data based on preliminary SAXS experiment supporting binding of siRNA to the CD and influence of the hinge domain on this binding. However, this line of investigation need to be extended and will be subject of future investigations.

      (6) RNase H1 overexpression may affect Swi6 localization and silencing indirectly as it would lead to a general reduction in R loops and RNA-DNA hybrids across the genome. RNaseH1 OE may also release chromatin-bound RNAs that act as scaffolds for siRNA-Ag1/RITS complexes that recruit Clr4 and ultimately Swi6.

      These are formal possibilities. However, the correlation between swi6 binding to siRNA-DNA hybrid and delocalization upon RNase H1 treatment argues for a more direct link.

      (7) Examples of inaccurate presentation of the literature.

      a) The authors state that "RNA binding by the murine HP1 through its hinge domains is required for heterochromatin assembly (Muchardt et al, 2002). The cited reference provides no evidence that HP1 RNA binding is required for heterochromatin assembly. Only the hinge region of bacterially produced HP1 contributes to its localization to DAPI-stained heterochromatic regions in fixed NIH 3T3 cells.

      Noted. Statement will be corrected.

      b) "... This scenario is consistent with the loss of heterochromatin recruitment of Swi6 as well as siRNA generation in rnai mutants (Volpe et al, 2002)." Volpe et al. did not examine changes in siRNA levels in swi6 mutant cells. In fact, no siRNA analysis of any kind was reported in Volpe et al., 2002.

      Correct.  We only say that Swi6 recruitment is reduced in rnai mutants and correlate it with ability of SWi6 to bind to siRNA generated by RNAi and subsequently to siRNA-DNA hybrid.

      Reviewer #2 (Public review):

      The aim of this study is to investigate the role of Swi6 binding to RNA in heterochromatin assembly in fission yeast. Using in vitro protein-RNA binding assays (EMSA) they showed that Swi6/HP1 binds centromere-derived siRNA (identified by Reinhardt and Bartel in 2002) via the chromodomain and hinge domains. They demonstrate that this binding is regulated by a lysine triplet in the conserved region of the Swi6 hinge domain and that wild-type Swi6 favours binding to DNA-RNA hybrids and siRNA, which then facilitates, rather than competes with, binding to H3K9me2 and to a lesser extent H3K9me3.

      However, the majority of the experiments are carried out in swi6 null cells overexpressing wild-type Swi6 or Swi63K-3A mutant from a very strong promoter (nmt1). Both swi6 null cells and overexpression of Swi6 are well known to exhibit phenotypes, some of which interfere with heterochromatin assembly. This is not made clear in the text.

      We think that the argument is not valid as we show that swi6 but not Swi63K-3A could restore silencing at imr1::ura4, otr1::ade6 and his3-telo (Fig 3) and mating type (Fig. S10), when transformed into a swi6D strain.

      Whilst the RNA binding experiments show that Swi6 can indeed bind RNA and that binding is decreased by Swi63K-3A mutation in vitro (confusingly, they only much later in the text explained that these 3 bands represent differential binding and that II is likely an isotherm). The gels showing these data are of poor quality and it is unclear which bands are used to calculate the Kd.

      We disagree with the comment about the quality of EMSA data. We think it is of similar quality or better than that of Keller et al, except in some cases, like Fig 1D, a shorter exposure shown to distinguish the slowest shifted band has caused the remaining bands to look fainter.

      RNA-seq data shows that overall fewer siRNAs are produced from regions of heterochromatin in the Swi63K-3A mutant so it is unsurprising that analysis of siRNA-associated motifs also shows lower enrichment (or indeed that they share some similarities, given that they originate from repeat regions).

      Please see response to comment 2(d) of the first reviewer above.

      It is not clear which bands are being alluded to. However, we‘ll rectify any gaps in information in the revision.

      The experiments are seemingly linked yet fail to substantiate their overall conclusions. For instance, the authors show that the Swi63K-3A mutant displays reduced siRNA binding in vitro (Figure 1D) and that H3K9me2 levels at heterochromatin loci are reduced in vivo (Figure 3C-D). They conclude that Swi6 siRNA binding is important for Swi6 heterochromatin localization, whilst it remains entirely possible that heterochromatin integrity is impaired by the Swi63K-3A mutation and hence fewer siRNAs are produced and available to bind. Their interpretation of the data is really confusing.

      Our argument is that the lack of binding by Swi63K>3A to siRNA can explain the loss of recruitment to heterochromatin loci and thus affect the integrity of heterochroamtin; the recruitment of Swi6 can occur possibly by binding initially to siRNA and thereafter as siRNA-DNA hybrid. However, the overall level of siRNAs is not affected, as in 2(D) above. This interpretation is supported by results of ChIP assay and confocal experiments, as also by the effect of RNaseH1 in the recruitment of Swi6.

      The authors go on to show that Swi63K-3A cells have impaired silencing at all regions tested and the mutant protein itself has less association with regions of heterochromatin. They perform DNA-RNA hybrid IPs and show that Swi63K-3A cells which also overexpress RNAseH/rnh1 have reduced levels of dh DNA-RNA hybrids than wild-type Swi6 cells. They interpret this to mean that Swi6 binds and protects DNA-RNA hybrids, presumably to facilitate binding to H3K9me2. The final piece of data is an EMSA assay showing that "high-affinity binding of Swi6 to a dg-dh specific RNA/DNA hybrid facilitates the binding to Me2-K9-H3 rather than competing against it." This EMSA gel shown is of very poor quality, and this casts doubt on their overall conclusion.

      We do agree with the reviewer about the quality of EMSA (Fig. 5B). However, as may be noticed in the EMSA for siRNA-DNA hybrid binding  (Fig 4A), the bands of Swi6-bound siRNA-DNA hybrid are extremely retarded. Hence the EMSA for subsequent binding by H3-K9-Me peptides required a longer electrophoretic run, which led to reduction in the sharpness of the bands. Nevertheless, the data does indicate binding efficiency in the order H3K9-Me2> H3-K9-Me3 > H3-K9-Me0. Having said that, we plan to repeat the EMSA or address the question by other methods, like SPR.

      Unfortunately, the manuscript is generally poorly written and difficult to comprehend. The experimental setups and interpretations of the data are not fully explained, or, are explained in the wrong order leading to a lack of clarity. An example of this is the reasoning behind the use of the cid14 mutant which is not explained until the discussion of Figure 5C, but it is utilised at the outset in Figure 5A.

      We tend to agree somewhat and will attempt to submit a revised version with greater clarity, as also the explanation of experiment with cid14D strain.

      Another example of this lack of clarity/confusion is that the abstract states "Here we provide evidence in support of RNAi-independent recruitment of Swi6". Yet it then states "We show that...Swi6/HP1 displays a hierarchy of increasing binding affinity through its chromodomain to the siRNAs corresponding to specific dg-dh repeats, and even stronger binding to the cognate siRNA-DNA hybrids than to the siRNA precursors or general RNAs." RNAi is required to produce siRNAs, so their message is very unclear. Moreover, an entire section is titled "Heterochromatin recruitment of Swi6-HP1 depends on siRNA generation" so what is the author's message?

      The reviewer has correctly pointed out the error. Indeed, our results actually indicate an RNAi-dependent rather than independent mode of recruitment. Rather, we would like to suggest an H3-K9-Me2-indpendnet recruitment of Swi6. We will rectify this error in our revised manuscript.

      The data presented, whilst sound in some parts is generally overinterpreted and does not fully support the author's confusing conclusions. The authors essentially characterise an overexpressed Swi6 mutant protein with a few other experiments on the side, that do not entirely support their conclusions. They make the point several times that the KD for their binding experiments is far higher than that previously reported (Keller et al Mol Cell 2012) but unfortunately the data provided here are of an inferior quality and thus their conclusions are neither fully supported nor convincing.

      We have used the method of Heffler et al (2012) to compute the Kd from EMSA data.

    1. Résumé de la vidéo [00:00:00][^1^][1] - [00:23:53][^2^][2] : Ce webinaire, animé par Alice Pierre-François, se concentre sur l'animation d'un collectif SISM (Semaines d'Information sur la Santé Mentale) en France. Il aborde les stratégies pour engager les membres sur le long terme, les partenariats possibles, et les méthodes d'animation pour susciter la motivation. Des intervenants partagent leurs expériences en matière de coordination d'événements SISM et d'animation de collectifs locaux.

      Points saillants : + [00:00:00][^3^][3] Introduction et objectifs du webinaire * Présentation par Alice Pierre-François * Discussion sur l'engagement des membres et l'animation des collectifs * Conseils pour la gestion des collectifs SISM + [00:01:04][^4^][4] Intervenants et leurs expériences * Partage d'expériences par divers intervenants * Exemples de coordination et d'animation de collectifs * Importance de l'engagement et de la communication + [00:03:26][^5^][5] Règles d'échange et modération du webinaire * Modération par Léa Sonet, responsable communication du Psycom * Rappel des règles pour le bon déroulement du webinaire * Encouragement à l'interaction via le chat + [00:07:35][^6^][6] Historique et importance des SISM * Explication des SISM, un rendez-vous annuel sur la santé mentale * Objectifs et organisation des SISM * Rôle du collectif national et des collectifs locaux + [00:11:21][^7^][7] Présentation de Widad l Wafi sur les SISM à Vichy * Organisation des SISM par le collectif de Vichy communauté * Diversité des acteurs et événements organisés * Exemples d'actions menées lors des SISM 2023 + [00:22:15][^8^][8] Présentation de Mélissa sur les SISM dans le département de l'Ain * Contexte géographique et démographique de l'Ain * Adaptation des événements SISM aux spécificités du département * Importance de l'accès aux soins et de la communication

      Résumé de la vidéo [00:23:55][^1^][1] - [00:48:17][^2^][2]:

      Cette vidéo présente un webinaire sur l'animation d'un collectif SISM (Semaines d'Information sur la Santé Mentale) en juin 2024. Elle aborde l'évolution des SISM dans le département de l'Indre depuis leur création en 2013, leur intégration dans le projet territorial de santé mentale en 2020, et la coordination par le service de santé mentale de l'Indre depuis 2021. La vidéo met en lumière l'importance de la mutualisation des moyens, la participation des membres du collectif, et l'évaluation de la satisfaction des participants.

      Points forts: + [00:23:55][^3^][3] Historique et évolution des SISM * Création en 2013 par un petit groupe * Évolution et intégration dans le projet territorial de santé mentale en 2020 * Coordination par le service de santé mentale de l'Indre depuis 2021 + [00:26:01][^4^][4] Participation et organisation * Environ 48 partenaires en 2023 * Réalisation de 26 événements en 2023 * Types d'événements variés : ateliers, conférences, débats, etc. + [00:29:28][^5^][5] Le collectif EO et ses objectifs * Existence depuis 2016 * Objectifs de décloisonnement et de renforcement des liens entre acteurs * Organisation de manifestations variées en 2023 + [00:39:10][^6^][6] Rôles et partenariats au sein des collectifs * Importance de la clarté des rôles et des missions * Mutualisation des moyens et participation active des membres * Évaluation de la satisfaction et amélioration continue

      Résumé de la vidéo [00:48:20][^1^][1] - [01:11:41][^2^][2]:

      Cette vidéo présente un webinaire sur l'animation d'un collectif SISM (Semaines d'Information sur la Santé Mentale) en juin 2024. Les intervenants discutent des méthodes d'organisation, de la diversité des acteurs impliqués, et de l'importance de l'interconnaissance et du soutien mutuel pour le succès des initiatives.

      Points forts: + [00:48:20][^3^][3] Organisation et partenariats * Importance de l'offre et de la demande de ressources * Exemple d'un débat universitaire facilité par la disponibilité d'une salle * Émergence de beaux partenariats + [00:49:16][^4^][4] Rôle et diversité au sein du collectif * Composition variée du collectif inscrite dans la charte * Représentation des structures hospitalières, associations d'usagers, et autres * Deux sous-groupes : coordination et communication + [00:51:57][^5^][5] Interconnaissance et engagement * Interconnaissance préalable entre certains membres * Cultivation de liens à travers différents projets * Partage d'expériences et soutien dans les actions + [00:56:21][^6^][6] Importance de la présence politique * Impact de la présence politique sur la valorisation des actions * Objectif futur de renforcer le lien avec les élus + [00:59:32][^7^][7] Méthodes d'animation d'un collectif * Présentation d'outils d'animation pour faciliter l'engagement * Exemple d'un appel à participation pour élargir le collectif + [01:07:59][^8^][8] Animation et réunions plénières du collectif * Cinq réunions plénières annuelles pour l'organisation * Présentiel privilégié pour l'accueil et la convivialité * Partage d'expériences et création de partenariats lors des réunions

      Résumé de la vidéo [01:11:45][^1^][1] - [01:23:14][^2^][2]:

      Cette partie du webinaire se concentre sur l'animation d'un collectif SISM en juin 2024, mettant en lumière les stratégies de communication, les outils de coordination et les pratiques d'engagement des membres.

      Points forts: + [01:11:45][^3^][3] Communication et visibilité * Distribution de flyers et programmes communs * Utilisation de QR codes et cartes pour localiser les actions * Soutien logistique par les coordinateurs + [01:14:55][^4^][4] Facilitation et soutien aux membres * Simplification de la participation au collectif * Prise en charge interne de la production de matériel promotionnel * Financement de la convivialité et des réunions par la communauté + [01:17:01][^5^][5] Planification et organisation des réunions * Utilisation d'outils participatifs comme Doodle pour planifier * Rotation des lieux de réunion pour une meilleure connaissance mutuelle * Création d'un padlet pour partager les coordonnées et informations + [01:21:00][^6^][6] Conseils et recommandations pour l'animation * Importance de l'horizontalité, convivialité et partage d'expérience * Bienveillance, suppression des rapports de force et rappel des enjeux * Créativité dans l'animation du collectif pour renforcer l'identité

    1. Video summary [00:00:00][^1^][1] - [00:54:09][^2^][2]:

      Cette vidéo présente une discussion approfondie sur la zététique, l'esprit critique, et les croyances, avec Samuel Buisseret.

      Il aborde son parcours personnel, ses critiques du milieu sceptique, et son livre "Arrêter de croire n'importe quoi".

      Highlights: + [00:00:00][^3^][3] Introduction et présentation * Samuel Buisseret se présente * Discussion sur la zététique et l'esprit critique * Annonce de l'arrêt de sa chaîne YouTube + [00:02:26][^4^][4] Critique du milieu sceptique * Distinction entre outil et application * Sensibilité particulière de Samuel en tant qu'ancien complotiste * Importance de l'autocritique dans la zététique + [00:04:02][^5^][5] Genèse du livre de Samuel * Commande des éditions de bouc supérieures * Synthèse de huit années de pratique zététique * Révision et contextualisation de ses opinions + [00:23:32][^6^][6] Création de la chaîne YouTube * Motivation personnelle et événement déclencheur * Première vidéo et découverte de l'esprit critique * Importance de la prudence épistémique + [00:52:14][^7^][7] Résultats en parapsychologie * Expériences et résultats significatifs * Importance de la rigueur méthodologique * Contribution de Renaud Evrard et Jean-Michel Abrassart

    1. Résumé de la vidéo [00:00:00][^1^][1] - [01:54:14][^2^][2]:

      Cette vidéo présente une réunion de l'Institut Bertrand Schwartz, axée sur la participation des jeunes et l'implication des élus locaux dans les missions locales.

      Les intervenants discutent des bénéfices et des risques de la proximité avec les administrés, ainsi que des changements nécessaires dans les pratiques des élus pour favoriser la participation citoyenne.

      Points forts : + [00:00:00][^3^][3] Introduction et déroulé de la réunion * Présentation des intervenants * Objectifs de la réunion * Importance de la participation des jeunes + [00:02:00][^4^][4] Rappel de la démarche et des principes * Implication des élus locaux * Contribution des jeunes aux politiques * Importance de la décentralisation + [00:05:00][^5^][5] Recherche-action pour la participation des jeunes * Trois types d'acteurs : jeunes, professionnels, élus * Objectifs des webinaires * Changement de posture des élus + [00:10:00][^6^][6] Outil de mesure de la participation * Types de participation : consultants, collaborateurs, pilotes * Importance de la non-participation assumée * Risques de fausse participation + [00:31:20][^7^][7] Discussion sur l'implication des élus * Importance des échanges directs avec les jeunes * Changement de contexte avec la garantie jeune * Rôle des missions locales comme médiateurs

    1. Résumé vidéo [00:00:05][^1^][1] - [00:21:55][^2^][2]:

      Cette vidéo présente une recherche sur les prises de position des élèves sur le bien-être animal en élevage, une question socialement vive.

      Elle montre comment les élèves articulent des connaissances, des émotions et des valeurs pour construire leurs arguments, et comment ils utilisent parfois des stratégies de défense pour justifier leurs pratiques.

      Elle propose des leviers pédagogiques pour accompagner les élèves à avoir un regard réflexif et à réduire les dissonances.

      Points forts: + [00:00:05][^3^][3] La présentation de la chercheuse et de son domaine de recherche * Enseignante chercheuse à l'ENSFEA * Spécialiste de la didactique des Questions Socialement Vives * Auteure d'un chapitre d'ouvrage sur le sujet + [00:01:27][^4^][4] La définition et les caractéristiques des Questions Socialement Vives * Questions à enjeux de société controversées dans les champs de référence, la société et la classe * Exemple du changement climatique * Articulation entre domaine cognitif, émotionnel et axiologique + [00:03:30][^5^][5] Le cas du bien-être animal en élevage et le scénario pédagogique mis en œuvre * Problème des interventions douloureuses sur les animaux * Séances de cours, de TP et de retours d'expérience * Recueil des discours des élèves lors du dilemme éthique et professionnel + [00:06:12][^6^][6] Deux exemples d'élèves illustrant des situations contrastées * Aymeric, fils d'éleveur, qui change de position selon les contextes * Rudy, élève peu impliqué, qui propose des pratiques douloureuses * Analyse des connaissances, des émotions et des valeurs mobilisées + [00:18:43][^7^][7] Les leviers pour mieux comprendre les prises de position des élèves * Nécessité des connaissances mais pas suffisantes * Accompagnement du regard réflexif, de la verbalisation des émotions et des valeurs * Diversification du vécu et confrontation d'alternatives * Réduction des dissonances et ouverture de nouveaux possibles + [00:21:24][^8^][8] La référence du chapitre d'ouvrage et l'invitation à échanger en mars * Ouvrage Educagri : "L'éthique dans l'enseignement agricole" * Chapitre : "Points de vue des élèves sur leur bien-être en classe" * Auteure : Amélie Lipp

    1. Temps Forts de la Vidéo "De l’indocilité des jeunesses populaires. Apprenti.e.s et élèves de lycées professionnels"

      Voici les principaux temps forts de la vidéo, accompagnés d'une description des sujets abordés et des timestamps :

      1. Introduction et présentation de l'ouvrage (0:00-2:00):

      • Prisca Kergoat, professeure des universités en sociologie et directrice du laboratoire CERTOP, présente son ouvrage "De l’indocilité des jeunesses populaires. Apprenti.e.s et élèves de lycées professionnels" (2022).

      2. Point de départ de la recherche (2:00-3:40):

      • La recherche part d'une discussion avec la littérature scientifique existante sur les élèves orientés vers des métiers d'exécution.
      • Cette littérature met en évidence la force des rapports de domination, mais déduit souvent que cette domination annihile la capacité d'agir des jeunes.
      • L'objectif de l'ouvrage est de démontrer que ces jeunes ont une autonomie de pensée et une capacité à déconstruire leurs conditions, justifiant l'utilisation du concept d' "indocilité".

      3. Objectifs et méthodologie de la recherche (3:40-7:20):

      • Objectifs:
        • Caractériser les contraintes exercées sur les élèves à travers leur distribution dans les espaces de formation professionnelle.
        • Mettre à jour l'intensification du sentiment d'injustice et l'émergence de pratiques et de pensées indociles.
      • Méthodologie:
        • Deux enquêtes collectives :
          • Conditions de vie et d'études des élèves de lycées professionnels (financée par le Ministère de l'Éducation Nationale).
          • Mesure des discriminations dans l'accès à l'apprentissage (financée par le Ministère de la Jeunesse).
        • Protocole articulant :
          • Étude extensive par questionnaire (environ 3000 questionnaires) auprès d'élèves et d'apprentis de différentes spécialités (féminisées, masculinisées et mixtes).
          • Entretiens semi-directifs (43 entretiens) auprès d'enseignants et d'élèves retraçant l'expérience de l'orientation, la recherche d'une place en entreprise et l'entrée en formation.

      4. L'orientation scolaire et professionnelle (7:20-17:00):

      • Contexte de l'orientation depuis les années 90:
        • Unification progressive du système éducatif et concurrence avec l'enseignement général et technologique.
        • Évolution des caractéristiques de la population des élèves de l'enseignement professionnel, marquée par l'association de l'origine populaire et des difficultés scolaires.
      • Réformes de l'orientation (loi de 1989 et 2018) :
        • Promotion d'un individu rationnel, entrepreneur de lui-même, libre de ses choix et capable de s'émanciper.
      • Arrêt des politiques de redoublement :
        • Élèves plus jeunes au moment de l'orientation (14-15 ans pour les lycées professionnels).
      • Typologie des rapports à l'orientation:
        • L'ouvrage présente une typologie des rapports à l'orientation, mettant en lumière la diversité des expériences.
      • L'expérience partagée de l'humiliation:
        • Sentiment d'humiliation (mépris de classe et honte de soi) ressenti même par les élèves ayant une vocation pour le métier choisi.
        • L'humiliation comme une expérience ordinaire, particulièrement pour les élèves en difficulté scolaire.
        • La décision d'orientation, présentée comme légitime et réglementaire, s'appuie sur les notes et un jugement collectif, la rendant d'autant plus humiliante.
        • La décision d'orientation comme un jugement de classe fondé sur une représentation de la culture légitime.
        • Verbatim d'une élève illustrant ce sentiment d'humiliation.
      • Conséquences des réformes de démocratisation scolaire:
        • Responsabilisation individuelle des réussites et des échecs.
        • Stigmatisation accrue des jeunes orientés.
      • Résistance des élèves:
        • Les élèves n'intériorisent pas passivement les verdicts scolaires.

      5. La recherche d'une place en entreprise (17:00-27:00):

      • Contexte de la recherche d'une entreprise :
        • Inversion hiérarchique initiée dans les années 80 : la préférence pour la formation en école est remplacée par une prédilection pour l'apprentissage en entreprise.
        • Valorisation de l'apprentissage au détriment du lycée professionnel.
        • Apprentissage de plus en plus sélectif.
        • Graphique illustrant la préférence des jeunes pour l'apprentissage.
      • Sélection accentuée :
        • 30% des élèves interrogés en lycée professionnel n'ont pas réussi à accéder à l'apprentissage.
      • Typologie des pratiques de recherche:
        • Classe 1 (31%): Accès rapide à l'apprentissage grâce à un capital d'autochtonie (réseau familial et parental).
          • Principalement des garçons issus de la fraction stable des classes populaires, avec une sur-représentation des indépendants, petits commerçants et artisans.
        • Classe 2 (56%): Souhait d'entrer en apprentissage resté au stade de la simple velléité.
          • Jeunes, issus des fractions paupérisées des classes populaires, avec une sur-représentation des étrangers ou issus de l'immigration.
          • Anticipation des obstacles et lucidité sociale conduisant à considérer le lycée professionnel comme plus protecteur.
        • Classe 3: Forte mobilisation dans la recherche d'une place en apprentissage (recherches durant jusqu'à 3 mois, contactant jusqu'à 100 entreprises).
          • Certains ont accédé à l'apprentissage, d'autres non.
          • Jeunes issus des classes paupérisées, avec une sur-représentation des filles.
      • Conséquences de la sélectivité de l'apprentissage :
        • Éviction d'une partie de la population de l'accès à l'apprentissage.
        • La performance de l'apprentissage en matière d'insertion professionnelle est en partie due à l'éviction des jeunes les plus fragilisés (milieux précarisés, filles, jeunes issus de l'immigration).
        • Concurrence mécanique de l'apprentissage avec le taux d'insertion des jeunes issus des lycées professionnels.

      6. L'intensification du sentiment d'injustice (27:00-32:00):

      • Sentiment d'injustice exprimé par près de la moitié des jeunes interrogés, bien qu'ils ne soient que 15% à déclarer avoir été discriminés.
      • Verbatims illustrant le sentiment d'injustice:
        • Capacité des élèves à discuter les jugements professoraux et à repérer le double discours valorisant l'enseignement professionnel tout en encourageant les meilleurs élèves à s'orienter vers l'enseignement général.
        • Insistance sur les conseils d'orientation enfermant les jeunes dans une offre de formation genrée, plus restreinte pour les filles.
        • Difficulté de faire un choix d'orientation à l'adolescence et sentiment d'être obligés de "vieillir" prématurément.
        • Nécessité d'adopter le modèle de féminité des classes intermédiaires occidentales pour trouver et garder une place en entreprise.
        • Injonctions contradictoires : considérés comme des enfants en établissement, mais devant se comporter comme des adultes autonomes et responsables en entreprise.

      7. Conclusion (32:00-33:45):

      • Les jeunes de l'enseignement professionnel savent débusquer et mettre à jour les rapports de domination.
      • Ils déstabilisent les évidences du sens commun.
      • Leurs expériences et pratiques sociales développent des pratiques indociles.
    1. Résumé de la vidéo [00:01:48][^1^][1] - [00:28:45][^2^][2]:

      Cette vidéo présente une conférence de Patrick Obertelli sur la confiance, organisée par l'ISF (Ingénieur et Scientifique de France).

      Elle aborde l'importance de la confiance dans la société, les causes de la défiance actuelle, et comment la confiance peut être reconstruite à travers les institutions et les interactions individuelles.

      Points forts:

      • [00:01:48][^3^][3] Introduction à l'ISF
        • Présentation de l'organisation et de ses actions
        • Importance du répertoire des ingénieurs et du label d'ingénieur diplômé
      • [00:06:00][^4^][4] Présentateurs et invités
        • Introduction des intervenants, Patrick Obertelli et Marc Ridel
        • Leurs contributions et parcours professionnels
      • [00:10:05][^5^][5] La confiance dans la société
        • Statistiques sur la confiance en France
        • Discussion sur la confiance envers les institutions et les entreprises
      • [00:15:09][^6^][6] Causes de la défiance
        • Analyse historique de la confiance et de la défiance
        • Impact des crises économiques et sociales sur la confiance
      • [00:19:01][^7^][7] Gestion de l'incertitude
        • Approches autoritaires vs démocratiques dans la gestion de l'incertitude
        • L'importance de la confiance face à l'incertitude
      • [00:23:02][^8^][8] Discussion ouverte
        • Échanges avec l'audience sur la confiance et la technologie
        • Comment la confiance influence les relations humaines et l'avenir Résumé de la vidéo [00:28:46][^1^][1] - [00:52:56][^2^][2]:

      Cette partie de la vidéo aborde la notion de confiance dans divers contextes, notamment la foi chrétienne, les idées politiques, et les institutions.

      Patrick Obertelli discute de la perte de confiance dans les élites, l'intolérance envers les inégalités, et les défis de comprendre un monde complexe.

      Il souligne l'importance de l'intelligence artificielle et de la technologie dans la gestion des organisations et la nécessité de construire la confiance à travers l'efficacité, l'information honnête, et la cohérence entre les paroles et les actes.

      Points forts: + [00:28:46][^3^][3] La confiance dans la foi et la politique * La confiance chrétienne dans un avenir meilleur * La confiance dans les institutions politiques et leurs défis * La remise en question des idéologies et des systèmes établis + [00:32:01][^4^][4] La perte de confiance dans les élites * La centralisation de l'État et les inégalités dans la santé * L'intolérance croissante envers les injustices * La complexité des processus décisionnels et la technologie + [00:38:00][^5^][5] La société de l'immédiateté et le rôle de l'IA * La difficulté à comprendre le monde actuel * L'impact de l'intelligence artificielle sur les décisions * La nécessité de prendre le temps pour construire la confiance + [00:45:00][^6^][6] Construire la confiance dans les organisations * La cohérence entre les paroles, les actes et les sentiments * L'importance de l'efficacité et de l'information honnête * La préservation des biens communs et l'anticipation des problèmes à long terme Résumé de la vidéo [00:28:46][^1^][1] - [00:52:56][^2^][2]:

      Cette partie de la vidéo aborde la notion de confiance dans divers contextes, notamment la foi chrétienne, les idées politiques, et les institutions. Patrick Obertelli discute de la perte de confiance dans les élites, l'intolérance envers les inégalités, et les défis de comprendre un monde complexe.

      Il souligne l'importance de l'intelligence artificielle et de la technologie dans la gestion des organisations et la nécessité de construire la confiance à travers l'efficacité, l'information honnête, et la cohérence entre les paroles et les actes.

      Points forts: + [00:28:46][^3^][3] La confiance dans la foi et la politique * La confiance chrétienne dans un avenir meilleur * La confiance dans les institutions politiques et leurs défis * La remise en question des idéologies et des systèmes établis + [00:32:01][^4^][4] La perte de confiance dans les élites * La centralisation de l'État et les inégalités dans la santé * L'intolérance croissante envers les injustices * La complexité des processus décisionnels et la technologie + [00:38:00][^5^][5] La société de l'immédiateté et le rôle de l'IA * La difficulté à comprendre le monde actuel * L'impact de l'intelligence artificielle sur les décisions * La nécessité de prendre le temps pour construire la confiance + [00:45:00][^6^][6] Construire la confiance dans les organisations * La cohérence entre les paroles, les actes et les sentiments * L'importance de l'efficacité et de l'information honnête * La préservation des biens communs et l'anticipation des problèmes à long terme Résumé de la vidéo [00:28:46][^1^][1] - [00:52:56][^2^][2]:

      Points forts: + [00:28:46][^3^][3] La confiance dans la foi et la politique * La confiance chrétienne dans un avenir meilleur * La confiance dans les institutions politiques et leurs défis * La remise en question des idéologies et des systèmes établis + [00:32:01][^4^][4] La perte de confiance dans les élites * La centralisation de l'État et les inégalités dans la santé * L'intolérance croissante envers les injustices * La complexité des processus décisionnels et la technologie + [00:38:00][^5^][5] La société de l'immédiateté et le rôle de l'IA * La difficulté à comprendre le monde actuel * L'impact de l'intelligence artificielle sur les décisions * La nécessité de prendre le temps pour construire la confiance + [00:45:00][^6^][6] Construire la confiance dans les organisations * La cohérence entre les paroles, les actes et les sentiments * L'importance de l'efficacité et de l'information honnête * La préservation des biens communs et l'anticipation des problèmes à long terme

      sociologie

    2. Résumé de la vidéo [00:01:48][^1^][1] - [00:28:45][^2^][2]:

      Cette vidéo présente une conférence de Patrick Obertelli sur la confiance, organisée par l'ISF (Ingénieur et Scientifique de France). Elle aborde l'importance de la confiance dans la société, les causes de la défiance actuelle, et comment la confiance peut être reconstruite à travers les institutions et les interactions individuelles.

      Points forts: + [00:01:48][^3^][3] Introduction à l'ISF * Présentation de l'organisation et de ses actions * Importance du répertoire des ingénieurs et du label d'ingénieur diplômé + [00:06:00][^4^][4] Présentateurs et invités * Introduction des intervenants, Patrick Obertelli et Marc Ridel * Leurs contributions et parcours professionnels + [00:10:05][^5^][5] La confiance dans la société * Statistiques sur la confiance en France * Discussion sur la confiance envers les institutions et les entreprises + [00:15:09][^6^][6] Causes de la défiance * Analyse historique de la confiance et de la défiance * Impact des crises économiques et sociales sur la confiance + [00:19:01][^7^][7] Gestion de l'incertitude * Approches autoritaires vs démocratiques dans la gestion de l'incertitude * L'importance de la confiance face à l'incertitude + [00:23:02][^8^][8] Discussion ouverte * Échanges avec l'audience sur la confiance et la technologie * Comment la confiance influence les relations humaines et l'avenir Résumé de la vidéo [00:28:46][^1^][1] - [00:52:56][^2^][2]:

      Cette partie de la vidéo aborde la notion de confiance dans divers contextes, notamment la foi chrétienne, les idées politiques, et les institutions. Patrick Obertelli discute de la perte de confiance dans les élites, l'intolérance envers les inégalités, et les défis de comprendre un monde complexe. Il souligne l'importance de l'intelligence artificielle et de la technologie dans la gestion des organisations et la nécessité de construire la confiance à travers l'efficacité, l'information honnête, et la cohérence entre les paroles et les actes.

      Points forts: + [00:28:46][^3^][3] La confiance dans la foi et la politique * La confiance chrétienne dans un avenir meilleur * La confiance dans les institutions politiques et leurs défis * La remise en question des idéologies et des systèmes établis + [00:32:01][^4^][4] La perte de confiance dans les élites * La centralisation de l'État et les inégalités dans la santé * L'intolérance croissante envers les injustices * La complexité des processus décisionnels et la technologie + [00:38:00][^5^][5] La société de l'immédiateté et le rôle de l'IA * La difficulté à comprendre le monde actuel * L'impact de l'intelligence artificielle sur les décisions * La nécessité de prendre le temps pour construire la confiance + [00:45:00][^6^][6] Construire la confiance dans les organisations * La cohérence entre les paroles, les actes et les sentiments * L'importance de l'efficacité et de l'information honnête * La préservation des biens communs et l'anticipation des problèmes à long terme Résumé de la vidéo [00:28:46][^1^][1] - [00:52:56][^2^][2]:

      Cette partie de la vidéo aborde la notion de confiance dans divers contextes, notamment la foi chrétienne, les idées politiques, et les institutions. Patrick Obertelli discute de la perte de confiance dans les élites, l'intolérance envers les inégalités, et les défis de comprendre un monde complexe. Il souligne l'importance de l'intelligence artificielle et de la technologie dans la gestion des organisations et la nécessité de construire la confiance à travers l'efficacité, l'information honnête, et la cohérence entre les paroles et les actes.

      Points forts: + [00:28:46][^3^][3] La confiance dans la foi et la politique * La confiance chrétienne dans un avenir meilleur * La confiance dans les institutions politiques et leurs défis * La remise en question des idéologies et des systèmes établis + [00:32:01][^4^][4] La perte de confiance dans les élites * La centralisation de l'État et les inégalités dans la santé * L'intolérance croissante envers les injustices * La complexité des processus décisionnels et la technologie + [00:38:00][^5^][5] La société de l'immédiateté et le rôle de l'IA * La difficulté à comprendre le monde actuel * L'impact de l'intelligence artificielle sur les décisions * La nécessité de prendre le temps pour construire la confiance + [00:45:00][^6^][6] Construire la confiance dans les organisations * La cohérence entre les paroles, les actes et les sentiments * L'importance de l'efficacité et de l'information honnête * La préservation des biens communs et l'anticipation des problèmes à long terme Résumé de la vidéo [00:28:46][^1^][1] - [00:52:56][^2^][2]:

      Cette partie de la vidéo aborde la notion de confiance dans divers contextes, notamment la foi chrétienne, les idées politiques, et les institutions. Patrick Obertelli discute de la perte de confiance dans les élites, l'intolérance envers les inégalités, et les défis de comprendre un monde complexe. Il souligne l'importance de l'intelligence artificielle et de la technologie dans la gestion des organisations et la nécessité de construire la confiance à travers l'efficacité, l'information honnête, et la cohérence entre les paroles et les actes.

      Points forts: + [00:28:46][^3^][3] La confiance dans la foi et la politique * La confiance chrétienne dans un avenir meilleur * La confiance dans les institutions politiques et leurs défis * La remise en question des idéologies et des systèmes établis + [00:32:01][^4^][4] La perte de confiance dans les élites * La centralisation de l'État et les inégalités dans la santé * L'intolérance croissante envers les injustices * La complexité des processus décisionnels et la technologie + [00:38:00][^5^][5] La société de l'immédiateté et le rôle de l'IA * La difficulté à comprendre le monde actuel * L'impact de l'intelligence artificielle sur les décisions * La nécessité de prendre le temps pour construire la confiance + [00:45:00][^6^][6] Construire la confiance dans les organisations * La cohérence entre les paroles, les actes et les sentiments * L'importance de l'efficacité et de l'information honnête * La préservation des biens communs et l'anticipation des problèmes à long terme

    1. Reviewer #2 (Public review):

      Summary:

      In this manuscript, Wang et al analyze ~17,000 transcriptomes from 35 human tissues from the GTEx database and address transcriptomic variations due to age and sex. They identified both gene expression changes as well as alternative splicing events that differ among sexes. Using breakpoint analysis, the authors find sex dimorphic shifts begin with declining sex hormone levels with males being affected more than females. This is an important pan-tissue transcriptomic study exploring age and sex-dependent changes although not the first one.

      Strengths:

      (1) The authors use sophisticated modeling and statistics for differential, correlational, and predictive analysis.

      (2) The authors consider important variables such as genetic background, ethnicity, sampling bias, sample sizes, detected genes, etc.

      (3) This is likely the first study to evaluate alternative splicing changes with age and sex at a pan-tissue scale.

      (4) Sex dimorphism with age is an important topic and is thoroughly analyzed in this study.

      Weaknesses:

      (1) The findings have not been independently validated in a separate cohort or through experiments. Only selective splicing factor regulation has been verified in other studies.

      (2) It seems the authors have not considered PMI or manner of death as a variable in their analysis.

      (3) The manuscript is very dense and sometimes difficult to follow due to many different types of analyses and correlations.

      (4) Short-read data can detect and quantify alternative splicing events with only moderate confidence and therefore the generalizability of these findings remains to be experimentally validated.

    2. Reviewer #3 (Public review):

      Summary:

      In this study, Wang et al utilized the available GTEx data to compile a comprehensive analysis that attempt to reveal aging-related sex-dimorphic gene expression as well as alternative splicing changes in humans.

      The key conclusions based on their analysis are that

      (1) extensive sex-dimorphisms during aging with distinct patterns of change in gene expression and alternative splicing (AS), and

      (2) the male-biased age-associated AS events have a stronger association with Alzheimer's disease, and

      (3) the female-biased events are often regulated by several sex-biased splicing factors that may be controlled by estrogen receptors. They further performed break-point analysis and revealed that in males there are two main breakpoints around ages 35 and 50, while in females, there is only one breakpoint at 45.

      Strengths:

      This study sets an ambitious goal, leveraging the extensive GTEx dataset to investigate aging-related, sex-dimorphic gene expression and alternative splicing changes in humans. The research addresses a significant question, as our understanding of sex-dimorphic gene expression in the context of human aging is still in its early stages. Advancing our knowledge of these molecular changes is vital for identifying therapeutic targets for age-related diseases and extending the human health span. The study is highly comprehensive, and the authors are commendable for their attempted thorough analysis of both gene expression and alternative splicing - an area often overlooked in similar studies.

      Weaknesses:

      Due to the inherent noise within the GTEx dataset - which includes numerous variables beyond aging and sex - there are significant technical concerns surrounding this study. Additionally, the lack of cross-validation with independent, existing data raises questions about whether the observed gene expression changes genuinely reflect those associated with human aging. For instance, the break-point analysis in this study identifies two major breakpoints in males around ages 35 and 50, and one breakpoint in females at age 45; however, these findings contradict a recent multi-omics longitudinal study involving 108 participants aged 25 to 75 years, where breakpoint at 44 and 60 years was observed in both male and females (Shen et al, 2024). These issues cast doubt on the robustness of the study's conclusions. Specific concerns are outlined below:

      (1) The primary method used in this study is linear regression, incorporating age, sex, and age-by-sex interactions as covariates, alongside other confounding factors (such as ethnicity) as unknown variables. However, the analysis overlooks two critical known variables in the GTEx dataset: time of death (TOD) and postmortem interval (PMI). Both TOD and PMI are recorded for each sample and account for substantial variance in gene expression profiles. A recent study by Wucher et al.(Wucher et al, 2023) demonstrated the powerful impact of TOD on gene expression by using it to reconstruct human circadian and even circannual datasets. Similarly, Ferreira et al. (Ferreira et al, 2018) highlighted PMI's influence on gene expression patterns. Without properly adjusting for these two variables, confidence in the study's conclusions remains limited at best.

      (2) To demonstrate that their analysis is robust and that the covariates TOD and PMI are otherwise negligible - the authors should cross-validate their findings with independent datasets to confirm that the identified gene expression changes are reproducible for some tissues. For instance, the recent study by Shen et al. (Shen et al., 2024) in Nature Aging offers an excellent dataset for cross-validation, particularly for blood samples. Comparing the GTEx-derived results with this longitudinal transcriptome dataset would enable verification of gene expression changes at both the individual gene and pathway levels. Without such validation, confidence in the study's conclusions remains limited.

      (3) As a demonstration of the lack of such validation, in the Shen et al. study (Shen et al., 2024), breakpoints at 44 and 60 years were observed in both males and females, while this study identifies two major breakpoints in males around ages 35 and 50, and one breakpoint in females at age 45. What caused this discrepancy?

      (4) Although the alternative splicing analysis is intriguing, the authors did not differentiate between splicing events that alter the protein-coding sequence and those that do not. Many splicing changes occurring in the 5' UTR and 3' UTR regions do not impact protein coding, so it is essential to filter these out and focus specifically on alternative splicing events that can modify protein-coding sequences.

      (5) One of the study's main conclusions - that "male-biased age-associated AS events have a stronger association with Alzheimer's disease" - is not supported by the data presented in Figure 4A, which shows an association with "regulation of amyloid precursor formation" only in female, not male, alternative splicing genes. Additionally, the gene ontology term "Alzheimer's disease" is absent from the unbiased GO analysis in Figure S6. These discrepancies suggest that the focus on Alzheimer's disease may reflect selective data interpretation rather than results driven by an unbiased analysis.

      (6) The experimental data presented in Figures 5E - I merely demonstrate that estrogen receptor regulates the expression of two splicing factors, SRSF1 and SRSF7, in an estradiol-dependent manner. However, this finding does not support the notion that this regulation actually contributes to sex-dimorphic alternative splicing changes during human aging. Notably, the authors do not provide evidence that SRSF1 and SRSF7 expression changes actually occur in a sex-dependent manner with human aging (in a manner similar to TIA1). As such, this experimental dataset is disconnected from the main focus of the study and does not substantiate the conclusions on sex-dimorphic splicing during human aging. The authors performed RNA-seq in wild-type and ER mutant cells, and they should perform a comprehensive analysis of ER-dependent alternative splicing and compare the results with the GTEx data. It should be straightforward.

      References:

      Ferreira PG, Muñoz-Aguirre M, Reverter F, Sá Godinho CP, Sousa A, Amadoz A, Sodaei R, Hidalgo MR, Pervouchine D, Carbonell-Caballero J et al (2018) The effects of death and post-mortem cold ischemia on human tissue transcriptomes. Nature Communications 9: 490.

      Shen X, Wang C, Zhou X, Zhou W, Hornburg D, Wu S, Snyder MP (2024) Nonlinear dynamics of multi-omics profiles during human aging. Nature Aging.

      Wucher V, Sodaei R, Amador R, Irimia M, Guigó R (2023) Day-night and seasonal variation of human gene expression across tissues. PLOS Biology 21: e3001986.

    1. Skin was swabbed using a microfiber cosmetic applicator and approximately 40µL of surfactant. Swabbing was performed by rubbing the surfactant-coated applicator tip in a back-and-forth motion for 45 seconds, applying light pressure, over a 2mm-by-1mm rectangular area on lesional psoriasis or eczema skin.

      What an interesting study, especially the findings relating to patient biomarkers and predicted therapeutic responsiveness.

      A few questions arose for me: 1) for the swabbing procedure, how were the affected areas chosen? Were they on similar body parts/skin types? Did this differ for psoriatic or eczematic skin? This information in the Methods section could be informative for readers. 2) Did you test for intra-patient proteomic consistency by sampling more than once per patient? I'm interested in how intra- and inter-patient heterogeneity might differ from each other, affect clustering, or inform overall conclusions. 3) Similarly, could you compare to swabs/proteomics from non-affected skin areas from the same patients? 4) How do you think disease progression dynamics might also introduce noise in proteomics, i.e. could some of the differences in results be explained by sampling from different timepoints?

    1. Total 71/100

      Original Content 30/30

      Technical 21/30:

      • Use of Required Tools (4/6): Uses ml5.js for image classification and integrates video input. Missing ml5.js script in the HTML file (added in to test) and mechanisms for interactivity are incomplete, such as transitioning from the start page (startPage always equals false/no timer or interaction implemented).
      • Integration of Technology (3/5): The classifier and video feed are set up correctly, and the model is functional. However, the logic for transitioning between states (e.g., from the start page to the content) is missing, which halts the progression of the project.
      • Functionality of Classifier Models (5/5): The classifier seems to work as intended, recognizing objects and labels.
      • Interactivity and Feedback (3/5): The feedback includes playful and narrative-driven text tied to specific labels, which aligns well with the project goals.
      • Complexity of Decision Tree or Flowchart (2/4): The decision logic is linear and lacks branching complexity.
      • Creative Application of Rules (4/5): The narrative is imaginative and cohesive, involving a playful journey to find a notebook.

      Timely Submission 20/40

    1. 5. We cannot truly call on God, the Father of all, if we refuse to treat in a brotherly way any man, created as he is in the image of God. Man's relation to God the Father and his relation to men his brothers are so linked together that Scripture says: "He who does not love does not know God" (1 John 4:8). No foundation therefore remains for any theory or practice that leads to discrimination between man and man or people and people, so far as their human dignity and the rights flowing from it are concerned. The Church reproves, as foreign to the mind of Christ, any discrimination against men or harassment of them because of their race, color, condition of life, or religion. On the contrary, following in the footsteps of the holy Apostles Peter and Paul, this sacred synod ardently implores the Christian faithful to "maintain good fellowship among the nations" (1 Peter 2:12), and, if possible, to live for their part in peace with all men,(14) so that they may truly be sons of the Father who is in heaven.(15)

      Absolutely valid. We are called to treat all men as brothers and sisters, irrespective of any outward trait (and in almost all other cases, even irrespective of whether they treat us with contempt or enmity!)

    1. Briefing Doc : La Santé des Enfants dans les Territoires Franciliens

      Source : La santé des enfants dans les territoires franciliens : Décryptage des indicateurs en Île-de-France. ORS Île-de-France, 2024.

      Thèmes principaux :

      • Lien entre la pauvreté et la santé des enfants en Île-de-France.
      • Disparités territoriales de santé infantile à l'échelle communale et intercommunale.
      • Analyse des indicateurs clés : natalité, mortalité, morbidité et recours aux soins.
      • Offre de soins en santé infantile : médecins généralistes, pédiatres, PMI, santé mentale.

      Idées et faits importants :

      Pauvreté et inégalités :

      • Un enfant sur cinq de moins de 11 ans vit en situation de pauvreté en France. (Introduction)
      • La pauvreté a un impact direct sur les déterminants de santé tels que les conditions de vie, l'habitat et les habitudes alimentaires. (Introduction)
      • Fortes disparités interdépartementales du taux de pauvreté des enfants en Île-de-France (35,9 % en Seine-Saint-Denis contre 14,2 % dans les Yvelines). (Chapitre 2)

      Données démographiques :

      • Recul de la population des moins de 5 ans en Île-de-France (-10% en dix ans). (Chapitre 3)
      • Natalité plus élevée dans les communes les plus pauvres. (Chapitre 3)
      • "Plus une commune a un bas niveau de revenu, plus important est son taux de natalité." (Chapitre 3)

      Indicateurs de mortalité :

      • Augmentation progressive de la mortalité infantile en Île-de-France depuis 2014. (Chapitre 4)
      • Disparités territoriales frappantes : la mortalité infantile est plus importante en Seine-Saint-Denis. (Chapitre 4)
      • "Le risque de surmortalité infantile est multiplié par quatre dans les intercommunalités les plus pauvres." (Chapitre 4)
      • La mort inattendue du nourrisson (MIN) reste un enjeu de santé publique majeur. (Chapitre 4)

      Morbidité et recours aux soins :

      • Prématurité plus fréquente dans les communes pauvres. (Chapitre 5)
      • Prévalence des troubles psychiatriques et de l'autisme en augmentation. (Chapitre 7)
      • "Moins de recours aux soins de santé mentale dans les communes moins favorisées." (Chapitre 7)
      • Offre de soins en médecine de ville plus importante à Paris et sa proche couronne, mais souvent avec dépassements d'honoraires. (Chapitre 6)
      • Recul de l'activité en service de protection maternelle et infantile (PMI). (Chapitre 6)
      • "On se demande si, trop sollicités pour des actions de suivi de pathologies chroniques, voire pour des actes curatifs, non assurés par la médecine de ville, ou pour des missions de protection de l’enfance, certains professionnels de PMI ne sont pas obligés de relayer les actes préventifs et de promotion de la santé au second plan." (Chapitre 8)

      Recommandations :

      • Renforcer les actions de prévention et de promotion de la santé dès le plus jeune âge, en ciblant les territoires les plus défavorisés.
      • Améliorer l'accès aux soins de santé mentale pour les enfants, en particulier dans les zones où l'offre est insuffisante.
      • Soutenir les services de PMI et leur permettre de se recentrer sur leurs missions de prévention.
      • Mettre en place un système de suivi plus précis de la santé des enfants à une échelle territoriale fine.

      Conclusion :

      Ce rapport met en lumière les inégalités de santé qui touchent les enfants en Île-de-France.

      Il souligne l'importance de prendre en compte les déterminants sociaux de la santé et de renforcer les actions de prévention et d'accès aux soins dans les territoires les plus défavorisés.

      La mise en place d'un système de suivi plus précis et l'amélioration de la disponibilité des données à une échelle fine sont nécessaires pour mieux orienter les politiques publiques en faveur de la santé des enfants.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Drawbacks: -While the population-specific approach is a strength, it also limits the direct applicability of findings to other populations.

      We thank the Reviewer for highlighting this important question. While we acknowledge the mentioned limitation, we would like to emphasize the benefits of adopting a population-specific approach, especially given that human gut microbiome diversity remains underexplored in many populations worldwide. Researching the Estonian population microbiome, we contribute to the broader global collection of gut microbial species, helping to address this gap.

      Moreover, new microbial species and strains identified in the Estonian population may be relevant for populations with similar environmental and lifestyle factors, such as the Finnish, Baltic, and Nordic populations. These findings can enhance understanding of regionally relevant microbiome characteristics and may serve as a useful reference for studies in these related populations. As more population-based microbiome research is published, it will build a valuable resource for cross-population comparative studies, shedding light on global microbiome diversity and its implications for health.

      Lastly, as part of the Estonian Biobank, our primary objective is to advance personalized medicine for the Estonian population. This requires a highly accurate reference for our specific population. We believe our approach not only benefits Estonian healthcare but also provides insights and methodologies that other population biobanks may find valuable as they embark on similar paths toward personalized medicine.

      -The study primarily focuses on taxonomic composition at the genus or species level, but a more in-depth functional analysis of the novel species could provide additional insights.

      We thank the Reviewer for this valuable addition. Functional analysis plays a crucial role in understanding the mechanisms that link the microbiome to human health, making it an essential. This becomes even more critical when studying newly discovered species. However, before embarking on functional analysis, we believe it is important to emphasize that, while high-quality metagenome-assembled genomes (MAGs) provide valuable insights, they do not fully represent the genomic completeness and accuracy of genomes reconstructed from pure bacterial cultures. Acknowledging this distinction was one of the reasons we decided not to include functional analysis in the original article. With these considerations in mind, we research a strain structure of four known species of Butyricimonas genus. While the primary interest lies in species associated with diseases, this particular species lacks a substantial number of high-quality MAGs. To gain deeper insights, we prioritized including a new species within the analyzed genus to perform a comparative analysis between the new species and a well-defined strain of a known species, creating a more comprehensive understanding. Among the 758 different genera present in our MAG collection, we selected the Butyricimonas genus for the following reasons: (1) it is a well-described genus of gut bacteria, represented by 300 high-quality MAGs in our dataset (2) it contains four known species along with two newly identified species clusters, and (3) the newly discovered species were shown to be prevalent in the human gut microbiome, being detected in more than 50% of samples through mapping.

      The following section was integrated in the new paragraph “Genome level analysis of species of interest” on page 6 in the revised version of the manuscript:

      “Species-level association studies can help identify candidates for genome-level analysis by exploring strain structure and functional differences. However, such analyses require a large number of high-quality MAGs from the same species, which is only feasible within large cohorts with deep sequencing data. While we currently need more samples to obtain sufficient MAGs for the new disease-associated species, we perform an analysis with the Butyricimonas genus species as an example. We show that the assembled MAGs of Butyricimonas species such as B. faeciominis, B. virosa, B. paravirosa and B. faecalis make up different strains (Figure 4a, Figure 4b, Supplementary results, Supplementary Table S5). After selecting a strain representative, we conducted a pan-genome analysis of species and strain-representative MAGs, including the two new species. The analysis revealed unique gene clusters consistently present in the new species but absent in all other analyzed species and strains (Figure 4c, Supplementary results, Supplementary Table S6).

      Figure 4. Strain-level structure of the Butyricimonas genus and comparative functional analysis of new species and known species strain. a. The strain structure of known Butyricimonas species assembled in the Estonian population - B. paravirosa, B. faecalis, B. virosa, and B. faecihominis (based on ANI index comparison). __b. __Butyricimonas genus structure. Comparisons include all known species from Butyricimonas genus (species assembled in Estonian population and publically available species) and all 4 newly assembled MAGs belonged to a new species. Publicly available Butyricimonas species - B. synergistica, "Candidatus B. faecavium", "Candidatus B. hominis", "Candidatus B. phoceensis", and "Candidatus B. vaginalis"—are each represented by a single genome of the type strain (the strain defining the species according to ISCP). Species assembled from our data are represented by both the type strain and all strain-representative MAGs. ANI values less than 95% (represent that MAGs belonged to different species) are not coloured, 95–100% ANI colored in different colors with 1% step. c. Pan-genome analysis of Butyricimonas genus. The analysis included the same genomes and MAGs as the analysis of the Butyricimonas genus structure and showed a core gene, as well as specific gene, set for the species. The two new species clusters (highlighted in green) also exhibit unique species-specific gene sets.

      We have also added Supplementary Results to our paper, providing a more detailed description of the strain structure analysis of Butyricimonas species and the functional analysis of both known and new species. We chose not to include this in the main text to avoid shifting the focus of the paper.

      Supplementary results

      Butyricimonas genus species strain-level and functional analysis

      Beyond taxonomic characterisation, it is crucial to understand the functional differences of newly detected species, as this insight is key to fully understanding the mechanisms that link the microbiome to human health. Reconstructing MAGs from a large cohort provides multiple genomes of the same species, particularly for prevalent species. During our study, we assembled MAGs from 758 different genera, including 358 genera with more than 10 extracted MAGs. Conducting a detailed in-depth strain-level and functional analysis of all these genera requires substantial effort. Therefore, we conduct an in-depth strain-level and functional analysis using the genus Butyricimonas as an example, because. The genus Butyricimonas was chosen for the following reasons: (1) it is a well-characterized genus of gut bacteria, represented by 300 high-quality MAGs in our dataset (2) it included four known species and two newly identified species clusters, and (3) the new discovered species have been shown to be prevalent in the human gut microbiome.

      *Known Butyricimonas species exhibit a clear strain-level structure based on pairwise ANI comparisons (ANI > 99.0), as calculated using ANIclustermap19 (Figure 4a). From a total of 300 high-quality MAGs selected for strain and functional analysis within the Butyricimonas genus, the species Butyricimonas paravirosa is represented by 23 MAGs and forms 5 distinct strain clusters. While one big cluster (cluster_id: B30) includes 7 highly similar genomes with ANI values close to 100%, other clusters (B31, B32, B34) exhibit more genomic diversity, with genomes showing ANI values greater between 99.0% and 99.6%. The final cluster (B33) contains a single MAG, suggesting unique genomic variation. Butyricimonas faecihominis is represented by 65 MAGs and forms 8 distinct strain clusters, exhibiting high genome similarity within each cluster. Butyricimonas virosa is represented by 67 MAGs and forms 14 distinct strain clusters. These strain clusters can be divided into two strain cluster groups, with low similarity between the groups (ANI values between strain cluster groups ranging from 95.0% to 96% and approaching the species boundary). Within each group, the strain clusters also exhibit genomic diversity, indicating a substantial level of variation even within closely related strains. Finally, Butyricimonas faecalis has the highest number of MAGs within its species 141 MAGs and shows a clean picture of 5 strain clusters with high similarity within the strain cluster (Figure SR1). *

      Figure SR1. The strain structure of known Butyricimonas species assembled in the Estonian population - B. paravirosa, B. faecalis, B. virosa, and B. faecihominis (ANI index comparison histogram).

      In addition to the four known species, we assembled two new species within the Butyricimonas genus. The first new species cluster (id: Bn1) is represented by a single MAG (H0366_Butyricimonas_undS), which serves as the representative genome for this species. The second new species cluster (id: Bn2) comprises three MAGs, with H1068_Butyricimonas_undS designated as the representative genome, selected using dRep. To determine the placement of these new species within the genus, we conducted genome pairwise comparisons based on the Average Nucleotide Identity (ANI) index between the MAGs of the new species and other species within the Butyricimonas genus. For the known species identified in our population, we selected representative genomes for each strain. These comparisons were made between the all new species MAGs, strain-level representative MAGs of four known species, and type strain genomes (the strain that defines the species according to ISCP) from other species of the Butyricimonas genus that were not present in our cohort,, such as Butyricimonas synergistica, "Candidatus Butyricimonas faecavium", "Candidatus Butyricimonas hominis", "Candidatus Butyricimonas phoceensis", and "Candidatus Butyricimonas vaginalis" (Figure 4b). The MAGs from the second new species cluster (Bn2) form a distinct and cohesive group, showing a closer relationship to Butyricimonas paravirosa and Butyricimonas faecihominis. In contrast, the first new species (Bn1), represented by a single MAG, is positioned closer to Butyricimonas virosa. Interestingly, while the ANI index between the type strain of Butyricimonas virosa and the Bn1 MAG is less than 95%, certain strains of B. virosa (e.g., strains 3, 6, 7, 9, 10, and 12) show ANI values slightly above 95%, which technically classifies them as the same species.

      To explore functional differences between new species clusters and other known species we perform pangenomic analysis using the analysis and visualization platform for ‘omics data (Anvi’o) workflow for microbial pangenomics20__. As the first new species cluster (id:Bn1) is represented by a single MAG, despite it containing unique genes not found in any other analyzed genomes, it is challenging to draw definitive conclusions. Another new species cluster (id:Bn2) consisting of three MAGs provides clearer insights. All three MAGs within this new species cluster share 183 unique genes that are consistently present across the species cluster but absent in all other analyzed species and strains. (Figure 4c). The majority of these genes (142 genes, 73.96%) have unknown functions. Among the genes with defined functions, the functions are distributed across various COG categories (__Suppl. Table S5,____Suppl. Figure SR2), with the top three categories being “Cell wall/membrane/envelope biogenesis”, “General function prediction only”, and “Posttranslational modification, protein turnover, and chaperones”.

      Figure SR2. COG categories for 183 unique genes that are consistently present across the new species MAGs from Butyricimonas genus (cluster id:Bn2) but absent in all other analyzed species and strains.

      Undoubtedly, further research is needed to understand the role of newly identified species in the human microbiome and to determine whether strain-level differences influence bacterial interactions with the gut and their overall impact. However, our current analysis has already significantly expanded our knowledge of the diversity within this genus. It has added two new species to the ten previously described and revealed the strain structure of known species within the Estonian population.

      -Is it possible for this large dataset to distill information and have plots for strain diversity of abundant and prevalent species, including low abundance species per donor or between donors? Can authors add such a plot or discuss this?

      We thank the Reviewer for this insightful question. Strain-level analysis holds significant potential and is one of the key reasons to use the genome assembly approach, rather than relying on microbiome community profiling using existing human gut species databases. To demonstrate how this can be applied in large datasets like ours, we focused on the same Butyricimonas genus selected for functional analysis. We believe that combining both strain-level and functional analyses provides a more comprehensive understanding when used together.

      The following section has been incorporated into a new paragraph, “Genome-Level Analysis of Species of Interest,” on page 6 of the revised manuscript, and in-depth analysis has been included in the Supplementary Results. As this section has already been cited in a previous response (due to its logical connection with the functional analysis of the new species), we will not cite it again here. Please refer to the previous answer for further details.

      -While associations between microbes and diseases were found, the study design cannot establish causal relationships. Are the authors planning to test some of the associations experimentally and see whether these observations work in vitro or in vivo?

      We agree that elaboration of causal relationships is crucial. However, this was beyond the scope of the current study, which is intended as a foundational step for future investigations. However, the samples are stored in the Estonian Biobank in a way that allows culturomic studies and follow-up experiments as done by Krigul et al [1].

      Krigul KL, Feeney RH, Wongkuna S, Aasmets O, Holmberg SM, Andreson R, Puértolas-Balint F, Pantiukh K, Sootak L, Org T, Tenson T, Org E, Schroeder BO. A history of repeated antibiotic usage leads to microbiota-dependent mucus defects. Gut Microbes. 2024 Jan-Dec;16(1):2377570. doi: 10.1080/19490976.2024.2377570.

      Minor comments:

      • The authors could provide more context on how their findings compare to similar studies in other populations. What are the differences and similarities, and how does this work at the next level and set new directions?

      We thank the Reviewer for this suggestion. We provided a summary of other population cohorts in the Introduction (Lines 79–90). Since MAG recovery from large cohorts is a relatively new approach, there are limited opportunities for direct comparison. However, we did note a decreasing number of newly recovered species in our study compared to previous studies (Lines 274–290).

      • Figures' quality and readability can be improved easily; all of them are low resolution, and the axes are hardly visible, particularly Figure 2, which could benefit from additional labeling or explanations in the legend to improve clarity.

      We apologize for the quality issues with the figures. We completely revised Figure 2 to improve clarity and placed a new higher-resolution version of Figure 2 to improve readability, ensuring that axes and details are clearly visible.

      Summary of performed changes: (1) we introduced a new Figure 2a to showcase the phylogenetic diversity of the recovered species and highlight the position of the newly assembled species identified for the first time in this study (2) We have updated Figure 2b. In the initial figure, a single line was presented. However, to enhance the visualization and emphasize the trend, five lines were subsequently plotted by altering the order of the samples. Since the order of the samples is not significant, this modification allows for a clearer representation of the overall trend of accumulation of the new species (3) we added new Figure 2c, to address the question about the range of diversity of detected species (4) we moved Figure 2a and 2d to Supplementary Figures to enhance clarity and relevance (Figure S4 and Figure S6 respectively).

      “Figure 2. Overview of species from the EstMB MAG collection a. Phylogenetic tree of the Estonian species representative MAGs. The inner circle displays a phylogenetic tree of species cluster representative MAGs, with branches colored according to their assigned phylum in the Genome Taxonomy Database (GTDB) (see color text). The surrounding ring highlights MAGs that represent novel species assembled in the current study, using the same colors as in the inner circle to indicate the phylum to which each new species belongs (see color text). b. The relationship between the number of samples analyzed and the cumulative number of new species identified c. Distribution of number of species detected by mapping per sample “species hits” (yellow color violinplot) and number of recovered MAGs per sample (blue color violinplot) from Estonian representative MAGs number. d. Number of recovered species (blue color dots) and species detected by mapping the reads against the EstMB MAG collection (yellow color dots) for each sample. Samples are sorted from those with the highest to the lowest number of recovered MAGs e. __The prevalence and number of recovered MAGs per species. The top 10 species with the highest number of recovered MAGs are shown. Blue bars represent the number of samples where MAG of the species were recovered, while gray bars show the species prevalence in EstMB __f. The prevalence and number of recovered MAGs per new species. The top 10 new species with the highest number of recovered MAGs are shown. Green bars represent the number of samples where MAG of the new species were recovered, while gray bars show the new species prevalence.”

      -A brief discussion on the potential clinical implications of the new species-disease associations would enhance the relevance. Why discovering new species are in testing and relevant for the microbiome field? Can authors add this somewhere, discussion?

      We thank the Reviewer for this suggestion. As such, the following section was integrated in the Discussion on page 8 in the revised version of the manuscript:

      “Reconstruction of a new species and new strain is critical for many aspects of personal medicine. We can identify three primary applications of the microbiome in personalized medicine: disease risk assessment and prevention, disease diagnosis, and disease treatment. The latter includes approaches such as microbial supplementation, suppression, or metabolite modulation [Karina Ratiner, 2024]. Both disease prevention and diagnosis rely on identifying bacterial biomarkers associated with prevalent or incident disease cases. In our study, an average of 4% of reads belonged to the newly identified species, with a maximum of 34.76%, demonstrating that excluding this species would lead to a significant loss of community diversity. This omission could potentially exclude biomarkers critical for disease prediction and diagnosis. Notably, one-third of the associations between bacterial species and diseases in our analysis involved the newly identified species, further emphasizing its potential importance as a biomarker. For disease treatment, it is crucial to understand the complete microbial diversity to distinguish between beneficial and harmful species. Equally important is knowing the genomic structure of species and strains to develop effective strategies for microbiome modulation. Without genome assembly, we are limited to assumptions based on previously described genomes of related bacteria. However, given the substantial genomic diversity within species, such assumptions may be highly inaccurate, underscoring the importance of genome assembly in advancing microbiome-based interventions.”

      • In lines 265-266, the authors discuss detected species per sample, on average, 389 species. Can the authors guide which plot is linked to it and whether it is possible to show the disturbing median number of species per sample to get an overall idea about the range of diversity this type of analysis can capture now? Maybe this will improve in the future; it is worth mentioning here.

      We thank the Reviewer for highlighting the need for the clarification. Original Figure 2c displayed the number of species detected through mapping (species hits) and the number of assembled MAGs for each individual sample. To provide a broader characterization of the distribution, we calculated the minimum, mean, median, and maximum values across all samples. As such, the __new Figure 2c __and the following section was integrated in the paragraph “Estimation of species prevalence using population-specific reference” on page 5 in the revised version of the manuscript:

      “Distribution of the number of species detected by mapping per sample exhibits a wide range of values, with a maximum of 842 and a minimum of 7, while the mean and median are 399 and 405, respectively. The distribution of numbers of recovered MAGs per sample shows a narrower range, with a maximum of 155 and a minimum of 1, alongside a mean of 45 and a median of 41 (Figure 2c).”

      Figure 2c.* Distribution of number of species detected by mapping per sample “species hits” (yellow color violinplot) and number of recovered MAGs per sample (blue color violinplot). *

      Other comments:

      -The key conclusions are generally convincing. The authors have successfully assembled a large number of MAGs from the Estonian population, identified potentially novel species, and established associations between microbial abundance and diseases.

      We appreciate the Reviewer's positive feedback on our findings. We are pleased that the significance of our MAG assembly, novel species identification, and disease associations is well-received.

      -The data presented appear to support the claims well. However, the authors should emphasize and clarify that the disease associations are correlational, not causal, and further validation is required.

      We agree that this is an important point to emphasize. We revised the manuscript to clarify that the disease associations are correlational and emphasize the need for further validation by adding the following section in Discussion on page 8 in the revised version of the manuscript:

      “While association does not imply causation, analyzing the association between bacterial species and diseases is a crucial first step in identifying potential biomarkers. This can be followed by meta-analyses across different cohorts and laboratory experiments to validate and confirm the observed effects.”

      -Even though I am not an expert in metagenomics analysis, the current experimental design and analysis are sound to support the main claims.

      We thank the Reviewer for recognizing the robustness of our experimental design and analysis.

      -The methods section can be improved by providing more details about how samples were collected and stored and how long after storage gDNA was extracted and processed for sequencing, allowing for reproducibility. The authors provide information on the bioinformatics pipelines, including software versions and parameters, but this can again be improved by adding details about the steps between sample processing and raw data processing.

      We thank the Reviewer for this suggestion and we agree that this is important information. All these details were thoroughly described in our previous paper, which focuses on our cohort description (Aasmets, O., Krigul, K.L., Lüll, K., Metspalu, A., and Org, E. (2022). Gut metagenome associations with extensive digital health data in a volunteer-based Estonian microbiome cohort. Nat. Commun. 13, 869.

      https://doi.org/10.1038/s41467-022-28464-9).

      However, to improve accessibility of this information, the following paragraph was integrated in the Methods on page 17 in the revised version of the manuscript:

      “Microbiome sample collection and DNA extraction

      The participants collected a fresh stool sample immediately after defecation with a sterile Pasteur pipette and placed it inside a polypropylene conical 15 mL tube. The participants were instructed to time their sample collection as close as possible to the visiting time in the study centre The samples were stored at −80 °C until DNA extraction. The median time between sampling and arrival at the freezer in the core facility was 3 h 25 min (mean 4 h 34 min) and the transport time wasn’t significantly associated with alpha (Spearman correlation, p-value 0.949 for observed richness and 0.464 for Shannon index) nor beta diversity (p-value 0.061, R-squared 0.0005). Microbial DNA extraction was performed after all samples were collected using a QIAamp DNA Stool Mini Kit (Qiagen, Germany). For the extraction, approximately 200 mg of stool was used as a starting material for the DNA extraction kit, according to the manufacturer’s instructions. DNA was quantified from all samples using a Qubit 2.0 Fluorometer with a dsDNA Assay Kit (Thermo Fisher Scientific).”

      -The study includes a large cohort (1,878 samples), which provides statistical power. The statistical analyses, including linear regression models adjusted for BMI, gender, and age, seem appropriate for the type of data presented. I suggest adding a separate paragraph about how the data is processed and statistically analyzed.

      Authors should include:

      • Appropriateness of the statistical tests used for the data types and experimental designs

      • Adequate description and justification of the statistical models and test and assumptions

      • Proper handling of replicates, controls, and data normalization

      • Reporting of effect sizes, sample size, confidence intervals, and statistical power

      • Data processing and analysis workflows.

      We thank the Reviewer for this recommendation. To highlight the statistical analysis carried out, we have made a separate paragraph for statistical analysis under the Methods section (lines 617-628). We note that we have previously described data processing and normalization. This study has an exploratory nature. Hence, the power calculations are not applicable, but this study can be an input for the power calculations of future studies testing statistical hypotheses. However, we agree that the sample sizes for each phenotype and beta estimation would support our results. We have now added them to __Table 1_. _ __

      Reviewer #1 (Significance (Required)):


      -This study represents an advance in the context of population-specific studies. Creating a comprehensive Estonian population-specific MAG reference and identifying new species contribute to our understanding of microbiome diversity.

      -The work builds upon previous large-scale microbiome projects, such as those that established the Unified Human Gastrointestinal Genome (UHGG) collection but focuses on a specific population.

      -The associations between microbial species (including novel ones) and common diseases provide potential avenues for future research into microbiome-based diagnostics or therapeutics.

      -The findings would interest microbiome researchers, bioinformaticians, and clinicians interested in the role of the gut microbiome in health and disease.

      We thank the Reviewer for the thoughtful feedback and recognition of our study's contributions to microbiome research. By creating an Estonian population-specific MAG reference and identifying new species, we advance population-specific studies and enhance global microbiome diversity. Building on projects like UHGG, we integrate local data into the global context and highlight potential applications in microbiome-based diagnostics and therapeutics. To address your suggestions, we expanded the results section with an example from the Butyricimonas genus. We hope our publicly available data will support future research and further advance understanding of the gut microbiome in health and disease.

      __ Reviewer #2 (Evidence, reproducibility and clarity (Required)):__


      The manuscript by Pantiukh et al. presents the collection of MAGs assembled from the Estonian Biobank, with a specific focus on the novel species clusters the authors defined and found associations with some of the diseases as collected among the samples available in their biobank. The manuscript is well organized. However, it lacks a bit in terms of novelty and also some statements that can mislead the readers to overinterpret some parts.

      Majors

      • The last paragraph of the introduction (lines 91-98) anticipates some results but lacks some methodological details. Please consider whether to move it to the results section or add very brief specifications, like (1) "sequence with deep coverage" is vague, how deep is deep? (2) "84,762 MAGs representing 2,257 species" are the 84k MAGs already quality-controlled? (3) "353 MAGs (15,6%) of the EstMB MAGs collection to represent potentially novel species." 353 are MAGs or species? As species clusters are defined later at 95% ANI, are all these 353 defining their own species clusters?

      We thank the Reviewer for insightful questions and suggestions. To address these points, we have added the following clarifications to the text:

      We specified the depth of coverage for sequences, providing an average reads number per sample - 56 mln reads. (Lines 92). We clarified that among 84,762 assembled MAGs, 42,049 MAGs (49.60 %) were high-quality (HQ) MAGs. (Lines 93-94). We revised the statement about the 353 MAGs, explicitly noting that they represent potentially novel species. Additionally, we clarified that all 2,257 representative MAGs, including these 353 new species MAGs represent separate species clusters based on the 95% ANI threshold mentioned later in the text. (Lines 94-98).

      In the paper, we included only the figure showing the quality group distribution for species cluster representative MAGs to avoid potential confusion between two similar figures: one for all assembled MAGs (n=84,762) and another for cluster representative MAGs (n=2,257). However, in response to this query, we have added a new __Supplementary Figure S1__that illustrates the quality group distribution for all assembled MAGs to provide a more comprehensive view.

      Figure S1. Quality estimation for the assembled MAGs (n=84,762). High-quality MAGs (HQ) – 42,049; Medium-quality MAGs (MQ) – 26,806; Low-quality MAGs (LQ) – 15,907.

      • lines 109 and 265, "11.73 +/- 3.9 Gb data per sample and 56.13 +/- 19.37 million reads per sample", numbers don't match... 11.73 Gbp is about 78M reads at 150nt read length, plus later the average depth is not 56.13 but 53.04, please double check these numbers

      We apologize for any misunderstanding. The numbers mentioned in the paper refer to the number of reads and the file size of each compressed *.fasta.gz file. This file size does not directly represent the total base pairs (Gb) for the current metagenome. Instead, it reflects the disk space occupied by the compressed sequencing data, including additional information such as sequence headers. We selected this parameter to provide an easy point of comparison with file sizes from other metagenome sequencing datasets, as *.fasta.gz is a commonly used format for storing sequence data. To clarify further, here is an example of the relationship between these parameters for one sample:

      Sample XX

      Value

      Meaning

      Program

      Compressed file size

      4.2 GB

      Represents disk space occupied by the compressed sequencing data. This applies to forward reads only; for a rough estimation of the disk space for both forward and reverse reads, it should be multiplied by 2 or calculated separately for both files.

      du -sh V00HXZ.fq1.gz

      The total number of reads

      41,062,933 reads

      (avg. read len = 147.7 bp)

      Represents number of forward reads. This applies to forward reads only; for a rough estimation of both forward and reverse reads, it should be multiplied by 2 or calculated separately for both files.

      seqkit stats V00HXZ.fq1.gz -a -T

      Total base pairs (Gb)

      6,066,493,002 bp (6.07 Gb)

      Represents total base pairs (Gb) for the current sample. This applies to forward reads only; for a rough estimation of both forward and reverse reads, it should be multiplied by 2 or calculated separately for both files.

      seqkit stats V00HXZ.fq1.gz -a -T

      We now realize this may have caused confusion. To address this, we have calculated the total base pairs (Gb) parameter for both forward and reverse reads and exchanged the __Compressed file size __number to __Total base pairs__with following section in the paragraph “Cohort overview and study design” on page 3 in the revised version of the manuscript:

      “The EstMB-deep samples were resequenced at deep coverage, generating an average of 16.49 ± 6.2 Gb of total base pairs per sample, or 56.13 ± 19.37 million paired reads per sample, with an average forward read length of 146.85 bp and an average reverse read length of 147.01 bp.”

      • line 118, "completeness > 90% and contamination We thank Reviewer for this comment, we use CheckM v2 for evaluation MAG completeness and contamination. We have incorporated the requested information into the manuscript. (Lines 128).

      • line 120, "84,762 MAGs were clustered at the species level with an average nucleotide identity (ANI) threshold of 95%.", as for my previous comment, either specify the Methods or quickly mention the tool used for the ANI analysis.

      We use dRep with default parameters for clustering. We have incorporated the requested information into the manuscript. (Lines 130).

      • lines 135-138, "The bacterial species most represented in our MAGs collection were Odoribacter splanchnicus (MAG recovered from 70.93% samples), Barnesiella intestinihominis (62.83%), Parabacteroides distasonis (60,38%), Alistipes putredinis (54,53%) and Agathobacter rectalis (51.92%) (Figure S2, Table S2).", it will be interesting to compare (some of) these speceis with other populations, to see if these species are globally prevalent in the human gut microbiome or specific to the Estonian population.

      We thank the Reviewer for this question. As highlighted in Figures 4e and 2d, the number of MAGs recovered for a given species often differs significantly from its prevalence in the population. Due to the complexities of MAG assembly, species prevalence is generally much higher, and these values do not correlate linearly, as shown in Supplementary Figure S5. Keeping in mind that species with the higher number of assembled MAGs are not the same as species with the higher prevalence, we compared our top assembled species with the most comprehensive up to date USGG collection of gut bacteria and integrated the following section in the paragraph “Population-specific Metagenome-Assembled Genomes (MAGs) reference” on page 4 in the revised version of the manuscript:

      “... All these species are also well-represented in other cohorts. For example, Parabacteroides distasonis, Alistipes putredinis, and Agathobacter rectalis rank among the top 6 species in the USGG by the number of genomes. Additionally, Barnesiella intestinihominis and Odoribacter splanchnicus rank among the top 40 species out of a total of 4,644 species in the USGG database.”

      • lines 143-144, "MAGs, 353 MAGs (15,64%) represent a new species according to the GTDB criteria.", these 353 MAGs might define fewer species clusters, I think the 'species' word in this sentence is misleading and can lead to an overinterpretation of the diversity, it will be more correct to report how many species clusters these MAGs defined.

      We apologize for not providing sufficient clarification. In our case each cluster represented a new distinct species. We added clarification in lines 152-153.

      • lines 163-168, the paragraph could be an overinterpretation, as it is unlikely that there is 'infinite' diversity, so it could be that by doubling the samples, there is already a plateau in terms of novel species clusters identified. I think this paragraph should be reconsidered.

      We thank the Reviewer for this question. We have updated Figure 2b. Instead of presenting a single version of the cumulative sum of new species discoveries, we reordered the samples five times to provide a more accurate approximation of new species accumulation as the number of samples increases. Additionally, we integrated the following section in the paragraph “Novel species and comparison of the population-specific reference with global reference UHGG” on page 4 in the revised version of the manuscript:

      “Our analysis so far shows a clear linear trend without indication of a plateau (although we can not exclude that plateau had been reached exactly at current sample size, which may not yet be evident).”

      __Figure 4b. __The relationship between the number of samples analyzed and the cumulative number of new species identified.

      • lines 182-184, "Even species which have been recovered from a large number of samples can be found in significantly more samples after mapping (Figure 2e, Table S2).", this is not novel as assembly requires higher coverage than calling a species present via mapping, please, rephrase this part.

      We thank the Reviewer for this thoughtful suggestion. We included this point in the article not because of its novelty but to emphasize that even a small number of recovered MAGs per sample can still hold significant value. This is because despite a small number of assembled genomes, the same species prevalence, as detected through mapping, can still be substantial which makes it possible to use them for, for example, association study. We added this perspective based on our personal experience of initial disappointment with the small number of MAGs recovered for many new species clusters. Our intention is to prevent similar discouragement among other researchers who may begin recovering MAGs from their large population cohorts.

      • lines 185-188, "which are usually extracted from a small number of samples, 185 show a prevalence exceeding 80% for some species. For example, Bacteroides faecalis has a prevalence of 97.23%, although only 1 MAG was assembled, and Bacteroides intestinigallinarum has a prevalence of 95.85% although only 2 MAGs were assembled.", this should be much better contextualized and discussed in terms of relative abundance and not only on the ability to reconstruct (which is highly impacted by coverage, which is a proxy for abundance) with its prevalence, it is known in the field that there are very highly prevalent species at very low abundance values, which are not that often reconstructed via metagenomic assembly.

      We agree that understanding the causes of assembly complications is important in the field, with abundance playing a key role. Moreover, other factors such as the presence of closely related species with similar genomes or multiple strains of the same species within a sample can significantly impact assembly, even for species with high abundance. However, since this paper focuses on the potential applications of MAG assembly in large population cohorts rather than the technical aspects of assembly, our main goal was to emphasize that MAGs assembled from the samples should not be used to estimate species prevalence.

      • Data availability, it appears that the provided accession number does not exist, please double-check this.

      We apologies about that issue, data now available with provided accession number PRJEB76860:

      Minors

      • line 106, "includes 1,308 women (69.64 %) and 570 men (30.35 %)", these sums up to 99.99%, the ratio for women is 1308/1878=0.69648, so can be rounded up to 69.65%.

      We thank the Reviewer for this correction. We correct numbers from 69.64% to 69.65% (Lines 114).

      • line 293, "ones[Philip Hugenholtz, 2008].", citation to fix.

      Thank you for the correction. We corrected the links. (Lines 414).

      • Fig. 1g, why completeness is up to 25%, from the text it seemed the MAGs were screened for completeness We apologize for not providing sufficient clarification. Indeed, as noted in Lines 124-126, *"We successfully reconstructed 84,762 metagenome-assembled genomes (MAGs), an average of 45 MAGs per sample. Among these, 42,048 according to CheckM, MAGs (49.6%) have completeness > 90% and contamination 90% and contamination 50% and contamination (Lines 131-132).

      • Fig. 2f says "Blue bars represent", but I believe it should be green instead of blue.

      Thank you for the correction. We corrected the color.

      (Lines 520).

    1. Author response:

      The following is the authors’ response to the original reviews.

      Joint Public Review:

      (1) This work investigates numerically the propagation of subthreshold waves in a model neural network that is derived from the C. elegans connectome. Using a scattering formalism and tight-binding description of the network -- approximations which are commonplace in condensed matter physics -- this work attempts to show the relevance of interference phenomena, such as wavenumber-dependent propagation, for the dynamics of subthreshold waves propagating in a network of electrical synapses.

      (2) The primary strength of the work is in trying to use theoretical tools from a far-away corner of fundamental physics to shed light on the properties of a real neural system. While a system composed of neurons and synapses is classical in nature, there are occasions in which interference or localization effects are useful for understanding wave propagation in complex media [review, van Rossum & Nieuwenhuizen, 1999]. However, it is expected that localization effects only have an impact in some parameter regimes and with low phase dissipation. The authors should have addressed the existence of this validity regime in detail prior to assuming that interference effects are important.

      The theoretical concept and tool used in this study are not situated in a far-away corner of fundamental physics but hold one of the central positions in condensed matter physics and statistical physics. In fact, the non-scientific statement about where the theoretical concept and tool employed by the researchers are positioned within the realm of fundamental physics is irrelevant. The fundamental physics governs the foundations of all natural phenomena, and thus it provides indispensable principles for interpreting not only neural systems but also all life phenomena. One such principle explored in our study is the interference and localization of waves.

      Specifically, in the third paragraph of the Introduction, we introduced that the interference effect of subthreshold oscillating waves, beyond being a theoretical possibility, is a phenomenon actually observed in neural tissue (Chiang and Durand, 2023; Gupta et al., 2016). Moreover, according to Devor and Yarom (2002), the propagation of subthreshold oscillations observed in the inferior olivary nucleus extended beyond a distance of 0.2 mm. Therefore, considering the propagation of subthreshold waves and the resulting interference in the connectome of C. elegans, which has a total body length of less than 1 mm, a diameter of about 0.08 mm, and most neurons distributed in the ring structure near its neck, provides sufficient validity for the initiation of theoretical and computational studies.

      The primary objective of our study is to investigate which regimes of signal transmission/localization and interference phenomena are valid within the network of electrical synapses in C. elegans, the only system for which the neural connectome structure is perfectly known. As the Reviewer rightly pointed out in the question, this is exactly the issue that the Reviewer is curious about. Therefore, the existence of this validity regime cannot be addressed prior to conducting the study but can only be identified as a result of performing the research. And we have conducted such a study.

      (3) An additional approximation that was made without adequate justification is the use of a tight-binding Hamiltonian. This can be a reasonable approximation, even for classical waves, in particular in the presence of high-quality-factor resonators, where most of the wave amplitude is concentrated on the nodes of the network, and nodes are coupled evanescently with each other. Neither of these conditions were verified for this study.

      The tight-binding Anderson Hamiltonian we used in this study originally consisted of the on-site energy at each node and the hopping matrix between nodes. When the on-site energy is relatively much more stable (i.e., has a large negative value) compared to the hopping matrix, most of the wave amplitude becomes concentrated on the nodes as the Reviewer mentioned. However, as is well-known from reference papers (Anderson, 1958; Chang et al., 1995; Meir et al., 1989; Shapir et al., 1982; Thomas and Nakanishi, 2016), in this study, we also removed the on-site energy to prevent the waves from being concentrated on the nodes. Therefore, the tight-binding Hamiltonian we used in this study ensures that waves propagate through edges in the network where the values of the hopping matrix exist.

      To assist the Reviewer in better understanding the model used in this study, we provide additional explanations as follows. In the manuscript, we have already provided detailed descriptions of the setup using the tight-binding Anderson Hamiltonian in the Method section under “Construction of our circuit model” and the explanation of Figure 1. In the model we used, the edges represented by solid lines are perfect conductors, while the dotted lines representing gap junctions act as potential barriers (Fig. 1B). Therefore, when electric signals propagate, we are dealing with the phenomenon where signals transmitted through the edges encounter potential barriers, causing scattering or attenuation. The model described by the Reviewer is indeed a commonly used model in condensed matter physics, but we did not use the exact model mentioned by the Reviewer. Instead, as is common in well-known reference papers, we modified it to suit our purposes. We hope this explanation helps the Reviewer gain a better understanding.

      (4) The motivation for this work is to understand the basic mechanisms underlying subthreshold intrinsic oscillations in the inferior olive, but detailed connectivity patterns in this brain area are not available. The connectome is known for C elegans, but sub-threshold oscillations have not been observed there, and the implications of this work for C elegans neuroscience remain unclear. The authors should also give more evidence for the claim that their study may give a mechanism for synchronized rhythmic activity in the mammalian inferior olive nucleus, or refrain from making this conclusion.

      We agree with the Reviewer's point. In this study, we do not provide additional analysis on the mammalian inferior olive nucleus beyond what is already known from previous research. What we intended to discuss in the Discussion section was to suggest that within our model, there is a “possibility” that a group of cells exchanging wave signals of a specific wavenumber with high transmittance may show synchronized rhythmic activity. Therefore, to avoid any misunderstanding for the reader, we have revised the corresponding sentence in the Discussion as follows.

      In the Discussion, “The plausible possibility according to our model study is that the constructive interference of subthreshold membrane potential waves with a specific wavenumber may generate the synchronized rhythmic activation.

      (5) In the same vein, since the work emphasizes the dependence on the wavenumber for the propagation of subthreshold oscillations, they should make an attempt at estimating the wavenumber of subthreshold oscillations in C elegans if they were to exist and be observed. Next, the presence of two "mobility edges" in the transmission coefficient calculated in this work is unmistakably due to the discrete nature of the system, coming from the tight-binding approximation, and it is unclear if this approximation is justified in the current system.

      In this study, we modeled the propagation of subthreshold waves on the electrical synapse network of C. elegans, but we did not explain the generation of subthreshold oscillations themselves. Here, we simply injected wave signals with various wavenumber values into the network using a hypothetical device called an "Injector." As the Reviewer pointed out, estimating the wavenumbers of subthreshold oscillations that may exist or be observed in C. elegans would require a comprehensive investigation of the membrane potential dynamics occurring in the membranes of individual neurons. However, this is beyond the scope of this study and would require considerable effort to accomplish.

      As for the use of the tight-binding Hamiltonian, we have addressed that in our response to the third paragraph in the Joint Public Review above.

      (6) Similarly, it is possible that the wavenumber-dependent transmission observed depends strongly on the addition of a large number of virtual nodes (VNs) in the network, which the authors give little to no motivation for. As these nodes are not present in the C elegans connectome, the authors should explain the motivation for their inclusion in the model and should discuss their consequences on the transmission properties of the network.

      As mentioned in our response to the third paragraph in the Joint Public Review above, in our model, a node is simply a pathway for waves to pass through. Therefore, inserting virtual nodes between two neurons that are connected in the C. elegans connectome does not alter the actual connection structure. In other words, virtual nodes do not create new connections between cells that didn’t exist in the connectome. The virtual nodes we introduced are merely a way to divide the sections—axon, gap junction, dendrite—through which the wave passes when it is transmitted between two neurons. As we have already explained in Fig. 1B, the edge connected by two virtual nodes, represented by a dotted line, is motivated to depict the gap junction acting as a potential barrier. We hope this explanation helps the Reviewer better understand the model used in this study.

      (7) As it stands, the work would only have a very limited impact on the understanding of subthreshold oscillations in the rat or in C elegans. Indeed, the preprint falls short of relating its numerical results to any phenomena which could be observed in the lab.

      In this study, we proposed a minimalistic model built using the currently available but limited C. elegans connectome information. Specifically, our model is not a phenomenological one that adjusts parameters to accurately predict experimental measurements, but rather an attempt at a novel conceptual approach to theoretically possible scenarios. While the model may not be satisfactory enough to explain experimental phenomena at present, it is a theoretical/computational study that someone needs to undertake. We believe this is the path of scientific progress. Therefore, as the Reviewer has expressed concern, it is entirely understandable that reproducing the numerical results measured in actual experiments is difficult in this study. Nevertheless, we believe that this study makes a basic contribution to the conceptual understanding of subthreshold signal propagation in C. elegans’ electric synapses.

      Rather than offering a stretched opinion, we maintain a positive hope that future researchers in this field will improve the model by incorporating more detailed and extensive biological data through follow-up studies, allowing us to get closer to describing real phenomena.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      The word "Sensory" was misspelled in Figures 2, 4 and 5.

      We appreciate the feedback from Reviewer #1. We have corrected the mentioned typos in Figures 2, 4, and 5 of the revised manuscript.

    1. Reviewer #2 (Public review):

      Summary:

      The authors did not find an increased representation of CS+ throughout reinforcement learning in the tuft dendrites of Rbp4-positive neurons from layer 5B of the barrel cortex, as previously reported for soma from layer 2/3 of the visual cortex.<br /> Alternatively, the authors observed an increased selectivity to both stimuli (CS+ and CS-) during reinforcement learning. This feature 1) was not present in repeated exposures (without reinforcement), 2) was not explained by animal's behaviour (choice, licking and whisking) and 3) was long-lasting, being present even when the mice disengaged from the task.<br /> Importantly, increased selectivity was correlated with learning (% correct choices), and neural discriminability between stimuli increased with learning.

      In conclusion, the authors show that tuft dendrites from layer 5B of the barrel cortex increase the representation of conditioned (CS+) and unconditioned stimuli (CS-) applied to the whiskers, during reinforcement learning.

      Strengths:<br /> The results presented are very consistent throughout the entire study, and therefore very convincing:

      (1) The results observed are very similar using two different imaging techniques (using 2-photon -planar imaging- and SCAPE - volumetric imaging). Fig. 3 and Fig.4 respectively.<br /> (2) The results are similar using "different groups" of tuft dendrites for the analysis (e.g. initially unresponsive and responsive pre- and post-learning). Fig. 5.<br /> (3) The results are similar from a specific set of trials (with the same sensory input, but different choices). Fig.7.<br /> (4) Additionally, the selectivity of tuft dendrites from layer 5B of the barrel cortex was higher in the mice that exclusively used the whisker to respond to the stimuli (CS+ and CS-).

      The results presented are controlled against a group of mice that received the same stimuli presentation, except the reinforcement (reward).

      Additionally, the behaviour outputs, such as choice, whisking and licking could not account for the results observed.

      Although there are no causal experiments, the correlation between selectivity and learning (% of correct choices), as well as the increased neural discriminability with learning, but not in repeated exposure, are very convincing.

      Weaknesses:

      The biggest weakness is the absence of causality experiments. Although inhibiting specifically tuft dendritic activity in layer 1 from layer 5 pyramidal neurons is very challenging, tuft dendritic activity in layer 1 could be silenced through optogenetic experiments as in Abs et al. 2018. By manipulating NDNF-positive neurons the authors could specifically modify tuft dendritic activity in the barrel cortex during CS presentations, and test if silencing tuft dendritic activity in layer 1 would lead to the lack of selectivity and an impairment of reinforcement learning. Additionally, this experiment will test if the selectivity observed during reinforcement learning is due to changes in the local network, namely changes in local synaptic connectivity, or solely due to changes in the long-range inputs.

    2. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      What neurophysiological changes support the learning of new sensorimotor transformations is a key question in neuroscience. Many studies have attempted to answer this question at the neuronal population level - with varying degrees of success - but few, if any, have studied the change in activity of the apical dendrites of layer 5 cortical neurons. Neurons in layer 5 of the sensory cortex appear to play a key role in sensorimotor transformations, showing important decision and reward-related signals, and being the main source of cortical and subcortical projections from the cortex. In particular, pyramidal track (PT) neurons project directly to subcortical regions related to motor activity, such as the striatum and brainstem, and could initiate rapid motor action in response to given sensory inputs. Additionally, layer 5 cortical neurons have large apical dendrites that extend to layer 1 where different neuromodulatory and long-range inputs converge, providing motor and contextual information that could be used to modulate layer 5 neurons output and/or to establish the synaptic plasticity required for learning a new association. 

      In this study, the authors aimed to test whether the learning of a new sensorimotor transformation could be supported by a change in the evoked response of the apical dendrites of layer 5 neurons in the mouse whisker primary somatosensory cortex. To do this, they performed longitudinal functional calcium imaging of the apical dendrites of layer 5 neurons while mice learned to discriminate between two multi-whisker stimuli. The authors used a simple conditioning task in which one whisker stimulus (upward or backward air pu , CS+) is associated with a reward after a short delay, while the other whisker stimulus (CS-) is not. They found that task learning (measured by the probability of anticipatory licking just after the CS+) was not associated with a significant change in the average population response evoked by the CS+ or the CS-, nor a change in the average population selectivity. However, when considering individual dendritic tufts, they found interesting changes in selectivity, with approximately equal numbers of dendrites becoming more selective for CS+ and dendrites becoming more selective for CS-. 

      One of the major challenges when assessing changes in neural representation during the learning of such Go/NoGo tasks is that the movements and rewards themselves may elicit strong neural responses that may be a confounding factor, that is, inexperienced mice do not lick in response to the CS+, while trained mice do. In this study, the authors addressed this issue in three ways: first, they carefully monitored the orofacial movements of mice and showed that task learning is not associated with changes in evoked whisker movements. Second, they show that whisking or licking evokes very little activity in the dendritic tufts compared to whisker stimuli (CS+ and CS-). Finally, the authors introduced into the design of their task a post-conditioning session after the last conditioning session during which the CS+ and the CS- are presented but no reward is delivered. During this post-session, the mice gradually stopped licking in response to the CS+. A better design might have been to perform the pre-conditioning and post-conditioning sessions in nonwater-restricted, unmotivated mice to completely exclude any lick response, but the fact that the change in selectivity persists after the mice stopped licking in the last blocks of the post-conditioning session (in mice relying only on their whiskers to perform the task) is convincing. 

      The clever task design and careful data analysis provide compelling evidence that learning this whisker discrimination task does not result in a massive change in sensory representation in the apical dendritic tufts of layer 5 neurons in the primary somatosensory cortex on average. Nevertheless, individual dendritic tufts do increase their selectivity for one or the other sensory stimulus, likely enhancing the ability of S1 neurons to accurately discriminate the two stimuli and trigger the appropriate motor response (to lick or not to lick). 

      One limitation of the present study is the lack of evidence for the necessity of the primary somatosensory cortex in the learning and execution of the task. As the authors have strongly emphasized in their previous publications, the primary somatosensory cortex may not be necessary for the learning and execution of simple whisker detection tasks, especially when the stimulus is very salient. Although this new task requires the discrimination between two whisker stimuli, the simplicity and salience of the whisker stimuli used could make this task cortex-independent. Especially when considering that some mice seem to not rely entirely on their whiskers to execute the task. 

      Nevertheless, this is an important result that shows for the first time changes in the selectivity to sensory stimuli at the level of individual apical dendritic tufts in correlation with the learning of a discrimination task. This study sheds new light on the cortical cellular substrates of reward-based learning and opens interesting perspectives for future research in this area. In future studies, it will be important to determine whether the change in selectivity of dendritic calcium spikes is causally involved in the learning of the task or whether it simply correlates with learning, as a consequence of changes in synaptic inputs caused by reward. The dendritic calcium spikes may be involved in the establishment of synaptic plasticity required for learning and impact the output of layer 5 pyramidal neurons to trigger the appropriate motor response. It would be important also to study the changes in selectivity in the apical dendrite of the identified projection neurons.  

      Reviewer #2 (Public Review):

      Summary: 

      The authors did not find an increased representation of CS+ throughout reinforcement learning in the tuft dendrites of Rbp4-positive neurons from layer 5B of the barrel cortex, as previously reported for soma from layer 2/3 of the visual cortex. 

      Alternatively, the authors observed an increased selectivity to both stimuli (CS+ and CS-) during reinforcement learning. This feature: 

      (1) was not present in repeated exposures (without reinforcement), 

      (2) was not explained by the animal's behaviour (choice, licking, and whisking), and 

      (3) was long-lasting, being present even when the mice disengaged from the task. 

      Importantly, increased selectivity was correlated with learning (% correct choices), and neural discriminability between stimuli increased with learning. 

      In conclusion, the authors show that tuft dendrites from layer 5B of the barrel cortex increase the representation of conditioned (CS+) and unconditioned stimuli (CS-) applied to the whiskers, during reinforcement learning. 

      Strengths: 

      The results presented are very consistent throughout the entire study, and therefore very convincing: 

      (1) The results observed are very similar using two different imaging techniques (2-photon planar imaging- and SCAPE-volumetric imaging). Figure 3 and Figure 4 respectively. 

      (2) The results are similar using "different groups" of tuft dendrites for the analysis (e.g.

      initially unresponsive and responsive pre- and post-learning). Figure 5. 

      (3) The results are similar from a specific set of trials (with the same sensory input, but di erent choices). Figure 7. 

      (4) Additionally, the selectivity of tuft dendrites from layer 5B of the barrel cortex was higher in the mice that exclusively used the whisker to respond to the stimuli (CS+ and CS-).  The results presented are controlled against a group of mice that received the same stimuli presentation, except for the reinforcement (reward). 

      Additionally, the behaviour outputs, such as choice, whisking, and licking could not account for the results observed. 

      Although there are no causal experiments, the correlation between selectivity and learning (percentage of correct choices), as well as the increased neural discriminability with learning, but not in repeated exposure, are very convincing. 

      Weaknesses: 

      The biggest weakness is the absence of causality experiments. Although inhibiting specifically tuft dendritic activity in layer 1 from layer 5 pyramidal neurons is very challenging, tuft dendritic activity in layer 1 could be silenced through optogenetic experiments as in Abs et al. 2018. By manipulating NDNF-positive neurons the authors could specifically modify tuft dendritic activity in the barrel cortex during CS presentations, and test if silencing tuft dendritic activity in layer 1 would lead to the lack of selectivity and an impairment of reinforcement learning. Additionally, this experiment will test if the selectivity observed during reinforcement learning is due to changes in the local network, namely changes in local synaptic connectivity, or solely due to changes in the long-range inputs.    

      We agree that such causal manipulations are a logical next step. Such manipulations are unfortunately not specific to layer 5 apicals, so the results would be difficult to interpret. We now discuss the challenge of such manipulations in the Discussion section.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors): 

      Overall, the study is solid and the article is well and clearly written. I have no suggestion for other experiments that would fall within the scope of this article. I would like only to suggest some additional analyses and clarifications in the writing. 

      Additional analyses: 

      Obviously, the main confounding factor in this type of data comes from the acquired motor response which follows - with a short latency - the sensory stimulus. This is particularly problematic for functional calcium imaging which has very low temporal resolution. The authors have addressed this question to some extent by showing that motor-evoked activity does not account for the change in selectivity acquired with learning and through the use of a post-conditioning session during which no reward was delivered. Figures 8C-D show that mice gradually stop licking in response to CS+ in this session and that the distribution of the selectivity index remains similar in these last blocks. Perhaps a more convincing analysis would be to simply select Miss and Correct rejection trials in which mice did not lick in response to the CS+ and CS-, respectively. Ideally, if the number of trials is sufficient, one could even select trials devoid of any evoked movement (no licking and no whisking).  

      We agree it would be interesting to compare Miss and Correct rejection trials to further rule out effects of a motor response, but there were never enough Miss trials to conduct such an analysis. Even in very early learning, there are few Miss trials (see Figure 1, session 2). We found that in early learning, animals would lick in most trials. Then, over the course of conditioning, they would learn to withhold licks during CS- presentation. Thus, we were able to examine Hits, Correct rejections, and False alarms (Figure 7), but not Miss trials. We have added text suggesting a future experiment in which the stimulus strengths are substantially reduced to drastically increase the error rates.

      The fact that changes in selectivity occur in both directions overall is really interesting. However, in the way the data are presented currently, one may wonder about mice/field of view vs single cell effect. i.e., do di erent dendritic tufts in the same field of view show opposite changes in selectivity? If we were to replot Figure 3A for a single mouse, would we obtain the same picture?  

      We appreciate this very good suggestion and have added scatter plots and selectivity index histograms for individual conditioned animals in Supplementary figure 2. These data demonstrate that different dendritic tufts in the same field of view exhibit opposite changes in selectivity.

      The authors point out that they observed no change in the mean response or selectivity during learning, but did find changes in selectivity at the level of individual dendritic tufts. This suggests that, at the population level, the ability to discriminate between the two stimuli should improve. A possible complementary analysis would be to show that the ability to decode stimulus identity from dendritic tuft population activity increases with learning.  

      Given the substantial change in individual tuft selectivity and that the tuft events occur are not rare, the population result is guaranteed. If individual tufts increase selectivity, the population will also increase its selectivity on a trial-by-trial basis. We have nevertheless included a new supplementary figure with a population analysis using SVMs to demonstrate this.

      Clarification: 

      The authors should make it clear from the beginning that mice are still water-restricted during the post-conditioning session and actually do keep licking for many CS+ trials. Therefore, this session is not devoid of motor response. 

      We have clarified this in the text.

      Did mice in the repeated exposure condition receive any reward during the recording sessions? If so when were rewards delivered? 

      We previously described in the Methods that these mice received water in their home cage, but we now additionally clarify this in the Results section.

      Minor: 

      Figure 2Aii, the labels of the Alpha and Betta barrels should be swapped. 

      Fixed

      Line 218: I believe this sentence should read "Using SCAPE microscopy, ...". 

      Corrected.

      Line 665: 'Reconstruction from 50' does that refer to the single cell reconstruction on the left panel? 

      Yes – Clarified in legend

      Reviewer #2 (Recommendations For The Authors): 

      Minor suggestions: 

      The 'summary' should mention from which brain area the results were acquired. Otherwise, it is misleading, giving the idea that the results described a generic feature, which is still unknown.  

      Added to the text.

      Please correct sentence 219: "SCAPE microscopy, we image tuft activity of additional mice..." 

      Added to the text.

      In the same sentence (219) it would be good to provide the number of additional mice imaged (2). 

      Added to the text.

      Regarding Supplementary Figure 1, it would be interesting to correlate the second peak after reward and learning rate, to provide further support to the sentences 109 to 113. 

      We agree this would be interesting to examine, but only four animals exhibited this second peak, which is too small of a sample to observe a meaningful correlation. We now clarify this in the text.

      In Figure 3, why not present the correlation between 'neural discriminability' and % of correct choices? 

      We appreciate the suggestion and have added this plot to Figure 3.

      The 'results' section will benefit tremendously if the authors consistently indicate the figures to which the results are being described, or 'data not shown' if it is the case. To give a few examples: 

      Sentence 108 - "averaged 28% ΔF/F" - From which figure is this result coming from?  Sentence 123 - "(p = 0.62, 0.64, respectively)" - comparison not shown, but see Figures 2E and D respectively? 

      Sentence 125 - "(CS+ responsive (...) across all sessions)" - From which figure is this result coming from? 

      Sentence 130 - "during pre-conditioning (p=0.66) or post-conditioning sessions (p=0.44) - From which figure? 

      Sentence 154 - "(Pre: p=0.20; last rewarded: p=0.43; Post: p=0.64, sign-rank test)" - From which figure? 

      Sentence 175 - "(-0.049, -0.001, and 0.003" - From which figure? Please show the graph that shows that the mean SI is not different. It can be supplementary. The distribution of SI will be strengthened by it.  

      We added this plot to supplementary figure 2.

      Sentence 244 - "(conditioned: 458/603; repeated exposure: 334/457) - From Figure 5E. 

      Sentence 256 - "(p=0.04, 2-sample t-test comparison mice) - From Figure 5B.  Sentence 258 - "(p=0.03, paired t-test) - from Figure 5B  Sentences 370 to 378 - No reference to the figure. 

      The 'discussion' section (sentences 459 to 494) refers to the differences between the current and previous studies (references 1,3,5), namely soma vs. dendrites and layer 2/3 vs. layer 5. However, it should also mention the difference between the nature of the stimuli and the brain area recorded (visual cortex vs. barrel cortex).

      We have addressed these issues in the text.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer 1:

      Authors reject the substance of Reviewer 1’s feedback primarily due to clear lack of understanding of typical parameterization practices used to avoid overfitting. To ensure the Spearman-rank correlation accuracy, 70% of all data was withheld from the optimization process and used solely for testing to yield figure 6. Data was withheld prior to model parameterization and therefore avoids Reviewer 1’s charge of “artificially forcing the correlation”. Authors did appreciate the request for clarification of additional definitions and minor reorganization suggestions. Below we provide specific responses to each numbered point (note: multiple responses are provided for some of the reviewer points).

      Point 1: Clarify Metrics Definition and Evaluation

      Authors clarified the description of biodiversity metrics. The metrics associated with manual methods are detailed in the third paragraph of the Materials and Methods: Data Analysis section, while the sensor-based metric is described in the second paragraph, and summarized in its last sentence.

      Text Additions:

      Authors added clarification to the introduction’s first paragraph defining biodiversity metrics, including species richness.

      Authors added detailed definitions of community metrics and their significance in community ecology in the Materials and Methods section (3rd paragraph of “Data Analysis” section). The discussion was updated to include a reference to community ecology and the benefits of big data, specifically highlighting the potential of autonomous optical sensors in entomology.

      Methods Reorganization

      We have reorganized the Methods section for clarity. Updated section clarifies metrics studied, location, dates, a description and methods around optical sensors, Malaise traps, and sweep netting.

      Text Additions:

      An overview paragraph was added to “Data analysis” (3rd paragraph) detailing key metrics used, specifying metrics such as abundance, richness, Shannon index, and Simpson index.

      Visualization methods for sensor data to deliver analogous metrics of abundance, richness, and diversity indices was added to “Data analysis” section.

      Supplementary Table 1 and the first paragraph of the Materials and Methods section cover location, dates, and other general information.

      Detailed descriptions and methods for optical sensors, Malaise traps, and sweeping are provided.

      Integration of Metrics

      Authors integrated two paragraphs explaining the fundamental differences between conventional methods in the 3rd paragraph of the discussion and the presented method of biodiversity measurement.

      Point 2: Body-to-Wing Ratio Calculation

      The backscattered optical cross-section is now clearly defined as the value measured at the maximum point of the event. Specifically, we have added the word ‘maximum’ to our methods section for clarity.

      Point 3: Ecosystem Services Paragraph

      We have shortened and edited this paragraph for clarity. The revised text is now more straightforward and comprehensible.

      Point 4: Results Section Structure

      We believe restructuring the results section around each metric would result in redundancy. The value of our analysis is in the comparison of different methods; therefore, instead of talking about methods in isolation, we provide an integrated discussion and comparison of all three methods across all metrics. Instead, we have maintained our current structure but ensured that the metrics are consistently described and analyzed.

      Point 5: Abundance Correlation

      We agree that the lack of a correlation between methods for abundance remains an open question. However, we maintain that fitting a linear model would be inappropriate and potentially misleading in the absence of significant correlation. We have clarified this in our manuscript.

      Point 6: Richness and Diversity Evaluations

      The authors disagree with Reviewer 1's feedback, citing a clear misunderstanding of standard parameterization practices used to prevent overfitting. Specifically, authors implemented a 30/70 Training/Testing split. Therefore only 30% of the data was used to fit the model and 70% of the dataset was reserved for testing to ensure the validity and reliability of our clustering results. By validating with a 70% testing dataset, we ensure that the clustering model can accurately group new data points and is robust against overfitting. This process helps verify that the identified clusters are meaningful and consistent across different subsets of the data.  Spearman's rho converts the data values into ranks and does not assume a linear relationship between the variables or require the data to follow a normal distribution. Spearman's rank correlation offers robustness against non-linearity and outliers by focusing on ranks. This approach is explained in the 4th paragraph of the “Data Analysis” section.

      Point 7: Clustering Method Credibility

      Authors acknowledge the variability in optical sensor features. However, the Law of Large Numbers supports increased insect measurement accuracy and stability occurs from optical insect sensors due to the increased number of observations made by the optical sensors compared to conventional methods. The manuscript now includes a detailed discussion of these aspects in the 3rd paragraph of discussion, emphasizing the correlation observed despite variability.

      Reviewer 2:

      Authors appreciate Reviewer 2’s feedback especially regarding contextualization. While authors disagree with the need for more specific experimental questions in a methods paper and the suggested need for more complex analysis, we agree with the essence of the review and added additional text regarding potential questions, method applications, and ecosystem processes for contextualization.

      Point 1: Larger Question Framing

      We present this article as a methodological paper rather than asking a specific experimental question. This approach is justified by the generalizable nature of methods papers, akin to those describing ImageJ or mass spectrometers. The method is widely applicable to a range of scientific questions. 

      We provided a discussion on how this technology could be applied in community ecology, conservation, and managed ecological systems like agriculture.

      In the Conclusion section we provided elaboration on the potential research questions and applications.

      Point 2: Complex Analyses

      While complex analyses like NMDS are useful for specific questions, this paper aims to establish the method. Once established, this method can be applied to various research questions in future studies. Therefore, as we are not directly asking an experimental question, more complex analysis is unnecessary.

      Point 3: Ecosystem Process (Granivory) Assay

      We have improved the contextualization and explanation of the ecosystem process assay throughout the manuscript, ensuring it is well-integrated and clear to readers.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      Summary:

      This paper explores how diverse forms of inhibition impact firing rates in models for cortical circuits. In particular, the paper studies how the network operating point affects the balance of direct inhibition from SOM inhibitory neurons to pyramidal cells, and disinhibition from SOM inhibitory input to PV inhibitory neurons. This is an important issue as these two inhibitory pathways have largely been studies in isolation. Support for the main conclusions is generally solid, but could be strengthened by additional analyses.

      Strengths:

      A major strength of the paper is the systematic exploration of how circuit architecture effects the impact of inhibition. This includes scans across parameter space to determine how firing rates and stability depend on effective connectivity. This is done through linearization of the circuit about an effective operating point, and then the study of how perturbations in input effect this linear approximation.

      Weaknesses:

      The linearization approach means that the conclusions of the paper are valid only on the linear regime of network behavior. The paper would be substantially strengthened with a test of whether the conclusions from the linearized circuit hold over a large range of network activity. Is it possible to simulate the full network and do some targeted tests of the conclusions from linearization? Those tests could be guided by the linearization to focus on specific parameter ranges of interest.

      We agree with the reviewer that it would be interesting to test if our results hold in a nonlinear regime of network behaviour (i.e. the chaotic regime, see also comment 1 by reviewer 2). As mentioned above, this requires a different type of model (either rate-based or spiking model with multiple neurons instead of modelling the mean population rate dynamics) which, in our opinion, exceeds the scope of this manuscript. Furthermore, the core measures of our study, network gain, and stability require linearization. In a chaotic regime where the linearization approach is impossible, we would need to consider/define new measures to characterize network response/activity. Therefore, while certainly being an interesting question to study, the broad scope of the studying networks in a nonlinear regime is better tackled in a separate study. We now acknowledge in the discussion of our manuscript that the linearization approach is a limitation in our study and that it would be an interesting future direction to investigate chaotic dynamics.

      The results illustrated in the figures are generally well described but there is very little intuition provided for them. Are there simplified examples or explanations that could be given to help the results make sense? Here are some places such intuition would be particularly helpful:

      page 6, paragraph starting ”In sum ...”

      Page 8, last paragraph

      Page 10, paragraph starting ”In summary ...”

      Page 11, sentence starting ”In sum ...”

      We agree with the reviewer that we didn’t provide enough intuition to our results. We now extended the paragraphs listed by the reviewer with additional information, providing a more intuitive understanding of the results presented in the respective chapter.

      Reviewer #2 (Public Review):

      Summary:

      Bos and colleagues address the important question of how two major inhibitory interneuron classes in the neocortex differentially affect cortical dynamics. They address this question by studying Wilson-Cowan-type mathematical models. Using a linearized fixed point approach, they provide convincing evidence that the existence of multiple interneuron classes can explain the counterintuitive finding that inhibitory modulation can increase the gain of the excitatory cell population while also increasing the stability of the circuit’s state to minor perturbations. This effect depends on the connection strengths within their circuit model, providing valuable guidance as to when and why it arises.

      Overall, I find this study to have substantial merit. I have some suggestions on how to improve the clarity and completeness of the paper.

      Strengths:

      (1) The thorough investigation of how changes in the connectivity structure affect the gain-stability relationship is a major strength of this work. It provides an opportunity to understand when and why gain and stability will or will not both increase together. It also provides a nice bridge to the experimental literature, where different gain-stability relationships are reported from different studies.

      (2) The simplified and abstracted mathematical model has the benefit of facilitating our understanding of this puzzling phenomenon. (I have some suggestions for how the authors could push this understanding further.) It is not easy to find the right balance between biologically detailed models vs simple but mathematically tractable ones, and I think the authors struck an excellent balance in this study.

      Weaknesses:

      (1) The fixed-point analysis has potentially substantial limitations for understanding cortical computations away from the steady-state. I think the authors should have emphasized this limitation more strongly and possibly included some additional analyses to show that their conclusions extend to the chaotic dynamical regimes in which cortical circuits often live.

      We agree with the reviewer that it would be interesting to test if our results hold in a chaotic regime of network behaviour (see also comment by reviewer 1). As mentioned above, this requires a different type of model (either rate-based or spiking model with multiple neurons instead of modelling the mean population rate dynamics) which, in our opinion, exceeds the scope of this manuscript. Furthermore, the core measures of our study, network gain, and stability require linearization. In a chaotic regime where the linearization approach is impossible, we would need to consider/define new measures to characterize network response/activity. Therefore, while certainly being an interesting question to study, the broad scope of the studying networks in a nonlinear regime is better tackled in a separate study. We now acknowledge in the discussion of our manuscript that the linearization approach is a limitation in our study and that it would be an interesting future direction to investigate chaotic dynamics.

      (2) The authors could have discussed – even somewhat speculatively – how SST interneurons fit into this picture. Their absence from this modelling framework stands out as a missed opportunity.

      We believe that the reviewer wanted us to speculate about VIP interneurons (and not SST interneurons, which we already do extensively in the manuscript). Previous models have included VIP neurons in the circuit (e.g. del Molino et al., 2017; Palmigiano et al., 2023; Waitzmann et al., 2024). While we do not model VIP cells explicitly, we implicitly assume that a possible source of modulation of SOM neurons comes from VIP cells. We have now added a short discussion on VIP cells in the last paragraph in our discussion section.

      (3) The analysis is limited to paths within this simple E,PV,SOM circuit. This misses more extended paths (like thalamocortical loops) that involve interactions between multiple brain areas. Including those paths in the expansion in Eqs. 11-14 (Fig. 1C) may be an important consideration.

      We agree with the reviewer that our framework can be extended to study many other different paths, like thalamocortical loops, cortical layer-specific connectivity motifs, or circuits with VIP or L1 inhibitory neurons. Studying these questions, however, are beyond the scope of our work. In our discussion, we now mention the possibility of using our framework to study those questions.

      Reviewer #3 (Public Review):

      Summary:

      Bos et al study a computational model of cortical circuits with excitatory (E) and two subtypes of inhibition parvalbumin (PV) and somatostatin (SOM) expressing interneurons. They perform stability and gain analysis of simplified models with nonlinear transfer functions when SOM neurons are perturbed. Their analysis suggests that in a specific setup of connectivity, instability and gain can be untangled, such that SOM modulation leads to both increases in stability and gain. This is in contrast with the typical direction in neuronal networks where increased gain results in decreased stability.

      Strengths:

      - Analysis of the canonical circuit in response to SOM perturbations. Through numerical simulations and mathematical analysis, the authors have provided a rather comprehensive picture of how SOM modulation may affect response changes.

      - Shedding light on two opposing circuit motifs involved in the canonical E-PV-SOM circuitry - namely, direct inhibition (SOM → E) vs disinhibition (SOM → PV → E). These two pathways can lead to opposing effects, and it is often difficult to predict which one results from modulating SOM neurons. In simplified circuits, the authors show how these two motifs can emerge and depend on parameters like connection weights.

      - Suggesting potentially interesting consequences for cortical computation. The authors suggest that certain regimes of connectivity may lead to untangling of stability and gain, such that increases in network gain are not compromised by decreasing stability. They also link SOM modulation in different connectivity regimes to versatile computations in visual processing in simple models.

      Weaknesses:

      The computational analysis is not novel per se, and the link to biology is not direct/clear.

      Computationally, the analysis is solid, but it’s very similar to previous studies (del Molino et al, 2017). Many studies in the past few years have done the perturbation analysis of a similar circuitry with or without nonlinear transfer functions (some of them listed in the references). This study applies the same framework to SOM perturbations, which is a useful and interesting computational exercise, in view of the complexity of the high-dimensional parameter space. But the mathematical framework is not novel per se, undermining the claim of providing a new framework (or ”circuit theory”).

      In the introduction we acknowledge that our analysis method is not novel but is rather based on previous studies (del Molino et al., 2017; Kuchibhotla et al., 2017; Kumar et al., 2023, Litwin-Kumar et al., 2016; Mahrach et al., 2020; Palmigiano et al., 2023; Veit et al., 2023; Waitzmann et al., 2024). We now rewrote parts of the introduction to make sure that it does not sound like the computational analysis has been developed by us, but that we rather use those previously developed frameworks to dissect stability and gain via SOM modulation.

      Link to biology: the most interesting result of the paper with regard to biology is the suggestion of a regime in which gain and stability can be modulated in an unconventional way - however, it is difficult to link the results to biological networks: - A general weakness of the paper is a lack of direct comparison to biological parameters or experiments. How different experiments can be reconciled by the results obtained here, and what new circuit mechanisms can be revealed? In its current form, the paper reads as a general suggestion that different combinations of gain modulation and stability can be achieved in a circuit model equipped with many parameters (12 parameters). This is potentially interesting but not surprising, given the high dimensional space of possible dynamical properties. A more interesting result would have been to relate this to biology, by providing reasoning why it might be relevant to certain circuits (and not others), or to provide some predictions or postdictions, which are currently missing in the manuscript.

      - For instance, a nice motivation for the paper at the beginning of the Results section is the different results of SOM modulation in different experiments - especially between L23 (inhibition) and L4 (disinhibition). But no further explanation is provided for why such a difference should exist, in view of their results and the insights obtained from their suggested circuit mechanisms. How the parameters identified for the two regimes correspond to different properties of different layers?

      As pointed out by the reviewer, the main goal of our manuscript is to provide a general understanding of how gain and stability depend on different circuit motifs (ie different connectivity parameters), and how circuit modulations via SOM neurons affect those measures. However, we agree with the reviewer that it would be useful to provide some concrete predictions or postdictions following from our study.

      An interesting example of a postdiction of our model is that the firing rate change of excitatory neurons in response to a change in the stimulus (which we define as network gain, Eq. 2) depends on firing rates of the excitatory, PV, and SOM neurons at the moment of stimulus presentation (Fig. 3ii; Fig. 4Aii,Bii,Cii; Fig. 5Aii, Bii, Cii). Hence any change in input to the circuit can affect the response gain to a stimulus presentation, in line with experimental evidence which suggests that changes in inhibitory firing rates and changes in the behavioral state of the animal lead to gain modifications (Ferguson and Cardin 2020).

      Another recent concrete example is the study of Tobin et al., 2023, in which the authors show that optogenetically activating SOM cells in the mouse primary auditory cortex (A1) decreases the excitatory responses to auditory stimuli. In our framework, this corresponds to the case of decreases in network gain (gE) for positive SOM modulation, as seen in the circuit with PV to SOM feedback connectivity (Suppl. Fig. S1).

      Another example is the study by Phillips and Hasenstaub 2016, in which the authors study the effect of optogenetic perturbations of SOM (and PV) cells on tuning curves of pyramidal cells in mouse A1. While they find large heterogeneity in additive/subtractive or multiplicative/divisive tuning curve changes following SOM inactivation, most cells have a purely multiplicative or purely additive component (and none of the cells have a divisive component). In our study, we see that large multiplicative responses of the excitatory population follow from circuits with strong E to SOM feedback connectivity.

      We note that in future computational studies, it would be useful to apply our framework with a focus on a specific brain region and add all relevant cell types (at a minimum E, PV, SOM, and VIP) plus a dendritic compartment, in order to formulate much more precise experimental predictions.

      We have now added additional information to the discussion section.

      - Another caveat is the range of parameters needed to obtain the unintuitive untangling as a result of SOM modulation. From Figure 4, it appears that the ”interesting” regime (with increases in both gain and stability) is only feasible for a very narrow range of SOM firing rates (before 3 Hz). This can be a problem for the computational models if the sweet spot is a very narrow region (this analysis is by the way missing, so making it difficult to know how robust the result is in terms of parameter regions). In terms of biology, it is difficult to reconcile this with the realistic firing rates in the cortex: in the mouse cortex, for instance, we know that SOM neurons can be quite active (comparable to E neurons), especially in response to stimuli. It is therefore not clear if we should expect this mechanism to be a relevant one for cortical activity regimes.

      We agree with the reviewer that it’s important to test the robustness of our results. As suggested by the reviewer, we now include a new supplementary figure (Suppl. Fig. S2) which measures the percentage of data points in the respective quadrant Q1-Q4 when changing the SOM firing rates (as done in Fig. 5). We see that the quadrants in which the network gain and stability change in the same direction (Q2 and Q3) remain high in the case for E to SOM feedback (Suppl. Fig. S2A) over SOM rates ranging over 0-10 Hz (and likely beyond).

      - One of the key assumptions of the model is nonlinear transfer functions for all neuron types. In terms of modelling and computational analysis, a thorough analysis of how and when this is necessary is missing (an analysis similar to what has been attempted at in Figure 6 for synaptic weights, but for cellular gains). In terms of biology, the nonlinear transfer function has experimentally been reported for excitatory neurons, so it’s not clear to what extent this may hold for different inhibitory subtypes. A discussion of this, along with the former analysis to know which nonlinearities would be necessary for the results, is needed, but currently missing from the study. The nonlinearity is assumed for all subtypes because it seems to be needed to obtain the results, but it’s not clear how the model would behave in the presence or absence of them, and whether they are relevant to biological networks with inhibitory transfer functions.

      It is true that the nonlinear transfer function is a key component in our model. We chose identical transfer functions for E, PV, and SOM (; Eq. 4) to simplify our analysis. If the transfer function of one of the neuron types would be linear (β \= 1), then the corresponding b terms (the slope of the nonlinearity at the steady state; b \= dfX/dqX; Fig. 1B; Eq. 4) would be equal to α. Therefore, if neurons had a linear transfer function in our model, there would not be a dependence of network gain on E and PV firing rate as studied in Fig. 3-5. This is because the relationship between PV rates and their gain would be constant (bP \= α) in Fig. 1B (bottom).

      If all the transfer functions were linear, changes in firing rates would not have an impact on network gain or stability. Changing the nonlinear transfer function by changing the α or β terms in Eq. 4 would only scale the way a change in the rates affects the b terms and hence the results presented in Fig. 3-5. More interesting would be to study how different types of nonlinearities, like sigmoidal functions or sublinear nonlinearities (i.e. saturating nonlinearities), would change our results. However, we think that such an investigation is out of scope for this study. We now added a comment to the Methods section.

      Experimentally, F-I curves have been measured also for PV and SOM neurons. For example, Romero-Sosa et al., 2021 measure the F-I curve of pyramidal, PV and SOM neurons in mouse cortical slices. They find that similar to pyramidal neurons, PV and SOM neurons show a nonlinear F-I curve. We now added the citation of Romero-Sosa et al., 2021 to our manuscript.

      - Tuning curves are simulated for an individual orientation (same for all), not considering the heterogeneity of neuronal networks with multiple orientation selectivity (and other visual features) - making the model too simplistic.

      The reviewer is correct that we only study changes in tuning curves in a simplistic model. In our model, the excitatory and PV populations are tuned to a single orientation (in the case of Fig. 7 to θ \= 90). While this is certainly an oversimplification, it allows us to understand how additive/subtractive and multiplicative/divisive changes in the tuning curves come about in networks with different connectivity motifs. To model heterogeneity of tuning responses within a network, it requires more complex models. A natural choice would be to extend a classical ring attractor model (Rubin et al., 2015) by splitting the inhibitory population into PV and SOM neurons, or study the tuning curve heterogeneity that occurs in balanced networks (Hansel and van Vreeswijk 2012). However, this model has many more parameters, like the spatial connectivity profiles from and onto PV and SOM neurons. While highly valuable, we believe that studying such models exceeds the scope of our current manuscript. We now added a paragraph in the discussion section, mentioning this as an interesting future direction.

      Reviewer #1 (Recommendations For The Authors):

      The last sentence of the abstract is hard to interpret before reading the rest of the paper - suggest replacing or rephrasing.

      We rephrased the sentence to make more clear what we mean.

      Page 3, last full paragraph: I think this assumes that phi is positive. What is the justification for that assumption? More generally, I think you could say a bit more about phi in the main text since it is a fairly complicated term.

      The reviewer is correct, for a stable system phi is always positive. We now clarify this and explain phi in more detail in the main text.

      Fig 1D: It would be helpful to identify when the stimulus comes on and be clearer about what the stimulus is. I assume it’s a step increase in S input at 0.05 s or so - but that should be immediately apparent looking at the figure.

      We agree with the reviewer and we added a dashed line at the time of stimulus onset in Fig. 1D.

      Page 5: ”To motivate our analysis we compare ... (Fig. 2A)” - Figure 2A does not show responses without modulation, so this sentence is confusing.

      The dashed lines in Fig. 2A (and Fig. 2C) actually represents the rate change without modulation.

      Page 6: sentence “The central goal of our study ...” seems out of place since this is pretty far into the results, and that goal should already be clear.

      We agree with the reviewer, hence we updated the sentence.

      Page 10, top: the green curve in panel Aii always has a negative slope - so I am confused by the statement that increasing wSE decreases both gain and stability.

      We thank the reviewer for pointing out this mistake. We now fixed it in the text.

      Figure 6: in general it is hard to see what is going on in this figure (the green and blue in particular are hard to distinguish). Some additional labels would be helpful, but I would also see if the color scheme can be improved.

      We added a zoom-in to the panels which were hard to distinguish.

      Reviewer #2 (Recommendations For The Authors):

      Major recommendations:

      (1) The authors should explain early on in the results section what the key factor(s) is that differentiates SOM from PV cells in their model. E.g., in Fig. 1A, the only obvious difference is that SOM cells don’t inhibit themselves. However, later on in the paper, the difference in external stimulus drive to these interneuron classes is more heavily emphasized. Given the importance of that difference (in external stim drive), I think this should be highlighted early on.

      We now mention the key factors that differentiate PV and SOM neurons already when describing Fig. 1A.

      (2) The result in Figs. 5,6 demonstrate that recurrent SOM connectivity is important for achieving increases in both gain and stability. This observation could benefit from some intuitive explanation. Perhaps the authors could find this explanation by looking at their series expansion (Eqs. 11-14, Fig. 1C) and determining which term(s) are most important for this effect. The corresponding paths through the circuit – the most important ones – could then be highlighted for the reader.

      We agree with the reviewer that our results benefit from more intuitive explanations. This has also been pointed out by reviewer 1 in their public review. We now extended the concluding paragraphs in the context of Fig. 4-6 with additional information, providing a more intuitive understanding of the results presented in the respective chapter. While it is possible to gain an intuitive understanding of how the network gain depends on rate and weight parameters (Eq. 2), this understanding is unfortunately missing in the case of stability. The maximum eigenvalue of the system have a complex relationship with all the parameters, and often have nonlinear dependencies on changes of a parameter (e.g. as we show in Fig. 3iv or one can see in Fig. 6). We now discuss this difficulty at the end of the section “Influence of weight strength on network gain vs stability”.

      (3) I think the authors should consider including some analyses that do not rely on the system being at or near a fixed point. I admit that such analysis could be difficult, and this could of course be done in a future study. Nevertheless, I want to reiterate that this addition could add a lot of value to this body of work.

      As outlined above, we decided to not include additional analysis on network behaviour in nonlinear regimes but we now acknowledge in the discussion of our manuscript that the linearization approach is a limitation in our study and that it would be an interesting future direction to investigate chaotic dynamics.

      Minor recommendations:

      (1) At the top of P. 6, when the authors first discuss the stability criterion involving eigenvalues, they should address the question ”eigenvalues of what?”. I suggest introducing the idea of the Jacobian matrix, and explaining that the largest eigenvalue of that matrix determines how rapidly the system will return to the fixed point after a small perturbation.

      We included an additional sentence in the respective paragraph explaining the link between stability and negative eigenvalues, and we also added a sentence in the Methods section stating the the largest real eigenvalue dominates the behavior of the dynamical system.

      (2) The panel labelling in Fig. 3 is unnecessarily confusing. It would be simpler (and thus better) to simply label the panels A,B,C,D, or i,ii,iii,iv, instead of the current labelling: Ai, Aii, Aiii, Aiv. (There are currently no panels ”B” in Fig. 3).

      We updated the figure accordingly.

      Reviewer #3 (Recommendations For The Authors):

      • Suggestions for improved or additional experiments, data or analyses.

      Analysis of the effect of different nonlinear transfer functions is necessary.

      Please see our detailed answer to the reviewer’s comment in the public review above.

      Analysis of gain modulation in models with more realistic tuning properties.

      Please see our detailed answer to the reviewer’s comment in the public review above.

      Mathematical analysis of the conditions to obtain ”untangled” gain and stability:

      One of the promises of the paper is that it is offering a computational framework or circuit theory for understanding the effect of SOM perturbation. However, the main result, namely the untangling of gain and stability, has only been reported in numerical simulations (e.g. Fig. 6). Different parameters have been changed and the results of simulations have been reported for different conditions. Given the simplified model, which allows for rigorous mathematical analysis, isn’t it possible to treat this phenomenon more analytically? What would be the conditions for the emergence of the untangled regime? This is currently missing from the analyses and results.

      We agree with the reviewer that our results benefit from more intuitive explanations. This has also been pointed out by reviewer 1 in their public review. We now extended the concluding paragraphs in the context of Fig. 4-6 with additional information, providing a more intuitive understanding of the results presented in the respective chapter. While it is possible understand analytically of how the network gain depends on rate and weight parameters (Eq. 2), this understanding is unfortunately missing in the case of stability. The maximum eigenvalue of the system have a complex relationship with all the parameters, and often have nonlinear dependencies on changes of a parameter (e.g. as we show in Fig. 3iv or one can see in Fig. 6). This doesn’t allow for a a deep analytical understanding of the entangling of gain and stability. We now discuss this difficulty at the end of the section “Influence of weight strength on network gain vs stability”.

      • Recommendations for improving the writing and presentation. The Results section is well written overall, but other parts, especially the Introduction and Discussion, would benefit from proof reading - there are many typos and problems with sentence structures and wording (some mentioned below).

      We have gone through the manuscript again and improved the writing.

      The presentation of the dependence on weight in Figure 6 can be improved. For instance, the authors talk about the optimal range of PV connectivity, but this is difficult to appreciate in the current illustration and with the current colour scheme.

      We added a zoom in to the panels which were hard to distinguish.

      • Minor corrections to the text and figures. Text:

      We thank the reviewer for their thorough reading of our manuscript. We fixed all the issues from below in the manuscript.

      Some examples of bad structure or wording:

      From the Abstract:

      ”We show when E - PV networks recurrently connect with SOM neurons then an SOM mediated modulation that leads to increased neuronal gain can also yield increased network stability.” From Introduction:

      Sentence starting with ”This new circuit reality ...”

      ”Inhibition is been long identified as a physiological or circuit basis for how cortical activity changes depending upon processing or cognitive needs ...”

      Sentence starting with ”Cortical models with both ...”

      ”... allowing SOM neurons the freedom to ..”

      From Results:

      ”... affects of SOM neurons on E ..”

      ”seem in opposition to one another, with SOM neuron activity providing either a source or a relief of E neuron suppression”. The sentence after is also difficult to read and needs to be simplified.

      P. 7: ”We first remark that ...”

      Difficult to read/understand - long and badly structured sentence.

      P. 8: ”adding a recurrent connection onto SOM neurons from the E-PV subcircuit” It’s from E (and not PV) to be more precise (Fig. 5).

      Discussion:

      ”Firstly, E neurons and PV neurons experience very similar synaptic environments.” What does it mean?

      ”Fortunately, PV neurons target both the cell bodies and proximal dendrites” Fortunately for whom or what? ”in line with arge heterogeneity”

      Methods:

      Matrix B is never defined - the diagonal matrix of b (power law exponents) I assume.

      Some of the other notations too, e.g. bs, etc (it’s implicit, but should be explained).

      Structure of sentence:

      ”Network gain is defined as ...” (p. 17)

      Figure:

      The schematics in Figure 4 can be tweaked to highlight the effect of input (rather than other components of the network, which are the same and repetitive), to highlight the main difference for the reader.

    1. Reviewer #1 (Public review):

      Summary:

      The aim of the experiment reported in this paper is to examine the nature of the representation of a template of an upcoming target. To this end, participants were presented with compound gratings (consisting of tilted to the right and tilted to the left lines) and were cued to a particular orientation - red left tilt or blue right tilt (counterbalanced across participants). There are two directly compared conditions: (i) no ping: where there was a cue, that was followed by a 5.5-7.5s delay, then followed by a target grating in which the cued orientation deviated from the standard 45 degrees; and (ii) ping condition in which all aspects were the same with the only difference that a ping (visual impulse presented for 100ms) was presented after the 2.5 seconds following the cue. There was also a perception task in which only the 45 degrees to the right or to the left lines were presented. It was observed that during the delay, only in the ping condition, were the authors able to decode the orientation of the to-be-reported target using the cross-task generalization. Attention decoding, on the other hand, was decoded in both ping and non-ping conditions. It is concluded that the visual system has two different functional states associated with a template during preparation: a predominantly non-sensory representation for guidance and a latent sensory-like for prospective stimulus processing.

      Strengths:

      There is so much to be impressed with in this report. The writing of the manuscript is incredibly clear. The experimental design is clever and innovative. The analysis is sophisticated and also innovative - the cross-task decoding, the use of Mahalanobis distance as a function of representational similarity, the fact that the question is theoretically interesting, and the excellent figures.

      Weaknesses:

      While I think that this is an interesting study that addresses an important theoretical question, I have several concerns about the experimental paradigm and the subsequent conclusions that can be drawn.

      (1) Why was V1 separated from the rest of the visual cortex, and why the rest of the areas were simply lumped into an EVC ROI? It would be helpful to understand the separation into ROIs.

      (2) It would have been helpful to have a behavioral measure of the "attended" orientation to show that participants in fact attended to a particular orientation and were faster in the cued condition. The cue here was 100% valid, so no such behavioral measure of attention is available here.

      (3) As I was reading the manuscript I kept thinking that the word attention in this manuscript can be easily replaced with visual working memory. Have the authors considered what it is about their task or cognitive demand that makes this investigation about attention or working memory?

      (4) If I understand correctly, the only ROI that showed a significant difference for the cross-task generalization is V1. Was it predicted that only V1 would have two functional states? It should also be made clear that the only difference where the two states differ is V1.

      (5) My primary concern about the interpretation of the finding is that the result, differences in cross-task decoding within V1 between the ping and no-ping condition might simply be explained by the fact that the ping condition refocuses attention during the long delay thus "resharpening" the template. In the no-ping condition during the 5.5 to 7.5 seconds long delay, attention for orientation might start getting less "crisp." In the ping condition, however, the ping itself might simply serve to refocus attention. So, the result is not showing the difference between the latent and non-latent stages, rather it is the difference between a decaying template representation and a representation during the refocused attentional state. It is important to address this point. Would a simple tone during the delay do the same? If so, the interpretation of the results will be different.

      (6) The neural pattern distances measured using Mahalanobis values are really great! Have the authors tried to use all of the data, rather than the high AMI and low AMI to possibly show a linear relationship between response times and AMI?

      (7) After reading the whole manuscript I still don't understand what the authors think the ping is actually doing, mechanistically. I would have liked a more thorough discussion, rather than referencing previous papers (all by the co-author).

    2. Reviewer #3 (Public review):

      This paper discusses how non-sensory and latent, sensory-like attentional templates are represented during attentional preparation. Using multivariate pattern analysis, they found that visual impulses can enhance the decoding generalization from perception to attention tasks in the preparatory stage in the visual cortex. Furthermore, the emergence of the sensory-like template coincided with enhanced information connectivity between V1 and frontoparietal areas and was associated with improved behavioral performance. It is an interesting paper with supporting evidence for the latent, sensory-like attentional template, but several problems still need to be solved.

      (1) The title is "Dual-format Attentional Template," yet the supporting evidence for the non-sensory format and its guiding function is quite weak. The author could consider conducting further generalization analysis from stimulus selection to preparation stages to explore whether additional information emerges.

      (2) In Figure 2, the author did not find any decodable sensory-like coding in IPS and PFC, even during the impulse-driven session, indicating that these regions do not represent sensory-like information. However, in the final section, the author claimed that the impulse-driven sensory-like template strengthens informational connectivity between sensory and frontoparietal areas. This raises a question: how can we reconcile the lack of decodable coding in these frontoparietal regions with the reported enhancement in network communication? It would be helpful if the author provided a clearer explanation or additional evidence to bridge this gap.

      (3) Given that the impulse-driven sensory-like template facilitated behavior, the author proposed that it might also enhance network communication. Indeed, they observed changes in informational connectivity. However, it remains unclear whether these changes in network communication have a direct and robust relationship with behavioral improvements.

      (4) I'm uncertain about the definition of the sensory-like template in this paper. Is it referring to the Ping impulse-driven condition or the decodable performance in the early visual cortex? If it is the former, even in working memory, whether pinging identifies an activity-silent mechanism is currently debated. If it's the latter, the authors should consider whether a causal relationship - such as "activating the sensory-like template strengthens the informational connectivity between sensory and frontoparietal areas" - is reasonable.

    1. Programme en PDF

      Lutter Contre les Inégalités Sociales de l'École à l'Enseignement Supérieur : Déterminants et Solutions Source 1 : Excerpts from "CONFÉRENCE INTERNATIONALE : Agir sur les inégalités sociales de l’école à l’enseignement supérieur"

      [01:11:10][^3^][3] Introduction par Stanislas Dehan

      I. Introduction par Gabriel Atal, Ministre de l'Éducation Nationale 01:11:50

      Souligne l'importance de l'orientation scolaire pour la justice sociale et la nécessité de lutter contre le déterminisme social.

      Annonce la généralisation des cours d'empathie pour lutter contre le harcèlement scolaire.

      II. Présentation du Conseil Scientifique par Stanislas Dehaene [01:21:5

      Décrit la composition, les missions et les actions du Conseil Scientifique de l'Éducation Nationale.

      Met en avant l'importance de la recherche scientifique et de la comparaison internationale pour lutter contre les inégalités.

      1:31:14 Ouverture par Elise Huilleriy

      Problème n°1 du système éducatif français

      1:47:10 Session 1 Agir sur les compétences sociales et comportementales Introduction de Julien Grenet

      III. Présentation des Disparités Sociales dès la CP par Carlo Barone 01:48:19

      Démontre, à travers des données scientifiques, les disparités sociales dès la CP en matière de langage, de mathématiques et de compétences comportementales.

      Analyse l'influence du milieu social sur les pratiques éducatives parentales.

      2:14:16 Yann Algan Impact des compétences socio-comportementales sur la réussite des élèves

      IV. Présentation d'une Intervention pour Promouvoir l'Esprit de Croissance par Élise Huillery

      Explique le concept d'esprit de croissance et son importance pour la réussite scolaire.

      Détaille une intervention utilisant des SMS pour encourager l'esprit de croissance chez les parents et les élèves.

      V. 2:14:16 Yann Algan Impact des compétences socio-comportementales sur la réussite des élèves

      Développe l'importance des compétences comportementales, telles que la persévérance et le locus de contrôle, pour la réussite scolaire.

      Compare la France à d'autres pays de l'OCDE et souligne les faiblesses françaises en matière de compétences comportementales.

      VI. Présentation d'une Expérimentation sur le Développement des Compétences Sociales par un Chercheur Invité

      Partage une expérimentation menée à Montréal utilisant des jeux de rôle pour développer les compétences sociales et la gestion des conflits.

      Souligne l'impact positif de ces interventions sur le climat scolaire.

      VII. Discussions et Questions (2:36:29)

      2:36:52 Question coéducation et la présence des espaces parents

      2:38:14 Question compétences psychosociales et la formation des professionnels

      2:40:10 Question sur l'efficacité concernant les public les plus défavorisés

      2:42:39 Question Lien entre les déficits en CPS et les troubles neurodéveloppementaux

      2:43:56 question sur la surprise sur le progressive parenting

      Pas plus d'ouverture éducative des parents favorisés en France Moins de motivation intrinsèques en France

      Échange entre les participants sur les résultats des études présentées.

      2:46:09 Session 2

      2:46:31 intervention Julien Grenet Mixité sociale

      3:08:22 Intervention Nina Guyon Fermeture de collège

      3:28:33 Questions rôle du rang et l'estime de soi

      3:29:10 Question sur l'exemple Finlandais (peu de privé)

      3:31:53 Question déségrégation par et entre les classes (évoque les CHA)

      3:35:01 Question

      3:37:58 Pause

      Discussions sur les pistes d'action pour réduire les inégalités sociales à l'école, notamment l'importance de la mixité sociale et de l'information sur les filières d'orientation.

      4:01:37 Session 3 Orientation et inégalité scolaire

      VIII. Présentation de l'Orientation en Fin de 3ème par Pascal Bressoux 04:01:51

      Analyse du processus d'orientation en fin de 3ème et met en lumière l'influence des notes, de l'origine sociale et des recommandations du collège.

      Démontre l'existence d'un effet d'auto-sélection sociale et l'impact des recommandations du collège sur les choix d'orientation.

      IX. 04:19:14 Coralie Chevallier Comportement d'attente et choix dans l'enseignement supérieur

      Explique le lien entre le statut socio-économique et les comportements d'attente, qui influencent les décisions d'orientation.

      Analyse les comportements d'attente des élèves de terminale face aux choix d'orientation post-bac.

      X. Discussions et Questions 04:34:59

      Échange entre les participants sur les disparités d'orientation selon l'origine sociale et les pistes d'action pour les réduire.

      4:35:25 Discussion sur le rôle de l'autocensure, des contraintes économiques et de l'information disponible dans les choix d'orientation.

      4:39:35 Question quel est l'échelle utilisée dans les recherches présentées

      4:43:47 Question Parcoursup

      4:45:21 Intervention Michela Carlana Trajectoire des élèves issus de l'immigration

      5:15:48 Question sur l'idée d'étutier d'autre pays que l'Italie

      XI. Table Ronde sur les Politiques Publiques pour Réduire les Inégalités (05:20:54)

      Débat sur l'efficacité et les limites des politiques publiques existantes, comme l'éducation prioritaire et les internats d'excellence.

      Réflexion sur la nécessité de s'attaquer aux inégalités sociales au-delà de l'école pour réduire les disparités dans l'éducation.

      05:23:02 Marc Burgand sur les classes de niveau et redoublement

      05:28:42 intervenante 2 sur les CPS et réussite scolaire

      5:32:56 Intervenante 3 sur parcoursup

      5:40:29 Jean-Paul Delahaye

      XII. Discussions et Questions (05:47:07)

      05:48:04 M Burgand dilution de l'éducation prioritaire - l'internat de Sourdun 77

      05:53:08 intervenant 2 les inégalité au-delà de l'école

      05:57:16 intervenant enseignants solitaires et gestion RH

      06:00:23 Delahaye inégalité est une question politique

      Échange entre les participants et les intervenants sur les solutions possibles pour lutter contre les inégalités sociales à l'école.

      Discussion sur l'importance de la formation des enseignants, du soutien aux élèves en difficulté, de la mixité sociale et de la valorisation de tous les métiers.

      06:05:40 Question bilan besoin de lunettes - détection et gradient social

      06:09:47 question de l'universalisme, coopération est confiance en soi notamment par les pairs

      06:14:42 question l'argent dans les lycée pro a t'elle une face cachée

      Résumé de la vidéo [00:11:10][^1^][1] - [00:31:52][^2^][2]:

      Cette vidéo traite des inégalités sociales dans le système éducatif français, de l'école à l'enseignement supérieur. Elle met en lumière les défis actuels et les initiatives pour améliorer l'égalité des chances.

      Moments forts:

      • [01:11:10][^3^][3] Introduction par Stanislas Dehan
        • Présentation du thème
        • Remerciements aux organisateurs
        • Message du ministre Gabriel Attal
      • 01:11:50 Message du ministre Gabriel Attal
        • Importance de la recherche et de l'expérimentation
        • Efforts budgétaires pour l'éducation prioritaire
        • Initiatives pour réduire les inégalités
      • [01:21:52][^5^][5] Axes prioritaires pour l'éducation
        • Maîtrise des savoirs fondamentaux
        • Adaptation des conditions d'apprentissage
        • Investissement dans les petites classes
      • Orientation et culture
        • Importance de l'orientation scolaire
        • Accès à la culture pour tous les élèves
        • Développement des compétences psychosociales
      • Rôle du Conseil Scientifique
        • Création et missions du Conseil
        • Publications et recommandations
        • Importance de la recherche appliquée en éducation

      Résumé de la vidéo [01:31:55][^1^][1] - [01:03:50][^2^][2]:

      Cette partie de la conférence aborde les inégalités sociales dans le système éducatif français, en mettant en lumière les défis et les politiques mises en place pour les réduire.

      Moments forts:

      • Introduction et contexte
        • Importance du thème des inégalités sociales
        • Mesures internationales révélant la gravité du problème en France
        • Objectifs de la journée de conférence
      • Politiques éducatives passées
        • Éducation prioritaire et ses évaluations
        • Programmes comme "Coup de Pouce Clé" et leurs impacts
        • Plans de réussite éducative et leurs résultats
      • Réformes récentes
        • Loi de refondation de l'école (2013)
        • Initiatives pour la mixité sociale (2016)
        • Dédoublement des classes de CP et CE1
      • Interventions parentales
        • Importance du soutien parental
        • Exemples d'interventions réussies
        • Barrières à l'implication des parents
      • Motivation et activités des enfants
        • Types de motivation des élèves
        • Activités ludiques à la maison
        • Impact des activités sur le développement des enfants 1:47:16 Session 1 Agir pour les compétences socio comportementales introduction par Julien Grenet Résumé de la vidéo [01:03:51][^1^][1] - [01:31:01][^2^][2]:

      Cette partie de la conférence aborde les inégalités sociales dans l'éducation, en se concentrant sur les pratiques parentales et les compétences socio-comportementales des élèves.

      Moments forts: + [01:03:51][^3^][3] Pratiques parentales et discipline * Participation des parents aux réunions scolaires * Explication des règles à la maison * Gestion des écrans et surveillance des devoirs + [01:06:02][^4^][4] Impact du milieu social * Lien entre milieu social et pratiques disciplinaires * Capacité des enfants à se concentrer et à se contrôler * Réactions des parents face aux difficultés scolaires + [01:15:02][^5^][5] Compétences socio-comportementales * Importance de l'estime de soi et de l'optimisme * Développement de l'esprit de croissance * Locus de contrôle et perception de la réussite + [01:22:03][^6^][6] Compétences sociales * Confiance et empathie envers les autres * Capacité à comprendre les intentions des autres * Importance de la coopération et du travail en groupe + [01:24:00][^7^][7] Interventions éducatives * Programmes pour développer l'esprit de croissance * Importance des compétences sociales dès le jeune âge * Exemples d'interventions réussies à Montréal et ailleurs

      Résumé de la vidéo [01:31:03][^1^][1] - [01:55:37][^2^][2]:

      Cette vidéo traite des inégalités sociales dans l'éducation, de l'école à l'enseignement supérieur, et des interventions pour les réduire.

      Moments forts: + [01:31:03][^3^][3] Impact des interventions éducatives * Jeux de coopération pour les enfants * Augmentation de la confiance et des résultats scolaires * Réduction des taux de criminalité et augmentation des revenus + [01:34:01][^4^][4] Formation des enseignants * Importance de changer la posture pédagogique * Programme "Motiveaction" pour former les enseignants * Expérimentation en cours avec des résultats prometteurs + [01:37:01][^5^][5] Questions des participants * Importance de la coéducation avec les parents * Formation spécifique pour les enseignants sur les compétences psychosociales * Retours des parents sur les interventions + [01:46:02][^6^][6] Mixité sociale et performances éducatives * Initiatives pour réduire la ségrégation sociale dans les collèges * Effets sur les résultats scolaires et la cohésion sociale * Importance de la mixité sociale pour l'équité éducative

      Résumé de la vidéo [01:55:40][^1^][1] - [02:16:17][^2^][2]:

      Cette partie de la conférence traite des dispositifs et des expérimentations visant à réduire les inégalités sociales dans les établissements scolaires en France. Elle aborde les méthodes d'évaluation et les résultats obtenus.

      Moments forts: + [01:55:40][^3^][3] Dispositifs pour attirer les parents favorisés * Sections internationales et classes AR * Actions combinées avec des réformes structurelles * Exemple parisien avec secteurs bicollège + [01:57:15][^4^][4] Objectifs et méthodologie d'évaluation * Évaluer les effets sur la mixité sociale * Mesurer les performances scolaires et compétences sociales * Comparaison entre sites expérimentaux et sites témoins + [02:00:18][^5^][5] Effets sur la mixité sociale * Augmentation de la mixité dans les collèges * Effets limités par l'absence de potentiel de mixité dans certains sites * Concentration sur les sites avec potentiel de mixité élevé + [02:03:00][^6^][6] Effets sur les réseaux d'amitié et la vie scolaire * Progression de la mixité sociale dans les classes et amitiés * Pas d'effet significatif sur les absences et sanctions * Amélioration de l'estime de soi chez les élèves favorisés + [02:05:40][^7^][7] Compétences sociales et bien-être des élèves défavorisés * Meilleure qualité des relations amicales * Sentiment de sécurité accru * Amélioration des compétences de coopération et de travail en groupe

      Résumé de la vidéo [02:16:19][^1^][1] - [02:37:22][^2^][2]:

      Cette partie de la conférence aborde les effets des fermetures de collèges sur les élèves, en particulier ceux issus de zones d'éducation prioritaire (ZUS). Les intervenants discutent des impacts sur la violence scolaire, la ségrégation sociale, et les performances académiques.

      Points forts : + [02:16:19][^3^][3] Impact des fermetures de collèges * Effets sur les cohortes d'élèves * Normes sociales et violence scolaire * Différences entre les cohortes + [02:18:00][^4^][4] Effets sur le décrochage scolaire * Graphiques sur le décrochage après la troisième * Comparaison entre différentes cohortes * Effets significatifs observés + [02:22:00][^5^][5] Mécanismes bénéfiques et pénalisants * Amélioration du climat scolaire * Expérience des enseignants * Effets de rang et temps de trajet + [02:26:00][^6^][6] Déségrégation et ségrégation sociale * Effets sur la déségrégation dans les nouveaux collèges * Ségrégation interne entre les classes * Impact sur les élèves défavorisés et favorisés + [02:30:00][^7^][7] Questions et discussions * Exemple finlandais et enseignement privé * Effets de la ségrégation parmi les classes * Estime de soi et réussite scolaire

      Résumé de la vidéo [02:37:24][^1^][1] - [03:27:05][^2^][2]:

      Cette vidéo traite des inégalités sociales dans l'éducation, de l'école à l'enseignement supérieur. Elle explore comment les enseignants s'adaptent aux élèves de divers milieux et les impacts des choix d'orientation scolaire.

      Moments forts: + [02:37:24][^3^][3] Adaptation des enseignants * Comprendre les pratiques face à des classes hétérogènes * Importance des évaluations * Évolution des pratiques pédagogiques + [03:01:38][^4^][4] Orientation scolaire * Processus en quatre étapes * Impact des choix familiaux et des élèves * Autosélection sociale et ses effets + [03:09:00][^5^][5] Étude empirique * Analyse des choix d'orientation en fin de 3e * Impact des notes et de l'indice social * Différences entre enfants de professeurs et d'ouvriers + [03:16:00][^6^][6] Recommandations des collèges * Influence des notes et de l'indice social * Effets de compensation ou d'accentuation * Impact sur les décisions finales des élèves + [03:19:01][^7^][7] Comportements d'attente et choix * Différences selon le milieu socio-économique * Impact des ressources et de l'incertitude * Stratégies d'adaptation et de planification

      Résumé de la vidéo [03:27:08][^1^][1] - [03:51:46][^2^][2]:

      Cette vidéo traite des inégalités sociales dans le système éducatif, en se concentrant sur les choix des élèves et les différences entre boursiers et non-boursiers.

      Points forts : + [03:27:08][^3^][3] Choix des élèves * Classement des choix par les élèves * Importance des préférences subjectives * Mesure des choix d'attente + [03:29:02][^4^][4] Résultats des comportements d'attente * Attendre améliore le rang des vœux * 38% des élèves obtiennent une meilleure formation * Attendre est bénéfique + [03:31:12][^5^][5] Différences entre boursiers et non-boursiers * Les boursiers attendent moins * Différences de coûts à attendre * Importance de prendre en compte ces différences + [03:35:01][^6^][6] Questions et discussions * Autocensure et raisons économiques * Effets des pratiques d'orientation * Importance de l'accompagnement des élèves

      Résumé de la vidéo [03:51:49][^1^][1] - [04:24:43][^2^][2]:

      Cette vidéo traite des inégalités sociales dans l'éducation, en se concentrant sur les étudiants immigrants et les différences de genre dans les choix de filières scolaires et les taux d'échec.

      Moments forts: + [03:51:49][^3^][3] Différences de genre et d'origine * Les garçons immigrants ont moins de chances de choisir des filières prestigieuses * Les filles immigrants performantes choisissent des filières similaires aux natives * Les garçons de milieux défavorisés souffrent le plus + [03:55:00][^4^][4] Programme en Italie * Collaboration avec le ministère de l'Éducation * Ciblage des étudiants performants de milieux défavorisés * Alignement des choix de filières avec le potentiel académique + [04:00:00][^5^][5] Résultats du programme * Augmentation des choix de filières prestigieuses pour les garçons * Réduction des taux d'échec scolaire pour les garçons * Pas d'effet significatif pour les filles + [04:05:00][^6^][6] Impact des recommandations des enseignants * Les enseignants recommandent moins souvent les filières prestigieuses aux étudiants immigrants * Le programme a réduit cet écart * Importance de la sensibilisation des enseignants aux biais implicites + [04:10:00][^7^][7] Expansion et sensibilisation * Plan d'expansion du programme à d'autres pays * Importance de la prise en compte des dynamiques raciales * Besoin de plus de diversité parmi les enseignants

      Résumé de la vidéo [04:24:46][^1^][1] - [04:46:28][^2^][2]:

      Cette partie de la conférence aborde les inégalités sociales dans le système éducatif français, en mettant l'accent sur l'inefficacité des classes de niveau et du redoublement, ainsi que sur l'importance des compétences psychosociales et de la mixité sociale.

      Moments forts: + [04:24:46][^3^][3] Classes de niveau * Inefficaces selon la littérature scientifique * Renforcent les inégalités * Alternative : groupes de besoins spécifiques + [04:28:02][^4^][4] Redoublement * Considéré comme inefficace et coûteux * Effets stigmatisants sur les élèves * Importance de développer les compétences psychosociales + [04:32:05][^5^][5] Diplôme scolaire * Impact sur la position sociale * Discrimination basée sur le niveau d'éducation * Importance de l'éducation pour la réussite sociale + [04:36:00][^6^][6] Orientation scolaire * Différences entre lycées favorisés et défavorisés * Influence des moyens financiers et du réseau * Importance de l'information et de l'accompagnement + [04:40:00][^7^][7] Mixité sociale * Défis liés à la mixité sociale et scolaire * Importance de l'égalité dans l'offre de formation * Rôle de l'enseignement privé et des politiques publiques

      Résumé de la vidéo [04:46:30][^1^][1] - [05:09:04][^2^][2]:

      Cette vidéo traite des inégalités sociales dans l'éducation, de l'école à l'enseignement supérieur. Les intervenants discutent des politiques publiques nécessaires pour réduire ces inégalités et des défis rencontrés.

      Points forts : + [04:46:30][^3^][3] Introduction et contexte * Importance de l'éducation prioritaire * Changement de paradigme dans les politiques éducatives * Impact des internats d'excellence + [04:49:00][^4^][4] Effets des internats d'excellence * Augmentation du taux de réussite au bac * Accès accru à l'enseignement supérieur * Impact significatif sur les élèves issus de l'immigration + [04:52:00][^5^][5] Inégalités sociales et éducation * Pression sociale et matérielle * Importance du diplôme pour le statut social * Mobilité sociale ascendante et descendante + [04:56:00][^6^][6] Rôle des enseignants * Solitude professionnelle des enseignants français * Impact sur les compétences sociales des élèves * Défis liés au recrutement et à la gestion des enseignants + [05:01:00][^7^][7] Politiques publiques et inégalités * Importance de l'éthique collective en éducation * Influence des milieux favorisés sur les réformes * Exemples de rythmes scolaires et de leur impact

      Résumé de la vidéo [05:09:05][^1^][1] - [05:22:24][^2^][2]:

      Cette vidéo aborde les inégalités sociales dans l'éducation, de l'école à l'enseignement supérieur, en mettant l'accent sur les diagnostics et les soins, les neurosciences, et les réformes éducatives.

      Moments forts: + [05:09:05][^3^][3] Diagnostics et soins * Les élèves de milieux favorisés sont plus souvent diagnostiqués * L'accès aux soins varie selon le milieu social * Importance de la santé des élèves + [05:10:01][^4^][4] Neurosciences et apprentissage * Tous les enfants ont le même potentiel d'apprentissage * Importance de l'apprentissage quotidien * Problèmes de santé publique affectant tous les enfants + [05:12:00][^5^][5] Estime de soi et coopération * Importance de l'estime de soi et de la coopération entre élèves * Déficit de coopération dans le système scolaire français * Formation des professeurs et tutorat en classe + [05:15:00][^6^][6] Réforme de la voie professionnelle * Gratification des PFMP pour attirer les élèves défavorisés * Importance de l'argent pour les milieux populaires * Investissement nécessaire pour entrer dans un lycée professionnel + [05:17:00][^7^][7] Modèle pédagogique et didactique * Confusion entre égalité et uniformité * Modèle pédagogique fondé sur le cours magistral * Importance de développer les compétences comportementales et sociales

    1. Reviewer #1 (Public review):

      Summary:

      The authors have created a new model of KCNC1-related DEE in which a pathogenic patient variant (A421V) is knocked into a mouse in order to better understand the mechanisms through which KCNC1 variants lead to DEE.

      Strengths:

      (1) The creation of a new DEE model of KCNC1 dysfunction.

      (2) InVivo phenotyping demonstrates key features of the model such as early lethality and several types of electrographic seizures.

      (3) The ex vivo cellular electrophysiology is very strong and comprehensive including isolated patches to accurately measure K+ currents, paired recording to measure evoked synaptic transmission, and the measurement of membrane excitability at different time points and in two cell types.

      Weaknesses:

      (1) The assertion that membrane trafficking is impaired by this variant could be bolstered by additional data.

      (2) In some experiments details such as the age of the mice or cortical layer are emphasized, but in others, these details are omitted.

      (3) The impairments in PV neuron AP firing are quite large. This could be expected to lead to changes in PV neuron activity outside of the hypersynchronous discharges that could be detected in the 2-photon imaging experiments, however, a lack of an effect on PV neuron activity is only loosely alluded to in the text. A more formal analysis is lacking. An important question in trying to understand mechanisms underlying channelopathies like KCNC1 is how changes in membrane excitability recorded at the whole cell level manifest during ongoing activity in vivo. Thus, the significance of this work would be greatly improved if it could address this question.

      (4) Myoclonic jerks and other types of more subtle epileptiform activity have been observed in control mice, but there is no mention of littermate control analyzed by EEG.

    2. Author response:

      Reviewer #1 (Public review):

      Weaknesses:

      (1) The assertion that membrane trafficking is impaired by this variant could be bolstered by additional data.

      We agree with this comment and will perform additional analysis and experiments to support the assertion that membrane trafficking is impaired. As noted by the Reviewers, standard biochemical approaches to obtain such data may be challenging due to the fact that Kv3.1 is expressed in only a subset of cells and that we do not have a Kv3.1-A421V specific antibody.

      (2) In some experiments details such as the age of the mice or cortical layer are emphasized, but in others, these details are omitted.

      We appreciate that the Reviewer has noted this omission. We will include such details in the resubmission.

      (3) The impairments in PV neuron AP firing are quite large. This could be expected to lead to changes in PV neuron activity outside of the hypersynchronous discharges that could be detected in the 2-photon imaging experiments, however, a lack of an effect on PV neuron activity is only loosely alluded to in the text. A more formal analysis is lacking. An important question in trying to understand mechanisms underlying channelopathies like KCNC1 is how changes in membrane excitability recorded at the whole cell level manifest during ongoing activity in vivo. Thus, the significance of this work would be greatly improved if it could address this question.

      Yes, the impairments in neocortical PV-IN excitability are more marked than any other PV interneuronopathy that we have studied. We will include a more extensive analysis of the 2-photon imaging data in the resubmission. However, there are limitations to the inferences that can be made as to firing patterns based on 2-photon calcium imaging data, particularly for interneurons.

      (4) Myoclonic jerks and other types of more subtle epileptiform activity have been observed in control mice, but there is no mention of littermate control analyzed by EEG.

      We did not observe myoclonic jerks in control mice. This data will be included in the resubmission.

      Reviewer #2 (Public review):

      Weaknesses:

      In some experiments, the age of the animal in each experiment is not clearly stated. For example, the experiments in Figure 2 demonstrate impaired K+ conductance and membrane localization, but it is not clear whether they correlated with the excitability and synaptic defects shown in subsequent figures. Similarly, it is unclear how old mice the authors conducted EEG recordings, and whether non-epileptic mice are younger than those with seizures.

      We will include explicit information as to the age of the animals used for each experiment in the resubmission.

      The trafficking defect of mutant Kv3.1 proposed in this study is based only on the fluorescence density analysis which showed a minor change in membrane/cytosol ratio. It is not very clear how the membrane component was determined (any control staining?). In addition to fluorescence imaging, an addition of biochemical analysis will make the conclusion more convincing (while it might be challenging if the Kv3.1 is expressed only in PV+ cells).

      We will include additional information in the Methods section as to how the membrane component was determined in a revised version of the manuscript. We agree with Reviewer #2 regarding the limitations in the ability to further evaluate this.

      While the study focused on the superficial layer because Kv3.1 is the major channel subunit, the PV+ cells in the deeper cortical layer also express Kv3.1 (Chow et al., 1999) and they may also contribute to the hyperexcitable phenotype via negative effect on Kv3.2; the mutant Kv3.1 may also block membrane trafficking of Kv3.1/Kv3.2 heteromers in the deeper layer PV cells and reduce their excitability. Such an additional effect on Kv3.2, if present, may explain why the heterozygous A421V KI mouse shows a more severe phenotype than the Kv3.1 KO mouse (and why they are more similar to Kv3.2 KO). Analyzing the membrane excitability differences in the deep-layer PV cells may address this possibility.

      We will include recordings from PV-INs in deeper layers of the neocortex in the revised version of the manuscript, as requested.

      In Table 1, the A421V PV+ cells show a depolarized resting membrane potential than WT by ~5 mV which seems a robust change and would influence the circuit excitability. The authors measured firing frequency after adjusting the membrane voltage to -65mV, but are the excitability differences less significant if the resting potential is not adjusted? It is also interesting that such a membrane potential difference is not detected in young adult mice (Table 2). This loss of potential compensation may be important for developmental changes in the circuit excitability. These issues can be more explicitly discussed.

      We will include a more thorough discussion of this finding in the revised version of the manuscript. However, we do not completely understand this finding. It could be compensatory, as suggested by the Reviewer; however, it is transient and seems to be an isolated finding (i.e., there does not appear to be parallel “compensation” in other properties). Alternatively, it could be that impaired excitability of the Kcnc1-A421V/+ PV-INs may reflect impaired/delayed development, which itself is known to be activity-dependent.

      Reviewer #3 (Public review):

      Weaknesses:

      The manuscript identifies a partial mechanism of disease that leaves several aspects unresolved including the possible role of the observed impairments in thalamic neurons in the seizure mechanism. Similarly, while the authors identify a reduction in potassium currents and a reduction in PV cell surface expression of Kv3.1 it is not clear why these impairments would lead to a more severe disease phenotype than other loss-of-function mutations which have been characterized previously. Lastly, additional analysis of video-EEG data would be helpful for interpreting the extent of the seizure burden and the nature of the seizure types caused by the mutation.

      We agree with this comment. We studied neurons in the reticular thalamus as these cells are known to express Kv3.1 and are linked to epilepty pathogenesis. Yet, we focused on neocortical PV-INs over other Kv3.1-expressing neurons such as neurons of the reticular thalamus because we evaluated the impairments of intrinsic excitability to be more profound in neocortical PV-INs. Cross of Kcnc1-Flox(A421V)/+ mice to a cerebral cortex interneuron-specific driver that would avoid recombination in thalamus – such as Ppp1r2-Cre (RRID:IMSR_JAX:012686) – could assist in determining the relative contribution of thalamic reticular nucleus dysfunction to the overall phenotype, as performed by Makinson et al (2017) to address a similar question. There are of course other Kv3.1-expressing neurons in the brain, including in GABAergic interneurons in hippocampus and amygdala. We will include additional discussion in a revised version of the manuscript as to why we think there is more severe impairment in our Kcnc1-Flox(A421V)/+ mice relative to Kv3.1 and Kv3.2 knockout mice. We will include additional data on the epilepsy phenotype in the revised version of the manuscript, as requested.

    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript the authors follow up on their published observation that providing a lower glucose parental nutrition (PN) reduces sepsis from a common pathogen [Staphylococcus epidermitis (SE)] in preterm piglets. Here they found that a slightly higher dose of glucose could thread the needle and get the protective effects of low glucose without incurring significant hypoglycemia. They then investigate whether change in low glucose PN impacts metabolism to confer this benefit. The finding that lower glucose reduces sepsis is important as sepsis is a major cause of morbidity and mortality in preterm infants, and adjusting PN composition is a feasible intervention.

      Strengths:

      (1) They address a highly significant problem of neonatal sepsis in preterm infants using a preterm piglet model.<br /> (2) They have compelling data in this paper (and in a previous publication, ref 27) that low glucose PN confers a survival advantage. A downside of the low glucose PN is hypoglycemia which they mitigate in this paper by using a slightly high amount of glucose in the PN.<br /> (3) The experiment where they change PN from high to low glucose after infection is very important to determine if this approach might be used clinically. Unfortunately, this did not show an ability to reduce sepsis risk with this approach.<br /> (4) They produce an impressive multiomics data set from this model of preterm piglet sepsis which is likely to provide additional insights into the pathogenesis of preterm neonatal sepsis.

      Weaknesses:

      (1) Piglets on the low glucose PN had consistently lower density of SE (~1 log) across all timepoints. This may be due to changes in immune response leading to better clearance or it could be due to slower growth in lower glucose environment. These possibilities are not fully disentangled in this study.

      (2) Many differences in the different omics (transcriptomics, metabolomics, proteomics) were identified in the SE-LOW vs SE-HIGH comparison. Since the bacterial load is very different between these conditions, could the changes be due to bacterial load rather than metabolic reprograming from the low glucose PN? The authors argue in supplementary figure 1F that density of SE in blood does not correlate with sepsis implying that bacterial load is not the driver of outcome. The authors recently published some additional analysis that may be helpful to reference in this manuscript.

      (3) Further, expanding upon a model to better understand the complex relationship between differences in supplemental glucose infusion, blood glucose levels, bacterial load, host responses and how they impact the development of sepsis would be helpful. These complex relationships are difficult to fully disentangle, but one could consider infusing the same quantity of heat-killed bacteria under different glucose conditions to see if the glucose levels drive outcomes independently of bacterial burden.

    2. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      In this manuscript, the authors follow up on their published observation that providing a lower glucose parental nutrition (PN) reduces sepsis from a common pathogen [Staphylococcus epidermitis (SE)] in preterm piglets. Here they found that a higher dose of glucose could thread the needle and get the protective effects of low glucose without incurring significant hypoglycemia. They then investigate whether the change in low glucose PN impacts metabolism to confer this benefit. The finding that lower glucose reduces sepsis is important as sepsis is a major cause of morbidity and mortality in preterm infants, and adjusting PN composition is a feasible intervention.

      Strengths:

      (1) They address a highly significant problem of neonatal sepsis in preterm infants using a preterm piglet model.

      (2) They have compelling data in this paper (and in a previous publication, ref 27) that low glucose PN confers a survival advantage. A downside of the low glucose PN is hypoglycemia which they mitigate in this paper by using a slightly high amount of glucose in the PN.

      (3) The experiment where they change PN from high to low glucose after infection is very important to determine if this approach might be used clinically. Unfortunately, this did not show an ability to reduce sepsis risk with this approach. Perhaps this is due to the much lower mortality in the high glucose group (~20% vs 87% in the first figure).

      (4) They produce an impressive multiomics data set from this model of preterm piglet sepsis which is likely to provide additional insights into the pathogenesis of preterm neonatal sepsis.

      Weaknesses:

      (1) The high glucose control gives very high blood glucose levels (Figure 1C). Is this the best control for typical PN and glucose control in preterm neonates? Is the finding that low glucose is protective or high glucose is a risk factor for sepsis?

      This work is a follow-up from our previous work where we explored different PN glucose regimens. Taken together our experiments heavily imply that glucose provision is associated to severity in a seemingly linear manner. In the clinical setting, there is no fixed glucose provision, but guidelines specify ranges that are acceptable. However, these guidelines do not take possible infections into account and are designed to optimize growth outcomes. Increased provision of glucose to preterm neonates may therefore increase their infection risk, but parenteral glucose cannot be entirely avoided as it would lead to hypoglycaemia and associated brain damage. In the present paper the reduced glucose PN reflects the lowest end of the recommended PN glucose intake. More work is needed to figure out the best glucose provision to infected preterm newborns, balancing positive and negative factors.

      (2) In Figure 1B, preterm piglets provided the high glucose PN have 13% survival while preterm piglets on the same nutrition in Figure 6B have ~80% survival. Were the conditions indeed the same? If so, this indicates a large amount of variation in the outcome of this model from experiment to experiment.

      In the follow-up experiment outlined in Figure 6 we reduced the follow-up time to 12 hours in an effort to minimize the suffering of the animals. We did this because we could detect relevant differences in the immune response between High and low glucose infected pigs as 12 hours. If we had extended the follow-up experiment to 22 hours we would likely have seen a much increased mortality.

      (3) Piglets on the low glucose PN had consistently lower density of SE (~1 log) across all time points. This may be due to changes in immune response leading to better clearance or it could be due to slower growth in a lower glucose environment.

      We agree with this assessment and have adjusted our result section to reflect this.

      (4) Many differences in the different omics (transcriptomics, metabolomics, proteomics) were identified in the SE-LOW vs SE-HIGH comparison. Since the bacterial load is very different between these conditions, could the changes be due to bacterial load rather than metabolic reprogramming from the low glucose PN?

      We analyzed the relationship between bacterial burdens and mortality and found that it did not correlate within each of the treatment groups. We have now added this data to the results section as supplemental and report this fact in the section called “Reduced glucose supply increases hepatic OXPHOS and gluconeogenesis and attenuates inflammatory pathways”. This finding inspired us to further explore the relationship between bacterial burdens and infection responses in our model which has resulted in our recent preprint: Wu et at. Regulation of host metabolism and defense strategies to survive neonatal infection. BioRxiv 2024.02.23.581534; doi: https://doi.org/10.1101/2024.02.23.581534

      Reviewer #2 (Public Review):

      Summary:

      The authors demonstrate that a low parenteral glucose regimen can lead to improved bacterial clearance and survival from Staph epi sepsis in newborn pigs without inducing hypoglycemia, as compared to a high glucose regimen. Using RNA-seq, metabolomic, and proteomic data, the authors conclude that this is primarily mediated by altered hepatic metabolism.

      Strengths:

      Well-defined controls for every time point, with multiple time points and biological replicates. The authors used different experimental strategies to arrive at the same conclusion, which lends credibility to their findings. The authors have published the negative findings associated with their study, including the inability to reverse sepsis-related mortality after switching from SE-high to SE-low at 3h or 6h and after administration of hIAIP.

      Weaknesses:

      (1) The authors mention, and it is well-known, that Staph epi is primarily involved in late-onset sepsis. The model of S. epi sepsis used in this study clearly replicates early-onset sepsis, but S. epi is extremely rare in this time period. How do the authors justify the clinical relevance of this model?

      The distinction between early and late onset sepsis makes sense clinically because they are likely to be caused by different organisms and therefore require different empirical antibiotic regimes. Early onset sepsis is caused by organisms transferred perinatally often following chorioamnionitis or uro-gential maternal infections (Strep. agalacticae/E. coli) whereas Late onset sepsis is likely caused by organisms from indwelling catheters or mucosal surfaces, most often coagulase negative staphylococci. Timing of an infection after birth of course plays a role, but the virulence factors of the pathogen probably plays a large role in shaping the immune response. Therefore, even though the infection in our model is initiated on the first day after birth, the organism that we use, Staph epidermidids, makes it a better model for pathogenesis of late onset sepsis. However, it is also important to acknowledge that the pathophysiology of “sepsis” may be similar despite timing and pathogen and depends on the degree of immune activation and downstream effects on organs.

      (2) The authors find that the neutrophil subset of the leukocyte population is diminished significantly in the SE-low and SE-high populations. However, they conclude on page 10 that "modulations of hepatic, but not circulating immune cell metabolism, by reduced glucose supply..." and this is possible because the authors have looked at the entire leukocyte transcriptome. I am curious about why the authors did not sequence the neutrophil-specific transcriptome.

      We collected the whole blood transcript during the experiments, which reflect the transcription profile of all the circulating leucocytes. Since we did not do single cell RNA sequencing during the experiment there is no possibility of isolating the neutrophil transcriptome at this time. Your point however is valid and we will reconsider incorporating single cell transcriptomics in future experiments.

      (3) The authors use high (30g/k/d) and low (7.2g/k/d) glucose regimens. These translate into a GIR of 21 and 5 mg/k/min respectively. A normal GIR for a preterm infant is usually 5-8, and sometimes up to 10. Do the authors have a "safe GIR" or a threshold they think we cannot cross? Maybe a point where the metabolism switch takes place? They do not comment on this, especially as GIR and glucose levels are continuous variables and not categorical.

      Our reduced glucose PN was chosen as it corresponded with the low end of recommended guidelines for PN glucose intake. There likely is not a “safe GIR” as the clinical responses to glucose intake during infections do not seem binary but increase with glucose intake. It is also important to remember that the reduced glucose intervention still resulted in significant morbidity and a 25% mortality within 22 hours. There is therefore still vast room for improvement, but even though further reduction in PN glucose would probably provide further protection it would entail dangerous hypoglycaemia (as described in our previous paper). The findings in this current paper has prompted us to explore several strategies to replace parenteral glucose with alternative macronutrients. Thus, the optimal PN for infected newborns would probably differ from standard PN in all macronutrients and will require much more pre- and clinical research.

      (4) In Figures 2B and C the authors show that SE-high and SE-low animals have differences in the oxphos, TCA, and glycolytic pathways. The authors themselves comment in the Supplementary Table S1B, E-F that these same metabolic pathways are also different in the Con-Low and Con-high animals, it is just the inflammatory pathways that are not different in the non-infected animals. How can they then justify that it is these metabolic pathways specifically which lead to altered inflammatory pathways, and not just the presence of infection along with some other unfound mechanism?

      It is to be expected that the inflammatory pathways do not differ between the Con-Low and Con-High groups as there is no infection to induce these pathways. The identified metabolic pathways that differ between SE-High and SE-Low animals seem to us the best explanation of the differences in clinical phenotype.

      (5) The authors mention in Figure 1F that SE-low animals had lower bacterial burdens than SE-high animals, but then go on to infer that the inflammatory cytokine differences are attributed to a rewiring of the immune response. However, they have not normalized the cytokine levels to the bacterial loads, as the differences in the cytokines might be attributed purely to a difference in bacterial proliferation/clearing.

      Please see our response to reviewer #1

      (6) The authors mention that switching from SE-high to SE-low at 3 or 6 h time points does not reduce mortality. Have the authors considered the reverse? Does hyperglycemia after euglycemia initially, worsen mortality? That would really conclude that there is some metabolic reprogramming happening at the very onset of sepsis and it is a lost battle after that.

      A very good point that we have not explored yet, we have added this consideration to the discussion and slightly amended our conclusions of this follow-up experiment.

      Reviewer #3 (Public Review):

      Summary:

      Baek and colleagues present important follow-up work on the role of serum glucose in the management of neonatal sepsis. The authors previously showed high glucose administration exacerbated neonatal sepsis, while strict glucose control improved outcomes but caused hypoglycemia. In the current report they examined the effect of a more tailored glucose management approach on outcomes and examined hepatic gene expression, plasma metabolome/proteome, blood transcriptome, as well as the the therapeutic impact of hIAIP. The authors leverage multiple powerful approaches to provide robust descriptive accounts of the physiologic changes that occur with this model of sepsis in these various conditions. Strengths:

      (1) Use of preterm piglet model.

      (2) Robust, multi-pronged approach to address both hepatic and systemic implications of sepsis and glucose management.

      (3) Trial of therapeutic intervention - glucose management (Figure 6), hIAIP (Figure 7).

      Weaknesses:

      (1) The translational role of the model is in question. CONS is rarely if ever a cause of EOS in preterm neonates. The model. uses preterm pigs exposed at 2 hours of age. This model most likely replicates EOS.

      Please see our response to Reviewer #2

      (2) Throughout the manuscript it is difficult to tell from which animals the data are derived. Given the ~90% mortality in the experimental CONS group, and 25% mortality in the intervention group, how are the data from animals "at euthanasia" considered? Meaning - are data from survivors and those euthanized grouped together? This should be clarified as biologically these may be very different populations (ie, natural survivor vs death).

      This is a very valid point. For all endpoints that are analyzed “at euthanasia” the age of the animal will vary. Some will have been euthanized early due to clinical deterioration and some will have survived all the way to the end of the experiment. This needs to be kept in mind when interpreting the results. We have further highlighted this point in the discussion and made it clear to the reader at what time-point each analysis was performed.

      (3) With limited time points (at euthanasia ) for hepatic transcriptomics (Figure 2), plasma metabolite (Figure 3) blood transcriptome (Figure 4), and plasma proteome (Figure 5) it is difficult to make conclusions regarding mechanisms preceding euthanasia. Per methods, animals were euthanized with acidosis or clinical decompensation. Are the reported findings demonstrative of end-organ failure and deterioration leading to death, or reflective of events prior?

      Yes, all organ specific endpoints are snapshots of the state of the animals at the time of euthanasia, pooling together animals that succumbed to sepsis and those that survived to 22 hours post infection. These results therefore reflect the end-state of the infection we cannot be sure when the differences between groups manifested themselves. However, given the stark differences in plasma lactate at 12 hours post infection it is likely that changes to metabolism occurred before most of animals succumbed to sepsis.

      We agree this is a weakness in our model, but we have since published a pre-print where we have further explored how metabolic adaptations shape the fate of similarly infected preterm pigs: BioRxiv 2024.02.23.581534; doi: https://doi.org/10.1101/2024.02.23.581534

      (4) Data are descriptive without corresponding "omics" from interventions (glucose management and/or hIAIP) or at least targeted assessment of key differences.

      We only did in-depth analysis of the glucose intervention as this showed the most promising clinical effects that warranted further in-depth investigation. It is possible that further insights could be gained from in-depth analysis of the other interventions but given that there were no obvious clinical befits we refrained from that.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      I am intrigued that mortality was not correlated to bacterial burden. Please provide the "data not shown" as this would help the reader understand better whether the difference in bacterial burden is driving the phenotypes and findings of the low glucose group.

      We have added this data to supplementary figure 1.  

      Reviewer #2 (Recommendations For The Authors):

      (1) I would urge the authors to consider a neutrophil-specific transcriptomic analysis. I understand that this would add significantly to the resubmission process. If the authors wish to include that as a future direction instead, they need to specifically mention the limitations of whole blood transcriptomics and how different immune cell types react differently to bacterial antigens.

      We agree with your considerations but we cannot include that data using the whole blood method applied in the experiment. We have added your consideration to the discussions.

      (2) I urge the authors to remove any impression that this is a model of late-onset sepsis, which is implied from the introduction, lines 3 and 4.

      Our intention was not to directly suggest that our model is a perfect reflection of late-onset sepsis but rather to highlight the relevance of using a pathogen commonly associated with LOS. We believe our model primarily captures the effects of intense pro-inflammatory immune activation, which may have parallels with various forms of sepsis, including LOS.

      Reviewer #3 (Recommendations For The Authors):

      Drawing on the robust nature of your "omics", identify key measures and test whether they are altered earlier in the development of clinical sepsis. Test whether these are altered by the intervention.

      A very valid point, at the moment it is not possible for us to explore this within the confines of these experiments. But, building upon these findings and the ones in our recent preprint we are confident that shifts in hepatic ratio of Oxidative phosphorylation and gluconeogenesis vs glycolysis shape the immune response to infections in neonates. In our upcoming experiments we are planning to incorporate plasma metabolomics at earlier timepoints to monitor when shifts in metabolism occur. However, given the heterogeneity of pigs, as opposed to inbred rodent models, sacrificing animals at fixed timepoints to gauge their organ function will be hard to interpret as it is impossible to know what the end state of the particular animal would have been. Therefore longitudinal sampling of liver tissue, during the course of infection would be challenging.

    1. Reviewer #2 (Public review):

      This study uses all-atom MD simulation to explore the mechanics of channel opening for the NOMPC mechanosensitive channel. Previously the authors used MD to show that external forces directed along the long axis of the protein (normal to the membrane) result in AR domain compression and channel opening. This force causes two changes to the key TRP domains adjacent to the channel gate: 1) a compressive force pushes the TRP domain along the membrane normal, while 2) a twisting torque induces a clock-wise rotation on the TRP domain helix when viewing the bottom of the channel from the cytoplasm. Here, the authors wanted to understand which of those two changes is responsible for increasing the inner pore radius, and they show that it is the torque. The simulations in Figure 2 probe this question with different forces, and we can see the pore open with parallel forces in the membrane, but not with the membrane-normal forces. I believe this result as it is reproducible, the timescales are reaching 1 microsecond, and the gate is clearly increasing diameter to about 4 Å. This seems to be the most important finding in the paper, but the impact is limited since the authors already show how forces lead to channel opening, and this is further teasing apart the forces and motions that are actually the ones that cause the opening.

    1. Reviewer #1 (Public review):

      Summary:

      In "Drift in Individual Behavioral Phenotype as a Strategy for Unpredictable Worlds," Maloney et al. (2024) investigate changes in individual responses over time, referred to as behavioral drift within the lifespan of an animal. Drift, as defined in the paper, complements stable behavioral variation (animal individuality/personality within a lifetime) over shorter timeframes, which the authors associate with an underlying bet-hedging strategy. The third timeframe of behavioral variability that the authors discuss occurs within seasons (across several generations of some insects), termed "adaptive tracking." This division of "adaptive" behavioral variability over different timeframes is intuitively logical and adds valuable depth to the theoretical framework concerning the ecological role of individual behavioral differences in animals.

      Strengths:

      While the theoretical foundations of the study are strong, the connection between the experimental data (Figure 1) and the modeling work (Figure 2-4) is less convincing.

      Weaknesses:

      In the experimental data (Figure 1), the authors describe the changes in behavioral preferences over time. While generally plausible, I identify three significant issues with the experiments:

      (1) All of the subsequent theoretical/simulation data is based on changing environments, yet all the experiments are conducted in unchanging environments. While this may suffice to demonstrate the phenomenon of behavioral instability (drift) over time, it does not properly link to the theory-driven work in changing environments. An experiment conducted in a changing environment and its effects on behavioral drift would improve the manuscript's internal consistency and clarify some points related to (3) below.

      (2) The temporal aspect of behavioral instability. While the analysis demonstrates behavioral instability, the temporal dynamics remain unclear. It would be helpful for the authors to clarify (based on graphs and text) whether the behavioral changes occur randomly over time or follow a pattern (e.g., initially more right turns, then more left turns). A proper temporal analysis and clearer explanations are currently missing from the manuscript.

      (3) The temporal dimension leads directly into the third issue: distinguishing between drift and learning (e.g., line 56). In the neutral stimuli used in the experimental data, changes should either occur randomly (drift) or purposefully, as in a neutral environment, previous strategies do not yield a favorable outcome. For instance, the animal might initially employ strategy A, but if no improvement in the food situation occurs, it later adopts strategy B (learning). In changing environments, this distinction between drift and learning should be even more pronounced (e.g., if bananas are available, I prefer bananas; once they are gone, I either change my preference or face negative consequences). Alternatively, is my random choice of grapes the substrate for the learning process towards grapes in a changing environment? Further clarification is needed to resolve these potential conflicts.

    2. Author response:

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      In "Drift in Individual Behavioral Phenotype as a Strategy for Unpredictable Worlds," Maloy et al. (2024) investigate changes in individual responses over time, referred to as behavioral drift within the lifespan of an animal. Drift, as defined in the paper, complements stable behavioral variation (animal individuality/personality within a lifetime) over shorter timeframes, which the authors associate with an underlying bet-hedging strategy. The third timeframe of behavioral variability that the authors discuss occurs within seasons (across several generations of some insects), termed "adaptive tracking." This division of "adaptive" behavioral variability over different timeframes is intuitively logical and adds valuable depth to the theoretical framework concerning the ecological role of individual behavioral differences in animals.

      Strengths:

      While the theoretical foundations of the study are strong, the connection between the experimental data (Figure 1) and the modeling work (Figure 2-4) is less convincing.

      Weaknesses:

      In the experimental data (Figure 1), the authors describe the changes in behavioral preferences over time. While generally plausible, I identify three significant issues with the experiments:

      (1) All of the subsequent theoretical/simulation data is based on changing environments, yet all the experiments are conducted in unchanging environments. While this may suffice to demonstrate the phenomenon of behavioral instability (drift) over time, it does not properly link to the theory-driven work in changing environments. An experiment conducted in a changing environment and its effects on behavioral drift would improve the manuscript's internal consistency and clarify some points related to (3) below.

      In our framework, we posit that the amount of drift has been shaped by evolution to maximize fitness in the environments that the population has experienced, and this drift is observed independent of environment. While we agree that exploring the role of changing environments on the measure of drift would be interesting, we would anticipate the effects may be nuanced and beyond the scope of the current paper (and the scope of our theoretical work, which assumes that the individual phenotype is unaffected by change of environment except as mediated by death due to fitness effects). For example, it would be difficult to differentiate drift from idiosyncratic differences in learning (Smith et al., 2022), and non-adaptive plasticity to unrelated cues has been posited as a method of producing diverse phenotypes (Maxwell and Magwene, 2017), so “learning” to uncorrelated stimuli could conceivably be a mechanism for drift. Given the scope of the current study, we prioritized eliminating potential confounds for measuring drift, but remain interested in the interaction between learning and drift.

      (2) The temporal aspect of behavioral instability. While the analysis demonstrates behavioral instability, the temporal dynamics remain unclear. It would be helpful for the authors to clarify (based on graphs and text) whether the behavioral changes occur randomly over time or follow a pattern (e.g., initially more right turns, then more left turns). A proper temporal analysis and clearer explanations are currently missing from the manuscript.

      We agree it would be helpful to have more description of the dynamics over time aside from the power spectrum and autoregressive model fits. We hope to address this in more detail to provide more description of the changes over time in a revision.

      (3) The temporal dimension leads directly into the third issue: distinguishing between drift and learning (e.g., line 56). In the neutral stimuli used in the experimental data, changes should either occur randomly (drift) or purposefully, as in a neutral environment, previous strategies do not yield a favorable outcome. For instance, the animal might initially employ strategy A, but if no improvement in the food situation occurs, it later adopts strategy B (learning). In changing environments, this distinction between drift and learning should be even more pronounced (e.g., if bananas are available, I prefer bananas; once they are gone, I either change my preference or face negative consequences). Alternatively, is my random choice of grapes the substrate for the learning process towards grapes in a changing environment? Further clarification is needed to resolve these potential conflicts.

      As in our response to point 1, we believe this is a crucial distinction, and we intend to further highlight it in the discussion in the revision and further expand our discussion of how the two strategies may interact.

      Reviewer #2 (Public review):

      Summary:

      This is an inspired study that merges the concept of individuality with evolutionary processes to uncover a new strategy that diversifies individual behavior that is also potentially evolutionarily adaptive.

      The authors use a time-resolved measurement of spontaneous, innate behavior, namely handedness or turn bias in individual, isogenic flies, across several genetic backgrounds.

      They find that an individual's behavior changes over time, or drifts. This has been observed before, but what is interesting here is that by looking at multiple genotypes, the authors find the amount of drift is consistent within genotype i.e., genetically regulated, and thus not entirely stochastic. This is not in line with what is known about innate, spontaneous behaviors. Normally, fluctuations in behavior would be ascribed to a response to environmental noise. However, here, the authors go on to find what is the pattern or rule that determines the rate of change of the behavior over time within individuals. Using modeling of behavior and environment in the context of evolutionarily important timeframes such as lifespan or reproductive age, they could show when drift is favored over bet-hedging and that there is an evolutionary purpose to behavioral drift. Namely, drift diversifies behaviors across individuals of the same genotype within the timescale of lifespan, so that the genotype's chance for expressing beneficial behavior is optimally matched with potential variation of environment experienced prior to reproduction. This ultimately increases the fitness of the genotype. Because they find that behavioral drift is genetically variable, they argue it can also evolve.

      Strengths:

      Unlike most studies of individuality, in this study, the authors consider the impact of individuality on evolution. This is enabled by the use of multiple natural genetic backgrounds and an appropriately large number of individuals to come to the conclusions presented in the study. I thought it was really creative to study how individual behavior evolves over multiple timescales. And indeed this approach yielded interesting and important insight into individuality. Unlike most studies so far, this one highlights that behavioral individuality is not a static property of an individual, but it dynamically changes. Also, placing these findings in the evolutionary context was beneficial. The conclusion that individual drift and bet-hedging are differently favored over different timescales is, I think, a significant and exciting finding.

      Overall, I think this study highlights how little we know about the fundamental, general concepts behind individuality and why behavioral individuality is an important trait. They also show that with simple but elegant behavioral experiments and appropriate modeling, we could uncover fundamental rules underlying the emergence of individual behavior. These rules may not at all be apparent using classical approaches to studying individuality, using individual variation within a single genotype or within a single timeframe.

      Weaknesses:

      I am unconvinced by the claim that serotonin neuron circuits regulate behavioral drift, especially because of its bidirectional effect and lack of relative results for other neuromodulators. Without testing other neuromodulators, it will remain unclear if serotonin intervention increases behavioral noise within individuals, or if any other pharmacological or genetic intervention would do the same. Another issue is that the amount of drugs that the individuals ingested was not tracked. Variable amounts can result in variable changes in behavior that are more consistent with the interpretation of environmental plasticity, rather than behavioral drift. With the current evidence presented, individual behavior may change upon serotonin perturbation, but this does not necessarily mean that it changes or regulates drift.

      However, I think for the scope of this study, finding out whether serotonin regulates drift or not is less important. I understand that today there is a strong push to find molecular and circuit mechanisms of any behavior, and other peers may have asked for such experiments, perhaps even simply out of habit. Fortunately, the main conclusions derived from behavioral data across multiple genetic backgrounds and the modeling are anyway novel, interesting, and in fact more fundamental than showing if it is serotonin that does it or not.

      We agree that our data do not support a strong conclusion that serotonin plays a privileged role in regulating drift. Based on previous literature (e.g. Kain et al., 2014, where identical pharmacological manipulations had an effect on variability while dopaminergic and octopaminergic manipulations did not), we think it likely that large global perturbations in serotonin that we observe are likely to influence plasticity that might be involved in drift (and thus find the results we observe not particularly surprising). Nonetheless, we agree that the mechanism by which serotonin may affect drift could be indirect, and it is similarly plausible that many global perturbations could lead to some shift in the amount of drift. We intend to further discuss these issues in the revision.

      To this point, one thing that was unclear from the methods section is whether genotypes that were tested were raised in replicate vials and how was replication accounted for in the analyses. This is a crucial point - the conclusion that genotypes have different amounts of behavioral drift cannot be drawn without showing that the difference in behavioral drift does not stem from differences in developmental environment.

      While a cursory inspection suggests that batch effects between different replicates was small, we intend to clarify this and more explicitly address the effects of replicates in revision.

      Reviewer #3 (Public review):

      Summary:

      The paper begins by analyzing the drift in individual behavior over time. Specifically, it quantifies the circling direction of freely walking flies in an arena. The main takeaway from this dataset is that while flies exhibit an individual turning bias (when averaged over time), their preferences fluctuate over slow timescales.

      To understand whether genetic or neuromodulatory mechanisms influence the drift in individual preference, the authors test different fly strains concluding that both genetic background and the neuromodulator serotonin contribute to the degree of drift.

      Finally, the authors use theoretical approaches to identify the range of environmental conditions under which drift in individual bias supports population growth.

      Strengths:

      The model provides a clear prediction of the environmental fluctuations under which a drift in bias should be beneficial for population growth.

      The approach attempts to identify genetic and neurophysiological mechanisms underlying drift in bias.

      Weaknesses:

      Different behavioral assays are used and are differently analysed, with little discussion on how these behaviors and analyses compare to each other.

      We intend to address this in a revision of the discussion.

      Some of the model assumptions should be made more explicit to better understand which aspects of the behaviors are covered.

      We will further clarify the assumptions of the model in revision.

    1. Reviewer #1 (Public review):

      Zhu and colleagues used high-density Neuropixel probes to perform laminar recordings in V1 while presenting either small stimuli that stimulated the classical receptive field (CRF) or large stimuli whose border straddled the RF to provide nonclassical RF (nCRF) stimulation. Their main question was to understand the relative contribution of feedforward (FF), feedback (FB), and horizontal circuits to border ownership (Bown), which they addressed by measuring cross-correlation across layers. They found differences in cross-correlation between feedback/horizontal (FH) and input layers during CRF and nCRF stimulation.

      Although the data looks high quality and analyses look mostly fine, I had a lot of difficulty understanding the logic in many places. Examples of my concerns are written below.

      (1) What is the main question? The authors refer to nCRF stimulation emerging from either feedback from higher areas or horizontal connections from within the same area (e.g. lines 136 to 138 and again lines 223-232). I initially thought that the study would aim to distinguish between the two. However, the way the authors have clubbed the layers in 3D, the main question seems to be whether Bown is FF or FH (i.e., feedback and horizontal are clubbed). Is this correct? If so, I don't see the logic, since I can't imagine Bown to be purely FF. Thus, just showing differences between CRF stimulation (which is mainly expected to be FF) and nCRF stimulation is not surprising to me.

      (2) Choice of layers for cross-correlation analysis: In the Introduction, and also in Figure 3C, it is mentioned that FF inputs arrive in 4C and 6, while FB/Horizontal inputs arrive at "superficial" and "deep", which I take as layer 2/3 and 5. So it is not clear to me why (i) layer 4A/B is chosen for analysis for Figure 3D (I would have thought layer 6 should have been chosen instead) and (ii) why Layers 5 and 6 are clubbed.

      (3) Addressing the main question using cross-correlation analysis: I think the nice peaks observed in Figure 3B for some pairs show how spiking in one neuron affects the spiking in another one, with the delay in cross-correlation function arising from the conduction delay. This is shown nicely during CRF stimulation in Figure 3D between 4C -> 2/3, for example. However, the delay (positive or negative) is constrained by anatomical connectivity. For example, unless there are projections from 2/3 back to 4C which causes firing in a 2/3 layer neuron to cause a spike in a layer 4 neuron, we cannot expect to get a negative delay no matter what kind of stimulation (CRF versus nCRF) is used.

    2. Reviewer #2 (Public review):

      Summary:

      The authors present a study of how modulatory activity from outside the classical receptive field (cRF) differs from cRF stimulation. They study neural activity across the different layers of V1 in two anesthetized monkeys using Neuropixels probes. The monkeys are presented with drifting gratings and border-ownership tuning stimuli. They find that border-ownership tuning is organized into columns within V1, which is unexpected and exciting, and that the flow of activity from cell-to-cell (as judged by cross-correlograms between single units) is influenced by the type of visual stimulus: border-ownership tuning stimuli vs. drifting-grating stimuli.

      Strengths:

      The questions addressed by the study are of high interest, and the use of Neuropixels probes yields extremely high numbers of single-units and cross-correlation histograms (CCHs) which makes the results robust. The study is well-described.

      Weaknesses:

      The weaknesses of the study are (a) the use of anesthetized animals, which raises questions about the nature of the modulatory signal being measured and the underlying logic of why a change in visual stimulus would produce a reversal in information flow through the cortical microcircuit and (b) the choice of visual stimuli, which do not uniquely isolate feedforward from feedback influences.

      (1) The modulation latency seems quite short in Figure 2C. Have the authors measured the latency of the effect in the manuscript and how it compares to the onset of the visually driven response? It would be surprising if the latency was much shorter than 70ms given previous measurements of BO and figure-ground modulation latency in V2 and V1. On the same note, it might be revealing to make laminar profiles of the modulation (i.e. preferred - non-preferred border orientation) as it develops over time. Does the modulation start in feedback recipient layers?

      (2) Can the authors show the average time course of the response elicited by preferred and non-preferred border ownership stimuli across all significant neurons?

      (3) The logic of assuming that cRF stimulation should produce the opposite signal flow to border-ownership tuning stimuli is worth discussing. I suspect the key difference between stimuli is that they used drifting gratings as the cRF stimulus, the movement of the stimulus continually refreshes the retinal image, leading to continuous feedforward dominance of the signals in V1. Had they used a static grating, the spiking during the sustained portion of the response might also show more influence of feedback/horizontal connections. Do the initial spikes fired in response to the border-ownership tuning stimuli show the feedforward pattern of responses? The authors state that they did not look at cross-correlations during the initial response, but if they do, do they see the feedforward-dominated pattern? The jitter CCH analysis might suffice in correcting for the response transient.

      (4) The term "nCRF stimulation" is not appropriate because the CRF is stimulated by the light/dark edge.

    1. Reviewer #1 (Public review):

      The paper by Fournier et al. investigates the sensitivity of neural circuits to changes in intrinsic and synaptic conductances. The authors use models of the stomatogastric ganglion (STG) to compare how perturbations to intrinsic and synaptic parameters impact network robustness. Their main finding is that changes to intrinsic conductances tend to have a larger impact on network function than changes to synaptic conductances, suggesting that intrinsic parameters are more critical for maintaining circuit function.

      The paper is well-written and the results are compelling, but I have several concerns that need to be addressed to strengthen the manuscript. Specifically, I have two main concerns:<br /> (1) It is not clear from the paper what the mechanism is that leads to the importance of intrinsic parameters over synaptic parameters.<br /> (2) It is not clear how general the result is, both within the framework of the STG network and its function, and across other functions and networks. This is crucial, as the title of the paper appears very general.

      I believe these two elements are missing in the current manuscript, and addressing them would significantly strengthen the conclusions. Without a clear understanding of the mechanism, it is difficult to determine whether the results are merely anecdotal or if they depend on specific details such as how the network is trained, the particular function being studied, or the circuit itself. Additionally, understanding how general the findings are is vital, especially since the authors claim in the title that "Circuit function is more robust to changes in synaptic than intrinsic conductances," which suggests a broad applicability.

      I do not wish to discourage the authors from their interesting result, but the more we understand the mechanism and the generality of the findings, the more insightful the result will be for the neuroscience community.

      Major comments

      (1) Mechanism<br /> While the authors did a nice job of describing their results, they did not provide any mechanism for why synaptic parameters are more resilient to changes than intrinsic parameters. For example, from Figure 5, it seems that there is mainly a shift in the sensitivity curves. What is the source of this shift? Can something be changed in the network, the training, or the function to control it? This is just one possible way to investigate the mechanism, which is lacking in the paper.

      (2) Generality of the results within the framework of the STG circuit<br /> (a) The authors did show that their results extend to multiple networks with different parameters (the 100 networks). However, I am still concerned about the generality of the results with respect to the way the models were trained. Could it be that something in the training procedure makes the synaptic parameters more robust than intrinsic parameters? For example, the fact that duty cycle error is weighted as it is in the cost function (large beta) could potentially affect the parameters that are more important for yielding low error on the duty cycle.<br /> (b) Related to (a), I can think of a training scheme that could potentially improve the resilience of the network to perturbations in the intrinsic parameters rather than the synaptic parameters. For example, in machine learning, methods like dropout can be used to make the network find solutions that are robust to changes in parameters. Thus, in principle, the results could change if the training procedure for fitting the models were different, or by using a different optimization algorithm. It would be helpful to at least mention this limitation in the discussion.

      (3) Generality of the function<br /> The authors test their hypothesis based on the specific function of the STG. It would be valuable to see if their results generalize to other functions as well. For example, the authors could generate non-oscillatory activity in the STG circuit, or choose a different, artificial function, maybe with different duty cycles or network cycles. It could be that this is beyond the scope of this paper, but it would be very interesting to characterize which functions are more resilient to changes in synapses, rather than intrinsic parameters. In other words, the authors might consider testing their hypothesis on at least another 'function' and also discussing the generality of their results to other functions in the discussion.

      (4) Generality of the circuit<br /> The authors have studied the STG for many years and are pioneers in their approach, demonstrating that there is redundancy even in this simple circuit. This approach is insightful, but it is important to show that similar conclusions also hold for more general network architectures, and if not, why. In other words, it is not clear if their claim generalizes to other network architectures, particularly larger networks. For example, one might expect that the number of parameters (synaptic vs intrinsic) might play a role in how resilient the function is with respect to changes in the two sets of parameters. In larger models, the number of synaptic parameters grows as the square of the number of neurons, while the number of intrinsic parameters increases only linearly with the number of neurons. Could that affect the authors' conclusions when we examine larger models?

      In addition, how do the authors' conclusions depend on the "complexity" of the non-linear equations governing the intrinsic parameters? Would the same conclusions hold if the intrinsic parameters only consisted of fewer intrinsic parameters or simplified ion channels? All of these are interesting questions that the authors should at least address in the discussion.

    1. Finally, I will say, "Fear not, because we have options". here are four counter  measures that we can all take today.

      Proposed countermeasures

      1. Report bots
      2. Avoid anonymity online...be a real person
      3. Build a strong social network presence
      4. Choose resilient social media
    1. ips for a clear and informative notice There is no one, right way to give notice, and different situations may require more or less complicated notices and marking. The following tips may be helpful, however, in designing a clear and informative notice. 1. Define the work Substantively define the work to which you are applying the license. A notice that "this work" is offered under a CC license tells a user much less than one that gives the work's title or defines the work. If you intend to license a song, name the song. If you intend to license part of a work, describe that part. For instance, an author of a novel could offer one chapter under a CC license and use the following notice: "Chapter X of Novel Y by Author Z is offered under [license version].” Licensors who define the licensed work make it easier for users to understand which works and parts of works are licensed and available for use. 2. Identify any parts of the work to which the license does not apply Here the licensor should describe all of the elements of the work (as defined in the first step) to which the license does not apply, and thus are not available for use under the license terms. You may list reserved elements in the general notice, or you can describe and implement a marking procedure such as watermarking or text notices that you will use to denote those elements of the work that are not licensed. Licensors should inform users about any portions of the work to which the license cannot apply because the licensor does not have the necessary rights, and any places where the licensor has opted not to apply the license as a strategic matter. 3. Identify the rights licensed that you have in the work (as far as you know) and any rights that are not licensed because you do not have them. When licensors are the owners of, or are authorized to exercise, all rights related to their creative works, including copyright and publicity/privacy rights, marking a work is not problematic. However, some licensors do not own all rights related to their works. Copyright is a bundle of rights, such as the right to copy and the right to distribute, which are divisible and may be held by different parties. A licensor without all the rights should list those they have. For instance, a licensor who holds the performance rights to a recording of a song, but not the rights in the composition, should say so. Licensors should attempt to alert users of any rights held by others that may impact their ability to reuse the work. 4. Grant any additional permissions Licensors can use notices to grant additional permissions beyond the license grant. For instance, a licensor who chooses a NoDerivatives or NonCommercial license can grant users permission to create derivatives or make commercial uses under specific conditions. Note that licensors can use notices to broaden the license grant and give additional permissions, but notices cannot restrict any permissions already granted by the CC license. 5. Convey any supplementary requests or information Licensors should use notices to inform us

      Some useful tips

  3. Nov 2024
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02535

      Corresponding author(s): Modica, Maria Vittoria

      1. General Statements [optional]

      We are grateful to the reviewers for their detailed evaluation and insightful comments on our manuscript, which has led us to introduce several clarifications, expand a few issues initially underscored, and amend some incongruencies.

      We have been able to incorporate changes to reflect most of the suggestions provided by the reviewers, as highlighted in the main text. Most of the additional analyses proposed by the reviewers were carried out, in some cases providing interesting insights that were included in the manuscript, while in others revealed not conclusive, as detailed below.

      We believe that the congruence and readability of the manuscript has been overall improved, and we are confident that our responses align with the level of detail required by the reviewers

      • *

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      * Summary: The manuscript by Modica et al reports characterisation of the venom system in the white sea fan Eunicella singularis, a species of an octocorallian coral. E. singularis is common in the north-western Mediterranean sea. The authors used a proteo-transcriptomic approach followed by extensive bioinformatics analysis. Specifically, they generated a new E. singularis *transcriptome and characterised extracts from nematocysyts (venom-bearing structures) and whole body using tandem mass spectrometry. Toxins were identified by HMMER using Tox-prot and VenomZone databases as queries as well as ClanTox web server.

      Major comments:

      As far as I am aware, venom production by ectodermal gland cells has been reported only in sea anemones (Moran et al, 2011), therefore it is unclear whether it is the case in the octocorallian sea fan as well. Additionally, cnidarian toxin-like proteins might be produced by neurons (Sachkova et al, 2020) or involved in development (Surm et al 2024). Thus, it is probable that in E. singularis not all the toxin-like proteins found in the whole body proteome and missing from the nematocyst proteome are venom components. Thus, additional experiments would be required to localise those proteins to ectodermal gland cells. I suggest to mention this limitation and refer to such proteins as "toxin-like" or "putative toxins".

      • *

      We thank the Reviewer for this observation, which is indeed correct. We have modified the text according to this suggestion and we have added a cautionary statement to the analysis section.

      In addition to submitting proteomics data to PRIDE, it would be helpful for readers/reviewers to provide a supplementary excel file with all the peptides and proteins identified by PEAKS Studio. I could not access the data on PRIDE as I think they still have not been assigned a PXD dataset identifier.

      Excel files with both proteomes have now been provided as supplementary material (Suppl tab. 2 and 3).

      * *Minor comments:

      It would be helpful for readers to split the Results and Discussions into smaller subsections with headings, perhaps according to the identified toxin families. It would be also helpful to provide a summary figure with all the toxins identified and perhaps toxin expression levels. Especially showing cysteine patterns for new toxins would be very useful.

      Wherever possible, Results and Discussions were split into subsections according to toxin families, following reviewer’s suggestion.

      Figure 2.C summarizes the identified toxin families along with the number of validated sequences for each of them. We provided an excel file with the sequences and expression levels of the identified toxins as supplementary table 2. We have now added a column with cysteine patterns to better define and characterize these toxins

      It is unclear why the Toxin annotation pipeline is hidden in the supplementary material. It would be also helpful to show it as a schematic pipeline in the main text.

      We have prepared a figure describing the annotation pipeline that is now provided as Fig.1 in the main text.

      The identification of proteolytic cleavage sites is not really described. It would be also helpful to mark them at the Figure 2.

      We have adjusted the Methods section in the Supplementary Material to give a clearer explanation of the methods applied to identify putative cleavage sites. The figure (now Fig. 3) has been adjusted to include the protease recognition site.

      "Other peptides present in E. singularis nematocysts and displaying protease inhibitory domains, but likely lacking a toxin function (Kazal-type, cystatines, antistasins, and macins)..." - why do they likely lack a toxin function? what is the rational behind this statement?

      • *While we were referring to a strictly neurotoxic function, the statement is indeed misleading and was removed from the amended text and modified as follows “Other peptides present in E. singularis nematocysts displaying protease inhibitory domains (Kazal-type, cystatines, antistasins, and macins) were detected but did not present novelty elements. Their sequences are described in supplementary data.”

      "cell- or tissue-specific differential maturation patterns" - I think the differential maturation needs to be confirmed by additional experiments to exclude a possibility of being an artifact due to low mass spectrometry sensitivity.

      This is indeed true. Nonetheless, our proteomic analyses provided quite convincing evidence of this phenomenon. Figure 3 in the manuscript summarizes the output of our PEAKS studio analyses, but for clarity we reported as Suppl. Fig. 1 the original output for the identification of U-GRTX-Esi2a/b.In the figure, each blue line below the precursor sequence denotes a peptide that was confidently identified by LC-MS/MS. As visible, several peptides were identified for this protein in either proteome, but there is a clear pattern pointing toward the complete absence of the first domain in the NEM-P. The Reviewers have rightfully raised concerns that, given the ethanol extraction protocol employed, our NEM-P may be partial and/or contaminated by other extracted proteins. This is true, and in fact we have added cautionary statements throughout the text. It is reasonable to assume, though, that proteins with similar sequence and physicochemical features, like U-GRTX-ESI-2a and 2b, will respond similarly to the ethanol extraction procedure. If present, we believe the first domain (U-GRTX-ESI-2a) should have produced some detectable peptide also in the NEM-P. This seems even more reasonable if we consider that the WB-P contained a much higher number of proteins, which could have led to the loss of detection of some peptides due to instrument settings. With the due caution, we believe it is reasonable to leave our claim in the manuscript, supporting it by adding the Suppl. Fig.1.

      "three consecutive ShK domains with peculiar characteristics (Suppl. Fig. 2)" - what are these characteristics?* *

      This has been better clarified in the text which now reads “Only the C-terminal domain has the typical ShKT cysteine pattern, whereas the first two domains present an unusual shift of the C-terminal cysteine. None of the domains of U-GRTX-Esi4 presents the key Lys residue necessary for binding KV1.2 and KV1.3, while the subsequent Tyr residue, also important for binding KV1, is extremely conserved”. The reference figure is now Suppl. Fig. 3.

      Fig. S1 legend: "Octocorallia (cyano bar) and Hexacorallia (blue bar)" - the bars look pink and cyan.* *

      *The figure (now Suppl. Fig. 2) was modified in order to fix this issue. *

      * *Referee cross-commenting

      I agree with both reviewers that additional validation of the ethanol extraction method would be required to confirm its specificity and efficiency. Since ethanol is widely used for tissue fixation, I would guess that it is improbable that it leads to disruption of other coral cell types in addition to discharging nematocytes. However, to be 100% sure that would need to be confirmed experimentally. I think the suggestion to use Xenia single cell dataset to validate the nematocyst proteome reported in this paper is really worth trying. However, toxin-like genes in cnidarians might be recruited to non-venom cell types (Sachkova et al, 2020; Surm et al 2024) therefore if a gene is nematocyte-specific in one species it does not mean it would the same in another one, especially if they are distantly related. Thus, the best would be to run some additional experiments in Eunicella singularis, if the tissue is available.

      We have received this concern and addressed it by rephrasing the text. We have also performed the requested check with Xenia nematocysts single cell data set. In detail, we recovered 243 high-confidence single-copy orthologs conserved between Xenia and E. singularis, which were described as belonging to cluster 11, associated to nematocytes by Hu and colleagues in their 2020 Nature article. We comparatively evaluated the abundance of the peptide fragments that could be mapped to the corresponding de novo assembled contigs in E. singularis whole-body and nematocyst proteomes, finding very little overlap, both with the whole-body, and with the nematocyst proteome. In detail, we found none of the sequences shared with Xenia cluster 11 in the NEM-P, while 16 sequences were retrieved in the WB-P. None of the latter corresponded to toxins, but rather possessed PFAM domains indicative of housekeeping functions.

      We believe that these observations are not surprising, due to the following reasons:

      (i) as we show in Figure 6, Xenia appears to display a highly divergent venom arsenal not just from Eunicella singularis, but also from all other Octocorallia. Consequently, we can hardly expect any of the main molecular components of the venom to display a 1:1 orthology between the two species. In addition, Xenia is a zooxanthellate species, obtaining most of its energy autotrophically and complementing with the absorption of particulated organic matter. Due to its trophic ecology, we do not expect this species to produce predatory venom.

      (ii) although Xenia cluster 11 includes genes specifically expressed in the nematocysts, these do not necessarily encode venom components but also other cellular components from the nematocytes. In contrast, if successful, our approach would yield a fraction enriched in secretory products while other intracellular or membrane-bound proteins that are specifically expressed by nematocytes, are not expected to be particularly enriched in the NEM-P.

      In addition, due to the remarkable divergence between these two species, not all Xenia nematocyte-specific transcripts are expected to retain the same specificity also in Eunicella.

      Reviewer #1 (Significance (Required)):

      This study reports venom composition of an octocoral for the first time. These data are very important for understanding biology and ecology of these animals as they rely on venom for feeding and deterring predators. This study is a significant advancement of the cnidarian venomics as most of the literature is limited to sea anemone and jellyfish venoms. This study will be interesting to the broad audience: venomics and coral ecology communities, evolutionary biologists and marine scientists. The main strength of this work is that it provides a comprehensive overview of the venom system in a widespread octocoral species with important ecological roles. The limitations of this study is that the toxicity and biological function of the identified venom components have not been confirmed experimentally. However, the localisation of the proteins to nematocysts is a very strong indication of being a venom component. My expertise: cnidarian venom (biochemistry, ecology and evolution).

      *

      *Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors of this work explore the venom repertoire of octocoral, a group of cnidarians whose venom has largely been ignored in the literature. As a first step into characterizing the venom of octocorals, the authors use a proteo-transcriptomic approach for Eunicella singularis, Specifically, they generated the transcriptome and proteome from whole-body as well as a more specific proteome of the nematocyst, a specialized sub-cellular structure found only in cnidarians and used to inject venom. The nematocyst proteome is a crucial dataset of the manuscript as it allows the authors to discriminate what is most likely a bona fide toxin compared to general physiological proteins.

      * Major: However, I have some skepticism regarding the legitimacy of this nematocyst proteome. Specifically, the proteins from this are nematocyst-specific. The authors used an approach to soak the animal in ethanol, which theoretically should cause the nematocyst to fire, releasing the venom housed inside. This is a technique previously used in box jellyfish where they show that indeed the nematocyst have fired using histological approaches. However, this was not validated for Eunicella singularis*. I am hesitant to fully accept that the data from the nematocyst-proteome is specific. Other approaches, such as isolating nematocyst using a percoll gradient, will likely generate a more specific nematocyst proteome. This percoll gradient approach has been used to isolate nematocysts from different species of cnidarians ranging from hydra to sea anemones, however, I recognize that although this approach is robust for different cnidarians, acquiring enough material is challenging and maybe beyond the capacity for this octocoral. I would argue this would be the best approach, but if not feasible I can understand. However, other potential validation could be used to help improve the confidence that this is, at least mostly, nematocyst-specific. Furthermore, one could argue that this ethanol approach used in box jellyfish also specifically used tentacle, a tissue significantly enriched in nematocyst likely greatly improving the specificity in isolating nematocyst-specific proteins. whereas in this study they use a collection of whole polyps, therefore, anything that is extracted from the ethanol would precipitate. This is a much more complex collection of tissues which I would assume could interfere with isolating nematocyst-specific proteins

      We thank the Reviewer for these comments. It is indeed true that there are cleaner procedures to extract venom from nematocysts. Preliminary attempts with electrical stimulation of colonies to milk the venom were also performed, but did not yield satisfactory peptide amounts for further analysis. We then decided to attempt ethanol extraction. As also noted by Reviewer #1, ethanol is routinely used for tissue fixation, and we think that it could have only limited effect on other cell types, therefore we assumed that most proteins in this extract had to come from nematocysts firing. While we cannot be sure that we fired all kind of nematocysts from E. singularis, the enrichment of the NEM-P in proteins with typical toxin features (i.e. signal peptide, small size, elaborate cysteines patterns), represented an indirect proof of this hypothesis. We believe this NEM-P may represent a good snapshot of venom components from E. singularis. On the other hand, it is true that the ethanol procedure may introduce some contamination. Indeed, we adopted a conservative approach and discussed in detail only the proteins with toxin-like features. At any rate, we have clearly stated the methodological limitations of our approach in the text and added cautionary statements through the manuscript.

      * *A computational approach, that I think is essential, is to use the Xenia single-cell atlas. Xenia is also an octocoral with a nice single-cell atlas in which the cnidocytes form a distinct cluster. The authors can perform a reciprocal best-blast hit with the xenia genome and Eunicella singularis transcriptome and then see if gene-encoding proteins found in Eunicella nematocyst proteome have orthologs with genes found in the Xenia cnidocyte cluster. A statistical test could then be performed to show that there is a significant overlap between the nematocyst proteins from Eunicella and their orthologs in the Xenia cnidocyte cluster. This is still quite indirect but can give some insights. A better approach would be to perform proteomics from Xenia using the ethanol approach and mapping to see where the proteins captured are found in the atlas. This would massively elevate this work and provide proof that indeed this approach using ethanol is capable of precipitating nematocyst-specific proteins. I would strongly recommend trying to provide some evidence that this is indeed a nematocyst-specific protein, or at the least, is significantly enriched. Because this is unknown, many of the interpretations presented downstream are not well supported.

      As previously stated in response to Reviewer #1, we have performed the requested check on Xenia nematocyte single cell data set. In detail, we followed the advice provided by the reviewer, extracting the protein sequences of the 432 Xenia genes included in cluster 11 from the work by Hu and colleagues, and recovered the nucleotide sequence of the assembled transcripts of 243 high-confidence 1:1 orthologs from E. singularis. In this process, we paid particular attention to excluding ambiguous matches, such as genes subjected to lineage-specific duplications, and therefore we exploited the availability of the annotated genome of the congeneric species E. verrucosa for the first step of orthology detection (performed through a reciprocal BLASTp approach). In the second step of the analysis, the corresponding assembled transcripts from E. singularis were identified with tBLASTn, assuming an inter-specific divergence This subset of putative nematocyst-specific sequences was subjected to an in-depth analysis, which comparatively evaluated the relative abundance of mapped peptide fragments in the whole-body and nematocyst proteomes. This process led to the identification of very little overlap between Xenia and E. singularis. We believe that these observations are not surprising, due to the following reasons:

      (i) as we show in Figure 6, Xenia appears to display a highly divergent venom arsenal not just from Eunicella singularis, but also from all other Octocorallia. Consequently, we can hardly expect any of the main molecular components of the venom to display a 1:1 orthology between the two species. In addition, Xenia is a zooxanthellate species, obtaining most of its energy autotrophically and complementing with the absorption of particulated organic matter. Due to its trophic ecology, we do not expect this species to produce predatory venom.

      (ii) although Xenia cluster 11 includes genes specifically expressed in the nematocysts, these do not necessarily encode venom components but also other cellular components from the nematocytes. In contrast, if successful, our approach would yield a fraction enriched in secretory products while other intracellular or membrane-bound proteins that are specifically expressed by nematocytes, are not expected to be particularly enriched in the NEM-P.

      In addition, due to the remarkable divergence between these two species, not all Xenia nematocyte-specific transcripts are expected to retain the same specificity also in Eunicella.

      Another major issue with the manuscript is the section referring to SCRiPs. First, the authors do not cite Jouiaei, Sunagar et al. (2015) which was the first publication to functionally characterize SCRiPs as toxins. Additionally, the majority of SCRiPs identified in this study and those found in Eunicella have a different cysteine framework. The authors acknowledge this online 245 but claim that, given the alphafold structure is similar, they are from the same gene family. First, I think this is very weak support as typically sharing a conserved cysteine framework is the bare minimum to categorize these toxins in a gene family. Although some cysteine frameworks are somewhat hard to resolve as the space between the cysteines can be variable, in this case, SCRiPs have a very distinct triple repeat of cysteines near the C terminal that is missing in these octocoral SCRiPs. These make me suspicious that these are indeed from the same gene family. Then relying on alphafold to predict the structure and claiming it's similar to Tau-AnmTx Ueq 12-1 from Urticina eques is also fairly weak support. Although I am not an expert in protein structures, I cannot tell from the images comparing the 2 structures in the supplementary figure s1 that these are similar. Perhaps you could align or overlap them, or give some readout of the similarity of these structures. Currently, I am skeptical of any of the SCRiPs described in this manuscript. Additionally, if the authors can show that indeed these are SCRiPs, again I would strongly advise the authors to check the Xenia scRNA-seq to see if these Xenia SCRiP-like sequences are expressed in cnidocytes.

      Given the concerns raised by the Reviewer, throughout the text we now referred to octocoral SCRiPs as SCRIP-like proteins or octo-SCRiPs. Reference to Jouiaei, Sunagar et al. (2015) was added. However, we would like to point out that we do not associate them to hexacoral SCRiPs based on their predicted structure similarity: the Suppl. Fig. 2 presents the alignment of the sequences of these proteins with representative sequences from Hexacorallia, highlighting a sequence similarity up to 68%. Considering the high level of sequence divergence generally recognized within toxin families, this high similarity value contributes to support our claims. Despite the relevance of the cys framework in defining toxin families, a single amino acid shift is not necessarily indicative of a new structural family.

      Concerning the structural comparison between SCRiPs and octo-SCRiPs, Suppl. Figure 2.B has been replaced with a superposition of the structure of AnmTx Ueq 12-1 with the model of U-GRTX-Esi1a. The structures were aligned with TM-align, resulting in a Cα RMSD for the aligned region of 1.86 Å, which confirms the strict similarity of the two proteins.

      Unfortunately, we need to rely on available genome annotations for the evaluation of the Xenia scRNA-seq data. The only currently annotated Xenia gene showing significant homology with the SCRiP-like of E. singularis (Xe_002907) has a highly different organization, as it shows five consecutive cysteine-rich domains, and is therefore not orthologous to any of the three sequences we report in the present work. In the paper by Hu and colleagues, Xe_002907 is associated to cluster 2, which was unrelated with nematocysts.

      * Minor:

      *The ShK protein, U-GRTX-Esi4, strikes me as similar to NEP3 gene family identified in Nematostella, which also has 3 ShK domains (Columbus-Shenkar et al. 2018).

      We have added reference to the NEP3 family in the text and discussed the similarities of U-GRTX-Esi4 with its members, highlighting that while in NEP3 the mature toxin corresponds only to the first ShK domain, U-GRTX-Esi4 is supported as a multidomain protein by our proteomic analyses.

      Interestingly U-GRTX-Esi20 and 21 were found to be structurally similar to acrorhagin 1a but do not share a conserved cysteine framework ( 6 cysteines vs 8). One thing that the authors should be careful of, and perhaps point out that this is indeed not nematocyst-specific, is that an ortholog acrorhagin 1a was found to be expressed in the neurons in Nematostella (Sachkova et al. 2020). Perhaps ancestral acrorhagin 1 was found in the last common ancestor of Anthozoa but was a neuropeptide that got recruited to the venom in Actinia.

      Because of the methodology employed, we expected the NEM-P to be a toxin-enriched subset of the WB-P. Indeed, some of the toxin-like proteins detected in the NEM-P were not observed in the WB-P, where they might have been below the LOD during proteomic analysis. On the other hand, being a whole-body proteome, we expect the WB-P to contain ALSO nematocyst specific proteins. At present, the detection of U-GRTX-Esi20 and 21 in the WB-P does not rule out that these may be nematocyst specific, whereas their presence in the NEM-P, in our view, confirms their occurrence in the venom. At any rate, given the current level of evidence, this Reviewer is right in considering all possibilities, such as their neuropeptide nature. These considerations have been added to the text.

      * Also in general the authors refer to a lot of phylogenetics that I cannot see in the paper. For example, on line 339: "Our genomic survey indicates that these two toxins belong to two distinct monophyletic orthogroups within a very large superfamily of cysteine-rich peptides, encoded by ancestrally duplicated paralogous genes with intronless structures, that also include other members in E. singularis, not detected in the NEM-P." *What genomic survey are you referring to (where is this data)? What do you mean by "belong to two distinct monophyletic orthogroups".

      In the attempt to keep the manuscript more concise, we concentrated comparative genomic analyses in the supplementary material. We now provide in the main text a detailed phylogenetic tree that displays the complex evolutionary relationships between U-GRTX-Esi20 and 21 and a number of other related sequences sharing significant sequence homology and predicted structural organization (Figure 6). In detail, the two Eunicella toxins belong to two groups of sequences, labeled as “type I” and “type VI” which are highly supported by robust bootstrap values (94 and 95, respectively) as monophyletic within Malacalcyonacea. Notably, we could identify four additional monophyletic groups, characterized by similar support values, that included sequences from both Eunicella and other Malacalcyonacea species (type II, III, IV and V). Nevertheless, these sequences were not identified as venom components by our proteomic analyses. Related proteins were also identified in species belonging to Scleralcyonacea, even though their precise relationships with those of Malacalcyonacea were often unclear.

      Also, there is no visualization of the results when the authors refer to the genomic surveys, especially when referring to intron-exon boundaries. Please include which genomes include which sequences and their given intron-exon boundaries for a given gene family. I do not understand how the authors resolved figure 4. How do you know there was a loss not a gain of f exon 2 in the gene encoding for U-GRTX-Esi17. Providing the genomic loci for the toxin gene families would help. Maybe something like figure 5 from Koludarov et al. (2024) would be useful, but ideally including intron-exon boundaries.

      The scenario we propose is far more parsimonious than the alternative hypothesis involving an intron gain, since this would have required an extremely complex combination of far less likely events, i.e. the independent acquisition of two partial colipase-like arrays in positions compatible with the generation of a complete colipase-like cysteine array. Despite being theoretically possible, we believe this scenario to be highly unlikely, also considering the well-established differences between the rates of intron gain and intron loss in eukaryotes, with the latter exceeding the former by several orders of magnitude (see Roy and Gilbert, 2005, https://doi.org/10.1073/pnas.0500383102).

      We present a supplementary figure which schematically displays the architecture of the genes encoding novel putative venom components described in this manuscript. We need to remark the fact that, as mentioned in the main text, no genome assembly is presently available for E. singularis, and therefore such gene architectures have been inferred from the congeneric species E. verrucosa. Despite being certainly interesting, the approach proposed by the reviewer referring to figure 5 from Koludarov et al., which would basically involve a microsynteny analysis for all loci, would go far beyond the aims and scopes of the present work and require an unreasonable workload, with a very marginal increase in the quality of the data we report. First and foremost, no genome assembly is available for our target species. Moreover, just a very few genomes of Octocorallia are associated with publicly available gene annotations (in detail, no gene annotation tracks are available for R. reniformis, P. caledonicum, V. gustaviana, P. papillata, Chrysogorgia sp., H. coerulea, P. subtilis, Trachytela sp. and M. muricata). The lack of existing annotations does de facto prevent the possibility of retrieving flanking genes and providing evolutionary insights at the level requested by the reviewer. We believe that the manual annotation of the target genes of interest in all analyzed species fully meets the objectives of this study.

      In the methods the author's mention:

      "Whenever needed (i.e., U-GRTX-Esi20 and 21), a fine-scale classification of orthologous sequences was aided by Maximum Likelihood phylogenetic inference analyses, carried out with IQ-Tree [49] with 1000 ultrafast bootstrap replicates based on the best-fitting model of molecular evolution detected by ModelFinder [50]."

      So please include this data as supplementary figures. The authors did plenty of analysis they refer to but do not include this in the paper. This lack of data makes it very hard to follow many of the phylogenetic and genomic insights from this manuscript.

      The phylogenetic tree which concerns U-GRTX-Esi20 and 21 has been added in the main text as Figure 6. In pretty much all other cases where we referred to comparative genomics analyses, our inferences were simply based on the detection (or lack thereof) of orthologous genes. Considering the narrow taxonomic distribution of most target sequences, which prevents the possibility of identifying suitable outgroups for tree rooting purposes, and their usual presence as single-copy genes in E. singularis, we don’t think that adding phylogenetic trees would add useful information to the manuscript. Nevertheless, we have added the multiple sequence alignments of all relevant groups of orthologous sequences as supplementary figures.

      • *Reviewer #2 (Significance (Required)):

      * *This work is very can be very useful in extending our knowledge of venom in cnidarians and can help build better resolution of the evolutionary history of the ecologically essential proteins

      * *Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      *

      *SECTION A - Evidence, reproducibility and clarity

      * =================================================

      Summary: *Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      * This manuscript describes the proteotranscriptomic analysis of samples from the coral Eunicella singularis. A number of putative venom toxins are identified. In silico structural analyses are performed for select putative toxins and inferred activity/function is discussed. In my opinion the subject of the study is important. However, I have some important questions about the methodology (regarding "venom collection" and assignment of "venom components"), and given the preliminary nature of the study I found some of the conclusions (regarding activity) somewhat overstated. *Major comments:

      • Are the key conclusions convincing?

      * While some conclusions were justified, I felt unconvinced by others. Some of my pessimism stems from the technique used to extract the venom i.e. ethanol immersion. I'm not familiar with the use of this technique, however it strikes me as likely to be associated with some limitations. For example, while the nematocysts may indeed discharge their contents I would expect some contents e.g. larger proteins to be insoluble. Was this considered? This would have some major impacts on the conclusions drawn e.g. *(L418: "absence, in the NEM-P of E. singularis, of the common cnidarian cytolytic proteins." AND (L492): "conventional pore forming toxins (PFTs) of Cnidaria, including the aerolysin-like Δ-GRTX-Esi29 and the two actinoporins Δ-GRTX-Esi30 and 31 were not retrieved in the nematocysts' proteome."

      Because of this observation, the authors concluded that these were not venom components in this species and speculated on other functions. However, I can't help wondering if these were simply excluded from analysis as a result of the ethanol extraction i.e. a false negative.

      As anticipated in our response to Reviewer #1, we opted for ethanol extraction due to sample limitation and unsuccessful attempts with other venom collection protocols. The procedure we employed was first described by Jouiaei et al., 2015, to extract venom from the tentacles of Chironex fleckeri. Proteins and peptides extracted from the nematocysts were indeed precipitated from ethanol and subsequently resuspended for proteomic analysis. The original protocol by Jouiaei et al. used precipitation at -80°C to recover the proteins from ethanol. Albeit denaturing, this protocol should not imply sample losses. Large proteins that did precipitate were still resuspended and analyzed. We have introduced an evaporation/lyophilization step, which should not alter the outcome. In fact, we did detect higher molecular weight proteins in the NEM-P (mostly structural and enzymes). While denaturation and precipitation may functionally inactivate these proteins, these should all be detected by proteomics. The authors of the original paper presented a comparison between the venom obtained from ethanol extracted tentacles and the proteome of pressure disrupted purified nematocysts. In both cases, additional “non venom” and “structural” proteins were also detected (e.g. histones, filamin, ribosomal proteins, myosin, actin, collagen…). Given the prevalence of toxins or toxin-like proteins in our extract, we were reasonably convinced of the success of the extraction protocol. For sure, the method may present limitations: as also observed by Reviewer #1 and #3, contamination with non-nematocyst proteins is possible. This has also been considered. In fact, we adopted a conservative approach, choosing to discuss in detail only proteins with structural similarities with known toxins and/or typical toxin-like features. On the other hand, as noted by this Reviewer, our results may be partial, but, in our opinion, this would be most likely due to incomplete nematocysts firing rather than to sample loss. All these possibilities have now been better discussed and addressed in the text. At any rate, we are convinced that the protein diversification detected in the NEM-P is indicative of the presence of several venom components and provides a first indication of the existence of novel, octocoral-specific, venom protein families.

      Comparisons were made to other tissue samples (whole bodies). Were these samples prepared in the same way i.e. ethanol extraction? If not, the power of any comparisons would be limited.

      Following the described experimental approach, we expected the NEM-P to be a subset of the WB-P, for which no purification/enrichment of sort was performed. In fact, we reported both proteomes to confirm the enrichment of the NEM-P in venom components, highlighting the presence of putative toxins that might have been below the instrumental limit of detection in the more crowded whole body protein extract. At any rate, we have now modified the text, adding cautionary statements that may also explain our results.

      • *It was unclear to me exactly how "venom components" (Fig. 1A) were defined. Why are "enzymes" , "structural" and "unknow" NOT considered venom components when they were identified in the "venom" extract?

      The “structural” and “enzymes” categories were used to analyze the hits in the NEM-P. We decided to discuss only putative neurotoxins or cytolytic toxins based on the limited selectivity of the extraction protocol employed and on the lack of histological control. As structural components and enzymes, in the absence of a crude venom extract, may derive from other tissues, we preferred not to discuss them. We hope this is clearer in the amended version of the manuscript.

      Furthermore, a large proportion of proteins detected are "structural" - doesn't this suggest that the "venom" extract included a large proportion of false positives i.e. non-toxin proteins? Is it possible that some of the proteins which are considered as "venom components" are also false positives?

      • *As also noted by Reviewer #1, aside from contamination from other tissues, some of the toxin-like proteins we identified may have different functions (e.g, neuronal, developmental) and their toxin function is presumed on the basis of structural features. This issue is clearly addressed in the manuscript. Nonetheless, putative toxins are definitely enriched in the NEM-P compared to the WB-P, which leads us to believe that the NEM-P is a fraction enriched in nematocysts content. This is now more evident also in the PEAKS output files, provided as Supplementary Tables 2 and 3.

      The nematocyst ethanol extract is referred to throughout the manuscript as "venom". Similarly, what I would consider putative toxins are referred to throughout the manuscript as "toxins". Given the preliminary nature of the study I suggest the authors consider rewording these.

      This has been changed throughout the text.

      In short, the evidence presented left me unconvinced that the nematocyst ethanol extract that was analysed represented the genuine "venom" of this species and that the "toxins" identified represent the genuine toxin repertoire. The authors should at least discuss potential limitations, defend my claims in this context and adjust conclusions accordingly.

      We hope that the additional clarifications provided in the Results and Discussion section, and the amendments we made throughout the manuscript made our statements more convincing

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? See comment above regarding venom collection and conclusions drawn.

      We have introduced cautionary statements throughout the text.

      * *Also, despite the absence of any experimental activity/functional data, there was a lot of inference about activity and function.

      A few examples: L299 - "might have acquired peculiar biological activity."

      L301 - "support their relevance for the predatory and/or defensive strategies…"

      L326 - "abundance of this protein suggests a strong functional relevance…"

      L358 - "the structure presented a SCRiP-like W-shaped fold, indicative of a potential neurotoxic function."

      L427 - "suggestive of a peculiar chemical selectivity towards different lipids"

      L506 - "the cytolytic activity seems to be ascribable mostly to the six saposins"

      * *I suggest some removal or rewording throughout the Results/Discussion section to reflect the fact that most of this is purely speculative.

      This has been modified according to the reviewer’s suggestions.

      Regarding the following statement on L300 - "Notably, the transcripts for all these toxins had exceptionally high TPM values (1806, 569, 826 and 429, respectively for the U-GRTX-Esi14 to 17/18), which support their relevance for the predatory and/or defensive strategies of Eunicella singularis." These TPM values don't seem high to me e.g. 1806 TPM = 0.0018% of transcripts. How do these numbers compare to other "non-venom" components of the transcriptome? A graph illustrating this would be helpful.

      We thank the Reviewer for this suggestion. The expression values we report in this work were calculated based on an RNA-seq library generated from a whole body sample. Consequently, considering the low relative abundance of nematocysts to total body weight, we expect that the contribution of this cell type to the total extracted RNA to be rather low. We exploited the available information from a previously published single-cell RNA-seq dataset obtained from another octocoral species (i.e. Xenia, see Hu et al., 2020, Nature) to identify the most likely candidate nematocyst-specific mRNAs venom components having a 1:1 orthology relationship with E. singularis. In detail, we were able to detect high-confidence 1:1 orthologs for 242 out of the 432 Xenia genes included in cluster 11 in the study by Hu and colleagues (i.e. the cluster associated with nematocysts). This allowed us to assess the expression of the orthologous sequences, expected to share a similar cell-specificity, in E. singularis. The 242 putative nematocyst-specific mRNAs displayed an average expression level of 16.65 TPM (median = 4.85 TPM) in the whole body sample, and just 8 out of these (i.e. about 3% of the total) had an expression level higher than 100 TPM. Based on these observations, we believe that our statement that “all these toxins had exceptionally high TPM values” holds true. Supplementary table 2 reports the sequences of the toxins identified in the NEM-P together with the TPM of the corresponding transcripts.

      Regarding the following statement on L463 - "Our investigation unequivocally demonstrated that Octocorallia do produce venom" Was it not already known that Octocorallia have nematocysts and therefore are venomous (in which case this should be cited)? If this wasn't known, I don't think this study was really designed to test this hypothesis. Regardless, I don't think this is a meaningful claim to make here.

      This observation is correct. We have rephrased the text accordingly.

      Table S2: on what basis are the sequences highlighted in red considered "proteomics validated" e.g. confidence, coverage? Could a protein abundance column be included in this table (for NEM and WB tissues)?* *

      Residues highlighted in red in Table S2 (now Suppl tab. 4) correspond to the tryptic peptides identified with good confidence by the LC-MS analysis. We have added supplementary files, as per request of Reviewer #1, with the summary of the PEAKS Studio outputs for the two proteomes, highlighting the confidence and coverage scores. In Suppl. Tab. 4, coverage has been recalculated considering the sequence of the predicted mature peptide (not the precursor identified by PEAKS Studio). Finally, as PEAKS Studio does not provide a quantitative measure of the identified peptides (i.e., counts), we have calculated and added to said tables the exponentially modified Protein Abundance Index (emPAI), which provides an approximate label free measure of each protein’s abundance. We have also added the relative emPAI, which normalizes each protein's emPAI value relative to the total emPAI of all proteins in the sample, providing a percentage abundance. It is noteworthy that all the proteins that have been identified as putative toxins have higher relative emPAI values in the NEM-P, thus providing yet an additional indirect proof of the validity of the ethanol extraction protocol (see Suppl. Tab. 2 and 3).

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. *Additional experiments e.g. synthesis and activity assays would go a long way towards bolstering some of the conclusions. However, if some of the conclusions can be toned down a little (see comments above), I don't consider these to be essential.

      In my opinion, the study would benefit from some additional analyses (described in the comments above).

      See our answers to the specific comments above.

      Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      N/A* * Are the data and the methods presented in such a way that they can be reproduced?

      Yes. * Are the experiments adequately replicated and statistical analysis adequate? *No - I may be wrong, but as far as I can tell from the text, replicates were not collected. Three cDNA libraries were generated but were these replicates (please clarify this in the Methods)? It could be reasonably argued (and I would mostly agree) that replicates are not necessary for a general analysis of the composition of the samples. However in a couple of instances conclusions are drawn based on "differential expression". I suggest that in the absence of expression level replicates these conclusions should be withdrawn.

      The statements about differential expression (more correctly differential maturation) are based on proteomics results and not on DEG analysis in the transcriptome (see also reply to reviewer #1). All the claims have been rephrased and the supplementary figure 1 has been added to support our statements.

      Concerning the cDNA libraries, however, they were prepared as technical replicates to account for variations in venom expression among samples, and the resulting assemblies were pooled before assembly, as explained in the Methods section.

      • *"Abundance" of proteins or toxins was mentioned on occasion, but no data on quantification or abundance of proteins is mentioned anywhere (although this is something that could be done with the LC-MS/MS data). In my opinion these data would be very useful and should be included, especially if mentioned in the text.
      • *As previously discussed, we have calculated and added to the PEAKS output file the emPAI and the relative emPAI values. These data are now provided in the supplementary Tables 2 and 3.

      Minor comments:

      * *Specific experimental issues that are easily addressable.

      Are there limitations to the ethanol extraction procedure (please add a paragraph in the Discussion)? Are there any previous studies using this procedure?

      This has been done: the potential drawbacks of the ethanol extraction procedure are now addressed in the Results and Discussion section.

      * *Are prior studies referenced appropriately?

      Yes, for the most part (but see comment above).

      * *Are the text and figures clear and accurate?

      In general yes, although I found myself looking for actual data. Most of the current figures are summaries or cartoons. I would have liked to have seen pictures of the species in question (including a picture/diagram of the tissue from which the cDNA libraries and proteomes were derived); a picture of the nematocysts; the total ion chromatogram of the "venom"; Some type of figure to place the "toxin" expression level in the context of all transcripts; some more of the actual sequences identified including alignments (in the main text rather than the SI);

      Various figures in the manuscript have been modified in accordance to the Reviewers’ suggestions. We have included a workflow of the extraction with a picture of E. singularis and modified Fig1 (now Fig 2) to include the TIC of the NEM-P.

      Figure 4: could the motifs and termini for each be labelled please.

      This has been done.

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions? See comments above. In my opinion, the work done was quite preliminary (i.e. analysis of a single species and does not include any activity/functional data) but still significant and useful to the field. I felt that some of the conclusions were unnecessarily over-reaching and could be toned down without detracting from the importance of the manuscript.

      Several instances of hyperbole could be toned down e.g. use of the words: remarkable (L27); rich (L28); intricate (L38); significant (L189); peculiar (L299, 427); only (L191); exceptionally (L300); extremely (L316); strong (L326). Similarly, some wording is subjective e.g. "worthy of" (L33); "interestingly" (L220, 382, 426, 492, 535). Please amend.

      We have toned down our statements through the manuscript.

      "Homology" is used throughout when referring to similarity. Please change.

      This has been done

      Minor typos and similar:

      2.5 cm (L97) - use 25 mm (cm is not a standard scientific measure).

      30" (L97) - 30 min?

      ml (L97) - mL is technically correct although some journals use ml, regardless should be consistent throughout. Reverse-phase (L127) – reversed-phase

      30,000 (L141) – units?

      Typos were corrected.

      *

      *Reviewer #3 (Significance (Required)):

      *

      *SECTION B – Significance

      * ========================

      *- Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      * *Cnidarian venoms and toxins have been the subject of extensive study over the past several decades. However there has been very little work performed on corals. In this respect, this subject of this manuscript is significant.

      * *- Place the work in the context of the existing literature (provide references, where appropriate).

      * *The subject of this manuscript i.e. the characterisation of the venom composition of a coral is an interesting topic. The work is rather preliminary, but still represents an important addition to the literature (without requiring overinterpretation of the results-see comments above).

      * *- State what audience might be interested in and influenced by the reported findings.

      * *I would expect the manuscript to be of interest to others working in the toxinology field, particularly those working on Cnidarian venoms or toxins.

      * *- Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      * *Venom; Toxins; Pep

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The authors of this work explore the venom repertoire of octocoral, a group of cnidarians whose venom has largely been ignored in the literature. As a first step into characterizing the venom of octocorals, the authors use a proteo-transcriptomic approach for Eunicella singularis, Specifically, they generated the transcriptome and proteome from whole-body as well as a more specific proteome of the nematocyst, a specialized sub-cellular structure found only in cnidarians and used to inject venom. The nematocyst proteome is a crucial dataset of the manuscript as it allows the authors to discriminate what is most likely a bona fide toxin compared to general physiological proteins.

      Major:

      However, I have some skepticism regarding the legitimacy of this nematocyst proteome. Specifically, the proteins from this are nematocyst-specific. The authors used an approach to soak the animal in ethanol, which theoretically should cause the nematocyst to fire, releasing the venom housed inside. This is a technique previously used in box jellyfish where they show that indeed the nematocyst have fired using histological approaches. However, this was not validated for Eunicella singularis. I am hesitant to fully accept that the data from the nematocyst-proteome is specific. Other approaches, such as isolating nematocyst using a percoll gradient, will likely generate a more specific nematocyst proteome. This percoll gradient approach has been used to isolate nematocysts from different species of cnidarians ranging from hydra to sea anemones, however, I recognize that although this approach is robust for different cnidarians, acquiring enough material is challenging and maybe beyond the capacity for this octocoral. I would argue this would be the best approach, but if not feasible I can understand. However, other potential validation could be used to help improve the confidence that this is, at least mostly, nematocyst-specific. Furthermore, one could argue that this ethanol approach used in box jellyfish also specifically used tentacle, a tissue significantly enriched in nematocyst likely greatly improving the specificity in isolating nematocyst-specific proteins. whereas in this study they use a collection of whole polyps, therefore, anything that is extracted from the ethanol would precipitate. This is a much more complex collection of tissues which I would assume could interfere with isolating nematocyst-specific proteins

      A computational approach, that I think is essential, is to use the Xenia single-cell atlas. Xenia is also an octocoral with a nice single-cell atlas in which the cnidocytes form a distinct cluster. The authors can perform a reciprocal best-blast hit with the xenia genome and Eunicella singularis transcriptome and then see if gene-encoding proteins found in Eunicella nematocyst proteome have orthologs with genes found in the Xenia cnidocyte cluster. A statistical test could then be performed to show that there is a significant overlap between the nematocyst proteins from Eunicella and their orthologs in the Xenia cnidocyte cluster. This is still quite indirect but can give some insights. A better approach would be to perform proteomics from Xenia using the ethanol approach and mapping to see where the proteins captured are found in the atlas. This would massively elevate this work and provide proof that indeed this approach using ethanol is capable of precipitating nematocyst-specific proteins. I would strongly recommend trying to provide some evidence that this is indeed a nematocyst-specific protein, or at the least, is significantly enriched. Because this is unknown, many of the interpretations presented downstream are not well supported.

      Another major issue with the manuscript is the section referring to SCRiPs. First, the authors do not cite Jouiaei, Sunagar et al. (2015) which was the first publication to functionally characterize SCRiPs as toxins. Additionally, the majority of SCRiPs identified in this study and those found in Eunicella have a different cysteine framework. The authors acknowledge this online 245 but claim that, given the alphafold structure is similar, they are from the same gene family. First, I think this is very weak support as typically sharing a conserved cysteine framework is the bare minimum to categorize these toxins in a gene family. Although some cysteine frameworks are somewhat hard to resolve as the space between the cysteines can be variable, in this case, SCRiPs have a very distinct triple repeat of cysteines near the C terminal that is missing in these octocoral SCRiPs. These make me suspicious that these are indeed from the same gene family. Then relying on alphafold to predict the structure and claiming it's similar to Tau-AnmTx Ueq 12-1 from Urticina eques is also fairly weak support. Although I am not an expert in protein structures, I cannot tell from the images comparing the 2 structures in the supplementary figure s1 that these are similar. Perhaps you could align or overlap them, or give some readout of the similarity of these structures. Currently, I am skeptical of any of the SCRiPs described in this manuscript. Additionally, if the authors can show that indeed these are SCRiPs, again I would strongly advise the authors to check the Xenia scRNA-seq to see if these Xenia SCRiP-like sequences are expressed in cnidocytes.

      Minor:

      The ShK protein, U-GRTX-Esi4, strikes me as similar to NEP3 gene family identified in Nematostella, which also has 3 ShK domains (Columbus-Shenkar et al. 2018).

      Interestingly U-GRTX-Esi20 and 21 were found to be structurally similar to acrorhagin 1a but do not share a conserved cysteine framework ( 6 cysteines vs 8). One thing that the authors should be careful of, and perhaps point out that this is indeed not nematocyst-specific, is that an ortholog acrorhagin 1a was found to be expressed in the neurons in Nematostella (Sachkova et al. 2020). Perhaps ancestral acrorhagin 1 was found in the last common ancestor of Anthozoa but was a neuropeptide that got recruited to the venom in Actinia.

      Also in general the authors refer to a lot of phylogenetics that I cannot see in the paper. For example, on line 339:

      "Our genomic survey indicates that these two toxins belong to two distinct monophyletic orthogroups within a very large superfamily of cysteine-rich peptides, encoded by ancestrally duplicated paralogous genes with intronless structures, that also include other members in E. singularis, not detected in the NEM-P."

      What genomic survey are you referring to (where is this data)? What do you mean by "belong to two distinct monophyletic orthogroups".

      Also, there is no visualization of the results when the authors refer to the genomic surveys, especially when referring to intron-exon boundaries. Please include which genomes include which sequences and their given intron-exon boundaries for a given gene family. I do not understand how the authors resolved figure 4. How do you know there was a loss not a gain of f exon 2 in the gene encoding for U-GRTX-Esi17. Providing the genomic loci for the toxin gene families would help. Maybe something like figure 5 from Koludarov et al. (2024) would be useful, but ideally including intron-exon boundaries.

      In the methods the author's mention:

      "Whenever needed (i.e., U-GRTX-Esi20 and 21), a fine-scale classification of orthologous sequences was aided by Maximum Likelihood phylogenetic inference analyses, carried out with IQ-Tree [49] with 1000 ultrafast bootstrap replicates based on the best-fitting model of molecular evolution detected by ModelFinder [50]."

      So please include this data as supplementary figures. The authors did plenty of analysis they refer to but do not include this in the paper. This lack of data makes it very hard to follow many of the phylogenetic and genomic insights from this manuscript

      Significance

      This work is very can be very useful in extending our knowledge of venom in cnidarians and can help build better resolution of the evolutionary history of the ecologically essential proteins

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Modica et al reports characterisation of the venom system in the white sea fan Eunicella singularis, a species of an octocorallian coral. E. singularis is common in the north-western Mediterranean sea. The authors used a proteo-transcriptomic approach followed by extensive bioinformatics analysis. Specifically, they generated a new E. singularis transcriptome and characterised extracts from nematocysyts (venom-bearing structures) and whole body using tandem mass spectrometry. Toxins were identified by HMMER using Tox-prot and VenomZone databases as queries as well as ClanTox web server.

      Major comments:

      1. As far as I am aware, venom production by ectodermal gland cells has been reported only in sea anemones (Moran et al, 2011), therefore it is unclear whether it is the case in the octocorallian sea fan as well. Additionally, cnidarian toxin-like proteins might be produced by neurons (Sachkova et al, 2020) or involved in development (Surm et al 2024). Thus, it is probable that in E. singularis not all the toxin-like proteins found in the whole body proteome and missing from the nematocyst proteome are venom components. Thus, additional experiments would be required to localise those proteins to ectodermal gland cells. I suggest to mention this limitation and refer to such proteins as "toxin-like" or "putative toxins".
      2. In addition to submitting proteomics data to PRIDE, it would be helpful for readers/reviewers to provide a supplementary excel file with all the peptides and proteins identified by PEAKS Studio. I could not access the data on PRIDE as I think they still have not been assigned a PXD dataset identifier.

      Minor comments:

      1. It would be helpful for readers to split the Results and Discussions into smaller subsections with headings, perhaps according to the identified toxin families. It would be also helpful to provide a summary figure with all the toxins identified and perhaps toxin expression levels. Especially showing cysteine patterns for new toxins would be very useful.
      2. It is unclear why the Toxin annotation pipeline is hidden in the supplementary material. It would be also helpful to show it as a schematic pipeline in the main text.
      3. The identification of proteolytic cleavage sites is not really described. It would be also helpful to mark them at the Figure 2.
      4. "Other peptides present in E. singularis nematocysts and displaying protease inhibitory domains, but likely lacking a toxin function (Kazal-type, cystatines, antistasins, and macins)..." - why do they likely lack a toxin function? what is the rational behind this statement?
      5. "cell- or tissue-specific differential maturation patterns" - I think the differential maturation needs to be confirmed by additional experiments to exclude a possibility of being an artifact due to low mass spectrometry sensitivity.
      6. "three consecutive ShK domains with peculiar characteristics (Suppl. Fig. 2)" - what are these characteristics?
      7. Fig. S1 legend: "Octocorallia (cyano bar) and Hexacorallia (blue bar)" - the bars look pink and cyan.

      Referee cross-commenting

      I agree with both reviewers that additional validation of the ethanol extraction method would be required to confirm its specificity and efficiency. Since ethanol is widely used for tissue fixation, I would guess that it is improbable that it leads to disruption of other coral cell types in addition to discharging nematocytes. However, to be 100% sure that would need to be confirmed experimentally. I think the suggestion to use Xenia single cell dataset to validate the nematocyst proteome reported in this paper is really worth trying. However, toxin-like genes in cnidarians might be recruited to non-venom cell types (Sachkova et al, 2020; Surm et al 2024) therefore if a gene is nematocyte-specific in one species it does not mean it would the same in another one, especially if they are distantly related. Thus, the best would be to run some additional experiments in Eunicella singularis, if the tissue is available.

      Significance

      This study reports venom composition of an octocoral for the first time. These data are very important for understanding biology and ecology of these animals as they rely on venom for feeding and deterring predators. This study is a significant advancement of the cnidarian venomics as most of the literature is limited to sea anemone and jellyfish venoms. This study will be interesting to the broad audience: venomics and coral ecology communities, evolutionary biologists and marine scientists. The main strength of this work is that it provides a comprehensive overview of the venom system in a widespread octocoral species with important ecological roles. The limitations of this study is that the toxicity and biological function of the identified venom components have not been confirmed experimentally. However, the localisation of the proteins to nematocysts is a very strong indication of being a venom component.

      My expertise: cnidarian venom (biochemistry, ecology and evolution).

    1. The good news comes from Joe Kelly, who investigated the large “ultrafast” response ofEarth’s energy imbalance (EEI) to a 2×CO2 forcing in the GISS (2020) GCM. Specifically,we reported in Pipeline that the initial 4 W/m2 imbalance dropped by about 30% (to 2.7W/m2) in year 1 after the forcing was imposed.

      for - more research on - Joe Kelly - climate crisis - cloud behavior - Jim Hansen

    1. match state-of-the-art constraints using significantly less data

      The annotated text "match state-of-the-art constraints using significantly less data" is highlighted to emphasize a key finding of the research paper. This statement underscores the efficiency and effectiveness of the new machine learning-based approach developed by the authors. Here’s a comprehensive explanation of its significance:

      Explanation

      The highlighted text is transformational because it showcases a breakthrough in the methodology for constraining warm dark matter (WDM) models. Traditional methods, which rely on Markov Chain Monte Carlo techniques to analyze the Lyman-α forest power spectrum, require a substantial amount of observational data to achieve similar constraints on WDM particle mass. In contrast, the new machine learning approach described in the paper achieves comparable results with up to 40 times less observational data.

      This efficiency gain is significant for several reasons:

      1. Data Efficiency: The ability to obtain accurate constraints with significantly less data reduces the observational resources needed, making the research more cost-effective and accessible.
      2. Speed and Scalability: Machine learning models, once trained, can process and analyze data more quickly than traditional methods, enabling faster scientific discoveries.
      3. Broad Applicability: The approach can be applied to various datasets with different noise levels and resolutions, demonstrating its robustness and versatility.
      4. Future Research: This method opens new avenues for exploring other cosmological parameters and models with reduced data requirements, potentially accelerating advancements in the field of astrophysics.

      In summary, the highlighted text points to the novel and efficient nature of the machine learning approach, which represents a significant advancement in the study of warm dark matter and cosmological density fields.

    2. statistical inference pipeline to recover WDM masses from the reconstructed ∆τ skewers

      The annotated text, "statistical inference pipeline to recover WDM masses from the reconstructed ∆τ skewers," is highlighted due to its significance in the context of the paper's novel methodology and findings. Here’s a comprehensive explanation focusing on its importance:

      Significance and Implications

      1. Novelty and Innovation:
      2. The highlighted text points to a key methodological advancement presented in the paper. The development of a "statistical inference pipeline" represents a new approach to analyzing astrophysical data, specifically targeting the reconstruction and interpretation of warm dark matter (WDM) properties from observed Lyman-α skewers.

      3. Machine Learning Integration:

      4. This pipeline leverages machine learning, particularly a Bayesian neural network, to reconstruct the optical depth-weighted density fields (∆τ). This is significant because it demonstrates the application of advanced computational techniques to enhance the precision and efficiency of cosmological studies.

      5. Efficiency and Data Utilization:

      6. The method described allows for the extraction of meaningful constraints on WDM particle masses using significantly less observational data compared to traditional techniques. This efficiency is transformational because it reduces the resource intensity of such studies, making it more feasible to conduct large-scale analyses.

      7. Implications for Dark Matter Research:

      8. By enabling the direct inference of WDM masses, this approach addresses a fundamental question in cosmology: the nature and properties of dark matter. Accurate constraints on WDM masses help refine our understanding of the universe's structure and evolution, potentially leading to breakthroughs in both cosmology and particle physics.

      Contextual Importance

      • Broader Research Context:
      • The paper situates this advancement within the broader context of ongoing efforts to understand dark matter. Traditional models like cold dark matter (CDM) have been extensively studied, but WDM remains less understood. This research contributes to filling that gap.

      • Comparison to Existing Methods:

      • The text highlights a comparison with existing state-of-the-art methods, emphasizing that the new pipeline can achieve comparable or better constraints with much less data. This comparison underscores the practical benefits and potential for widespread adoption of the new method.

      Conclusion

      The annotation highlights a crucial aspect of the paper's contribution to the field of astrophysics. The "statistical inference pipeline to recover WDM masses from the reconstructed ∆τ skewers" encapsulates the core innovation and its transformational potential in dark matter research, offering a more efficient, accurate, and data-sparing approach to a longstanding cosmological challenge.

    3. Bayesian neural network architecture that can do regression and predict an optical depth-weighted density field

      The highlighted text, "Bayesian neural network architecture that can do regression and predict an optical depth-weighted density field," is significant for several reasons:

      1. Introduction of a Novel Method: This phrase introduces a new application of Bayesian neural networks specifically tailored for a complex astrophysical problem. The use of Bayesian neural networks for regression tasks in this context is innovative, as it allows for the prediction of the optical depth-weighted density field, which is crucial for understanding the distribution of warm dark matter (WDM) in the universe.

      2. Enhanced Predictive Capability: The ability to predict the optical depth-weighted density field represents a significant advancement over traditional methods. Traditional approaches often rely on summary statistics of the Lyman-α forest flux, such as the power spectrum, to infer properties of the intergalactic medium (IGM) and dark matter. By contrast, this machine learning approach directly reconstructs the density field on a pixel-by-pixel basis, potentially yielding more precise and detailed insights.

      3. Quantification of Uncertainty: The use of a Bayesian framework is particularly noteworthy because it allows the model to not only make predictions but also to quantify the uncertainty associated with those predictions. This is crucial in scientific research, where understanding the confidence in one's results can significantly impact the interpretation and subsequent conclusions drawn from the data.

      4. Reduction in Data Requirements: As highlighted in the paper, this method allows for achieving comparable constraints on WDM particle mass using significantly less observational data than traditional methods. This efficiency can lead to more cost-effective research and faster advancements in the field.

      5. Implications for Cosmology and Astrophysics: The development and implementation of this Bayesian neural network architecture have broader implications for the field of cosmology and astrophysics. It showcases the potential of machine learning techniques to revolutionize the way complex astrophysical phenomena are studied and understood, paving the way for more advanced models and analyses in the future.

      In summary, the highlighted text signifies a transformational approach in astrophysical research, leveraging advanced machine learning techniques to enhance predictive accuracy, quantify uncertainties, and reduce data requirements, thereby pushing the boundaries of current methodologies in the study of dark matter and the intergalactic medium.

    4. match current state-of-the-art WDM particle mass constraints using up to ∼40 times less observational data

      The annotated text "match current state-of-the-art WDM particle mass constraints using up to ∼40 times less observational data" is highlighted to emphasize a significant and transformational finding in the paper. Here's a comprehensive explanation of why this finding is important:

      Significance and Implications

      1. Efficiency in Data Usage:
      2. The highlighted text underscores a major breakthrough in the efficiency of data usage. Traditional methods to constrain warm dark matter (WDM) particle mass typically require vast amounts of observational data. The new machine learning approach presented in this paper achieves comparable constraints with up to 40 times less data. This is a substantial improvement and can drastically reduce the time, effort, and resources needed for such studies.

      3. Advancement in Methodology:

      4. The use of machine learning, specifically a Bayesian neural network, represents a novel methodological advancement. This approach enables a pixel-by-pixel reconstruction of the intergalactic medium (IGM) density field from Lyman-α forest observations, which is more direct and potentially more accurate than previous methods that relied on summary statistics like the power spectrum.

      5. Potential for Broader Applications:

      6. This finding has broader implications for the field of cosmology and astrophysics. The ability to make accurate inferences with less data can open up new possibilities for studying the universe, particularly in areas where data is scarce or difficult to obtain.

      7. Validation and Robustness:

      8. The paper demonstrates that the machine learning model is robust and generalizes well to different datasets, including those generated by alternative hydrodynamical codes not included in the training data. This robustness adds credibility to the method and suggests it can be reliably applied to various observational datasets.

      Contextual Importance

      • Current State-of-the-Art:
      • The paper compares its findings to existing state-of-the-art constraints on WDM particle mass, highlighting that the new method does not compromise on accuracy despite the reduced data requirement. This comparison is crucial for validating the effectiveness of the new approach.

      • Relevance to Broader Research Goals:

      • The ability to constrain WDM particle masses more efficiently aligns with broader research goals in understanding the nature of dark matter, a fundamental question in both cosmology and particle physics. Efficient data usage can accelerate progress in this area and potentially lead to new discoveries.

      In summary, the annotated text is highlighted to showcase a key finding of the paper that represents a significant advancement in the methodology for constraining WDM particle mass. By achieving state-of-the-art results with substantially less observational data, the research opens up new avenues for efficient and accurate cosmological studies.

    5. accurately recover within a 1σ error 85% of the density fields

      The highlighted text "accurately recover within a 1σ error 85% of the density fields" is significant for several reasons:

      1. Validation of Methodology: This statement underscores the effectiveness and reliability of the Bayesian neural network model used in the study. Achieving an 85% accuracy within a 1σ error margin indicates that the model is robust and performs well in reconstructing the intergalactic medium (IGM) density fields from the Lyman-α forest data.

      2. Confidence in Predictions: The 1σ error margin is a standard measure of uncertainty in statistical analysis, typically representing a 68% confidence interval. By demonstrating that the model can recover 85% of the density fields within this margin, the authors provide strong evidence that their machine learning approach yields precise and reliable predictions.

      3. Implications for Warm Dark Matter (WDM) Constraints: The ability to accurately reconstruct the density fields is crucial for constraining the mass of potential warm dark matter particles. The high accuracy of the model means that the derived constraints on the WDM particle mass are likely to be robust and credible, which is a significant advancement in the field of cosmology and astrophysics.

      4. Comparison to Traditional Methods: The highlighted text also implies a potential advantage over traditional methods such as Markov Chain Monte Carlo techniques, which require more observational data to achieve similar levels of accuracy. This efficiency in data usage could make the new machine learning approach more practical and accessible for future research.

      5. Broader Impact: The success of this model in accurately recovering density fields can inspire further applications of machine learning in astrophysics, potentially leading to more breakthroughs in understanding the universe's structure and the nature of dark matter.

      In summary, the highlighted text emphasizes the high accuracy and reliability of the Bayesian neural network model in reconstructing IGM density fields, which is crucial for advancing the study of warm dark matter and demonstrates the potential of machine learning in astrophysical research.

    6. Bayesian neural network on the supervised regression task

      The annotated text, "Bayesian neural network on the supervised regression task," is highlighted to emphasize a key methodological innovation in the research presented in the paper. Here’s a comprehensive explanation of its significance:

      Explanation

      The highlighted phrase "Bayesian neural network on the supervised regression task" is significant because it underscores a novel approach in the study of cosmological density fields and warm dark matter (WDM) models. By leveraging a Bayesian neural network, the authors introduce a machine learning technique that can perform a supervised regression task to predict the optical depth-weighted density field (Δτ) from the Lyman-α forest flux field. This method offers several transformational benefits:

      1. Enhanced Predictive Accuracy: The Bayesian neural network is trained to predict not only the density field but also the uncertainty in its reconstruction. This dual prediction capability allows for more robust and reliable inferences about the underlying cosmological structures.

      2. Efficient Data Utilization: The machine learning approach enables the extraction of meaningful constraints on WDM particle masses using significantly less observational data compared to traditional methods like Markov Chain Monte Carlo (MCMC) techniques. This efficiency is crucial for advancing our understanding of dark matter with limited data resources.

      3. Innovative Statistical Inference: The implementation of a Bayesian framework allows the researchers to quantify and propagate uncertainties through the statistical analysis pipeline. This results in more precise and credible constraints on the properties of dark matter particles.

      4. Validation on Diverse Datasets: The neural network's performance is validated against both the Sherwood-Relics simulation suite and the Nyx simulation code, demonstrating its generalization capability across different hydrodynamical solvers and physical prescriptions.

      Overall, the use of a Bayesian neural network for supervised regression in this context represents a significant methodological advancement in astrophysics and cosmology. It enables more accurate reconstructions of the intergalactic medium density field and provides tighter constraints on warm dark matter models, thus contributing to our broader understanding of the universe's structure and composition.

    7. constrain the mass of a potential warm dark matter particle

      The annotated text "constrain the mass of a potential warm dark matter particle" is highlighted to emphasize a significant objective of the research paper. This objective is transformative for several reasons:

      1. Novel Application of Machine Learning: The paper introduces a machine-learning approach, specifically a Bayesian neural network, to reconstruct the intergalactic medium (IGM) density field from Lyman-α forest data. This innovative method allows for a more precise and detailed analysis of the density field compared to traditional methods.

      2. Direct Constraints on Warm Dark Matter (WDM): By using the reconstructed density fields, the researchers aim to directly constrain the mass of warm dark matter particles. This is a significant advancement because WDM particles have different properties compared to cold dark matter (CDM) particles, and their mass is a crucial parameter for understanding the nature of dark matter and its role in the formation and evolution of cosmic structures.

      3. Efficiency and Data Utilization: The approach described in the paper claims to achieve comparable constraints on WDM particle mass using significantly less observational data (up to 40 times less) than traditional methods like Markov Chain Monte Carlo techniques based on the Lyman-α forest power spectrum. This efficiency could revolutionize the field by reducing the time and resources needed to obtain meaningful results.

      4. Broader Implications for Cosmology and Particle Physics: Constraining the mass of WDM particles has broader implications for both cosmology and particle physics. It helps refine models of the universe's large-scale structure and provides insights into the properties of dark matter, which remains one of the most significant unsolved problems in modern physics.

      In summary, the highlighted text underscores a key aim of the research that leverages advanced machine learning techniques to make substantial progress in the field of cosmology by providing more efficient and direct constraints on the properties of warm dark matter.

    8. machine-learning approach that allows for a pixel-by-pixel reconstruction

      The annotated text "machine-learning approach that allows for a pixel-by-pixel reconstruction" is highlighted because it introduces a novel and significant methodological advancement in the paper. Here’s a comprehensive explanation of why this is transformational:

      1. Significance of Pixel-by-Pixel Reconstruction:
      2. Precision and Detail: The approach allows for an unprecedented level of detail in reconstructing the intergalactic medium (IGM) density field. Instead of averaging over larger regions, the method can analyze and reconstruct the density at each individual pixel, leading to more precise and granular insights.

      3. Implications for Warm Dark Matter (WDM) Research:

      4. Enhanced Constraints: By reconstructing the density field at such a detailed level, the method improves the ability to constrain the mass of potential warm dark matter particles. This is a critical advancement because it allows researchers to derive constraints directly from the density field, potentially leading to more accurate and robust results than previous methods.

      5. Use of Machine Learning:

      6. Efficiency and Scalability: The use of a machine-learning model, specifically a Bayesian neural network, allows for efficient processing of large datasets. This is transformational because it can handle the complexity and volume of data from Lyman-α forests more effectively than traditional statistical methods.
      7. Uncertainty Quantification: The Bayesian aspect of the neural network provides a way to quantify uncertainties in the reconstruction, which is crucial for scientific rigor and confidence in the results.

      8. Context within the Paper:

      9. Innovative Methodology: The annotated text is part of a broader effort in the paper to leverage modern computational techniques to address longstanding questions in cosmology and astrophysics. This methodological innovation represents a significant step forward in the field.
      10. Comparison to Traditional Methods: The paper contrasts this new approach with traditional methods, such as Markov Chain Monte Carlo techniques, highlighting that the new method can achieve comparable or better constraints with significantly less observational data (up to ~40 times less).

      In summary, the annotated text is highlighted because it encapsulates the core innovation of the paper: a detailed, efficient, and precise machine-learning approach to reconstructing the IGM density field, which has significant implications for constraining warm dark matter models. This advancement is transformational in its potential to enhance the precision and efficiency of cosmological research.

    Annotators

    1. findbar

      The annotated text "findbar" appears in a context where the user is interacting with a PDF document, specifically within the user interface elements of a PDF reader application. Here is a comprehensive explanation of why this specific portion might have been highlighted:

      Explanation

      The term "findbar" is highlighted within a list of PDF reader functionalities. This annotation likely points to the "findbar" feature, which is a tool used for searching text within the document. This feature is significant for several reasons:

      1. Utility for Users: The "findbar" is a critical tool for users who need to quickly locate specific information within a PDF, especially in lengthy documents. It enhances the user experience by providing an efficient way to navigate through text.

      2. Context of User Query: Given the user’s question about annotating new and novel findings in a paper, the "findbar" tool is particularly relevant. It allows users to search for specific terms or phrases related to novel findings, making it easier to identify and annotate significant portions of the text.

      3. Implications for Document Interaction: Highlighting the "findbar" underscores the importance of search functionalities in modern document management. It reflects the shift towards more interactive and user-friendly digital reading experiences, where users can engage with content more dynamically.

      4. Broader Technological Context: The presence of the "findbar" is indicative of broader trends in software development aimed at improving accessibility and usability. It represents how digital tools are evolving to meet the needs of users who require quick access to specific information in large datasets or documents.

      Contextual Interpretation

      In the broader context of the user’s query about annotating new findings, the "findbar" is a transformative tool. It allows for the precise identification of novel findings within a document, making the process of annotation more efficient and effective. This capability is transformational because it leverages technology to enhance the way users interact with and analyze text, ultimately leading to a more streamlined and productive workflow.

      By focusing on the "findbar," we see a microcosm of the larger technological advancements in document management and user interface design. It highlights the ongoing efforts to make information retrieval more intuitive and accessible, which is crucial for researchers, students, and professionals who rely on quick and accurate data access.

      Conclusion

      The highlighting of "findbar" is significant as it points to an essential tool within the PDF reader that greatly aids in the process of finding and annotating specific text. This feature is transformational in the context of document interaction, enhancing user efficiency and reflecting broader technological trends aimed at improving accessibility and usability in digital environments.

    2. Highlight all

      Explanation:

      The annotated text "Highlight all" has been selected from a larger context that appears to be a user interface or a set of instructions related to navigating and interacting with a PDF document. The phrase "Highlight all" is significant because it represents a command or function within the PDF viewer that allows users to highlight all instances of a searched term or phrase within the document.

      Significance and Implications:

      1. Functionality Insight: The "Highlight all" feature is crucial for users who need to quickly identify and review all occurrences of specific terms or phrases within a document. This can be particularly useful for researchers, students, or professionals who are analyzing large texts and need to find relevant information efficiently.

      2. User Experience: Highlighting all instances of a term improves the user experience by saving time and reducing the effort required to manually search through the document. It enhances readability and accessibility, making it easier to locate and focus on important content.

      3. Contextual Relevance: In the context of the user's question, which asks for an annotation of novel findings in a paper, the ability to "Highlight all" could be transformational. It allows the reader to pinpoint every mention of new and novel findings, ensuring that none are overlooked. This feature supports thorough analysis and comprehension of the document’s content.

      4. Broader Implications: The "Highlight all" function also has broader implications for digital literacy and information management. It exemplifies how digital tools can enhance traditional reading and research methods, offering new ways to interact with and analyze text.

      Contextual Interpretation:

      Given that the user is asking for an annotation of new and novel findings in a paper, the highlighted "Highlight all" command underscores a tool that can facilitate this process. It suggests that the user can leverage this feature to efficiently locate and annotate significant findings throughout the document, thus transforming their approach to document analysis.

      In summary, the "Highlight all" function is highlighted because it represents a pivotal tool for efficient document navigation and analysis. Its significance lies in its ability to enhance user experience, support thorough research, and exemplify the advantages of digital tools in managing and interacting with text.

    3. Next

      The provided text to analyze is a technical interface description of a PDF viewer, detailing various functionalities such as navigating through pages, zooming in and out, selecting text, and more. The annotated text is the single word "Next," which appears to be a part of the navigation instructions within the PDF viewer.

      Explanation:

      The annotation "Next" is highlighted to emphasize its role in the user interface of the PDF viewer. This term is significant because it represents a common action that users frequently perform when interacting with digital documents—moving to the subsequent page or item. Here's a comprehensive analysis of its importance:

      1. User Experience: "Next" is a crucial part of the navigation system in any PDF viewer. It allows users to move forward through the document, which is essential for reading or reviewing content sequentially. Highlighting this term underscores the importance of intuitive navigation in enhancing user experience.

      2. Functionality: The presence of a "Next" button or option indicates that the PDF viewer is designed to handle multi-page documents efficiently. This functionality is fundamental for users who need to browse through extensive documents, such as research papers, e-books, or reports.

      3. Contextual Relevance: In the broader context of the interface description, "Next" is part of a series of commands that facilitate document interaction. Highlighting it draws attention to the navigational tools available to the user, which are critical for effective document management.

      4. Implications for Usability: The ease of navigation directly impacts the usability of the PDF viewer. A well-placed and easily identifiable "Next" button ensures that users can move through the document without frustration, thereby improving overall satisfaction and productivity.

      5. Technical Documentation: The detailed description of the PDF viewer's functionalities, including the "Next" command, serves as a technical guide for users. Highlighting this term can indicate its importance in the documentation, ensuring that users are aware of how to utilize this feature effectively.

      In summary, the annotated text "Next" is highlighted to emphasize its critical role in the navigation and usability of the PDF viewer. It is a fundamental feature that enhances user experience by enabling straightforward document traversal, which is essential for efficient reading and interaction with digital documents.

    4. Document Outline

      The provided annotated text is a snippet from a PDF document interface, specifically highlighting the phrase "Document Outline." To understand and explain why this phrase is significant, let's follow the outlined steps:

      1. Understand the User Question: The user is asking for an annotation of novel findings in a paper and an explanation of their transformational nature. The annotation should focus on identifying and explaining key discoveries or insights.

      2. Read the Full Text: The full text provided appears to be a list of interface elements from a PDF viewer, not the content of a research paper itself. This context is crucial as it indicates the user might be referring to the structure or navigational aspects of a document rather than its substantive content.

      3. Focus on the Annotated Text: The specific highlighted phrase is "Document Outline."

      4. Analyze the Annotation: The annotation "Document Outline" likely points to a feature within the PDF viewer that helps users navigate the document. This feature is essential for understanding the structure and organization of a lengthy or complex document.

      5. Contextual Interpretation: In the context of a research paper, the "Document Outline" is a tool that provides an overview of the paper's structure, including sections, headings, and subheadings. This is crucial for readers to quickly locate specific sections of interest and understand the flow of the document.

      6. Formulate Explanation:

      Explanation: The annotation "Document Outline" is significant because it highlights a feature within the PDF viewer that aids in the navigation and comprehension of the document. For a research paper, the document outline serves as a roadmap, allowing readers to efficiently access and understand the structure of the paper. This is especially transformational in lengthy or complex documents, where quickly finding and referencing specific sections can significantly enhance the reader's ability to engage with and comprehend the material.

      By focusing on the "Document Outline," the annotation underscores the importance of document navigation tools in facilitating a more effective and user-friendly reading experience. This feature can be particularly transformational in academic and professional settings, where the ability to quickly access relevant information can save time and improve the overall utility of the document.

    1. Reviewer #3 (Public review):

      Summary:

      The paper by Li et al explored the role of Estrogen receptor 1 (Esr1) expressing neurons in the pontine micturition center (PMC), a brainstem region also known as Barrington's nucleus (Hou et al 2016, Keller et al 2018). First, the author conducted bulk Ca2+ imaging/unit recording from PMCESR1 to investigate the correlations of PMCESR1 neural activity to voiding behavior in conscious mice and bladder pressure/external urethral muscle activity in urethane anesthetized mice. Next, the authors conducted optogenetics inactivation/activation of PMCESR1 to confirm the contribution to the voiding behavior also conducted peripheral nerve transection together with optogenetics activation to confirm the independent control of bladder pressure and urethral sphincter muscle.

      Weaknesses:

      (1) The study demonstrates that pelvic nerve transection reduces urinary volume triggered by PMCESR1+ cell photoactivation in freely moving mice. Could the role of pudendal nerve transection also be examined in awake mice to provide a more comprehensive understanding of neural involvement?

      (2) While the paper primarily focuses on PMCESR1+ cells in bladder-sphincter coordination, the analysis of PMCESR1+-DGC/SPN neural circuits - given their distinct anatomical projections in the sacral spinal cord - feels underexplored. How do these circuits influence bladder and sphincter function when activated or inhibited? Also, do you have any tracing data to confirm whether bladder-sphincter innervation comes from distinct spinal nuclei?

      (3) Although the paper successfully identifies the physiological role of PMCESR1+ cells in bladder-sphincter coordination, the study falls short in examining the electrophysiological properties of PMCESR1+-DGC/SPN cells. A deeper investigation here would strengthen the findings.

      (4) The parameters for photoactivation (blue light pulses delivered at 25 Hz for 15 ms, every 30 s) and photoinhibition (pulses at 50 Hz for 20 ms) vary. What drove the selection of these specific parameters? Moreover, for photoactivation experiments, the change in pressure (ΔP = P5 sec - P0 sec) is calculated differently from photoinhibition (Δpressure = Ppeak - Pmin). Can you clarify the reasoning behind these differing approaches?

      (5) The discussion could further emphasize how PMCESR1+ cells coordinate bladder contraction and sphincter relaxation to control urination, highlighting their central role in the initiation and suspension of this process.

      (6) In Figure 8, The authors analyze the temporal sequence of bladder pressure and EUS bursting during natural voiding and PMC activation-induced voiding. It would be acceptable to consider the existence of a lower spinal reflex circuit, however, the interpretation of the data contains speculation. Bladder pressure measurement is hard to say reflecting efferent pelvic nerve activity in real time. (As a biological system, bladder contraction is mediated by smooth muscle, and does not reflect real-time efferent pelvic nerve activity. As an experimental set-up, bladder pressure measurement has some delays to reflect bladder pressure because of tubing, but EUS bursting has no delay.) Especially for the inactivation experiment, these factors would contribute to the interpretation of data. This reviewer recommends a rewrite of the section considering these limitations. Most of the section is suitable for the results.

    2. Author response:

      Reviewer #1 (Public review):

      Summary:

      Urination requires precise coordination between the bladder and external urethral sphincter (EUS), while the neural substrates controlling this coordination remain poorly understood. In this study, Li et al. identify estrogen receptor 1-expressing neurons (ESR1+) in Barrington's nucleus as key regulators that faithfully initiate or suspend urination. Results from peripheral nerve lesions suggest that BarEsr1 neurons play independent roles in controlling bladder contraction and relaxation of the EUS. Finally, the authors performed region-specific retrograde tracing, claiming that distinct populations of BarEsr1 neurons target specific spinal nuclei involved in regulating the bladder and EUS, respectively.

      Strength:

      Overall, the work is of high quality. The authors integrate several cutting-edge technologies and sophisticated, thorough analyses, including opto-tagged single unit recordings, combined optogenetics, and urodynamics, particularly those following distinct peripheral nerve lesions.

      Weakness:

      (1) My major concern is the novelty of this study. Keller et al. 2018 have shown that BarEsr1 neurons are active during urination and play an essential role in relaxing the external urethral sphincter (EUS). Minimally, substantial content that merely confirms previous findings (e.g. Figures 1A-E; Figures 3A-E) should be move to the supplementary datasets.

      Indeed, we are aware of and have carefully studied the literature of Keller et al. Our manuscript here presents novel experiments beyond the scopes of that paper. Thanks to this comment, we will substantially revise our manuscript to enhance the visibility of novel data while keeping the agreeing data in the supplementary.

      (2) I also have concerns regarding the results showing that the inactivation of BarEsr1 neurons led to the cessation of EUS muscle firing (Figures 2G and S5C). As shown in the cartoon illustration of Figure 8, spinal projections of BarEsr1 neurons contact interneurons (presumably inhibitory) that innervate motor neurons, which in turn excite the EUS. I would therefore expect that the inactivation of BarEsr1 should shift the EUS firing pattern from phasic (as relaxation) to tonic (removal of relaxation), rather than stopping their firing entirely. Could the authors comment on this and provide potential reasons or mechanisms for this finding?

      We agree with this point. We meant that the EUS’ phasic bursting pattern was rapidly stopped upon BarEsr1 photoinhibition, but not all the firing stopped instantaneously. According to the previous studies (Chang et al., 2007, de Groat, 2009, de Groat and Yoshimura, 2015, Kadekawa et al., 2016), the voiding physiology of rodents is probably different from that of humans, such that for rodents the urine is step-wise pumped out in the gap time between multiple consecutive EUS phasic bursting epochs, and for humans the urine is continuously pumped out once the EUS firing is almost fully inhibition during a period of time. Namely, for mice, the EUS display sustained tonic activity following phasic bursting, while, in contrast, for humans the EUS keeps tonic firing until the moment of voiding onset (complete inhibition, muscle relaxed). Despite the prominent differences in the basic physiological properties, our assumption is that the logic of circuits from the brainstem to the urethra in this pathway is evolutionally conserved for both species; thus the logic of brainstem coordination of voiding could also be the same for both species, which is the main interest of our study (of using an animal model to address concerns of human health). Thus, to interpret our data for a broader audience we made a simplified and inaccurate expression. We apologize for the inaccuracy and we will correct our previous inaccurate description in the revised manuscript.

      (3) Current evidence is insufficient to support the claim that the majority of BarEsr1 neurons innervate the SPN but not DGC. The current spinal images are uninformative, as the fluorescence reflects the distribution of Esr1- or Crh-expressing neurons in the spinal cord, along with descending BarEsr1 or BarCrh axons. Given the close anatomical proximity of these two nuclei, a more thorough histological analysis is required to demonstrate that the spinal injections were accurately confined to either the SPN or the DGC.

      We agree that current evidence is insufficient to support the current claim. To address this concern and strengthen our claim, we will repeat the retrograde viral tracing experiments, combined with CTB647 injections to label the injection site, to validate specific targeting of SPN or DGC populations. We will also add higher-magnification imaging to distinguish BarESR1 axonal projections targeting SPN versus DGC. Results from these ongoing experiments will be incorporated into the revised manuscript.

      Reviewer #2 (Public review):

      Summary:

      The authors have performed a rigorous study to assess the role of ESR1+ neurons in the PMC to control the coordination of bladder and sphincter muscles during urination. This is an important extension of previous work defining the role of these brainstem neurons, and convincingly adds to the understanding of their role as master regulators of urination. This is a thorough, well-done study that clarifies how the Pontine micturition center coordinates different muscle groups for efficient urination, but there are some questions and considerations that remain.

      Strengths:

      These data are thorough and convincing in showing that ESR1+PMC neurons exert coordinated control over both the bladder and sphincter activity, which is essential for efficient urination. The anatomical distinctions in pelvic versus pudendal control are clear, and it's an advance to understand how this coordination occurs. This work offers a clearer picture of how micturition is driven.

      Weaknesses:

      The dynamics of how this population of ESR1+ neurons is engaged in natural urination events remains unclear. Not all ESR1+neurons are always engaged, and it is not measured whether this is simply variation in population activity, or if more neurons are engaged during more intense starting bladder pressures, for instance. In particular, the response dynamics of single and doubly-projecting neurons are not defined. Additionally, the model for how these neurons coordinate with CRH+ neuron activity in the PMC is not addressed, although these cell types seem to be engaged at the same time. Lastly, it would be interesting to know how sensory input can likely modulate the activity of these neurons, but this is perhaps a future direction.

      In response to the reviewer’s comments, we will attempt perform the following revisions for this round:

      (1) Engagement of ESR1+ neurons in natural urination events:

      We agree that probably not all ESR1+ neurons are consistently engaged during urination. To address this, we will perform a detailed analysis of the opto-tagged single unit recordings data.

      (2) Response dynamics of single- and doubly-projecting neurons:

      (a) We will use retrograde labelling combined with Ca2+ photometry recordings to differentiate the response dynamics of SPN- and DGC-projecting neurons during urination.

      (b) We will perform functional validations to assess the specific roles of single- and doubly-projecting neurons in coordinating bladder and EUS activity.

      (3) Coordination with CRH+ neurons in the PMC:<br /> We appreciate the suggestion to include CRH+ neurons in our model. We will expand our model to incorporate CRH+ neurons and their potential interactions with ESR1+ neurons.

      (4) Sensory modulation of ESR1+ neurons:<br /> The reviewer raises an excellent point regarding sensory input modulation of ESR1+ neuron activity. Although this is beyond the scope of our current study, we recognize its importance and propose to include this as a future direction.

      Reviewer #3 (Public review):

      Summary:

      The paper by Li et al explored the role of Estrogen receptor 1 (Esr1) expressing neurons in the pontine micturition center (PMC), a brainstem region also known as Barrington's nucleus (Hou et al 2016, Keller et al 2018). First, the author conducted bulk Ca2+ imaging/unit recording from PMCESR1 to investigate the correlations of PMCESR1 neural activity to voiding behavior in conscious mice and bladder pressure/external urethral muscle activity in urethane anesthetized mice. Next, the authors conducted optogenetics inactivation/activation of PMCESR1 to confirm the contribution to the voiding behavior also conducted peripheral nerve transection together with optogenetics activation to confirm the independent control of bladder pressure and urethral sphincter muscle.

      Weaknesses:

      (1) The study demonstrates that pelvic nerve transection reduces urinary volume triggered by PMCESR1+ cell photoactivation in freely moving mice. Could the role of pudendal nerve transection also be examined in awake mice to provide a more comprehensive understanding of neural involvement?

      Thank you for the suggestion, the pudendal nerve transection in awake mice is indeed a challenging experiment that has been missed. We will try it for the revision.

      (2) While the paper primarily focuses on PMCESR1+ cells in bladder-sphincter coordination, the analysis of PMCESR1+-DGC/SPN neural circuits - given their distinct anatomical projections in the sacral spinal cord - feels underexplored. How do these circuits influence bladder and sphincter function when activated or inhibited? Also, do you have any tracing data to confirm whether bladder-sphincter innervation comes from distinct spinal nuclei?

      Thank you for this great comment. The projection-specific neuronal function analysis is, as also suggested by Reviewer 2 in a similar comment (#8), missing in our first submission. These are so challenging experiments that we have missed in the first round of tests, but we decide to pursuit this goal again. Namely, we will perform photometry recordings of PMC neurons projecting to the DGC/SPN during measuring bladder pressure and urethral sphincter EMG activity. Additionally, while our study does not include direct tracing data to confirm distinct spinal nuclei for bladder and sphincter innervation, this has been well-documented in classic literature (Yao et al., 2018, Karnup and De Groat, 2020, Karnup, 2021). Specifically, anatomical studies have shown that SPN primarily innervates the bladder, while the DGC is associated with the innervation of the urethral sphincter. We will cite these references to provide context and support for our interpretations.

      (3) Although the paper successfully identifies the physiological role of PMCESR1+ cells in bladder-sphincter coordination, the study falls short in examining the electrophysiological properties of PMCESR1+-DGC/SPN cells. A deeper investigation here would strengthen the findings.

      While our study primarily focuses on the functional role of PMCESR1+ neurons in bladder-sphincter coordination, we acknowledge that understanding their intrinsic electrophysiological characteristics could further strengthen our findings. However, this aspect falls beyond the scope of the current study. Nevertheless, we recognize the significance of this direction and are excited to pursue it in future research. We appreciate the reviewer’s suggestion, as it highlights an important avenue for expanding upon our current findings.

      (4) The parameters for photoactivation (blue light pulses delivered at 25 Hz for 15 ms, every 30 s) and photoinhibition (pulses at 50 Hz for 20 ms) vary. What drove the selection of these specific parameters? Moreover, for photoactivation experiments, the change in pressure (ΔP = P5 sec - P0 sec) is calculated differently from photoinhibition (Δpressure = Ppeak - Pmin). Can you clarify the reasoning behind these differing approaches?

      We sincerely thank the reviewer for raising these important points and for the opportunity to clarify our experimental design and data analysis methods.

      Photoactivation versus photoinhibition parameters: The differences in photoactivation (25 Hz, 15 ms pulses) and photoinhibition (50 Hz, 20 ms pulses) protocols are based on the distinct physiological and technical requirements for activating versus inhibiting PMCESR1+ neurons. For photoactivation, 25 Hz stimulation aligns with the natural firing patterns of central neurons, allowing for intermittent activation without exceeding the neuronal refractory period. The shorter pulse duration (15 ms) minimizes phototoxicity and avoids overstimulation, as performed in previous studies (Keller et al., 2018). In contrast, photoinhibition requires sustained suppression of neuronal activity, achieved through higher frequencies (50 Hz) and longer pulses (20 ms) to ensure continuous coverage of neuronal activity.

      Calculation of pressure changes (ΔP) for photoactivation and photoinhibition: The differing methods for calculating pressure changes reflect the distinct physiological effects we aimed to capture. In photoactivation experiments (ΔP = P5 sec - P0 sec), the pressures before (P0 sec) and 5 seconds after (P5 sec) light delivery were compared to capture the immediate effect of light activation on bladder pressure, focusing on the onset and early dynamics of activation. In contrast, photoinhibition experiments assessed the immediate impact of light-induced suppression on bladder pressure during an ongoing voiding event. Here, Δpressure was calculated as Ppeak – Pmin to measure the rapid drop in pressure directly attributable to neuronal inhibition.

      We will expand these details in the methods section of the revised manuscript to provide greater transparency.

      (5) The discussion could further emphasize how PMCESR1+ cells coordinate bladder contraction and sphincter relaxation to control urination, highlighting their central role in the initiation and suspension of this process.

      We fully agree with this point. Additionally, in response to your and other reviewers’ suggestions, we are preparing a new round of experiments with projection-specific recording, and thus our discussion and conclusion will also be updated according to the newly obtained data.

      (6) In Figure 8, The authors analyze the temporal sequence of bladder pressure and EUS bursting during natural voiding and PMC activation-induced voiding. It would be acceptable to consider the existence of a lower spinal reflex circuit, however, the interpretation of the data contains speculation. Bladder pressure measurement is hard to say reflecting efferent pelvic nerve activity in real time. (As a biological system, bladder contraction is mediated by smooth muscle, and does not reflect real-time efferent pelvic nerve activity. As an experimental set-up, bladder pressure measurement has some delays to reflect bladder pressure because of tubing, but EUS bursting has no delay.) Especially for the inactivation experiment, these factors would contribute to the interpretation of data. This reviewer recommends a rewrite of the section considering these limitations. Most of the section is suitable for the results.

      Thank you for mentioning the possibility of bladder pressure measurement delay. We would prefer to perform a physical control test to quantify how much delay this measurement is under our experimental conditions. We will use a small ballon to mimic the bladder and use two identical pressure sensors, one with a very short tube inserted into the ballon and one with an extended tube same as in our animal experiments. We will then mimic both contraction initiation and halting, and quantify the delay between the two sensors.

      References

      • Chang HY, Cheng CL, Chen JJJ, de Groat WC. 2007. Serotonergic drugs and spinal cord transections indicate that different spinal circuits are involved in external urethral sphincter activity in rats. American Journal of Physiology-Renal Physiology 292: F1044-F1053. DOI: 10.1152/ajprenal.00175.2006

      • de Groat WC. 2009. Integrative control of the lower urinary tract: preclinical perspective. British Journal of Pharmacology 147. DOI: 10.1038/sj.bjp.0706604

      • de Groat WC, Yoshimura N. 2015. Anatomy and physiology of the lower urinary tract. Handb Clin Neurol 130: 61-108. DOI: 10.1016/B978-0-444-63247-0.00005-5

      • Kadekawa K, Yoshimura N, Majima T, Wada N, Shimizu T, Birder LA, Kanai AJ, de Groat WC, Sugaya K, Yoshiyama M. 2016. Characterization of bladder and external urethral activity in mice with or without spinal cord injury—a comparison study with rats. American Journal of Physiology-Regulatory, Integrative and Comparative Physiology 310: R752-R758. DOI: 10.1152/ajpregu.00450.2015

      • Karnup S. 2021. Spinal interneurons of the lower urinary tract circuits. Autonomic Neuroscience 235. DOI: 10.1016/j.autneu.2021.102861

      • Karnup SV, De Groat WC. 2020. Mapping of spinal interneurons involved in regulation of the lower urinary tract in juvenile male rats. IBRO Rep 9: 115-131. DOI: 10.1016/j.ibror.2020.07.002

      • Keller JA, Chen J, Simpson S, Wang EH-J, Lilascharoen V, George O, Lim BK, Stowers L. 2018. Voluntary urination control by brainstem neurons that relax the urethral sphincter. Nature Neuroscience 21: 1229-1238. DOI: 10.1038/s41593-018-0204-3             

      • Yao J, Zhang Q, Liao X, Li Q, Liang S, Li X, Zhang Y, Li X, Wang H, Qin H, Wang M, Li J, Zhang J, He W, Zhang W, Li T, Xu F, Gong H, Jia H, Xu X, Yan J, Chen X. 2018. A corticopontine circuit for initiation of urination. Nature Neuroscience 21: 1541-1550. DOI: 10.1038/s41593-018-0256-4

    1. Author response:

      The following is the authors’ response to the original reviews.

      Author Response

      Reviewer #1 (Public Review): 

      Weaknesses: 

      - Having demonstrated that NK cell IFNgamma is important for recruiting and activating DCs and T cells in their model, one is left to wonder whether it is important for the therapeutic effect, which was not tested. 

      We conducted a preliminary study to compare the pro-survival effect of WT NK and Ifng-/- NK cell therapies. We found that, in the 95-500 mg day-21 tumor group, the overall survival (OS) of mice receiving Ifng-/- NK cell therapy significantly decreased (p = 0.045) compared to mice receiving WT NK cell therapy up to 60 days after tumor inoculation, but there was no difference in OS beyond 65 days after tumor inoculation. Therefore, we have added the following sentences at the end of the second paragraph in our Discussion (Page 32):

      “However, although Ifng-/- NK cells induced less cDC activation compared to WT NK cells, the levels of CD86 on cDCs of mice that received Ifng-/- NK cells were higher than those of mice not subjected to NK cell transfer (Figure 4B). This outcome indicates the presence of IFN-g-independent or/and compensatory mechanism(s) for cDC activation by the transferred NK cells, which is in line with our preliminary result that Ifng-/- NK cell therapy does not significantly diminish the pro-survival effect in comparison to WT NK cell therapy beyond 60 days after tumor cell inoculation (data not shown).”

      - It was somewhat difficult to gauge the clinical trial results because the trial was early stage and therefore not controlled. Evaluation of the results therefore relies on historical comparisons. To evaluate how encouraging the results are, it would be valuable for the authors to provide some context on the prognoses and likely disease progression of these patients at the time of treatment. 

      We had already indicated in our Results that all six patients had an ECOG performance status of 0 (Page 25 and Table). We have now added in the Results that they had “a predicted survival of >3 months” (Page 25).

      Reviewer #1 (Recommendations For The Authors):

      Minor points: 

      (1) It would be helpful if the authors provided a rationale for why they derived their NK cell product from bone marrow cells instead of the more common source, spleen cells. 

      We now clarify that: “We used BM cells instead of splenocytes for NK cell culture because removal of T cells from BM cells before culturing is not necessary” (Page 35) to the section Ex vivo expansion of murine and human NK cells in our Materials and Methods.

      (2) It would have been helpful to provide summary results from replicates of the cytokine production data shown in Figure 1F. 

      We have now added a graphical panel on the relative ΔMFI of two independent experiments to Figure 1F and revised the figure legend accordingly (Page 7—8).

      (3) The role of conventional CD4+ T cells is a little unclear. The authors state in the discussion that they contribute to the antitumor response, which is consistent with their finding that depleting both CD4 T cells and CD8 T cells has a greater effect than depleting CD8 T cells. Depleting CD4 T cells alone trended towards improving the response, however. Probably Tregs are the culprit in the latter effect but a sentence or two would be helpful if the claim for a protective role for CD4 T cells is to remain.  

      We have now re-analyzed the data of Figure 3D by separating mice into two groups according to day 21 tumor weight, i.e., 95-600 mg and >600 mg (Page 13—14). We have revised our explanation of the Figure 3D data in the Results (Page 11—12) as follows:

      “Accordingly, we examined the role of T cells in NK cell therapy by depleting T cell subsets with antiCD4 or/and anti-CD8 antibodies two days before primary tumor resection (Figure 3D Schema and Figure 3-figure supplement 1). In the 95-600 mg tumor group, depletion of CD8+ cells alone or both CD4+ and CD8+ cells diminished the effect of NK cell therapy, whereas depletion of CD4+ cells alone did not affect OS (Figure 3D). This result indicates that CD8+ T cells are essential for the effect of NK cell therapy. In contrast, the >600 mg tumor group displayed a limited NK-cell treatment effect as expected, but did exhibit improved OS upon depleting CD4+ cells alone (Figure 3D). As the proportion of lung Foxp3+CD4+ T cells in CD45+ cells positively correlated with day 21 tumor weight (data not shown), depletion of Foxp3+CD4+ T cells by anti-CD4 antibody likely has a stronger effect in augmenting the immune response for the >600 mg tumor group than the 95-600 mg tumor group. Moreover, both tumor groups showed diminished OS upon depletion of both CD4+ and CD8+ cells than was the case for depletion of CD8+ cells alone, indicating a CD8+ T cell-independent anti-tumor effect of CD4+ T cells (Figure 3D).”

      (4) The schema in Figure 3E states that mice were inoculated with either EO771 tumor cells or B16F10 tumor cells, but it appears that the data only show EO771 tumor challenges. This should be corrected. 

      Corrected according to the reviewer’s comment.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews: 

      Reviewer #1 (Public Review): 

      Summary: 

      Previous work has shown that the evolutionarily-conserved division-orienting protein LGN/Pins (vertebrates/flies) participates in division orientation across a variety of cell types, perhaps most importantly those that undergo asymmetric divisions. Micromere formation in echinoids relies on asymmetric cell division at the 16-cell stage, and these authors previously demonstrated a role for the LGN/Pins homolog AGS in that ACD process. Here they extend that work by investigating and exploiting the question of why echinoids but not other echinoderms form micromeres. Starting with a phylogenetics approach, they determine that much of the difference in ACD and micromere formation in echinoids can be attributed to differences in the AGS Cterminus, in particular a GoLoco domain (GL1) that is missing in most other echinoderms.

      Thank you for the summary.

      Strengths: 

      There is a lot to like about this paper. It represents a superlative match of the problem with the model system and the findings it reports are a valuable addition to the literature. It is also an impressively thorough study; the authors should be commended for using a combination of experimental approaches (and consequently generating a mountain of data). 

      Thank you.

      Weaknesses: 

      There is an intriguing finding described in Figure 1. AGS in sea cucumbers looks identical to AGS in the pencil urchin, at least at the C terminus (including the GL1 domain). Nevertheless, there are no micromeres in sea cucumbers. Therefore another mechanism besides GL motif organization has arisen to support micromere formation. It is a consequential finding and an important consideration in interpreting the data, but I could not find any mention of it in the text. That is a missed opportunity and should be remedied, ideally not only through discussion but also experimentation. Specifically: does sea cucumber AGS (SbAGS) ever localize to the vegetal cortex in sea cucumbers? Can it do so in echinoids? Will that support micromere formation? 

      Thank you for pointing this out. 

      To respond to the Reviewer’s request, we synthesized sea cucumber (Sb) AGS based on the sequence available in the database and tested it in the sea urchin (Sp) embryos, which is enclosed in Fig. S3. We performed this experiment to confirm that SbAGS localizes less at the vegetal cortex than SpAGS as a proof of principle. However, we hesitate to conduct further studies using the synthetic sequence in this study. Sea cucumbers are an emerging yet understudied model. This species is not readily available or established as a model system for embryology. Even for the two species (A. japonicus in Japan and P. parvimensis in the USA) that were previously used for embryonic studies, their gametes are typically available only for 12 months in a year. Since some echinoderm researchers are aiming to establish sea cucumbers as a model system in the near future (see 2024 review: PMID: 38368336), we hope to be able to have better access to their embryos in the future. Yet, it may require a few more years to reach that condition.

      In this revised manuscript, we explained the above details and further added the discussion described below. All of the experimental models used in this study are wild animals obtained from the ocean, raising the standard for reproducibility. However, handling wild animals could come with challenges. We hope that the reviewer understands the unique benefits and challenges of this study.

      Discussion:

      Previous studies (PMIDs: 17726110; 21855794) suggest that GL1 is not involved in intramolecular interaction with TPR domains. This allows GL1 to interact independently with Gαi for cortical recruitment yet without influencing other GLs for AGS activation. To ensure GL1's independence, GL1 is typically located distantly from other GLs in Pins (flies), LGN (humans), and AGS (sea urchins). Based on this prior knowledge, we speculate three scenarios for sea cucumber (Sb) AGS not being able to localize or function during asymmetric cell division (ACD): 1) GL1 and GL2 are located too close to each other, compromising GL1's independence for recruitment. 2) A lack of GL4 loosens the autoinhibition state. 3) The GL1 sequence of SbAGS is quite different from that of echinoids’ AGS (Figure S2), compromising its recruiting efficacy. 

      For 1), we tested this possibility by making the SpAGS-GL1GL2 mutant that has GL1 and GL2 next to each other (Fig. 4G). This mutant indeed compromised its cortical localization and function in ACD. For 2), we showed that the lack of GL4 partially compromised ACD in SpAGS (Fig. 3F), suggesting that GL4 supports ACD. For 3), The results in Figure 4 indicate that the position but not the sequence of GL1 is critical for ACD. Based on these observations, we speculate a combination of 1) and 2) compromised SbAGS's ACD function. However, it is still possible that a significant difference in the GL1 sequence diminished its function as GL entirely. Future studies should address these remaining questions directly in the sea cucumber embryos once they are established as a model system in the near future (PMID: 38368336)

      The authors point out that AGS-PmGL demonstrates enrichment at the vegetal cortex (arrow in 5G, quantifications in 5H), unlike PmAGS. AGS-PmGL does not however support ACD. They interpret this result to indicate "that other elements of SpAGS outside of its C-terminus can drive its vegetal cortical localization but not function." This is a critical finding and deserves more attention. Put succinctly: Vegetal cortical localization of AGS is insufficient to promote ACD, even in echinoids. Why should this be?  

      Thank you for the suggestion. We revised our wording to be more succinct. Of note, as we noted in the text, AGS-PmGL has only two GL domains, which will likely not provide the full force to control ACD and result in insufficient ACD function.

      The authors did perform experiments to address this problem, hypothesizing that the difference might be explained by the linker region, which includes a conserved phosphorylation site that mediates binding to Dlg. They write "To test if this serine is essential for SpAGS localization, we mutated it to alanine (AGS-S389A in Fig. S3A). Compared to the Full AGS control, the mutant AGS-S389A showed reduced vegetal cortical localization (Fig. S3B-C) and function (Fig. S3D-E). Furthermore, we replaced the linker region of PmAGS with that of SpAGS (PmAGSSpLinker in Fig. S4A-B). However, this mutant did not show any cortical localization nor proper function in ACD (Fig. S4C-F). Therefore, the SpAGS C-terminus is the primary element that drives ACD, while the linker region serves as the secondary element to help cortical localization of AGS." 

      The experiments performed only make sense if the AGS-PmGL chimeric protein used in Figure 5 starts the PmGL sequence only after the Sp linker, or at least after the Sp phosphorylation site. I can't tell from the paper (Figure S3 indicates that it does, whereas S5 suggests otherwise), but it's a critical piece of information for the argument. 

      Thank you for the pointer, and we apologize for the confusion. AGS-PmGL contains the SpAGS linker domain. To clarify this point, we added the amino acid position at the junction of each chimeric construct diagram in Figs. 5 and S4. To clarify, Figure S5 is about the GL domain mutations (not about the Linker).  

      Another piece of missing information is whether the PmAGS can be phosphorylated at its own conserved phosphorylation site. The authors don't test this, which they could at least try using a phosphosite prediction algorithm, but they do show that the candidate phosphorylation site has a slightly different sequence in Pm than in Et and Sp (Fig. S4A). With impressive rigor, the authors go on to mutate the PmAGS phosphorylation site to make it identical to Sp. Nothing happens. Vegetal cortical localization does not increase over AGS-PmGL alone. Micromere formation is unrescued. 

      There is therefore a logic problem in the text, or at least in the way the text is written. The paragraph begins "Additionally, AGS-PmGL unexpectedly showed cortical localization (Figure 5G), while PmAGS showed no cortical localization (Figure 5B)." We want to understand why this is true, but the explanation provided in the remainder of the paragraph doesn't match the question: according to quite a bit of their own data, the phosphorylation site in the linker does not explain the difference. It might explain why AGS-PmGL fails to promote micromere formation, but only if the AGS-PmGL chimeric protein uses the Pm linker domain (see above).

      Thank you for the insightful suggestion. As suggested, we performed the phosphosite predictions using GPS 6.0 (PMID: 37158278) and enclosed the results in Fig. S4A (replacing the old Fig. S3A). The software predicts SpAGS and EtAGS have a predicted AuroraA phosphorylation site (RRRSMEN in Supplemental figure S4A) in their linker domain, while PmAGS does not. Sp and Et AGS also have the additional 5-7 predicted phosphorylation sites, while PmAGS has only three sites with low scores. Therefore, the linker domain is not conserved in PmAGS. 

      The PmAGS+SpLinker mutant does restore the predicted AuroraA phosphorylation site on the software, yet it does not restore the cortical localization or ACD function in the embryo. Therefore, other sites in the Linker region might also be necessary for cortical localization and ACD function of AGS. In this study, we did not perform further manipulations in the Linker domain. As the reviewer rightfully pointed out, even if we identify the Linker regions essential for AGS localization and function, it will be difficult to interpret the result unless we know what proteins interact with the Linker domain of AGS. Therefore, this is beyond the scope of the current manuscript. We discussed these remaining matters in the discussion section. 

      Another concern that is potentially related is the measurement of cortical signal. For example, in the control panel of Figure 5C, there is certainly a substantial amount of "non-cortical" signal that I believe is nuclear. I did not see a discussion of this signal or its implications. My impression of the pictures generally is that the nuclear signal and cortical signal are inversely correlated, which makes sense if they are derived from the same pool of total protein at different points of the cell cycle. If that's the case (and it might not be) I would expect some quantifications to be impacted. For example, the authors show in Figure S3B that AGS-S389A mutant does not localize to the cortex. However, this mutant shows a radically different localization pattern to the accompanying control picture (AGS), namely strong enrichment in what I assume to be the nucleus. Is the S389 mutant preventing AGS from making it to the cortex? Or are these pictures instead temporally distinct, meaning that AGS hasn't yet made it out of the nucleus? Notably, the work of Johnston et al. (Cell 2009), cited in the text, does not show or claim that the linker domain impacts Pins localization. Their model is rather that Pins is anchored at the cortex by Gαi, not Dlg, and that is the same model described in this manuscript.

      In agreement with that model and the results of Johnston et al., a later study (Neville et al. EMBO Reports 2023) failed to find a role for Dlg or the conserved phosphorylation site in Pins localization. 

      In the sea urchin embryo, the dye or GFP often appears in the nucleus randomly on top of the cytoplasm (for example, see Fig. S2b of PMID: 35444184). Further, embryos tend to incorporate exogenous genomic fragments more efficiently during early embryogenesis (PMID: 3165895). It is proposed that early embryos may have a loosened or incomplete nuclear envelope compared to adult cells as they divide rapidly (every 40 minutes). Therefore, any excess protein with no specific localization signal may randomly appear in the nucleus as it serves as an available space in the cell. As the Reviewer rightfully pointed out, we consider that the nuclear AGS signal is due to the lack of a specific destination since this signal pattern is not consistent across embryos. In contrast, the proteins that have nuclear localization (e.g., transcription factors) usually show a consistent nuclear signal across cells and embryos with less cytoplasmic signal. To avoid confusion, we replaced the S389A image in Fig. S3B (which is now Fig. S4C) as well as any other images that may create similar confusion.

      Reviewer #2 (Public Review): 

      This study from Dr. Emura and colleagues addresses the relevance of AGS3 mutations in the execution of asymmetric cell divisions promoting the formation of the micromere during seasearching development. To this aim, the authors use quantitative imaging approaches to evaluate the localisation of AGS3 mutants truncated at the N-terminal region or at the Cterminal region, and correlate these distributions with the formation of micromere and correct development of embryos to the pluteus stage. The authors also analyse the capacity of these mutated proteins to rescue developmental defects observed upon AGS3 depletion by morpholino antisense nucleotides (MO). Collectively these experiments revealed that the Cterminus of AGS3, coding for four GoLoco motifs binding to cortical Gaphai proteins, is the molecular determinant for cortical localisation of AGS3 at the micromeres and correct pluteus development. Further genetic dissections and expression of chimeric AGS3 mutants carrying shuffled copies of the GoLoco motifs or four copies of the same motifs revealed that the position of GoLoco1 is essential for AGS3 functioning. To understand whether the AGS3-GoLoco1 evolved specifically to promote asymmetric cell divisions, the authors analyse chimeric AGS3 variants in which they replaced the sea urchin GoLoco region with orthologs from other echinoids that do not form micromeres, or from Drosophila Pins or human LGN. These analyses corroborate the notion that the GoLoco1 position is crucial for asymmetric AGS3 functions. In the last part of the manuscript, the authors explore whether SpAGS3 interacts with the molecular machinery described to promote asymmetric cell division in eukaryotes, including Insc, NuMA, Par3, and Galphai, and show that all these proteins colocalize at the nascent micromere, together with the fate determinant Vasa. Collectively this evidence highlighted how evolutionarily selected AGS3 modifications are essential to sustain asymmetric divisions and specific developmental programs associated with them. 

      Thank you for the useful summary.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors): 

      The quantifications of "vegetal cortical localization" are somewhat incomplete. As measured, "vegetal cortical localization" does not demonstrate particular enrichment at the vegetal cortex, only that some signal appears there. In other words, we can't tell for sure that there is any more signal at the vegetal cortex than anywhere else along the cortex, and in fact that's plainly true and even described for the ACS1111 and AGS2222 constructs. One solution would be to measure signal strength around the cell perimeter and see where it is strongest. 

      As suggested by the Reviewer, we added new measurements, focusing and comparing the signals on the animal versus vegetal cortices (Figs. 2C, 3D, 4C, 5C, &H, 9D & F, S3D, S4D &I). 

      A related issue is that the strength of cortical enrichment is indicated in this paper by the ratio of cortical to "non-cortical" signal, but "non-cortical" is not defined. Does it include the nuclear signal? 

      As described above, we replaced all measurements using the above animal vs. vegetal cortices to avoid confusion. The nuclear signal is thus not measured in these analyses.

      I'm enthusiastic about the results in Figure 7, but I can't really see them very well. Could you please consider changing the color scheme? For single-color figures, it would be helpful to view them as black on white rather than (for example) blue on black. That change is easily achieved with Fiji. 

      We revised the Figure as suggested.

      Page 3 Results section: "At the time of ACD, Insc recruits Pins/LGN to the cortex through Gαi": I understand this sentence to mean that Gαi is an intermediary protein that Insc uses to recruit Pins/LGN. I think the point should be made more clear. As shown in Figure 1, Insc binds to Pins/LGN directly and interacts with cortical polarity proteins directly. Recruitment therefore doesn't appear to require Gαi, but stable association with the membrane (a subsequent step) probably does. That model is shown and described in Figure 6A.

      Thank you for the pointer. We clarified our explanations as suggested.

      Reviewer #2 (Recommendations For The Authors): 

      The manuscript addresses an interesting question, and uses elegant genetic approaches associated with imaging analyses to elucidate the molecular mechanisms whereby AGS3 and spindle orientation proteins promote asymmetric divisions and specific developmental programs. This considered, it might be worth clarifying a few aspects of the reported findings. 

      (1) In some experimental settings, the presence of AGS3 mutants exacerbates the AGS3 deletion by MO (Figure 4F). Can the author speculate on what can be the molecular explanation? 

      Thank you for pointing this out. We speculate that AGS1111 and AGS2222 are unable to keep the auto-inhibited forms since they lack GL3 and GL4 as modeled in Figure 6. AGS-MO reduces the endogenous AGS, which compromises the vegetal polarity. In this embryo, constitutive active AGS likely further randomizes the polarity, as evidenced by AGS-OE results in Fig. S7, resulting in an even worse outcome. We elaborated on this part in the text.

      (2) Imaging analyses of Figure 4B-C suggest that the mutant AGS1111 does not localise at the vegetal cortex while AGS2222 does (Fig. 4C). However these mutants induce similar developmental defects (Figure 4F). What could be the reason? 

      We apologize for the confusion in Fig. 4C. The majority of embryos from both AGS1111 and 2222 groups failed to form micromeres and showed AGS localization across the cortex. Among the dozens we examined, 0 embryos from 1111 and 8 embryos from 2222 developed micromeres. Those 8 embryos still showed vegetal cortical localization, so the proportion appears high in Fig. 4B, yet it reflects the minority in the group. In contrast, Development was scored for all embryos (including those that failed to form micromeres), so the graph demonstrates the majority of embryos. To avoid this confusion, we replaced the old Fig. 4C with a new graph that analyzes the cortical signal levels at the vegetal versus animal cortices.

      (3) Figure 7 shows the crosstalk between AGS3 and other asymmetry players including NuMA. Vertebrate and Drosophila NuMA are ubiquitously present in tissues and localise to the spindle poles in mitosis. However, in Figures 7A and 7E NuMA seems expressed only in a subset of sea urchin embryonic cells. Is this the case? 

      As the Reviewer rightfully pointed out, Sea urchin NuMA is also present in all cells and localizes to the spindle (please see Fig. 2 of our previous paper PMID: 31439829). AGS is also slightly localized on the spindles of all cells. However, the PLA signal of AGS and NuMA mostly showed up in the vegetal cortex in this study, suggesting that major crosstalk may occur in the vegetal cortex. This does not rule out the possibility that minor interactions may also occur on the spindle or elsewhere in the cell, which was not quantifiable in this study. We clarified this point in the text.

    1. Reviewer #1 (Public review):

      Summary:

      The authors aimed to classify hepatocellular carcinoma (HCC) patients into distinct subtypes using a comprehensive multi-omics approach. They employed an innovative consensus clustering method that integrates multiple omics data types, including mRNA, lncRNA, miRNA, DNA methylation, and somatic mutations. The study further sought to validate these subtypes by developing prognostic models using machine learning algorithms and extending the findings through single-cell RNA sequencing (scRNA-seq) to explore the cellular mechanisms driving subtype-specific prognostic differences.

      Strengths:

      (1) Comprehensive Data Integration: The study's integration of various omics data provides a well-rounded view of the molecular characteristics underlying HCC. This multi-omics approach is a significant strength, as it allows for more accurate and detailed classification of cancer subtypes.

      (2) Innovative Methodology: The use of a consensus clustering approach that combines results from 10 different clustering algorithms is a notable methodological advancement. This approach reduces the bias that can result from relying on a single clustering method, enhancing the robustness of the findings.

      (3) Machine Learning-Based Prognostic Modeling: The authors rigorously apply a wide array of machine learning algorithms to develop and validate prognostic models, testing 101 different algorithm combinations. This comprehensive approach underscores the study's commitment to identifying the most predictive models, which is a considerable strength.

      (4) Validation Across Multiple Cohorts: The external validation of findings in independent cohorts is a critical strength, as it increases the generalizability and reliability of the results. This step is essential for demonstrating the clinical relevance of the proposed subtypes and prognostic models.

      Weaknesses:

      (1) Inconsistent Storyline:<br /> Despite the extensive data mining and rigorous methodologies, the manuscript suffers from a lack of a coherent and consistent narrative. The transition between different sections, particularly from multi-omics data integration to single-cell validation, feels disjointed. A clearer articulation of how each analysis ties into the overall research question would improve the manuscript.

      (2) Questionable Relevance of Immune Cell Activity Analysis:<br /> The evaluation of immune cell activities within the cancer cell model raises concerns about its meaningfulness. The methods used to assess immune function in the tumor microenvironment may not be fully appropriate, potentially limiting the insights gained from this part of the study.

      (3) Incomplete Single-Cell RNA-Seq Validation:<br /> The validation of the findings using single-cell RNA-seq data appears insufficient to fully support the study's claims. While the authors make an effort to extend their findings to the single-cell level, the analysis lacks depth. A more comprehensive validation is necessary to substantiate the robustness of the identified subtypes.

      (4) Figures and Visualizations:<br /> Several figures in the manuscript are missing necessary information, which affects the clarity of the results. For instance, the pathways in Figure 3A could be clustered to enhance interpretability, the blue bar in Figure 4A is unexplained, and Figure 4B is not discussed in the text. Additionally, the figure legend in Figure 7C lacks detail, and many figure descriptions merely repeat the captions without providing deeper insights.

      (5) Appraisal of the Study's Aims and Results:<br /> The authors have set out to achieve an ambitious goal of classifying HCC patients into distinct prognostic subtypes and validating these findings through both bulk and single-cell analyses. While the methodologies employed are innovative and the data integration comprehensive, the study falls short of fully achieving its aims due to inconsistencies in the narrative and incomplete validation. The results partially support the conclusions, but the lack of coherence and depth in certain areas limits the overall impact of the study.

      (6) Impact on the Field:<br /> If the identified weaknesses are addressed, this study has the potential to significantly impact the field of HCC research. The multi-omics approach combined with machine learning is a powerful framework that could set a new standard for cancer subtype classification. However, the current state of the manuscript leaves some uncertainty regarding the practical applicability of the findings, particularly in clinical settings.

      (6) Additional Context<br /> For readers and researchers, this study offers a valuable look into the potential of integrating multi-omics data with machine learning to improve cancer classification and prognostication. However, readers should be aware of the noted weaknesses, particularly the need for more consistent narrative development and comprehensive validation of the methods. Addressing these issues could greatly enhance the study's utility and relevance to the community.

    2. Reviewer #2 (Public review):

      Summary:

      Overall, this is a well-executed and insightful study. With some refinement to the presentation and a deeper exploration of the implications, the manuscript will make a significant contribution to the field of cancer genomics and personalized medicine.

      Strengths:

      The manuscript integrates multi-omics data with machine learning to address the significant heterogeneity of hepatocellular carcinoma (HCC). The use of multiple clustering algorithms and a consensus method strengthens the robustness of the findings. The study successfully develops a prognostic model with excellent predictive accuracy, validated across independent datasets. This adds considerable value to the field, particularly in providing individualized treatment strategies. The identification of two distinct liver cancer subtypes with different biological and metabolic characteristics is well-supported by the data, offering a promising direction for personalized medicine.

      Weaknesses:

      (1) Consider streamlining the presentation of methods, especially regarding the clustering algorithms and machine learning models. Readers may find it difficult to follow the exact process unless more clearly outlined.

      (2) Some figures, such as the signaling pathways and heatmaps, are critical to understanding the study's findings. Ensure that all figures are high quality, easy to interpret, and adequately labeled. You may also want to highlight the key findings within the figure captions more explicitly.

      (3) While the manuscript does compare its prognostic model to those previously published, the novelty of the findings could be emphasized more clearly. Discussing the potential limitations of the study (e.g., the reliance on computational models and small sample sizes for scRNA-seq) could strengthen the manuscript.

      (4) The manuscript mentions that the data was split into training and validation datasets in a 1:1 ratio. How was the performance verified? Is there an independent test set?

      (5) The role of the MIF signaling pathway in subtype differentiation is intriguing, but further mechanistic insights into how this pathway drives the differences between CS1 and CS2 could be discussed in more detail. If experimental evidence for this pathway exists in the literature, it should be mentioned.

      (6) Some sentences are quite long and complex, which can affect readability. Breaking them down into shorter, clearer sentences would improve the flow.

    3. Author response:

      Reviewer #1 (Recommendations for the authors):

      (1) Storyline and Narrative Flow:

      Consider revising the manuscript to create a more coherent and consistent narrative. Clarify how each section of the study-particularly the transition from multi-omics data integration to single-cell RNA-seq validation-contributes to the overall research question. This will help readers better understand the logical flow of the study.

      In the upcoming revisions, we will optimize the logical connections between sections of the manuscript to clarify the role each part plays in the overall research question, making it easier for readers to follow.

      (2) Immune Cell Activity Analysis:

      Reevaluate the methods used to assess immune cell activities within the context of the tumor microenvironment. Consider providing additional justification for the relevance of using the cancer cell model for this analysis. If necessary, explore alternative methods or models that might offer more meaningful insights into immune-tumor interactions.

      We fully recognize the importance of using tumor models to analyze and validate immune activity results, and we are considering experimental research in this area in future projects.

      (3) Single-Cell RNA-Seq Validation:

      Expand the validation of your findings using single-cell RNA-seq data. This could include more in-depth analyses that explore the heterogeneity within the subtypes and confirm the robustness of your classification method at the single-cell level. This would strengthen the support for your claims about the relevance of the identified subtypes.

      In the current study, we have applied the obtained multi-omics profiling features to single-cell sequencing data to classify malignant cells. We analyzed the metabolic and cell communication differences between different subtypes of malignant cells and explored potential reasons for these differences. Next, we plan to conduct further analysis of the differences between malignant cell subtypes to identify additional clues and mechanisms underlying these variations.

      (4) Methodological Justification:

      Provide a more detailed rationale for the selection of machine learning algorithms and integration strategies used in the study. Explain why the chosen methods are particularly well-suited for this research, and discuss any potential limitations they might have.

      In the revised manuscript, we will include descriptions of the principles of these analytical methods, as well as examples of their application in other studies, to discuss the rationale and limitations of applying these methods in this research.

      (5) Figures and Visualizations:

      Improve the clarity of your figures by addressing the following:

      a) Figure 3A: Cluster the pathways to make the comparisons clearer and more meaningful.

      b) Figure 4A: Clearly explain the significance of the blue bar.

      c) Figure 4B: Ensure this figure is discussed in the main text to justify its inclusion.

      d) Figure 7C: Enhance the figure legend to provide more informative details.

      Additionally, ensure that figure descriptions go beyond the captions and provide detailed explanations that help the reader understand the significance of each figure.

      We fully agree with the reviewer’s suggestions regarding these figures, and we will make the necessary revisions in the revised manuscript.

      (6) Supplementary Materials:

      Consider including more detailed supplementary materials that provide additional validation data, extended methodological descriptions, and any other information that would support the robustness of your findings.

      When we submission the revised manuscript, we will include supplementary materials such as figures or tables that may enhance the presentation of the manuscript's completeness.

      (7) Recent Literature:

      a) Incorporate more recent studies in your discussion, especially those related to HCC subtypes and the application of machine learning in oncology. This will provide a more current context for your work and help position your findings within the broader field.

      We appreciate the reviewer's suggestion. We will incorporate more recent studies into the discussion section and optimize its content.

      (8) Data and Code Availability:

      Ensure that all data, code, and materials used in your study are made available in line with eLife's policies. Provide clear links to repositories where readers can access the data and code used in your analyses.

      We have indicated the sources of the data and tools used in the analysis process within the text, and these data and tools can be accessed through the websites or literature we have cited.

      Reviewer #2 (Recommendations for the authors):

      (1) While the computational findings are robust, further experimental validation of the two subtypes, particularly the role of the MIF signaling pathway, would strengthen the biological relevance of the findings. In vitro or in vivo validation could confirm the proposed mechanisms and their influence on patient prognosis.

      We fully recognize the importance of using tumor models to analyze and validate immune activity results, and we are considering experimental research in this area in future projects.

      (2) Consider testing the model on additional independent cohorts beyond the TCGA and ICGC datasets to further demonstrate its generalizability and applicability across different patient populations.

      We are considering looking for independent external datasets in the GEO database or other databases to validate our model.

      (3) Review the manuscript for long or complex sentences, which can be broken down into shorter, more readable parts.

      In the revised manuscript, we will address any grammatical issues present in the manuscript and modify long and complex sentences that may hinder reader comprehension.

    1. Reviewer #2 (Public review):

      Summary:

      The authors seek to use single-cell sequencing approaches to identify TCRs specific for the SARS CoV2 spike protein, select a candidate TCR for cloning and use it to construct a TCR transgenic mouse. The argument is that this process is less cumbersome than the classical approach, which involves the identification of antigen-reactive T cells in vitro and the construction of T cell hybridomas prior to TCR cloning. TCRs identified by single-cell sequencing that is already paired to transcriptomic data would more rapidly identify TCRs that are likely to contribute to a functional response. The authors successfully identify TCRs that have expanded in response to SARS CoV2 spike protein immunization, bind to MHC tetramers and express genes associated with functional response. They then select a TCR for cloning and construction of a transgenic mouse in order to test the response of resulting T cells in vivo following immunization with spike protein of coronavirus infection.

      Strengths:

      (1) The study provides proof of principle for the identification and characterization of TCRs based on single-cell sequencing data.

      (2) The authors employ a recently developed software tool (DALI) that assists in linking transcriptomic data to individual clones.

      (3) The authors successfully generate a TCR transgenic animal derived from the most promising T cell clone (CORSET8) using the TCR sequencing approach.

      (4) The authors provide initial evidence that CORSET8 T cells undergo activation and proliferation in vivo in response to immunization or infection.

      (5) Procedures are well-described and readily reproducible.

      Weaknesses:

      (1) The purpose of presenting a failed attempt to generate TCR transgenic mice using a traditional TCR hybridoma method is unclear. The reasons for the failure are uncertain, and the inclusion of this data does not really provide information on the likely success rate of the hybridoma vs single cell approach for TCR identification, as only a single example is provided for either.

      (2) There is little information provided regarding the functional differentiation of the CORSET8 T cells following challenge in vivo, including expression of molecules associated with effector function, cytokine production, killing activity and formation of memory. The study would be strengthened by some evidence that CORSET8 T cells are successfully recapitulating the functional features of the endogenous immune response (beyond simply proliferating and expressing CD44). This information is important to evaluate whether the presented sequencing-based identification and selection of TCRs is likely to result in T-cell responses that replicate the criteria for selecting the TCR in the first place.

      (3) While I find the argument reasonable that the approach presented here has a lot of likely advantages over traditional approaches for generating TCR transgenic animals, the use of TCR sequencing data to identify TCRs for study in variety of areas, including cancer immunotherapy and autoimmunity, is in broad use. While much of this work opts for alternative methods of TCR expression in primary T cells (i.e. CRISPR or retroviral approaches), the process of generating a TCR transgenic mouse from a cloned TCR is not in itself novel. It would be helpful if the authors could provide a more extensive discussion explaining the novelty of their approach for TCR identification in comparison to other more modern approaches, rather than only hybridoma generation.

      Comments on revisions:

      The authors have provided additional clarification on the comparisons between the presented method for TCR transgenic generation and the hybridoma method that is more commonly used and added additional verification of the functional response in vivo of T cells expressing the selected TCR. Overall, these additions enhance the evidence that the proposed methods are likely to identify TCRs with a strong immune activation profile and are a reasonable response to the first round of review.