5,256 Matching Annotations
  1. Oct 2022
    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The article by M. Grzonka and H. Bazzi entitled: Mouse Sas-6 is required for centriole formation in embryos and integrity in embryonic stem cells, describes new findings in novel mouse models of Sas-6 knockouts (KO). This is an interesting study that reports two different mouse Sas6 KO models and the depletion of Sas6 from mouse embryonic stem cells (mESCs). This type of analysis has never been done before and so it reveals and describes a role for Sas-6 in centriole biogenesis in mouse.

      The authors compare their analysis with Sas-4 KO and overall found similar results when compared to previous work from H. Bazzi, when Sas-4 was depleted in mouse embryos. Due to the mitotic stopwatch pathway, Sas6KO embryos die during development at extremely early stages and this can be rescued by depletion in p53 and other members of the pathway. Perhaps, not so surprisingly, these embryos do not contain centrioles, showing that in vivo, Sas6 is absolutely required for centriole duplication. More surprisingly, however, in cultures of mESCs, established and propagated in vitro, Sas-6 crispr induced KO, does not result in lack of centrioles. Instead, abnormal structures that show aberrant morphologies, length, and incapacity to assemble cilia were detected. In principle, this means that centrioles can be assembled independently of Sas-6, even if not in the correct manner.

      The authors interpret these differences as possible differences in the pathways involved in centriole assembly and propose different requirements in different cell types, within the same species. I have problems with this interpretation. To me is very difficult to understand, how the "protein" absolutely required for cartwheel assembly at the early stages of centriole biogenesis, can be essential and dispensable at the same time. Although, I may be wrong, I think the authors have not envisage other possibilities to interpret their data, which should be taken into consideration.

      1) I do not know anything about ESC and ESC cultures. So maybe this is a stupid suggestion. But can't they be derived exactly from the same genetic background of SAS-6KO embryos? Because the way the two (or even 3 as there are 2 mouse KO lines) are generated is different. Why is that?

      2) Still on mESCs, are the authors sure that there are no WT Sas-6mRNAs still present in their ESC cells? Because tiny amounts are maybe sufficient to allow the initial cartwheel structure. In FigS2B, I can see a really faint band, very faint but it is there.

      3) This last point goes also with the western-blot of Figure S2C- there is still a band, very tiny between the two very tick bands (marked with *). Maybe separating proteins better will help visualizing the real Sas-6 band? Have they used the Sas6 ab in other WBs from the KO embryos, for example? Can they use the Sas6 ab in immunostaining to show if the assembled abnormal centrioles completely lack Sas6. This will allow to distinguish between the hypothesis of having some (even if not much) sas6 left?

      4) Then a more theoretical point? Have the authors considered that the difference is more in the stability of the abnormal structures. Let's say, without a cartwheel and maybe enough PLK4 activity and high level of other centriolar components, the centrioles are abnormally assembled- they have no cartwheel, but they are disassembled very fast in the embryo but not in ESCs?

      5) Even if there is a real difference and without Sas-6 ESCs can make centrioles that are abnormal in structure and function (at least at the cilia assemble level), the choice of words "strictly required", I am not sure it is correct. Because, since Sas-6 is described by many studies as the factor required for cartwheel assembly, which occurs very early in the pathway, this means that in mESCs centrioles can assembled without forming a cartwheel. And so that the cartwheel is actually not required for the initial building block, but more as a structure that maintains the whole centriole in an intact manner?

      6) The authors mentioned that in flies, abnormal Sas-6 structures have been described in certain cell types. Are these mutants, null mutants? In other words, do these structures assembled in a context of no Sas6 or abnormal Sas-6 protein or even low levels of Sas-6?

      Other points:

      • I think the 1st sentence of the abstract appears disconnected from the rest. The same goes for the 1st sentence of the introduction. And also, what is the evidence that pluripotent stem cells rely primarily on the proper assembly of a mitotic spindle? They relly on many other things, not sure this is the first one.
      • The authors mention that centrioles are lost in Sas6-/- after "differentiation" of mESCs. The term differentiation is not appropriate, and confusing here. Differentiation normally refer to cells that stopped proliferating and exited the cell cycle, which is not the case here, as NPCs are progenitor cells that keep cycling.
      • Figure S1: Percent of cells with centrosomes was assessed by a co-staining of tubulin and Cep164, which mark the mother centrioles. As Cep164 may be absent from centrosomes after lack of centriole maturation in sas6-/- embryos, another combination of staining should be performed to evaluate the percent of cells without centrosomes. tubulin staining can be seen in Sas6 em5/em5 embryos, while the quantification claims total absence of centrosomes. The authors use the CENT2-eGFP transgenic line to count the number of centrioles in Figure 3, they should do the same in Figure S1.

      Significance

      This study shows with a novel mouse model the requirement of centrioles during mouse development. It will be relevnat to centrosome labs, the novel mouse lines will be useful to many labs working on centrioles, cilia and centrosomes. My expertise: centrosome biology

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the Reviewers for their useful feedback on our manuscript. We have addressed the Reviewers’ comments and revised our manuscript accordingly. A point-by-point response is provided below.

      Reviewer comments:

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Gopalan et al use quantitative, comprehensive lipid mass spectrometry of mouse brain tissue isolated at various time points in embryonic and postnatal development. They then go on to use the same quantitative analysis of mouse and human stem cells differentiated in vitro into neurons to define the lipid composition of these cultures.

      Major Comments:

      1. As mentioned above, it is difficult to assess whether the discrepancy in the lipotype acquisition between in vivo mouse brain development and stem cell differentiation is due to metabolic differences in the in vitro differentiation as the authors state or is due to a lack of the stem cells to actually acquire a neuronal phenotype. Perhaps showing more clearly that the protocols for neuronal differentiation work efficiently and/or how they compare to brains dissected would be helpful in stating that the lipotype is different. The protocol referenced here (Bogetofte et al) only gives ~30% TH+ positive DA neurons in their manuscript. What cell type the other 70% of the cells are is something that could be discussed as means of "diluting" out the lipotype seen in these cultures. Perhaps the 30% TH+ DA neurons do attain the "correct" lipotype, but the lipidomic analysis can not detect this due to the contaminating effects of the non-differentiated cells. In this work, it would be nice to see what percentage of the cells differentiate into the expected cell type rather than referencing previous manuscripts. As differentiation protocols and originating cell sources are highly variable and error-prone, it's difficult to know what the lipotype results are actually reporting on. Furthermore, discussion about these differentiation techniques and how well they represent functional neurons is warranted. The papers referenced here don't show 100% differentiation into the phenotypes that are described in this work such that the lipotype finding is not the only suggestion of a "general failure of in vitro neuronal differentiation models". Maybe a discussion of how the lack of ability to attain the neuronal lipotype due to the metabolic deficiencies discussed here could be causative to the inability to full recapitulate the neuronal phenotype is useful for the reader.

      We thank the reviewer for this question and suggested experiments. Following the advice, we have now show immunofluorescence data of pan-neuronal markers (i.e., b-tub III or MAP2) in mESC and iPSCs. In agreement with previously published datasets from the Noh and Meyer labs (Gehre et al., 2020; Bogetofte et al.,2019), we show that the protocols we use generate a very high percentage of neurons. We have now included these images and quantifications in our manuscript as Figs. 2B and S6A,B.

      From the discussion and work here is unclear why the stearate feeding of the stem cells did not result in an increase in the 18:0-containing sphingolipids. The authors state that the appropriate metabolic pathways are not fully established and go on to look at the CerS expression levels across the differentiation timeline. It appears that the results presented in Fig. S7 counter the authors' interpretation of the lipotype and more discussion here would be nice to clarify this discrepancy.

      We thank the reviewer for highlighting this seemingly counterintuitive observation. We have now included a quantification of CerS mRNA from commercially available mouse tissues analysed in Sladitschek and Neveu, 2019 and compared this to the data from Gehre et al, 2020. In the mouse brain tissue, CerS1 expression is upregulated dramatically, while CerS5 and 6 are downregulated (see new panel A in Fig. S8). In contrast, during in vitro differentiation of mESCs, CerS5 is not downregulated and CerS6 is upregulated (Fig. S8B). Accordingly, we have expanded our discussion in the revised manuscript as follows:

      “On the other hand, supplementing the cells with stearic acid (18:0) does not result in high levels of 18:0-sphingolipids. It is known that the Cer synthase CerS1 is specific for stearoyl-CoA (Venkataraman et al., 2002), which results in the production of 18:0-sphingolipids, while the synthases CerS5 and CerS6 are responsible for 16:0-sphingolipid production. During brain development, one observes a 35-fold increase in the expression of CerS1 and a downregulation of CerS5 and CerS6 compared to embryonic tissue (Sladitschek & Neveu, 2019) (Fig. S8A). In contrast, during in vitro neuronal differentiation, between day 8 and 12, CerS1 expression increases only by 5-fold and, contrary to expectation, CerS6 expression is upregulated and CerS5 expression is unchanged (Gehre et al., 2020) (Fig. S8B). This could underpin the observation that 16:0-sphingolipids remain elevated whereas brain-specific 18:0-sphingolipids only increase marginally, despite supplementation with stearic acid. Overall, this suggests that appropriate programming of the sphingolipid metabolic machinery is not fully established in stem cell-derived neurons.”

      Minor comments:

      1. I find the data presentation of the LENA analysis to be difficult to follow (Fig. 1E). In my opinion, the p-value is not the most important bit of information in this graph, though having it on the y-axis with other pertinent information encoded by colors or arrows being disguised. I would rather see the data on the x-axis that is above a certain p-value (denoted in the figure legend) plotted with the direction and magnitude of change shown.

      We thank the reviewer for this suggestion. In the revised manuscript, we now plot log2(odds ratio) on the y-axis instead of the p-value. Moreover, we have dimensioned the size and color intensity of each point as function of the p-value (Fig. 1E and 2E, shown below).

      In the PCA in Fig 1, what are the loadings that define the variable PC1 and PC2? What is predominantly changing the P21 samples that lead to such a large shift if most of the data shown in the subsequent panels are not changing much between P2 and P21.

      In the revised manuscript, we now include a plot of the PCA loadings of the lipids majorly influencing principal components 1 and 2 as supplemental Fig. S3.

      Reviewer #1 (Significance (Required)):

      This work provides a nice reference for the complex lipidomes in embryonic and postnatal murine brain development. The details of the lipotype changes during development are well laid out and will of no doubt be of great use across a variety of scientific fields. While I found the in vivo data to be compelling, interesting, and useful, the lack of controls for the in vitro stem cell differentiation work makes this particular data set and comparison less useful. Further work to identify the limitations of the stem cell differentiation protocols as a valid comparison to in vivo brain development need to be done and/or the discussion of the direct comparisons between the two toned down.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The study used a quantitative lipidomics approach which I am very familiar with. The results should be highly reproducible.

      Reviewer #2 (Significance (Required)):

      The manuscript submitted by Gopalan et al. reported a quantitative and comparative lipidomics study between mouse brain samples from early embryonic to postnatal stages, and rodent and human stem cell-derived neurons. The authors found a couple of very unique characters only existing in brain samples, but not in stem cell-derived neurons, including 22:6-containing glycerophospholipids and 18:0-containing sphingolipids. The authors further found the brain-like lipotypes can only be partially established in stem cell-derived neurons after supplementing brain lipid precursors. These findings clearly suggest that stem cell-derived neurons might not be appropriately used to mechanistically study lipid biochemistry, membrane biology, and biophysics in brains. The study was well designed. and the manuscript was very informative and resourceful. I would suggest to accept the manuscript for publication.

      We thank the Reviewer for the positive assessment of our work.

      References

      Gehre, M., Bunina, D., Sidoli, S., Lübke, M. J., Diaz, N., Trovato, M., Garcia, B. A., Zaugg, J. B., & Noh, K. M. (2020). Lysine 4 of histone H3.3 is required for embryonic stem cell differentiation, histone enrichment at regulatory regions and transcription accuracy. Nature Genetics, 52(3), 273–282. https://doi.org/10.1038/s41588-020-0586-5

      Levy, M., & Futerman, A. H. (2010). Mammalian ceramide synthases. IUBMB Life, 62(5), 347–356. https://doi.org/10.1002/iub.319

      Sladitschek, H. L., & Neveu, P. A. (2019). A gene regulatory network controls the balance between mesendoderm and ectoderm at pluripotency exit. Molecular Systems Biology, 15(12), 1–13. https://doi.org/10.15252/msb.20199043

      Venkataraman, K., Riebeling, C., Bodennec, J., Riezman, H., Allegood, J. C., Cameron Sullards, M., Merrill, A. H., & Futerman, A. H. (2002). Upstream of growth and differentiation factor 1 (uog1), a mammalian homolog of the yeast longevity assurance gene 1 (LAG1), regulates N-stearoyl-sphinganine (C18-(dihydro)ceramide) synthesis in a fumonisin B1-independent manner in mammalian cells. Journal of Biological Chemistry, 277(38), 35642–35649. https://doi.org/10.1074/jbc.M205211200

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The study used a quantitative lipidomics approach which I am very familiar with. The results should be highly reproducible.

      Significance

      The manuscript submitted by Gopalan et al. reported a quantitative and comparative lipidomics study between mouse brain samples from early embryonic to postnatal stages, and rodent and human stem cell-derived neurons. The authors found a couple of very unique characters only existing in brain samples, but not in stem cell-derived neurons, including 22:6-containing glycerophospholipids and 18:0-containing sphingolipids. The authors further found the brain-like lipotypes can only be partially established in stem cell-derived neurons after supplementing brain lipid precursors. These findings clearly suggest that stem cell-derived neurons might not be appropriately used to mechanistically study lipid biochemistry, membrane biology, and biophysics in brains. The study was well designed. and the manuscript was very informative and resourceful. I would suggest to accept the manuscript for publication.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Gopalan et al use quantitative, comprehensive lipid mass spectrometry of mouse brain tissue isolated at various time points in embryonic and postnatal development. They then go on to use the same quantitative analysis of mouse and human stem cells differentiated in vitro into neurons to define the lipid composition of these cultures.

      Major Comments:

      1. As mentioned above, it is difficult to assess whether the discrepancy in the lipotype acquisition between in vivo mouse brain development and stem cell differentiation is due to metabolic differences in the in vitro differentiation as the authors state or is due to a lack of the stem cells to actually acquire a neuronal phenotype. Perhaps showing more clearly that the protocols for neuronal differentiation work efficiently and/or how they compare to brains dissected would be helpful in stating that the lipotype is different. The protocol referenced here (Bogetofte et al) only gives ~30% TH+ positive DA neurons in their manuscript. What cell type the other 70% of the cells are is something that could be discussed as means of "diluting" out the lipotype seen in these cultures. Perhaps the 30% TH+ DA neurons do attain the "correct" lipotype, but the lipidomic analysis can not detect this due to the contaminating effects of the non-differentiated cells. In this work, it would be nice to see what percentage of the cells differentiate into the expected cell type rather than referencing previous manuscripts. As differentiation protocols and originating cell sources are highly variable and error-prone, it's difficult to know what the lipotype results are actually reporting on.

      Furthermore, discussion about these differentiation techniques and how well they represent functional neurons is warranted. The papers referenced here don't show 100% differentiation into the phenotypes that are described in this work such that the lipotype finding is not the only suggestion of a "general failure of in vitro neuronal differentiation models". Maybe a discussion of how the lack of ability to attain the neuronal lipotype due to the metabolic deficiencies discussed here could be causative to the inability to full recapitulate the neuronal phenotype is useful for the reader. 2. From the discussion and work here is unclear why the stearate feeding of the stem cells did not result in an increase in the 18:0-containing sphingolipids. The authors state that the appropriate metabolic pathways are not fully established and go on to look at the CerS expression levels across the differentiation timeline. It appears that the results presented in Fig S7 counter the authors' interpretation of the lipotype and more discussion here would be nice to clarify this discrepancy.

      Minor comments:

      1. I find the data presentation of the LENA analysis to be difficult to follow (Fig 1E). In my opinion, the p-value is not the most important bit of information in this graph, though having it on the y-axis with other pertinent information encoded by colors or arrows being disguised. I would rather see the data on the x-axis that is above a certain p-value (denoted in the figure legend) plotted with the direction and magnitude of change shown.
      2. In the PCA in Fig 1, what are the loadings that define the variable PC1 and PC2? What is predominantly changing the P21 samples that lead to such a large shift if most of the data shown in the subsequent panels are not changing much between P2 and P21.

      Significance

      This work provides a nice reference for the complex lipidomes in embryonic and postnatal murine brain development. The details of the lipotype changes during development are well laid out and will of no doubt be of great use across a variety of scientific fields. While I found the in vivo data to be compelling, interesting, and useful, the lack of controls for the in vitro stem cell differentiation work makes this particular data set and comparison less useful. Further work to identify the limitations of the stem cell differentiation protocols as a valid comparison to in vivo brain development need to be done and/or the discussion of the direct comparisons between the two toned down.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their constructive comments, which greatly helped us in steering the manuscript in the right direction. Below is our point to point response to their concerns:

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The Abeliovich lab has discovered differential degradation rates for mitochondrial proteins by mitophagy, a selective form of autophagy of mitochondria. In previous publications, they identified two protein kinases Pkp1/2 and the protein phosphatase Aup1(Pct6), which regulate phosphorylation patterns of mitochondrial proteins and affect their degradation rates by mitophagy. In the current manuscript, the authors now analyze the role of the pyruvate dehydrogenase complex (PDC) in mitochondria in the regulation of these regulatory link between Pkp1/2 and Aup1 and their substrate proteins. They find that the deletion of pda1, a core subunit of PDC, inhibits the degradation of previously identified mitophagy substrates Mdh1, Aco2, Qcr2, Aco1, and Idp1, while the turnover of mtDHFR-GFP, a model substrate diffusely localized in the mitochondrial matrix, is not affected. These data indicate that tested mitochondrial proteins are excluded from mitochondrial turnover by mitophagy in the absence of Pda1 by as yet unknown sorting mechanisms. The authors next show that the mutation of a known Pkp1/2 and Aup1 phosphorylation site within Pda1 (S313A) drastically reduces mitophagic turnover of Mdh1, which is completely blocked by the additional deletion of AUP1. A mutation blocking Pda1 activity (R322C) also almost completely blocked Mdh1 degradation. However, PDC activity does not generally affect mitophagy, since an inactivating mutation in Lat1 (K75R) promotes mitophagy of Mdh1. Published work has shown that Pkp1/2 and Aup1, which affect the phosphorylation and degradation of Mdh1, physically interact with the PDC. Thus, the authors tested whether the absence of Pda1 affected the phosphorylation state of Mdh1 and thus its turnover. Indeed, Mdh1 showed globally reduced phosphorylation in the absence of Pda1, which was rescued by the overexpression of Pkp1/2 promoting degradation of Mdh1. Consistent with the effects on mitophagic turnover, Mdh1 was phosphorylated at higher level in the presence of Lat1-K75R. The authors analyzed the mitochondrial distribution of Mdh1 in dependence of Pda1. They provide cytological data suggesting that Mdh1 might segregate from a diffuse matrix localized mtRFP reporter, which is suppressed upon Pkp1/2 overexpression. Finally, the authors perform a mitochondrial phospho-proteome analysis and found that the absence of Pda1 affected phosphorylation sites in 8 mitochondrial matrix proteins. From these data the authors propose a model in which the PDC complex controls the activity of the associated protein kinase Pkp1/2 and the phosphates Aup1 by allosteric changes, which in turn regulates the phosphorylation and differential turnover of mitochondrial proteins by mitophagy. They speculate that the PDC and associated factors could resemble large protein kinase complexes as TORC.

      (1) The authors postulate a structural role for the PDC in regulating Pkp1/2 and Aup1 for controllingmitophagy turnover of certain substrates. While the physical association of Pkp1/2 and Aup1 with PDC has been shown previously, the authors need to critically test their model and assess (a) whether this physicalinteraction occurs under their experimental conditions and (b) whether the different mutants that affectmitophagy of Mdh1 also affect the physical interaction of Pkp1/2 and Aup1 with PDC. Otherwise, their model, although consistent, remains purely speculative.

      Response: We fully agree with the reviewer that this is an important point and we are currently trying to verify these interactions under our working conditions. Naturally, we will then test whether the documented Pdh1-Aup1 and the Pdh1-Pkp1/2 interactions are affected by mutants such as the lat1K75R mutation and the lat1Δ mutation, as insightfully suggested by the reviewer.

      (2) It is important to test every mutant that affects Mdh1 turnover for effects on mitophagy in general using the mtDHFR-GFP reporter to be able to conclude specific effects on Mdh1 mitophagy. For example, the deletion of Lat1 has been shown to induce mitophagy of a mitochondrial matrix reporter during nitrogen starvation (Bockler and Westermann, 2013). While the authors observe a complete block of Mdh1 turnover in lat1 deletion cells, they do observe increased Mdh1 degradation in Lat1-K75R cells. In line with this reasoning, the authors have identified a number of Mdh1 variants in a previous publication (Kolitsida et al. 2019) that can be used to further explore the observed phenomenon. For example, the Mdh1- T199A could be used to test whether the expression of Pkp1/2 in pda1 deficient cells has specific or general effects on mitophagy. Furthermore, can Mdh1-T199D, which is turned over independent of Pkp1/2, be degraded in the absence of Pda1?*

      Response: We completely agree with the reviewer, and we have tested the effect of the lat1Δ mutations on mtDHFR-GFP (Supplemental Figure 2). It is important to note, however, that while the Bockler and Westermann paper does report an increased red/green fluorescence ratio in lat1Δ cells using mtRosella as a mitophagy reporter, this is part of a general screen, and was never further investigated or validated with additional, more rigorous methods (that paper focused on the role of the HERMES complex in mitophagy). In vivo fluorescence can be affected by a multitude of physical intracellular and intra-organellar factors including local pH, ionic strength, and molecular crowding. Therefore, the result from the screen conducted by Bockler and Westermann - although intriguing- is somewhat preliminary and requires more scrutiny (e.g. validation with a GFP release assay, complementation, etc) before we can make conclusions regarding the effect of the lat1Δ mutation on mitophagy, under their experimental conditions.

      Furthermore, can Mdh1-T199D, which is turned over independent of Pkp1/2, be degraded in the absence of Pda1?*

      This is now addressed in Figure 3E and 3F. The results are consistent with our hypothesis, namely showing complete or near complete suppression of the phenotype when tested using the phosphomimetic reporter.

      (3) The cytological data are not convincing. The imaging quality is low and it is very difficult for the reader to appreciate the suggested segregation of Mdh1-GFP and mtRFP. Thus, it is important to provide cortical sections in order to visualize mitochondrial tubules/networks and Mdh1 distribution. This is particularly important, because the authors mention effects of Pda1 on mitochondrial morphology, which appears to be rescued by Pkp1/2 overexpression.

      We are sorry to hear that the reviewer is not convinced by the microscopy data. We agree with the comment that it is not trivial to observe the segregation by eye. We do not intend to say that the segregation is absolute, and the differences we observe in the distributions of the two signals are relative. By that, we mean that a prominent green dot in the cell, which is brighter than other green dots in the same cell, has a red channel counterpart which is not brighter than the other red dots in the cell, or may even be dimmer than the other red dots in the same cell. However, we do provide illustrative examples of segregation with specific arrows pointing to loci of segregation between the two channels. This was not pointed out in the original figure legend and this oversight has now been corrected. More importantly, we performed a mathematical analysis of the overlap between the two signals, and demonstrate a statistically (very) significant difference between the endogenous mitochondrial protein and the artificial mitochondrially-targeted GFP chimera. We would also like to stress that under our conditions (gluconeogenic medium, stationary phase) yeast mitochondria (at least in our genetic background) are not normally found in tubules (this pattern is specifically observed in cells growing in glucose-based medium). Nonetheless, we will attempt to carry out a 3-d reconstruction using our available z-sections, as per the reviewer’s request.

      (3 continued) In this context it would be very informative to follow the localization of PDC in mitochondria. Previous work has shown that Pda1 forms punctate structures in mitochondria in proximity to ER-mitochondria contact sites marked by ERMES (Cohen et al. 2014). Is Pda1 itself degraded by mitophagy or is it excluded? Could the physical interaction of Mdh1 with Pda1 explain mitophagy phenotypes if Pda1 is excluded from mitophagy? In other words, could the PDC form a physical unit to segregate and prevent proteins from mitophagy turnover?

      Response: It was shown by others that Pda1-GFP localizes to mitochondrial puncta, and we also observe this in our system. We have also briefly looked at the mitophagic efficiency of Pda1-GFP. While it is inefficiently delivered to the vacuole (5% free GFP after 5 days, vs 20% for Mdh1), it is not excluded from mitophagic targeting. We are not sure what the reviewer is referring to as a “physical interaction between Pdh1 and Mdh1”. We do not demonstrate such an interaction, and to our knowledge such an interaction has not been reported and validated in any publication. Even if such an interaction existed, it cannot explain the rescue of the pda1Δ phenotype by the co-overexpression of Pkp1 and 2, nor the effect of the pda1Δ mutation on Mdh1 phosphorylation, or the effect of the pda1Δ mutation on other mitophagy reporters which we used, such as Aco1, Qcr2, Idp1, and Aco2. We agree with the reviewer that there is a possibility that an organizing center, be it the PDC or another complex, may regulate intra-matrix segregation and mitophagic trafficking. However, addressing this question would require significant additional time and resources, and we would beg the referee to defer this question to future investigations, as it is not central to the claims made in the current manuscript.

      (4) The conclusions that can be drawn from the analysis of the mitochondrial phospho-proteome independence of Pda1 are rather limited. First, the experimental setup does not distinguish between directeffects of PDC on Pkp1/2 or Aup1 activity and the effects of simply PDC dysfunction on mitochondrial proteins. Along those lines, it is unclear whether the few identified proteins with altered phosphorylation states are indeed targets of Pkp1/2 or Aup1. Thus, to be able to support their conclusion, the authors need to include a number of additional controls/strains.

      Response: We agree with the reviewer that the phosphoproteomic analysis is not comprehensive. However, it was the best that we were able to do at the time, and it is unlikely that we could distinguish direct from indirect effects using this approach. The purpose of the experiment was to test whether we could identify more global effects of Pda1 on mitochondrial protein phosphorylation. This was in no way intended to imply a direct effect. Rather, it provides an unbiased map for potentially identifying the signaling network(s) involved. This may also include downstream events outside the mitochondrial matrix and even in the cytoplasm. We previously showed a connection of the Aup1-dependent signaling pathway to the RTG retrograde signaling pathway (Journo et al, 2009), and such a global analysis may allow a future understanding of how intra-matrix events can signal to the cytoplasm. However, we do not make any specific claims regarding which effects are direct and which are not. To address the reviewer’s concern, we have now modified the text to clarify these points (Page 9 line 28-Page 10 line 1).In an effort to improve the evidence for PDC- dependent mitochondrial matrix phosphorylation, we will carry out a phosphoproteomic analysis comparing WT cells to cells expressing the lat1K75R mutation. Since this mutation is expected to increase kinase activity, we expect to obtain a clearer picture that will complement the results obtained with the pda1Δ deletion mutant.

      Reviewer #1 (Significance (Required)):

      The authors continue to explore the mechanisms underlying their initial observation of differential turnover rates for mitochondrial proteins by mitophagy. This current work specifically builds on the previous publication identifying Pkp1/2 and Aup1 as regulators of the phosphorylation state of specific substrate proteins (Kolitsida et al. 2019). Here the authors now explore how these regulators might be organized and controlled by PDC. Thus, the study adds another layer of regulation. However, the molecular mechanisms of how PDC might regulate Pkp1/2 and Aup1 is not directly addressed. In conclusion, the current study opens up new lines of research that need to be explored to critically test the proposed model. A key question to me is of course how mitochondrial proteins can be excluded from mitophagy and whether the proteins in this study may play a direct role in these mechanisms.

      To further address the mechanism by which the PDC affects mitophagy via Aup1, Pkp1, and Pkp2, we will carry out the following experiments: 1) We will verify the Pda1-Pkp1/2 interaction and test for effects of PDC mutants on this interaction, with an emphasis on the lat1K75R mutation 3) We will analyze the effect of the lat1K75R mutation on mitochondrial phosphoproteomics 4) we will analyze the effect of the pda1Δ mutation on the phosphorylation state of additional proteins which exhibit defective mitophagy in the pda1Δ background (Figure 1B). Our basic hypothesis regarding the reviewers’ question, namely the molecular mechanism behind the observed regulation is based on mitochondrial heterogeneity. We suggest that mitochondrial heterogeneity underlies mitophagic selectivity and that factors which affect heterogeneity also affect mitophagic selectivity.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Kolitsida et al. unravels an interesting link between mitochondrial pyruvatedehydrogenase complex (PDC) and protein sorting for mitophagy-mediated clearance. Using yeast genetics combined with immunoblotting-based mitophagy protein sorting reporter assay, authors show that targeting components of PDC and associated regulatory factors (kinases and phosphatases) interferes with phosphorylation status of at least some mitochondrial matrix proteins and prevents their degradation by autophagic machinery. Experimental design is sound, and statistic evaluation seems appropriate. Although I find this work important and overall valuable, some of the conclusions made by the authors are still preliminary and could require additional experimental testing.

      We thank the reviewer for these supportive comments

      *In this regard, some of my major suggestions include: *

      1. Authors propose that the interaction of dedicated kinases and phosphatases with pyruvate dehydrogenase complex changes their affinity/activity toward the non-PDC substrates (possibly by allosteric regulation). Yet, alternatively, it can also be considered that the PDC complex acts as a docking station that stabilizes the associated regulatory factors inside the mitochondria and allows their additional functions. Indeed, simultaneous overexpression of Pkp1/2 increases the free GFP signal and Mdh1 phosphorylation even in the absence of core E1 PDC component (pda1, figure 3 C, D and figure 4 A, B), suggesting that the interaction of phosphatases and kinases with pda1 is not essential to mediate the phosphorylation of other mitoproteins. Are the protein levels of regulatory PDC factors (Pkp1, Pkp2, Aup1) somewhat altered by depletion of their target E1a subunit (pda1)? Could depletion of pda1 trigger destabilization of these factors and decrease their half-life? This can be tested by western-blotting and combined with cycloheximide chase assay.

      We agree with the reviewer that, theoretically, the PDC could be necessary for the stability of Pkp1, Pkp2 and Aup1. However, have now tested this, and found that deletion of PDA1 has no effect on the expression levels of Aup1, Pkp1 and Pkp2 under our conditions (see Supplementary Figure 2).

      2. Allosteric regulation of specificity of PDC kinases and phosphatases is an intriguing yet still slightly unbacked hypothesis. To provide further experimental evidence for this explanation, authors could develop the classical in vitro kinase assay for Pkp1/2 on their model substrate (Mdh1) in the absence or presence of the Pda1 subunit. Even more excitingly, authors could isolate the Pkp1/2 from wild-type and Pda1- compromised cells and see whether they are able to phosphorylate their model substrate in vitro. Additionally, does specific inhibition of pyruvate dehydrogenase kinases (e.g., with dichloroacetate) affect the Mdh1 phosphorylation levels?

      We present the hypothesis that the PDC allosterically regulates Aup1 and Pkp1/2 , as an explanation for the data. We agree with the reviewer that further experimental work will be necessary to verify this hypothesis. While the Pkp1/2 kinase assays could be useful in this regard, several issues temper our enthusiasm for this experiment:

      i) This assay is far from “classical” with respect to this study. Neither of the two yeast mitochondrial kinases (Pkp1 and Pkp2) was previously characterized using in vitro kinase assays. While the mammalian assay conditions may work here, this is not guaranteed.

      ii) A negative result will have no value

      iii) A positive result (e.g. phosphorylation of recombinant Mdh1 by recombinant Pkp1/2) could arise due to low stringency conditions in the assay

      iv) The nature of the kinase activity is unclear. We do not know if it is Pkp1, Pkp2 or a heterodimer of Pkp1 and Pkp2, or whether currently unknown ancillary factors are necessary for activity. This greatly complicates the analysis.

      In lieu of the direct in vitro kinase assay, we have carried out an experiment demonstrating that an Mdh1-GFP variant with a phosphomimetic threonine to aspartate mutation at position 199, which we previously demonstrated to suppress both the aup1Δ and Pkp2Δ phenotypes (Kolitsida et al, 2019), can also suppress the pda1Δ phenotype (New Figure 3E, F). We suggest that this result provides sufficient evidence of a phosphorylation cascade linking the PDC, Pkp1/2 and Mdh1 mitophagic trafficking.

      In addition, we will also attempt to further support the allosteric regulation hypothesis, by testing whether mutations in LAT1 such as lat1K75R modulate the known Aup1-Pda1 and Pkp1/2-Pda1 interactions (also see response to referee #1).

      We greatly appreciate the suggestion to test the effect of DCA on our assays, and this experiment will be carried out in the near future. However DCA has not been shown to affect Pda1 phosphorylation in yeast or to modulate kinase activity of Pkp1/2, and it is not clear whether this approach will work. Many reagents and inhibitors which act in mammalian cells, are unable to cross the yeast cell wall.

      3. Are the Pkp1/Pkp2 changing their interactome upon PDC alternation (e.g., Pda1 depletion)? Do their interactome alters upon stimulation of mitophagy? Co-immunoprecipitation or bioID assay could shed light on how the partitioning of the Pkp1/Pkp2/Aup1 functions between the metabolic regulation and protein quality control is maintained.

      We agree with the referee that this is an excellent question. Our current hypothesis posits that the established physical interaction between Pkp1/2 and Pda1 is crucial for activity of the kinases towards "third party" clients. To test this we will assay whether the catalytically inactive lat1K75R mutation affects the Pkp1/2 - Pda1 interaction. If we cannot find evidence for such a direct mechanism using these straightforward hypothesis-driven experiments, then we will also analyze effects the pda1Δ mutation on the general interactomes of Pkp1 and Pkp2.

      Minor points:

      1. The paper would benefit from a brief explanation of the principles of the mitophagy reporter assay used in this study.

      Thank you for the suggestion. We have now expanded our explanation of the GFP release assay (see Methods section, Page 13 lines 8-16).

      2. MS phosphoproteomics is indeed a great approach to tackle many questions in this study. Surprisingly,the authors did not discuss these data extensively. Although changes in the phosphorylation status of some proposed targets are apparent (as QCR2), many others were not detected (including Aco1/2, Idp1, and model substrate - Mdh1; Figure 1).

      In our experiment, the coverage of the mitochondrial phosphoproteome was not complete. Some proteins, such as Mdh1, were observed only in a subset of the replicates, and therefore are not included in the figure. In the revision, we plan to add an analysis of the mitochondrial phosphoproteome in the lat1K75R mutation, and hopefully this will provide further insights.

      Furthermore, many significant changes occur in the proteins that do not belong to the mitochondrial matrix compartment (as TOMM20 or YAT1), questioning the direct involvement of Pkp1/Pkp2/Aup1. How do authors interpret these data?

      We previously showed that the Aup1 signaling pathway converges with the RTG retrograde signaling pathway, which signals from the mitochondrial matrix to the nucleus (Journo et al, 2009). We would like to suggest that these effects on non-matrix proteins may indicate a possible role in transducing this signal. We have now added this comment to the discussion on Page 9 lines 28- Page 10 line 1.

      3. Minor spelling mistakes should be reviewed (e.g., "mutophagic" pg. 7).

      Thank you. Fixed.

      *Reviewer #2 (Significance (Required)): *

      *This work further expands on the previous observation by the authors (Kolitsida et al., 2019) and provides a novel and interesting hypothesis that may potentially impact our understanding of mitophagy selectivity toward particular mito proteome content. Furthermore, it unravels the additional function of PDC kinases and phosphatases behind the well-established regulation of energy metabolism. Therefore, it could interest the broad field of researchers interested in mitochondrial quality control and its interplay with cellular metabolism. *

      We thank the reviewer for these encouraging comments.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this elegant and creative study, Dr. Abeliovich and his collaborators describe a novel pathway for the selective elimination of specific proteins. The flow of the experiments is logical and the writing is clear. The topic is very timely as the subject of mitochondrial quality control is of great interest. The main strength of this study is the creative ideas and the very out-of-the-box concept of selective elimination of specific proteins. The main shortcoming of this study is the incomplete direct evidence for a mechanism and conclusions that exceed the results. I believe these are fixable and I am sure this study will be well received after these issues are addressed.

      We thank the reviewer for this positive assessment of the manuscript.

      Specific comments:

      Major comments:

      1. It is not clear what is the mechanism for the recovery of pkp1 and pkp2 shown in figure 4. Supposedly, it is downstream of mitophagy, by how? Is there any data on that?

      We presume the referee is referring to Figure 4A and B, where we demonstrate suppression of the pda1Δ phosphorylation phenotype by co-overexpression of Pkp1 and Pkp2. As indicated in the manuscript, we interpret these results as indicating that Pkp1 and Pkp2 function downstream of the PDC in the regulation of mitophagic trafficking. To further bolster this conclusion we now also show that a phosphomimetic mutation in Mdh1 that suppresses the pkp2Δ mutation as well as the aup1Δ mutation (Kolitsida et al, 2019), is also able to suppress the pda1Δ mutation. In addition, we will use co-IPs to test whether mutations in Lat1, and specifically the lat1K75R mutation which increases mitophagy and Mdh1 phosphorylation, can affect the known Pkp1/2-Pda1 interaction.

      2. Pertaining to the experiment shown in figure 4, it is not clear if this was conducted in the presence of low glucose and if this was required for the effect.

      As pointed out in the legend to Figure 4, the cells were grown in SL medium. As per the “Methods” section, SL contains, in addition to 2% lactate, 0.1% glucose. This small amount of glucose is always added to gluconeogenic media because initial growth in the total absence of glucose is very sluggish. 0.1% glucose does not induce the crabtree effect (does not inhibit respiration) and is consumed during the initial growth phase.

      3. It is not clear if the effect is directly mediated by pct 5&7 acting on MDH1 or if this is an indirect effect. Can this be stated and then addressed experimentally? Are pct5 and pct7 the only phosphatases in the matrix?

      The purpose of this experiment was to untangle the apparent paradox wherein Aup1/Ptc6 is the opposing phosphatase countering Pkp1/2 on Pda1, but this is clearly not the case for the signaling pathway uncovered in this study. This finding further differentiates our signaling pathway from the conventional PDC regulation cascade, and provides a previously unidentified function for Ptc5. We do not claim that these effects are direct. There are 3 documented phosphatases in yeast mitochondria: Ptc5, Ptc7 and Ptc6/Aup1. Since we have previously shown that the aup1Δ phenotype is very similar to that of pkp2Δ, it disqualifies Aup1 from being a candidate for the opposing phosphatase acting on Mdh1-GFP. The experiment shown in Figure 4D and E indicates that Ptc5 is the prime candidate for a phosphatase that targets Mdh1-GFP. We now clarify this point in the discussion (Page 10, lines 14-24), and we suggest that a more definitive identification will be made in future studies.

      4. Figure 6 uses deletion of PDA1 as a loss of function experiment. However in this case, this is a rather crude approach since the point the authors are trying to make is that it is the regulation rather than the expression levels that is critical. If possible, the authors should try and affect the regulatory site by mutation rather than delete the gene.

      We fully agree with this suggestion. We will now test the effect of the lat1K75R mutation on the mitochondrial phosphoproteome, in order to address this deficiency.

      Minor comments:

      1. The title should include that this was done in yeast. Similarly, yeast should be mentioned in the abstract too.

      We have now added this information to the abstract. We do not wish to do the same for the title, as most journals have strict limits on the number of characters in the title.

      2. In the introduction, it is stated that mitophagy is a process degrading dysfunctional mitochondria. It will be more accurate to say that it is degrading dysfunctional mitochondria as well as removing mitochondria in cells that shrink or rebuild their mitochondrial proteome.

      Thank you. We have now made these changes (Page 3, lines 8-11)

      3.Are pkp1 and pkp2 found only in the mitochondria?

      As per the literature, Pkp1 and 2 are exclusively mitochondrial

      4. In figure 1A/B only the blots of MDH1 are shown. It will be informative to show the blots of the other

      proteins (currently in the supplementary)

      We have added these blots to Figure 1

      5. It is not clear how the reporter in fig 1C is different from the reporter in 1A.

      This has now been clarified in the text (Page 4 line 28- Page 5 line 2)

      Figure 4C. Please add an image of the colonies.

      Thank you. This will be added to Figure 4.

      6. It is not clear what the author defines as "increased segregation of Mdh1-GFP relative to generic mtRFP". Please clarify "generic mtRFP" and explain how the relativeness was deducted.

      Thank you. We have now dropped the qualification “generic” and explain the difference. We now also explain the protocol for quantifying the overlap between the two signals (Page 8 lines 9-12, Page 16 lines 19-21 and 27-31).

      Reviewer #3 (Significance (Required)):

      This is a very hot topic and this lab is at the front of it It is for broad cell biology and biochemistry audience

      We thank the referee for his/her generous and supportive comments

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this elegant and creative study, Dr. Abeliovich and his collaborators describe a novel pathway for the selective elimination of specific proteins. The flow of the experiments is logical and the writing is clear. The topic is very timely as the subject of mitochondrial quality control is of great interest. The main strength of this study is the creative ideas and the very out-of-the-box concept of selective elimination of specific proteins. The main shortcoming of this study is the incomplete direct evidence for a mechanism and conclusions that exceed the results. I believe these are fixable and I am sure this study will be well received after these issues are addressed.

      Specific comments:

      Major comments:

      1. It is not clear what is the mechanism for the recovery of pkp1 and pkp2 shown in figure 4. Supposedly, it is downstream of mitophagy, by how? Is there any data on that?
      2. Pertaining to the experiment shown in figure 4, it is not clear if this was conducted in the presence of low glucose and if this was required for the effect.
      3. It is not clear if the effect is directly mediated by pct 5&7 acting on MDH1 or if this is an indirect effect. Can this be stated and then addressed experimentally? Are pct5 and pct7 the only phosphatases in the matrix?
      4. Figure 6 uses deletion of PDA1 as a loss of function experiment. However in this case, this is a rather crude approach since the point the authors are trying to make is that it is the regulation rather than the expression levels that is critical. If possible, the authors should try and affect the regulatory site by mutation rather than delete the gene.

      Minor comments:

      1. The title should include that this was done in yeast. Similarly, yeast should be mentioned in the abstract too.
      2. In the introduction, it is stated that mitophagy is a process degrading dysfunctional mitochondria. It will be more accurate to say that it is degrading dysfunctional mitochondria as well as removing mitochondria in cells that shrink or rebuild their mitochondrial proteome. 3.Are pkp1 and pkp2 found only in the mitochondria?
      3. In figure 1A/B only the blots of MDH1 are shown. It will be informative to show the blots of the other proteins (currently in the supplementary)
      4. It is not clear how the reporter in fig 1C is different from the reporter in 1A. Figure 4C. Please add an image of the colonies.
      5. It is not clear what the author defines as "increased segregation of Mdh1-GFP relative to generic mtRFP". Please clarify "generic mtRFP" and explain how the relativeness was deducted.

      Significance

      This is a very hot topic and this lab is at the front of it It is for broad cell biology and biochemistry audience

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript by Kolitsida et al. unravels an interesting link between mitochondrial pyruvate dehydrogenase complex (PDC) and protein sorting for mitophagy-mediated clearance. Using yeast genetics combined with immunoblotting-based mitophagy protein sorting reporter assay, authors show that targeting components of PDC and associated regulatory factors (kinases and phosphatases) interferes with phosphorylation status of at least some mitochondrial matrix proteins and prevents their degradation by autophagic machinery. Experimental design is sound, and statistic evaluation seems appropriate. Although I find this work important and overall valuable, some of the conclusions made by the authors are still preliminary and could require additional experimental testing.

      In this regard, some of my major suggestions include:

      1. Authors propose that the interaction of dedicated kinases and phosphatases with pyruvate dehydrogenase complex changes their affinity/activity toward the non-PDC substrates (possibly by allosteric regulation). Yet, alternatively, it can also be considered that the PDC complex acts as a docking station that stabilizes the associated regulatory factors inside the mitochondria and allows their additional functions. Indeed, simultaneous overexpression of Pkp1/2 increases the free GFP signal and Mdh1 phosphorylation even in the absence of core E1 PDC component (pda1, figure 3 C, D and figure 4 A, B), suggesting that the interaction of phosphatases and kinases with pda1 is not essential to mediate the phosphorylation of other mitoproteins. Are the protein levels of regulatory PDC factors (Pkp1, Pkp2, Aup1) somewhat altered by depletion of their target E1a subunit (pda1)? Could depletion of pda1 trigger destabilization of these factors and decrease their half-life? This can be tested by western-blotting and combined with cycloheximide chase assay.
      2. Allosteric regulation of specificity of PDC kinases and phosphatases is an intriguing yet still slightly unbacked hypothesis. To provide further experimental evidence for this explanation, authors could develop the classical in vitro kinase assay for Pkp1/2 on their model substrate (Mdh1) in the absence or presence of the Pda1 subunit. Even more excitingly, authors could isolate the Pkp1/2 from wild-type and Pda1-compromised cells and see whether they are able to phosphorylate their model substrate in vitro. Additionally, does specific inhibition of pyruvate dehydrogenase kinases (e.g., with dichloroacetate) affect the Mdh1 phosphorylation levels?
      3. Are the Pkp1/Pkp2 changing their interactome upon PDC alternation (e.g., Pda1 depletion)? Do their interactome alters upon stimulation of mitophagy? Co-immunoprecipitation or bioID assay could shed light on how the partitioning of the Pkp1/Pkp2/Aup1 functions between the metabolic regulation and protein quality control is maintained.

      Minor points:

      1. The paper would benefit from a brief explanation of the principles of the mitophagy reporter assay used in this study.
      2. MS phosphoproteomics is indeed a great approach to tackle many questions in this study. Surprisingly, the authors did not discuss these data extensively. Although changes in the phosphorylation status of some proposed targets are apparent (as QCR2), many others were not detected (including Aco1/2, Idp1, and model substrate - Mdh1; Figure 1). Furthermore, many significant changes occur in the proteins that do not belong to the mitochondrial matrix compartment (as TOMM20 or YAT1), questioning the direct involvement of Pkp1/Pkp2/Aup1. How do authors interpret these data?
      3. Minor spelling mistakes should be reviewed (e.g., "mutophagic" pg. 7).

      Significance

      This work further expands on the previous observation by the authors (Kolitsida et al., 2019) and provides a novel and interesting hypothesis that may potentially impact our understanding of mitophagy selectivity toward particular mito proteome content. Furthermore, it unravels the additional function of PDC kinases and phosphatases behind the well-established regulation of energy metabolism. Therefore, it could interest the broad field of researchers interested in mitochondrial quality control and its interplay with cellular metabolism.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The Abeliovich lab has discovered differential degradation rates for mitochondrial proteins by mitophagy, a selective form of autophagy of mitochondria. In previous publications, they identified two protein kinases Pkp1/2 and the protein phosphatase Aup1(Pct6), which regulate phosphorylation patterns of mitochondrial proteins and affect their degradation rates by mitophagy. In the current manuscript, the authors now analyze the role of the pyruvate dehydrogenase complex (PDC) in mitochondria in the regulation of these regulatory link between Pkp1/2 and Aup1 and their substrate proteins. They find that the deletion of pda1, a core subunit of PDC, inhibits the degradation of previously identified mitophagy substrates Mdh1, Aco2, Qcr2, Aco1, and Idp1, while the turnover of mtDHFR-GFP, a model substrate diffusely localized in the mitochondrial matrix, is not affected. These data indicate that tested mitochondrial proteins are excluded from mitochondrial turnover by mitophagy in the absence of Pda1 by as yet unknown sorting mechanisms.

      The authors next show that the mutation of a known Pkp1/2 and Aup1 phosphorylation site within Pda1 (S313A) drastically reduces mitophagic turnover of Mdh1, which is completely blocked by the additional deletion of AUP1. A mutation blocking Pda1 activity (R322C) also almost completely blocked Mdh1 degradation. However, PDC activity does not generally affect mitophagy, since an inactivating mutation in Lat1 (K75R) promotes mitophagy of Mdh1. Published work has shown that Pkp1/2 and Aup1, which affect the phosphorylation and degradation of Mdh1, physically interact with the PDC. Thus, the authors tested whether the absence of Pda1 affected the phosphorylation state of Mdh1 and thus its turnover. Indeed, Mdh1 showed globally reduced phosphorylation in the absence of Pda1, which was rescued by the overexpression of Pkp1/2 promoting degradation of Mdh1. Consistent with the effects on mitophagic turnover, Mdh1 was phosphorylated at higher level in the presence of Lat1-K75R. The authors analyzed the mitochondrial distribution of Mdh1 in dependence of Pda1. They provide cytological data suggesting that Mdh1 might segregate from a diffuse matrix localized mtRFP reporter, which is suppressed upon Pkp1/2 overexpression.

      Finally, the authors perform a mitochondrial phospho-proteome analysis and found that the absence of Pda1 affected phosphorylation sites in 8 mitochondrial matrix proteins.

      From these data the authors propose a model in which the PDC complex controls the activity of the associated protein kinase Pkp1/2 and the phosphates Aup1 by allosteric changes, which in turn regulates the phosphorylation and differential turnover of mitochondrial proteins by mitophagy. They speculate that the PDC and associated factors could resemble large protein kinase complexes as TORC.

      Critical points:

      1. The authors postulate a structural role for the PDC in regulating Pkp1/2 and Aup1 for controlling mitophagy turnover of certain substrates. While the physical association of Pkp1/2 and Aup1 with PDC has been shown previously, the authors need to critically test their model and assess (a) whether this physical interaction occurs under their experimental conditions and (b) whether the different mutants that affect mitophagy of Mdh1 also affect the physical interaction of Pkp1/2 and Aup1 with PDC. Otherwise, their model, although consistent, remains purely speculative.
      2. It is important to test every mutant that affects Mdh1 turnover for effects on mitophagy in general using the mtDHFR-GFP reporter to be able to conclude specific effects on Mdh1 mitophagy. For example, the deletion of Lat1 has been shown to induce mitophagy of a mitochondrial matrix reporter during nitrogen starvation (Bockler and Westermann, 2013). While the authors observe a complete block of Mdh1 turnover in lat1 deletion cells, they do observe increased Mdh1 degradation in Lat1-K75R cells. In line with this reasoning, the authors have identified a number of Mdh1 variants in a previous publication (Kolitsida et al. 2019) that can be used to further explore the observed phenomenon. For example, the Mdh1-T199A could be used to test whether the expression of Pkp1/2 in pda1 deficient cells has specific or general effects on mitophagy. Furthermore, can Mdh1-T199D, which is turned over independent of Pkp1/2, be degraded in the absence of Pda1?
      3. The cytological data are not convincing. The imaging quality is low and it is very difficult for the reader to appreciate the suggested segregation of Mdh1-GFP and mtRFP. Thus, it is important to provide cortical sections in order to visualize mitochondrial tubules/networks and Mdh1 distribution. This is particularly important, because the authors mention effects of Pda1 on mitochondrial morphology, which appears to be rescued by Pkp1/2 overexpression. In this context it would be very informative to follow the localization of PDC in mitochondria. Previous work has shown that Pda1 forms punctate structures in mitochondria in proximity to ER-mitochondria contact sites marked by ERMES (Cohen et al. 2014). Is Pda1 itself degraded by mitophagy or is it excluded? Could the physical interaction of Mdh1 with Pda1 explain mitophagy phenotypes if Pda1 is excluded from mitophagy? In other words, could the PDC form a physical unit to segregate and prevent proteins from mitophagy turnover?
      4. The conclusions that can be drawn from the analysis of the mitochondrial phospho-proteome in dependence of Pda1 are rather limited. First, the experimental setup does not distinguish between direct effects of PDC on Pkp1/2 or Aup1 activity and the effects of simply PDC dysfunction on mitochondrial proteins. Along those lines, it is unclear whether the few identified proteins with altered phosphorylation states are indeed targets of Pkp1/2 or Aup1. Thus, to be able to support their conclusion, the authors need to include a number of additional controls/strains.

      Significance

      The authors continue to explore the mechanisms underlying their initial observation of differential turnover rates for mitochondrial proteins by mitophagy. This current work specifically builds on the previous publication identifying Pkp1/2 and Aup1 as regulators of the phosphorylation state of specific substrate proteins (Kolitsida et al. 2019). Here the authors now explore how these regulators might be organized and controlled by PDC. Thus, the study adds another layer of regulation. However, the molecular mechanisms of how PDC might regulate Pkp1/2 and Aup1 is not directly addressed. In conclusion, the current study opens up new lines of research that need to be explored to critically test the proposed model. A key question to me is of course how mitochondrial proteins can be excluded from mitophagy and whether the proteins in this study may play a direct role in these mechanisms.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      SUMMARY

      The manuscript by Smoak et al., provides an analysis of the Hyr/Iff-like (Hil) genes in Candida species with a strong focus on C. auris. The authors demonstrate a repeated expansion of these genes in unique lineages of fungal species, many of which are associated with stronger clinical disease. There is evidence of selection operating on the gene family in the primary domain used for identification. These genes include a repeat just downstream of that core domain that changes frequently in copy number and composition. The location of these genes tends to cluster at chromosome ends, which may explain some aspects of their expansion. The study is entirely in silico in nature and does not include experimental data.

      MAJOR POINTS

      Altogether, many of the general findings could be convincing but there are some aspects of the analysis that need further explanation to ensure they were performed correctly. To start, a single Hil protein from C. auris was used as bait in the query to find all Hil proteins in yeast pathogens. Would you get the same outcome if you started with a different Hil protein? What is the basis for using Hil1 as the starting point? It also doesn't make sense to me to remove species just because there are already related species in the list. This may exclude certain evolutionary trends. Furthermore, it would be helpful to know how using domain presence and the conservation of position changes the abundance of the gene family across species? (beginning of results).

      We appreciate the reviewer’s criticisms on our strategy for identifying Hil proteins. In response, we have significantly revised our pipeline. In particular, we now combine the search results from three queries: in addition to C. auris Hil1’s Hyphal_reg_CWP domain (XP_028889033), we added the Hyphal_reg_CWP sequences from C. albicans Hyr1 and C. glabrata Hyr1. They were chosen as representatives in the two phylogenetic groups distinct from the one containing C. auris in order to avoid the bias due to the query’s phylogenetic position. Using the same criteria as we did for the original search, we identified three additional hits compared with the original 104 homologs list. In response to the criticism of the arbitrary exclusion of some species, we now include any species from the BLASTP search results as long as it is part of the 332 yeast species studied by Shen et al. 2018 (PMID: 30415838). The reason for this criterion is so that we can use the high-quality species phylogeny generated by Shen et al. 2018 to properly study the gene family evolution by reconciling the gene tree with the species tree. We additionally include the species in the MDR clade closely related to C. auris and used Muñoz et al. 2018 (PMID: 30559369) as the basis for the species phylogeny in the clade. Lastly, we no longer require the particular domain organization in classifying Hil family members. All BLASTP hits satisfying the E-value cutoff of 1x10-5 and query coverage > 50% are included.

      A major challenge in the analysis like this one is in dealing with repetitive sequences present in amplified gene families. For example, testing modes of selection on non-conserved sites is fraught. It's not clear if all sites used for these tests are positionally conserved and this should be clarified. Alignments at repeat edges will need to maintain this conservation and relatively good alignments as stated in lines 241-242 are concerning that this includes sequence that does not retain this structure necessary for making predictions of selection.

      We appreciate the reviewer’s comment. In the original manuscript, we performed two different types of analyses, one on the conserved and well-aligned Hyphal_reg_CWP domain and another on the rapidly evolving repeat region. For the former, we performed phylogenetic dN/dS analyses using maximum-likelihood, for which a reliable alignment is crucial and is the case here. The Hyphal_reg_CWP domain alignment for C. auris Hil1-Hil8 is shown below and also included as Fig. S7 in the revised manuscript: (figure in the response file)

      In the text, we added this sentence to emphasize this point: “We chose to focus on the Hyphal_reg_CWP domain because of its potential importance in mediating adhesion and also because the high-quality alignment in this domain allowed us to confidently infer the evolutionary rates (Fig. S7).”

      For the repeat domain, what we did in the original version was to calculate the pairwise dN/dS between individual repeat units found in Hil1 and Hil2. This didn’t require aligning the entire repeat regions in the two proteins, but instead relied on the alignment of the individual ~44 aa repeat units, which were highly conserved (see below). In the revised manuscript, however, we decided to focus our analyses on the Hyphal_reg_CWP domain because of a different concern, namely gene conversions between paralogs could distort the evolutionary history of the repeats (the same concern was addressed for the effector domain using an additional step of detecting recombination breakpoints, but the same analysis would be challenging for the repeat region due to alignment issues).

      (figure in the response file)

      It's also unclear to me why Figure S12 is here. The parameters for this analysis should be tested ahead of building models so only one set of parameters should be necessary to run the test. The evolutionary tests within single genes and across strains is really nice!

      We appreciate the reviewer’s suggestion. Based on the reviewer’s suggestion, we removed Fig. S12 and describe the model set up in the Materials and Methods section. We were not sure if the last point was a comment or a suggestion. We didn’t perform a population level selective sweep scan in C. auris. Such an analysis has in fact been attempted by Muñoz et al. 2021, who identified several members of the Hil family as the top candidates for positive selection (PMID: 33769478). We cited this in our Discussion:

      “Lastly, scans for selective sweep in C. auris identified Hil and Als family members as being among the top 5% of all genes, suggesting that adhesins are targets of natural selection in the recent evolutionary history of this newly emerged pathogen (Muñoz et al. 2021).”

      A major challenge for expanded gene families is rooting based on the inability to identify a strong similarity match for the full length sequence. The full alignment mentioned would certainly include significant gaps. If those gaps are removed and conserved sites only are used, does it produce the same tree? Inclusion of unalignable sequences would be expected to significantly alter the outcomes of those analysis and may produce some spurious relationships in reconciling with the species trees. Whether or not there are similar issues in the alignment of PF11765 need to be addressed as well. There's nothing in the methods that clarifies site selection.

      We appreciate reviewer’s comment and agree with the concern about alignment quality affecting phylogenetic reconstruction. To clarify, all phylogenetic analyses in this work are based on the alignment of the Hyphal_reg_CWP domain, which is well aligned (shown above for the subset of eight homologs in C. auris). Alignment of all 215 homologs is provided for readers to review (shorturl.at/kDEJ3). To clarify this choice, we now include the following in Results:

      “To further characterize the evolutionary history of the Hil family, including among closely related Candida lineages, we reconstructed a species tree-aware maximum likelihood phylogeny for the Hil family based on the Hyphal_reg_CWP domain alignment (Fig. 1C, Fig. S2).”

      We also included detailed steps for reconstructing the gene tree in Materials and Methods.

      To test the effect of gaps in the alignment on phylogenetic tree inference, we used two trimming programs, ClipKit (PMID: 33264284) and BMGE (PMID: 20626897), with author-recommended modes. They resulted in consistent gene tree results. We present the tree based on the ClipKit trimmed alignment in the main results. The root of the gene tree was inferred by jointly maximizing the likelihood scores for the gene tree based on the alignment and the evolution of the gene family within the species tree, using GeneRax (Morel et al. 2020, PMID: 32502238).

      Figure 1A: the placement of evolved pathogenesis is a little arbitrary. It's just as feasible that a single event increased pathogenesis in the LCA of C. albicans and C. parapsilosis that was subsequently lost in L. elongisporus. These should be justified or I'd suggest removing. The assignment of Candida species here also seems incomplete. The Butler paper notes both D. hansineii and C. lusitaniae as Candida species whereas they are excluded here.

      We removed Figure 1 entirely based on this and another reviewer’s comment. We note that there is broad consensus that opportunistic yeast pathogens have independently arisen multiple times, such as C. auris, C. albicans and C. glabrata. Whether Candida pathogens that are more closely related evolved separately or not are subjects of ongoing research (PMID: 24034898).

      It is tricky to include scaffolds in analysis of chromosomal location of the HIL genes. The break in the scaffold may be due to the assc repeats of these proteins alone or other, nearby repeats. Any statistics would be best done to include only known chromosomes or those that are strongly inferred by Munoz, 2021. This will change the display of Figure 7, but is unlikely to change the take home message.

      We agree with the reviewer’s concern. In the revised manuscript and with more species included, we now only analyze genomes assembled to a chromosomal level, with the exception of C. auris B8441, which is supported by Muñoz et al. 2021 as having chromosome-length sequences. The revised Figure 7 now only includes these results. We also removed the accompanying supplementary figure that showed results based on scaffold-level assemblies.

      MINOR POINTS

      Line 18: "spp." Should be "spps."

      Addressed throughout the revised manuscript.

      Line 41: I might rephrase this as "how pathogenesis arose in yeast..."

      Accepted (line 43 in revised manuscript).

      I might use a yeast-centric example around line 40 for duplication and divergence. This could include genes for metabolism of different carbon sources in S. cerevisiae.

      Accepted (lines 47-48)

      The Butler paper referenced on line 51 compared seven Candida species and 9 Saccharomyces species

      Changed (line 48)

      The autors state no other evolutionary analysis of adhesins has been performed but do not acknowledge this study: https://academic.oup.com/mbe/article/28/11/3127/1047032

      We appreciate the reviewer pointing this important reference to us. We now cite it in the introduction (line 64) and discussion (line 340)

      The first paragraph of the Results could be condensed

      Addressed.

      How was the species tree in Figure 1A obtained?

      The previous figure 1 is now removed. The species tree used throughout the manuscript is based on Shen et al. 2018 with MDR clade species added, based on Muñoz et al. 2018.

      Figure 2: In panel A, "DH" and "SS" are not defined. I'd be careful with use of "non-albicans Candida" in Figure 2B. This usually includes C. tropicalis and C. dubliniensis and may confuse the reader.

      We removed the DH and SS labels. Instead, we highlighted three clades, which were defined in previous studies. These are the Candida/Lodderomyces clade (based on NCBI taxonomy database), the MDR clade (e.g., Muñoz et al. 2018, PMID: 30559369) and the glabrata clade (e.g., Gabaldón et al. 2013, PMID: 24034898).

      How was the binding domain defined to extract those sequences are produce a phylogeny? In building a ML model, how were parameters chosen?

      We now provide the following details in the Materials and Methods section:

      “To infer the evolutionary history of the Hil family, we reconstructed a maximum-likelihood tree based on the alignment of the conserved Hyphal_reg_CWP domain. First, we used hmmscan (HmmerWeb version 2.41.2) to identify the location of the Hyphal_reg_CWP domain in each Hil homolog. We used the “envelope boundaries” to define the domain in each sequence, and then aligned their amino acid sequences using Clustal Omega with the parameter {--iter=5}. We then trimmed the alignment using ClipKit with its default smart-gap trimming mode (Steenwyk et al. 2020). RAxML-NG v1.1.0 was compiled and run on the University of Iowa ARGON server with the following parameters on the alignment: raxml-ng-mpi --all --msa $align --model LG+G --seed 123 --bs-trees autoMRE.”

      The parameters for the ML tree reconstruction is listed on the last line above. The main parameter was the evolutionary model (LG+G), which accounts for rate variations using a gamma distribution. Other protein evolution models, e.g., VT+I+G, were tested and resulted in nearly identical tree topologies.

      Figure 3C/D could be just one panel.

      The structure predictions are now reorganized and presented on their own in the new Figure 3.

      Can you relate more the fungal hit to the Hil proteins conveyed in lines 152-154?

      We appreciate the reviewer’s comment, which referred to CgAwp1 and CgAwp3, whose effector domain structures were reported in a recent study (Reithofer et al. 2021, PMID: 34962966). We now discuss them in relation to the predicted Hyphal_reg_CWP structure, by showing them in Figure 3 and describing them in the Results (lines 181) “crystal structures for the effector domains of two Adhesin-like Wall Proteins (Awp1 and Awp3b) in C. glabrata, which are distantly related to those in the Hil family were recently reported, and the predicted structure of one of C. glabrata’s Hil family members (Awp2) was found to be highly similar to the two solved structures (Reithofer et al. 2021)”

      Line 168: Should read "Hence, ..."

      The original sentence was removed, but this grammatical error was checked for and corrected.

      Label proteins along the top of Figure 4 too.

      Accepted (in new Figure 4).

      Figure 5: for tests of selection, were sites conserved across the group? What does the black number at each node indicate? Dn and Ds are given as decimals. This is based on what attribute? For panel B, it is unclear what each tip denotes i.e., Hil1_tr6. Hil1 is the gene but what is "tr6"?

      In the revised manuscript, we provide the multiple sequence alignment for the Hyphal_reg_CWP domain used for the selection analysis as Fig. S7 to illustrate the level of conservation. The black numbers at the internal nodes are numeric indices used to refer to those nodes. In the revised manuscript, we use some of them to refer to the internal branches, e.g., 12…14 in the legend. In the new Figure 5, we do not list the numeric values of Dn and Ds (aka Ka, Ks). Instead, we use a color gradient to represent the estimated dN/dS ratios. The raw estimates are available in the project github repository. Panel B in the original Figure 5 and other panels related to the evolution of the repeats are now removed.

      It's unclear why comparison of the PF11765 domain includes the MRD proteins when those aren't included in the comparison to the repeats alone. Could that skew the comparison due to unequal sample numbers or changed variation frequencies in MDR relative to the other two groups?

      These results pertaining to the evolution of the repeats are now removed.

      Table 2 doesn't add much. This section could probably be reduced to a few sentences since it's highly speculative (intraspecies variation).

      Table 2 is now Table S5. We also simplified the result section in the revised version. While the functional implications of the intraspecific variable number of tandem repeats (VNTR) is speculative, it is founded on two bases: 1) the identification of the VNTR is credible, as the copy number variation is consistent within clades but differ between clades, which is not expected if they are caused by assembly errors; 2) experimental studies in S. cerevisiae for the Flo family strongly supported a direct impact of adhesin length on the adhesive phenotype of the cells (PMID: 16086015).

      Table 3 is not needed.

      Table 3 is now removed.

      Figure 6 - color coding in 6A needs to be explained. I'm assuming this is a taxonomical coding.

      In the revised Figure 6A, the coloring scheme is consistent with what we used in Figure 1 based on the three clades, and a legend is provided.

      Figure 1B is unnecessary. A Model of the protein indicating domains is sufficient here. Figure 1C needs labels for all termini, not just the pathogenic red branches. The figure doesn't provide clear association between adhesin families and the associated species. This could be omitted, especially since Flo is often associated with Saccharomyces species. Figure 1D is unnecessary.

      We have removed the original Figure 1.

      SIGNIFICANCE

      The work here is sorely needed in expanded gene families and in fungi specifically. No analysis at this level has been performed, to the best of my knowledge, in any fungal associated gene family and certainly not in relationship to pathogenic potential. The authors do a good job in citing the foundational literature upon which their study builds in most cases (one exception is noted above). It would be of general interest to those interested in the evolution of virulence, but the analysis is tricky. This is the biggest drawback I currently have as some of the information to assess the results is missing.

      We really appreciate the reviewer's positive comments. We agree and plan to explore the relationship between the adhesin family evolution and virulence phenotypes.

      Expertise: gene families, evolution dynamics, human fungal pathogens

      Reviewer #2

      SUMMARY

      Gene duplication and divergence of adhesin proteins are hypothesized to be linked with the emergence of pathogenic yeasts during evolution; however, evidence supporting this hypothesis is limited. Smoak et al. study the evolutionary history of Hil genes and show that expansion of this gene family is restricted to C. auris and other pathogenic yeasts. They identified eight paralogous Hil proteins in C. auris. All these proteins share characteristic domains of adhesin, and the structural prediction supports that their tertiary structures are adhesin-like. Evolutionary analysis of protein domains finds weak evidence of positive selection in the ligand-binding domain, and the central domain showed rapid changes in repeat copy number. However, performed tests cannot unambiguously distinguish between positive selection and relaxed selection of paralogs after gene duplication. Some alternative tests are suggested that may be able to provide more unambiguous evidence. Together with these additional tests, the detailed phylogenetic analyses of Hil genes in C. auris might be able to better support the hypothesis that the expansion and diversification of adhesin proteins could contribute to the evolution of pathogenicity in yeasts.

      We appreciate the reviewer’s comments and will address specific points below.

      MAJOR COMMENTS

      The authors present extensive analyses on the evolution of Hil genes in C. auris. There is significant merit in these analyses. However, the analyses conducted so far are incomplete, lacking proper consideration of other confounding factors. Detailed explanations of our major comments are listed below.

      1. First, the authors restricted genes in the Hil family to those only containing the Hyphal_reg_CWP domain. Yet, previous work included genes containing the ligand-binding domain or the repeat domain as Hil genes. More justification is needed whether the author's choice represents the natural evolutionary history of Hil genes appropriately. For instance, are the genes only containing the ligand-binding domain monophyletic or polyphyletic? We recommend including the phylogeny of all the Hil candidate genes, to discern whether evolutionary histories of the repeat domain and ligand-binding domain are congruent. Authors can use this phylogeny as justification to focus only on the ligand-binding domain containing genes.

      Butler et al. 2009 (PMID: 19465905) defined the Als family and the Hyr/Iff family as having either the N-terminal effector domain or the intragenic tandem repeats (ITR). Their rationale for the latter was that the ITS sequences were often conserved across species. Upon close inspection (Fig. S19,20 in that paper), however, we found that the ITS tend to be conserved in closely related species, but diverged among more distantly related species. Moreover, proteins in those figures that only contain the ITS and not the ligand-binding domains are all missing either the signal peptide, the GPI-anchor or both. This raises questions as to whether these proteins sharing the ITS sequence alone act as adhesins.

      More generally, defining the evolutionary history of proteins with multiple domains is complicated by recombination, which causes different parts (e.g., domains) of the protein to have distinct evolutionary histories. In fact, our study and others show that there exist “chimeras” that combine the effector domain from one adhesin family and the repeat sequence found in another (Zhao et al. 2011, PMID: 21208290, Oh et al. 2019, PMID: 31105652). In these cases, one phylogenetic tree is insufficient to describe the evolutionary history of the whole protein. We chose to define the Hil family by the Hyphal_reg_CWP domain and thus focus on the evolutionary history of this region because 1) while tandem repeat regions also contribute to adhesion in yeasts (Rauceo et al. 2006, PMID: 16936142), the effector domain likely plays a more important role in ligand binding and specificity. Therefore, we believe using the effector domain to define a protein family is more likely to group proteins with similar functional properties than if the repeat sequences were used. Also, while putative fungal adhesins lacking a recognizable ligand-binding domain exist, they are rare (Lipke 2018, PMID: 29772751); 2) The repeat region evolved much more rapidly than the effector domain, as we illustrate in Figures 2, 4 and 6 in our revised manuscript. While some repeat units are highly conserved, e.g., the ~44 aa unit found in Hil1-4 in C. auris and close relatives in the MDR clade, many others are short and degenerate, making it difficult to reliably identify homologs sharing the repeat. Besides, since each protein could contain many distinct repeats, it is not clear how one defines two sequences as belonging to the same family if they share one out of six types of repeats. We acknowledge that this definition leaves out the evolutionary history defined by the tandem repeats, which may reveal intriguing evolutionary dynamics, with functional implications. A recent review for the Als family discussed similar definition challenges and partly supported our choice (Hoyer and Cota, 2016, PMID: 27014205).

      In the analysis of positive selection, the authors do not adequately control for the effect of recombination on the evolutionary histories of protein sequences, especially given that Hil genes are rich in repetitive sequences. To account for recombination, GARD, an algorithm detecting recombination, should be performed to detect any recombination breakpoints within a protein domain. If recombination did occur within a protein domain, the authors should treat the unrecombined part as a single unit and use the phylogenetic information of this part to proceed with PAML analysis, instead of using the phylogeny of the entire protein domain. The authors should consider doing GARD analysis for the ligand-binding and repeat domains. For the repeat domain, low BS values in Fig. 5C indicate recombination between repeat units. Thus, the authors should analyze each repeat unit with GARD and re-analyze dN/dS.

      We deeply appreciate the reviewers’ criticism here. In the revised manuscript, we removed the analysis of the repeat units and followed the reviewers’ suggestion to carry out GARD analysis on the effector domain, which we now show reveals evidence of intra-domain recombination. Using the inferred breakpoints (Fig. S8), we identified two putatively non-recombining partitions and performed all downstream phylogenetic analyses on them separately. The results are presented in Fig. 5 and Table S6. Compared with the previous result based on the entire Hyphal_reg_CWP domain alignment, the new results reveal clearer patterns, including significantly elevated dN/dS on a subset of the branches. Newly added branch-site test results support a role of positive selection on the effector domain during the expansion of the Hil family in C. auris, suggesting functional diversification following gene duplications.

      The authors concluded positive selection in the ligand-binding domain based on the branch-wise model of PAML. Yet, w values were not higher than one, and it's unclear whether the difference in selective pressures the authors claimed here is biologically significant. Overall, what the authors present so far seems to be weak evidence of positive selection but is much more consistent with variation in the degree of purifying selection or evolutionary constraint. Using the site-wise model (m7 vs. m8) in PAML would allow the authors to detect which residues of the ligand-binding domain underwent recurrent positive selection. Combining the evolutionary information of protein residues and the predicted 3D structure will provide molecular insights into the biological impact of rapidly evolving residues. This would be a significant addition and raise the significance of the study, besides providing potentially stronger evidence of positive selection.

      We appreciate the reviewers’ criticism and suggestions. In the revised manuscript, we performed site tests comparing models M2a vs M1a, M8 vs M7 and M8a vs M8. For partition 1 (P1-414), all three tests were insignificant. For partition 2 (P697-987), the M2a vs M1a test was insignificant (P > 0.05) but M8 vs M7 and M8a vs M7a were both significant at a 0.01 level, and the omega estimate for the positively selected category was estimated to be ~15. The site tests require all branches to evolve under the same selection regime. To relax this constraint, we performed additional branch-site tests by designating the branches with an estimated dN/dS > 10 as the foreground (based on the free-ratio model estimates). This test was significant for both branches at a 0.01 level and the Bayes Empirical Bayes (BEB) procedure identified a total of 5 residues as having been under positive selection. Although three of the five residues, located in the C-terminus of the Hyphal_reg_CWP domain, are part of the α-crystallin domain, we refrain from drawing any functional conclusions because 1) the BEB procedure is known to be lacking power in identifying positively selected residues and 2) we still lack structure-function relationship for the α-crystallin domain. But we agree and believe that this line of analysis is promising in yielding functional insight into the evolution of the effector domain in the family.

      MINOR COMMENTS

      1. In Fig 1c, the figure legend should include more specific details: which adhesin proteins are shown here? Please specify species names on the species tree

      Figure 1 is removed in the revised manuscript

      In Fig 3E, secondary structures are labeled with the wrong colors. Sheet: purple, helix: yellow

      In the revised manuscript, the structures of SRRP-BR (original 3E) is now colored in a single color.

      What's the ligand-binding activity of the b-solenoid fold? How structurally similar are C. auris PF 11765 domains compared to C. glabrata Awp domains? This information will support the role of adhesin for the ligand-binding domain of Hil genes.

      We discuss the ligand-binding activity of the β-solenoid as follows in Discussion:

      “The elongated shape and rigid structure of the β-helix are consistent with the functional requirements of adhesins, including the need to protrude from the cell surface and the capacity for multiple binding sites along its length that facilitate adhesion. In some bacterial adhesins, such as the serine rich repeat protein (SRRP) from the Gram-positive bacterium, L. reuterii, a protruding, flexible loop in the β-helix was proposed to serve as a binding pocket for its ligand (Sequeira et al. 2018). Such a feature is not apparent in the predicted structure of the Hyphal_reg_CWP domain. Further studies are needed to elucidate the potential substrate for this domain and its mechanism of adhesion.”

      We also compare the structures of the C. auris Hil1/Hil7 Hyphal_reg_CWP domain and the CgAwp1/3 in Figure 3, with this in the legend “(C) Crystal structure of the C. glabrata Awp1 effector domain, which is highly similar to C. auris Hil1 and Hil7, but with the disulfide bond in a different location.”

      We added a section in the Discussion to comment on the structure-function relationship based on known β-helix (aka β-solenoid) structures. The main insight comes from similar structures identified through DALI searches, many of which are bacterial and viral surface proteins mediating adhesion. The ligand binding pocket and specificity would require additional structural studies to elucidate.

      In lines 248-249, the authors should also consider the influence of evolutionary history. For instance, repeats within the same Hil protein appeared later in evolution, compared to Hil gene duplication, and therefore these repeats experience less time for sequence divergence.

      In the revised manuscript, we removed the analyses pertaining to the evolution of the repeat regions due to multiple challenges including alignment, potential of gene conversion and recombination. This is an important and intriguing aspect of adhesin family evolution that we plan to follow up in future work.

      Although the bioinformatic evidence of C. auris Hil genes acting as adhesins is strong, it is still worthwhile to discuss the experiments of confirming the function of adhesins.

      We agree with the reviewer and acknowledge in the revised manuscript the limitation of our work:

      “Future experimental tests of these hypotheses will be important biologically for improving our understanding of the fungal adhesin repertoire, important biotechnologically for inspiring additional nanomaterials, and important biomedically for advancing the development of C. auris-directed therapeutics.”

      SIGNIFICANCE

      Overall, this study is interesting to investigate the evolutionary history of a crucial virulent gene in C. auris. Such evolutionary understanding will help us identify critical molecular changes associated with the pathogenicity of an organism during evolution, providing insights into the emergence of pathogens and novel strategies to cure fungal infections. The research question is important; however, the current analyses on the positive selection are incomplete, so the conclusion is modest so far. We recommend that the authors re-do the PAML analysis with the above considerations. This work will bring more significance to the mycology field if the functional impact of rapid evolution in protein domains can be supported or inferred.

      This manuscript is well-written, and the authors also did a great job specifying all the necessary details in the M&M.

      We appreciate the reviewers’ positive comments.

      Reviewer #3

      Summary:

      The manuscript by Smoak et al. provides considerable information gleaned from analysis of HYR/IFF genes in 19 fungal species. A specific focus is on Candida auris. The main conclusion is that this gene family repeatedly expanded in divergent pathogenic Candida lineages including C. auris. Analyses focus on the sequences encoding the protein's N-terminal domain and tracts of repeated sequences that follow. The authors conclude with the hypothesis that expansion and diversification of adhesin gene families underpin fungal pathogen evolution and that the variation among adhesin-encoding genes affects adhesion and virulence within and between species. The paper is easy to read, includes clear and attractive graphics, as well as a considerable number of supplementary data files that provide thorough documentation of the sources of information and their analysis.

      We appreciate the positive comment.

      MAJOR COMMENTS:

      • Are the key conclusions convincing?

      Overall, the authors' conclusions are supported by the information they present. However, the overall conclusion is stated as a hypothesis and that hypothesis is not particularly novel. The idea that expansion of gene families associated with pathogenesis occurs in the pathogenic species dates back at least to Butler et al. 2009, who first presented the genome sequences for many of the species considered here.

      We appreciate the reviewer’s comment. Our main conclusions are 1) the Hil family is strongly enriched in distinct clades of pathogenic yeasts after accounting for phylogenetic relatedness. This enrichment results from independent duplications, which is ongoing between closely related species; 2) the protein sequence of the Hil family homologs diverged rapidly following gene duplication, driven largely by the evolution of the tandem repeat content, generating large variation in protein length and β-aggregation potentials; 3) there is strong evidence for varying levels of selective constraint and moderate evidence for positive selection acting on the N-terminal effector domain during the expansion of the family in C. auris as our focal species. Based on these observations, we propose that expansion of adhesin gene families is a key preliminary step towards the emergence of fungal pathogenesis.

      Indeed, some version of this hypothesis has been proposed by several groups before us. We fully acknowledged this in our previous as well as the revised manuscript, by citing Butler et al. 2009 (PMID: 19465905), Gabaldón et al. 2013, 2016 (PMID: 24034898, 27493146). Our study built on these earlier efforts and extended them by addressing several limitations. First, we performed phylogenetic regression to test for the association between gene family size and the life history trait (pathogen or not) in order to properly account for the phylogenetic relatedness. This was not done in previous studies. Second, most earlier studies didn’t construct a family-wide gene tree to fully investigate the evolutionary history of the family. Gabaldón et al. 2013 did a phylogenetic analysis for the Epa family and a few others within the Nakaseomycetes, revealing highly dynamic expansions. In the present study, we expanded this effort by comprehensively identifying homologs within the Hil family in all yeasts and beyond. Third and perhaps the most important novelty in our study is our detailed analysis of sequence divergence and role of natural selection during the evolution of the family post duplication. This allowed us to present a complete picture of the family’s evolution, not just in its increase in copy number but also its diversification after the duplications, which is a key part of how gene duplications contribute to the evolution of novel traits. As such, we believe our study provides strong support for the above hypothesis.

      One key issue with a manuscript of this type is whether genome sequence data are accurate. The authors are not the first research group to take draft genome sequence data at face value and attempt to draw major conclusions from it. The accuracy of public genome data continues to improve, especially with the emergence of PacBio sequencing. Because the IFF/HYR genes contain long tracts of repeated sequences, genome assemblies from short-read data are frequently inaccurate. For example, is it reasonable to have confidence that the number of copies of a tandemly repeated sequence in a specific ORF is exactly 21 (an example taken from Table 2) when each repeat is 40+ amino acids long and highly conserved? Table S6 would benefit from inclusion of the type of sequence data used to construct each draft genome sequence. It is also reasonable to question whether the genome of the type strain is used as a template to construct the draft genomes of the other strains. If that was standard practice, conservation of the repeat copy number among strains might be an artefact. Conservation of repeat sequences to the degree shown is not a feature of the ALS family, a point of contrast between gene families that could be explored in the Discussion.

      We appreciate the reviewer’s comment and agree strongly that a key limitation in gene family evolution studies like ours is the quality of the genome assembly. In the original manuscript, we took several steps to ensure the completeness and accuracy of the Hil family homologs, primarily by basing our results on the high quality RefSeq collection of assemblies, and supplementing it with two fungi-specific databases. In the revised manuscript, we performed further quality analyses to assess and correct for inaccuracy in the BLASTP hits. Because RefSeq aims to provide a stable reference, it is often slow in replacing older assemblies with newer ones based on improved technologies. We thus compared the RefSeq hits for species in which a newer, long-read based assembly had become available. The results are documented in Text S1 and in summary, while we did find examples of missing homologs and inconsistent sequences, the problems were isolated to specific species and the inconsistency pertains only to the tandem repeat regions. Regarding the specific example of within-species variable number of tandem repeats (VNTR) in C. auris Hil1-Hil4, we are confident of both the copy number and the sequence variation for two reasons. First, all C. auris strain genomes analyzed in this study were assembled de novo rather than based on a reference genome, and all were long-read based (PacBio) (Table S4). Second, empirically, we found the VNTR identified in Hil1-Hil4 agree among strains within one of the four clades of C. auris while differing between clades (Table S5). Since assembly errors are not expected to produce clade-specific patterns, we believe this is strong evidence for the VNTR identified being real.

      We also appreciate the reviewer’s suggestion on discussing the conservation of the repeats as an interesting trait for a group of Hil proteins in comparison to the Als family. We now added a section in Discussion focusing on the special properties of this group of Hil proteins.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Due to the nature of my comments, this review will not be anonymous. I will include some of the data from my laboratory to further illustrate the point about the quality of draft genome sequences, especially for gene families that contain repeated sequences. My laboratory group has spent the past several years looking at the families of cell wall genes in these species and know that the C. tropicalis genome sequence used in the current analysis is highly flawed. There is even a manuscript from several years ago that documents problems in the assembly (doi: 10.1534/g3.115.017566). There is a new PacBio sequence available that has considerably improved data for this group of genes, but still is not perfect. We designed primers and amplified the various coding regions to verify whether the IFF/HYR were correct in the draft genome sequences. For C. tropicalis, we know that 7 of the genes listed in this paper are broken (i.e. prematurely terminated) giving a false impression of their construction. The current study did not verify any gene sequences, so broken/incomplete genes are a stumbling block for developing conclusions.

      We deeply appreciate the reviewer pointing out the flaws in the C. tropicalis genome. Using the PacBio sequence-based new assembly, we were able to confirm the reviewer’s comment on the sequence and annotation error in the RefSeq assembly for C. tropicalis. We listed the comparisons between the two assemblies in Table S8. Because the differences reside outside of the Hyphal_reg_CWP domain, they don’t impact our phylogenetic analyses, which are based on the effector domain alignment. To determine if this is a widespread issue affecting all genome assemblies based on older technologies, and in response to the reviewer’s criticism, we systematically checked the sequences of BLASTP hits based on the RefSeq assemblies against newer, long-read based ones when available. As detailed in Text S1 in the revised manuscript, the problems seen in C. tropicalis were not observed in four other species. While the sample size is small, we believe the issues with C. tropicalis are likely due to a combination of specific issues with the original assembly and special properties of the genome.

      Similarly, the recent work from Cormack's lab features a PacBio C. glabrata sequence (doi: 10.1111/mmi.14707). The paper details how the authors focused on accurate assembly of the types of genes studied here. Sequences from the current project should be compared to the PacBio assembly to determine if they provide the same results.

      We compared the sequences of the three C. glabrata Hil homologs identified in the RefSeq assembly (GCF_000002545.3) to the best BLAST hits in one of the new Cormack lab assemblies for (BG2 strain, GCA_014217725.1). Two of the three proteins showed identical sequences between the assemblies. One of them is longer in the new assembly than in the RefSeq (1861 vs 1771 aa, XP_447567.2, QHS67215.1). The main difference, however, was the number of hits recovered. Performing BLASTP searches in the new assembly recovered 13 hits vs 3 from the RefSeq assembly, of which 12 were in the subtelomeric region. For this reason, we used the new assembly as the basis for the Hil homologs in our subsequent analyses. To determine if we missed homologs in other genomes due to incomplete subtelomeric regions in the RefSeq assemblies, we repeated the BLASTP search in four other genomes (Text S1). In one of the four species, C. nivariensis, we recovered one more homolog than in the RefSeq. In all other three, we identified the same number (S. cerevisiae: 0, K. lactis: 1, C. albicans: 12), suggesting that the issues seen in C. glabrata is likely specific to this species and its RefSeq assembly.

      Another part of the study that deserves additional attention or perhaps altered presentation is the idea that the Iff/Hyr N-terminal domain binds ligands. The literature on the Iff/Hyr proteins is limited. In my opinion, though, the authors of this paper could more completely present the information that is known. The paper by Uppuluri et al. is cited (doi: 10.1371/journal.ppat.1007056), but I did not see any information about their data regarding interaction of C. albicans Hyr1 with bacterial proteins mentioned in the manuscript under review. It is formally possible that the N-terminal domain of Iff/Hyr proteins does not bind a ligand. The current manuscript includes a great deal of speculation on that point, suiting it better to a Hypothesis and Theory format rather than other types of publications.

      We appreciate the reviewer’s criticism and suggestion. We made two revisions based on the comments. First, we no longer refer to the Hyphal_reg_CWP domain as ligand-binding. Instead, we refer to it as the effector domain, following existing practices in the field (Lipke 2018, PMID: 29772751, de Groot et al. 2013, PMID: 23397570). Second, during the description of the predicted structure for the domain, we mentioned that it lacks an apparent binding pocket as suggested/identified in other β-solenoid proteins with carbohydrate binding abilities. Therefore, we suggest that the potential substrate and mechanism of binding by this domain remain to be determined with further experiments. We do, however, believe that there is strong evidence for the domain being involved in adhesion. A recent study (Reithofer et al. 2021) presented structural and phenotypic characterization of three Adhesin wall-like proteins (Awp1,2,3) in C. glabrata. In particular, experimental studies of CgAwp2, a Hil family protein, showed that its deletion resulted in the reversion of the hyperadhesive phenotype in one of the C. glabrata strains. Plastic was one of the substrates being evaluated, although, as the reviewer’s work pointed out, adhesion to plastics doesn’t indicate ligand binding, as it can be mediated by non-specific hydrophobic interactions (Hoyer and Cota 2016, PMID: 27014205). Nonetheless, the results presented in Reithofer et al. 2021 and other lines of evidence presented in the current work strongly supported adhesin functions of the Hil family.

      Table 1 attempts to offer evidence that the Iff/Hyr N-terminal domain has adhesive function but falls short of convincing the reader. One of the example structural templates is a sugar pyrophosphorylase that seems irrelevant to the current discussion. In the column called "Function", the word adhesin is found several times, but no detail is presented. The only entry that offers an example ligand indicates that the domain binds cellulose which is not likely relevant for mammalian pathogenesis, the main focus of the work. Other functions listed include self-association and cell aggregation--using the N-terminal domain. It is formally possible that Iff/Hyr proteins drive aggregation using the N-terminal domain and beta-aggregation sequences in the repeated region. The authors should develop these ideas further. Discussion of adhesive/aggregative function related to the ALS family can be found in Hoyer and Cota, 2016 (doi: 10.3389/fmicb.2016.00280).

      We appreciate the reviewer’s comments. In the revised manuscript, we removed Table 1, which was based on I-TASSER identified templates. Instead, we identified similar structures in the PDB50 database to the AlphaFold2 prediction for the Hyphal_reg_CWP domain in C. auris Hil1 using DALI (Table S3). We described the functional implications based on this list as follows:

      “We identified a number of bacterial adhesins with a highly similar β-helix fold but no α-crystallin domain (Table S3), e.g., Hmw1 from H. influenzae (PDB: 2ODL), Tāpirins from C. hydrothermalis (PDB: 6N2C), TibA from enterotoxigenic E. coli (PDB: 4Q1Q) and SRRP from L. reuteri (PDB: 5NY0). For comparison, the binding region of the Serine Rich Repeat Protein 100-23 (SRRP100-23) from L. reuteri was shown in Fig. 3F (Sequeira et al. 2018). Together, these results strongly suggest that the Hyphal_reg_CWP domain in the C. auris Hil family genes mediate adhesion.”

      One line of evidence that suggest the Hyphal_reg_CWP domain may have ligand-binding activity is from the L. reuteri SRRP-BR, which is one of the bacterial adhesins identified as having a highly similar β-helical structure (but missing the α-crystallin domain). In Sequeira et al. 2018 (PMID: 29507249), the authors showed via both in-vitro and in-vivo experiments that this domain “bound to host epithelial cells and DNA at neutral pH and recognized polygalacturonic acid (PGA), rhamnogalacturonan I, or chondroitin sulfate A at acidic pH”. However, the predicted structure for the Hyphal_reg_CWP domain in C. auris Hil1 and Hil7 lack a protruding, flexible loop in the β-helix, which was proposed to serve as a binding pocket for the ligand in SRRP-BR. We therefore commented in the text “Such a feature is not apparent in the predicted structure of the Hyphal_reg_CWP domain. Further studies are needed to elucidate the potential substrate for this domain and its mechanism of adhesion.”

      We also appreciate the reviewer’s suggestion to discuss the potential role of the Hil proteins in mediating adhesion vs cell aggregation. We now have a section in Discussion that focuses on the potential role of the β-aggregation sequences especially in the subset of Hil proteins led by C. auris Hil1-Hil4, which have an unusually large number of such sequences. We discuss the recent literature suggesting the potential of such features mediating cell-cell aggregation.

      The incredibly large number of figures that focus on the repeated sequences in the genes does not appear to include mention of the idea that these regions are frequently highly glycosylated. Knowing how much carbohydrate is added to these sequences in the mature protein would also have bearing on whether the beta-aggregation potential is realized. The Iff/Hyr proteins could stick to other things based on ligand binding (adhesion), hydrophobicity, aggregative activity, etc. Not much is really known about protein function so the conclusions are only speculative. The authors are largely accurate in presenting their conclusions as speculative, but the conclusions are not developed fully and always land on the idea that the N-terminal domain has adhesive function when that aspect clearly is not known.

      We appreciate the reviewer’s comment. We have performed N- and O-glycosylataion predictions for the Hil family proteins in C. auris as a focal example and presented the results in Figure 2 of the revised manuscript. Briefly, we found that all eight proteins are predicted to be heavily O-glycosylated (Fig. 2C). N-glycosylation is rare except in Hil5 and Hil6, in regions with a low Ser/Thr content (Fig. 2C). We also deemphasized the ligand-binding ability of the effector domain and its importance in assessing the adhesin function of the Hil family proteins. At the same time, we highlighted other mechanisms as the reviewer pointed out, such as aggregative activities, in our discussion on the potential importance of the large number of β-aggregation motifs.

      Another aspect of the analysis that is not mentioned is that several of the species discussed are diploid. What effect does ploidy have on the conclusions? Most draft genomes for diploid species are presented in a haploid display, so are not completely representative of the species. Additionally, some species such as C. parapsilosis are known to vary between strains in their composition of gene families, with varying numbers of loci in different isolates.

      We appreciate the reviewer raising this issue. The potential impact of having diploid genomes represented as haploids is twofold. First, if the genome sequencing was performed on a diploid cell sample with some highly polymorphic regions, that would present difficulties to the assembly and could result in poorly assembled sections. Second, either because of the first issue, or because the researchers used the haploid phase of the organism for sequencing, the representative haploid genome will not be “completely representative of the species” as the reviewer suggested. The second problem is not specific to diploids – even for haploids, any single or collection of genomes would represent just a slice of the genetic diversity in the species. We did two things to look into this. First, we analyzed multiple strains in C. auris to reveal both Hil family size variation and also sequence polymorphism, particularly in the tandem repeat region. We also, as part of the quality control, compared and searched assemblies for different strains of some species when available. We agree that characterizing multiple genomes in a species is important for fully revealing the gene pool diversity and could have important consequences on our understanding of the emergence of novel yeast pathogens.

      Regarding the first issue, we checked the original publications for two large-scale yeast genome sequencing projects that included 10 of the 32 species in the present study (Dujon et al. 2004, PMID: 15229592 and Butler et al. 2009, PMID: 19465905). In Dujon et al. 2004, the authors stated that haploid cells were used in cases where the species has both haploid and diploid phases. In Butler et al. 2009, the authors said in the Methods that “for highly polymorphic regions of diploid genomes, initial sequence assemblies were iteratively re-assembled in regions of high polymorphism to minimize read disagreement from the two haplotypes while maximizing coverage.”. Therefore, the potential issue of heterozygosity is likely minimal. In addition, many diploid yeasts have large regions in their genomes being homozygous, both as a result of clonal expansion and also due to loss of heterozygosity (LOH), as documented in C. albicans and other Candida species (e.g., PMID: 28080987). Nonetheless, we acknowledge that this issue is yet another challenge to having high-quality, complete genome assemblies. In the discussion, we fully acknowledge the limitation of our study by genome assemblies, and believe that ongoing improvement thanks to the development of long-read technologies will allow more in-depth studies, particularly in the subtelomeric regions and for repeat-rich sequences.

      The manuscript concludes that having more genes is better, that the gene family represents diversification that must be driven by its importance to pathogenesis, without recognizing that some species evolve toward lower pathogenesis. This concept could be explored in the Discussion. …My own experience makes me wonder if the authors found any examples of species that provide an exception to the idea that having more genes is better and positively associated with pathogenesis. The parallel between IFF/HYR and ALS genes is made many times in the manuscript. Spathaspora passalidarum, a species that is not pathogenic in humans, but clearly within the phylogenetic group examined here, has 29 loci with sequence similarity to ALS genes. How many IFF/HYR genes are in S. passalidarum?

      We appreciate the reviewer’s comment. We will address the two comments above together as they are related. First of all, S. passalidarum is now included in our extended BLAST search list and we identified a total of 3 homologs in this species. When compared with the related Candida/Lodderomyces clade, which includes C. albicans, the Hil family in this species is relatively small (3 vs. >10). More generally, we observe a significant correlation between the Hil family size and the species’ pathogenic potential (Figure 1B and the phylogenetic regression result in the text).

      Regarding the first comment, we did identify two species that had a large Hil family (>8 based on C. auris) and yet were not known to infect humans. One of them, M. bicuspidata, has 29 Hil homologs and is interestingly a parasite for freshwater animals, such as Daphnia. The other species, K. africana, has 10 homologs and its ecology is not well described in the literature. With respect to the relationship between adhesin family and pathogenicity, we would like to make two points. First, as mentioned above, we observed a strong correlation between the Hil family size and the pathogen status, after correcting for phylogenetic relatedness, suggesting that expansion of the Hil family is a shared trait among pathogenic species. This doesn’t rule out the possibility that some species may have an expanded adhesin family, such as the example the reviewer mentioned, for reasons other than infecting a human host. Second, a key point in our work is that expansion of the adhesin family is only the first step – the crucial contribution of gene duplications to adaptation is not just in the increase in copy number, but also in providing the raw materials for selection to generate novel phenotypes. On that front, we documented the rapid divergence of the central domains both between and within species, as well as signatures of relaxed selective constraint and positive selection acting on the effector domain following gene duplications in C. auris, both of which support the above theme.

      There are several current taxonomies for the species in this region of the tree. The source of the names used in this paper could be specific more completely.

      We appreciate the reviewer’s comment. We now gave the complete Latin names for all species in Figure 1 and only use abbreviated names, e.g., C. auris, after the first occurrence. For species with multiple names in the literature, we followed the species name and phylogenetic placement in Shen et al. 2018 (PMID: 30415838).

      The Results and Discussion sections are largely redundant. The tone of the paper is conversational, making it easy to read, but there seems little left to say in the Discussion that has not already been mentioned as the background for the various types of analyses. The authors should revise the paper to eliminate discussions of published literature from the Results and expand the Discussion to include some of the themes that have not been mentioned yet.

      We appreciate the reviewer’s comment. In the revised manuscript, we have moved discussion points from the Result to the Discussion section. We also overhauled the Discussion to focus on the implications based on, but not already covered, in the Result part, including the points the reviewer suggested, e.g., the implications of the structure on adhesion mechanism.

      Another point that the authors do not mention is documented recombination between IFF and ALS genes (doi: 10.3389/fmicb.2019.00781) and the effect of that process on evolution among these gene families.

      We appreciate the reviewer’s comment. We now mention this and related observations in Discussion as part of the discussion on the mutational mechanisms for the evolution of the family:

      “Diversification of adhesin repertoire within a strain can arise from a variety of molecular mechanisms. For example, chimeric proteins generated through recombination between Als family members or between an Als protein’s N terminal effector domain and an Hyr/Iff protein’s repeat region have been shown (Butler et al. 2009; Zhao et al. 2011; Oh et al. 2019). Some of the adhesins with highly diverged central domains may have arisen in this manner (Fig. S10).”

      My reading of the work by Xu et al. 2021 (doi: 10.1111/mmi.14707) does not match the direction of its presentation in the current paper. Oh et al., 2021 (doi: 10.3389/fcimb.2021.794529) discussed that point recently, providing another point for the Discussion in the current paper.

      We appreciate the reviewer’s comment and agree that our original reading of Xu et al. 2021 was incorrect. Instead of suggesting a higher mutation rates in the subtelomeric region, the authors instead suggested the evolution of the Epa family in the subtelomere was driven by Break-Induced Replication. We now update our discussion in the following way, also citing Oh et al. 2021

      “Finally, as reported by (Muñoz et al. 2021), we found that the Hil family genes are preferentially located near chromosomal ends in C. auris and also in other species examined (Fig 7), similar to previous findings for the Flo and Epa families (Teunissen and Steensma 1995; De Las Peñas et al. 2003; Xu et al. 2020; Xu et al. 2021) as well as the Als genes in certain species (Oh et al. 2021). This location bias of the Hil and other adhesin families is likely a key mechanism for their dynamic expansion and sequence evolution, either via ectopic recombination (Anderson et al. 2015) or by Break-Induced Replication (Bosco and Haber 1998; Sakofsky and Malkova 2017; Xu et al. 2021). Another potential consequence of the subtelomeric location of Hil family members is that the genes may be subject to epigenetic silencing, which can be derepressed in response to stress (Ai et al. 2002). Such epigenetic regulation of the adhesin genes was found to generate cell surface heterogeneity in S. cerevisiae (Halme et al. 2004) and leads to hyperadherent phenotypes in C. glabrata (Castaño et al. 2005).”

      I might have missed it, but I could not find what constitutes a BLAST-excluded sequence (Table S7). Additional explanation (or making the explanation easier to find) would help the reader.

      We apologize for the inadvertent mistake of leaving out Table S7. In the revised manuscript, we include all hits from species that are part of the 322 species phylogeny in Shen et al. 2018. Thus, we removed the original Table S7.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Ideally, validation of all sequences would provide a stronger foundation for the work. However, that request is not realistic in terms of time or resources.

      We agree with the reviewer and appreciate the understanding. In the revised manuscript, we performed additional analyses to evaluate the accuracy and correct the sequences of the BLASTP hits from RefSeq database by comparing them to long-read based assemblies when possible. Please see previous replies to reviewers’ comments and Text S1 for details.

      • Are the data and the methods presented in such a way that they can be reproduced?

      Yes, the data and methods are documented clearly and perhaps too thoroughly in many places. A considerable amount of confidence is placed in sequences that might not be accurate and tracking details down to the amino acid residue may not be reasonable in this context. A disclaimer might help--everyone probably already knows that genome sequences are not perfect but stating that the analysis is only as good as the genome sequence acknowledges that fact.

      We appreciate the reviewer’s comment. In the revised manuscript, we tried to strike a balance between providing enough methodological details for the readers to assess the conclusions and yet also keeping the flow of the paper. We also accepted the reviewer’s suggestion by adding a disclaimer in the Discussion:

      “we acknowledge the possibility of missing homologs in some species and having inaccurate sequences in the tandem-repeat region. We believe the expected improvements in genome assemblies due to advances in long-read sequencing technologies will be crucial for future studies of the adhesin gene family in yeasts.”

      • Are the experiments adequately replicated and statistical analysis adequate?

      The idea of replicates does not really apply to this analysis. I think that the species sampled are reasonable to represent the region of the phylogenetic tree on which the analysis is focused. The authors clearly documented their computational methods in an admirable way.

      We appreciate the reviewer’s comment.

      MINOR COMMENTS:

      Figure 1 has elements that would make a nice graphical summary, but most of it should not be part of the final manuscript. For example, Panel A is repeated in Figure 2. It is not clear what Panel C means until the reader gets to Figure 2. Panel D is unnecessary. The image in Panel B is a good graphic. Endothelial adhesion is not mentioned, though. It is also debatable whether the proteins bind directly to plastic or to the body fluids that coat the plastic.

      Based on this and another reviewer’s comments, we removed Figure 1 from the revised manuscript.

      Compared to Figure 1, the information in Figure 3 is inconsistent. The "central domain" in Panel A is not central to anything as drawn, located at the end of the protein. The figure should be revised to be consistent with the majority of the authors' results.

      We appreciate the reviewer’s suggestion. The terminologies used to describe the different parts of a typical yeast adhesin vary in the literature. In the Als family literature, central domain refers to the region after the N-terminal effector domain and before the C-terminal Ser/Thr-rich stalk domain. In the Hil family proteins, there is not a clear distinction between a “central” and a “stalk” region. In Boisramé et al. 2011 (PMID: 21841123), the authors referred to the region between the Hyphal_reg_CWP domain and the GPI-anchor as the central domain. We adopted that use. We realize that this can lead to confusion especially for Als researchers. In some other literature, e.g., Reithofer et al. 2021, this part of the protein is referred to as the B-region. But we couldn’t find wide use of that term. We decided to stay with “central domain” in this work and hope that by defining the term in Figure 2A, we would avoid any confusion within the scope of this work.

      Are the low-complexity repeats mentioned in the Figure 4 legend present anywhere else in the C. auris genome or elsewhere among the species used in this study? The answer to that question may also provide evolutionary clues.

      We did find one other putative GPI-anchored cell wall protein containing this ~44aa repeat unit, but with a different effector domain (GLEYA, PF10528). This protein (PIS58185.1 in C. auris B8441), appears to be a hybrid between the repeat region of C. auris Hil1 and an N-terminal effector domain of a different family. This result fits the theme of the reviewer’s work in C. albicans and C. tropicalis on the chimeric adhesins formed between the Als and Hyr/Iff families. Due to the scope of the current work, we omitted this finding from the main result.

      Figure S1 legend. How was the distance to C. glabrata measured to call it equal?

      The original Figure S1 was removed in the revised manuscript. A consistent set of criteria was employed in deciding which BLASTP hits to include as Hil family members.

      Figure S4 could be presented better. Both diagonals have the same information. One could be emptied or could alternatively present nucleotide identity.

      The original Figure S4 was removed in the revised manuscript.

      Italicize the species names in Panel C of Figure S8.

      The original Figure S8C is now Figure S9 and we systematically checked to make sure that species Latin names are italicized. Thanks for pointing this out.

      Lines 256-257: The paper selectively samples the Iff/Hyr family and does not examine the "entire" family. Please revise.

      We appreciate the reviewer’s comment. In the revised manuscript, we no longer selectively sample species. Instead, we only exclude three species that are not part of the 322-yeast species phylogeny in Shen et al. 2018 and Muñoz et al. 2018, namely Diutina rugosa, Kazachstania barnettii and Artibeus jamaicensis. Our extensive BLASTP searches also indicated that the family as defined in this work is specific to the budding yeast subphylum. We therefore believe it is accurate to describe the work as examining the entire Hil family.

      • Are prior studies referenced appropriately?

      I was disappointed to see that the paper does not reference my laboratory's work at all. When ALS genes are featured so strongly in a report, it seems reasonable to include something we have done over 30+ years. Our most-recent ALS paper (Oh et al., 2021 doi: 10.3389/fcimb.2021.794529) would be a reasonable source for defending the gene numbers used in Figure 2A. Other examples of our work that directly relate to concepts in this paper were mentioned above.

      We sincerely apologize for our negligence. We are new to the fungal adhesin field through an accidental finding, and despite our effort to digest the relevant literature, we did unfortunately overlook the extensive work done on the Als family, much of which came from the reviewer’s lab. We have carefully read the papers suggested by the reviewer as well as others, and now have better incorporated prior foundational and insightful work into our result and discussion sections (see previous replies to the reviewer’s comments).

      • Are the text and figures clear and accurate?

      Suggestions for improvement are incorporated into the comments above.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Please present Methods and Results in the past tense. I still make the same mistake when I try to get my ideas on the page but proofread one more time and ensure the verb tenses are accurate.

      We appreciate the reviewer’s comments and have edited the Methods and Results sections accordingly.

      SIGNIFICANCE

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      The paper reads as if it is presenting preliminary data for a grant proposal. Perhaps Prof. He's lab wants to seek functional evidence for the role of the Iff/Hyr proteins. The current paper provides an exhaustive background for such a pursuit. As presented, there is little functional data for these proteins, genome sequences are not 100% accurate, but the trends noted are defendable.

      We appreciate the reviewer’s comments. We acknowledge that experimental studies will be needed to prove and further establish the functional importance of our findings. However, we believe our gene family evolutionary studies provided important novel insights and serve as an example for adhesin family evolution.

      • Place the work in the context of the existing literature (provide references, where appropriate).

      The ideas presented here are similar to those pioneered in the Butler et al. Nature paper in 2009 (doi: 10.1038/nature08064). We now have the benefit of more genome sequences so the analysis can encompass more species. C. auris adds a newer focus on part of the phylogenetic tree that was not previously emphasized. The idea of "more is better" is very simplistic, though. Parallel work for the ALS family shows complexity in gene expression levels, suggesting that some adhesins are poised to make a large contribution while others are likely to have a scant presence on the cell surface. Those concepts are not really explored in the current paper, either. See Hoyer and Cota 2016 (doi: 10.3389/fmicb.2016.00280); Oh et al. (doi: 10.3389/fmicb.2020.594531).

      We appreciate the reviewer’s comments and have included a discussion about the potential diversity of the duplicated Hil family proteins, in terms of function and their regulation in the Discussion. Also see our response to the first comment of the reviewer regarding the novelty of our hypothesis and the significance of our findings.

      • State what audience might be interested in and influenced by the reported findings.

      Potential readers would come from the fields of fungal adhesion and pathogenesis, as well as evolutionary biology.

      We appreciate the reviewer’s comments.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I discovered and named the ALS gene family in C. albicans and have spent 30+ years characterizing it. Most recently, my lab has focused on providing an accurate gene census and validated gene sequences for the cell wall "adhesinome" in the pathogenic Candida species. Some families are expanded and some are not. Some proteins appear only in a few species and demonstrate key roles in host-fungus interactions. There are many nuances to interpretation of what these fungi are doing from the standpoint of cell-surface adhesins and we look forward to exploring these ideas across many genomes, using validated gene sequences. We have a tremendous dataset that might make good fuel for a collaboration with Prof. He, given his enthusiasm for this area of study, as well as his outstanding expertise and perspectives on evolutionary analyses.

      We sincerely thank the reviewer for the critical analysis of our manuscript and appreciate the many suggestions for improving the manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Smoak et al. provides considerable information gleaned from analysis of HYR/IFF genes in 19 fungal species. A specific focus is on Candida auris. The main conclusion is that this gene family repeatedly expanded in divergent pathogenic Candida lineages including C. auris. Analyses focus on the sequences encoding the protein's N-terminal domain and tracts of repeated sequences that follow. The authors conclude with the hypothesis that expansion and diversification of adhesin gene families underpin fungal pathogen evolution and that the variation among adhesin-encoding genes affects adhesion and virulence within and between species. The paper is easy to read, includes clear and attractive graphics, as well as a considerable number of supplementary data files that provide thorough documentation of the sources of information and their analysis.

      Major comments:

      • Are the key conclusions convincing?

      Overall, the authors' conclusions are supported by the information they present. However, the overall conclusion is stated as a hypothesis and that hypothesis is not particularly novel. The idea that expansion of gene families associated with pathogenesis occurs in the pathogenic species dates back at least to Butler et al. (2009; doi: 10.1038/nature08064) who first presented the genome sequences for many of the species considered here.

      One key issue with a manuscript of this type is whether genome sequence data are accurate. The authors are not the first research group to take draft genome sequence data at face value and attempt to draw major conclusions from it. The accuracy of public genome data continues to improve, especially with the emergence of PacBio sequencing. Because the IFF/HYR genes contain long tracts of repeated sequences, genome assemblies from short-read data are frequently inaccurate. For example, is it reasonable to have confidence that the number of copies of a tandemly repeated sequence in a specific ORF is exactly 21 (an example taken from Table 2) when each repeat is 40+ amino acids long and highly conserved? Table S6 would benefit from inclusion of the type of sequence data used to construct each draft genome sequence. It is also reasonable to question whether the genome of the type strain is used as a template to construct the draft genomes of the other strains. If that was standard practice, conservation of the repeat copy number among strains might be an artefact. Conservation of repeat sequences to the degree shown is not a feature of the ALS family, a point of contrast between gene families that could be explored in the Discussion. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Due to the nature of my comments, this review will not be anonymous. I will include some of the data from my laboratory to further illustrate the point about the quality of draft genome sequences, especially for gene families that contain repeated sequences. My laboratory group has spent the past several years looking at the families of cell wall genes in these species and know that the C. tropicalis genome sequence used in the current analysis is highly flawed. There is even a manuscript from several years ago that documents problems in the assembly (doi: 10.1534/g3.115.017566). There is a new PacBio sequence available that has considerably improved data for this group of genes, but still is not perfect. We designed primers and amplified the various coding regions to verify whether the IFF/HYR were correct in the draft genome sequences. For C. tropicalis, we know that 7 of the genes listed in this paper are broken (i.e. prematurely terminated) giving a false impression of their construction. The current study did not verify any gene sequences, so broken/incomplete genes are a stumbling block for developing conclusions.

      Similarly, the recent work from Cormack's lab features a PacBio C. glabrata sequence (doi: 10.1111/mmi.14707). The paper details how the authors focused on accurate assembly of the types of genes studied here. Sequences from the current project should be compared to the PacBio assembly to determine if they provide the same results.

      Another part of the study that deserves additional attention or perhaps altered presentation is the idea that the Iff/Hyr N-terminal domain binds ligands. The literature on the Iff/Hyr proteins is limited. In my opinion, though, the authors of this paper could more completely present the information that is known. The paper by Uppuluri et al. is cited (doi: 10.1371/journal.ppat.1007056), but I did not see any information about their data regarding interaction of C. albicans Hyr1 with bacterial proteins mentioned in the manuscript under review. It is formally possible that the N-terminal domain of Iff/Hyr proteins does not bind a ligand. The current manuscript includes a great deal of speculation on that point, suiting it better to a Hypothesis and Theory format rather than other types of publications.

      Table 1 attempts to offer evidence that the Iff/Hyr N-terminal domain has adhesive function but falls short of convincing the reader. One of the example structural templates is a sugar pyrophosphorylase that seems irrelevant to the current discussion. In the column called "Function", the word adhesin is found several times, but no detail is presented. The only entry that offers an example ligand indicates that the domain binds cellulose which is not likely relevant for mammalian pathogenesis, the main focus of the work. Other functions listed include self-association and cell aggregation--using the N-terminal domain. It is formally possible that Iff/Hyr proteins drive aggregation using the N-terminal domain and beta-aggregation sequences in the repeated region. The authors should develop these ideas further. Discussion of adhesive/aggregative function related to the ALS family can be found in Hoyer and Cota, 2016 (doi: 10.3389/fmicb.2016.00280).

      The incredibly large number of figures that focus on the repeated sequences in the genes does not appear to include mention of the idea that these regions are frequently highly glycosylated. Knowing how much carbohydrate is added to these sequences in the mature protein would also have bearing on whether the beta-aggregation potential is realized. The Iff/Hyr proteins could stick to other things based on ligand binding (adhesion), hydrophobicity, aggregative activity, etc. Not much is really known about protein function so the conclusions are only speculative. The authors are largely accurate in presenting their conclusions as speculative, but the conclusions are not developed fully and always land on the idea that the N-terminal domain has adhesive function when that aspect clearly is not known.

      Another aspect of the analysis that is not mentioned is that several of the species discussed are diploid. What effect does ploidy have on the conclusions? Most draft genomes for diploid species are presented in a haploid display, so are not completely representative of the species. Additionally, some species such as C. parapsilosis are known to vary between strains in their composition of gene families, with varying numbers of loci in different isolates.

      The manuscript concludes that having more genes is better, that the gene family represents diversification that must be driven by its importance to pathogenesis, without recognizing that some species evolve toward lower pathogenesis. This concept could be explored in the Discussion.

      The Results and Discussion sections are largely redundant. The tone of the paper is conversational, making it easy to read, but there seems little left to say in the Discussion that has not already been mentioned as the background for the various types of analyses. The authors should revise the paper to eliminate discussions of published literature from the Results and expand the Discussion to include some of the themes that have not been mentioned yet.

      My own experience makes me wonder if the authors found any examples of species that provide and exception to the idea that having more genes is better and positively associated with pathogenesis. The parallel between IFF/HYR and ALS genes is made many times in the manuscript. Spathaspora passalidarum, a species that is not pathogenic in humans, but clearly within the phylogenetic group examined here, has 29 loci with sequence similarity to ALS genes. How many IFF/HYR genes are in S. passalidarum?

      There are several current taxonomies for the species in this region of the tree. The source of the names used in this paper could be specific more completely.

      Another point that the authors do not mention is documented recombination between IFF and ALS genes (doi: 10.3389/fmicb.2019.00781) and the effect of that process on evolution among these gene families.

      My reading of the work by Xu et al. 2021 (doi: 10.1111/mmi.14707) does not match the direction of its presentation in the current paper. Oh et al., 2021 (doi: 10.3389/fcimb.2021.794529) discussed that point recently, providing another point for the Discussion in the current paper.

      I might have missed it, but I could not find what constitutes a BLAST-excluded sequence (Table S7). Additional explanation (or making the explanation easier to find) would help the reader. - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Ideally, validation of all sequences would provide a stronger foundation for the work. However, that request is not realistic in terms of time or resources. - Are the data and the methods presented in such a way that they can be reproduced?

      Yes, the data and methods are documented clearly and perhaps too thoroughly in many places. A considerable amount of confidence is placed in sequences that might not be accurate and tracking details down to the amino acid residue may not be reasonable in this context. A disclaimer might help--everyone probably already knows that genome sequences are not perfect but stating that the analysis is only as good as the genome sequence acknowledges that fact. - Are the experiments adequately replicated and statistical analysis adequate?

      The idea of replicates does not really apply to this analysis. I think that the species sampled are reasonable to represent the region of the phylogenetic tree on which the analysis is focused. The authors clearly documented their computational methods in an admirable way.

      Minor comments:

      • Specific experimental issues that are easily addressable.

      Figure 1 has elements that would make a nice graphical summary, but most of it should not be part of the final manuscript. For example, Panel A is repeated in Figure 2. It is not clear what Panel C means until the reader gets to Figure 2. Panel D is unnecessary. The image in Panel B is a good graphic. Endothelial adhesion is not mentioned, though. It is also debatable whether the proteins bind directly to plastic or to the body fluids that coat the plastic.

      Compared to Figure 1, the information in Figure 3 is inconsistent. The "central domain" in Panel A is not central to anything as drawn, located at the end of the protein. The figure should be revised to be consistent with the majority of the authors' results. Structures in Panels C to E would benefit from the "through the spiral" view that is featured in Figure S9. What experimental technique was used to solve the structure in Panel E? Adding that information to the legend would be helpful to the reader. Also, the secondary structure colors seem to be reversed between the legend and domain structure. Adding the coordinates of the domains shown would help the reader to understand their location in the mature protein.

      Are the low-complexity repeats mentioned in the Figure 4 legend present anywhere else in the C. auris genome or elsewhere among the species used in this study? The answer to that question may also provide evolutionary clues.

      Figure S1 legend. How was the distance to C. glabrata measured to call it equal?

      Figure S4 could be presented better. Both diagonals have the same information. One could be emptied or could alternatively present nucleotide identity.

      Italicize the species names in Panel C of Figure S8.

      Lines 256-257: The paper selectively samples the Iff/Hyr family and does not examine the "entire" family. Please revise. - Are prior studies referenced appropriately?

      I was disappointed to see that the paper does not reference my laboratory's work at all. When ALS genes are featured so strongly in a report, it seems reasonable to include something we have done over 30+ years. Our most-recent ALS paper (Oh et al., 2021 doi: 10.3389/fcimb.2021.794529) would be a reasonable source for defending the gene numbers used in Figure 2A. Other examples of our work that directly relate to concepts in this paper were mentioned above. - Are the text and figures clear and accurate?

      Suggestions for improvement are incorporated into the comments above. - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Please present Methods and Results in the past tense. I still make the same mistake when I try to get my ideas on the page but proofread one more time and ensure the verb tenses are accurate.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      The paper reads as if it is presenting preliminary data for a grant proposal. Perhaps Prof. He's lab wants to seek functional evidence for the role of the Iff/Hyr proteins. The current paper provides an exhaustive background for such a pursuit. As presented, there is little functional data for these proteins, genome sequences are not 100% accurate, but the trends noted are defendable. - Place the work in the context of the existing literature (provide references, where appropriate).

      The ideas presented here are similar to those pioneered in the Butler et al. Nature paper in 2009 (doi: 10.1038/nature08064). We now have the benefit of more genome sequences so the analysis can encompass more species. C. auris adds a newer focus on part of the phylogenetic tree that was not previously emphasized. The idea of "more is better" is very simplistic, though. Parallel work for the ALS family shows complexity in gene expression levels, suggesting that some adhesins are poised to make a large contribution while others are likely to have a scant presence on the cell surface. Those concepts are not really explored in the current paper, either. See Hoyer and Cota 2016 (doi: 10.3389/fmicb.2016.00280); Oh et al. (doi: 10.3389/fmicb.2020.594531). - State what audience might be interested in and influenced by the reported findings.

      Potential readers would come from the fields of fungal adhesion and pathogenesis, as well as evolutionary biology. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I discovered and named the ALS gene family in C. albicans and have spent 30+ years characterizing it. Most recently, my lab has focused on providing an accurate gene census and validated gene sequences for the cell wall "adhesinome" in the pathogenic Candida species. Some families are expanded and some are not. Some proteins appear only in a few species and demonstrate key roles in host-fungus interactions. There are many nuances to interpretation of what these fungi are doing from the standpoint of cell-surface adhesins and we look forward to exploring these ideas across many genomes, using validated gene sequences. We have a tremendous dataset that might make good fuel for a collaboration with Prof. He, given his enthusiasm for this area of study, as well as his outstanding expertise and perspectives on evolutionary analyses.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Gene duplication and divergence of adhesin proteins are hypothesized to be linked with the emergence of pathogenic yeasts during evolution; however, evidence supporting this hypothesis is limited. Smoak et al. study the evolutionary history of Hil genes and show that expansion of this gene family is restricted to C. auris and other pathogenic yeasts. They identified eight paralogous Hil proteins in C. auris. All these proteins share characteristic domains of adhesin, and the structural prediction supports that their tertiary structures are adhesin-like. Evolutionary analysis of protein domains finds weak evidence of positive selection in the ligand-binding domain, and the central domain showed rapid changes in repeat copy number. However, performed tests cannot unambiguously distinguish between positive selection and relaxed selection of paralogs after gene duplication. Some alternative tests are suggested that may be able to provide more unambiguous evidence. Together with these additional tests, the detailed phylogenetic analyses of Hil genes in C. auris might be able to better support the hypothesis that the expansion and diversification of adhesin proteins could contribute to the evolution of pathogenicity in yeasts.

      Major Comments

      The authors present extensive analyses on the evolution of Hil genes in C. auris. There is significant merit in these analyses. However, the analyses conducted so far are incomplete, lacking proper consideration of other confounding factors. Detailed explanations of our major comments are listed below.

      1. First, the authors restricted genes in the Hil family to those only containing the Hyphal_reg_CWP domain. Yet, previous work included genes containing the ligand-binding domain or the repeat domain as Hil genes. More justification is needed whether the author's choice represents the natural evolutionary history of Hil genes appropriately. For instance, are the genes only containing the ligand-binding domain monophyletic or polyphyletic? We recommend including the phylogeny of all the Hil candidate genes, to discern whether evolutionary histories of the repeat domain and ligand-binding domain are congruent. Authors can use this phylogeny as justification to focus only on the ligand-binding domain containing genes.
      2. In the analysis of positive selection, the authors do not adequately control for the effect of recombination on the evolutionary histories of protein sequences, especially given that Hil genes are rich in repetitive sequences. To account for recombination, GARD, an algorithm detecting recombination, should be performed to detect any recombination breakpoints within a protein domain. If recombination did occur within a protein domain, the authors should treat the unrecombined part as a single unit and use the phylogenetic information of this part to proceed with PAML analysis, instead of using the phylogeny of the entire protein domain. The authors should consider doing GARD analysis for the ligand-binding and repeat domains. For the repeat domain, low BS values in Fig. 5C indicate recombination between repeat units. Thus, the authors should analyze each repeat unit with GARD and re-analyze dN/dS.
      3. The authors concluded positive selection in the ligand-binding domain based on the branch-wise model of PAML. Yet, w values were not higher than one, and it's unclear whether the difference in selective pressures the authors claimed here is biologically significant. Overall, what the authors present so far seems to be weak evidence of positive selection but is much more consistent with variation in the degree of purifying selection or evolutionary constraint. Using the site-wise model (m7 vs. m8) in PAML would allow the authors to detect which residues of the ligand-binding domain underwent recurrent positive selection. Combining the evolutionary information of protein residues and the predicted 3D structure will provide molecular insights into the biological impact of rapidly evolving residues. This would be a significant addition and raise the significance of the study, besides providing potentially stronger evidence of positive selection.

      Minor Comments

      1. In Fig 1c, the figure legend should include more specific details: which adhesin proteins are shown here? Please specify species names on the species tree
      2. In Fig 3E, secondary structures are labeled with the wrong colors. Sheet: purple, helix: yellow
      3. What's the ligand-binding activity of the b-solenoid fold? How structurally similar are C. auris PF 11765 domains compared to C. glabrata Awp domains? This information will support the role of adhesin for the ligand-binding domain of Hil genes.
      4. In lines 248-249, the authors should also consider the influence of evolutionary history. For instance, repeats within the same Hil protein appeared later in evolution, compared to Hil gene duplication, and therefore these repeats experience less time for sequence divergence.
      5. Although the bioinformatic evidence of C. auris Hil genes acting as adhesins is strong, it is still worthwhile to discuss the experiments of confirming the function of adhesins.

      Significance

      Overall, this study is interesting to investigate the evolutionary history of a crucial virulent gene in C. auris. Such evolutionary understanding will help us identify critical molecular changes associated with the pathogenicity of an organism during evolution, providing insights into the emergence of pathogens and novel strategies to cure fungal infections. The research question is important; however, the current analyses on the positive selection are incomplete, so the conclusion is modest so far. We recommend that the authors re-do the PAML analysis with the above considerations. This work will bring more significance to the mycology field if the functional impact of rapid evolution in protein domains can be supported or inferred.

      This manuscript is well-written, and the authors also did a great job specifying all the necessary details in the M&M.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Smoak et al., provides an analysis of the Hyr/Iff-like (Hil) genes in Candida species with a strong focus on C. auris. The authors demonstrate a repeated expansion of these genes in unique lineages of fungal species, many of which are associated with stronger clinical disease. There is evidence of selection operating on the gene family in the primary domain used for identification. These genes include a repeat just downstream of that core domain that changes frequently in copy number and composition. The location of these genes tends to cluster at chromosome ends, which may explain some aspects of their expansion. The study is entirely in silico in nature and does not include experimental data.

      Major points

      Altogether, many of the general findings could be convincing but there are some aspects of the analysis that need further explanation to ensure they were performed correctly.

      To start, a single Hil protein from C. auris was used as bait in the query to find all Hil proteins in yeast pathogens. Would you get the same outcome if you started with a different Hil protein? What is the basis for using Hil1 as the starting point? It also doesn't make sense to me to remove species just because there are already related species in the list. This may exclude certain evolutionary trends. Furthermore, it would be helpful to know how using domain presence and the conservation of position changes the abundance of the gene family across species? (beginning of results).

      A major challenge in the analysis like this one is in dealing with repetitive sequences present in amplified gene families.

      • For example, testing modes of selection on non-conserved sites is fraught. It's not clear if all sites used for these tests are positionally conserved and this should be clarified. Alignments at repeat edges will need to maintain this conservation and relatively good alignments as stated in lines 241-242 are concerning that this includes sequence that does not retain this structure necessary for making predictions of selection.
      • It's also unclear to me why Figure S12 is here. The parameters for this analysis should be tested ahead of building models so only one set of parameters should be necessary to run the test. The evolutionary tests within single genes and across strains is really nice!
      • A major challenge for expanded gene families is rooting based on the inability to identify a strong similarity match for the full length sequence. The full alignment mentioned would certainly include significant gaps. If those gaps are removed and conserved sites only are used, does it produce the same tree? Inclusion of unalignable sequences would be expected to significantly alter the outcomes of those analysis and may produce some spurious relationships in reconciling with the species trees.
      • Whether or not there are similar issues in the alignment of PF11765 need to be addressed as well. There's nothing in the methods that clarifies site selection. Figure 1A: the placement of evolved pathogenesis is a little arbitrary. It's just as feasible that a single event increased pathogenesis in the LCA of C. albicans and C. parapsilosis that was subsequently lost in L. elongisporus. These should be justified or I'd suggest removing. The assignment of Candida species here also seems incomplete. The Butler paper notes both D. hansineii and C. lusitaniae as Candida species whereas they are excluded here. It is tricky to include scaffolds in analysis of chromosomal location of the HIL genes. The break in the scaffold may be due to the assc repeats of these proteins alone or other, nearby repeats. Any statistics would be best done to include only known chromosomes or those that are strongly inferred by Munoz, 2021. This will change the display of Figure 7, but is unlikely to change the take home message.

      Minor points

      Line 18: "spp." Should be "spps."

      Line 41: I might rephrase this as "how pathogenesis arose in yeast..."

      I might use a yeast-centric example around line 40 for duplication and divergence. This could include genes for metabolism of different carbon sources in S. cerevisiae.

      The Butler paper referenced on line 51 compared seven Candida species and 9 Saccharomyces species The autors state no other evolutionary analysis of adhesins has been performed but do not acknowledge this study: https://academic.oup.com/mbe/article/28/11/3127/1047032

      The first paragraph of the Results could be condensed

      How was the species tree in Figure 1A obtained?

      Figure 2: In panel A, "DH" and "SS" are not defined. I'd be careful with use of "non-albicans Candida" in Figure 2B. This usually includes C. tropicalis and C. dubliniensis and may confuse the reader. How was the binding domain defined to extract those sequences are produce a phylogeny? In building a ML model, how were parameters chosen?

      Figure 3C/D could be just one panel.

      Can you relate more the fungal hit to the Hil proteins conveyed in lines 152-154?

      Line 168: Should read "Hence, ..." Label proteins along the top of Figure 4 too.

      Figure 5: for tests of selection, were sites conserved across the group? What does the black number at each node indicate? Dn and Ds are given as decimals. This is based on what attribute? For panel B, it is unclear what each tip denotes i.e., Hil1_tr6. Hil1 is the gene but what is "tr6"?

      It's unclear why comparison of the PF11765 domain includes the MRD proteins when those aren't included in the comparison to the repeats alone. Could that skew the comparison due to unequal sample numbers or changed variation frequencies in MDR relative to the other two groups?

      Table 2 doesn't add much. This section could probably be reduced to a few sentences since it's highly speculative (intraspecies variation).

      Table 3 is not needed.

      Figure 6 - color coding in 6A needs to be explained. I'm assuming this is a taxonomical coding.

      Figure 1B is unnecessary. A Model of the protein indicating domains is sufficient here. Figure 1C needs labels for all termini, not just the pathogenic red branches. The figure doesn't provide clear association between adhesin families and the associated species. This could be omitted, especially since Flo is often associated with Saccharomyces species. Figure 1D is unnecessary.

      Significance

      The work here is sorely needed in expanded gene families and in fungi specifically. No analysis at this level has been performed, to the best of my knowledge, in any fungal associated gene family and certainly not in relationship to pathogenic potential. The authors do a good job in citing the foundational literature upon which their study builds in most cases (one exception is noted above). It would be of general interest to those interested in the evolution of virulence, but the analysis is tricky. This is the biggest drawback I currently have as some of the information to assess the results is missing. Expertise: gene families, evolution dynamics, human fungal pathogens

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank the Reviewers for their valuable comments and constructive suggestions concerning our manuscript entitled " Drosophila pVALIUM10 TRiP RNAi lines cause undesired silencing of Gateway-based transgenes" (RC-2022-01629).

      Please find below our responses to the Reviewers' questions and comments. We have revised the Manuscript following the Reviewers' suggestions. The changes in the Manuscript are indicated in blue.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): ____ This manuscript by Uhlirova and colleagues identified an unwanted off-target effect in the pVALIUM10 TRiP RNAi lines that are commonly used in the fly community. The pVALIUM10 lines use long double-stranded hairpins and are useful vectors for somatic gene knock-down, hence they are widely used.

      Here the authors find that any pVALIUM10 TRiP RNAi line can create the silencing of any transgenes that were cloned with the commonly used Gateway system. this is caused by targeting attB1 and attB2 sequences, which are also present in other Drosophila stocks including the transgenic flyORF collection. Hence, this is an important and useful information for the fly community that should be published quickly. All experiments are well documented and well controlled. I only have a few minor comments.

      1. I recommend to mention the number of 1800 pVALIUM10 lines in Bloomington in the abstract rather than 11% to make clear that this is an important number of lines. (1800 of 13,698 lines in Bloomiongton are 13 and not 11 per cent?)

      We now include the absolute number of pVALIUM10 lines in the manuscript abstract. The percentages have been corrected. Furthermore, we updated/corrected the total number of RNAi lines available from various stock centers in the Discussion, L153-L156.

      The status on 23.10.2022

      VDRC - 23,411 in total (12,934 GD lines; 9,674 KK lines; 803 shRNA lines)

      Bloomington - 13,410 TRiP lines based on pVALIUM vectors (13,674 in total, including 264 non-pVALIUM, and 48 non-fly genes targeting lines)

      NIG - 12,365 in total (5,676 TRiP lines; 7,923 NIG RNAi lines)

      The authors may consider to call the 'unspecific' silencing effect an 'off-target' effect compared to intended 'on-target'. Such a nomenclature would be more consensus.

      We changed the wording in the manuscript as suggested by the reviewer.

      Ideally, all the imaging results in Figure 2 and 3 would be quantified. The simple 'V10' label in the Figure 3L and 3M is not the most intuitive, at least it took me a while to figure out what the authors compare.

      The labeling in the charts has been changed. We now provide quantifications for the data shown in Figure 2 and 3.

      Does the silencing also affect attR sequences? These are present after cassette exchange in many transgenes, most of the time not in the mRNA though, so it might not be so relevant.

      A 22 nucleotide stretch of the attB2 site indeed shows a 100% match to the attL2 site. See the example alignment below (availbale in word/PDF version of the Letter). While we did not assess this possibility experimentally, attL sites would likely be susceptible to the same undesirable off-target silencing effects if present in the nascent or mature transcript.

      Reviewer #1 (Significance (Required)): This is an important and useful information for the fly community that should be published quickly.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): ____ Stankovic, Csordas, and Uhlirova show that a specific subset of the TRiP RNAi lines available, namely the pVALIUM10 subset, can cause a knockdown of certain co-expressed transgenes that contain attB1 and attB2 sites. The authors demonstrate that while pVALIUM20 or Vienna KK lines for BuGZ or myc RNAi do not affect RNase H1:GFP expression, pVALIUM10 RNAi lines against BuGZ or myc significantly decrease expression of the RNAseH1:GFP transgene. The authors propose that, due to how these RNAi lines were constructed, the siRNA products could be targeting to attB1 and attB2 sites in transgenes that were made using similar methodology. To support this idea, they ubiquitously express mCherry transgenes encoding mRNAs either containing or lacking attB sites. They find that the knockdown of mCherry seen with several different pVALIUM10 RNAi lines is observed with the reporter mRNA containing attB sites, but is suppressed when the attB sites are removed from mCherry mRNA. They also find that the pVALIUM10 RNAi lines reduce the expression of the FlyORF transgene SmD3:HA.

      The paper is very clearly written and the data presented is convincing.

      Minor suggestions:

      1. Figure 3 L+M The labels for the ubi-mcherry and ubiΔattb-mcherry are switched in these graphs (i.e. ubiΔattb-mcherry should be the one with a higher intensity in the pouch compared to the notum).

      Figure 3M the labels don't match the RNAi lines used in H-K.

      We corrected the labelling in the charts.

      Figure 2 and 3. For the images of the transgenes, it seems as if the BuGZ RNAi line has a more drastic effect on RNaseH1 than mCherry, and vice versa for the myc RNAi lines. Did the authors notice a pattern with the decreased expression. Do some of the RNAi lines have a more consistent/severe impact, or might different transgenes be impacted to different extents?

      Throughout the study and multiple experimental trials, we did not observe that the BuGZRNAi and mycRNAi silencing efficiency would depend on whether the monitored reporter was RNase H1::GFP or mCherry. What has been reproducible is the differential impact of the three tested mycRNAi lines on ubi-RNaseH1::GFP transgene. While pVALIUM10-based mycRNAi[TRiP.JF01761] reduces RNaseH1::GFP signal Valium20 mycRNAi[TRiP.HMS01538] enhances it and GD mycRNAi[GD2948] has no effect, although the number of replicates for the latter is lower compared to the other tested lines. Why Valium20 mycRNAi[TRiP.HMS01538] increases RNaseH1::GFP signal remains unclear for now.

      We would like to refrain from directly quantitatively comparing the effects of phenotypically different RNAi lines on differently tagged mRNAs/proteins. As the RNAseH1::GFP fusion protein is nuclear while the mCherry is cytoplasmic, their distinct subcellular localization and/or turnover rate may give a different overall impression on the change in fluorescence intensity (Boisvert et al, 2012; Mathieson et al, 2018). Another confounding factor is the described roles of Drosophila Myc in regulating transcription, translation, and cell growth (Gallant, 2007).

      Line 150 unnecessary comma after Both Line 131 knockdown should be knocked down Line 133 should be "using an additional" Figure legend 1 wing disc should be at least written out when the abbreviation (WD) is first used.

      We thank the reviewer for pointing these out, the relevant corrections were performed.

      Reviewer #2 (Significance (Required)):

      Overall, this manuscript is an informative reminder that RNAi lines can have weaknesses that have not yet been considered, and we appreciate the authors work to inform the fly community about this specific issue. These insights are crucial for fly labs to consider when planning experiments that will use the pVALIUM10 RNAi lines in combination with other transgenesis modalities. The manuscript also provides a cautionary note for the usage of similar resources in other model organisms.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): Summary: In their manuscript "Drosophila pVALIUM10 TRiP RNAi lines cause undesired silencing of Gateway-base transgenes", Stankovic et al. describe off-target silencing of transgenes expressed from Gateway systems when expressed in transgenic RNAi drosophila lines from the VALIUM10 collection. Using fluorescence microscopy and immunostaining, the authors show that this unintended silencing is specific to VALIUM20 lines and is not observed with VALIUM20, KK or GD lines that also allow gene-specific RNAi silencing. This pleiotropic silencing effect was observed in 10 different VALIUM20 lines and affected Gateway-based transgene expressed from an ubiquitous promoter (poly-ubiquitin, ubi) or from Gal4/UAS systems. Finally, the authors identify the molecular basis of VALIUM20 pleiotropic silencing on Gateway transgenes as being due to the presence of short sequences used for PhiC31-based recombination in the Gateway and the VALIUM systems, and could lead to the production of siRNAs against PhiC31 recombination sites in VALIUM10 lines. Using Gateway transgenes lacking the recombination sites (attB1 and attB2), the authors could abrogate silencing of the transgene in VALIUM10 lines, confirming the recombination as shared targets between the Gateway and the VALIUM systems.

      Major comments: - The study is well designed and the key conclusions are convincing. - However, the authors provide only fluorescence microscopy data to show decreased transgene expression. To confirm pleiotropic RNAi effect on Gateway transgenes in VALIUM10, the authors should assess silencing with another technique. For instance, expression levels of proteins from Gateway transgenes could be measured by Western blot (e.g.: by assessing protein levels of GFP or other tags present in the Gateway transgenes).

      In the manuscript, we present microscopy data as this is the typical use case for fluorescent reporters. The strength of the microscopy, in contrast to Western Blot or RT-qPCR approach, is that it allows us to directly compare the impact of RNAi silencing on cells that express the dsRNA transgene (cell-autonomous) to surrounding neighbor cells. The fluorescent imaging of WDs where all cells express the reporter construct, but only a subset of cells trigger RNAi-mediated silencing, provides spatial resolution and means for normalization while minimizing artifacts that can arise during tissue processing for WB and RT-qPCR. We provide data on GFP and HA-tagged transgenes, respectively, and untagged mCherry expressed from Gateway vectors under ubiquitin or UAS regulatory sequences with the explicit reason to show that the silencing effect is independent of the type of the protein tag or the expression regulator sequence.

      In addition, the claim on line 141,"These results strongly indicate that the dsRNA hairpin produced from pVALIUM10 RNAi vectors generates attB1- and attB2-siRNAs" , should be modified. The authors only present fluorescence microscopy data to show decreased transgene expression and do not actually provide data on siRNA expression in the pVALUM20 lines. Therefore, with the current data, the authors should only say that their results suggest that the dsRNA hairpin produced from pVALIUM10 RNAi vectors generates attB1- and attB2-siRNAs.

      In order to substantiate their claim about pleiotropic RNAi effects from VALIUM lines on Gateway transgenes due to the production of attB1- and attB2 -siRNAs, the authors should perform an experiment to show attB1- and attB2 -siRNAs production in VALIUM10 lines and not in VALIUM20, KK or GD lines. Deep-sequencing analysis of siRNA (i.e.: miRNA-seq) from tissue expressing the corresponding RNAi transgenes would be an excellent approach to assess siRNA production in multiple samples at once. Alternatively, the authors could search published miRNA-seq datasets from VALIUM10 and other RNAi lines to assess the presence of attB1- and attB2 -siRNAs only in VALIUM10 lines. This would be free and require only a few days of data mining and analysis, if such datasets exist already. Another cheaper and faster approach (if lacking easy access to sequencing platform or bioinformatics capability) would be to perform small RNA northern blots analysis from fly tissues expressing VALIUM10 vs VALIUM20 (or KK or GD lines) and should take only a few days to do as described in doi: 10.1038/nprot.2008.67.

      If such experiments or analyses cannot be performed, then the authors can only conclude that their data suggest that the unintended silencing of Gateway transgenes in VALIUM10 is likely due to the production attB1- and attB2 -siRNAs production.

      We thank the reviewer for the valuable suggestions on experimental approcahes to identify the exact interfering RNAs produced by the VALIUM10-based RNAi constructs, which can be useful for controlling the specificity of knockdown of transgenes in studies using the resources mentioned in this report.

      We believe the fluorescence micrographs and quantifications demonstrate the off-target silencing effects of pVALIUM10-based RNAi lines on transgenic reporters generated using the Gateway LR cloning approach. Furthermore, we provide genetic evidence that removing the attB1 and attB2 sites from the reporter construct, which is otherwise identical to the original transgene (same promoter, same position of insertion, same genetic background), is sufficient to abolish the off-target effect. We would argue that the functional genetic experiments we performed with the original and mutated reporters represent the strongest possible evidence to confirm that silencing is taking effect via the attB sites.

      As we do not attempt to detect siRNA complementary to attB1/attB2 sites directly, we have changed the statements in question as per the recommendation of the reviewer.

      • The current data and methods are adequately detailed and presented, and the statistical analysis adequate.

      Minor comments:

      • The current manuscript does not have specific experimental issues.
      • Prior studies are referenced appropriately
      • Overall the text and figures are clear and accurate except for the following issues with Figure 3 and its legends On lines 396, 397, 399 and 403, the authors refer to "wild-type" ubi-mCherry. This transgene directs the ubiquitous expression of an heterologous reporter gene and thus can not as "wild type". It could instead be referred to as the "original" or "unmodified" transgene.

      We removed "wild-type" from the text.

      Fig.3 L: the x-axis labels are wrong. Decrease in the mCherry intensity ratio is observed with the ubi-mCherry construct and not in the ubi∆attB-mCherry, where the attB sequences thought to be targeted by the pVALIUM10 have been deleted.

      More space should be added between the first row of images (B-G), the second (H-L) and also the third (M-P) to avoid confusion between the labeling of the figures. Finally, to help contextualize their findings and gauging the extent of the risk of using VALIUM10 lines in RNAi screen where a Gateway transgene is involved, the authors could provide information on the overlap between the VALIUM10 collection and VALIUM20, GD and KK collections. Knowing how many genes are uniquely targeted by VALIUM10, could be helpful.

      We corrected the Figure panels according Reviewer 1 and 3’s observation.

      Of the TRiP pVALIUM-based RNAi stocks currently available in BDSC, 686 genes are targeted exclusively by pVALIUM10 RNAi lines. Considering KK, GD and shRNA transgenic lines from VDRC and NIG RNAi collection, 17 genes remain unique targets for pVALIUM10 lines. The graphical overview of the availbale lines is availbale in the word/PDF file of the Response to Reviewers Letter.

      Reviewer #3 (Significance (Required)):

      • The manuscript "Drosophila pVALIUM10 TRiP RNAi lines cause undesired silencing of Gateway-base transgenes" by Stankovic et al. is a technical study that sheds light on potential limitations of using common RNAi drosophila lines, namely the VALIUM10 collection.
      • The study provides information about very specific genetic screens conditions in Drosophila, that are likely to be rare. A rapid Pubmed search with the following terms: "drosophila TRiP screen" returns only 11 citations, while a similar search with "drosophila CRISPR screen" returns 99 citations. This suggests that in vivo RNAi screen in Drosophila using TRiP RNAi collections might not be as common or powerful as CRISPR-based screens.
      • The reported findings might be of interest mostly to a small group of scientists working with Drosophila melanogaster that specifically rely on VALIUM10 lines to perform in vivo RNAi screen in combination with Gateway transgene expression. This very specific combination of parameters is rare, since other RNAi fly stock collections exist (e.g.: VALIUM20, 21, KK, GD...). Furthermore, the advent of CRISPR tools that allows tissue-specific gene knock-out has led to the rapid expansion of CRISPR fly stock collections (https://doi.org/10.7554/eLife.53865). Regardless of the limited scope of the study, this kind information is still valuable, albeit to a very limited audience.
      • My relevant fields of expertise for this study are : insect RNAi, RNAi of RNAi screens and drosophila genetics.

      We would like to raise some points concerning the above comments.

      While TRiP-screen may not be an often-used keyword combination, the use of the TRiP lines is, in fact, ubiquitous in the Drosophila community. The tissue-specific RNA interference is still commonly utilized as a rapid, first-generation screening method that can be performed in a tissue-specific manner, representing one of the key advantages of the Drosophila model. To illustrate, since the submission of our manuscript a new study published by Rylee and co-workers investigated Drosophila pseudopupil formation by screening 3971 TRiP RNAi lines (Rylee et al, 2022). In contrast, genetic screens relying on mutant alleles usually require at least one additional cross, effectively doubling the time of the experiment. In addition, tissue-specific or temporarily restricted knockdown is sometimes required in screens, as full-body loss of function is often lethal or has developmental phenotypes incompatible with assessing gene function later in life.

      The use of tissue-specifically driven Cas9 with integrated gRNA-expressing vectors is indeed becoming more common. However, this technique, much like RNA interference, is not without flaws. First, this produces knockout instead of knockdown, which means it has to be induced early in order for the resulting mutation to take effect. Otherwise, the remaining mRNA/protein may prevent the development of a phenotype. Second, the Cas9 must be titrated as high Cas9 levels have adverse phenotypes (Huynh et al, 2018; Meltzer et al, 2019; Poe et al, 2019; Port et al, 2014). Third, in our personal experience, as well as literature reports (Mehravar et al, 2019; Port & Boutros, 2022), indicate that the resulting phenotype can produce mosaics in the tissue.

      Although the combination of Gateway-based reporters with TRiP-RNAi lines may seem like a fringe case, there are popular reporters that could be screening targets. Potentially the most well-known is the live cell cycle indicator fly-FUCCI system (Zielke et al, 2014), which allows the analysis of the cell cycle in real-time thanks to the expression of two fluorescently tagged degrons. As FUCCI transgenes were constructed with Gateway recombination, they represent targets of the pVALIUM10 TRiP lines. We now include the fly-FUCCI system as an example in addition to 3xHA-tagged FlyORF collection in the Discussion.

      REFERENCES

      Boisvert FM, Ahmad Y, Gierlinski M, Charriere F, Lamont D, Scott M, Barton G, Lamond AI (2012) A quantitative spatial proteomics analysis of proteome turnover in human cells. Mol Cell Proteomics 11: M111 011429

      Gallant P (2007) Control of transcription by Pontin and Reptin. Trends Cell Biol 17: 187-192

      Huynh N, Zeng J, Liu W, King-Jones K (2018) A Drosophila CRISPR/Cas9 Toolkit for Conditionally Manipulating Gene Expression in the Prothoracic Gland as a Test Case for Polytene Tissues. G3 (Bethesda) 8: 3593-3605

      Mathieson T, Franken H, Kosinski J, Kurzawa N, Zinn N, Sweetman G, Poeckel D, Ratnu VS, Schramm M, Becher I et al (2018) Systematic analysis of protein turnover in primary cells. Nature Communications 9: 689

      Mehravar M, Shirazi A, Nazari M, Banan M (2019) Mosaicism in CRISPR/Cas9-mediated genome editing. Developmental Biology 445: 156-162

      Meltzer H, Marom E, Alyagor I, Mayseless O, Berkun V, Segal-Gilboa N, Unger T, Luginbuhl D, Schuldiner O (2019) Tissue-specific (ts)CRISPR as an efficient strategy for in vivo screening in Drosophila. Nature Communications 10: 2113

      Poe AR, Wang B, Sapar ML, Ji H, Li K, Onabajo T, Fazliyeva R, Gibbs M, Qiu Y, Hu Y et al (2019) Robust CRISPR/Cas9-Mediated Tissue-Specific Mutagenesis Reveals Gene Redundancy and Perdurance in Drosophila. Genetics 211: 459-472

      Port F, Boutros M (2022) Tissue-Specific CRISPR-Cas9 Screening in Drosophila. In: Drosophila: Methods and Protocols, Dahmann C. (ed.) pp. 157-176. Springer US: New York, NY

      Port F, Chen HM, Lee T, Bullock SL (2014) Optimized CRISPR/Cas tools for efficient germline and somatic genome engineering in Drosophila. Proc Natl Acad Sci U S A 111: E2967-2976

      Rylee J, Mahato S, Aldrich J, Bergh E, Sizemore B, Feder LE, Grega S, Helms K, Maar M, Britt SG et al (2022) A TRiP RNAi screen to identify molecules necessary for Drosophila photoreceptor differentiation. G3 Genes|Genomes|Genetics: jkac257

      Zielke N, Korzelius J, van Straaten M, Bender K, Schuhknecht GFP, Dutta D, Xiang J, Edgar BA (2014) Fly-FUCCI: A versatile tool for studying cell proliferation in complex tissues. Cell Rep 7: 588-598

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In their manuscript "Drosophila pVALIUM10 TRiP RNAi lines cause undesired silencing of Gateway-base transgenes", Stankovic et al. describe off-target silencing of transgenes expressed from Gateway systems when expressed in transgenic RNAi drosophila lines from the VALIUM10 collection. Using fluorescence microscopy and immunostaining, the authors show that this unintended silencing is specific to VALIUM20 lines and is not observed with VALIUM20, KK or GD lines that also allow gene-specific RNAi silencing. This pleiotropic silencing effect was observed in 10 different VALIUM20 lines and affected Gateway-based transgene expressed from an ubiquitous promoter (poly-ubiquitin, ubi) or from Gal4/UAS systems. Finally, the authors identify the molecular basis of VALIUM20 pleiotropic silencing on Gateway transgenes as being due to the presence of short sequences used for PhiC31-based recombination in the Gateway and the VALIUM systems, and could lead to the production of siRNAs against PhiC31 recombination sites in VALIUM10 lines. Using Gateway transgenes lacking the recombination sites (attB1 and attB2), the authors could abrogate silencing of the transgene in VALIUM10 lines, confirming the recombination as shared targets between the Gateway and the VALIUM systems.

      Major comments:

      • The study is well designed and the key conclusions are convincing.
      • However, the authors provide only fluorescence microscopy data to show decreased transgene expression. To confirm pleiotropic RNAi effect on Gateway transgenes in VALIUM10, the authors should assess silencing with another technique. For instance, expression levels of proteins from Gateway transgenes could be measured by Western blot (e.g.: by assessing protein levels of GFP or other tags present in the Gateway transgenes). In addition, the claim on line 141,"These results strongly indicate that the dsRNA hairpin produced from pVALIUM10 RNAi vectors generates attB1- and attB2-siRNAs" , should be modified. The authors only present fluorescence microscopy data to show decreased transgene expression and do not actually provide data on siRNA expression in the pVALUM20 lines. Therefore, with the current data, the authors should only say that their results suggest that the dsRNA hairpin produced from pVALIUM10 RNAi vectors generates attB1- and attB2-siRNAs. In order to substantiate their claim about pleiotropic RNAi effects from VALIUM lines on Gateway transgenes due to the production of attB1- and attB2 -siRNAs, the authors should perform an experiment to show attB1- and attB2 -siRNAs production in VALIUM10 lines and not in VALIUM20, KK or GD lines. Deep-sequencing analysis of siRNA (i.e.: miRNA-seq) from tissue expressing the corresponding RNAi transgenes would be an excellent approach to assess siRNA production in multiple samples at once. Alternatively, the authors could search published miRNA-seq datasets from VALIUM10 and other RNAi lines to assess the presence of attB1- and attB2 -siRNAs only in VALIUM10 lines. This would be free and require only a few days of data mining and analysis, if such datasets exist already. Another cheaper and faster approach (if lacking easy access to sequencing platform or bioinformatics capability) would be to perform small RNA northern blots analysis from fly tissues expressing VALIUM10 vs VALIUM20 (or KK or GD lines) and should take only a few days to do as described in doi: 10.1038/nprot.2008.67.<br /> If such experiments or analyses cannot be performed, then the authors can only conclude that their data suggest that the unintended silencing of Gateway transgenes in VALIUM10 is likely due to the production attB1- and attB2 -siRNAs production.
      • The current data and methods are adequately detailed and presented, and the statistical analysis adequate.

      Minor comments:

      • The current manuscript does not have specific experimental issues.
      • Prior studies are referenced appropriately
      • Overall the text and figures are clear and accurate except for the following issues with Figure 3 and its legends On lines 396, 397, 399 and 403, the authors refer to "wild-type" ubi-mCherry. This transgene directs the ubiquitous expression of an heterologous reporter gene and thus can not as "wild type". It could instead be referred to as the "original" or "unmodified" transgene. Fig.3 L: the x-axis labels are wrong. Decrease in the mCherry intensity ratio is observed with the ubi-mCherry construct and not in the ubi∆attB-mCherry, where the attB sequences thought to be targeted by the pVALIUM10 have been deleted.
      • More space should be added between the first row of images (B-G), the second (H-L) and also the third (M-P) to avoid confusion between the labeling of the figures. Finally, to help contextualize their findings and gauging the extent of the risk of using VALIUM10 lines in RNAi screen where a Gateway transgene is involved, the authors could provide information on the overlap between the VALIUM10 collection and VALIUM20, GD and KK collections. Knowing how many genes are uniquely targeted by VALIUM10, could be helpful.

      Significance

      • The manuscript "Drosophila pVALIUM10 TRiP RNAi lines cause undesired silencing of Gateway-base transgenes" by Stankovic et al. is a technical study that sheds light on potential limitations of using common RNAi drosophila lines, namely the VALIUM10 collection.
      • The study provides information about very specific genetic screens conditions in Drosophila, that are likely to be rare. A rapid Pubmed search with the following terms: "drosophila TRiP screen" returns only 11 citations, while a similar search with "drosophila CRISPR screen" returns 99 citations. This suggests that in vivo RNAi screen in Drosophila using TRiP RNAi collections might not be as common or powerful as CRISPR-based screens.
      • The reported findings might be of interest mostly to a small group of scientists working with Drosophila melanogaster that specifically rely on VALIUM10 lines to perform in vivo RNAi screen in combination with Gateway transgene expression. This very specific combination of parameters is rare, since other RNAi fly stock collections exist (e.g.: VALIUM20, 21, KK, GD...). Furthermore, the advent of CRISPR tools that allows tissue-specific gene knock-out has led to the rapid expansion of CRISPR fly stock collections (https://doi.org/10.7554/eLife.53865). Regardless of the limited scope of the study, this kind information is still valuable, albeit to a very limited audience.
      • My relevant fields of expertise for this study are : insect RNAi, RNAi of RNAi screens and drosophila genetics.
    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Stankovic, Csordas, and Uhlirova show that a specific subset of the TRiP RNAi lines available, namely the pVALIUM10 subset, can cause a knockdown of certain co-expressed transgenes that contain attB1 and attB2 sites. The authors demonstrate that while pVALIUM20 or Vienna KK lines for BuGZ or myc RNAi do not affect RNase H1:GFP expression, pVALIUM10 RNAi lines against BuGZ or myc significantly decrease expression of the RNAseH1:GFP transgene. The authors propose that, due to how these RNAi lines were constructed, the siRNA products could be targeting to attB1 and attB2 sites in transgenes that were made using similar methodology. To support this idea, they ubiquitously express mCherry transgenes encoding mRNAs either containing or lacking attB sites. They find that the knockdown of mCherry seen with several different pVALIUM10 RNAi lines is observed with the reporter mRNA containing attB sites, but is suppressed when the attB sites are removed from mCherry mRNA. They also find that the pVALIUM10 RNAi lines reduce the expression of the FlyORF transgene SmD3:HA.

      The paper is very clearly written and the data presented is convincing.

      Minor suggestions:

      1. Figure 3 L+M The labels for the ubi-mcherry and ubiΔattb-mcherry are switched in these graphs (i.e. ubiΔattb-mcherry should be the one with a higher intensity in the pouch compared to the notum).
      2. Figure 3M the labels don't match the RNAi lines used in H-K.
      3. Figure 2 and 3. For the images of the transgenes, it seems as if the BuGZ RNAi line has a more drastic effect on RNaseH1 than mCherry, and vice versa for the myc RNAi lines. Did the authors notice a pattern with the decreased expression. Do some of the RNAi lines have a more consistent/severe impact, or might different transgenes be impacted to different extents?
      4. Line 150 unnecessary comma after Both Line 131 knockdown should be knocked down Line 133 should be "using an additional" Figure legend 1 wing disc should be at least written out when the abbreviation (WD) is first used.

      Significance

      Overall, this manuscript is an informative reminder that RNAi lines can have weaknesses that have not yet been considered, and we appreciate the authors work to inform the fly community about this specific issue. These insights are crucial for fly labs to consider when planning experiments that will use the pVALIUM10 RNAi lines in combination with other transgenesis modalities. The manuscript also provides a cautionary note for the usage of similar resources in other model organisms.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript by Uhlirova and colleagues identified an unwanted off-target effect in the pVALIUM10 TRiP RNAi lines that are commonly used in the fly community. The pVALIUM10 lines use long double-stranded hairpins and are useful vectors for somatic gene knock-down, hence they are widely used.

      Here the authors find that any pVALIUM10 TRiP RNAi line can create the silencing of any transgenes that were cloned with the commonly used Gateway system. this is caused by targeting attB1 and attB2 sequences, which are also present in other Drosophila stocks including the transgenic flyORF collection. Hence, this is an important and useful information for the fly community that should be published quickly. All experiments are well documented and well controlled. I only have a few minor comments.

      1. I recommend to mention the number of 1800 pVALIUM10 lines in Bloomington in the abstract rather than 11% to make clear that this is an important number of lines. (1800 of 13,698 lines in Bloomiongton are 13 and not 11 per cent?)
      2. The authors may consider to call the 'unspecific' silencing effect an 'off-target' effect compared to intended 'on-target'. Such a nomenclature would be more consensus.
      3. Ideally, all the imaging results in Figure 2 and 3 would be quantified. The simple 'V10' label in the Figure 3L and 3M is not the most intuitive, at least it took me a while to figure out what the authors compare.
      4. Does the silencing also affect attR sequences? These are present after cassette exchange in many transgenes, most of the time not in the mRNA though, so it might not be so relevant.

      Significance

      This is an important and useful information for the fly community that should be published quickly.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to reviewers


      Reviewer #1 (Evidence, reproducibility and clarity (Required)): The authors develop a previously identified lead compound for the blocking of malaria transmission from humans to mosquitoes further and identified a protein target of the chemical. The protein target, Pfs16 is long known to be upregulated in gametocytes and has been speculated to be a target for small molecules. The work is well (if at time maybe too well/too detailed) described and potential shortfalls are highlighted.

      My major comment is that without a deletion mutation of Pfs16, the paper will remain somewhat preliminary. I would strongly encourage the authors to generate such a mutant and compare it to the parasites treated with their drug candidate. I feel the text can be much shortened and a lot of information moved to the materials and methods. The conclusions should be toned down on several occasions (abstract, introduction, discussion). Avoid adjectives, e.g. what is a 'powerful starting point' (abstract) or 'compelling interdisciplinary evidence' but hot air?

      We thank the reviewer for this comment. However, we would like to reiterate (as stated in the manuscript) that knockout of Pfs16 in P. falciparum is transmission lethal, i.e. you do not get progression of male gametogenesis. Thus, whilst re-generation of a Pfs16 KO would be interesting in terms of comparing phenotypically with the drug treated parasites, we are not convinced it would add any further evidence of support for or against our conclusion in terms of the ability of the N-4HCS scaffold to target this protein. E.g. we could drug treat a Pfs16 KO but this would not be expected to show gametogenesis irrespective of treatment. Therefore, whilst of academic interest, we believe it is satisfactory to judge our phenotypic work based on published accounts of the Pfs16 KO without having to engage in the costly experiments to regenerate the parasite and work on it side-by-side, especially given the limited resolution it would give towards the overall goal of the work in terms of defining the effect and likely target of this drug class on parasites.

      Addressing the second comment, we are happy to alter areas of the paper that may have over-stated the conclusions of the work including the abstract/introduction and discussion.

      CROSS-CONSULTATION COMMENTS I think these three reviews are pretty much in line with their overall assessment. I am happy if send as is to authors as it will help them shape a much better paper

      Reviewer #1 (Significance (Required)):

      The paper shows that very likely a new chemical with some potential for transmission inhibition of malaria parasites for mosquitoes binds to a Plasmodium protein that is specifically expressed in the sexual stages of the parasite.

      The paper compares to good papers published in journals like ACS Infectious Diseases or Antimicrobial Agents and Chemotherapy, but I am not sure which of the Review Commons sister journals it would fit to. I am a molecular parasitologist.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): Transmission blocking drugs are of high interest as a strategy to combat malaria but they are difficult to study. For instance it is problematic to raise resistant parasites to find mode of action of transmission blocking drugs and to identify their targets in the cell. In this manuscript Yahiya et al. build on previous work which identified the N-4HCS scaffold, of which DDD01035881 is the lead compound, as an inhibitor P. falciparum male gametocytes. Using PAL to enrich for target proteins Pfs16 was identified and validated as a possible target of DDD01035881. Binding was validated through CESTA. Determination of the phenotype following DDD01035881 treatment was found to partially match the previously published Pfs16 KO phenotype. However curiously no impact was seen in gametocytogenesis despite published evidence of Pfs16 being involved in sexual conversion. The authors speculate as to reasons but a direct experimental comparison with Pfs16 mutant parasites (which likely would have been revealing) is not provided. On the positive side, this analysis of the stage-specific effect of the drug pinpoint the stage inhibited during microgamete development which is a very interesting part of the manuscript.

      We thank the reviewer for this positive assessment of our work. Mirroring comments above, our challenge with Pfs16 knockout or mutation is that if we ablate Pfs16 function we cannot assess the effect of drug action. Definition of a mutant that would demonstrate precisely the drug mode of action would require structural resolution of drug bound to target (i.e. to identify which residues to target) – this is a major goal for our research group moving forwards, but likely many years’ work. In general, our core approach here has been one of chemo-proteomic based methods and phenotypic investigation of the novel antimalarial. Further evidence might be forthcoming from molecular genetics/structural biology, but we believe these are beyond the scope of the current work (and our available resources at present). We state future directions in the discussion and can add more to this in any revised manuscript.

      This work deepens the understanding of a novel class of transmission blocking drugs with reasonable potency (foremost (-)-DDD01028076, which has low nanomolar activity, the modified versions considerably less). Question on how to achieve serum concentrations for sufficient potency aside, these compounds will in the very least provide experimental tools to study their mode of action and might reveal interesting biology. This work is therefore of interest to the malaria field.

      The experimental methodology seems excellent but some of the results raise questions that make definite conclusions difficult and this should be addressed. Overall, this is very solid work but leaves some doubts whether Pfs16 is indeed the (only) target of this class of compounds.

      Major comments: 1. The reasons for excluding Etramp10.3 are not convincing. In fact it could be argued it is nearly as good a candidate as Pfs16. Contrary to the author's statements in the results section, etramp10.3 transcription is highly upregulated in gametocytes (see e.g. PMID: 22129310) with a generally very low transcription in asexual stages. It is argued that Etramp10.3 is essential in blood stages because MacKellar et al failed to disrupt the gene and because the PiggyBAC screen predicted it to be essential. However, if this is an argument for exclusion then this would also apply to Pfs16 which is also predicted by the PiggyBAC screen to be essential (likely both are non-essential in blood stages as they are barely expressed but Pfs16 and Etramp10.3 might by chance have not received an insertion in the PiggyBAC screen due to their very small size which may also explain failure of disrupting integration in MacKellar). Given the finding that the drug binds Pfs16 only in late gams it might also be argued that an essential function in asexuals might not be affected if they behave similarly to young gams and hence this criterion is not valid anyway.

      Further following this line of thought that ETRAMP10.3 could be a hit equivalent to Pfs16, Figure 2D shows a band below the band considered to be Pfs16. It would not be all surprising if this were ETRAMP10.3 (the size would fit).

      We don’t disagree with reviewer 2’s comments that ETRAMP10.3 could be an additional target. Although not traditionally related there is some similarity between these proteins and it may be that at the macroscopic level there is a structural homology between them. As stated elsewhere we are happy to tone down the assertion that Pfs16 is the only drug target candidate, leaving open the possibility of future follow up work that may yet reveal additional targets. This cannot be explored much further without extensive experimentation, which is beyond our current capacity. Given the strong phenotypic effect on gametocytes, whilst ETRAMP may be upregulated, this paper naturally focused its core attention on Pfs16 as a candidate target. We certainly subscribe to the view that absence of evidence is not evidence of absence.

      Both, Pfs16 and ETRAMP10.3 can be expected to be very abundant proteins in the parasite periphery in gams. Can the authors exclude that these simply are the first to encounter the N-4HCS photoaffinity probe and that this may have led to their enrichment in the target identification experiments. The biochemical data argues for a specific interaction with Pfs16, but by itself is not that strong. Given the discrepancies of the phenotype with the Pfs16 disruption and the peculiar finding that the drug binds Pfs16 only in later stage gametocytes, it might be a good idea to further caution the conclusion of Pfs16 as the inhibited target.

      We don’t necessarily agree that the evidence is not strong (three methods pointing to the same target is by many accounts solid evidence). Additionally, whilst it is true that the N-4HCS photoaffinity probes likely interact with PVM proteins in first instance, it is also worth noting that this doesn’t necessarily deduct from their likelihood to be true targets, but instead fits with the N-4HCS phenotype. We observe the compounds to inhibit microgametogenesis without any prior incubation and to retain this activity even beyond activation of microgametogenesis, specifically during the window in which the PVM remains associated with the parasite. Our phenotypic observations therefore fit with the notion that the molecules target proteins that lie within the PVM and interact with the molecules at first instance. Whilst we understand the concern that PVM proteins may be likely to be enriched given their abundance and localisation, we believe this to support our phenotypic findings.

      The phenocopy evidence of the NH compounds with the Pfs16 disruption is based on comparison with published evidence. It would have been much preferred to have a side-by-side comparison with the (or an) actual Pfs16 disruption parasite line. Although the authors stress that the phenotype with DD01035881 fits the phenotype of the targeted gene disruption in the results, this only partially matches the cited publication (PMID: 14698439) which concludes there is an effect on the number of gametocytes produced. The exflagellation phenotype in that publication was classified as preliminary. Although this is discussed, the main results text should be adapted to reflect this and the conclusion that Pfs16 may be the target should be further cautioned.

      As stated, we are happy to tone down conclusions in this direction. We also note comments above about Pfs16 disruption.

      Minor comments: 4. From the modifications of the compounds it seems the chemical space for further modification to achieve higher potency is limited with this scaffold. Maybe the authors can comment whether they envisage this to be a potential obstacle.

      The modification space of the compounds is explored extensively in previous work from our group, which we feel more than adequately addresses this question. See Rueda-Zubiaurre et al (2020) J Med Chem.

      Line 67: references are superscript.

      We can change this

      Line 77: I would recommend replacing 'quiescence' here, a cell that matures is not quiescent.

      We can change this

      Line 116: consider removing 'interdisciplinary'.

      We can change this

      Line 120: I would caution here (see major comments) and recommend a less definite proclamation of Pfs16 as a promising new drug target

      We can change this along with the general “tone” of the manuscript.

      Page 7: compounds 9 is still considered active ("retained micromolar activity"), but in Table 1 this is given as >1000nM. Please add the actual IC50.

      We can add this to the final version. The actual IC50 for this compound was 1.7uM. For the SAR study we grouped compounds with IC50 >1uM into discrete groups based on rough IC50 (>1uM, >10uM etc.) hence this fell in the intermediate group.

      Line 138- 173: The order in which this is discussed makes it unclear that the work described was done prior to, and guided, the synthesis of compound 1 and probe 2

      This can be addressed in a revised manuscript.

      Line 194: was the data deposited in a database?

      The proteomics data has not been deposited in a database but is accessible in the extended SI.

      Line 202: introduction as to the benefits of using a competition + probe condition here could aid reader understanding. The interpretation of this data is complicated by the covalent and reversible binding of the two compounds and the weight of this control is therefore difficult to gage.

      We can embellish the description here.

      Table 2 and Extended Data Table 1 show different p values and enrichments for the same hits. This is confusing. It would also be useful to label the hits in the scatter plots in Figure 2 for easy identification and comparison to the tables.

      We can amend this and label each hit within the scatter plot.

      Line 215-218, please correct the data on Etramp10.3 (see major points) and put in perspective to Pfs16 (Etramp10.3 is similarly upregulated in gams where it is highly expressed; PiggyBAC predicts essentiality for Pfs16 and Etramp10.3 in blood stages).

      We can discuss this to a limited extent for future exploration of Etramp10.3.

      Line 221: the results from the PiggyBAC screen are stated as fact, but what the screen provides is a prediction of the probability of importance for parasite growth. I would replace 'is' with 'is predicted' (even though in the case of Rab1b it seems likely the prediction is correct).

      We can change this

      Line 233 and elsewhere: define 'reversibility' (binding? activity?).

      We can change this

      Line 240: clarify what is in the cited paper (see major points).

      We can clarify this

      Line 297: We utilised in-lysate...... clunky sentence, please rephrase.

      We can change this

      Line 325: reference is missing the year.

      We can change this

      Line 343: It is utterly puzzling that binding is specific to Pfs16 in mature gametocytes and I do not find the explanation in the discussion convincing (see point 28 below). Do the authors have another explanation? Could Pfs16 be modified in later gams (or vice versa)?

      We believe that Pfs16 is functionally different at different stages of gametocyte development, this is either in terms of its presentation (e.g. perhaps due to complex formation, though this remains elusive) or the functionality of different domains, as per the effect of different truncation mutants. We can address some of these concerns in a revised manuscript.

      Line 388: Justification seems odd as a PV protein would be unlikely to directly impact DNA replication. Please rephrase the sentence.

      We can change this

      Line 405: remove the 'to'

      We can change this

      Line 411: it would be useful to the reader to state at what IC-value the drug was used in these experiments.

      We can state this

      Line 431: While the alpha-tubulin staining indicates exflagellation and is similar to the DMSO only control, the staining for the RBC membrane (Glycophorin A) and DNA (DAPI) appear different, yet this is ignored. One interpretation of this could be that while late treatment doesn't block exflagellation, it still impacts other aspects of microgamete development.

      We can make mention of this

      Line 436: IFA work was done with drug treatment post activation while EM was done post activation but drug treatment prior to activation. Is there a reason for this?

      The reviewer is astute to point this out. Limitations with access to the EM facility meant that whilst IFAs were completed for pre-activation treated samples, the post-activation EM became impossible as the EM facility closed during the COVID lockdown. Thus, we do not have a complete set here. However, we do not feel this takes away from the EM observations presented. We can clarify this incompleteness in the revised manuscript.

      Line 450: is this really CytB, or was it CytD?

      We did indeed used Cytochalasin B here, which whilst less potent than D does still target microfilament formation.

      Line 465: Pfs16 localised to vesicles: there is no data showing the dots in the micrograph are vesicles, please rephrase.

      We can change this

      Page 19 and 20, discussion on stage-specific differences of Pfs16 during gametocytogenesis to explain the difference in binding: without experimental data using H-4HCS in the parasites of the publication cited to explain this (PMID: 21498641), this is very speculative. The cited work used episomal expression of Pfs16 tagged with fluorescent proteins. This would be the first integral PVM protein that is actually inserted into the PV membrane when tagged in that way (usually this results in a PV location), casting some doubt on the findings in that paper. All in all the provided explanation is not very convincing.

      We can attempt to clarify this in a revised discussion.

      Line 519: if with the conserved part the N-terminus is meant, then this has for other PVM proteins already been shown to be PVM internal, not facing the erythrocyte (show in very early work; PMID: 1852170 but also multiple times after that).

      We can clarify this

      Line 534: consider replacing 'highly plausible' with something more cautious.

      We can change this

      Line 550: Given this discussion how stable are N- 4HCS compounds?

      We can clarify this.

      Table 1: Having all chemical structures in same orientation would be nicer visually. I assume blue indicates modification but this is not stated.

      We can change this

      Figure 1: Please use different colours or symbols. The dark green crosses and the blue Pfs16 cross are hard to distinguish.

      We can change this

      Figure 3d: Unclear as to why a difference temperature range is displayed here.

      We can clarify this

      Figure 3e: Unclear % Inhibition compared to what.

      We can clarify this

      Figure 5G: What is the white arrow pointing to?

      We can clarify this

      Figure 5j: Given how the explanation is written this would make more sense between current image 5G and 5J.

      We are not sure what the comment relates to here but we can endeavour to clarify this

      Figure 6: Erythrocyte membrane colour not stated in legend.

      We can change this

      Figure 6A: were the exposure times similar? How can so little be left after ~4-5.5 minutes but at later time points there seems to be much more Pfs16 signal left? Maybe amount of signal should be taken into consideration to establish the fate of Pfs16 in the process.

      We can endeavour to clarify this

      Figure 6B: is the second phenotype (successful but aberrant egress) shown? The only image where WGA is not circular around the parasite is an exact match of Pfs16 which is in dots (image at 7.5-8.5 minutes). The imaging data for this phenotype should be presented more clearly.

      We can attempt to clarify this

      Reviewer #2 (Significance (Required)):

      Nature and significance: a lot of weight has been placed on transmission blocking drugs although there are also a number of problems associated with them (ethics for testing and use etc; drugs acting on asexuals and transmission stages alike might be even more useful). Transmission blocking drugs are difficult to study and this work is therefore important. The experiments are well done, but the conclusions are not fully convincing, leaving some doubts in regard to Pfs16 being the actual target of the class of drugs studied.

      Compare to existing published evidence: it is a logic continuation of previous work and this is appropriately highlighted in the manuscript.

      Audience: medium interest for malaria researchers; high interest for researchers working on transmission blocking drugs and those studying microgametes.

      Your expertise: malaria, P. falciparum, biology of apicomplexans

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): The manuscript by Yahiya et al describes an extensive investigation of the mode of action of DDD01028076, which specifically inhibits microgametogenesis in Plasmodium falciparum. The phenotypic characterisation of the MOA uses some very nice imaging to demonstrate the point at which this compound inhibits microgametogenesis. The authors have also attempted to identify the molecular target using chemoproteomics and label-free CETSA techniques. The photoaffinity labelling and pull-down approach suggested the Pfs16 may be preferentially enriched by a PAL probe that is representative of this series. However, the data supporting the validation of this target is not very conclusive, and in some cases argues against Pfs16 being a specific target of DDD01028076. Whilst the presented data makes a significant contribution to the literature regarding a novel drug candidate that targets microgametogenesis, it does not support the author's claims that Pfs16 is the target.

      Major Concerns: The strongest evidence for Pfs16 being the target comes from the chemoproteomics pull-down study that found Pfs16 to be the most significantly enriched protein by compound 2 vs DMSO. However, this should be interpreted with caution as it is based on only 3 replicates and omics studies are prone to false-positives. That only 125 proteins were detected also raises questions about the coverage of the proteomics, it is quite possible that the actual target is not detectable using this method, and the Pfs16 appears because it is one of the more abundant proteins during this stage of the lifecycle.

      As discussed, we are happy to tone down the conclusions about Pfs16 being an exclusive target for the N-4HCS drug class, however, we feel the reviewer is being unnecessarily negative. There are myriad papers in the literature based on singular proteomics experiments (given their cost, complexity and time -consuming nature) that then facilitate downstream experiments that support findings. We have endeavoured to be as thorough as we could in the work and believe, like others, three replicates of a massive experimental pipeline should be sufficient to make a defined conclusion – whether the additional downstream evidence we have then leaned on is supportive of this (as we judge it to be) is another matter. We agree, proteomics often suffers with low protein abundance. The complexity of growing large quantities of gametocytes is familiar to anyone who has struggled to grow these finicky parasites at a larger scale than 10-25mL dishes. Given the scales we have reached, we believe these might in fact be some of the most comprehensive proteomics studies to date!

      Somewhat concerningly, the control with 1 as the competitor did not show significant enrichment of Pfs16, although a trend was observed. More concerning, was the lack of enrichment when using DDD01028076 as the competitor. This result essentially proves that Pfs16 is not the specific target (and the argument about reversibility is unlikely since most drugs are reversible binders, but many have worked with this type of approach). It is surprising that DDD01028076 (ideally the (-) form) wasn't used as the competitor for the proteomics study. This compound has ~100-fold better potency than the probe 2, which should provide much better competition that 1. It would also be more specific than 1, which is an important control considering that (-)-DDD01028076 has activity in the low nanomolar range, whereas 2 acts in the micromolar range. Non-specific interactions are an important consideration to exclude, and whilst 1 is structurally similar, it is not very potent and therefore not the best control to find the target associated with activity.

      Whilst we understand the concerns with insignificant enrichment in the competition labelling, we believe the enrichment in the presence of photoaffinity probe 2 over background (i.e. DMSO vs. probe experiments) to be of more value given the design of the experiment. The competition experiments were performed by co-treating gametocytes with photoaffinity probe 2 and parent molecule 1 prior to UV irradiation, to enable irreversible conjugation to protein target(s). However, given that both compounds, probe and parent, theoretically bind to Pfs16 at the PVM in a reversible manner (i.e. losing interaction with even gentle washing), UV irradiation is likely to favour probe-binding irrespective of competition with a marginally more potent parent molecule (in this case, parent molecule 1). This is especially true as treated parasites were very thoroughly washed after irradiation, so should the parent molecule have bound the target protein(s), these drug-target interactions were likely lost during stringent washing. The drug-target interactions with parent molecule 1 wouldn’t have been aided by UV irradiation, as the molecule lacks the functional group required for bioconjugation. So, even if parent molecule-target interactions were more abundant than probe-target interactions, interactions between parent molecule 1 were most likely lost and proteins bound by probe were enriched.

      This would have been true with more potent N-4HCS derivatives such and DDD01028076 and (-)-DDD01028076 (where potency is tested in the DGFA, independent of bioconjugation), and here we opted for a structurally similar compound of similar potency to not skew competition solely based on potency.

      We can embellish on this in the revised manuscript to make our conclusions from this part clear.

      A closer look at the gels in the supplementary data raises many questions that undermine the authors conclusions: - Fig S1a - The lane without probe (2) still identifies Pfs16 (or a protein at that MW) as the most abundant protein. Also, as the Pfs16 band increases, you can see that most other proteins also increase in abundance, so either the loading is inconsistent, or the probe actually causes non-specific enrichment of many proteins. This figure also indicates that the washing protocol is not sufficient to remove non-specific binders. Given the covalent nature of the PAL approach I would think a very thorough washing protocol could be employed.

      It is certainly the case that Pfs16 is abundant in gametocytes, a reason behind its early discovery. Thus it is challenging to remove it from background. We still believe the enrichment to be specific, highlighting the comparative work with Pfg377 in Figure 2. Further repetitions with more stringent washing might resolve the background, however, this is beyond our current resources to repeat.

      -- Running another negative control in the proteomics using one of the inactive controls from table 1 might help to disambiguate specificity.

      We don’t disagree with this though this would involve an entire re-running of the experimental workflow which is not possible.

      • Fig S2a - The anti-Pfs16 Western blots show that this protein is actually enriched more in the flow-through than the eluates. This shows that this protein is not specifically enriched by the PAL-CuAAC pull-down, it is just more abundant in the treated samples.

      Again, the presence of Pfs16 in the flow-through is unsurprising, given its abundance in stage V gametocytes. The relative abundance in the eluate is not an indication that the binding and subsequent enrichment is not specific, rather this shows the compound does not necessarily bind each and every protein – which is not unexpected. The crucial conclusion to be drawn here is the concentration-dependent enrichment of Pfs16 in the eluate in the presence of probe.

      • Fig S2b - The darkest Pfs16 spot is actually the sample with no UV treatment. This is a negative control, so should not enrich the target protein. This sample also has significant signal in replicates A and C.

      As we have noted above, it is not unsurprising that modification of the N-4HCS scaffold to yield this probe may introduce a level of irradiation-independent binding, which explains the presence of signal in the UV-independent sample.

      • Fig S2c - This blot is very messy and difficult to read, but in general the Pfs16 spots in the IGF don't correlate with the intensities in the anti-Pfs16 western.

      These experiments are extremely challenging (something that is perhaps beyond the expertise of the reviewer) and what is presented is the result of substantial optimisation. Loss of AzTB fluorescence in the gel which is subsequently analysed by western blot explains this.

      • Fig S2 - This data, and the main figures based on this data, generally don't support the hypothesis that Pfs16 is the specific target. The controls are not as would be expected, and there are no loading controls. Looking at the flow-throughs suggests that there was just more Pfs16 (and possibly total protein) in the treated samples before the enrichment step. The Pfg377 also appears quite variable in the different samples, with replicates B and C not consistent with A.

      We do not concur with the reviewer here and their dismissal of what was extremely thorough and well-executed experimtns. These are not like traditional western blots and require substantial optimisation. We refer them to our previous point in reference to the UV controls. With regards to the Pfg377 variability, the experiment itself is inherently variable with such large volumes of parasites. In many cases, for example, the male:female ratio within a mature gametocyte culture can vary and this can contribute to the variability in 377 abundance between replicates.

      The other major concern is with the CETSA analysis, which appears to show very minor stabilisation of Pfs16, but the specificity of this target is questionable, and the data has the following inconsistencies. - The supplementary data only shows n=1, yet there are error bars in the main figures. Where did these come from?

      The individual western blot replicates can be provided in a revised manuscript if judged important.

      • The samples with apparent destabilisation are all near the edge of large western blots, which often doesn't run straight and has no loading controls. We need to see the loading controls.

      Given all proteins within a lysate will aggregate with thermal treatment, antibody loading controls are not feasible with these experiments. Each sample is normalised prior to thermal stabilisation (ensuring the same protein quantity is treated in both DMSO and drug, at each temperature) and any protein that is not aggregated is loaded – the nature of CETSA itself is to compare the stabilisation between DMSO and drug.

      • The melting temperature of Pfs16 is extremely high at around 85 degrees C. Most plasmodium proteins melt at around 50-60 degrees (Dzekian et al, 2019). Even the cited work on membrane proteins didn't go to those temperatures (Kawatkar et al, 2019) Can this high temperature be explained, and has the CETSA approach been validated at such high temperatures where additional physical and chemical processes may be occurring in the sample?

      We agree that this temperature of stabilisation is unusually high and may require further biochemical validation. Without further investigation we cannot say definitively why the melting temperature of Pfs16 is so high, but suspect its size and membrane localisation may play a role.

      • The lack of difference between + and - isomers suggests that the very small stabilisation observed here is not specific to drug activity, but is more likely a non-specific binding effect. Additional negative control compounds might help here, but the + isomer is probably the best negative control (albeit the concentrations were not ideal in the presented data).

      Please we have already addressed this in the text – refer to line 312 and beyond.

      • The very high concentration (100uM) increases the chances of non-specific effects being observed here (especially since the authors claim to see stabilisation at about 10nM). The study should be repeated at lower concentrations (with negative controls) in order to confirm a specific binding effect.

      Whilst further replicates with different conditions might be preferable, as discussed extensively here, this would be beyond the scope of what we are able to achieve for a revision.

      • The concentration-ranging study was performed at 78.4 degrees, at which temperature very little denaturation of Pfs16 occurs fig S4a (and Fig 3b-c). Therefore, you would not expect to see any drug-induced stabilisation, and it is not plausible that significant stabilisation could occur at this temperature. Therefore, the apparent destabilisation at sub-10nM drug concentrations is highly questionable.

      We would have to agree to disagree on this point.

      • Stabilisation of Pfs16 did not occur in lysates from younger gametocytes (fig s4g-h), but this is a biophysical assay, so regardless of the function of this protein at different stages, the biophysical interaction between the drug and the protein should be the same regardless of the source of the protein. This data argues against Pfs16 being a specific binding target of Pfs16.

      We don’t agree with this statement, since the drug is binding the protein in native lysate – this may be a multi-meric complex (homo or hetero) which only exists at certain stages. As such we disagree with the reviewer that this argues against Pfs16 being the target.

      In addition to the above concerns, the fact that this compound doesn't inhibit the earlier functions of Pfs16 in gametocytogenesis, and that it doesn't inhibit P. berghei, also argue against this being the specific target of this drug. Whilst the authors have a valid argument that these findings don't exclude the possibility of stage-specific targeting of Pfs16, we could also argue that all the phenotypic data in figures 4-6 is merely correlative of a drug that acts at the same point in the lifecycle as Pfs16.

      We have discussed this in the manuscript and strongly feel the reviewer is being unnecessarily dismissive of a body of work that is coherent. We are happy to tone down the narrative of the paper with Pfs16 being the exclusive target. Structural homology of P. berghei Pfs16 orthologues has never been done but it would not be unprecedented if another target was functionally homologous (an idea we are currently pursuing). Stage specificity is also possible given the nature of Pfs16 (e.g. if it is in a complex). The reviewer appears fixated on a singular entity and unable to imagine a complex scenario where structure or protein-protein interactions might affect drug binding (as it does with other proteins present in complexes, e.g. proteasomal targeting drugs).

      Overall, I believe that significant additional studies would be required to identify the target of this compound. Either by repeating the included studies with additional controls and conditions, or by follow-up studies such as genetic manipulation (knock-down or overexpression) or heterologous expression and biophysical binding studies.

      Alternatively, the manuscript could be restructured as primarily a report on the phenotypic effect of this compound on microgametogenesis, with the target identification work reported as a hypothesis-generating chemoproteomics study that provides some ideas about possible targets, but requires substantial follow-up to confirm the target (which may be beyond the scope of this report?).

      We strongly disagree with this reviewer’s entire dismissal of an extensive body of work. In line with other reviewers comments we accept a need to tone down our conclusions, but do not consent to dropping the majority of the paper in favour of a phenotypic descriptive work.

      MINOR COMMENTS The manuscript is very well-written and presented.

      Several of the conclusions are overstated (as detailed above) and several statements should be tempered based on this data (e.g. statements linking DDD01028076 effects to Pfs16 function).

      We can address the overstatement of conclusions in a revised manuscript.

      I find the term 'crosslinking' confusing for the photo-affinity labelling, as crosslinking in proteomics often refers to crosslinking between proteins (not between protein and drug).

      This is simple to address – to minimise confusion for readers, we can simply state where photoaffinity labelling and bioconjugation were performed (and not refer to the latter as crosslinking).

      The data and terminology around activity (IC50) for compounds in table 1 is a little confusing. Some IC50 values are reported as >1000, while others have precise mean values reported over 1000, and others are >10,000 or >25,000. This is especially confusing where 9 is claimed to have retained activity, but is >1000. If consistent thresholds are not appropriate then perhaps including dose response curves in the supp data might be necessary to explain these?

      We can simply provide the provide IC50s for compounds of greater potency. We are also happy to provide the curves but with such a large body of work already, this might be unnecessary.

      Reviewer #3 (Significance (Required)):

      The work is potentially interesting to Plasmodium biology and drug discovery researchers. The concept of a transmission-blocking drug is quite attractive to this community, so the topic is highly relevant. Keeping in mind that this compound was reported previously, the main novelty is in defining it's window of activity during the microgametogenesis process, and differentiating this from other drugs/compounds that inhibit this process. There is clearly an advance in knowledge presented here.

      If Pfs16 were to be confirmed as the target of this series then I think that this study would have much greater impact and attract interest from a broad audience. However, at this stage I don't see strong evidence for this hypothesis, and some of this data casts significant doubt on the likelihood that Pfs16 is the direct target.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Yahiya et al describes an extensive investigation of the mode of action of DDD01028076, which specifically inhibits microgametogenesis in Plasmodium falciparum. The phenotypic characterisation of the MOA uses some very nice imaging to demonstrate the point at which this compound inhibits microgametogenesis. The authors have also attempted to identify the molecular target using chemoproteomics and label-free CETSA techniques. The photoaffinity labelling and pull-down approach suggested the Pfs16 may be preferentially enriched by a PAL probe that is representative of this series. However, the data supporting the validation of this target is not very conclusive, and in some cases argues against Pfs16 being a specific target of DDD01028076. Whilst the presented data makes a significant contribution to the literature regarding a novel drug candidate that targets microgametogenesis, it does not support the author's claims that Pfs16 is the target.

      Major Concerns:

      The strongest evidence for Pfs16 being the target comes from the chemoproteomics pull-down study that found Pfs16 to be the most significantly enriched protein by compound 2 vs DMSO. However, this should be interpreted with caution as it is based on only 3 replicates and omics studies are prone to false-positives. That only 125 proteins were detected also raises questions about the coverage of the proteomics, it is quite possible that the actual target is not detectable using this method, and the Pfs16 appears because it is one of the more abundant proteins during this stage of the lifecycle.

      Somewhat concerningly, the control with 1 as the competitor did not show significant enrichment of Pfs16, although a trend was observed. More concerning, was the lack of enrichment when using DDD01028076 as the competitor. This result essentially proves that Pfs16 is not the specific target (and the argument about reversibility is unlikely since most drugs are reversible binders, but many have worked with this type of approach). It is surprising that DDD01028076 (ideally the (-) form) wasn't used as the competitor for the proteomics study. This compound has ~100-fold better potency than the probe 2, which should provide much better competition that 1. It would also be more specific than 1, which is an important control considering that (-)-DDD01028076 has activity in the low nanomolar range, whereas 2 acts in the micromolar range. Non-specific interactions are an important consideration to exclude, and whilst 1 is structurally similar, it is not very potent and therefore not the best control to find the target associated with activity.

      A closer look at the gels in the supplementary data raises many questions that undermine the authors conclusions:

      • Fig S1a - The lane without probe (2) still identifies Pfs16 (or a protein at that MW) as the most abundant protein. Also, as the Pfs16 band increases, you can see that most other proteins also increase in abundance, so either the loading is inconsistent, or the probe actually causes non-specific enrichment of many proteins. This figure also indicates that the washing protocol is not sufficient to remove non-specific binders. Given the covalent nature of the PAL approach I would think a very thorough washing protocol could be employed. -- Running another negative control in the proteomics using one of the inactive controls from table 1 might help to disambiguate specificity.
      • Fig S2a - The anti-Pfs16 Western blots show that this protein is actually enriched more in the flow-through than the eluates. This shows that this protein is not specifically enriched by the PAL-CuAAC pull-down, it is just more abundant in the treated samples.
      • Fig S2b - The darkest Pfs16 spot is actually the sample with no UV treatment. This is a negative control, so should not enrich the target protein. This sample also has significant signal in replicates A and C.
      • Fig S2c - This blot is very messy and difficult to read, but in general the Pfs16 spots in the IGF don't correlate with the intensities in the anti-Pfs16 western.
      • Fig S2 - This data, and the main figures based on this data, generally don't support the hypothesis that Pfs16 is the specific target. The controls are not as would be expected, and there are no loading controls. Looking at the flow-throughs suggests that there was just more Pfs16 (and possibly total protein) in the treated samples before the enrichment step. The Pfg377 also appears quite variable in the different samples, with replicates B and C not consistent with A.

      The other major concern is with the CETSA analysis, which appears to show very minor stabilisation of Pfs16, but the specificity of this target is questionable, and the data has the following inconsistencies.

      • The supplementary data only shows n=1, yet there are error bars in the main figures. Where did these come from?
      • The samples with apparent destabilisation are all near the edge of large western blots, which often doesn't run straight and has no loading controls. We need to see the loading controls.
      • The melting temperature of Pfs16 is extremely high at around 85 degrees C. Most plasmodium proteins melt at around 50-60 degrees (Dzekian et al, 2019). Even the cited work on membrane proteins didn't go to those temperatures (Kawatkar et al, 2019) Can this high temperature be explained, and has the CETSA approach been validated at such high temperatures where additional physical and chemical processes may be occurring in the sample?
      • The lack of difference between + and - isomers suggests that the very small stabilisation observed here is not specific to drug activity, but is more likely a non-specific binding effect. Additional negative control compounds might help here, but the + isomer is probably the best negative control (albeit the concentrations were not ideal in the presented data).
      • The very high concentration (100uM) increases the chances of non-specific effects being observed here (especially since the authors claim to see stabilisation at about 10nM). The study should be repeated at lower concentrations (with negative controls) in order to confirm a specific binding effect.
      • The concentration-ranging study was performed at 78.4 degrees, at which temperature very little denaturation of Pfs16 occurs fig S4a (and Fig 3b-c). Therefore, you would not expect to see any drug-induced stabilisation, and it is not plausible that significant stabilisation could occur at this temperature. Therefore, the apparent destabilisation at sub-10nM drug concentrations is highly questionable.
      • Stabilisation of Pfs16 did not occur in lysates from younger gametocytes (fig s4g-h), but this is a biophysical assay, so regardless of the function of this protein at different stages, the biophysical interaction between the drug and the protein should be the same regardless of the source of the protein. This data argues against Pfs16 being a specific binding target of Pfs16.

      In addition to the above concerns, the fact that this compound doesn't inhibit the earlier functions of Pfs16 in gametocytogenesis, and that it doesn't inhibit P. berghei, also argue against this being the specific target of this drug. Whilst the authors have a valid argument that these findings don't exclude the possibility of stage-specific targeting of Pfs16, we could also argue that all the phenotypic data in figures 4-6 is merely correlative of a drug that acts at the same point in the lifecycle as Pfs16.

      Overall, I believe that significant additional studies would be required to identify the target of this compound. Either by repeating the included studies with additional controls and conditions, or by follow-up studies such as genetic manipulation (knock-down or overexpression) or heterologous expression and biophysical binding studies. Alternatively, the manuscript could be restructured as primarily a report on the phenotypic effect of this compound on microgametogenesis, with the target identification work reported as a hypothesis-generating chemoproteomics study that provides some ideas about possible targets, but requires substantial follow-up to confirm the target (which may be beyond the scope of this report?).

      Minor comments

      The manuscript is very well-written and presented.

      Several of the conclusions are overstated (as detailed above) and several statements should be tempered based on this data (e.g. statements linking DDD01028076 effects to Pfs16 function).

      I find the term 'crosslinking' confusing for the photo-affinity labelling, as crosslinking in proteomics often refers to crosslinking between proteins (not between protein and drug).

      The data and terminology around activity (IC50) for compounds in table 1 is a little confusing. Some IC50 values are reported as >1000, while others have precise mean values reported over 1000, and others are >10,000 or >25,000. This is especially confusing where 9 is claimed to have retained activity, but is >1000. If consistent thresholds are not appropriate then perhaps including dose response curves in the supp data might be necessary to explain these?

      Significance

      The work is potentially interesting to Plasmodium biology and drug discovery researchers. The concept of a transmission-blocking drug is quite attractive to this community, so the topic is highly relevant. Keeping in mind that this compound was reported previously, the main novelty is in defining it's window of activity during the microgametogenesis process, and differentiating this from other drugs/compounds that inhibit this process. There is clearly an advance in knowledge presented here.

      If Pfs16 were to be confirmed as the target of this series then I think that this study would have much greater impact and attract interest from a broad audience. However, at this stage I don't see strong evidence for this hypothesis, and some of this data casts significant doubt on the likelihood that Pfs16 is the direct target.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Transmission blocking drugs are of high interest as a strategy to combat malaria but they are difficult to study. For instance it is problematic to raise resistant parasites to find mode of action of transmission blocking drugs and to identify their targets in the cell. In this manuscript Yahiya et al. build on previous work which identified the N-4HCS scaffold, of which DDD01035881 is the lead compound, as an inhibitor P. falciparum male gametocytes. Using PAL to enrich for target proteins Pfs16 was identified and validated as a possible target of DDD01035881. Binding was validated through CESTA. Determination of the phenotype following DDD01035881 treatment was found to partially match the previously published Pfs16 KO phenotype. However curiously no impact was seen in gametocytogenesis despite published evidence of Pfs16 being involved in sexual conversion. The authors speculate as to reasons but a direct experimental comparison with Pfs16 mutant parasites (which likely would have been revealing) is not provided. On the positive side, this analysis of the stage-specific effect of the drug pinpoint the stage inhibited during microgamete development which is a very interesting part of the manuscript.

      This work deepens the understanding of a novel class of transmission blocking drugs with reasonable potency (foremost (-)-DDD01028076, which has low nanomolar activity, the modified versions considerably less). Question on how to achieve serum concentrations for sufficient potency aside, these compounds will in the very least provide experimental tools to study their mode of action and might reveal interesting biology. This work is therefore of interest to the malaria field.

      The experimental methodology seems excellent but some of the results raise questions that make definite conclusions difficult and this should be addressed. Overall, this is very solid work but leaves some doubts whether Pfs16 is indeed the (only) target of this class of compounds.

      Major comments:

      1. The reasons for excluding Etramp10.3 are not convincing. In fact it could be argued it is nearly as good a candidate as Pfs16. Contrary to the author's statements in the results section, etramp10.3 transcription is highly upregulated in gametocytes (see e.g. PMID: 22129310) with a generally very low transcription in asexual stages. It is argued that Etramp10.3 is essential in blood stages because MacKellar et al failed to disrupt the gene and because the PiggyBAC screen predicted it to be essential. However, if this is an argument for exclusion then this would also apply to Pfs16 which is also predicted by the PiggyBAC screen to be essential (likely both are non-essential in blood stages as they are barely expressed but Pfs16 and Etramp10.3 might by chance have not received an insertion in the PiggyBAC screen due to their very small size which may also explain failure of disrupting integration in MacKellar). Given the finding that the drug binds Pfs16 only in late gams it might also be argued that an essential function in asexuals might not be affected if they behave similarly to young gams and hence this criterion is not valid anyway. Further following this line of thought that ETRAMP10.3 could be a hit equivalent to Pfs16, Figure 2D shows a band below the band considered to be Pfs16. It would not be all surprising if this were ETRAMP10.3 (the size would fit).
      2. Both, Pfs16 and ETRAMP10.3 can be expected to be very abundant proteins in the parasite periphery in gams. Can the authors exclude that these simply are the first to encounter the N-4HCS photoaffinity probe and that this may have led to their enrichment in the target identification experiments. The biochemical data argues for a specific interaction with Pfs16, but by itself is not that strong. Given the discrepancies of the phenotype with the Pfs16 disruption and the peculiar finding that the drug binds Pfs16 only in later stage gametocytes, it might be a good idea to further caution the conclusion of Pfs16 as the inhibited target.
      3. The phenocopy evidence of the NH compounds with the Pfs16 disruption is based on comparison with published evidence. It would have been much preferred to have a side-by-side comparison with the (or an) actual Pfs16 disruption parasite line. Although the authors stress that the phenotype with DD01035881 fits the phenotype of the targeted gene disruption in the results, this only partially matches the cited publication (PMID: 14698439) which concludes there is an effect on the number of gametocytes produced. The exflagellation phenotype in that publication was classified as preliminary. Although this is discussed, the main results text should be adapted to reflect this and the conclusion that Pfs16 may be the target should be further cautioned.

      Minor comments:

      1. From the modifications of the compounds it seems the chemical space for further modification to achieve higher potency is limited with this scaffold. Maybe the authors can comment whether they envisage this to be a potential obstacle.
      2. Line 67: references are superscript.
      3. Line 77: I would recommend replacing 'quiescence' here, a cell that matures is not quiescent.
      4. Line 116: consider removing 'interdisciplinary'.
      5. Line 120: I would caution here (see major comments) and recommend a less definite proclamation of Pfs16 as a promising new drug target
      6. Page 7: compounds 9 is still considered active ("retained micromolar activity"), but in Table 1 this is given as >1000nM. Please add the actual IC50.
      7. Line 138- 173: The order in which this is discussed makes it unclear that the work described was done prior to, and guided, the synthesis of compound 1 and probe 2
      8. Line 194: was the data deposited in a database?
      9. Line 202: introduction as to the benefits of using a competition + probe condition here could aid reader understanding. The interpretation of this data is complicated by the covalent and reversible binding of the two compounds and the weight of this control is therefore difficult to gage.
      10. Table 2 and Extended Data Table 1 show different p values and enrichments for the same hits. This is confusing. It would also be useful to label the hits in the scatter plots in Figure 2 for easy identification and comparison to the tables.
      11. Line 215-218, please correct the data on Etramp10.3 (see major points) and put in perspective to Pfs16 (Etramp10.3 is similarly upregulated in gams where it is highly expressed; PiggyBAC predicts essentiality for Pfs16 and Etramp10.3 in blood stages).
      12. Line 221: the results from the PiggyBAC screen are stated as fact, but what the screen provides is a prediction of the probability of importance for parasite growth. I would replace 'is' with 'is predicted' (even though in the case of Rab1b it seems likely the prediction is correct).
      13. Line 233 and elsewhere: define 'reversibility' (binding? activity?).
      14. Line 240: clarify what is in the cited paper (see major points).
      15. Line 297: We utilised in-lysate...... clunky sentence, please rephrase.
      16. Line 325: reference is missing the year.
      17. Line 343: It is utterly puzzling that binding is specific to Pfs16 in mature gametocytes and I do not find the explanation in the discussion convincing (see point 28 below). Do the authors have another explanation? Could Pfs16 be modified in later gams (or vice versa)?
      18. Line 388: Justification seems odd as a PV protein would be unlikely to directly impact DNA replication. Please rephrase the sentence.
      19. Line 405: remove the 'to'
      20. Line 411: it would be useful to the reader to state at what IC-value the drug was used in these experiments.
      21. Line 431: While the alpha-tubulin staining indicates exflagellation and is similar to the DMSO only control, the staining for the RBC membrane (Glycophorin A) and DNA (DAPI) appear different, yet this is ignored. One interpretation of this could be that while late treatment doesn't block exflagellation, it still impacts other aspects of microgamete development.
      22. Line 436: IFA work was done with drug treatment post activation while EM was done post activation but drug treatment prior to activation. Is there a reason for this?
      23. Line 450: is this really CytB, or was it CytD?
      24. Line 465: Pfs16 localised to vesicles: there is no data showing the dots in the micrograph are vesicles, please rephrase.
      25. Page 19 and 20, discussion on stage-specific differences of Pfs16 during gametocytogenesis to explain the difference in binding: without experimental data using H-4HCS in the parasites of the publication cited to explain this (PMID: 21498641), this is very speculative. The cited work used episomal expression of Pfs16 tagged with fluorescent proteins. This would be the first integral PVM protein that is actually inserted into the PV membrane when tagged in that way (usually this results in a PV location), casting some doubt on the findings in that paper. All in all the provided explanation is not very convincing.
      26. Line 519: if with the conserved part the N-terminus is meant, then this has for other PVM proteins already been shown to be PVM internal, not facing the erythrocyte (show in very early work; PMID: 1852170 but also multiple times after that).
      27. Line 534: consider replacing 'highly plausible' with something more cautious.
      28. Line 550: Given this discussion how stable are N- 4HCS compounds?
      29. Table 1: Having all chemical structures in same orientation would be nicer visually. I assume blue indicates modification but this is not stated.
      30. Figure 1: Please use different colours or symbols. The dark green crosses and the blue Pfs16 cross are hard to distinguish.
      31. Figure 3d: Unclear as to why a difference temperature range is displayed here.
      32. Figure 3e: Unclear % Inhibition compared to what.
      33. Figure 5G: What is the white arrow pointing to?
      34. Figure 5j: Given how the explanation is written this would make more sense between current image 5G and 5J.
      35. Figure 6: Erythrocyte membrane colour not stated in legend.
      36. Figure 6A: were the exposure times similar? How can so little be left after ~4-5.5 minutes but at later time points there seems to be much more Pfs16 signal left? Maybe amount of signal should be taken into consideration to establish the fate of Pfs16 in the process.
      37. Figure 6B: is the second phenotype (successful but aberrant egress) shown? The only image where WGA is not circular around the parasite is an exact match of Pfs16 which is in dots (image at 7.5-8.5 minutes). The imaging data for this phenotype should be presented more clearly.

      Significance

      Nature and significance: a lot of weight has been placed on transmission blocking drugs although there are also a number of problems associated with them (ethics for testing and use etc; drugs acting on asexuals and transmission stages alike might be even more useful). Transmission blocking drugs are difficult to study and this work is therefore important. The experiments are well done, but the conclusions are not fully convincing, leaving some doubts in regard to Pfs16 being the actual target of the class of drugs studied.

      Compare to existing published evidence: it is a logic continuation of previous work and this is appropriately highlighted in the manuscript.

      Audience: medium interest for malaria researchers; high interest for researchers working on transmission blocking drugs and those studying microgametes.

      Your expertise: malaria, P. falciparum, biology of apicomplexans

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors develop a previously identified lead compound for the blocking of malaria transmission from humans to mosquitoes further and identified a protein target of the chemical. The protein target, Pfs16 is long known to be upregulated in gametocytes and has been speculated to be a target for small molecules. The work is well (if at time maybe too well/too detailed) described and potential shortfalls are highlighted.

      My major comment is that without a deletion mutation of Pfs16, the paper will remain somewhat preliminary. I would strongly encourage the authors to generate such a mutant and compare it to the parasites treated with their drug candidate. I feel the text can be much shortened and a lot of information moved to the materials and methods. The conclusions should be toned down on several occasions (abstract, introduction, discussion). Avoid adjectives, e.g. what is a 'powerful starting point' (abstract) or 'compelling interdisciplinary evidence' but hot air?

      Referees cross-commenting

      I think these three reviews are pretty much in line with their overall assessment. I am happy if send as is to authors as it will help them shape a much better paper

      Significance

      The paper shows that very likely a new chemical with some potential for transmission inhibition of malaria parasites for mosquitoes binds to a Plasmodium protein that is specifically expressed in the sexual stages of the parasite.

      The paper compares to good papers published in journals like ACS Infectious Diseases or Antimicrobial Agents and Chemotherapy, but I am not sure which of the Review Commons sister journals it would fit to. I am a molecular parasitologist.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: * Saha et al. characterize Drosophila egg chambers that are mutant for cup and identify an increase in the number of a specialized type of follicle cells, the border cells. They demonstrate that this increase correlates with an expanded domain of STAT activity and reduced Notch signaling in anterior follicle cells. Determining that cup is required in the germline cells, the authors postulate and provide some evidence that cup mutants prevent germline Delta from properly signaling to follicle cells. In line with this, they also show that blocking endocytosis phenocopies some aspects of cup mutants, particularly border cell numbers and Delta levels, which they monitor cytoplasmically and at the cell surface. Lastly, they demonstrate that activation of Rab11 can rescue Delta levels and border cell number in cup mutants. They conclude that a key function of Cup in the germline is to traffic Delta to signal to follicle cells, and that the endocytic processing of Delta is required for its function.*

      Major comments:

      • The findings of this study are interesting and novel. The authors have completed a lot of experiments and analyzed the results carefully and in great detail. Experimental design is described adequately and statistical analysis is sufficient. While the main results are largely convincing and support the conclusions, there are some weaknesses that need to be addressed.*

      Response: We thank the reviewer for appreciating our work and we have tried to address concerns of the reviewers to the maximal possible extent with the hope to strengthen our claims further.

      One major concern is that the vast majority of the experiments were conducted with a single homozygous allele for cup. The authors claim this was necessary because other alleles arrest oogenesis, which is understandable, but it leaves the potential problem that the allele, a P-element insertion, may affect other genes, or there may be other unidentified mutations on the mutant chromosome. The authors are able to partially rescue the border cell phenotype with overexpression of Cup and can also mimic the outcome with RNAi in the germline, which helps alleviate some of this concern, but this was only done for one set of experiments (those in figure 1). Similar experiments need to be included to demonstrate the same outcomes when cut is disrupted by other alleles/methods for at least some of the Notch/Delta analyses since this is key to the paper's conclusions.

      Response____: We acknowledge the concern raised by reviewer and to address it, we evaluated different allelic combination of Cup to rule out issues with background mutation. We evaluated the Delta count, NICD and border cell numbers in a different allelic background of cup8/ cup01355. Satisfyingly we observed similar results like that observed for cup01355/ cup01355 homozygotes. This result is included as (Fig S1E-G)

      In addition, we have specifically downregulated Cup function in the germline employing the RNAi approach and validated the non-cell autonomous effect of Cup function in border cell fate specification. This result is included in (Fig 1M-O)

      A second concern is that some evidence is circumstantial or indirect. Specifically, the authors argue that the effect of Cut is due to trafficking of Delta, but do not consider the possibility that Delta could be more directly regulated or that other factors may be relevant. Border cell specification is rescued by increasing recycling in cup mutants, but this could be due to recycling of more factors besides Delta. To address this more directly, the authors should overexpress Delta in the germline of cut mutants. It is possible the disruption of Delta in cut mutants is due to changes in Delta protein stability/levels, so the experiment may also clarify this issue. If this is the case, it may be that hypomorphic Delta mutants would have a defect on border cell number, which could be examined separately. If Delta levels are low, endocytosis and recycling increases may also rescue cut mutants indirectly, but the conclusion about what Cut regulates may differ.

      Response: As per the suggestion of the reviewer, we did attempt to over express Delta in the germline of cup mutants egg chambers. Unfortunately, we couldn’t record any Delta overexpression as the available vector (UASt- Delta) can drive stable expression only in the somatic cells but not in the germline cells. However, to check out the possibility if Delta was being directly regulated by Cup, we compared the levels of proteins between wild type and Cup mutant egg chambers (Figure 4E-G). Unlike our expectation we didn’t observe any significant differences in the levels of Delta in Cup compared to the control. This kind of supports our belief that Cup may not be directly regulating the levels of Delta in the germline.

      Another concern is that Cup's main role is a confusing since it regulates many things, including cytoskeleton and cytoskeleton is necessary for general health and vesicle trafficking in the egg chamber - how do the authors think Rab11 upregulation is overcoming these defects?

      Response: We appreciate the reviewer for raising this concern as it kind of intrigued us to examine if the overexpression Rab11CA was rescuing the cytoskeleton too. Interestingly, we observed that Rab11CA overexpression restored the actin filament in Cup mutant germline(figure S6H-K). This result is in line with report that Rab11 effector Nuf can modulate actin polymerization (Jian Cao et al.,2008).

      Rab11CA rescues Delta levels almost completely in cut mutants but only partially rescues Notch activation, suggesting there are other problems in these egg chambers that could contribute to the defects. While exploring possible other factors is beyond the scope of this work, the authors may want to acknowledge this issue.

      Response: We do agree with the reviewer that we only observe partial rescue of the NRE GFP with Rab11CA, it suggests that Cup can affect different aspect of egg chamber development independent of Rab11 function.

      Minor comments:

      It would help the presentation of the paper to introduce Notch/Delta signaling during oogenesis in the introduction. More introduction and clarity about the number of polar cells at early stages and their role in the border cell cluster may also be useful to the reader.

      Response: We have modified the introduction to highlight the role of Notch/ Delta signaling in early oogenesis.

      It is notable that the primary phenotype of a change in border cell numbers is quite subtle, often only affecting 1-2 cells, and the variation in different genotypes and experiments is sometimes also that large. The authors do a good job of being careful to count the cells at a specific developmental time and do appropriate statistical tests within an experiments. Still, it difficult to be sure that the effects are due to the gene being manipulated specifically or the genetic background. Related to this, a few issues should be addressed. Notably, at earlier stages, Notch signaling impacts cell division, so some of the phenotypes might be explained by there being more total cells in the domain instead of more signaling. The authors show Cut is in the same domain and pH3 is similar, but they didn’t seem assess overall numbers.

      Response: As per the suggestion of the reviewer, we assessed the total number of follicle cell nuclei in stage 8 egg chambers. This analysis was done each confocal z slide of the egg chamber taking care that each nuclei (DAPI) was counted only once. Satisfyingly we didn’t observe any significant difference in the number of follicle cell nuclei between wild type and cup mutant egg chambers supporting our earlier claims with pH3 and Cut antibody that cell proliferation is not responsible for the excessive border cell fate in Cup mutants. This result in included in (Fig S2O-Q)

      Secondly, for the stat suppression of cut (figure 2L), the authors need to show the stat-/+ control for comparison to make a conclusion about suppression versus additive effects.

      Response: As per the suggestion of the reviewer, we have included the data for statp1681/+ control in figure 2L.

      In addition, prior work (Wang et al 2007) expressed DN Kuz in border cells and did not see a change in specification, unlike what is claimed here. In the experiment in question, the control has lower than normal numbers of border cells and the DN Kuz has a number more typical of the controls in other experiments- so this is a concern that there is something else in the genetic background influencing the numbers. Other controls could help make this case, but ultimately this result is probably not necessary for the main argument. Thus the authors might consider leaving it out the Kuz analysis or perhaps can comment on the discrepancy with prior published results.

      Response: We have removed the data on Kuzbanian and have added data that suggests that Notch activation in the follicle cells downstream of Cup facilitates specification of appropriate number of migratory border cells (Fig 3K-N).

      Can the authors comment on why the volume of the border cell cluster increases more dramatically (>2x) than the number of cells (30% more)? * Does the increase in border cell number change the migratory capacity? That is, do the clusters in cut mutant egg chambers migrate normally while the egg chamber looks okay?*

      Response: We believe that dramatic increase in the volume of the border cell cluster I (>2x) than the number of cells (30% more) is due the loose arrangement of the cells in the border cell cluster. Interestingly, the cup mutant border cell clusters do exhibit migration defect that we are examine as part of separate study.

      Several of the figure legend titles state conclusions that are over interpretations of the data shown:

      - Figure 3 legend is overstated- these experiments do not assay STAT activity, only border cell number, so the title can be simplified to say that.

      Response: We have modified the Figure legend in line with the data presented.

      - For figure 4, both cytoskeleton and Delta are shown to be disrupted in cup mutants, but they are not directly linked, eg, the experiments do not show a change in Delta in cytoskeletal mutants alone. While it is interesting that cup mutants have disrupted cytoskeleton, ultimately this result is not well connected to the main issue of Notch/Delta signaling; in fact, it becomes confusing how anything can be trafficked to the cell surface if there is poor cytoskeletal organization. Since the authors favor the hypothesis that the cytoskeleton is not the key to the border cell specification difference, they may want to move this result out of figure 4.

      __Response: __We have included the data that suggests that cytoskeleton organization is critical for Delta trafficking. Specifically we demonstrate that treatment of egg chambers with Cytochalasin D exhibits accumulation of Delta in the nurse cell cytoplasm (Fig S5D-F).

      - The Figure 5 legend is also overstated- these experiments show that Delta is higher in cup mutants and endocytosis mutants AND that endocytosis (of something) is required in the germline for border cell number- but these results are not linked in this figure. More evidence for this connection does come later in figure 6. * Some figure legends are quite brief and could benefit from a little more detail on what is being shown*.

      __Response: __We have modified the title of the Figure legends with respect to data presented.

      Figure layout could be improved by keeping images consistent sizes and making sure graph text is large enough to read easily. Figures in general could be streamlined by having negative results and less pertinent results in supplemental data.

      Response: We have reorganized the figures and worked on the graph text for easy read.

      Not all papers cited in the text are in the reference list.

      Responses: We have modified the title of the figure legends and cross checked our reference list with the papers mentioned in the main text.

      CROSS-CONSULTATION COMMENTS

      I generally agree with the other reviewers that there are concerns with the precise function of cup in this context, and that some revision is needed, including editing of the writing. In response to reviewer 2, prior published studies only detected Cup in germline, but it is possible that it is expressed in follicle cells at a low level. The mutant clonal experiment in follicle cells that the authors did had no effect on border cells, so that provides some evidence the role is non-autonomous. I agree with reviewer 2's concern that the authors overstate the connection between cup and Delta and border cells based on their data and need a few more experiments to tie things together. I understand reviewer 3's concerns that the experimental effects on border cell numbers are very small and variable- I listed this as a minor concern, though, since this number is mainly being used as a read-out for STAT signaling levels and the data were extensively quantified and statistically tested.

      Reviewer #1 (Significance (Required)):

      My expertise is in cell migration, developmental biology, and Drosophila genetics. This paper will be of broad interest in these fields as it incorporates aspects of each in its characterization of a new regulatory mechanism to induce a motile cell population non-cell-autonomously, which is an exciting finding. Specifically, the work increases our understanding of the intersection between Notch and Jak/STAT signaling, which many researchers study - these were both known to be involved in border cell specification. The study provides more detailed characterization of the signaling and specification process in general, and makes significant advances in understanding how Delta signals are produced and presented from germline cells to receiving cells in the soma. Cut has not been previously implicated in these signaling pathways, so that is also novel, although its precise mechanistic role here is still somewhat unclear.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Saha et al. made a detailed description of the role of the mRNA binding protein Cup in specifying the number of Border Cells (BC) during Drosophila melanogaster oogenesis. First of all, they show that females homozygote for a hypomorph allele of cup have higher number of BCs compared to Wild Type (WT) females. They present a series of experiments that points towards the phenotype being due to a specific role of cup in the nurse cells that non-cell autonomously regulates BC specification. Also, they show that this phenotype is the result of an increase in the levels of JAK/STAT signalling in the BC, a major determinant of BC

      fate. In addition, they show that cup mutant egg chambers exhibit a downregulation of

      the Notch (N) pathway function in the BCs and that over-activating Notch results in the rescue of the number of BCs. Moreover, the authors present data on the effect of cup in Delta (Dl) trafficking in the nurse cells: They found that cup mutant egg chambers show increased number of Dl puncta within the cytoplasm of the nurse cells, but reduced numbers in the nurse cell-Anterior Follicle Cell (AFC) boundary as a result of defective Dl endocytosis. Finally, they were able to rescue the Dl trafficking phenotype, as well as the number of BC by overexpressing an active form of Rab11.

      Mayor points:

      In this study, the authors employed an hypomorph allele of Cup to generate egg chambers where both germline and somatic cells are mutant for Cup. They did a series of experiments to try to demonstrate that the Border Cell (BC) specification phenotype they observe is non-cell autonomous and that is due to the Loss of Function (LOF) of Cup exclusively in the nurse cells. Although I appreciate the difficulties of eliminating or reducing the levels of Cup specifically in the nurse cells only during mid-oogenesis, I feel like this is key to be able to claim that this effect of Cup in BC specification is really non-cell autonomous. The reasons why I still have some doubts that there might be some cell autonomous effects in the FCs are the following:

      o The authors show that cup01355 mutant egg chambers have a phenotype in Dl trafficking. Although they analysed in detail the effects on Dl in the nurse cells, their images show that there might be a defect in Dl levels/trafficking in the Follicle Cells (FCs) as well (Fig5A-B). It has been shown that Dl mut FCs have reduced levels of Notch activity due to reduced lateral inhibition (Poulton et al., 2011), so there is a possibility that the reduced levels of Notch activity in the cup01355 egg chambers might be due, partially, to defects in Dl trafficking/levels in the FCs, rather than in the nurse cells. o The authors tested the role of the Notch pathway in the cup mutant phenotypes by measuring the number of NICD puncta in the signal receiving cells as proxy for Notch activity (Fig4). Although I understand the rationale, I am not convinced that they can completely rule out that the changes in NICD puncta number in FCs is not due to some effect of cup LOF on Notch trafficking in these cells.

      o In figure 6, the authors show that expression of a constitutively active form of Rab11 specifically in the nurse cells restores the BC number to that of the WT. However, the levels of Dl particles and, especially the levels of NRE-GFP expression, remains slightly lower than in the WT conditions.

      Response: We do agree with the reviewer that we only observe partial rescue of the NRE GFP with Rab11CA, it suggests that Cup can affect different aspect of egg chamber development independent of Rab11 function. This has been acknowledged in the main text and it now reads as “We did note that irrespective of partial rescue in the levels of NRE-GFP and Delta puncta count, a complete reversion to wild type border cell numbers was observed when Rab11CA was overexpressed in the cup mutant germline. This may suggest either that border cell fate specification is quite robust beyond a certain base level of signaling or Cup may affect other aspects of egg chamber development independent of Rab11 function.”

      One of the main conclusions of this study is that cup regulates BC specification through a non-cell autonomous mechanism that involves communication between nurse cells and AFCs. For that reason, I think in order to conclusively say that, the authors need to try to remove the function of cup specifically in the nurse cells. They mentioned they have tried different ways of doing this unsuccessfully, but do not specify how they have tried. I suggest using the cup-RNAi line combined with a nurse cell specific Gal4 and a ubiquitous gal80ts line (tub-Gal80ts), if they have not try this. I do not expect the authors to repeat all the experiments with this condition, but at least they should test the main findings i.e. number of BCs, JAK/STAT overactivation and Notch attenuation.

      Response: To further support the non-autonomous role of Cup in border cell fate specification, we down regulated Cup function in germline nurse cells employing Mat-alpha GAL4 and Cup RNAi. Since Mat-alpha GAL4 driver has weak expression in the nurse cells of early stage chambers, it enabled us to evaluate Cup function during mid oogenesis. Consistent with our expectation, we observed higher number number of border cells in the migratory cluster compared to the control supporting our conclusion that germline Cup modulates the number of adjacent anterior follicle cells that acquire migratory border cell fate. The above results are included in (Fig 1M-O). In addition over expression if Cup cDNA in the anterior follicle cells failed to the rescue the excessive border cells observed in the Cup mutant egg chambers supporting the germline role of Cup further. This result in included in (Fig S1L-O).

      • The authors have shown in Figure 3 that there is a decrease in Notch signalling in the AFCs in cup01355 egg chambers. In order to test that the BC number phenotype observe in this condition is due to that effect on Notch signalling they have done a rescue experiment using the antimorphic Notch allele Nax-16. Since in this condition all cells (nurse cells and FCs) have increased levels of Notch, they cannot conclusively say that the increase in Notch function in the FCs rescues the cup

      phenotype. If they want to show that the function of Notch is specifically needed in the FCs, they should over-activate Notch exclusively in the AFCs. For instance, they could express a constitutively active form of Notch, such as UAS-NICD (Go et al., 1998) or UAS-NDECD (Fortini et al., 1993), specifically in the AFCs. Otherwise, they should re-write the text since they cannot conclusively say that the increase in Notch function in the FCs rescues the cup phenotype.

      Response: Following the suggestion of the reviewer, we attempted over expression of NICD in the follicle using driver slbo-GAL4 in the cup mutant background. Gratifyingly, we observed rescue in the border cell fate of Cup mutant egg chambers. However, we didn’t observe any rescue in the morphology of nurse cell nuclei of Cup mutants. This supports our conclusion that increase in Notch function in the FCs rescues the cup phenotype with respect to the border cell fate only. (Fig 3K-N).

      • The authors had made a great effort to prove that proper Delta endocytosis in the nurse cells is essential for adequate Notch signalling in the AFCs and right number of BCs recruitment. Specifically:

      o They checked the consequences on Dl trafficking of down-regulation of rab5 or auxilin, but they did not test the effect in BC numbers * o They show that downregulating the function of shi affects the number of BCs, but did not show the effect of this condition in Dl trafficking. * Consequently, they cannot conclusively say that effects on trafficking of Dl affect number of BCs, since they haven't really tested both effects on the same background. I think that for simplification, they should test both, effects on Dl trafficking and number of BCs in one of those genetic backgrounds and leave the other two for supplementary material. Alternatively, they should re-write their conclusion for this section.

      Response: As Rab11GTPase over expression rescued the excessive border cell fate in the cup mutants, to test the specificity we downregulated Rab11 function in the germline itself to check Delta trafficking and border cell fate specification. We employed a late expressing GAL4 driver in the germline and observed that down regulation of Rab11 function resulted in more number of follicle cell acquiring border cell fate and decrease in the number of Delta puncta at the interface of Anterior follicle cells and nurse cells. This phenotype is reminiscent of the Cup mutants suggesting that perturbing the recycling component of endocytosis perse affects border cell fate and Delta trafficking. This result in included in (Fig 6D-I)

      • Their results clearly show that Dl accumulates in puncta, suggesting that there might be a defect in Dl trafficking, and although their rescue experiments point towards an scenario where Rab11-dependent Dl recycling is being affected, I think there are some weak points on their arguments. The fact that Rab11-KD does not generally affect Notch signalling in the FCs, as shown in (Windler & Bilder, 2010) argues against their conclusion that the effect of cup in nurse cells on Rab11 function is responsible for the defects in Dl trafficking and, subsequently, on Notch activity in AFCs. An alternative explanation is that Rab11 overactivation in the Cup mutant background compensates for a different defect on Dl trafficking, for example, Rab4-dependent recycling pathway. Another possibility is that AFCs could be specially sensitive to changes in Rab11-dependent Dl trafficking defects in the nurse cells. To distinguish between these two possibilities, they should perform some of the following experiments:
      • o First of all, there are a number of endosome markers that can be used to check in which step of the endocytic route Dl is being accumulated, including (but not limited to) anti-Rab11 antibody, anti-Rab5, anti-Rab7, tub-Rab4-mcherry. They should do co-localization experiments with Dl and endosomal markers.*
      • o Also, they could check what happens to the number of BCs and Dl trafficking when Rab11 function is blocked in the nurse cells, in a similar way to what they did with Auxillin, Rab5 and Shi. They could use some of the tools described in (Satoh et al., 2005)*

      Response: We have perturbed Rab11 function during mid oogenesis which is quite distant from early stage egg chambers examined by Windler & Bilder. We observed that down regulation of Rab11 activity in germline affects both border cell fate in the AFCs and Delta trafficking in the germline itself. Protein Trap analysis of Rab11 in wild type and Cup mutant background suggests Rab11 is enriched in the trans-golgi network where the activity of Rab11 is modulated through nucleotide exchange. Over all our results suggest that Rab11 activity is diminished in the cup01355 egg chambers and thus stimulating the recycling endocytosis restores Notch signalling in the AFCs, limiting JAK-STAT activation and restricting BC cell fate specification.

      • The authors final model is one in which cup in the nurse cells regulates Rab11 function to ultimately control JAK/STAT signalling in the AFCs. However, they have not looked at the status of JAK/STAT signalling in their Rab11-CA rescue experiments. I think this experiment will really round-up their work.* Response: The border cell fate is linked to activation of JAK-STAT signaling in the anterior follicle cells. As we have already exhausted the STAT antibody, it will difficult to access the levels of STAT perse.

      Minor points:

      • The authors tested if the extra BC phenotype observed in the cup mutant egg chambers is due to defects in FCs endoreplication. I have two questions related to this section.*

      • o First of all, I do not understand the rationale behind this idea that defects in FCs endoreplication would result in extra BCs. Please explain and add any relevant references.*

      • o Secondly, they say that they used Cut and Phospho-Histone3 as endoreplication markers. I believe that what they mean is that the absent of these two markers indicates that FCs have exit the cell cycle and enter the endocycle (Sun & Deng, 2005), however they are not markers of endoreplication. Please, re-write to make this clear.*

      Response: The follicle cell exhibits a switch from mitotic to endocycle phase at a particular stage of oogenesis (Sun & Deng’ 2005). Our premise is that incase this switch is delayed, will the extra proliferation can account for the excessive border cell fate? In this context we have modified the text to render clarity to this section.

      • The authors tested whether the levels of Notch activity were altered in the cup mutant egg chambers. For that, they used an NRE-GFP construct that shows a clear reduction in the levels of Notch activity in the AFCs. They also used the number of NICD and NECD puncta in signal receiving and sending cells respectively, as proxy of Notch activity. Although I understand the rationale, there are other explanations for this phenotype as discussed above. Thus, if they want to have an alternative way of showing the dampening of Notch signalling, they could use the levels of expression of well characterised targets of Notch in the FCs, such us hnt and E(spl)mb-CD2 or E(spl)m7. Response: We believe that our new set of data with NICD over expression (in the AFCs) rescuing border cell fate in Cup mutants coupled with NRE-GFP, NICD, NECD data now lends stronger support to our claim that Notch signaling in the follicle cells is indeed downstream of Cup function in developing egg chambers.

      • In M&M the authors explain that NRE-GFP levels were expressed in Fold change. However, in figure 3C the units of the graph are Fluorescence Intensity in a.u. Please,*

      check this small inconsistency

      Response: We have modified this as per reviewer’s suggestion.

      • In figure 4, they show the quantification of tubulin fibres within the nurse cells, however they are missing a similar analysis of Phalloidin (Pha) fibres/levels. I think this experiment and figure will be more complete if the authors added such a quantification of the effects of cup LOF in Pha distribution. Also, the authors do not show the single Pha channel in Fig4C, which would greatly helped to appreciate the differences between the WT and Cup LOF nurse cells. I suggest modifying the figure to better show the changes in Pha distribution. Response: We have modified the figure and included quantitation of actin fibre length in Supplementary figure 6H- K.

      • In figure 4F-G the authors are showing the general effect of cup LOF in Delta distribution. They indicate with yellow arrowheads the cytoplasmic Dl puncta accumulation in the nurse cells, however it is almost impossible to see such puncta with that level of magnification/resolution. I suggest removing the arrowheads, since the figure 4H-I shows the same puncta more clearly. Response: We have modified the figure to render clarity

      • In the Dl trafficking experiments (Fig4 H-I,K,L and Fig5A-C), the authors measured the number of puncta in the anterior nurse cell-follicle cell junction. In order to do those types of quantifications they need to be able to tell the cell boundaries that separate FCs from the nurse cells. Please, clarify the criteria for determining if the puncta are within the FCs or the underlying nurse cells. Response: Delta, NICD, NECD proteins marks the apical surface of the follicle cells. We used this as a reference to segregate nurse cell puncta with respect to follicle cells. This has been elaborated in the Material & Method section.

      • In figure 6C-D the authors show example images of egg chambers expressing Rab11-CA-YFP using the germline specific nos-Gal4. However, in the images it looks like the YFP signal is coming from the surrounding stretched FCs. Please check that these are the right images or explain the inconsistency.

      Response: We have crosschecked the images and the YFP signaling is from nurse cell periphery which gives the wrong impression that it is from stretched follicle cells.

      • In figures 1R, 2L, 3Q, 6I, 6M, the authors should show the results of the statistical analysis between all the conditions tested. I think that this is crucial to be able to tell whether some of the rescues are complete or only partial. *Responses: To avoid cramming the Figures, we have including some of the p values in the Figure legends. *

      • Line 174: should say "mutant egg chambers".*
      • Line 281: There is a reference that is missing from reference list: Liu et al., 2010;*
      • Line 292: The reference for the NRE-GFP construct is not the correct one, since that references to a review article. Please, add the correct reference.*
      • In line 462 of the manuscript you have a reference that is missing from your reference list.*
      • In line 394 the authors say: "protein, it's enrichment in the cytoplasmic fraction of the cup mutant egg chambers", but I think that they meant mutant nurse cells.*

      Response: We have modified the text as per the all the suggestions above Reviewer #2 (Significance (Required)):

      The BC migration is an excellent model to study collective cell migration and how epithelial cells can acquire migratory behaviours. After years of study, there is good understanding of the signals and genetic circuits that regulate BCs specification and migration (Montell et al., 2012), but there are not many studies, to my knowledge, that describe a role of nurse cells in specifying or guiding the migration of these cells. Thus, this study by Saha and colleagues is one of the first studies that show a role for nurse cells in specifying the number of BCs.

      My field of expertise is in cell-cell communication through different pathways, including Notch and Integrin signalling. I have studied the role of endocytosis in regulating Notch signalling in various contexts, including follicular epithelium in Drosophila ovaries.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This manuscript describes an investigation into the signaling that induces the differentiation of follicle cells into border cells in the Drosophila ovary. Previous studies have established the border cells as an informative model for studying how epithelial cells delaminate and undergo collective cell migration, and have identified the JAK-STAT and Notch pathways as important regulators of the process. Here, the authors performed a forward genetic screen and identified cup as another gene that is involved in the regulation of border cell differentiation. Their findings are consistent with a model in which cup is required in germ cells for the endocytosis of the Notch ligand, Delta. In cup mutants, impaired trafficking of Delta leads to decreased Notch signaling in follicle cells, which allows for increased JAK-STAT expression in follicle cells and an increase in the number of follicle cells that differentiate into border cells. Overall, the approach is thorough and the phenotypes are clear and well-described. The quantification of phenotype penetrance and of aspects of the images, such as pixel intensities and the number of particles in a region is a strength of the paper. The use of multiple independent methods to test key points is another strength. However, there are several concerns that should be addressed before the paper is considered for publication:

        1. The central phenotype that this paper is based on is a difference in the number of border cells per cluster in wildtype and mutant genotypes. However, this phenotype is fairly subtle in some cases (e.g. in Fig. 2L, it varies by only about 10% between control and mutant) and it is somewhat variable. For example, the number of cells in border cell clusters of the controls range from 4.49 in Fig. 3M to 6.41 in Fig. 1F. Considering that the mutant values fall within this range in some cases (e.g. 5.98 in Fig 3M) and the difference between the means from control and mutant genotypes is often less than two, the significance of this phenotype is unclear. How does this compare to other mutants that have been described to affect border cell specification? Are there any consequences for the differentiation of the follicle or the function of the egg caused by this defect?*

      Response: We are using the border cell number as readout for the output of JAK-STAT signaling. Though the difference in numbers may appear to be subtle, we believe our data clearly demonstrates that Cup non cell autonomously regulates border cell fate by modulating Notch signaling in the follicle cells*. *

      • Wang, et al. (PMID 17010965) have described previously that Notch signaling, and*

      Kuzbanian specifically, is required for border cell migration. The authors should cite this paper and discuss their findings in light of this study. For example, if Notch signaling is impaired in cup mutants, is border cell migration also impaired? Likewise, the citation of the Assa-Kunik, 2007 study as evidence that Notch and JAK-STAT signaling act antagonistically (Line 286) is a bit of an oversimplification. While that study does show that Notch and JAK-STAT act antagonistically at earlier stages of follicle development, Fig. 6 of that paper shows that a Notch reporter and a JAK-STAT reporter are both expressed concomitantly in border cells of a Stage 10 follicle and in the anterior follicle cells of what looks like a Stage ~8 follicle. The authors should discuss the apparent contradiction between their findings and this study.

      Response: We provide genetic evidence to support our claims that Cup in the germline modulates Notch activation in the anterior follicle cells thus limiting border cell fate specification to a few. The overlap in the expression of Notch reporter m7-lacz and STAT in the follicle cells and border cells is interesting and will need further investigation in real time to decipher any comparison between the two studies.

      • Lastly, the manuscript contains many grammatical errors, incomplete sentences, improper punctuation and spacing, and informal writing, such as the use of contractions. It should be thoroughly edited for content and clarity.*

      Response: We have tried to edit the manuscript with the aim to improve on the language, grammar and punctuations.

      Reviewer #3 (Significance (Required)):

      Although the identification of cup as a contributor to the regulation of border cell differentiation is novel, the other main regulators investigated in this study, including Notch and JAK-STAT signaling, have been identified previously. The role of cup in this context seems to be to fine tune Notch signaling and it seems to play a relatively minor role in the process of border cell specification. In addition, the conclusions of this paper are not well-integrated into the existing literature on Notch and JAK-STAT signaling in border cells, and the discussion about the broader implications of this study for the understanding of Notch signaling was not well-developed. However, the careful documentation and quantification of the phenotypes reported in this study adds rigor and allows for firm conclusions. For these reasons, this study may have a lasting but perhaps somewhat incremental impact on the study of border cell migration in the Drosophila ovary.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript describes an investigation into the signaling that induces the differentiation of follicle cells into border cells in the Drosophila ovary. Previous studies have established the border cells as an informative model for studying how epithelial cells delaminate and undergo collective cell migration, and have identified the JAK-STAT and Notch pathways as important regulators of the process. Here, the authors performed a forward genetic screen and identified cup as another gene that is involved in the regulation of border cell differentiation. Their findings are consistent with a model in which cup is required in germ cells for the endocytosis of the Notch ligand, Delta. In cup mutants, impaired trafficking of Delta leads to decreased Notch signaling in follicle cells, which allows for increased JAK-STAT expression in follicle cells and an increase in the number of follicle cells that differentiate into border cells. Overall, the approach is thorough and the phenotypes are clear and well-described. The quantification of phenotype penetrance and of aspects of the images, such as pixel intensities and the number of particles in a region is a strength of the paper. The use of multiple independent methods to test key points is another strength. However, there are several concerns that should be addressed before the paper is considered for publication:

      1. The central phenotype that this paper is based on is a difference in the number of border cells per cluster in wildtype and mutant genotypes. However, this phenotype is fairly subtle in some cases (e.g. in Fig. 2L, it varies by only about 10% between control and mutant) and it is somewhat variable. For example, the number of cells in border cell clusters of the controls range from 4.49 in Fig. 3M to 6.41 in Fig. 1F. Considering that the mutant values fall within this range in some cases (e.g. 5.98 in Fig 3M) and the difference between the means from control and mutant genotypes is often less than two, the significance of this phenotype is unclear. How does this compare to other mutants that have been described to affect border cell specification? Are there any consequences for the differentiation of the follicle or the function of the egg caused by this defect?
      2. Wang, et al. (PMID 17010965) have described previously that Notch signaling, and Kuzbanian specifically, is required for border cell migration. The authors should cite this paper and discuss their findings in light of this study. For example, if Notch signaling is impaired in cup mutants, is border cell migration also impaired? Likewise, the citation of the Assa-Kunik, 2007 study as evidence that Notch and JAK-STAT signaling act antagonistically (Line 286) is a bit of an oversimplification. While that study does show that Notch and JAK-STAT act antagonistically at earlier stages of follicle development, Fig. 6 of that paper shows that a Notch reporter and a JAK-STAT reporter are both expressed concomitantly in border cells of a Stage 10 follicle and in the anterior follicle cells of what looks like a Stage ~8 follicle. The authors should discuss the apparent contradiction between their findings and this study.
      3. Lastly, the manuscript contains many grammatical errors, incomplete sentences, improper punctuation and spacing, and informal writing, such as the use of contractions. It should be thoroughly edited for content and clarity.

      Significance

      Although the identification of cup as a contributor to the regulation of border cell differentiation is novel, the other main regulators investigated in this study, including Notch and JAK-STAT signaling, have been identified previously. The role of cup in this context seems to be to fine tune Notch signaling and it seems to play a relatively minor role in the process of border cell specification. In addition, the conclusions of this paper are not well-integrated into the existing literature on Notch and JAK-STAT signaling in border cells, and the discussion about the broader implications of this study for the understanding of Notch signaling was not well-developed. However, the careful documentation and quantification of the phenotypes reported in this study adds rigor and allows for firm conclusions. For these reasons, this study may have a lasting but perhaps somewhat incremental impact on the study of border cell migration in the Drosophila ovary.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Saha et al. made a detailed description of the role of the mRNA binding protein Cup in specifying the number of Border Cells (BC) during Drosophila melanogaster oogenesis. First of all, they show that females homozygote for a hypomorph allele of cup have higher number of BCs compared to Wild Type (WT) females. They present a series of experiments that points towards the phenotype being due to a specific role of cup in the nurse cells that non-cell autonomously regulates BC specification. Also, they show that this phenotype is the result of an increase in the levels of JAK/STAT signalling in the BC, a major determinant of BC fate. In addition, they show that cup mutant egg chambers exhibit a downregulation of the Notch (N) pathway function in the BCs and that over-activating Notch results in the rescue of the number of BCs. Moreover, the authors present data on the effect of cup in Delta (Dl) trafficking in the nurse cells: They found that cup mutant egg chambers show increased number of Dl puncta within the cytoplasm of the nurse cells, but reduced numbers in the nurse cell-Anterior Follicle Cell (AFC) boundary as a result of defective Dl endocytosis. Finally, they were able to rescue the Dl trafficking phenotype, as well as the number of BC by overexpressing an active form of Rab11.

      Major points:

      • In this study, the authors employed an hypomorph allele of Cup to generate egg chambers where both germline and somatic cells are mutant for Cup. They did a series of experiments to try to demonstrate that the Border Cell (BC) specification phenotype they observe is non-cell autonomous and that is due to the Loss of Function (LOF) of Cup exclusively in the nurse cells. Although I appreciate the difficulties of eliminating or reducing the levels of Cup specifically in the nurse cells only during mid-oogenesis, I feel like this is key to be able to claim that this effect of Cup in BC specification is really non-cell autonomous. The reasons why I still have some doubts that there might be some cell autonomous effects in the FCs are the following:
        • The authors show that cup01355 mutant egg chambers have a phenotype in Dl trafficking. Although they analysed in detail the effects on Dl in the nurse cells, their images show that there might be a defect in Dl levels/trafficking in the Follicle Cells (FCs) as well (Fig5A-B). It has been shown that Dlmut FCs have reduced levels of Notch activity due to reduced lateral inhibition (Poulton et al., 2011), so there is a possibility that the reduced levels of Notch activity in the cup01355 egg chambers might be due, partially, to defects in Dl trafficking/levels in the FCs, rather than in the nurse cells.
        • The authors tested the role of the Notch pathway in the cup mutant phenotypes by measuring the number of NICD puncta in the signal receiving cells as proxy for Notch activity (Fig4). Although I understand the rationale, I am not convinced that they can completely rule out that the changes in NICD puncta number in FCs is not due to some effect of cup LOF on Notch trafficking in these cells.
        • In figure 6, the authors show that expression of a constitutively active form of Rab11 specifically in the nurse cells restores the BC number to that of the WT. However, the levels of Dl particles and, especially the levels of NRE-GFP expression, remains slightly lower than in the WT conditions.

      One of the main conclusions of this study is that cup regulates BC specification through a non-cell autonomous mechanism that involves communication between nurse cells and AFCs. For that reason, I think in order to conclusively say that, the authors need to try to remove the function of cup specifically in the nurse cells. They mentioned they have tried different ways of doing this unsuccessfully, but do not specify how they have tried. I suggest using the cup-RNAi line combined with a nurse cell specific Gal4 and a ubiquitous gal80ts line (tub-Gal80ts), if they have not try this. I do not expect the authors to repeat all the experiments with this condition, but at least they should test the main findings i.e. number of BCs, JAK/STAT overactivation and Notch attenuation. - The authors have shown in Figure 3 that there is a decrease in Notch signalling in the AFCs in cup01355 egg chambers. In order to test that the BC number phenotype observe in this condition is due to that effect on Notch signalling they have done a rescue experiment using the antimorphic Notch allele Nax-16. Since in this condition all cells (nurse cells and FCs) have increased levels of Notch, they cannot conclusively say that the increase in Notch function in the FCs rescues the cup phenotype. If they want to show that the function of Notch is specifically needed in the FCs, they should over-activate Notch exclusively in the AFCs. For instance, they could express a constitutively active form of Notch, such as UAS-NICD (Go et al., 1998) or UAS-NECD (Fortini et al., 1993), specifically in the AFCs. Otherwise, they should re-write the text since they cannot conclusively say that the increase in Notch function in the FCs rescues the cup phenotype. - The authors had made a great effort to prove that proper Delta endocytosis in the nurse cells is essential for adequate Notch signalling in the AFCs and right number of BCs recruitment. Specifically: - They checked the consequences on Dl trafficking of down-regulation of rab5 or auxilin, but they did not test the effect in BC numbers - They show that downregulating the function of shi affects the number of BCs, but did not show the effect of this condition in Dl trafficking. Consequently, they cannot conclusively say that effects on trafficking of Dl affect number of BCs, since they haven't really tested both effects on the same background. I think that for simplification, they should test both, effects on Dl trafficking and number of BCs in one of those genetic backgrounds and leave the other two for supplementary material. Alternatively, they should re-write their conclusion for this section. - Their results clearly show that Dl accumulates in puncta, suggesting that there might be a defect in Dl trafficking, and although their rescue experiments point towards an scenario where Rab11-dependent Dl recycling is being affected, I think there are some weak points on their arguments. The fact that Rab11-KD does not generally affect Notch signalling in the FCs, as shown in (Windler & Bilder, 2010) argues against their conclusion that the effect of cup in nurse cells on Rab11 function is responsible for the defects in Dl trafficking and, subsequently, on Notch activity in AFCs. An alternative explanation is that Rab11 overactivation in the Cup mutant background compensates for a different defect on Dl trafficking, for example, Rab4-dependent recycling pathway. Another possibility is that AFCs could be specially sensitive to changes in Rab11-dependent Dl trafficking defects in the nurse cells. To distinguish between these two possibilities, they should perform some of the following experiments: - First of all, there are a number of endosome markers that can be used to check in which step of the endocytic route Dl is being accumulated, including (but not limited to) anti-Rab11 antibody, anti-Rab5, anti-Rab7, tub-Rab4-mcherry. They should do co-localization experiments with Dl and endosomal markers.<br /> - Also, they could check what happens to the number of BCs and Dl trafficking when Rab11 function is blocked in the nurse cells, in a similar way to what they did with Auxillin, Rab5 and Shi. They could use some of the tools described in (Satoh et al., 2005) - The authors final model is one in which cup in the nurse cells regulates Rab11 function to ultimately control JAK/STAT signalling in the AFCs. However, they have not looked at the status of JAK/STAT signalling in their Rab11-CA rescue experiments. I think this experiment will really round-up their work.

      Minor points:

      • The authors tested if the extra BC phenotype observed in the cup mutant egg chambers is due to defects in FCs endoreplication. I have two questions related to this section.
        • First of all, I do not understand the rationale behind this idea that defects in FCs endoreplication would result in extra BCs. Please explain and add any relevant references.
        • Secondly, they say that they used Cut and Phospho-Histone3 as endoreplication markers. I believe that what they mean is that the absent of these two markers indicates that FCs have exit the cell cycle and enter the endocycle (Sun & Deng, 2005), however they are not markers of endoreplication. Please, re-write to make this clear.
      • The authors tested whether the levels of Notch activity were altered in the cup mutant egg chambers. For that, they used an NRE-GFP construct that shows a clear reduction in the levels of Notch activity in the AFCs. They also used the number of NICD and NECD puncta in signal receiving and sending cells respectively, as proxy of Notch activity. Although I understand the rationale, there are other explanations for this phenotype as discussed above. Thus, if they want to have an alternative way of showing the dampening of Notch signalling, they could use the levels of expression of well characterised targets of Notch in the FCs, such us hnt and E(spl)m-CD2 or E(spl)m7.
      • In M&M the authors explain that NRE-GFP levels were expressed in Fold change. However, in figure 3C the units of the graph are Fluorescence Intensity in a.u. Please, check this small inconsistency
      • In figure 4, they show the quantification of tubulin fibres within the nurse cells, however they are missing a similar analysis of Phalloidin (Pha) fibres/levels. I think this experiment and figure will be more complete if the authors added such a quantification of the effects of cup LOF in Pha distribution. Also, the authors do not show the single Pha channel in Fig4C, which would greatly helped to appreciate the differences between the WT and Cup LOF nurse cells. I suggest modifying the figure to better show the changes in Pha distribution.
      • In figure 4F-G the authors are showing the general effect of cup LOF in Delta distribution. They indicate with yellow arrowheads the cytoplasmic Dl puncta accumulation in the nurse cells, however it is almost impossible to see such puncta with that level of magnification/resolution. I suggest removing the arrowheads, since the figure 4H-I shows the same puncta more clearly.
      • In the Dl trafficking experiments (Fig4 H-I,K,L and Fig5A-C), the authors measured the number of puncta in the anterior nurse cell-follicle cell junction. In order to do those types of quantifications they need to be able to tell the cell boundaries that separate FCs from the nurse cells. Please, clarify the criteria for determining if the puncta are within the FCs or the underlying nurse cells.
      • In figure 6C-D the authors show example images of egg chambers expressing Rab11-CA-YFP using the germline specific nos-Gal4. However, in the images it looks like the YFP signal is coming from the surrounding stretched FCs. Please check that these are the right images or explain the inconsistency.
      • In figures 1R, 2L, 3Q, 6I, 6M, the authors should show the results of the statistical analysis between all the conditions tested. I think that this is crucial to be able to tell whether some of the rescues are complete or only partial.
      • Line 174: should say "mutant egg chambers".
      • Line 281: There is a reference that is missing from reference list: Liu et al., 2010;
      • Line 292: The reference for the NRE-GFP construct is not the correct one, since that references to a review article. Please, add the correct reference.
      • In line 462 of the manuscript you have a reference that is missing from your reference list.
      • In line 394 the authors say: "protein, it's enrichment in the cytoplasmic fraction of the cup mutant egg chambers", but I think that they meant mutant nurse cells.

      Significance

      The BC migration is an excellent model to study collective cell migration and how epithelial cells can acquire migratory behaviours. After years of study, there is good understanding of the signals and genetic circuits that regulate BCs specification and migration (Montell et al., 2012), but there are not many studies, to my knowledge, that describe a role of nurse cells in specifying or guiding the migration of these cells. Thus, this study by Saha and colleagues is one of the first studies that show a role for nurse cells in specifying the number of BCs.

      My field of expertise is in cell-cell communication through different pathways, including Notch and Integrin signalling. I have studied the role of endocytosis in regulating Notch signalling in various contexts, including follicular epithelium in Drosophila ovaries.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Saha et al. characterize Drosophila egg chambers that are mutant for cup and identify an increase in the number of a specialized type of follicle cells, the border cells. They demonstrate that this increase correlates with an expanded domain of STAT activity and reduced Notch signaling in anterior follicle cells. Determining that cup is required in the germline cells, the authors postulate and provide some evidence that cup mutants prevent germline Delta from properly signaling to follicle cells. In line with this, they also show that blocking endocytosis phenocopies some aspects of cup mutants, particularly border cell numbers and Delta levels, which they monitor cytoplasmically and at the cell surface. Lastly, they demonstrate that activation of Rab11 can rescue Delta levels and border cell number in cup mutants. They conclude that a key function of Cup in the germline is to traffic Delta to signal to follicle cells, and that the endocytic processing of Delta is required for its function.

      Major comments:

      The findings of this study are interesting and novel. The authors have completed a lot of experiments and analyzed the results carefully and in great detail. Experimental design is described adequately and statistical analysis is sufficient. While the main results are largely convincing and support the conclusions, there are some weaknesses that need to be addressed. One major concern is that the vast majority of the experiments were conducted with a single homozygous allele for cup. The authors claim this was necessary because other alleles arrest oogenesis, which is understandable, but it leaves the potential problem that the allele, a P-element insertion, may affect other genes, or there may be other unidentified mutations on the mutant chromosome. The authors are able to partially rescue the border cell phenotype with overexpression of Cup and can also mimic the outcome with RNAi in the germline, which helps alleviate some of this concern, but this was only done for one set of experiments (those in figure 1). Similar experiments need to be included to demonstrate the same outcomes when cut is disrupted by other alleles/methods for at least some of the Notch/Delta analyses since this is key to the paper's conclusions.

      A second concern is that some evidence is circumstantial or indirect. Specifically, the authors argue that the effect of Cut is due to trafficking of Delta, but do not consider the possibility that Delta could be more directly regulated or that other factors may be relevant. Border cell specification is rescued by increasing recycling in cup mutants, but this could be due to recycling of more factors besides Delta. To address this more directly, the authors should overexpress Delta in the germline of cut mutants. It is possible the disruption of Delta in cut mutants is due to changes in Delta protein stability/levels, so the experiment may also clarify this issue. If this is the case, it may be that hypomorphic Delta mutants would have a defect on border cell number, which could be examined separately. If Delta levels are low, endocytosis and recycling increases may also rescue cut mutants indirectly, but the conclusion about what Cut regulates may differ.

      Another concern is that Cup's main role is a confusing since it regulates many things, including cytoskeleton and cytoskeleton is necessary for general health and vesicle trafficking in the egg chamber - how do the authors think Rab11 upregulation is overcoming these defects? Rab11CA rescues Delta levels almost completely in cut mutants but only partially rescues Notch activation, suggesting there are other problems in these egg chambers that could contribute to the defects. While exploring possible other factors is beyond the scope of this work, the authors may want to acknowledge this issue.

      Minor comments:

      It would help the presentation of the paper to introduce Notch/Delta signaling during oogenesis in the introduction. More introduction and clarity about the number of polar cells at early stages and their role in the border cell cluster may also be useful to the reader.

      It is notable that the primary phenotype of a change in border cell numbers is quite subtle, often only affecting 1-2 cells, and the variation in different genotypes and experiments is sometimes also that large. The authors do a good job of being careful to count the cells at a specific developmental time and do appropriate statistical tests within an experiments. Still, it difficult to be sure that the effects are due to the gene being manipulated specifically or the genetic background. Related to this, a few issues should be addressed. Notably, at earlier stages, Notch signaling impacts cell division, so some of the phenotypes might be explained by there being more total cells in the domain instead of more signaling. The authors show Cut is in the same domain and pH3 is similar, but they didn't seem assess overall numbers. Secondly, for the stat suppression of cut (figure 2L), the authors need to show the stat-/+ control for comparison to make a conclusion about suppression versus additive effects. In addition, prior work (Wang et al 2007) expressed DN Kuz in border cells and did not see a change in specification, unlike what is claimed here. In the experiment in question, the control has lower than normal numbers of border cells and the DN Kuz has a number more typical of the controls in other experiments- so this is a concern that there is something else in the genetic background influencing the numbers. Other controls could help make this case, but ultimately this result is probably not necessary for the main argument. Thus the authors might consider leaving it out the Kuz analysis or perhaps can comment on the discrepancy with prior published results.

      Can the authors comment on why the volume of the border cell cluster increases more dramatically (>2x) than the number of cells (30% more)?

      Does the increase in border cell number change the migratory capacity? That is, do the clusters in cut mutant egg chambers migrate normally while the egg chamber looks okay?

      Several of the figure legend titles state conclusions that are over interpretations of the data shown:

      • Figure 3 legend is overstated- these experiments do not assay STAT activity, only border cell number, so the title can be simplified to say that.
      • For figure 4, both cytoskeleton and Delta are shown to be disrupted in cup mutants, but they are not directly linked, eg, the experiments do not show a change in Delta in cytoskeletal mutants alone. While it is interesting that cup mutants have disrupted cytoskeleton, ultimately this result is not well connected to the main issue of Notch/Delta signaling; in fact, it becomes confusing how anything can be trafficked to the cell surface if there is poor cytoskeletal organization. Since the authors favor the hypothesis that the cytoskeleton is not the key to the border cell specification difference, they may want to move this result out of figure 4.
      • The Figure 5 legend is also overstated- these experiments show that Delta is higher in cup mutants and endocytosis mutants AND that endocytosis (of something) is required in the germline for border cell number- but these results are not linked in this figure. More evidence for this connection does come later in figure 6. Some figure legends are quite brief and could benefit from a little more detail on what is being shown.

      Figure layout could be improved by keeping images consistent sizes and making sure graph text is large enough to read easily. Figures in general could be streamlined by having negative results and less pertinent results in supplemental data.

      Not all papers cited in the text are in the reference list.

      Referees cross-commenting

      I generally agree with the other reviewers that there are concerns with the precise function of cup in this context, and that some revision is needed, including editing of the writing. In response to reviewer 2, prior published studies only detected Cup in germline, but it is possible that it is expressed in follicle cells at a low level. The mutant clonal experiment in follicle cells that the authors did had no effect on border cells, so that provides some evidence the role is non-autonomous. I agree with reviewer 2's concern that the authors overstate the connection between cup and Delta and border cells based on their data and need a few more experiments to tie things together. I understand reviewer 3's concerns that the experimental effects on border cell numbers are very small and variable- I listed this as a minor concern, though, since this number is mainly being used as a read-out for STAT signaling levels and the data were extensively quantified and statistically tested.

      Significance

      My expertise is in cell migration, developmental biology, and Drosophila genetics. This paper will be of broad interest in these fields as it incorporates aspects of each in its characterization of a new regulatory mechanism to induce a motile cell population non-cell-autonomously, which is an exciting finding. Specifically, the work increases our understanding of the intersection between Notch and Jak/STAT signaling, which many researchers study - these were both known to be involved in border cell specification. The study provides more detailed characterization of the signaling and specification process in general, and makes significant advances in understanding how Delta signals are produced and presented from germline cells to receiving cells in the soma. Cut has not been previously implicated in these signaling pathways, so that is also novel, although its precise mechanistic role here is still somewhat unclear.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript, authors establish a glyco-profiling platform for the functional analysis of genes involved in pseudaminic (Pse) and legionaminic (Leg) acid biosynthetic pathways. They used B. subvibroides and C. crescentus specific mutants in pseI and legI genes involved in the Pse and Leg biosynthesis, respectively, and cross-complementation assays with orthologous genes from different bacterial species, analysing motility and flagellin glycosylation. These assays show that Pse and Leg biosynthetic pathways are genetically different and recognize the LegX enzyme as a critical element in the Leg-specific enzymatic biosynthesis. Since that legX orthologous were only identified in the genome of bacteria with Leg biosynthetic pathways, it becomes a good marker to distinguish Leg from Pse biosynthesis pathways and a novel bioinformatic criterion for the assignment and discrimination of these two pathways. Reconstitution of Leg biosynthetic pathway of B. subvibroides in the C. crescentus mutant that lack flagellins, PseI and FlmG, complemented with both flagellin and FlmG of B. subvibroides, identified a new class of FlmG protein glycosyltransferases that modify flagellin with legionaminic acid. Furthermore, the construction of a chimeric FlmG through domain substitutions, allowed to reprogram a Pse-dependent FlmG into a Leg-dependent enzyme and reveal two modular determinants that govern flagellin glycosyltransferase specificity: a glycosyltransferase domain that accepts either Leg or Pse, and a specialized flagellin-binding domain to identify the substrate.

      Major comments:

      The conclusions obtained are convincing and well-supported. However, I think some points should be specify or clarify.

      1.- In the mutants (pseI, legI, flmG,...) the non-glycosylated flagellin are exported and assembled in a flagellum filament shorter than the WT strain. However, motility in plates is absent or very reduced. This might be produced by instability of the flagellum filament when rotating in a semi-solid surface. MET was performed from plates or liquid cultures? Do the author analyses motility in liquid media? If they did, changes in motility were observed?

      Response: The Caulobacter ΔpseI mutant accumulates low levels of flagellin in the supernatant. TEM analysis reveals that the flagellar filament is not assembled and only the hook structure is visible (PMID: 33108275). Brevundimonas subvibrioides ΔlegI or ΔflmG cells feature a shorter filament compared to WT by TEM. In all these analyses, TEM was performed on cells grown in broth to exponential growth phase as detailed in the Experimental procedures section. These mutant cells do not swim when analyzed by phase contrast microscopy. While is not known if swimming on semi-solid medium would further destabilize the flagellar structures seen in liquid cultures by TEM, there is more residual motility in B. subvibrioides mutants that make a short filament compared to C. crescentus mutants that lack the flagellar filament. Thus, our analyses point to a positive correlation between the residual motility and residual filament length when comparing the B. subvibrioides and C. crescentus mutants.

      2.- In page 5, lines 158-163, the analysis, by HPLC, of derivatized nonulosonic acid from B. subvibroides flagella, shows a major peak at 9.8 minutes retention and a minor peak at 15.3 minutes. Since that Pse-standard have retentions peaks at 9.7 and 13 minutes, and Leg-standard at 12.3 minutes, the authors cannot infer, only with these data, the flagella sugar is a legionaminic acid derivative. In my opinion, should be included that inference comes from the data obtained by HPLC analysis and genetic approaches. Thanks. Corrected. 3.- In page 5, line 173-175. Authors indicate, "While no difference in the abundance of flagellin was observed in extracts from mutant versus WT cells, flagellin was barely detectable in the supernatants of mutant cultures, suggesting flagellar filament formation is defective in these mutants". MET images show that the flagellum filament length is shorter in the mutants than in the WT strain. Therefore, if the same number of mutants and WT cells has been used in the immunodetection assays, there should be more flagellin monomers in the WT samples than in the mutants ones and flagellin bands should be less intense in mutant samples corresponding to the anchored flagellum. Why bands corresponding to flagellin in mutants and WT show similar intensity in the immunodetection assays (Figure 3C and D)? Furthermore, in lane 177-178, authors suggest that LegI and FlmG govern flagellin glycosylation and export (or stability after export). However, if filament stability is affected, the amount of flagellin monomers in the supernatant of mutants should be higher than in the WT. However, immunodetection assays show less abundance of flagelin monomers in the supernatant of mutants. Please, can you clarify this? In relation to this point, I suggest that authors include, in the experimental procedures, how they obtained the supernatants to flagellin immunodetection, as well as why they used anti- FljKCc anti-serum to detect the B. subvibroides flagellin.

      We thank the reviewer for raising this point. We have now clarified this question in the updated Experimental procedures section. Our immunoblots harbor the same number of cells harvested in exponential phase (OD=0.4). One mL of cells was harvested from cultures by centrifugation at full speed. The supernatant that was used for the immunodetection corresponds to the supernatant after the centrifugation. The supernatant fraction contains flagella that have been shed during the cell cycle at the swarmer cell to stalked cell (G1-S) transition of C. crescentus and B. subvibrioides.

      Thus, it is clear that the majority of flagellins detected by immunoblotting are in fact cell associated and specifically the intracellular flagellins. The evidence for this is that the levels are comparable between WT and ΔflmG mutant cells, even though the latter has shorter or no flagellar filaments. Moreover, while C. crescentus cells are not constantly flagellated during the cell cycle, flagellins are detectable on cell-associated samples by immunoblotting even when cells do not yet or no longer have a flagellar filament. Based on these two points, we conclude that the total flagellin levels associated with cells do not reflect the levels of flagellin assembled into a flagellar filament, but rather the flagellin bulk present in the cytoplasm.

      Consistent with this view, we previously reported that C. crescentus ΔpseI cells have the same amount of flagellins in cell lysates compared to the WT strain (PMID: 33108275), even though the mutant cells lack a flagellar filament. Thus, the results obtained here are consistent with previous observations and indicate that B. subvibrioides flagellin glycosylation mutants also still produce comparable amounts of flagellins intracellularly like the WT strain, despite the absence of flagellin glycosylation and inefficient assembly into a flagellar filament.

      Concerning the potential role of LegI and FlmG in flagellin stability after export, we were referring to protein stability (half-life), not filament stability. Glycosylation may impact the half-life of extracellular flagellins since glycosylation can protect from proteolytic degradation of proteins, possibly in this case by different proteases that may accumulate in the supernatant. Thus, non-glycosylated flagellins could be more easily degraded by extracellular proteases once they are exported, ultimately resulting in a lower amount in the supernatant.

      Addressing the final question about the specificity of the anti-FljKCc antiserum: we used this anti-serum because it detects the B. subvibrioides flagellins owing to the high sequence similarity between B. subvibrioides flagellins and C. crescentus flagellins. We previously showed that the anti-FljKCc anti-serum detects all six flagellins from C. crescentus, as determined by individually expressing each flagellin in a strain deleted for all six flagellin genes (Δfljx6) (PMID: 33108275). FljKCc (against which the antibody was raised) is 65% similar to the most distant C. crescentus flagellin, FljJ. As the similarity of FljKCc to the three B. subvibrioides flagellins ranges from 74% -67% sequence similarity, they should be even better recognized by the anti- FljKCc antibody than C. crescentus FljJ. However, on immunoblots we cannot attribute the signal to any individual B. subvibrioides flagellin as they could all co-migrate on SDS-PAGE and therefore all flagellins might reside in the same immunoblot band. However, we can clearly demonstrate that the immunoblot band corresponds to flagellins: a B. subvibrioides ΔflaF mutant (see below) that we constructed revealed that the flagellin signal is lost, as is the case for a C. crescentus ΔflaF mutant (PMID: 33113346). In the case of C. crescentus, the FlaF secretion chaperone is required for flagellin translation (synthesis) and we suspect that this also the case for B. subvibrioides FlaF. This experiment provides additional evidence that the B. subvibrioides flagellins are recognized by the anti-FljK (C. crescentus) anti-serum.

      4.- Authors demonstrate the specificity of the GT-B domain of FlmG, using a chimeric FlmGCc-Bs in a mutant of C. crescentus that lacks FlmG and harbour the Leg biosynthetic pathway of B. subvibroides. However, since that TPR comes from C. crescentus, this chimeric protein, could be transfer the legionaminic acid to the flagellin of B. subvibroides? Furthermore, the complementation of this mutant with the FlmGBs did not support efficient flagellin modification and this might be related to the TPRCc domain. Therefore, in my opinion, the chimeric protein should be introduced in the B. subvibroides∆flmG background. The answer to the first question is “No” or “very inefficiently” as determined from immunoblot analyses of B. subvibrioides ΔflmG cells expressing the chimeric FlmG_Cc-Bs protein that we now show in Fig S2B.

      Expression of the different FlmG (FlmG_Cc, FlmG_Bs, FlmG_Cc-Bs) in C. crescentus cells producing Pse or Leg revealed that FlmG_Bs does not support efficient flagellin modification with Pse in C. crescentus, likely because FlmG_Bs interacts poorly with the C. crescentus flagellins. By using the FlmG_Cc-Bs chimera we hoped to overcome this interaction problem with the C. crescentus flagellins (because the FlmG chimera harbors the C. crescentus TPR to bind the C. crescentus flagellins), however glycosyltransfer still does not occur efficiently because the GT domain from FlmG_Bs does not function with Pse. However, FlmG_Cc-Bs can modify the C. crescentus flagellins once C. crescentus is genetically modified to produce CMP-Leg (instead of CMP-Pse). This confirms that the FlmG TPR from C. crescentus is important for flagellin modification through the FlmG/flagellin interaction and that GT_B type glycosyltransferase only transfers Leg. In addition, we have now added as Fig S2B an immunoblot and as Fig S2C a motility test of B. subvibrioides ΔflmG cells expressing the FlmG_Cc-Bs chimeric protein in which we only observed little modification of B. subvibrioides flagellins and a poor motility, respectively. We extended our discussion of these results.

      5.- Page 8, line 299-301. Authors point out that C. crescentus that lacks FlmG and harbour the Leg biosynthetic pathway of B. subvibroides and the chimeric FlmGCc-Bs, although it has a glycosylated flagellin, whose mobility in SDS-PAGE is like the WT strain, is non-motile. They suggest that additional factors exist in the flagellation pathway that exhibit specificity towards the glycosyl group that is joined to flagellins. However, would be interesting to see if the flagellum filament has similar length to the WT strain or at least, it has increased in relation to the flagella length of the mutant. If flagella length has not increased, it could suggest that changes in the glycan type might affects the flagellin assembly or the stability of the flagellum filament. Therefore, would be also important to analyse its motility in liquid media.

      To investigate why the C. crescentus cells that produce Leg and express the chimeric FlmGCc-Bs glycosyltransferase are non-motile (Figure S5B) despite flagellin modification (by immunoblotting, Figure 7C), we employed two strategies. First, we performed immunoblot analyses on the supernatant fraction from these cells to determine if flagellins accumulate extracellularly. As now showed in Figure S5A, only low amounts of C. crescentus flagellins modified by Leg are present in the SN fraction. Second, we conducted TEM analyses of cells grown to exponential growth phase in broth. As shown in Figure S5C, the C. crescentus cells producing Leg and expressing FlmG_Cc-Bs glycosyltransferase harbor a shorter flagellum compared to those expressing the FlmG_Cc in which C. crescentus flagellins are modified by Pse. Altogether these results explain why these cells are non-motile both on soft agar plate and in liquid.

      Minor comments: 1.- Pag 3 line102. Please change ".....two predicted synthases, a PseI and LegI homolog, and C. crescentus only encodes only PseI...." to ".....two predicted synthases, a PseI and LegI homolog, and C. crescentus only encodes a PseI...." 2.- Figure 2 A. Plasmid nomenclature (Plac-neuB) is confusing because C.c. ΔpseI cells express predicted LegI or PseI synthases. Please change to Plac, as in Figure 2B and 4. Figure 2A and 2B do not contain any complementation with Bacillus subtilis (Basu), however two complementation are labelled as Bs in Figure 2A and 2B. Furthermore, no Bs are present in the Figure 2 legend. 3.- Legend of figure 3 should include B. subvibrioides abreviation Bs. Line 774: Please change ".......glycosylation and secretion in B. subvibrioides." to ".......glycosylation and secretion in B. subvibrioides (Bs)." 4.- Figure 3. In order to keep a similar nomenclature in all plasmids, plasmid Plac-legI syn and Plac-flmG should be labelled as Plac-legIBs syn and Plac-flmGBs.

      5.- Legend of figure 4 should include B. subvibrioides abreviation Bs. Line 791: Please change "....... complementation of the B.subvibrioides ΔlegI mutant with ...." to "....... complementation of the B.subvibrioides (Bs)ΔlegI mutant with ...." Furthermore, Legend of figure 4 indicate in line 795, that immunoblots reveal the intracellular levels of flagellin, however figure 2 and 3 show immunoblot of cell extracts. Please, correct this sentence. 6.- Legend of figure 5, 6 and 7 should include B. subvibrioides abreviation Bs. Line 808: Please change "Predicted Leg biosynthetic pathway in B. subvibrioides " to"Predicted Leg biosynthetic pathway in B. subvibrioides (Bs)" Line 834: Please change "....affects motility, flagellin glycosylation and secretion in B. subvibrioides."to "....affects motility, flagellin glycosylation and secretion in B. subvibrioides (Bs).Line 852: Please change "...acetyltransferase in flagellar motility of B. subvibrioides cells." to ""...acetyltransferase in flagellar motility of B. subvibrioides (Bs) cells." Furthermore, figure 5 should include C. crescentus abbreviation. Line 815: Please change "....whole cell lysates from C. crescentus mutant cultures......." to "....whole cell lysates from C. crescentus (Cc) mutant cultures......." 7.- In my opinion it would be useful to include a scheme of the gene organization involved in Leg biosynthesis in B. subvibrioides.

      8.- Legend of figure S1 should include B. subvibrioides (Bs) and C. crescentus (Cc) abbreviations. Line 888-867: Please change "...C. crescentus ΔpseI cells and B. subvibrioides ΔlegI cells with plasmids expressing..." to "...C. crescentus (Cc) ΔpseI cells and B. subvibrioides (Bs) ΔlegI cells with plasmids expressing..." Furthermore, the name and abbreviations (Mm, So, Ku, Pi, Dv) of the species used should be included in the legend. Why the authors used a plasmid with a Pvan promoter in these assays? Why the authors changed the code color of pseI and legI orthologous genes? It would be more useful and understandable follow the code color used in figure 2 and 4.

      Page 6 line 200, Please change ".....complementing synthases exhibit greater overall sequence similarity to LegI than Pse of C. jejuni. 22268,....." to ".....complementing synthases exhibit greater overall sequence similarity to LegI than PseI of C. jejuni. 22268,....." 10.- Page 7 line 231, Please change ".....negative bacteria A. baumannii LAC-4 (GCA_000786735.1)[38] and P. sp. Irchel 3E13..." to ".....negative bacteria A. baumannii LAC-4 (GCA_000786735.1)[38] and Pseudomonas sp. Irchel 3E13..." 11.- Introduce a line break between line 503 and 504. 12.- Page 14 line 543, please change "XbaI" to "XbaI" Thanks for the careful editing. We changed the text as suggested by the reviewer. We also added a scheme showing the genetic organization of the genes involved in Leg production and present as Figure 1B. When this study was initiated, the pMT335 plasmid with a Pvan promoter was used before we switched to using the pSRK plasmid with Plac promoter for better induction. Note that the results with Pvan or Plac are comparable regarding the PseI synthases interchangeability. Color code is now homogenous through the manuscript.

      Reviewer #1 (Significance (Required)):

      This is an interesting manuscript that contributes to the knowledge of the legionaminic biosynthetic pathway and establish a glyco-profiling platform for the functional analysis of genes involved in pseudaminic (Pse) and legionaminic (Leg) acid biosynthetic pathways. The analysis of Leg patway allowed to identify a gene (legX) that can be used to distinguish Leg from Pse biosynthesis pathways, becoming a bioinformatic tool for the assignment and discrimination of these two pathways. Furthermore, a new class of FlmG protein glycosyltransferases, able to transfer Leg to the flagellin, has been identified and its analysis reveal two modular determinants that govern flagellin glycosyltransferase specificity: a glycosyltransferase domain that accepts either Leg or Pse, and a specialized flagellin-binding domain to identify the substrate.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: Viollier and co-workers present a study in which they preform an elegant and rigorous genetic profiling of the the legionaminic and pseudaminic acid biosynthesis and flagellar glycosylation pathways in C. crescentus (native Pse) and B. subvibrioides (native Leg). They use motility as a representative readout for functional flagellar glycosylation with these microbial sialic acids. They discover orthologous Pse synthase genes can replace the function of the native synthase in C. crescentus and orthologous legionaminic acid synthase genes can achieve the same in B. subvibrioides. However, not vice versa indicating a strong preference for each microbial sialic acid stereoisomer in these species. For the Leg biosynthesis pathway, which requires GDP-GlcNAc, the authors also identify LegX as an essential component to synthesize this sugar nucleotide and thus a marker for Leg biosynthesis pathways. Upstream in theses pathways, they also identify a new class of FlmG flagellar protein glycosyltransferases. Importantly, through heterologous reconstitution experiments to uncovered that these glycosyltransferases possess two distinct domains, a transferase domain the determines specificity for either CMP-Leg or CMP-Pse, and a flagellin-binding domain to achieve selectivity for the substrate. Interestingly, by creating chimeric FlmG for these two domains between C. crescentus and B. subvibrioides they show that these two modular parts can be interchanged to adapt flagellin glycosyltransferase specificity in these species. Major comments: The key conclusions of the manuscript by Viollier and co-workers are convincing and well supported by their experiments and used methods, with respect to the insulation of the Leg and Pse biosynthetic pathways, they key role of LegX in launching the Leg pathway and the successful reconstitution of Leg glycosylation in a previously Pse-producing C. crescentus strain. Finally, they convincingly show that a chimeric version of the involved glycosyltransferases is functional, which besides intriguing future glycoengineering possibilities also emphasizes the two discrete domains in these transferases that dictate their sugar nucleotide and acceptor specificity. There is one additional experiment I would suggest with relation to the detection and confirmation of Pse and Leg on flagella of respectively, C. crescentus and B. subvibrioides. In the case of C. crescentus the detected DMB derivatized monosaccharide co-elutes with a validated standard of tri-acetylated Pse, which is convincing evidence of its identity. However, for B. subvibrioides. Their DMB derivatized monosaccharides from its flagella, results in a peak the does not co-elute with the only Leg standard (Leg5Ac7Ac) they have, it does elute at the same time as their Pse standard. Although it cannot of course be Pse as B. subvibrioides. Does not possess a Pse biosynthesis pathway, it also does not provide enough evidence to conclude that it is a Leg derivative. An MS(-MS) measurement of the eluted signal would not be a big investment in time and resources and would provide additional evidence to at least assign this peak to microbial sialic acid related to the present Leg biosynthesis pathway. It the identified mass would lead to identification of the derivative, it would also add to the proper characterization of the flagella glycosylation in the bacterium.

      We have now added the glycopeptide analyses as requested. They are described in the last experimental section and confirm our results.

      The data and the methods presented in this study are presented with sufficient detail so that they can be reproduced? However, I would suggest as is common nowadays in most journals that the authors include images of the raw unprocessed blot in de supporting info.

      *The motility pictures are representative of three independent experiments and the immunoblots are representative of at least two independent experiments. This has now been mentioned in the Experimental procedures. The raw unprocessed blots have now been added as supporting info. *

      Minor comments: There are a few textual errors that the authors should fix: -page 2, line 70: change "used" to "use" -page 11, line 407: add the word "are" after Pse On page 2, line 36, the authors state that "most eubacteria and the archaea typically decorate their cell surface structures with (5-, 7-)diacetamido derivatives, either pseudaminic acid (Pse) and/or its stereoisomer legionaminic acid (Leg,". This should be nuanced as to my knowledge it is not most eubacteria, but more a subset as identified by Varki in his seminal PNAS paper. The authors clearly present their data and conclusions in the figures of this manuscript. However, I would recommend the take a critical look at the drawing of their monosaccharide chair conformations and the positioning of the axial and equatorial groups on these chairs in Figure 1 and 5, as these are in most cases drawn a bit crooked, which can easily be corrected. We corrected the text as the reviewer suggested. We changed the sentence in the introduction to be more nuanced. The drawing of the monosaccharide has been improved.

      Reviewer #2 (Significance (Required)):

      The family of carbohydrates called sialic acids was long thought to exclusively occur in glycoproteins and glycolipids of vertebrates, but has since also been found in specific microbes. Especially symbiotic and pathogenic microbes associated with the humans express a wide array of unique microbial sialic acids for which their functional roles are not well understood and the associated glycosylhydrolase and glycosyltransferase have in most cases not been identified yet. The authors present an impressive insight into flagellar glycosylation with Pseudaminic and Legionaminic acid in two bacterial species, using genomic analysis, rewiring, immunoblots and motility assays as their main tools. They provide compelling evidence on the insulation of the Pse of Leg pathway in these species, the flexibility in exchanging between biosynthetic enzymes from the same pathway between various species. Crucially, most glycosyltransferases that add the Pse or Leg glycoform onto various acceptor sites in bacteria, have up to this point remained elusive in most cases. It is therefore very valuable information that the authors here provide on the involved glycosyltransferases. Especially, on the two domains that govern their sugar nucleotide and acceptor specificity, and that these can be reengineered as chimeric glycosyltransferases. To me as a chemical glycobiologist this provides compelling possibilities for glycoengineering possibilities in future studies in the field to elucidate the functional roles of Pse and Leg glycosylation.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary of the findings and key conclusions (including methodology and model system(s) where appropriate): Kint et al describe a neat study of bacterial flagellin glycosylation by a recently identified class of protein glycosyltransferases called FlmG. The experiments are well designed, the data presented is convincing and the conclusions drawn are mostly in line with the experimental evidence presented. These are the key findings. Kint et al show that genetic tools and motility can be used as a readout to probe the sugar biosynthesis pathway in bacteria. Using the recently characterized system of Caulobacter crescentus, they have performed a survey of different PseI/LegI/NeuB genes from various bacteria, checking whether they could rescue the motility defect in C. crescentus ΔpseI cells. They found that those genes that did confer motility also had higher sequence similarity to C. jejuni PseI than to C. jejuni LegI or C. jejuni NeuB. They also found that these genes also restored flagellin glycosylation as checked by mobility shift on gel electrophoresis with immunoblotting to anti-FljK antibody. This survey brought up an interesting finding that the PseI/LegI/NeuB orthologs of the closely related Brevundimonas species were unable to confer motility to C. crescentus ΔpseI cells, and were more similar to C. jejuni LegI than to C. jejuni PseI or C. jejuni NeuB. They also performed similar glycoprofiling experiments using B. subvibrioides ΔlegIBs cells and various PseI/LegI/NeuB orthologs from different bacteria, which indicated the restoration of motility by putative LegI synthases. Kint et al demonstrate flagellin glycosylation in B. subvibrioides by performing in-frame deletions of FlmG, and LegI genes in B. subvibrioides and checking for motility, presence of flagella, and flagellin glycosylation by motility shift on gel electrophoresis. Further, they confirm the critical nature of GDP-GlcNAc for Leg biosynthesis by assessing flagellin glycosylation and motility in B. subvibrioides with an in-frame deletion in PtmE/LegX and by performing heterologous complementation with an M. humiferra PtmE ortholog. They also reconstitute the legionaminic acid biosynthesis pathway in C. crescentus cells that lack flagellins, PseI and FlmG, and show that the heterologously expressed B. subvibrioides flagellin is glycosylated by heterologously expressed B. subvibrioides FlmG. Finally, they also show that whereas the CcFlmG cannot substitute for BsFlmG and vice versa, a chimeric FlmG bearing the TPR domain from C. crescentus FlmG (that recognizes C. crescentus FljK) and the GT domain from B. subvibrioides FlmG (that transfers CMP-Leg) modifies CcFljK in C. crescentus cells that lack CcFlmG but express both Pse (endogenously) and Leg (from the reconstituted pathway). This demonstrates the modularity of the FlmG glycosyltransferases. Kint et al provide the chemical nature of C. crescentus flagellin glycosylation. Kint et al have analyzed the glycans released from the flagellin by acid hydrolysis and clearly shown the nature of the glycan in C. crescentus flagellin to be Pse4Ac5Ac7Ac by use of Pse standards. The glycan from B. subvibrioides was distinct from the Leg standard used, and could be a Leg derivative distinct from Leg5Ac7Ac.

      Major comments: 1. Table 1 and Text in Results, lines 116-119, "In support of the notion that derivatization occurs after the PEP-dependent condensation reaction to form Pse or Leg, our glyco profiling analysis revealed that putative PseI proteins (identified by sequence comparisons to C. jejuni 11168, Table S1) conferred motility to C. crescentus ΔpseI cells, whereas putative LegI synthases did not." Not clear how putative PseI and LegI synthases were identified. Table 1 only lists overall percent sequence identity and similarity to Cj PseI, LegI and NeuB, and percent identities and similarities of the various nonulosonic synthases to these proteins are in the similar range, as expected. In the absence of sequence alignments indicating the presence of conserved residues, particularly related to the substrate binding region, that are distinct in these paralogs, calling out the type of synthase based on the highest percent identity (to Cj PseI, LegI or NeuB) is speculative. Also, Shewanella oneidensis does not follow the pattern of highest similarity to NeuB3. Second, in the absence of data showing that the Leg and Pse found in these different organisms actually are different derivatives, this does not support that "derivatization occurs after the PEP-dependent condensation reaction to form Pse or Leg". Putative PseI and LegI were proposed based on BlastP analyses in which the protein sequences of interest were aligned to the three experimentally validated synthases from C. jejuni 11168: PseI, LegI, NeuB as well as PseI from C. crescentus, as indicated in Table S1. While, the assignment of the donor sugar is based only on the sequence identity and similarity to LegI or PseI, this assignment corresponds well according to the restoration of the motility of the C. crescentus ΔpseI mutant upon expression of PseI ortholog and B. subvibrioides ΔlegI mutant with heterologous LegI expression.

      It is true that for Shewanella oneidensis the assignment as PseI or LegI is ambiguous, exhibiting nearly identical similarity, but it is quite distinct from NeuB. This actually makes the S. oneidensis synthase a very interesting case to explore the enzymology of its Pse/LegI ortholog, knowing that it has been previously shown that this bacterium glycosylates its flagellins with Pse derivatives (PMID: 24039942). The results from our genetic complementation analysis are however very clear (PseI ortholog) and very consistent with the functional analysis in S. oneidensis.

      Concerning the different derivatives of Pse or Leg: McDonald and Boyd (PMID 32950378) recently published a review giving some examples of Bacteria/Archaea experimentally shown to contain Pse/Leg-derivatives: C. jejuni 11168 modifies its flagellin with 5,7-N-acetyl Pse, Sinorhizobium fredii NGR234 (not used in this study but in our previous work PMID 33113346 and showed to restore the motility of C. crescentus ΔpseI cells) modifies its capsule with 5-acetamido-7-3-hydroxybutyramido-Pse), Treponema denticola modifies its flagellin with 7-(2-metoxy-4,5,6-trihydroxy-hexanoyl-Pse, A. baumannii LAC-4 produces 5,7-N-acetyl-8-epi-Leg to decorate the capsule, Halorubrum sp. PV6 modifies the LPS with N-formylated Leg and L. pneumophila produces 5-acetamidino-Leg.

      The reviewer is right in that we do not know the exact version of Pse or Leg produced in C. crescentus and B. subvibrioides, HOWEVER, the fact that complementation works with the majority of the orthologs of PseI and LegI including many from bacteria that are known to produce modified Pse derivatives for example in Shewanella oneidensis and Treponema denticola, the most likely explanation is that derivatization occurs after the PseI or LegI step, but we concede that the results are also compatible with a promiscuous enzyme that can accept different Pse derivatives or different Leg derivatives.

      1. Related to (1), Text in Results, lines 130-131, "We conclude from our survey that (heterologous) PseI synthase activity generally confers motility to C. crescentus ΔpseI cells, whereas LegI-type (or NeuB-type) synthases are unable to do so." There is no a priori evidence provided indicating that these were PseI or LegI type synthases. So the conclusion really is that assuming only PseI type synthases would be able to rescue the motility defect in C. crescentus ΔpseI cells, this glyco-profiling motility assay now provides the first biochemical evidence telling us which synthases are Pse-type, and which are Neu/Leg-type. And in my view, this is the conclusion of greater significance in the field - to be able to now identify which is a PseI and which a LegI based on these complementation assays. However, if the authors still wish to retain their original conclusion, they could cite or provide evidence (either biochemical evidence in this work or reported literature regarding the sugar synthesized or bioninformatics analysis regarding the presence of distinct genes such as the Ptm genes for legionaminic acid biosynthesis pathway or genes that differ in their enzyme activities and overall fold such as PseB/LegB or PseG/LegG in the gene neighborhood) indicating or suggesting the PseI/LegI/NeuB nature of the different synthases. Also, methods for the bioinformatics analysis (eg. BLASTp settings used, dates of searches, whether regular BLAST or PSI-BLAST was used, etc.) are missing in the manuscript, and need to be included. We agree that for many PseI or LegI tested, there is no provided biochemical evidence. HOWEVER, this is not the case for some of them including the PseI, LegI and NeuB from Campylobacter jejuni (PMID 19282391), some A. baumannii strains (α-epi-legionaminic acid for A. baumannii LAC-4 PMID 24690675), Shewanella oneidensis (Pseudaminic acid with methylation PMID 23543712), Legionella pneumophila (Legionaminic acid PMID 18275154) or Halorubrum sp. PV6 (N-formylated legionaminic acid PMID 30245679). Thus, we maintain the two conclusions: the PseI and LegI synthases are generally interchangeable and the complementation assays can enable to identify and assign PseI and LegI function. BLAST2P was used to compare the protein sequences of the tested NeuB-like synthases with NeuB1, LegI (NeuB2) and PseI (NeuB3) from Campylobacter jejuni but also with PseI from C. crescentus. BLOSUM62 matrix was used as well as a word of size 3 for the comparison. We have now added this procedure in the legend of the Table S1.
      2. It is interesting that there is still a signification amount of flagellin secretion/assembly in the B. subvibrioides LegI and FlmG mutants. It will be good to see a discussion about whether this is likely from due to low level of function despite the in-frame deletion of genes; how many flagellin subunits are likely to have managed secretion and assembly in these short flagella; whether there is any redundancy of LegI / FlmG (perhaps with lower levels of expression); considering Parker and Shaw's findings of glycosylation being required for flagellin binding to the chaperone and subsequence secretion in A. caviae whether there is a FlaJ homolog in B. subvibrioides. Also, can the authors rule out the possibility that absence of glycosylation does not affect flagellin assembly but makes the flagellum prone to shear/breaks in B. subvibriodes, resulting in smaller flagella? How many flagellins are there in B. subvibrioides? Is it possible that one is glycosylated but another/others are not, and that is the reason for the small flagellum in these mutants? The number of flagellin subunits that are assembled into a full-length flagellar filament is unknown in C. crescentus and in B. subvibrioides. There are 3 different flagellin genes that are now presented schematically in Figure 1C. No redundancy has been found for LegI or FlmG. It is possible that the B. subvibrioides is better in exporting non-glycosylated flagellin or that the capping proteins can function better with sugar modification or that the filament of B. subvibrioides mutants is less fragile when it is non-glycosylated or that its flagellins “stick” better. It is also possible that short filaments are not actually containing flagellins mounted on the hook but another protein that polymerizes aberrantly in the absence of Leg or FlmG. This remains to be investigated and compared to the situation of Pse and FlmG mutants of C. crescentus.

      B. subvibrioides possesses an ortholog of the C. crescentus flagellin secretion chaperon FlaF (PMID 33113346). As observed in C. crescentus, FlaF likely has a role in flagellin translation as its inactivation totally prevents flagellins production (see answer to reviewer #1). For C. crescentus, bacterial two hybrid experiments revealed that FlaF can interact with non-glycosylated flagellins in E. coli. Thus, it is strongly possible that FlaF/flagellins interaction is not dependent on the flagellins glycosylation state. In addition, the short flagellum filament observed in B. subvibrioides ΔlegI or ΔflmG mutants argues that at least some flagellins are secreted while not glycosylated.TEM pictures have been performed in liquid medium from exponential growth phase. In this condition, no fragment of flagella was observed in the culture medium by TEM but only small flagella with a hook structure attached. Also, flagella breaks might result in more random length of flagellum.

      Three flagellins are in B. subvibrioides (Bresu_2403 is 59% identical with FljLCc, Bresu_2638 is 57% identical with FljKCc and Bresu_2636 is 62% identical with FljJCc). We now show this genetic organization of the flagellins in Fig. 1C. The three flagellins are all detected by the anti-FljKCc anti serum (see answer and figure to reviewer #1). We cannot attribute the immunoblot signal to any individual B. subvibrioides flagellin as they could all co-migrate on SDS-PAGE. However, the signal often looks like a doublet (as shown in Figure 4B for example) suggesting that at least two flagellins are detected and this doublet is always found to migrate faster in absence of glycosylation that could indicate that all B. subvibrioides flagellins (or at least 2) are modified.

      Text in Results, lines 170-171, "We then probed the resulting ΔlegIBs and ΔflmGBs single mutants for motility defects in soft agar and analyzed flagellin glycosylation by immunoblotting using antibodies to FljKCc". Was the antibody to FljKCc determined to also specifically bind to FljKBs? Also, how many flagellins are there in B. subvibrioides? Are all detected with this antibody? Antibodies raised to FljKCc were raised against His6-FljK produced in E. coli (previously published in Ardissone et al, 2020). This serum recognizes the 6 flagellins from C. crescentus (PMID: 33108275). It recognized the three flagellins from B.s. (see answer to reviewer #1).

      It is interesting that C. cresentus cells expressing Pse (endogenously) and Leg (reconstituted pathway), and BsFlmG and BsFljK (corresponding to Figure 5C) are not motile. Was the motility assay done for the experiment of figure 5B as well? Are the C. crescentus cells lacking Pse and FlmG but with heterologous expression of Leg and BsFljK and BsFlmG also non-motile? Also, it will be good to see the TEM images for these cells.

      C. crescentus cells that produce Pse (endogenously) or Leg (reconstituted pathway) and BsFlmG and BsFljK (formerly Figure 5C and now as Figure 7C) are indeed not motile as shown by the motility tests presented in Figure S5B. Motility assays with cells used in the former Figure 5B (now Figure 7B) have also been done and are now presented Figure S4B. These cells are non-motile because BsFljK is not efficiently secreted (or unstable after secretion) as shown on the immunoblot of the supernatant fraction in Figure S4A lower panel. As a result, flagellar filament is not properly assembled as only a short flagellum was observed by TEM in such cells compared to the WT C. crescentus (Figure S4C and S4D).

      Immunoblotting of the supernatants should be shown (in addition to the cell extracts) for Figures 5B and 5C so that the reader can appreciate whether glycosylation has taken place but secretion/assembly has not. Further, HPLC of the acid extracts from flagellin could be done to unambiguously show whether the CcFlmG has transferred Pse and the BsFlmG and Cc-BsFlmG have transferred Leg on to the CcFljK in Figure 5c, and the identity of the sugar, if any, transferred by CcFlmG in the absence of Pse, and BsLeg genes or BsLegX gene in figure 5B.

      *__ Immunoblots of the supernatants for Figure 5B (now Figure 7B) have been done and been added (Figure S4A lower panel). BsFljK is barely detected in the supernatant whatever its glycosylation state (with or without Leg). Note that in the supporting info where the raw unprocessed blot used for this panel is shown, a positive control of blotting (C. crescentus Δfljx6 mutant expressing CcFljK from pMT463) has been used. Immunoblots of the supernatant from Figure 5C (now 7C) have been done and been added in figure S5A. The CcFljK modified with Leg is poorly secreted (or unstable after secretion). As a result, these cells only harbor a short flagellum compared to those that are able to modify CcFljK with Pse (Figure S5C).

      HPLC of the acid extracts from flagellins have been performed on purified flagella obtained by ultracentrifugation. As C. crescentus cells expressing BsFlmG and Cc-BsFlmG harbor no or short flagellar filament, the purification by ultracentrifugation is limited. Thus, to further confirm that CcFlmG has transferred Pse and Cc-BsFlmG (and BsFlmG) has transferred Leg on CcFljK (former Figure 5C and now Figure 7C), we performed immunoblots on the cell extracts of C. crescentus ΔflmG ΔpseI cells that cannot produce Pse but able to produce Leg (reconstituted pathway). These experiments, now presented in Figure 7C (lower panel) confirmed that no modification of CcFljK was observed in C. crescentus cells expressing CcFlmG whereas CcFljK is modified in C. crescentus expressing Cc-BsFlmG, confirming that Cc-BsFlmG has transferred Leg (the only NulO produced in this condition).__*

      Text in discussion, lines 334-338, "By extension, having recognized the LegX/PtmE enzyme as a critical element in the Leg-specific enzymatic biosynthesis step (Figure 6) likewise offers another functional, but also a novel bioinformatic, criterion for the correct assignment and discrimination of predicted stereoisomer biosynthesis routes residing in ever-expanding genome databases" It will be nice to see a discussion on the prevalence of PtmE versus GlmU (or equivalent gene), PtmF, PtmA, PgmL in the Leg synthesizing organisms. Is the PtmE but not the other genes found in all cases, which makes it better as a molecular determinant for bioinformatics predictions of the type of pathway? Also, on whether PtmE has any homology to genes in other pathways (not associated with flagellin glycosylation) and how reliable a marker it is to differentiate Leg biosynthesis from Neu5Ac biosynthesis pathways.

      GlmU is a potential bifunctional UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase that can be part of both Pse and Leg pathway (PMID 19282391). Accordingly, a GlmU ortholog is found in C. crescentus and B. subvibrioides that we showed are producing Pse and Leg, respectively. Thus, GlmU cannot be attributed to a Leg pathway signature. On the other hand, PtmE is barely found in the organisms from which PseI orthologs restore the motility of C. crescentus ΔpseI cells.

      PtmF, PtmA, PgmL and GlmS are proposed to act upstream of the production of GlcN-1-P that is a precursor of both UDP-GlcNAc and GDP-GlcNAc, the precursors of Pse and Leg respectively. In addition, orthologs of these genes are not prevalent in the Leg synthetizing organisms present in Table S2 using BlastP analyses with C. jejuni proteins as templates.PtmE ortholog is found in most of the Leg synthetizing organisms as shown in Table S2 and often genetically linked with other genes coding for proteins involved in Leg production (shown with the asterisk * in table S2). Of note, PtmE is found not only in organisms that modify flagellin(s) with Leg but also in organisms that add Leg on capsule such as A. baumannii LAC-4.

      It is not clear from the methods or the figure legends how many times the immunoblotting, motility experiments were done; how many experiments/trials are the images representative of? The motility pictures are representative of three independent experiments. The immunoblots are representative of at least two independent experiments. This information is now added in the Experimental procedures section.

      Minor comments:

      1. The gene for GlcN-1-P guanylyltransferase in the Leg-specific enzymatic biosynthesis step is already known as PtmE from the work of Schoenhofen's group. For the sake of consistency, it would be better to retain the nomenclature as PtmE throughout the manuscript instead of introducing the name LegX, which makes it sound like it is a previously unknown gene.

      2. Text in abstract, lines 15-17: "Sialic acids commonly serve as glycosyl donors, particularly pseudaminic (Pse) or legionaminic acid (Leg) that prominently decorate eubacterial and archaeal surface layers or appendages" The glycosyl donor is the nucleotide sugar and not the nonulosonic acid or sialic acid... rephrasing required for accuracy. Done

      3. Text in abstract, lines 18: "a new class of FlmG protein glycosyltransferases that modify flagellin" The authors are presumably referring to FlmG as the new class of protein glycosyltransferases... rephrasing required for accuracy Corrected
      4. Text in introduction, lines 41-42 "Pse and Leg derivatives synthesized in vitro can be added exogenously in metabolic labeling experiments" It should be "derivatives of Pse and Leg precursors" and not "Pse and Leg derivatives" corrected
      5. Text in introduction, line 46 "Pse- or Leg-decorated flagella may also be immunogenic." This sentence is not referenced and it is not clear why it is written here.

      6. Text in introduction, lines 63-66 "The synthesis of CMP-Pse or CMP-Leg proceeds enzymatically by series of steps [20-22], ultimately ending with the condensation of an activated 6-carbon monosaccharide (typically N-acetyl glucosamine, GlcNAc) with 3-carbon pyruvate (such as phosphoenolpyruvate, PEP) by Pse or Leg synthase paralogs, PseI or LegI, respectively" The synthesis begins with activated GlcNAc. The substrate for condensation is not activated GlcNAc. It is 2,4-diacetamido-2,4,6-trideoxy-D-mannopyranose in case of LegI and 2,4-diacetamido-2,4,6-trideoxy-b-L-altropyranose in case of PseI. Indeed, we modified the sentence.

      7. Text in introduction, line 70 "for used as glycosyl donors" Typographical error, "for use as glycosyl donors" Corrected
      8. Text in Results, line 102, "C. crescentus only encodes only PseI" Do the authors mean "only one PseI"? Corrected
      9. Text in Results, lines 108 and 109, "Such modifications could occur before the PseI synthase acts or afterwards. In the latter case, most (if not all) synthases would be predicted to produce the same Pse molecule," Do the authors know of any reports of modifications occurring before the PseI synthase? Please cite references, if known. Why "most (if not all)"? If the former case is true, the PseI synthase might not be able to accept the substrate. Correct. Because we cannot test all enzymes we must keep the statement non-committing.

      “Most (if not all)” refers to the latter case i.e. the modification occurs after PseI synthase. In this context, PseI should do the same reaction, however, there might be some exceptions.

      There is, to our knowledge, no reports showing that modifications occur before the PseI synthase. The glyco-profiling experiments all suggest that modification occurs after Pse production based on our motility readout. It is possible that PseI enzymes that condense a modified precursor would not be functional in our motility assay.

      Text in Results, lines 141-143, "our bioinformatic searches using C. jejuni 11168 as reference genome identified all six putative enzymes in the B. subvibrioides ATCC15264 genome (CP002102.1) predicted to execute the synthesis of Leg from GDP-GlcNAc" Not clear how this was done. Do the authors mean that they used the genes from C. jejuni 11168 as the query genes to identify homologs in B. subvibrioides ATCC15264 genome (CP002102.1)? Or did they use putative genes from B. subvibrioides ATCC15264 genome (CP002102.1) and pull out homologs from C. jejuni 11168 by using C. jejuni 11168 as the reference genome? We now have modified the sentence to make it clearer.

      At first reading, the flow of the manuscript is difficult to follow due to the figures not appearing in full in order of their occurrence. For instance, Figures 5B and 5C are discussed only in the end of the manuscript after the results of Figures 6 and 7. Other instances also exist. The authors may consider re-ordering the figure parts if possible so that all parts of each figure appear in order of occurrence in the manuscript text. Thanks for raising this issue. We have now tried to address this concern by re-organizing the order of occurrence of the figures. Notably we have now exchanged Figure 5 (on Leg pathway reconstitution and FlmG rewiring) with Figure 7 (on LegB and LegH). We modified the text accordingly. We hope that it makes the manuscript and corresponding figures easier to follow.

      Reviewer #3 (Significance (Required)):

      The nonulosonic acids, Pseudaminic acid and Legionaminic acid, are abundant in bacterial systems in the capsular and lipopolysaccharides as well as in glycoprotein glycans. The Ser/Thr-O-nonulosonic acid glycosylation of flagellins has been studied with respect to the system of Maf glycosyltransferases in Campylobacter jejuni, C. coli, Helicobacter pylori, Aeromonas caviae, Magnetospirillum magneticum, Clostridium botulinum and Geobacillus kaustophilus, and recently with respect to the system of FlmG glycosyltransferases by Viollier's group in Caulobacter crescentus. However, the determinants that govern the glycosyltransferase function are not still well known. Kint et al have performed excellent work using bacterial genetics tools to (1) highlight the "functional insulation" of the Leg and Pse biosynthesis pathways, (2) demonstrate the modularity of the FlmG glycosyltransferase proteins with respect to the flagellin binding and glycosyltransferase domains. This work makes a significant advance in the field with respect to (1) understanding flagellin glycosylation by FlmG, (2) making designer protein Ser/Thr-O-glycosyltransferases, and (3) bioinformatics analysis of genomes with respect to the Pse/Leg/Neu nonulosonic acid biosynthetic potential encoded. The findings will be of great interest to scientific audiences working in the areas of glycobiology and bacteriology. My area of expertise: Maf flagellin glycosyltransferases

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary of the findings and key conclusions (including methodology and model system(s) where appropriate): Kint et al describe a neat study of bacterial flagellin glycosylation by a recently identified class of protein glycosyltransferases called FlmG. The experiments are well designed, the data presented is convincing and the conclusions drawn are mostly in line with the experimental evidence presented.<br /> These are the key findings. Kint et al show that genetic tools and motility can be used as a readout to probe the sugar biosynthesis pathway in bacteria. Using the recently characterized system of Caulobacter crescentus, they have performed a survey of different PseI/LegI/NeuB genes from various bacteria, checking whether they could rescue the motility defect in C. crescentus ΔpseI cells. They found that those genes that did confer motility also had higher sequence similarity to C. jejuni PseI than to C. jejuni LegI or C. jejuni NeuB. They also found that these genes also restored flagellin glycosylation as checked by mobility shift on gel electrophoresis with immunoblotting to anti-FljK antibody. This survey brought up an interesting finding that the PseI/LegI/NeuB orthologs of the closely related Brevundimonas species were unable to confer motility to C. crescentus ΔpseI cells, and were more similar to C. jejuni LegI than to C. jejuni PseI or C. jejuni NeuB. They also performed similar glycoprofiling experiments using B. subvibrioides ΔlegIBs cells and various PseI/LegI/NeuB orthologs from different bacteria, which indicated the restoration of motility by putative LegI synthases. Kint et al demonstrate flagellin glycosylation in B. subvibrioides by performing in-frame deletions of FlmG, and LegI genes in B. subvibrioides and checking for motility, presence of flagella, and flagellin glycosylation by motility shift on gel electrophoresis. Further, they confirm the critical nature of GDP-GlcNAc for Leg biosynthesis by assessing flagellin glycosylation and motility in B. subvibrioides with an in-frame deletion in PtmE/LegX and by performing heterologous complementation with an M. humiferra PtmE ortholog. They also reconstitute the legionaminic acid biosynthesis pathway in C. crescentus cells that lack flagellins, PseI and FlmG, and show that the heterologously expressed B. subvibrioides flagellin is glycosylated by heterologously expressed B. subvibrioides FlmG. Finally, they also show that whereas the CcFlmG cannot substitute for BsFlmG and vice versa, a chimeric FlmG bearing the TPR domain from C. crescentus FlmG (that recognizes C. crescentus FljK) and the GT domain from B. subvibrioides FlmG (that transfers CMP-Leg) modifies CcFljK in C. crescentus cells that lack CcFlmG but express both Pse (endogenously) and Leg (from the reconstituted pathway). This demonstrates the modularity of the FlmG glycosyltransferases. Kint et al provide the chemical nature of C. crescentus flagellin glycosylation. Kint et al have analyzed the glycans released from the flagellin by acid hydrolysis and clearly shown the nature of the glycan in C. crescentus flagellin to be Pse4Ac5Ac7Ac by use of Pse standards. The glycan from B. subvibrioides was distinct from the Leg standard used, and could be a Leg derivative distinct from Leg5Ac7Ac.

      Major comments:

      1. Table 1 and Text in Results, lines 116-119, "In support of the notion that derivatization occurs after the PEP-dependent condensation reaction to form Pse or Leg, our glyco profiling analysis revealed that putative PseI proteins (identified by sequence comparisons to C. jejuni 11168, Table S1) conferred motility to C. crescentus ΔpseI cells, whereas putative LegI synthases did not." Not clear how putative PseI and LegI synthases were identified. Table 1 only lists overall percent sequence identity and similarity to Cj PseI, LegI and NeuB, and percent identities and similarities of the various nonulosonic synthases to these proteins are in the similar range, as expected. In the absence of sequence alignments indicating the presence of conserved residues, particularly related to the substrate binding region, that are distinct in these paralogs, calling out the type of synthase based on the highest percent identity (to Cj PseI, LegI or NeuB) is speculative. Also, Shewanella oneidensis does not follow the pattern of highest similarity to NeuB3. Second, in the absence of data showing that the Leg and Pse found in these different organisms actually are different derivatives, this does not support that "derivatization occurs after the PEP-dependent condensation reaction to form Pse or Leg".
      2. Related to (1), Text in Results, lines 130-131, "We conclude from our survey that (heterologous) PseI synthase activity generally confers motility to C. crescentus ΔpseI cells, whereas LegI-type (or NeuB-type) synthases are unable to do so." There is no a priori evidence provided indicating that these were PseI or LegI type synthases. So the conclusion really is that assuming only PseI type synthases would be able to rescue the motility defect in C. crescentus ΔpseI cells, this glyco-profiling motility assay now provides the first biochemical evidence telling us which synthases are Pse-type, and which are Neu/Leg-type. And in my view, this is the conclusion of greater significance in the field - to be able to now identify which is a PseI and which a LegI based on these complementation assays. However, if the authors still wish to retain their original conclusion, they could cite or provide evidence (either biochemical evidence in this work or reported literature regarding the sugar synthesized or bioninformatics analysis regarding the presence of distinct genes such as the Ptm genes for legionaminic acid biosynthesis pathway or genes that differ in their enzyme activities and overall fold such as PseB/LegB or PseG/LegG in the gene neighborhood) indicating or suggesting the PseI/LegI/NeuB nature of the different synthases. Also, methods for the bioinformatics analysis (eg. BLASTp settings used, dates of searches, whether regular BLAST or PSI-BLAST was used, etc.) are missing in the manuscript, and need to be included.
      3. It is interesting that there is still a signification amount of flagellin secretion/assembly in the B. subvibrioides LegI and FlmG mutants. It will be good to see a discussion about whether this is likely from due to low level of function despite the in-frame deletion of genes; how many flagellin subunits are likely to have managed secretion and assembly in these short flagella; whether there is any redundancy of LegI / FlmG (perhaps with lower levels of expression); considering Parker and Shaw's findings of glycosylation being required for flagellin binding to the chaperone and subsequence secretion in A. caviae whether there is a FlaJ homolog in B. subvibrioides. Also, can the authors rule out the possibility that absence of glycosylation does not affect flagellin assembly but makes the flagellum prone to shear/breaks in B. subvibriodes, resulting in smaller flagella? How many flagellins are there in B. subvibrioides? Is it possible that one is glycosylated but another/others are not, and that is the reason for the small flagellum in these mutants?
      4. Text in Results, lines 170-171, "We then probed the resulting ΔlegIBs and ΔflmGBs single mutants for motility defects in soft agar and analyzed flagellin glycosylation by immunoblotting using antibodies to FljKCc". Was the antibody to FljKCc determined to also specifically bind to FljKBs? Also, how many flagellins are there in B. subvibrioides? Are all detected with this antibody?
      5. It is interesting that C. cresentus cells expressing Pse (endogenously) and Leg (reconstituted pathway), and BsFlmG and BsFljK (corresponding to Figure 5C) are not motile. Was the motility assay done for the experiment of figure 5B as well? Are the C. crescentus cells lacking Pse and FlmG but with heterologous expression of Leg and BsFljK and BsFlmG also non-motile? Also, it will be good to see the TEM images for these cells.
      6. Immunoblotting of the supernatants should be shown (in addition to the cell extracts) for Figures 5B and 5C so that the reader can appreciate whether glycosylation has taken place but secretion/assembly has not. Further, HPLC of the acid extracts from flagellin could be done to unambiguously show whether the CcFlmG has transferred Pse and the BsFlmG and Cc-BsFlmG have transferred Leg on to the CcFljK in Figure 5c, and the identity of the sugar, if any, transferred by CcFlmG in the absence of Pse, and BsLeg genes or BsLegX gene in figure 5B.
      7. Text in discussion, lines 334-338, "By extension, having recognized the LegX/PtmE enzyme as a critical element in the Leg-specific enzymatic biosynthesis step (Figure 6) likewise offers another functional, but also a novel bioinformatic, criterion for the correct assignment and discrimination of predicted stereoisomer biosynthesis routes residing in ever-expanding genome databases" It will be nice to see a discussion on the prevalence of PtmE versus GlmU (or equivalent gene), PtmF, PtmA, PgmL in the Leg synthesizing organisms. Is the PtmE but not the other genes found in all cases, which makes it better as a molecular determinant for bioinformatics predictions of the type of pathway? Also, on whether PtmE has any homology to genes in other pathways (not associated with flagellin glycosylation) and how reliable a marker it is to differentiate Leg biosynthesis from Neu5Ac biosynthesis pathways.
      8. It is not clear from the methods or the figure legends how many times the immunoblotting, motility experiments were done; how many experiments/trials are the images representative of?

      Minor comments:

      1. The gene for GlcN-1-P guanylyltransferase in the Leg-specific enzymatic biosynthesis step is already known as PtmE from the work of Schoenhofen's group. For the sake of consistency, it would be better to retain the nomenclature as PtmE throughout the manuscript instead of introducing the name LegX, which makes it sound like it is a previously unknown gene.
      2. Text in abstract, lines 15-17: "Sialic acids commonly serve as glycosyl donors, particularly pseudaminic (Pse) or legionaminic acid (Leg) that prominently decorate eubacterial and archaeal surface layers or appendages" The glycosyl donor is the nucleotide sugar and not the nonulosonic acid or sialic acid... rephrasing required for accuracy.
      3. Text in abstract, lines 18: "a new class of FlmG protein glycosyltransferases that modify flagellin" The authors are presumably referring to FlmG as the new class of protein glycosyltransferases... rephrasing required for accuracy
      4. Text in introduction, lines 41-42 "Pse and Leg derivatives synthesized in vitro can be added exogenously in metabolic labeling experiments" It should be "derivatives of Pse and Leg precursors" and not "Pse and Leg derivatives"
      5. Text in introduction, line 46 "Pse- or Leg-decorated flagella may also be immunogenic." This sentence is not referenced and it is not clear why it is written here.
      6. Text in introduction, lines 63-66 "The synthesis of CMP-Pse or CMP-Leg proceeds enzymatically by series of steps [20-22], ultimately ending with the condensation of an activated 6-carbon monosaccharide (typically N-acetyl glucosamine, GlcNAc) with 3-carbon pyruvate (such as phosphoenolpyruvate, PEP) by Pse or Leg synthase paralogs, PseI or LegI, respectively" The synthesis begins with activated GlcNAc. The substrate for condensation is not activated GlcNAc. It is 2,4-diacetamido-2,4,6-trideoxy-D-mannopyranose in case of LegI and 2,4-diacetamido-2,4,6-trideoxy-b-L-altropyranose in case of PseI.
      7. Text in introduction, line 70 "for used as glycosyl donors" Typographical error, "for use as glycosyl donors"
      8. Text in Results, line 102, "C. crescentus only encodes only PseI" Do the authors mean "only one PseI"?
      9. Text in Results, lines 108 and 109, "Such modifications could occur before the PseI synthase acts or afterwards. In the latter case, most (if not all) synthases would be predicted to produce the same Pse molecule," Do the authors know of any reports of modifications occurring before the PseI synthase? Please cite references, if known. Why "most (if not all)"? If the former case is true, the PseI synthase might not be able to accept the substrate.
      10. Text in Results, lines 141-143, "our bioinformatic searches using C. jejuni 11168 as reference genome identified all six putative enzymes in the B. subvibrioides ATCC15264 genome (CP002102.1) predicted to execute the synthesis of Leg from GDP-GlcNAc" Not clear how this was done. Do the authors mean that they used the genes from C. jejuni 11168 as the query genes to identify homologs in B. subvibrioides ATCC15264 genome (CP002102.1)? Or did they use putative genes from B. subvibrioides ATCC15264 genome (CP002102.1) and pull out homologs from C. jejuni 11168 by using C. jejuni 11168 as the reference genome?
      11. At first reading, the flow of the manuscript is difficult to follow due to the figures not appearing in full in order of their occurrence. For instance, Figures 5B and 5C are discussed only in the end of the manuscript after the results of Figures 6 and 7. Other instances also exist. The authors may consider re-ordering the figure parts if possible so that all parts of each figure appear in order of occurrence in the manuscript text.

      Significance

      The nonulosonic acids, Pseudaminic acid and Legionaminic acid, are abundant in bacterial systems in the capsular and lipopolysaccharides as well as in glycoprotein glycans. The Ser/Thr-O-nonulosonic acid glycosylation of flagellins has been studied with respect to the system of Maf glycosyltransferases in Campylobacter jejuni, C. coli, Helicobacter pylori, Aeromonas caviae, Magnetospirillum magneticum, Clostridium botulinum and Geobacillus kaustophilus, and recently with respect to the system of FlmG glycosyltransferases by Viollier's group in Caulobacter crescentus. However, the determinants that govern the glycosyltransferase function are not still well known. Kint et al have performed excellent work using bacterial genetics tools to (1) highlight the "functional insulation" of the Leg and Pse biosynthesis pathways, (2) demonstrate the modularity of the FlmG glycosyltransferase proteins with respect to the flagellin binding and glycosyltransferase domains. This work makes a significant advance in the field with respect to (1) understanding flagellin glycosylation by FlmG, (2) making designer protein Ser/Thr-O-glycosyltransferases, and (3) bioinformatics analysis of genomes with respect to the Pse/Leg/Neu nonulosonic acid biosynthetic potential encoded. The findings will be of great interest to scientific audiences working in the areas of glycobiology and bacteriology. My area of expertise: Maf flagellin glycosyltransferases

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Viollier and co-workers present a study in which they preform an elegant and rigorous genetic profiling of the the legionaminic and pseudaminic acid biosynthesis and flagellar glycosylation pathways in C. crescentus (native Pse) and B. subvibrioides (native Leg). They use motility as a representative readout for functional flagellar glycosylation with these microbial sialic acids. They discover orthologous Pse synthase genes can replace the function of the native synthase in C. crescentus and orthologous legionaminic acid synthase genes can achieve the same in B. subvibrioides. However, not vice versa indicating a strong preference for each microbial sialic acid stereoisomer in these species. For the Leg biosynthesis pathway, which requires GDP-GlcNAc, the authors also identify LegX as an essential component to synthesize this sugar nucleotide and thus a marker for Leg biosynthesis pathways. Upstream in theses pathways, they also identify a new class of FlmG flagellar protein glycosyltransferases. Importantly, through heterologous reconstitution experiments to uncovered that these glycosyltransferases possess two distinct domains, a transferase domain the determines specificity for either CMP-Leg or CMP-Pse, and a flagellin-binding domain to achieve selectivity for the substrate. Interestingly, by creating chimeric FlmG for these two domains between C. crescentus and B. subvibrioides they show that these two modular parts can be interchanged to adapt flagellin glycosyltransferase specificity in these species.

      Major comments:

      The key conclusions of the manuscript by Viollier and co-workers are convincing and well supported by their experiments and used methods, with respect to the insulation of the Leg and Pse biosynthetic pathways, they key role of LegX in launching the Leg pathway and the successful reconstitution of Leg glycosylation in a previously Pse-producing C. crescentus strain. Finally, they convincingly show that a chimeric version of the involved glycosyltransferases is functional, which besides intriguing future glycoengineering possibilities also emphasizes the two discrete domains in these transferases that dictate their sugar nucleotide and acceptor specificity. There is one additional experiment I would suggest with relation to the detection and confirmation of Pse and Leg on flagella of respectively, C. crescentus and B. subvibrioides. In the case of C. crescentus the detected DMB derivatized monosaccharide co-elutes with a validated standard of tri-acetylated Pse, which is convincing evidence of its identity. However, for B. subvibrioides. Their DMB derivatized monosaccharides from its flagella, results in a peak the does not co-elute with the only Leg standard (Leg5Ac7Ac) they have, it does elute at the same time as their Pse standard. Although it cannot of course be Pse as B. subvibrioides. Does not possess a Pse biosynthesis pathway, it also does not provide enough evidence to conclude that it is a Leg derivative. An MS(-MS) measurement of the eluted signal would not be a big investment in time and resources and would provide additional evidence to at least assign this peak to microbial sialic acid related to the present Leg biosynthesis pathway. It the identified mass would lead to identification of the derivative, it would also add to the proper characterization of the flagella glycosylation in the bacterium. The data and the methods presented in this study are presented with sufficient detail so that they can be reproduced? However, I would suggest as is common nowadays in most journals that the authors include images of the raw unprocessed blot in de supporting info.

      Minor comments:

      There are a few textual errors that the authors should fix:

      • page 2, line 70: change "used" to "use"
      • page 11, line 407: add the word "are" after Pse
      • On page 2, line 36, the authors state that "most eubacteria and the archaea typically decorate their cell surface structures with (5-, 7-)diacetamido derivatives, either pseudaminic acid (Pse) and/or its stereoisomer legionaminic acid (Leg,". This should be nuanced as to my knowledge it is not most eubacteria, but more a subset as identified by Varki in his seminal PNAS paper. The authors clearly present their data and conclusions in the figures of this manuscript. However, I would recommend the take a critical look at the drawing of their monosaccharide chair conformations and the positioning of the axial and equatorial groups on these chairs in Figure 1 and 5, as these are in most cases drawn a bit crooked, which can easily be corrected.

      Significance

      The family of carbohydrates called sialic acids was long thought to exclusively occur in glycoproteins and glycolipids of vertebrates, but has since also been found in specific microbes. Especially symbiotic and pathogenic microbes associated with the humans express a wide array of unique microbial sialic acids for which their functional roles are not well understood and the associated glycosylhydrolase and glycosyltransferase have in most cases not been identified yet. The authors present an impressive insight into flagellar glycosylation with Pseudaminic and Legionaminic acid in two bacterial species, using genomic analysis, rewiring, immunoblots and motility assays as their main tools. They provide compelling evidence on the insulation of the Pse of Leg pathway in these species, the flexibility in exchanging between biosynthetic enzymes from the same pathway between various species. Crucially, most glycosyltransferases that add the Pse or Leg glycoform onto various acceptor sites in bacteria, have up to this point remained elusive in most cases. It is therefore very valuable information that the authors here provide on the involved glycosyltransferases. Especially, on the two domains that govern their sugar nucleotide and acceptor specificity, and that these can be reengineered as chimeric glycosyltransferases. To me as a chemical glycobiologist this provides compelling possibilities for glycoengineering possibilities in future studies in the field to elucidate the functional roles of Pse and Leg glycosylation.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, authors establish a glyco-profiling platform for the functional analysis of genes involved in pseudaminic (Pse) and legionaminic (Leg) acid biosynthetic pathways. They used B. subvibroides and C. crescentus specific mutants in pseI and legI genes involved in the Pse and Leg biosynthesis, respectively, and cross-complementation assays with orthologous genes from different bacterial species, analysing motility and flagellin glycosylation. These assays show that Pse and Leg biosynthetic pathways are genetically different and recognize the LegX enzyme as a critical element in the Leg-specific enzymatic biosynthesis. Since that legX orthologous were only identified in the genome of bacteria with Leg biosynthetic pathways, it becomes a good marker to distinguish Leg from Pse biosynthesis pathways and a novel bioinformatic criterion for the assignment and discrimination of these two pathways. Reconstitution of Leg biosynthetic pathway of B. subvibroides in the C. crescentus mutant that lack flagellins, PseI and FlmG, complemented with both flagellin and FlmG of B. subvibroides, identified a new class of FlmG protein glycosyltransferases that modify flagellin with legionaminic acid. Furthermore, the construction of a chimeric FlmG through domain substitutions, allowed to reprogram a Pse-dependent FlmG into a Leg-dependent enzyme and reveal two modular determinants that govern flagellin glycosyltransferase specificity: a glycosyltransferase domain that accepts either Leg or Pse, and a specialized flagellin-binding domain to identify the substrate.

      Major comments:

      The conclusions obtained are convincing and well-supported. However, I think some points should be specify or clarify.

        • In the mutants (pseI, legI, flmG,...) the non-glycosylated flagellin are exported and assembled in a flagellum filament shorter than the WT strain. However, motility in plates is absent or very reduced. This might be produced by instability of the flagellum filament when rotating in a semi-solid surface. MET was performed from plates or liquid cultures? Do the author analyses motility in liquid media? If they did, changes in motility were observed?
        • In page 5, lines 158-163, the analysis, by HPLC, of derivatized nonulosonic acid from B. subvibroides flagella, shows a major peak at 9.8 minutes retention and a minor peak at 15.3 minutes. Since that Pse-standard have retentions peaks at 9.7 and 13 minutes, and Leg-standard at 12.3 minutes, the authors cannot infer, only with these data, the flagella sugar is a legionaminic acid derivative. In my opinion, should be included that inference comes from the data obtained by HPLC analysis and genetic approaches.
        • In page 5, line 173-175. Authors indicate, "While no difference in the abundance of flagellin was observed in extracts from mutant versus WT cells, flagellin was barely detectable in the supernatants of mutant cultures, suggesting flagellar filament formation is defective in these mutants". MET images show that the flagellum filament length is shorter in the mutants than in the WT strain. Therefore, if the same number of mutants and WT cells has been used in the immunodetection assays, there should be more flagellin monomers in the WT samples than in the mutants ones and flagellin bands should be less intense in mutant samples corresponding to the anchored flagellum. Why bands corresponding to flagellin in mutants and WT show similar intensity in the immunodetection assays (Figure 3C and D)? Furthermore, in lane 177-178, authors suggest that LegI and FlmG govern flagellin glycosylation and export (or stability after export). However, if filament stability is affected, the amount of flagellin monomers in the supernatant of mutants should be higher than in the WT. However, immunodetection assays show less abundance of flagelin monomers in the supernatant of mutants. Please, can you clarify this? In relation to this point, I suggest that authors include, in the experimental procedures, how they obtained the supernatants to flagellin immunodetection, as well as why they used anti- FljKCc anti-serum to detect the B. subvibroides flagellin.
        • Authors demonstrate the specificity of the GT-B domain of FlmG, using a chimeric FlmGCc-Bs in a mutant of C. crescentus that lacks FlmG and harbour the Leg biosynthetic pathway of B. subvibroides. However, since that TPR comes from C. crescentus, this chimeric protein, could be transfer the legionaminic acid to the flagellin of B. subvibroides? Furthermore, the complementation of this mutant with the FlmGBs did not support efficient flagellin modification and this might be related to the TPRCc domain. Therefore, in my opinion, the chimeric protein should be introduced in the B. subvibroides∆flmG background.
        • Page 8, line 299-301. Authors point out that C. crescentus that lacks FlmG and harbour the Leg biosynthetic pathway of B. subvibroides and the chimeric FlmGCc-Bs, although it has a glycosylated flagellin, whose mobility in SDS-PAGE is like the WT strain, is non-motile. They suggest that additional factors exist in the flagellation pathway that exhibit specificity towards the glycosyl group that is joined to flagellins. However, would be interesting to see if the flagellum filament has similar length to the WT strain or at least, it has increased in relation to the flagella length of the mutant. If flagella length has not increased, it could suggest that changes in the glycan type might affects the flagellin assembly or the stability of the flagellum filament. Therefore, would be also important to analyse its motility in liquid media.

      Minor comments:

        • Pag 3 line102. Please change ".....two predicted synthases, a PseI and LegI homolog, and C. crescentus only encodes only PseI...." to ".....two predicted synthases, a PseI and LegI homolog, and C. crescentus only encodes a PseI...."
        • Figure 2 A. Plasmid nomenclature (Plac-neuB) is confusing because C.c. ΔpseI cells express predicted LegI or PseI synthases. Please change to Plac, as in Figure 2B and 4. Figure 2A and 2B do not contain any complementation with Bacillus subtilis (Basu), however two complementation are labelled as Bs in Figure 2A and 2B. Furthermore, no Bs are present in the Figure 2 legend.
        • Legend of figure 3 should include B. subvibrioides abreviation Bs. Line 774: Please change ".......glycosylation and secretion in B. subvibrioides." to ".......glycosylation and secretion in B. subvibrioides (Bs)."
        • Figure 3. In order to keep a similar nomenclature in all plasmids, plasmid Plac-legI syn and Plac-flmG should be labelled as Plac-legIBs syn and Plac-flmGBs.
        • Legend of figure 4 should include B. subvibrioides abreviation Bs. Line 791: Please change "....... complementation of the B.subvibrioides ΔlegI mutant with ...." to "....... complementation of the B.subvibrioides (Bs)ΔlegI mutant with ...." Furthermore, Legend of figure 4 indicate in line 795, that immunoblots reveal the intracellular levels of flagellin, however figure 2 and 3 show immunoblot of cell extracts. Please, correct this sentence.
        • Legend of figure 5, 6 and 7 should include B. subvibrioides abreviation Bs. Line 808: Please change "Predicted Leg biosynthetic pathway in B. subvibrioides " to"Predicted Leg biosynthetic pathway in B. subvibrioides (Bs)" Line 834: Please change "....affects motility, flagellin glycosylation and secretion in B. subvibrioides."to "....affects motility, flagellin glycosylation and secretion in B. subvibrioides (Bs).Line 852: Please change "...acetyltransferase in flagellar motility of B. subvibrioides cells." to ""...acetyltransferase in flagellar motility of B. subvibrioides (Bs) cells." Furthermore, figure 5 should include C. crescentus abbreviation. Line 815: Please change "....whole cell lysates from C. crescentus mutant cultures......." to "....whole cell lysates from C. crescentus (Cc) mutant cultures......."
        • In my opinion it would be useful to include a scheme of the gene organization involved in Leg biosynthesis in B. subvibrioides.
        • Legend of figure S1 should include B. subvibrioides (Bs) and C. crescentus (Cc) abbreviations. Line 888-867: Please change "...C. crescentus ΔpseI cells and B. subvibrioides ΔlegI cells with plasmids expressing..." to "...C. crescentus (Cc) ΔpseI cells and B. subvibrioides (Bs) ΔlegI cells with plasmids expressing..." Furthermore, the name and abbreviations (Mm, So, Ku, Pi, Dv) of the species used should be included in the legend. Why the authors used a plasmid with a Pvan promoter in these assays? Why the authors changed the code color of pseI and legI orthologous genes? It would be more useful and understandable follow the code color used in figure 2 and 4.
      1. Page 6 line 200, Please change ".....complementing synthases exhibit greater overall sequence similarity to LegI than Pse of C. jejuni. 22268,....." to ".....complementing synthases exhibit greater overall sequence similarity to LegI than PseI of C. jejuni. 22268,....."
        • Page 7 line 231, Please change ".....negative bacteria A. baumannii LAC-4 (GCA_000786735.1)[38] and P. sp. Irchel 3E13..." to ".....negative bacteria A. baumannii LAC-4 (GCA_000786735.1)[38] and Pseudomonas sp. Irchel 3E13..."
        • Introduce a line break between line 503 and 504.
        • Page 14 line 543, please change "XbaI" to "XbaI"

      Significance

      This is an interesting manuscript that contributes to the knowledge of the legionaminic biosynthetic pathway and establish a glyco-profiling platform for the functional analysis of genes involved in pseudaminic (Pse) and legionaminic (Leg) acid biosynthetic pathways. The analysis of Leg patway allowed to identify a gene (legX) that can be used to distinguish Leg from Pse biosynthesis pathways, becoming a bioinformatic tool for the assignment and discrimination of these two pathways. Furthermore, a new class of FlmG protein glycosyltransferases, able to transfer Leg to the flagellin, has been identified and its analysis reveal two modular determinants that govern flagellin glycosyltransferase specificity: a glycosyltransferase domain that accepts either Leg or Pse, and a specialized flagellin-binding domain to identify the substrate.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      the authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Wong et al., provides the first report of a homozygous loss-of-function mutation of the RAF1 gene in humans. The mutation (T543M) was found in two siblings of a consanguineous family in association with perinatal death and multiple developmental abnormalities. These abnormalities show strong similarities with a rare congenital malformation syndrome with unknown aetiology named Acro-cardio-facial syndrome (ACFS, MIM600460). Conversely, reported abnormalities are different from those observed in RASopathies, congenital diseases caused by gain-of-function mutations in either the RAF gene or genes implicated in the same signaling pathway (MAPK) and that induce its ectopic/over-activation. By performing functional experiments in cellular systems (cell line where the mutated RAF was either overexpressed or knocked-in) and in Xenopus Laevis embryos, the authors demonstrate that the reason for the phenotypic differences compared to RASopathies is that the RAF1T543M variant impairs the signaling activity of RAF and blunts MAPK pathway activation. In particular, the RAF1T543M variant: 1) is not actively phosphorylated at key activating residues, 2) is unable to transduce MAPK signaling towards MEK/ERK substrates, 3) is inherently unstable and prone to proteasome-mediated degradation and 4) is unable to block stress-induced apoptosis. On the basis of increased apoptosis detected in cellular systems and some morphological defects observed in the probands, the author classify this novel syndrome as a segmental progeroid syndrome.

      Major comments:

      The authors perform a thorough analysis of the structural and functional defects of the RAF1T543M protein, using in silico analyses and in vitro systems. The data are corroborated by an elegant and clear-cut experiment in Xenopus embryos that demonstrates that the mutated RAF is not able to transduce MAPK signaling when overexpressed in an in vivo model. The lack of patients' material prevents a validation in human cells, but I think the evidence collected in the manuscript is supportive of the loss-of-function mutation in RAF as the causative mutation of the observed phenotype.

      The concept illustrated in the last sentence of the discussion, i.e. that different mutations in the same genes can either cause cancer (when overactivating) or premature aging (when blunting the activity of the enzyme) is fascinating. I think discussing this concept in the context of this manuscript is appropriate. However, the authors classify the syndrome caused by the RAF1T543M variant as a "novel segmental progeroid syndrome" in the title, abstract and first sentence of the discussion. I don't think that presented data are convincing for this classification. Indeed, two affected siblings die at very early post-natal stages (7 and 50 days, respectively) likely because of malformations that are not compatible with life and that are due to altered in uthero development. Progeroid syndromes are a heterogeneous group of syndromes, a minority of which is characterized by malformations at birth. The more general concept is that physical abnormalities are progressively acquired during post-natal life, in specific tissues that show typical features associated with aging, including senescence, decline of the stem cell compartment, increased inflammation. In the case of individuals carrying the RAF1T543M variant, none of these tissue abnormalities have been reported (due to the lack of material from patients) and the classification as a progeroid syndrome is based on external inspection of organs. Reported phenotypes are not specific and a failure of in uthero development of heart, limbs and other organs seems like the absolutely predominant trait. This in uthero phenotype is perfectly consistent with the physiological role of the MAPK pathway downstream multiple RTKs that transduce morphogenetic, in addition to mitogenic, signals. As for the authors words: "endogenous FGF/FGFR signaling is required for proper mesoderm and neural induction" and supports the author claim that the analysis of individuals carrying the RAF1T543M variant "underscores the importance of RTK signaling during human development". I think that the "progeroid" phenotypes are less clear. It is still important to present the phenotypes reminiscent of progeroid syndromes and discuss them and perhaps more clearly explain the logical connection with the increased apoptosis observed in in vitro experiments.

      In regard to the apoptosis experiment in figure 4, it supports the claim that RAF1 activity is necessary for protection from stress-induced apoptosis. However, the cartoon presented in figure 4C also shows that this process is mediated by increased ASK1/MST2 signaling. This part of the cartoon is based on the literature and has not been formally demonstrated. The authors can try to rescue the apoptotic status by silencing ASK1 in the RAF1T543M cells, similar to what has been done in Yamaguchi, O., 2004, Cardiac-specific disruption of the c-raf-1 gene induces cardiac dysfunction and apoptosis. J. Clin. Invest. 114, 937-943. Literature also suggests that RAF1-mediated inhibition of apoptosis is kinase-independent. This part is particularly interesting since the variant produces a protein that is both kinase-inactive and unstable. It would be a nice addition to the matuscript if the authors could clarify, at least in their cellular model, whether the increased apoptosis is due to the loss of either the protein or the kinase activity.

      Minor comments:

      A minor comment to the Xenopus ISH: the number of embryos that have been analyzed per condition is not reported anywhere. Figure legend explains that presented images are "representative pictures", but knowing the number of tested embryos and the penetrance of the phenotype would help undertsand the relevance of the RAF mutation in the signaling pathway under investigation.

      Significance

      This work by Wong and colleagues provides conceptual advance and will be of interest for researchers dealing with RTK/MAPK signaling in multiple contexts including oncology, developmental biology and cardiomyopathy.

      The gene under investigation is widely studied in cancer, where gain-of-function, oncogenic mutations are common. The role of RAF1 during embryonic development is less known. A couple of studies have investigated the developmental phenotype of mouse models carrying either a knock-out allele (Mikula, M, et al. Embryonic lethality and fetal liver apoptosis in mice lacking the c-raf-1 gene. EMBO J. 2001. 20:1952-1962) or an allele producing a truncated protein (Wojnowski L et al., (1998) Craf-1 protein kinase is essential for mouse development. Mech Dev, 76, 141-149). A few studies have investigated the conditional inactivation in specific tissues, such as the cardiac muscle (Yamaguchi, O., 2004, Cardiac-specific disruption of the c-raf-1 gene induces cardiac dysfunction and apoptosis. J. Clin. Invest. 114, 937-943.). And one study has found heterozygous carriers of a loss-of-function mutation among cohorts of children affected by dilated cardiomyopathy (Dhandapany, P.S., et al. (2014). RAF1 mutations in childhood-onset dilated cardiomyopathy. Nat. Genet. 46, 635-639). The manuscript reports the spectrum of defects acquired during embryonic development by carriers of a pathogenic mutation in the RAF1 gene. The mutation impairs both stability and kinase activity of the protein. The manuscript points out the non-redundant role of RAF1-mediated signaling in specific organs during embryonic development.

      The person who is reviewing the manuscript has expertise in cellular signaling, mouse embryonic development and human aging.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The authors describe a homozygous missense variant in RAF1 identified in a child with a lethal malformation syndrome. The T543M variant not only blocks the kinase activity, but also destabilizes the RAF1 protein. In combination, this leads not only to a loss of MAPK signaling, but also to elevated apoptosis upon cell stress, which both together very likely explain the dramatic phenotype, which may correspond to the rarely described acro-cardio-facial syndrome.

      Major comments:

      The results of the functional analysis presented by the authors are highly convincing and clearly demonstrate a LOF effect of the identified variant. The experiments are described in sufficient detail and the statistical analysis is sound. Depending on the target journal the clinical information might be a bit scarce. There are neither clinical pictures nor molecular data from the first affected sibling. For a more clinically oriented journal, a summary of clinical features in form of a table and a comparison to ACFS will be a prerequisite and facilitates appreciation of this very special phenotype. The affected child seems much older than his chronological age, which seems due to the hypoplastic viscerocranium and the loss of subcutaneous fat tissue. The heart and limb defects have no link to chronological aging. Whether such phenotypes should be really called progeroid can be debated, but it is common practice (and always a good selling point). The authors do not discuss their findings in the light of chronological aging. The hypothesis that the RAF1 LOF liberates ASK1 leading to increased apoptosis is very attractive. It would be very nice to show some experimental data proving this, but such data is not pivotal for the paper.

      Minor comments:

      The facial and overall progeroid phenotype of the affected child is quite different from most of the described ACFS patients. In contrast, the prominent neurocranium with low hairline and the hypoplastic viscerocranium remind this reviewer of Gorlin-Chaudhry-Moss/Fontaine-Petty syndrome. This differential diagnosis is also interesting since the disorder is caused by GOF variants in a mitochondrial ATP transporter increasing the sensitivity for apoptosis. This underlines the suggested link between RAF1 and the apoptosis inducer ASK1. There is probably no cellular function that has not been linked to the MAPK pathway. One of these connections is to cellular aging. Strongly activating variants in pathway members lead to oncogene-induced cellular senescence. Here, we have a progeroid phenotype, but a variant with LOF effect. This might be worthwhile to discuss. The discussion could also use some sentences on the SHSF mechanism. FGFs produced by the AER are important for proliferation of the limb mesenchyme. What is the connection between MAPK and p63?

      Significance

      This is a central signaling pathway and a prominent and historically outstanding oncogene. Up to now all variants in the MAPK pathway have been described as GOF and this is the first phenotype related to a LOF. Thus, this is a landmark paper widening the view on the pathway. The paper is interesting for clinical and molecular geneticists, tumor biologists, and cell biologists. This reviewer is a biochemist and clinical geneticist with research focus on mechanisms of skeletal and connective tissue disorders.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In the present manuscript, Wong et al describe for the first time the human phenotype resulting from the bi-allelic germline T543M mutation in the RAF1 proto-oncogene. Although somatic RAF1 mutations are outnumbered by BRAF mutations, they also occur in human cancer, but so far germline RAF1 mutations, usually dominant-acting ones abrogating RAF1 autoinhibition, have been mainly associated with RASopathies, in particular with Noonan syndrome. Here, Wong et al describe for the first time that the two individuals with a homozygous T543M mutation is associated with a neonatal lethal progeroid syndrome presenting with cutaneous, craniofacial, cardiac and limb anomalies. Moreover, the affected residue, despite its conservation across RAF1 orthologues and in the ARAF and BRAF paralogues, has neither been described as a target of somatic and germ-line mutation in cancer and RASopathies, respectively. The authors then use a combination of ectopic expression experiments in HEK293T cells and Xenopus embryos as well as CRISPR/Cas9 genome editing and structural bioinformatics to provide several lines of evidence that RAF1 T543M represents a hypomorph suppressing ERK activation. In summary, this is a very interesting and well-written manuscript with a stimulating discussion of the novel and convincing findings and the mechanisms driving the described pathology. The suggestions below might further strengthen this otherwise already very advanced manuscript.

      Major:

      1. The data presented in this manuscript imply that RAF1 T543M represents a hypomorph suppressing ERK pathway activity. Nevertheless, it remains unclear whether this mutation reduces/abolishes the intrinsic activity of RAF1 or acts by a dominant-negative mechanism, e.g. by sequestrating critical components of RAF complexes, such as KSR proteins, or MEKs. To distinguish between both possibilities, it would be helpful, if the authors were able to document the intrinsic kinase activity of immunoprecipitated RAF1 T543M (and appropriate controls such as wildtype RAF1, the S257L gain-of-function allele and a truly kinase-dead variant, e.g. by mutating the aspartate of the DFG motif) towards recombinant and commercially available GST-MEK1.
      2. RAF kinases engage in complex homo- and heterodimerization events. Does RAF1 T543M form homo- and heterodimers, e.g. with BRAF or ARAF, to a similar or different extent than wildtype RAF1?

      Minor:

      1. Abstract. To my knowledge, RAS but not RAF was the first human oncogene identified, albeit the discovery of RAF and its viral counterparts also took place very early in the oncogene discovery era.
      2. Introduction, p. 2 "...while Braf-/- mice are embryonic lethal due to vascular defects.7" For a more balanced review, I would suggest citing additional and more recent papers describing the complex phenotypes of Braf deficient mice, e.g. PMIDs: 18332218 and PMID: 16432225
      3. Figure 1D. Here, CRAF is used instead of RAF1. As the latter is the official gene/protein name RAF1 should be used to avoid confusion to readers outside of the RAF field.

      Significance

      Very interesting novel findings to researchers being active in the signalling, cancer and developmental biology fiels as well as to human geneticists and pediatricians.

      My expertise: MAPK signalling, functional characterization of oncogenic mutations

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      In this paper we exploit the model system of the Y loops in Drosophila spermatocytes to provide a super-resolution view of the topology of active transcription. We are pleased that Reviewer 1 assesses the work as “highly significant”. They also mention the caveat that findings may be “specific to the model system”. Reviewer 2 provides a counter to this caveat by pointing out that our findings are in line with some previous reports in mammalian cells. The interpretation in previous studies has however been hindered by the density of chromatin in the nucleus making it difficult to clearly identify individual fibres and Reviewer 2 points out that our work provides a significant advance as in our system we are able to “visualize the organization of chromatin within an individual, isolated loop”.

      *Reviewer #1 *

      ∙ Have you assessed if you are under-labeling the nucleosomes? or RNA Polymerase?

      Yes, we tried a range of antibody concentrations which did not change the observations. We have added a statement in the M&M; “To control for under-labelling of structures, we tested a range of antibody concentrations and found no significant difference in the observed nucleosome or RNAPol distribution.”

      *∙ Can you stimulate and inhibit transcription to see how changes in RNA Polymerase and nucleosome organisation occur? *

      We are not aware of an approach to further stimulate transcription in this system, however we have some preliminary analyses of the effects of transcription inhibition that we plan to firm up within weeks. We agree with the reviewer that it may potentially be interesting and informative to include such studies but we do not consider this as central to the significance of the paper. So our plan is to attempt to quickly gather data on this for potential inclusion but we would plan to only include this material if we can rapidly achieve a clear understanding of the effects of transcription inhibition on nucleosome cluster organisation.

      ∙ The 300 msec exposure time is very long for typical STORM experiments - blinking would usually be lost at this time scale?

      Apologies, this was a typo. Now corrected to 30ms.

      ∙ Please quote laser power in absolute values rather than percentage

      Done.

      ∙ Collecting variable number of frames (20K - 50K) can impact the number of detections/clusters. How can you account for this variation in your analysis?

      Although data was collected with variable number of frames (20K-50K), in the original paper we only used data from 50K samples for quantitative analysis. In response to reviewer #2, we have now performed the analysis on a larger data set with variable frame number. Analysis of the two data sets controls for any impact of frame number variation and makes very little difference to the results of the analysis e.g the median cluster FWHM remains at 52 nm.

      Reviewer #2

      ∙ Pages 2/3, the authors estimate the number of nucleosomes per cluster identified in the Y loops. They do so "based on a simple volume calculation" but the quantification method and reference values used to reach the estimate of a "maximum of 158 nucleosomes" is not reported in the methods. Also, the authors should discuss how do they account for multiple blinking effect in the estimate of cluster measures. Methods based on normalization curves with nucleosome arrays (Ricci et al Cell 2015) and on DNA origami (Cella Zanacchi et al Nat Meth 2017) among others were previously used to address similar questions. The authors should consider applying a normalization approach to achieve a more accurate estimate or at least discuss the caveats of their strategy.

      We have now specified the details of our estimation of the number of nucleosomes per cluster in the Materials and Methods. Techniques to quantify clusters based on localisation number have been used in the past on SMLM data, but the normalisations are difficult to validate as there is uncertainty around numbers of antibodies binding, differences in frame numbers affecting blink number, as well as unknown binding efficiencies of antibodies. Our cluster quantification instead is based on the size of the clusters (cluster width), as this does not vary significantly across frame number once signal saturation has been achieved and avoids the difficulties associated with normalisation of localisation number. As we have mentioned in the Discussion, our “simple volume calculation” does have some caveats arising from the STORM precision, the labelling method and the EM-derived estimate of packing density; nevertheless, as we say in the Discussion we consider it provides a useful “starting point for interpretation of the observed labelling in terms of underlying chromatin structure.”

      ∙ Methods, STORM imaging: the authors should provide the imaging buffer composition as the papers they refer to make use of an array of buffers. If possible, the authors should include laser powers in standard units (kW/cm^2) to increase clarity and reproducibility.

      We have provided the imaging buffer composition and the laser power in standard units.

      ∙ Methods, Cluster analysis: the authors should specify the values for the parameters used in "First a description of the average cluster was established using spatial statistics, then a clustering algorithm using the average parameters provided by the initial spatial statistics further refined the description"

      As detailed in response to the next comment, we have clarified the text in the Materials and Methods to specify the parameters and workflow of the analysis.

      ∙ Given the availability of broadly used methods for clustering such as DBScan (Ester, et al. A density-based algorithm for discovering clusters in large spatial databases with noise. Kdd, 1996), Bayesian methods (Rubin-Delanchy et al. Bayesian cluster identification in single-molecule localization microscopy data. Nature methods, 2015), (Ricci et al. Cell 2015), Voronoi based tessellation (Levet et al. SR-Tesseler: a method to segment and quantify localization-based super-resolution microscopy data. Nature methods, 2015), and more recently machine learning approaches (Williamson et al. Machine learning for cluster analysis of localization microscopy data. Nature communications, 2020) the clustering method of choice is peculiar. It makes more difficult the direct comparisons with previous work. The authors could justify the choice or compare results using other more broadly used methods.

      We have added text to justify our selected analysis method and to clarify the analysis workflow. We do not consider our approach “peculiar” as very similar approaches have been employed previously (e.g. Sengupta et al Nat. Methods 8, 969–975 (2011), Veatch et al PLoS ONE 7, e31457 (2012) and Cisse Science 341:664-667 (2013)). In selecting this approach we compared various approaches and we found Meanshift to perform best with respect to identifying clusters closely positioned along a fibre. (Example data comparing the performance of Meanshift with DBscan in cluster identification with varying radius input parameter was provided to reviewers). We concluded that MeanShift provides a more robust cluster identification and that the radius value of 2Xsigma derived from the PCF is appropriate.

      ∙ No statistical tests were performed in the study, although no comparisons between experimental conditions are shown. The authors mention the number of replicates performed only on Fig 3 ("13 regions of interest (ROI) were selected from 3 cells"), which is a low number of cells analyzed. The authors should enlarge the analyzed datasets and mention how many replicates were used throughout the work.

      As mentioned above, the quantitative analysis originally presented used the data with the constant, highest frame number (3 cells, 895 clusters). In the revised paper we present the analysis of the wider data set (12 cells, 2473 clusters). In practice this makes very little difference to the results, e.g the median cluster FWHM remains at 52 nm.

      ∙It is unclear why the authors did not perform STORM on EU-labeled samples immunolabeled with Pol2 antibodies. In principle, they could have applied the same imaging protocol used in previous figures gaining spatial resolution to better characterize Y nascent transcripts and how Pol2 is arranged with respect to nascent RNA.

      There are indeed a number of questions that can be approached by super-resolution imaging of nascent transcripts in association with co-labelling for RPol. We are exploring this interesting direction but this is a whole separate project and we propose to report our findings on this in a subsequent publication.

      ∙ The text is well organized and clear as well as the figures but the authors should revise the text in the section "linking RNA polymerase distribution and Y loop transcription" to include missing references to Figure 10. For example, the statement "This distribution of nascent transcript along the Y loops fits with the distribution of RNA-PSer2," refers to results shown in Fig 10A.

      We are pleased the reviewer found the text and figures well organised and clear. We have added/modified figure references in this section to clarify the relationship between text and figures.

      ∙ At end of page 5 the authors describe the location of RNA Pol2 pSer2 with respect to nucleosomes/RNA. As these results are related to findings reported by Castells-Garcia Nucl Acids Res 2022, cited elsewhere in the manuscript, the authors could cite the work here.

      Done.

      It is unclear for which experiments goat anti-mouse Ig-Alexa Fluor 405 has been used, remove from antibody list if not relevant.

      This antibody has been removed.

      ∙ The use of the 488 laser as the activatory laser is peculiar, can the authors better explain this choice?

      Empirically we found the 488 laser performed better as an activatory laser- as it gave a higher signal to noise than the more commonly used 405 laser. We have added this statement to the M&M.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The authors investigate the nucleosome and RNA Pol2 arrangement along the Y loop of Drosophila primary spermatocytes with super-resolution (STORM) and confocal microscopy. Given the peculiarities of the model system (extremely long transcriptional unit, transcriptional expression restricted to spermatogenesis, well-characterized nuclear location), Y loops are an interesting choice to investigate the organization of chromatin loops as they allow to physically isolate one specific DNA filament from the rest of the genome. Ball and colleagues found that, although this loop undergoes transcription and is under a decondensed state, chromatin is arranged as a chain of nucleosome clusters of variable sizes and not as a 10 nm fiber. They also show that RNA Pol 2 pSer2 is interspersed along the loop, preferentially at the periphery of nucleosome clusters and at linker sites and do not form transcription factories. RNA Pol 2 pSer5 instead is less associated with the loop and is preferentially found close to the nucleolus consistent with its role in transcriptional initiation. Given the relative scarcity of RNA Pol 2 with respect to nucleosomes along the Y loop, the authors conclude that nucleosome arrangement in chains of clusters is not determined by the transcriptional activity.

      Major comments

      • Pages 2/3, the authors estimate the number of nucleosomes per cluster identified in the Y loops. They do so "based on a simple volume calculation" but the quantification method and reference values used to reach the estimate of a "maximum of 158 nucleosomes" is not reported in the methods. Also, the authors should discuss how do they account for multiple blinking effect in the estimate of cluster measures. Methods based on normalization curves with nucleosome arrays (Ricci et al Cell 2015) and on DNA origami (Cella Zanacchi et al Nat Meth 2017) among others were previously used to address similar questions. The authors should consider applying a normalization approach to achieve a more accurate estimate or at least discuss the caveats of their strategy.
      • Methods, STORM imaging: the authors should provide the imaging buffer composition as the papers they refer to make use of an array of buffers. If possible, the authors should include laser powers in standard units (kW/cm^2) to increase clarity and reproducibility.
      • Methods, Cluster analysis: the authors should specify the values for the parameters used in "First a description of the average cluster was established using spatial statistics, then a clustering algorithm using the average parameters provided by the initial spatial statistics further refined the description"
      • Given the availability of broadly used methods for clustering such as DBScan (Ester, et al. A density-based algorithm for discovering clusters in large spatial databases with noise. Kdd, 1996), Bayesian methods (Rubin-Delanchy et al. Bayesian cluster identification in single-molecule localization microscopy data. Nature methods, 2015), (Ricci et al. Cell 2015), Voronoi based tessellation (Levet et al. SR-Tesseler: a method to segment and quantify localization-based super-resolution microscopy data. Nature methods, 2015), and more recently machine learning approaches (Williamson et al. Machine learning for cluster analysis of localization microscopy data. Nature communications, 2020) the clustering method of choice is peculiar. It makes more difficult the direct comparisons with previous work. The authors could justify the choice or compare results using other more broadly used methods.
      • No statistical tests were performed in the study, although no comparisons between experimental conditions are shown. The authors mention the number of replicates performed only on Fig 3 ("13 regions of interest (ROI) were selected from 3 cells"), which is a low number of cells analyzed. The authors should enlarge the analyzed datasets and mention how many replicates were used throughout the work.
      • It is unclear why the authors did not perform STORM on EU-labeled samples immunolabeled with Pol2 antibodies. In principle, they could have applied the same imaging protocol used in previous figures gaining spatial resolution to better characterize Y nascent transcripts and how Pol2 is arranged with respect to nascent RNA.

      Minor comments

      • The text is well organized and clear as well as the figures but the authors should revise the text in the section "linking RNA polymerase distribution and Y loop transcription" to include missing references to Figure 10. For example, the statement "This distribution of nascent transcript along the Y loops fits with the distribution of RNA-PSer2," refers to results shown in Fig 10A.
      • At end of page 5 the authors describe the location of RNA Pol2 pSer2 with respect to nucleosomes/RNA. As these results are related to findings reported by Castells-Garcia Nucl Acids Res 2022, cited elsewhere in the manuscript, the authors could cite the work here.
      • It is unclear for which experiments goat anti-mouse Ig-Alexa Fluor 405 has been used, remove from antibody list if not relevant.
      • The use of the 488 laser as the activatory laser is peculiar, can the authors better explain this choice?

      Referees cross-commenting

      I believe the comments of the other reviewer are very good and in line with the comments I made. I do not have much to add.

      Significance

      The conclusions reached by the authors are in line with previous reports (Ricci et al. Cell 2015, Ou et al. Science 2017, Castells-Garcia et al Nucl Acids Res 2022, among others). With different SR techniques, these works show that chromatin in mammalian cells is arranged in heterogeneous groups of nucleosomes and that RNA Pol 2 is not organized in transcription factories but rather binds the chromatin fiber forming small clusters at the periphery of nucleosomes. The use of Y loops as a paradigm of transcriptionally active allows the authors to confirm previous findings on another organism, Drosophila Melanogaster, and to visualize the organization of chromatin within an individual, isolated loop, without the need for FISH-like labeling. The work is potentially interesting to the field of genome and chromatin organization and SR microscopy, it is also relevant to scientists working on transcriptional regulation.

      Field of expertise: Chromatin organization - SR microscopy - transcription.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors use STORM imaging to investigate transcription/spatial organisation of Drosophila Y loops. They find that the Y loop is organised in to small sections of clustered nucleosomes. Active RNAP is then found on the periphery of these clusters - furthermore, RNAP is distributed along the fibre rather than in to distinct "factory" units. The RNAP foci were less frequent than the nucleosome clusters therefore RNAP is not directing organising of the chromatin at these sites.

      Comments:

      1. Have you assessed if you are under-labeling the nucleosomes? or RNA Polymerase?
      2. Can you stimulate and inhibit transcription to see how changes in RNA Polymerase and nucleosome organisation occur?
      3. The 300 msec exposure time is very long for typical STORM experiments - blinking would usually be lost at this time scale?
      4. Please quote laser power in absolute values rather than percentage.
      5. Collecting variable number of frames (20K - 50K) can impact the number of detections/clusters. How can you account for this variation in your analysis?

      Referees cross-commenting

      I agree with the other reviewer statements/comments.

      Significance

      Highly significant work to visualise active transcription. It provides an interesting finding that transcription is not acting to determine chromatin organisation. Nevertheless, this may be specific to the model system.

      Field of expertise: Transcription, Super resolution imaging, single molecule imaging, spatial organisation.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The reviews are on balance an accurate, thoughtful, thorough assessment of the manuscript. We appreciate the careful engagement with the B cell differentiation aspect of our work. We identify 2 major critiques from the reviews:

      1. The manuscript should make stronger connections with existing literature on ____in-vitro _and _in-vivo ____B cell differentiation. We agree the manuscript should be revised to interact more holistically and carefully with relevant B cell differentiation research. In this respect, the reviewers both help by pointing to high-quality and relevant literature that will be discussed and cited.

      The cytokine mixture we used on the B cells was not defined / described in the manuscript. This fact hinders the interpretation of the data because B cells will respond to diverse stimuli in quite different ways.

      We agree this hinders interpretation of the data, and the reviewers bring up astute points about different types of stimuli (TD vs. TI vs. TLR vs. BCR). Unfortunately, the manufacturer of the product, Stem Cell Technologies, will not disclose exactly what is in the product. Given we are in strong agreement with the reviewers on this point, we analyzed the cytokine contents of the cocktail and our cell culture supernatants using a luminex cytokine panel. We present a discussion of our findings on this data in a supplementary note and figure. We acknowledge this analysis is non-exhaustive, because it does not include possible additions of non-cytokine stimulants. However, we maintain it adds much clarity to the interpretation of the data.

      We note that the contents of the stimulation cocktail are knowable and well-defined. These attributes are in contrast to almost all B cell stimulation protocols of which are aware. Typical stimulation protocols use various types of feeder cells, cytokines, and FBS (Fetal Bovine Serum). In particular, the feeder cells and FBS, are highly variable between labs, lots, and even experiments. FBS has a myriad of issues which are described here (____https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8349753/____). Major variability, from genomic to phenotypic, has been described in laboratory cell lines like the ones used as feeder cells. With respect to B cells specifically, large differences in B cell activation programs are observed between lots of FBS, as described here (____https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7854248/#r5____). Additionally, we have observed the presence of bovine viruses and other contaminants in FBS (unpublished data). Thus, the stimulation protocol we used is reproducible and robust in ways generally unseen by us in B cell stimulation literature. In summary, we view this cocktail as useful in a similar way to how FBS is useful to biologists – a major difference being that this cocktail is better defined and controlled. We provide similar thoughts in our supplementary note.

      A final general point we will make is about the significance of our work, which appears to be lost on Reviewer #1. Similar technical and conceptual advances by our lab have been cited 1000s of times. Thus, we think the impact of our scientific approach speaks for itself. Many of our results confirm and expand on previous literature about B cells. We deliberately chose to make this novel technical and conceptual advance in the well-studied system of B cell differentiation. This allows us to integrate our findings with prior literature and helps validate the general approach. Reviewer #1 has performed a scholarly service by independently verifying our findings are coherent with existing literature, and we thank them for that.

      In response to the reviews, we have edited the manuscript to reference even more of the papers in the field which report similar findings. Thus, our concordance with prior literature should be viewed as a strength of the manuscript. It shows readers of the manuscript the conceptual framework we use here is valid and can generate similar insights in less well-studied systems. For example, the approach developed here could be used in non-B cells, non-human immune systems, or even non-model organisms. In response to the reviewers critique, we modified the discussion of our work in multiple places to emphasize these points.

      Description of the planned revisions
      

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      1) Which B cell activation protocol was used? No information is provided in the main text or supplementary information. Yet, this information is key to fully understand many of the conclusions of this work (e.g., ... memory B cells are intrinsically two-fold more persistent in vitro (2A)), which largely depend on the nature of the stimuli used in the in vitro B cell culture.

      We used the B cell activation protocol developed by StemCell Technologies as described in our methods section. We agree the reader and scientific community would benefit from additional information about this cocktail. To this end, we added a discussion of the cocktail to the supplementary information. We also used a cytokine analysis panel to analyze the cocktail, which provided detailed although non-exhaustive information about what is in the cocktail.

      2) It would be informative to use more than one B cell activation program, e.g., CD40L with or without a cytokine as well as CD40L vs. CpG-DNA. Authors make broad statements about B cell fates without discussing the impact of a given signal on a given B cell fate. For instance, do memory B cells follow the same differentiation program upon stimulation with CD40L, IL-4 or a combination of CD40L and IL-4? How about differences between a TD signaling program such as that provided by CD40L and IL-10 and TI signaling program such as that provided by CpG-DNA and IL-10?

      This is a good point. We agree stimulation using a panel of different agents would be a worthwhile experiment. It stands as a goalpost for future studies. Currently, performing single cell RNA sequencing on so many samples is both beyond the scope of this manuscript and very resource intensive. ____

      3) Page 3, first line: After low quality and non-B cells (Fig S1A & B). What does this statement mean? The sentence seems incomplete.

      Thank you for catching this typo. It is now clarified in the manuscript that we removed these cells bioinformatically.

      4) First, we noted that non-B cells present in the input rapidly became undetectable by day 4, which shows the specificity of the cytokines for B cell expansion. Which cytokines are we talking about? No detail is provided.

      We now provide our analysis of the stimulation cocktail, in Supplementary note 1 and Supplementary figure 1A. We still believe it is an interesting observation that this cocktail specifically stimulates B cells because many cytokines are not specifically B cell division signals, there were some impurities in the input population, and many cytokines are produced by the cultured cells themselves.

      5) Plasmablasts were not distinguished from plasma cells.

      We agree this is an interesting and important distinction to make. We have now distinguished between these classes of cells.

      6) Critically, we observed no appreciable evidence of hypermutation in vitro (S2C), consistent with prior literature (Bergthorsdottir et al. 2001). This statement is vague, misleading and likely inaccurate for the following reasons. (a) The B cell culture conditions used by the authors are completely unknown. (b) It was shown that SHM can be achieved under specific in vitro B cell culture conditions that include the presence of activated CD4+ T cells (PMID: 9052835; PMID: 10092799; PMID: 10878357; PMID: 12145648). Did authors try to recapitulate those culture conditions?

      We see how this statement could be misunderstood. We only claim not to observe evidence of hypermutation in our specific culture conditions, which is important for the inferences we make. We added language to make this more clear.

      We did not try to recapitulate the conditions in the references supplied by the reviewer. We note that these references use cell lines and not B cells. While there is immensely valuable work done on cell lines, they behave very differently from actual cells and these findings may not be relevant to our human B cells.

      7) Some of the reported findings are repetitive of previously published results and provide no additional new information. For example:

      1. a) "Interestingly, we found mutated B cells were far more likely to express genes involved in T cell interaction (2B), suggesting Memory B cells are intrinsically licensed to enter an inflammatory state which activates T cells". This evidence is already published (PMID: 7535180 among many other published studies).

      We will cite this paper, which is a landmark study. We don’t claim we are the first to discover a propensity of mutated B cells to present help to T cells, but note that we were able to observe this fact via lineage tracing in a single experiment, which is a conceptual and technical advance. Additionally, we report an entire transcriptional module of genes which are upregulated in memory B cells vs. Naive B cells exposed to the same stimulus. This adds to the systematic understanding of the Memory B Cell activation program.

      1. b) "Instead, Naive B cells were biased toward expressing lectins and CCR7, suggesting Naive B cells are intrinsically primed to home into the lymphatic system and germinal centers (2B)". This evidence is already published (PMID: 9585422 among many other published studies).

      While this is an interesting and important paper referenced by the reviewer, we are unable to find anything similar to our claim about naive B cells in the reference provided. The investigators do not discuss intrinsic differences between memory and naive subsets when responding to the same stimulus.

      8) We quantified the in vitro dynamics of CSR through the lens of mutation status, which revealed strongly different fate biases between germline and mutated cells (2D). Most strikingly, B cells which switched to IGHE were almost exclusively derived from germline progenitors: the ratio of germline IGHE cells to mutated IGHE cells was (8-fold - inf, 95 % CI). Also this evidence is not novel (PMID: 34050324 among other published studies) and, again, must reflect the presence of specific culture conditions that remain completely undisclosed. This is incredibly confusing.

      Thank you for providing this reference, we were not aware of this interesting study. These studies are quite different and complementary. Differences between these studies likely reflect the fact that their B cells are isolated from a niche, rather than generated ____in-vitro_. Most of the tissue-resident cells in their study are quite mutated, and thus are not the Naive B cells we are making a claim abountj. In fact, despite their claim of low mutational load, these cells would fall into the “mutated” or even “heavily mutated” categories we defined in our paper. Cells with mutation levels of 5% are not thought to be Naive in any classification scheme. Our study showed that, _in vitro____, IGHE B cells effectively came exclusively from germline progenitors, their study shows no such result. The novelty of this finding was appreciated by reviewer 2.

      9) Authors should mention that non-switched memory B cells include IgDlowIgM+CD27+ and IgD-IgM+CD27+ memory B cells. Some authors define these distinct memory B cell fractions as marginal zone (MZ) or MZ-like B cells (please, notice that splenic MZ B cells recirculate in humans) and IgM-only B cells, respectively (PMID: 28709802; PMID: 9028952; PMID: 10820234; PMID: 11158612; PMID: 26355154; PMID: 15191950; and PMID: 24733829 among many other published studies).

      We appreciate these points. We attempted to classify our B cells within this taxonomy and found no such separation clearly exists in single-cell RNA-seq profiles. Instead, we opted to re-classify our data with a state-of-the-art algorithm called celltypist (DOI: 10.1126/science.abl5197____ )____ which harmonizes cell annotations across a growing number of single-cell RNA sequencing studies. While this classification system is not currently mutually exclusive / completely exhaustive, we believe using this system provides standardization and data availability that are key for sharing results. As single-cell RNA-seq and flow/mass cytometry harmonize their classification systems, anyone should be able to transfer their preferred classification scheme to the cells profiled here.

      10) Thus, CSR from IGHM cells did not meaningfully contribute to the abundance of IGHA+ cells in the population. Also this conclusion may be misleading and/or inaccurate. Indeed, an efficient class switching to IgA requires the exposure of naïve B cells to the cytokine TFG-beta in addition to a robust TD (CD40L) or TI (CpG DNA or BAFF or APRIL) co-signal. Was TGF-beta present in this culture?

      This is a good point about TGF-beta and switching to IgA. Here is a clear example of the novelty and power of our approach, as well as the benefits of using a well-characterized system such as B cell differentiation. Lineage tracing clarifies between two explanations for why there are IgA cells in the output population. One explanation is that non-IgA B cells in the input switch to IgA, driven by TGF-beta. Another explanation is that IgA cells in the input expand modestly and account for IgA cells in the output. Lineage tracing offers clear evidence that the latter explanation is true. Following from this, our approach allows us to make a strong inference that TGF-beta is not present in the incompletely determined cytokine mixture. We are not sure how this conclusion may be misleading or inaccurate, as it is a clear and simple description of our data, not a claim about what factors are necessary for switching.

      11) In contrast, we noted that many intraclonal class-switching events appeared to be directly from IGHM to IGHE. Explanations involving unobserved cells with intermediate isotypes notwithstanding, these data illustrate the relative ease with which B cells can switch directly to IGHE. It is very difficult to interpret this statement, as no information regarding the B cell-stimulating conditions used is provided. In addition, relevant literature is not quoted (e.g., PMID: 34050324).

      We clarify our discussion here to claim the ease with which peripheral blood IGHM B cells switch to IGHE. Again, lineage tracing has allowed us to distinguish between two very different population-level phenomena. One explanation is that undetected IGHE+ progenitors in the input population expanded rapidly and account for the IGHE+ cells. Another explanation is that cells class-switch to IGHE. Our data are consistent with the latter. We note that this validates the conceptual use of lineage tracing to understand rapid population dynamics in immune responses and cell differentiation protocols. This is a strength of our manuscript. We appreciate the the reviewer has furnished relevant studies, which we will cite.

      12) Our data for IGHE cells contrasts with in vivo data which show IgE B cells to be: (1) very rare, (2) apparently derived from sequential switching (e.g. from IgG1 to IgE) (Horns et al., 2016; Looney et al., 2016), and (3) often heavily hypermutated (Croote et al., 2018).

      While this reviewer agrees with the first comment (switching to IgE is relatively rare in vivo, at least in healthy individuals), the other statements are quite inaccurate. Indeed, unmutated extrafollicular naïve B cells from tonsils and possibly other mucosal districts directly class switch from IgM to IgE in healthy individuals, thereby generating a low-affinity IgE repertoire. In principle, low-affinity IgE antibodies may protect against allergy by competing with high-affinity IgE specificities. In allergic patients, high-affinity IgE clones emerge from class-switched and hypermutated memory B cells that sequentially switch from IgG1 or IgA1 to IgE as a result of specific environmental conditions, including an altered skin barrier (PMID: 22249450; PMID: 30814336; PMID: 32139586).

      Moreover, in contrast to what stated by authors, sequential IgG1/IgA1-to-IgE class switching mostly occurs in allergic patients but is less frequent in healthy individuals, where IgE specificities are less mutated (PMID: 30814336). Along the same lines, IgE is heavily mutated only in allergic individuals with significant molecular evidence of sequential IgG1/IgA1-to-IgE class switching (PMID: 30814336; PMID: 32139586). Overall, the data provided by Swift M. et al. are largely confirmatory of previously published evidence.

      We appreciate the clarification of this complex field and will cite the relevant literature. We also agree with the reviewers assessment that our data are validated by other approaches and groups. We see that our discussion of IgE B cells should have included that caveat that we are discussing IgE B cells detected in the peripheral blood. We have restricted claim to the suggestion that if our conditions mimic such niches where B cells switch to IgE, there are clearly efficient mechanisms which limit the amount of circulating IGHE B cells mechanisms in comparison to other isotypes.____

      Taken together, these data suggest that while direct switching to IGHE from Naive progenitors is trivial in vitro, niche factors or intrinsic death programs efficiently limit their generation or lifetime in vivo." I cannot understand this conclusion, which seems to contradict earlier statements.

      We hope we have clarified via the above comment.

      13) I am not sure I learned much regarding the "cell-intrinsic" fate bias and transcriptional memory of B cells after reading this elegantly presented but confusing and superficially discussed manuscript (please, see also comments 15-23).

      We understand the reviewer is confused about various aspects of our manuscript and appreciate the opportunities to clarify. We show cells with broad identities (such as germline vs. mutated or naive vs. memory) respond differently to the same stimulus. These are cell intrinsic fate biases. We quantify them and provide statistical bounds on the effect sizes of these differences, which to our knowledge has not been done. We agree with the reviewer that in the case of memory and naive B cells, much is already known about their biases. We recapitulate some of this knowledge, while adding a quantitative and an unbiased transcriptomic lens with which to view the biases. However, our analysis moves beyond cell types broadly defined, and focuses on the concept that each clone is a cell state or identity, where some of the identity may be faithfully propagated over generations and other information may not be. To this end, we tracked the transcriptome of clones during differentiation. We show that B cell clones share highly similar cell fates, implicating cell-intrinsic heterogeneity as a major contribution to diversity in immune responses. We note this reviewer did not critique this aspect of our work. The review also did not critique Figure 3 or 4, in which we present a quantitative analysis of which transcriptional programs are maintained by B cells and contribute to their clonal identity. Finally, via our analysis of human long-lived plasma cells, we report these transcriptional identities are observed in-vivo, over long time scales. This type of cell-intrinsic bias has not been studied or described to our knowledge. These findings were of particular interest to reviewer 2 and other readers of the manuscript.

      MINOR COMMENTS

      Thank you for reading the manuscript carefully and providing these comments and observations. We have fixed all clerical errors that were pointed out. We also responded to some of these minor comments here, and made changes to the manuscript to clarify.

      1) Figures 1B, S1C and S1D are not referred to in the text. x

      2) 2B in the Text is 2D in the Figure. x

      3) 2D in the Text is 2E in the Figure. x

      4) Figure 2D seems to show only 10 genes. Please, clarify.

      We clarify in the manuscript that we present the top differentially expressed genes

      5) 2E in the text is 2F in the Figure. x

      6) Figure S3B is not indicated in the text. x

      7) Figure 3E is not indicated in the text. x

      8) Figure S4A is not indicated in the text. x

      9) In some sections of the text, Figure panels are not sequentially discussed, which makes the text very difficult to follow. x

      Reviewer 2:

      Major comments:

      On p.3 the authors assume that a B cell with an unmutated BCR in the time course arose from a naive B cell progenitor. However, it is also possible that it arose from a IgM memory B cells since they also contain a non-negligible proportion of cells with 0 mutations. This was initially seen already in the Klein, J Exp Med, 1998 paper and later confirmed by e.g. Weller et al, J Exp Med, 2008 and Wu et al, Front Immunol, 2011. And since the authors herein and others have demonstrated that IgM memory B cells have a high proliferative capacity it is possible that IgM memory B cells are overrepresented among those unmutated BCRs seen in the cultures.

      The finding that IgM memory B cells are highly proliferative is not novel. It has been demonstrated by other groups before and one good example is Seifert et al, PNAS, 2015 where IgM memory B cells proliferated significantly more to BCR stimulation than naive or IgG memory B cells. However, it is also shown that IgG memory B cells are more responsive to TLR9 stimulation than IgM memory B cells as demonstrated by e.g. Marasco et al, Eur J Immunol, 2017. This is not discussed by the authors and should be added into the discussion for context of their finding by scRNAseq methods.

      These are astute points. We incorporated a more nuanced discussion of the prior literature about highly proliferative IgM memory B cells, which have been reported before. We also added a figure which identifies the genes associated with proliferative clones in Figure 3d, which adds to our understanding of the gene regulatory networks which govern IgM memory B cell behavior. We appreciate the reference to the Seifert et al paper, which is relevant and high quality work. We concur that a discuss of Marasco would be helpful, especially because it is unknown if a TLR9 agonist is in the stimulation cocktail, but their data would suggest there is not.

      The notion that a memory transcriptional program can be induced without SHM is not novel and this should be brought up in the discussion. One paper showing a memory transcriptional program in unmutated memory B cells is Kibler et al, Front Immunol, 2022.

      We were not aware of this literature and have now cited it in our discussion of this finding.

      The observation that memory B cells are more likely to enter an inflammatory state and support T cells has been suggested by other groups (Seifert et al, PNAS, 2015; Magri et al, Immunity, 2017, Grimsholm et al, Cell Reports, 2020).

      We have now cited and discussed a number of papers which contain similar findings. We note that we add to the holistic understanding of this phenomenon via our single cell transcriptomic approach.

      Please provide the age distribution of the peripheral blood samples as well.

      We have now provided the age distribution of the peripheral blood samples

      Please show flow cytometry analysis of the cultures to assist in assessing subset distribution, viability and plasma cell differentiation for each time point. This can be provided as supplementary information.

      We did not use flow cytometry for subset distribution and measurements of differentiation per se, only to exclude non-viable cells and we have now made this clearer in the methods section. We also now include representative plots show our sorting strategy.

      The stimulation cocktail used for this study, what does it contain? This needs to be specified in the manuscript and not only referring to the manufacturer. This has major impact on the results since different stimulatory agents will induce different pathways.

      This is a valid point that we addressed in our response to reviewer 1. See supplementary note 1 and Figure S1A for our analysis of the stimulation cocktail.

      Minor comments:

      Please avoid the term plasma B cells, does it refer to plasmablasts and/or plasma cells?

      Thank you for the suggestion, we have modified our language to refer to plasmablasts and plasma cells separately.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      I have with great interest read the manuscript "Fate bias and transcriptional memory of human B cells" by Swift et al where they use an in vitro approach and single cell RNA sequencing to elucidate fate decisions of human B cells. The authors demonstrate that genes related to cell fate determination in human B cells are persistent both in their in vitro culture and in ex vivo plasma cells from bone marrow. This is a very important finding in this manuscript and the technical execution of the paper is overall of high quality. The finding of switching to IgE in vitro from unmutated precursor is very interesting and does not necessarily contradict existing data in the field but rather adds new important information. However, some of the findings are not totally novel and need to be put in context of earlier findings (outlined below). The discussion does not acknowledge literature on memory B cell transcriptional programs and mutation status in a balanced manner. Important literature by leading groups in the field are left out.

      Major comments:

      On p.3 the authors assume that a B cell with an unmutated BCR in the time course arose from a naive B cell progenitor. However, it is also possible that it arose from a IgM memory B cells since they also contain a non-negligible proportion of cells with 0 mutations. This was initially seen already in the Klein, J Exp Med, 1998 paper and later confirmed by e.g. Weller et al, J Exp Med, 2008 and Wu et al, Front Immunol, 2011. And since the authors herein and others have demonstrated that IgM memory B cells have a high proliferative capacity it is possible that IgM memory B cells are overrepresented among those unmutated BCRs seen in the cultures.

      The finding that IgM memory B cells are highly proliferative is not novel. It has been demonstrated by other groups before and one good example is Seifert et al, PNAS, 2015 where IgM memory B cells proliferated significantly more to BCR stimulation than naive or IgG memory B cells. However, it is also shown that IgG memory B cells are more responsive to TLR9 stimulation than IgM memory B cells as demonstrated by e.g. Marasco et al, Eur J Immunol, 2017. This is not discussed by the authors and should be added into the discussion for context of their finding by scRNAseq methods. The notion that a memory transcriptional program can be induced without SHM is not novel and this should be brought up in the discussion. One paper showing a memory transcriptional program in unmutated memory B cells is Kibler et al, Front Immunol, 2022.

      The observation that memory B cells are more likely to enter an inflammatory state and support T cells has been suggested by other groups (Seifert et al, PNAS, 2015; Magri et al, Immunity, 2017, Grimsholm et al, Cell Reports, 2020). Please provide the age distribution of the peripheral blood samples as well.

      Please show flow cytometry analysis of the cultures to assist in assessing subset distribution, viability and plasma cell differentiation for each time point. This can be provided as supplementary information.

      The stimulation cocktail used for this study, what does it contain? This need to be specified in the manuscript and not only referring to the manufacturer. This has major impact on the results since different stimulatory agents will induce different pathways.

      Minor comments:

      Please avoid the term plasma B cells, does it refer to plasmablasts and/or plasma cells?

      Referees cross-commenting

      Comments on reviewer 1 report: The reviewer raised the same significant objection to the manuscript as it stands today i.e. the content of the stimulation cocktail is not described. Without this information the manuscript is hard to put into the perspective of the existing literature.

      Significance

      See above comments.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      Swift M. et al. explored how gene expression responds to extracellular stimuli by dissecting the transcriptional programs that underlie cell-intrinsic clonal fate bias. They also verified how extrinsic signals, intrinsic state, and clonal population structure interacted to determine the dynamics of the B cell immune response. To address these questions, lineage and single-cell RNA sequencing measurements were performed in human B cells stimulated in vitro through an undisclosed protocol. The B cell receptor (BCR) gene was used as a genetic lineage marker and this information was paired with that gained from transcriptomics, which provided an additional readout of cellular identity. Authors concluded that B cells have intrinsic biases towards specific cell fates and identified specific transcriptional memory states.

      Major comments

      While this is an elegantly presented manuscript that presents a wealth of data obtained from state-of-the-art transcriptomics and bioinformatics, some issues considerably weaken the main conclusions. These weaknesses entail an almost complete absence of details as to the stimuli being used for the in vitro activation of B cells (e.g., were they exposed to IL-4, TGF-beta and/or CD40L?). In the absence of this information, the discussion of intrinsic biases towards specific cell fates cannot be fully interpreted and understood, as different stimuli induce different B cell fates. In addition, there seems to be some misunderstanding about IgE class switching and somatic hypermutation, which differ in healthy and allergic individuals. Also, the novelty of this work is reduced by the fact that some of its conclusions are either largely expected or already known from studies published previously. Altogether, these issues make some statements and conclusions confusing and, sometimes, misleading. The following additional major comments are provided.

      1. Which B cell activation protocol was used? No information is provided in the main text or supplementary information. Yet, this information is key to fully understand many of the conclusions of this work (e.g., ... memory B cells are intrinsically two-fold more persistent in vitro (2A)), which largely depend on the nature of the stimuli used in the in vitro B cell culture.
      2. It would be informative to use more than one B cell activation program, e.g., CD40L with or without a cytokine as well as CD40L vs. CpG-DNA. Authors make broad statements about B cell fates without discussing the impact of a given signal on a given B cell fate. For instance, do memory B cells follow the same differentiation program upon stimulation with CD40L, IL-4 or a combination of CD40L and IL-4? How about differences between a TD signaling program such as that provided by CD40L and IL-10 and TI signaling program such as that provided by CpG-DNA and IL-10?
      3. Page 3, first line: After low quality and non-B cells (Fig S1A & B). What does this statement mean? The sentence seems incomplete.
      4. First, we noted that non-B cells present in the input rapidly became undetectable by day 4, which shows the specificity of the cytokines for B cell expansion. Which cytokines are we talking about? No detail is provided.
      5. Plasmablasts were not distinguished from plasma cells.
      6. Critically, we observed no appreciable evidence of hypermutation in vitro (S2C), consistent with prior literature (Bergthorsdottir et al. 2001). This statement is vague, misleading and likely inaccurate for the following reasons. (a) The B cell culture conditions used by the authors are completely unknown. (b) It was shown that SHM can be achieved under specific in vitro B cell culture conditions that include the presence of activated CD4+ T cells (PMID: 9052835; PMID: 10092799; PMID: 10878357; PMID: 12145648). Did authors try to recapitulate those culture conditions?
      7. Some of the reported findings are repetitive of previously published results and provide no additional new information. For example:

        • a) "Interestingly, we found mutated B cells were far more likely to express genes involved in T cell interaction (2B), suggesting Memory B cells are intrinsically licensed to enter an inflammatory state which activates T cells". This evidence is already published (PMID: 7535180 among many other published studies).
        • b) "Instead, Naive B cells were biased toward expressing lectins and CCR7, suggesting Naive B cells are intrinsically primed to home into the lymphatic system and germinal centers (2B)".

      This evidence is already published (PMID: 9585422 among many other published studies). 8. We quantified the in vitro dynamics of CSR through the lens of mutation status, which revealed strongly different fate biases between germline and mutated cells (2D). Most strikingly, B cells which switched to IGHE were almost exclusively derived from germline progenitors: the ratio of germline IGHE cells to mutated IGHE cells was (8-fold - inf, 95 % CI). Also this evidence is not novel (PMID: 34050324 among other published studies) and, again, must reflect the presence of specific culture conditions that remain completely undisclosed. This is incredibly confusing. 9. Authors should mention that non-switched memory B cells include IgDlowIgM+CD27+ and IgD-IgM+CD27+ memory B cells. Some authors define these distinct memory B cell fractions as marginal zone (MZ) or MZ-like B cells (please, notice that splenic MZ B cells recirculate in humans) and IgM-only B cells, respectively (PMID: 28709802; PMID: 9028952; PMID: 10820234; PMID: 11158612; PMID: 26355154; PMID: 15191950; and PMID: 24733829 among many other published studies). 10. Thus, CSR from IGHM cells did not meaningfully contribute to the abundance of IGHA+ cells in the population. Also this conclusion may be misleading and/or inaccurate. Indeed, an efficient class switching to IgA requires the exposure of naïve B cells to the cytokine TFG-beta in addition to a robust TD (CD40L) or TI (CpG DNA or BAFF or APRIL) co-signal. Was TGF-beta present in this culture? 11. In contrast, we noted that many intraclonal class-switching events appeared to be directly from IGHM to IGHE. Explanations involving unobserved cells with intermediate isotypes notwithstanding, these data illustrate the relative ease with which B cells can switch directly to IGHE. It is very difficult to interpret this statement, as no information regarding the B cell-stimulating conditions used is provided. In addition, relevant literature is not quoted (e.g., PMID: 34050324). 12. Our data for IGHE cells contrasts with in vivo data which show IgE B cells to be: (1) very rare, (2) apparently derived from sequential switching (e.g. from IgG1 to IgE) (Horns et al., 2016; Looney et al., 2016), and (3) often heavily hypermutated (Croote et al., 2018). While this reviewer agrees with the first comment (switching to IgE is relatively rare in vivo, at least in healthy individuals), the other statements are quite inaccurate. Indeed, unmutated extrafollicular naïve B cells from tonsils and possibly other mucosal districts directly class switch from IgM to IgE in healthy individuals, thereby generating a low-affinity IgE repertoire. In principle, low-affinity IgE antibodies may protect against allergy by competing with high-affinity IgE specificities. In allergic patients, high-affinity IgE clones emerge from class-switched and hypermutated memory B cells that sequentially switch from IgG1 or IgA1 to IgE as a result of specific environmental conditions, including an altered skin barrier (PMID: 22249450; PMID: 30814336; PMID: 32139586). Moreover, in contrast to what stated by authors, sequential IgG1/IgA1-to-IgE class switching mostly occurs in allergic patients but is less frequent in healthy individuals, where IgE specificities are less mutated (PMID: 30814336). Along the same lines, IgE is heavily mutated only in allergic individuals with significant molecular evidence of sequential IgG1/IgA1-to-IgE class switching (PMID: 30814336; PMID: 32139586). Overall, the data provided by Swift M. et al. are largely confirmatory of previously published evidence.

      Taken together, these data suggest that while direct switching to IGHE from Naive progenitors is trivial in vitro, niche factors or intrinsic death programs efficiently limit their generation or lifetime in vivo." I cannot understand this conclusion, which seems to contradict earlier statements. 13. I am not sure I learned much regarding the "cell-intrinsic" fate bias and transcriptional memory of B cells after reading this elegantly presented but confusing and superficially discussed manuscript (please, see also comments 15-23).

      Minor comments

      1. Figures 1B, S1C and S1D are not referred to in the text.
      2. 2B in the Text is 2D in the Figure.
      3. 2D in the Text is 2E in the Figure.
      4. Figure 2D seems to show only 10 genes. Please, clarify.
      5. 2E in the text is 2F in the Figure.
      6. Figure S3B is not indicated in the text.
      7. Figure 3E is not indicated in the text.
      8. Figure S4A is not indicated in the text.
      9. In some sections of the text, Figure panels are not sequentially discussed, which makes the text very difficult to follow.

      Significance

      NATURE AND SIGNIFICNCE OF THE ADVANCE

      I could not grasp a significant advance in this study as provided.

      COMPARE TO EXISTING PUBLISHED KNOWLEDGE

      As detailed in my specific comments, many of the conclusions of this work are largely expected or already published. It is difficult to evaluate the substance and originality of other conclusions in manuscript as provided.

      AUDIENCE

      If adequately and extensively revised, this work could be interesting to a broad audience.

      MY EXPERTISE

      B cell biology; B cell subsets; regulation of antibody production, including class switching; signals skewing antibody responses to a specific isotype.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Reply to the Reviewers

      I thank the Referees for their...

      Referee #1

      1. The authors should provide more information when...

      Responses + The typical domed appearance of a hydrocephalus-harboring skull is apparent as early as P4, as shown in a new side-by-side comparison of pups at that age (Fig. 1A). + Though this is not stated in the MS 2. Figure 6: Why has only...

      Response: We expanded the comparison

      Minor comments:

      1. The text contains several...

      Response: We added...

      Referee #2

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study addresses the newly appreciated role of E-cadherin in tumor cell invasion. A strength of this study is their inducible RasV12 Drosophila model of transformation, which allows them to follow cell dissemination at the basal midgut. The authors demonstrate, convincingly, that depletion of E-cadherin in this model impairs cell dissemination. However, their efforts to demonstrate a role for two Ca2+ signaling pathways mediated by IP3R and Piezo channels are not convincing given the use of blunt and non-specific pharmacological tools, lack of Ca2+ monitoring, and generally descriptive observations.

      A significant weakness in this study is the sole and unsubstantiated use of Arm (the Drosophila b-catenin ortholog) as a proxy for e-cadherin localization and abundance. This leads to unsubstantiated statements such as, "disassembly of adherens junctions rather than E-cad levels" is involved in dissemination of cells. The role of E-cad is complicated by the observation that both knockdown and overexpression of E-cad perturb cell dissemination in their model.

      In Figure 4, the role of Piezo in Arm distribution is presented in confocal images of a few cells. Is this statistically significant? This experiment needs to be quantified with more cells to substantiate the claim that Arm signals redistribute from cell boundary to cytosol in Piezo and Calpain knockdowns. Piezo channels are non-selective for cations. This means that the results of the knockdown cannot be assigned to changes in cytosolic Ca2+. At the least, cytosolic Ca2+ levels should be monitored. The authors suggest that calpains work with Piezo through Ca2+ signaling on the basis of a single experiment in which calpain knockdown has a similar effect on Arm localization to Piezo knockout. This conclusion seems speculative and is not adequately supported.

      Thapsigargin is a blunt tool that causes a large, non-physiological and irreversible increase in cytosolic Ca2+. The observation that high Ca2+, resulting from Tg treatment, mediates disassembly of cadherins junction is not new, and indeed has been known for at least 20 years. Therefore, the effect of Tg in causing relocation of Arm signals from the adherens junctions is not insightful.

      Use of GdCl3 is a non-specific inhibitor of many ion channels, including voltage gated Ca2+ channels, TRP channels, leak K+ channels, etc. It cannot be used to infer the role of Piezo when used as described in this study. The observation that Tg + GdCl3 somewhat preserves basal/junction ratios of Arm, relative to Tg alone, is interesting but uninterpretable without additional experiments, including measurements of cytoplasmic Ca2+ levels. None of the findings related to Figure 5 are mechanistically insightful and the imputed role of Piezo is not convincing.

      Significance

      Overall, although this study addresses a topic of high significance and interest, the limitations in experimental approach do not allow mechanistic insights that advance our understanding of the role of E-cadherin in tumor invasion.

      Referee Cross-commenting

      Dear fellow reviewers,

      My expertise is in calcium signaling and ion transporters/channels. Although I was excited to review this work based on the abstract, I do not think that the link to calcium signaling (figs 4-5) is insightful or mechanistic, as I explain in my review. I realize that experimental approaches may be limited by the fly model, but the conclusions made by the authors would not be persuasive in other models.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, Cabrera et. al. examine the role of DE-cadherin in cell dissemination, using a model of RasV12-transformed cells in the Drosophila hindgut. They show that E-cad relocates in the transformed cells from cell-cell junctions to basal invasive protrusions, and this relocation depends on calcium release from the ER by the PLC-IP3R-CAMK pathway. DE-cad is not required for the formation of actin-rich protrusions. However, upon RNAi-mediated DE-cad depletion (or depletion of the PLC-IP3R pathway), the basement membrane is not degraded and cell dissemination is inhibited. Furthermore, piezo and calpain are also shown to be required for DE-cad to assemble at invasive protrusions.

      Major comments:

      The data in figures 1-3 is overall convincing, although some of the points are missing quantification. The data supporting a role for piezo in regulating DE-cad assembly at adherens junctions is weaker. The data derived from ex-vivo pharmacological treatments (figure 5) is difficult to interpret or draw conclusions from.

      Specific comments:

      1. The data on DE-cad overexpression is puzzling. When DE-Cad is overexpressed in Ras-transformed cells it relocates from AJ to invadopodia and the ECM is degraded just like in Ras-transformed cells, yet cell dissemination is reduced by 50%. Why? The authors don't offer any explanation and in the absence of one I don't see how the DE-cad overexpression data adds to the story.
      2. DE-cad is knocked down by RNA interference using two different sequences. Admittedly, I am not a fly person, but isn't there a way for fly geneticists to show how well the KD worked? What percentage of WT DE-cad is left? This same question can be applied to all the RNAi experiments.
      3. Arm redistribution and co-localization with F-actin should be quantified. While the "representative" images are of high quality and their presentation in XY and XZ or basal, middle, ortho views is very helpful, the data from multiple cells, from multiple flies, from at least 3 different experiments must be quantified to show by how much does the level of E-cad decrease at AJ and by how much does it increase at the basal protrusions. Also, the degree of co-localization of F-actin and E-cad at invadopodia should be quantified. Perhaps F-actin is not the best marker for invadopodia since it is also present in other structures.
      4. The data presented in Figure 3c-h also needs to be properly quantified for level of DE-cad at lateral sides of cells and on basal side from multiple cells in multiple animals and at least 3 experiments.
      5. The connection made between piezo and DE-cadherin (figure 4) is tenuous. Piezo is known (based on the authors previous work as well as others) to be upstream of invadopodia formation, so it is not surprising that E-cad does not localize to invadopodia when they don't exist. The authors do not provide any evidence to directly link piezo activity or calcium entry to DE-cadherin localization at invadopodia or at AJ and therefore their claim that "calcium signaling mediated by the Piezo-calpain pathway plays a distinct role in the DE-cad/Arm remodeling process" is purely speculative. The images of fuzzy DE-cadherin in figure 4b are hardly proof.
      6. The experiment in fig 5a,b showing that calcium release from the ER leads to AJ disassembly, nicely complements the in vivo data in fig 3c-h. However, the remaining of figure 5, dealing with piezo, is entirely unconvincing. The images in 5f don't resemble invadopodia and have nothing to do with them. The data in 5i regarding apical delamination is interesting, but does not support their point about piezo and AJ. They don't even show DE-cadherin.

      Minor comments:

      Have a figure "for the reviewer" should be avoided. The authors must write the paper and include diagrams or images so that any reader is able to understand the model system just as well as the reviewer.

      Significance

      In my assessment of significance I am only taking into account the conclusions that I feel are well supported. Depending on when the preprint they cite (ref. 52) will be published, this could be the first report of E-cad localizing in invadopodia and, importantly, being essential for ECM degradation and cell dissemination. Also of significance is the identification of intracellular calcium release as a signal for AJ disassembly and relocation of E-cad to invasive protrusions. While the molecular mechanism through which E-cad regulates ECM degradation is not elucidated, the paper still represents a significant advance for the fields of cancer cell biology and cell and developmental biology and would be of interest to a wide audience.

      I am a cell and developmental biologist with expertise in cell adhesion and morphogenesis. I am not an expert on Drosophila.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This interesting manuscript examines how intracellular calcium signaling and cation channels (piezo) may affect the remodelling of adherens junctions in tumor cells. The authors use the Drosophila midgut model to examine the dissemination of RasV12 cells. Authors show that at day 2 of Ras expression, DE-cadherin/b-catenin complex redistributes from adherens junction to basal invasive protrusions. Depletion of DE-cadherin attenuates the dissemination and local invasion of RasV12 cells, as manifest in damage to underlying visceral muscles. The authors attribute disassembly of cadherin from cell-cell junctions to IP3R-dependent release of calcium from ER stores and subsequent activation of CaMKs. Surprisingly, the authors found an apparently separate role for Piezo (potentially mediating calcium signaling) to support the assembly of DE-cad at the invasive protrusions.

      Overall, this is a well-presented report that outlines the basic features of the phenomena with data of good quality. Over the years, there have been intermittent efforts to characterize the impact of calcium signaling on cell-cell junctions but much remains to be learnt. This manuscript is informative for focusing attention on a model of cadherin downregulation (and redistribution) in early tumors. However, it is somewhat descriptive and lacks depth of mechanistic analysis. For example, how might the cadherin in invasive protrusions be contribution to matrix degradation? What are the targets of either the IP3/CamK pathway or piezo/calpain signaling? Clearly, these foundational observations could be taken in many different directions. But some development would enrich the current package. Some suggestions follow.

      Major comments:

      1. What is the role of E-cadherin in invasive protrusions? The authors use DE-cadherin depletion to support a role for this relocated pool of cadherin in tumor dissemination. However, this doesn't distinguish between a direct action of the cadherin complex at the protrusions from some indirect effect. Further analysis of the "protrusion" cadherin could be informative. For example, one possibility is that E-cadherin is essential to localise other proteins to the invasive protrusion. Authors have previously reported (Lee et al. 2020) that cortactin is essential for dissemination of the RasV12 cells. Considering that the phenotype of cortactin KD (presented in Lee et al. 2020) and DE-cad KD mostly overlap, and literature suggests that E-cadherin and cortactin colocalise and can influence each others' localisation in cultured cells (e.g. Ren et al. 2009; PMCID: PMC2707247). Could KD of DE-cad perturb cortactin localisation at the invasive protrusions? This could be doable in a reasonable time, as according to Lee et al. 2020 authors already have a relevant Drosophila strain, that lets visualisation of cortactin.
      2. Another route to explore in greater depth is the impact of DE-cadherin depletion on the protrusion. The authors show that upon DE-cad KD, some actin-rich puncta are still able to form at the basal surface of the Ras cells. This should be characterised in more details to assess whether DE-cad KD affects the number, size and dynamics of forming protrusions. At current stage, it's not obvious why DE-cad depleted Ras cells do not disseminate, even though they are able to form the invasive protrusions. As authors may potentially use movies/images they already have, this should be doable within reasonable time frames.
      3. Authors could expand the discussion section to address the issue of the unusual DE-cad localisation. The open, basic question that the authors haven't addressed at any stage is "what does the DE-cad bind to at the invasive protrusions?". Do authors speculate that DE-cad is able to interact with some components of ECM/VM (as would be suggested by the cartoon Fig S5)? Have authors considered that E-cadherin transinteractions forming eg. between small adjacent protrusions may serve to stabilise the initial structures and let them develop into more prominent invasive protrusions?
      4. Given the rapid intracellular diffusivity of calcium, it is interesting that two different sources of calcium have such different effects on the cadherin pool. Why does calcium released from ER by IP3R is enough to disassemble the cadherin complex at AJs, but signaling from Piezo is needed to regulate assembly of the cadherin complex at the invasive protrusions. Can authors comment on why calcium released by ER can't control both processes? Is it possible to visualize the dynamic subcellular distribution of calcium in these cells to test if there are different pools involved?
      5. Further, the discrepancy between the influence of the piezo1 KD and GdCl3 treatment on the localisation of the DE-cad/Arm is quite striking. As literature suggests an existence of a physical interaction between Piezo channel and E-cadherin, blocking Piezo vs depleting it may potentially affect E-cadherin differently. To address this, can authors e.g. add GdCl3 treatment (covering same timeframes as the Piezo KD) to data presented in Fig 4b (this would also eliminate the ambiguity caused by possible differences between in vivo vs ex vivo conditions - Fig 4 vs Fig 5). Also, is the phenotype of loss of junctional DE-cad/Arm upon Piezo KD limited to the Ras cells or does it also happen in GFP control cells?
      6. Is the observed phenotype (dissemination and involvement of DE-cad) limited to RasV12 oncogene? Do authors have any experience with other oncogenes (e.g. Src)? If an adequate Drospihila strain is available to authors, it would be worth confirming that the presented findings are not limited to only one oncogene.
      7. Finally, the paper feels like a small, follow up story that is hugely dependent on author's 2020 paper (Lee et al. 2020). It may be quite hard to follow if the reader is not familiar with the 2020 paper, as eg. the current paper does not describe what the "dissemination process" actually is, instead it directs the reader to the 2020 paper (similarly the figure for reviewers consists of images taken from the 2020 paper, rather than the current one).

      Minor comments:

      1. How was junctional intensity of proteins quantified in samples that don't have obvious junctional staining (eg. Fig5b images and quantification for TG/ TG+GdCl3 in c and d or Image 5f and quantification in h). Do authors use an additional junctional marker to define where the junction is? If not, how is the position of junctions determined when the measured protein is dispersed from junctions?
      2. Is the calculation for IP3R-iGD1676 in Fig S3a correct? It doesn't match representative images presented in Fig3a or data in Fig3b, where IP3R-iGD1676 is not any different from other IR3R knock downs. If the quantification in Fig S3a is correct, the image presented in Fig3a is not representative of the majority of phenotype in this condition.
      3. Can delamination of Ras cells (Fig S1b and S3b) be quantified? It's quite challenging to assess the number of delaminated cells (especially in S3b).
      4. It would be good to quantitate the change in Arm (Fig 3d-h), in addition to showing representative images.
      5. Can authors describe the names used for labelling graphs in the figure captions? Some of the names used are not obvious for readers not already familiar with them eg. labels presented FigS3c - it's not obvious what norpA is, and the figure captions don't help at all.
      6. Is the antibody used for DE-cad able to distinguish between DE-cad and other cadherins (especially N-cadherin, which is commonly upregulated during EMT).

      Significance

      We believe that after addressing the above concerns, the paper may provide an informative advance to current knowledge about dissemination of early cancer cells leading to cancer metastasis. As authors mentioned, literature is not clear about involvement of E-cadherin in cancer metastasis. It was first suggested that loss of E-cadherin increases the metastatic potential of tumourigenic cells (Berx et al. 1995; Bogenrieder et al. 2003), however, more recent work indicated that the loss of E-cadherin, while increasing invasiveness, decreases metastatic potential, as well as cell proliferation and survival of circulating tumour cells (Padamanaban et al. 2019), indicating that the role of E-cadherin in cancer is much more complicated than previously appreciated. Thus, assessing the role of E-cadherin during cancer cell delamination in vivo may be a powerful and informative tool to bring some clarity to the cancer biology field.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the reviewers

      We sincerely thank the reviewers for their thoughtful and helpful input. Below were have included our response to the points raised by our reviewers.

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: Reichling et al. used flow-injection time-of-flight mass spectrometry (FIA-MS), a chromatography-free high-throughput method of untargeted metabolomics, to assess the temporary changes in the concentrations of polar metabolites within budding yeast exposed to the TORC1 (target of rapamycin complex 1) inhibitor rapamycin. The water-soluble metabolomes of the rapamycin-treated wild-type (WT) strain, many mutants in the non-essential genes lacking known protein components of the TORC1-upstream and -downstream signaling and numerous gene-deletion mutants missing the redundant small-molecule protein receptors that might participate in TORC1 signaling were analyzed by FIA-MS. The quality of the quantitative metabolome profiling performed by the FIA-MS method was validated with the help of a time-consuming liquid chromatography/mass spectrometry-based metabolomics of WT cells. *

      * Using the FIA-MS-based metabolome profiling, the authors revealed that rapamycin treatment upregulates or downregulates the polar metabolomes specific to a distinct set of cellular processes. Similar patterns of the temporal dynamics of rapamycin-induced changes in the metabolomes characteristic of these cellular processes were observed in WT cells and mutants deficient in known protein components of TORC1 signaling. Remarkably, the authors found that three mutants (i.e., BCK1, CLA4 and CFF1) impaired in the small-molecule protein receptors that were unknown for their roles in TORC1 signaling exhibit the temporal dynamics of rapamycin-induced metabolome changes comparable to the mutants deficient in the known positive protein regulators of TORC1 signaling. Furthermore, the comparative metabolome-based analyses of relationships between rapamycin-treated cells defective in the many known and several unknown protein regulators of TORC1 signaling allowed authors to conclude that such analyses objectively reflect the functional connectivity of these protein regulators. Moreover, the authors provided evidence that comparing the metabolome profiles in rapamycin-treated WT and TORC1 signaling mutant cells enables the identification of new metabolic reactions and pathways affected by and/or contributing to the TORC1-dependent nutrient signaling network. *

      * In this proof-of-principle study, the authors also employed the liquid chromatography-tandem mass spectrometry for a quantitative proteomic comparison of rapamycin-treated WT cells and mutants impaired in the known and previously unknown protein components of TORC1 signaling. They found that rapamycin induces comparable changes in the proteomes of all these cells. This proteomic analysis confirmed that the small-molecule protein receptors previously unknown as TORC1 signaling components and identified as such components only with the help of dynamic metabolome are integrated into the TORC1 signaling network. The authors further confirmed the essential contribution of the novel protein components of the TORC1 signaling to this type of nutrient signaling in the experiments on measuring growth rate changes following a nitrogen source upshift and assessing metabolome and proteome alterations after a nitrogen source downshift.

      Comments: The manuscript is clearly written and of high technical quality. All claims are convincing, fully supported by the experimental data and appropriately discussed in the context of previous literature. The authors have been fair in their treatment of previous literature. They provided the methodological detail sufficient for others to reproduce the experiments. No additional experiments are needed to support the claims made in the manuscript. The experiments are adequately replicated and statistical analysis is appropriately performed. *

      * Reviewer #1 (Significance (Required)):

      This pioneering study represents a major conceptual advancement in the fields of using condition-specific, dynamic metabolome profiling for the deep understanding of relationships between genes integrated into a signaling pathway, discovering novel genes incorporated into a cellular signaling network and its small-molecule regulators, and identifying the metabolic pathways governed by a signal transduction network. *

      * The treatment of the existing literature on the research topic by the manuscript's authors is balanced and fair. *

      * This well-organized and clearly written manuscript is a must-read for anyone interested in the molecular mechanisms of cellular signaling. *

      * My field of expertise involves exploring the molecular dynamics of complex cellular processes using advanced genetic, cell biological and biochemical approaches (including the mass spectrometry-based analyses of the cellular proteomes, lipidomes and water-soluble metabolomes). *

      * I recommend accepting this manuscript for publication in any journal affiliated with Review Commons. **

      *

      We thank reviewer 1 heartily for their comments and for the time they spent reviewing our work.*

      ** Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Rechiling et al., have used a unique approach by exploiting a temporal dynamic high-throughput metabolome profiling (using flow-injection time-of-flight mass spectrometry) to measure the metabolome profiles of many mutants in yeast, which allow them to newly annotate 3 genes in the TOR signaling pathway. This work demonstrate elegantly that dynamic perturbation of the cell allows inferring gene function when using a metabolomics-based guilt-by-association scheme. They were able to successfully find genes like CFF1, BCK1 and CLA4 which might act as positive regulators in the TOR pathway. This an interesting study since it provides an alternative approach to annotate gene function and their contribution to known signaling pathways by analyzing the dynamic of the soluble metabolome. Overall, the manuscript was concisely well written, and the finding has a great potential to improve our understanding of gene function and genetic determinism of metabolism in model organisms.

      Major comments: As the cellular response to rapamycin is not restricted to changes at the level of the metabolome, the authors investigated the proteomic response of each mutant to reaffirm their functional relationship to the TOR pathway. In this regard, it is not clear why the author did not consider a time-course analysis of the proteome as they did for the metabolome. The measurable steady-state proteomic signature might also reflect a buffered cellular state which might hide other response.

      *

      A time course analysis of the proteome for all mutants would be extremely interesting. However, due to the time and cost of proteomic analysis we instead focused on one time point for rapamycin treatment in the context of proteomics analysis. The proteome response data from mutants treated with rapamycin for 1 hour was used to identify the altered proteome responses of the mutants presented in figure 3A. This allowed us to explore the proteome responses of the mutants to rapamycin, and overcome the cellular buffering that our reviewer correctly points to.

      * **Minor comments: Many typos in the methodology section (e.g., potassium phosphate ...) *

      The typos in the methods section have been corrected.

      * **Supplementary Fig 1: Not clear what this figure show and how it supports the author's claim that FIA-MS was validated by LC-MS. *

      The positive linear relationship between the values for the measured metabolites by FIA-MS and LC-MS indicate that similar results were obtained with both methods. We have altered the text to make this clearer.*

      **Reviewer #2 (Significance (Required)):

      This an interesting study since it provides an alternative approach to annotate gene function and their contribution to known signaling pathways by analyzing the dynamic of the soluble metabolome. The finding has a great potential to improve our understanding of gene function and genetic determinism of metabolic adaptation in model organisms. *

      We thank reviewer 2 for their helpful comments, and the time they spent reviewing our work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Rechiling et al., have used a unique approach by exploiting a temporal dynamic high-throughput metabolome profiling (using flow-injection time-of-flight mass spectrometry) to measure the metabolome profiles of many mutants in yeast, which allow them to newly annotate 3 genes in the TOR signaling pathway. This work demonstrate elegantly that dynamic perturbation of the cell allows inferring gene function when using a metabolomics-based guilt-by-association scheme. They were able to successfully find genes like CFF1, BCK1 and CLA4 which might act as positive regulators in the TOR pathway.

      This an interesting study since it provides an alternative approach to annotate gene function and their contribution to known signaling pathways by analyzing the dynamic of the soluble metabolome. Overall, the manuscript was concisely well written, and the finding has a great potential to improve our understanding of gene function and genetic determinism of metabolism in model organisms.

      Major comments:

      As the cellular response to rapamycin is not restricted to changes at the level of the metabolome, the authors investigated the proteomic response of each mutant to reaffirm their functional relationship to the TOR pathway. In this regard, it is not clear why the author did not consider a time-course analysis of the proteome as they did for the metabolome. The measurable steady-state proteomic signature might also reflect a buffered cellular state which might hide other response.

      Minor comments:

      Many typos in the methodology section (e.g., potassium phosphate ...) Supplementary Fig 1: Not clear what this figure show and how it supports the author's claim that FIA-MS was validated by LC-MS.

      Significance

      This an interesting study since it provides an alternative approach to annotate gene function and their contribution to known signaling pathways by analyzing the dynamic of the soluble metabolome. The finding has a great potential to improve our understanding of gene function and genetic determinism of metabolic adaptation in model organisms.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Reichling et al. used flow-injection time-of-flight mass spectrometry (FIA-MS), a chromatography-free high-throughput method of untargeted metabolomics, to assess the temporary changes in the concentrations of polar metabolites within budding yeast exposed to the TORC1 (target of rapamycin complex 1) inhibitor rapamycin. The water-soluble metabolomes of the rapamycin-treated wild-type (WT) strain, many mutants in the non-essential genes lacking known protein components of the TORC1-upstream and -downstream signaling and numerous gene-deletion mutants missing the redundant small-molecule protein receptors that might participate in TORC1 signaling were analyzed by FIA-MS. The quality of the quantitative metabolome profiling performed by the FIA-MS method was validated with the help of a time-consuming liquid chromatography/mass spectrometry-based metabolomics of WT cells.

      Using the FIA-MS-based metabolome profiling, the authors revealed that rapamycin treatment upregulates or downregulates the polar metabolomes specific to a distinct set of cellular processes. Similar patterns of the temporal dynamics of rapamycin-induced changes in the metabolomes characteristic of these cellular processes were observed in WT cells and mutants deficient in known protein components of TORC1 signaling. Remarkably, the authors found that three mutants (i.e., BCK1, CLA4 and CFF1) impaired in the small-molecule protein receptors that were unknown for their roles in TORC1 signaling exhibit the temporal dynamics of rapamycin-induced metabolome changes comparable to the mutants deficient in the known positive protein regulators of TORC1 signaling. Furthermore, the comparative metabolome-based analyses of relationships between rapamycin-treated cells defective in the many known and several unknown protein regulators of TORC1 signaling allowed authors to conclude that such analyses objectively reflect the functional connectivity of these protein regulators. Moreover, the authors provided evidence that comparing the metabolome profiles in rapamycin-treated WT and TORC1 signaling mutant cells enables the identification of new metabolic reactions and pathways affected by and/or contributing to the TORC1-dependent nutrient signaling network.

      In this proof-of-principle study, the authors also employed the liquid chromatography-tandem mass spectrometry for a quantitative proteomic comparison of rapamycin-treated WT cells and mutants impaired in the known and previously unknown protein components of TORC1 signaling. They found that rapamycin induces comparable changes in the proteomes of all these cells. This proteomic analysis confirmed that the small-molecule protein receptors previously unknown as TORC1 signaling components and identified as such components only with the help of dynamic metabolome are integrated into the TORC1 signaling network. The authors further confirmed the essential contribution of the novel protein components of the TORC1 signaling to this type of nutrient signaling in the experiments on measuring growth rate changes following a nitrogen source upshift and assessing metabolome and proteome alterations after a nitrogen source downshift.

      Comments:

      The manuscript is clearly written and of high technical quality. All claims are convincing, fully supported by the experimental data and appropriately discussed in the context of previous literature. The authors have been fair in their treatment of previous literature. They provided the methodological detail sufficient for others to reproduce the experiments. No additional experiments are needed to support the claims made in the manuscript. The experiments are adequately replicated and statistical analysis is appropriately performed.

      Significance

      This pioneering study represents a major conceptual advancement in the fields of using condition-specific, dynamic metabolome profiling for the deep understanding of relationships between genes integrated into a signaling pathway, discovering novel genes incorporated into a cellular signaling network and its small-molecule regulators, and identifying the metabolic pathways governed by a signal transduction network.

      The treatment of the existing literature on the research topic by the manuscript's authors is balanced and fair.

      This well-organized and clearly written manuscript is a must-read for anyone interested in the molecular mechanisms of cellular signaling.

      My field of expertise involves exploring the molecular dynamics of complex cellular processes using advanced genetic, cell biological and biochemical approaches (including the mass spectrometry-based analyses of the cellular proteomes, lipidomes and water-soluble metabolomes).

      I recommend accepting this manuscript for publication in any journal affiliated with Review Commons.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 CROSS-CONSULTATION COMMENTS

      All reviewers have similar doubts on quality and quantification of the IHC analyses, so I would agree to give a chance to the authors for a revision, and it is probably doable in between 3 and 6 months (I wrote >6 before).

      Response: In our fist submission, our full PDF file did not use the high-resolution figures due to large file size of some images. For re-submission, we uploaded our original, high-resolution figures. We also re-performed several immunostaining experiments and took confocal images to increase the quality of our figures. Please see below for detailed response.

      Reviewer #1 (Evidence, reproducibility, and clarity (Required)):

      Summary:

      In this study, Wu et al. confirmed the prominent cross talk of the Hippo pathway in beta-cell development, namely of Ngn3-YAP and beta-cell maturation. Deletion of the Hippo kinases Lats1 and 2 in Ngn3-expressing endocrine progenitor cells activated YAP1/TAZ transcriptional activity and reduced islets size and morphology and induced pancreas inflammation. This is in line with the previously established role of YAP/TAZ as regulator of viral response pathways In contrast, deletion of Lats1&2 later in development had no effect on beta-cell function and morphology. The authors conclude that Hippo pathway-mediated YAP1/TAZ inhibition in endocrine progenitors is a prerequisite for beta-cell maturation.While the effect of Lats1 and 2 in beta-cell development has never been investigated, the outcome of the study is largely confirmative; namely YAP's necessity to switch off whenever an endocrine cell is formed while it also balances inflammation.

      The study depends on cell-specific Lats1 and 2-KO mouse models and in-vitro assessments of mechanisms how loss of LATS leads to beta-cell derangement at the time of ngn3 expression and how the inflammatory pathway is activated are missing. Observations are mostly based on stainings of quite low quality and it is unclear how authors performed quantitative evaluations and how many cells from how many mice were counted. Figures show high background, e.g. CD45 in Fig.3 which would make a robust evaluation impossible. Also YAP staining, usually well-expressed in ductal cells, shows low quality and high background. Usually the antibody is well-know for its weak performance in fluorescence and should only be used with chromogenic labels.

      Response: ____ To show the co-localization of multiple proteins, immunofluorescence staining is necessary. ____We added and described, in detail, how we performed macrophage quantification in the Methods section.

      Reviewer #2 (Evidence, reproducibility, and clarity (Required)):

      Summary:

      Wu et al. examined the roles of Hippo pathway mediated YAP1/TAZ inhibition in the development stages of endocrine specification and differentiation in vitro and in vivo. This study concluded Hippo pathway-mediated YAP1/TAZ inhibition in endocrine progenitors is a prerequisite for endocrine specification and differentiation. The present study is conducted by solid experiment in some parts, however, there are several major concerns as follows.

      Major comments:

      Authors concluded that proper Hippo activity was required for the Ngn3 driven differentiation program, further expanding our fundamental understanding of Hippo pathway participation in pancreatic endocrine development. This sentence is too vague. Previous study has already reported that Ngn3 expression and Yap loss occur in parallel within the same cell during development of the endocrine pancreas (Mol Endocrinol Baltim Md. 2015;29: 1594-607. doi:10.1210/me.2014-1375).

      Response: ____ As reviewer pointed out, the previous study only reported that Ngn3 expression and YAP1 loss occur in parallel within the same cells, and NGN3 can turn off Yap1 transcription in cell culture setting. However, our study showed that NGN3 alone is not sufficient to turn off Yap1 gene in vivo. Using our genetic model, we revealed that the Hippo pathway controling the nuclear localization of YAP1 is essential at the initiation of endocrine differentiation and allows YAP1 to be turned off at the transcriptional level. ____Our rescue model (NLTY mice) confirmed that the endocrine development defects observed in NL pancreas are caused by YAP1/TAZ, suggesting the necessity of controlling YAP1/TAZ by the Hippo pathway during endocrine development. Failure to sequester YAP1/TAZ outside of nuclei by the Hippo pathway in Ngn3-expressing endocrine progenitors halts the development of these cells.

      Reviewer #2:

      In the present study, removal of YAP1/TAZ rescued the defect in endocrine specification and differentiation in LATS1/2-null pancreas. These results indicate that both of LATS1/2 and YAP/TAZ are not essential for the normal endocrine specification and differentiation. Furthermore, these results suggest that Hippo pathway-mediated YAP1/TAZ inhibition is also unnecessary for proper pancreatic endocrine specification and differentiation. Additionally, these results suggest that just a deletion of YAP/TAZ is sufficient for endocrine specification and differentiation. How is the endocrine specification and differentiation in Ngn3creyap1fl/flTazfl/fl mice?

      Response: ____ Our NL model (deletion of Lats1/2 in Ngn3-expressing cells) showed endocrine development defects, suggesting that Lats1/2 are essential for normal endocrine specification and differentiation. Removal of YAP1/TAZ in Lats1/2 null cells rescued the endocrine defects of NL pancreas. Genetic removal of YAP1/TAZ mimics Hippo pathway-mediated sequestration/degradation of YAP1/TAZ by LATS1/2. Uncontrolled YAP1/TAZ due to genetic removal of Lats1/2 blocked the endocrine specification and differentiation. This further demonstrates the necessity for functional Lats1/2 for inhibition of YAP1/TAZ in Ngn3-expressing cells during endocrine development. Ngn3CreYap1fl/flTazfl/fl mice have no endocrine defects, which is consistent with previous reports that Ngn3 expression and YAP1 loss occur in parallel within the same cells. Thus, we did not present this result.

      Reviewer #2:

      In Figure 2B and 3B, YFP expression (an indicator for LATS1/2 deletion) is more detectable in control compared to LATS1/2 null mice, suggesting that the LATS1/2 expression is more decreased in control mice. Is this true?

      Response:____ YFP expression shows Cre activity which can be used as an indicator for Lats1/2 deletion. However, the expression level of YFP does not correlate with the expression level of Lats1/2 because YFP is controlled by the Rosa26 gene promoter. We did observe a higher level of YFP expression in endocrine cells compared with non-endocrine cells in NL pancreas. It is possible that the Rosa26 promoter is more active in endocrine cells. However, this is beyond the scope of our research.

      Reviewer #2:

      Please show the expression of YFP, NGN3, and YAP/TAZ in figure 6A to confirm their expression status.

      Response:____ In Figure 6, we intend to show that loss of YAP1/TAZ can rescue NL mice defects at P1 where Ngn3 expression is low. The NLTY pancreas showed much more endocrine cells compared to NL pancreas, suggesting that Lats1&2 functions to control YAP1/TAZ. Genetic ablation of YAP1/TAZ in Ngn3-expressing endocrine cells equals sequestering of YAP1/TAZ outside of nuclei by LATS1/2. Thus, we did not show expression of YFP, NGN3, and YAP1/TAZ in Figure 6A. Instead, we reperformed immunostaining for Figure 5B and took high-magnification confocal images to show expression pattern of NGN3 and YAP1 in endocrine progenitors.

      Reviewer #2:

      Please present the sequential changes of LATS1/2, YAP/TAZ, NGN3 expressions in the control mice and LATS1/2 null mice at least E12.5, E16.5, and P1, to make ii easy to understand them for general readers.

      Response:____ At E12.5, the Hippo pathway plays important roles in pancreas development. See reference 11. Ngn3 peaks at E15.5 and only expresses in endocrine progenitors. We have published a review paper (ref. 7) “Wu Y, Aegerter P, Nipper M, Ramjit L, Liu J, Wang P. Hippo Signaling Pathway in Pancreas Development. Front Cell Dev Biol. 2021;9: 663906. doi:10.3389/fcell.2021.663906” as a detailed background for this manuscript, including pancreas development, up-to-date publication on Hippo pathway in pancreas development, and a model for YAP1 function in endocrine development. Thus, we did not include this background information in this manuscript.

      Reviewer #2:

      Minor comments: Please describe the affiliation of authors (number 3 and 4).

      Response:____ It has been included in the manuscript: 3Department of Molecular Medicine, 4Department of Population Health Sciences.

      Reviewer #2:

      In table 1, the details of secondary antibodies were not described.

      Response:____ We have added tables to show all primary and secondary antibodies used in the paper.

      Reviewer #2 (Significance (Required)): Previous studies have already reported that Ngn3 expression and Yap loss occur in parallel manner during development of the endocrine pancreas. The primary aim of the present study is whether the YAP loss is mediated by LATS1/2 in Ngn3 positive cells.

      Response:____ The primary aim of our study is to understand if the Hippo pathway plays an important function in endocrine pancreas development.

      Reviewer #2:

      Furthermore, although authors concluded that Hippo pathway (LATS1/2)-mediated YAP1/TAZ inhibition is essential for proper pancreatic endocrine specification and differentiation, the null condition both of LATS1/2 and YAP/TAZ in Ngn3 positive cells provided the normal endocrine specification and differentiation (Figure 6). These findings do not support the conclusion.

      Response:____ The defects in our NL (Lats1/2 null) pancreas in endocrine development strongly suggest the essential role of Hippo pathway in endocrine development. The null conditions of both Lats1/2 and Yap1/Taz (NLTY mice) mimic the effects of LATS1/2-mediated control of YAP1/TAZ in endocrine progenitors via genetic ablation rather than through the canonical Hippo kinase cascade. This further demonstrates the importance of proper Hippo signaling during endocrine development to inhibit YAP1/TAZ via the Hippo kinase cascade in Ngn3-expressing endocrine progenitors.

      Reviewer #3 (Evidence, reproducibility, and clarity (Required)):

      Summary: In this study, Wu et al. use murine Cre-lox model systems to demonstrate that the core Hippo pathway components, Lats1/2, promote pancreatic endocrine specification and differentiation through Yap1/Taz. The roles of the Hippo pathway in mammals are complicated and context-dependent. Prior studies have implicated the core kinase cascade of the Hippo pathway, containing MST1/2, LATS1/2 and YAP1/TAZ, in pancreatic cell lineage differentiation and morphogenesis. However, the necessity of the Hippo pathway in the development of the endocrine pancreas in vivo remains unsettled. This study by Wu et al. demonstrates that the Hippo pathway is essential for endocrine progenitor specification and differentiation but not for pancreatic beta cell function in mice. Their results are in line with prior studies, but some major issues ensue largely due to the lack of data quantification to substantiate the authors' claims, the use of ambiguous/imprecise terminologies, and making unsubstantiated claims.

      Major comments:

      Immunofluorescence was used to evaluate the expression, co-localization, and subcellular localization of proteins of interest (Figures 2~7). However, except for the estimation of macrophage densities (4D and 6C), none of the other immunofluorescence experiments are accompanied by quantifications and statistical analyses to substantiate the authors' claims. In addition, the red channel in several figure panels (eg. 2A and 4A) was suboptimal making interpretations very difficult. Quantification of immunofluorescence data can be done by manual counting or automated counting using software such as ImageJ or QuPath, followed by statistical analyses to provide objective evidence.

      Response:____ We did not upload high resolution images for Bioxriv publication. We now uploaded the high resolution images. We also repeated a few immunostaining experiments and have taken confocal images to increase clarity.

      Reviewer #3:

      The endocrine component constitutes only a portion of the pancreas. The authors use whole pancreases to compare the expression levels of cell type-specific or Yap1 target genes between the knockout and control mice through qPCR. While there may be technical feasibility reasons limiting the direct assessments of gene expression in the endocrine progenitor cells, the caveats of the experiments (eg. inferring cell type-specific gene expression changes from whole organ) should be highlighted along with any inconsistent results. For example, among the three ductal genes tested in Figure 1C, only Krt19 has significantly higher mRNA expression in NL vs control, but the possible reasons for such discrepancy between ductal markers were not discussed. The authors used immunofluorescence, which is only semi-quantitative, to determine that Yap1 protein abundance is increased in Lats1&2 knockout cells. qPCR should also be performed for Yap1 in addition to Yap1 target genes to augment their claim of Yap1 expression being increased. More details on how the relative mRNA expression was computed is also necessary for readers to accurately interpret the results.

      Response:____ As reviewer pointed out that there are technical feasibility reasons limiting the direct assessments of gene expression in the endocrine progenitor cells. We can only quantify gene expression level through RT-qPCR. We have discussed the high level of Krt19 and unchanged level of Sox9 in Figure 3B and 3C when we performed immunostaining. The results from the two experiments are consistent. We observed KRT19 positive staining but no SOX9 staining in Lats1/2 null Ngn3-expressing cells. This result is consistent with RT-qPCR result. We have postulated in the Discussion section that Krt19 may be directly controlled by YAP1.

      Furthermore, Lats1/2 controls the protein level and localization of YAP1/TAZ, thus immunostaining can show the cellular localization.

      In addition, we added the n’s used to perform RT-qPCR experiments to our manuscript and figures. We also added details to provide clarity on calculations made for relative mRNA expression from RT-qPCR experiments in the Methods section.

      Reviewer #3:

      The increased immunofluorescent detection of Yap1 in the NL pancreases at E16.5 is interesting but warrants further investigation. Is the increase restricted to endocrine progenitor cells or all endocrine compartments? George et al. (Mol Endocrinol. 2015) used RNA in situ hybridization and found that Yap1 mRNA expression is undetectable in the endocrine pancreas at E16.5, but here the authors observe increased Yap1 protein detected by immunofluorescence in the pancreases of animals with Lats1&2 knockout at E16.5. Although the authors speculate on a possible mechanism that may explain the discrepancies, it is important to evaluate whether the knockout really results in reactivation of Yap1 transcription and whether Yap1 auto-regulates on the transcriptional level.

      Response:____ The increase of YAP1 staining is restricted to endocrine progenitor cells and blocks the endocrine differentiation. The endocrine cells in NL pancreas are escapers of Lats1/2 deletion and have no YAP1 expression, which is consistent with George et al. findings. The model we proposed is that in Ngn3-expressing cells, Lats1/2 are required to keep YAP1/TAZ out of nuclei so that Ngn3 can repress YAP1 expression. Loss of Lats1/2 led to high nuclear YAP1 which may block Ngn3’s ability to suppress YAP1. ChIP-seq or ChIP-PCR on YAP1 promoter will answer the question whether YAP1 auto-regulates on the transcriptional level. However, the small number of cells in mouse pancreas limits us to perform this experiment.

      Reviewer #3:

      The numbers of animals used are unclear except for those depicted by the barplots. There is also no mention of the number of cells or fields analyzed for immunofluorescence experiments, which is essential for any quantitative comparisons and claims. The n's should be added throughout.

      Response:____ We added n’s to all figures and in our manuscript.

      Reviewer #3:

      The authors repeatedly refer to the dataset from Cebola et al. (Nat Cell Biol 2015), which was generated from human embryonic pancreatic progenitor cells, to speculate that TEAD1 binds to the promoter regions of YAP1, CDH1 and KRT19 and therefore may promote YAP1 autoregulation or expression of CDH1 and KRT19, explaining some of their immunofluorescence observations. However, they fail to acknowledge potential differences that may ensue due to the different species examined. Co-staining of Yap1 and Cdh1/Krt19 would indicate whether co-expression of Yap1 and Cdh1/Krt19 is indeed evident in the context of their study and provide further evidence to support their speculations.

      Response:____ Reviewer is correct that the dataset of TEAD1 ChIP-seq from Cebola et al. was generated from human embryonic pancreatic progenitor cells. These data are in line with our mouse experimental results where cells with high YAP1, due to loss of Lats1&2, continue to have high YAP1, CDH1 and KRT19. We do not have evidence to point out the potential differences between mouse and human.

      Reviewer #3:

      Several observations are over-interpreted or over-stated, and should be qualified as preliminary or speculative with proper wording. For example, P14: "differentiation...was blocked by lack of expression of ISL1 and NKX2.2". Although the authors observe low Isl1 and Nkx2.2 staining in NL vs control pancreases, no experiments were done to substantiate the claim that the reduction in ISL1 and NKX2.2 directly block the differentiation in this context.

      Response:____ We did not claim that the reduction in ISL1 and NKX2.2 directly block the differentiation in NL pancreas. ISL1 and NKX2.2 are markers for endocrine differentiation. The lack of ISL1 and NKX2.2 expression indicates that endocrine differentiation has been blocked.

      Reviewer #3:

      P15: "...KRT19 expression is not controlled by SOX9, but instead by YAP1". The authors observe that Krt19 proteins are increased in Lats1&2-null Ngn3+ cells, whereas Sox9 proteins were unchanged. However, they do not provide evidence that Yap1 controls Sox9 expression.

      __Response:____ No Sox9 expression was observed in Ngn3-expressing cells in both Control and NL pancreases (Figure 3B and 3C) while we observed YAP1 nuclei staining in NGN3-positive cells (Figure 5B), suggesting that YAP1 does not control Sox9 expression.

      __

      Reviewer #3:

      Minor comments:

      It is mentioned that deletion of Lats1&2 in Ngn3+ cells results in fewer acinar cells and smaller islets, evident by reduction in Ins+ or Gcg+ cells, whereas such genetic ablation in pancreatic beta cells does not result in any phenotype. Did the deletion of Lats1&2 in Ngn3+ cells similarly lead to reduction in other endocrine cell types?

      Response:____ We showed that there was no positive staining for ISL1 and NKX2.2 in progeny of Lats1&2 null Ngn3-expressing cells, suggesting the block of endocrine differentiation including all endocrine cells. The INS+ and GCG+ cells in NL pancreas are the escapers in which Lats1&2 were not deleted. Other endocrine cells should be affected too. We observed smaller sized NL mice with low blood glucose levels. We have postulated that brain expression of Ngn3Cre may contribute to these phenotypes.

      Reviewer #3:

      The authors use the word "expression" without specification to refer to both mRNA expression and marker fluorescence levels throughout the text. This is inaccurate and potentially confusing. More specific terminologies should instead be used to avoid ambiguity.

      Response:____ We have added mRNA in the appropriate places where we discuss results from RT-qPCR experiments. All other places are protein expression results by immunostaining. We followed the general guideline for formatting gene and protein name throughout the manuscript: mouse gene symbols are italicized, with only the first letter in upper-case; protein symbols are not italicized, and all letters are in upper-case.

      Reviewer #3:

      Figure S1D is mis-referred to as S1E in the text. Figure 7G is missing. Table S1 is mis-referred to as S Table 2 in the text. Typo on page 19: "Controlcontrol" Figure S6B is mis-referred to as Figure S6A in the text.

      Response:____ We have made all appropriate changes to the manuscript to correct these mistakes.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      In this study, Wu et al. use murine Cre-lox model systems to demonstrate that the core Hippo pathway components, Lats1/2, promote pancreatic endocrine specification and differentiation through Yap1/Taz. The roles of the Hippo pathway in mammals are complicated and context-dependent. Prior studies have implicated the core kinase cascade of the Hippo pathway, containing MST1/2, LATS1/2 and YAP1/TAZ, in pancreatic cell lineage differentiation and morphogenesis. However, the necessity of the Hippo pathway in the development of the endocrine pancreas in vivo remains unsettled. This study by Wu et al. demonstrates that the Hippo pathway is essential for endocrine progenitor specification and differentiation but not for pancreatic beta cell function in mice. Their results are in line with prior studies, but some major issues ensue largely due to the lack of data quantification to substantiate the authors' claims, the use of ambiguous/imprecise terminologies, and making unsubstantiated claims.

      Major comments:

      1. Immunofluorescence was used to evaluate the expression, co-localization, and subcellular localization of proteins of interest (Figures 2~7). However, except for the estimation of macrophage densities (4D and 6C), none of the other immunofluorescence experiments are accompanied by quantifications and statistical analyses to substantiate the authors' claims. In addition, the red channel in several figure panels (eg. 2A and 4A) was suboptimal making interpretations very difficult. Quantification of immunofluorescence data can be done by manual counting or automated counting using software such as ImageJ or QuPath, followed by statistical analyses to provide objective evidence.
      2. The endocrine component constitutes only a portion of the pancreas. The authors use whole pancreases to compare the expression levels of cell type-specific or Yap1 target genes between the knockout and control mice through qPCR. While there may be technical feasibility reasons limiting the direct assessments of gene expression in the endocrine progenitor cells, the caveats of the experiments (eg. inferring cell type-specific gene expression changes from whole organ) should be highlighted along with any inconsistent results. For example, among the three ductal genes tested in Figure 1C, only Krt19 has significantly higher mRNA expression in NL vs control, but the possible reasons for such discrepancy between ductal markers were not discussed. The authors used immunofluorescence, which is only semi-quantitative, to determine that Yap1 protein abundance is increased in Lats1&2 knockout cells. qPCR should also be performed for Yap1 in addition to Yap1 target genes to augment their claim of Yap1 expression being increased. More details on how the relative mRNA expression was computed is also necessary for readers to accurately interpret the results.
      3. The increased immunofluorescent detection of Yap1 in the NL pancreases at E16.5 is interesting but warrants further investigation. Is the increase restricted to endocrine progenitor cells or all endocrine compartments? George et al. (Mol Endocrinol. 2015) used RNA in situ hybridization and found that Yap1 mRNA expression is undetectable in the endocrine pancreas at E16.5, but here the authors observe increased Yap1 protein detected by immunofluorescence in the pancreases of animals with Lats1&2 knockout at E16.5. Although the authors speculate on a possible mechanism that may explain the discrepancies, it is important to evaluate whether the knockout really results in reactivation of Yap1 transcription and whether Yap1 auto-regulates on the transcriptional level.
      4. The numbers of animals used are unclear except for those depicted by the barplots. There is also no mention of the number of cells or fields analyzed for immunofluorescence experiments, which is essential for any quantitative comparisons and claims. The n's should be added throughout.
      5. The authors repeatedly refer to the dataset from Cebola et al. (Nat Cell Biol 2015), which was generated from human embryonic pancreatic progenitor cells, to speculate that TEAD1 binds to the promoter regions of YAP1, CDH1 and KRT19 and therefore may promote YAP1 autoregulation or expression of CDH1 and KRT19, explaining some of their immunofluorescence observations. However, they fail to acknowledge potential differences that may ensue due to the different species examined. Co-staining of Yap1 and Cdh1/Krt19 would indicate whether co-expression of Yap1 and Cdh1/Krt19 is indeed evident in the context of their study and provide further evidence to support their speculations.
      6. Several observations are over-interpreted or over-stated, and should be qualified as preliminary or speculative with proper wording. For example, a. P14: "differentiation...was blocked by lack of expression of ISL1 and NKX2.2". Although the authors observe low Isl1 and Nkx2.2 staining in NL vs control pancreases, no experiments were done to substantiate the claim that the reduction in ISL1 and NKX2.2 directly block the differentiation in this context. b. P15: "...KRT19 expression is not controlled by SOX9, but instead by YAP1". The authors observe that Krt19 proteins are increased in Lats1&2-null Ngn3+ cells, whereas Sox9 proteins were unchanged. However, they do not provide evidence that Yap1 controls Sox9 expression.

      Minor comments:

      1. It is mentioned that deletion of Lats1&2 in Ngn3+ cells results in fewer acinar cells and smaller islets, evident by reduction in Ins+ or Gcg+ cells, whereas such genetic ablation in pancreatic beta cells does not result in any phenotype. Did the deletion of Lats1&2 in Ngn3+ cells similarly lead to reduction in other endocrine cell types?
      2. The authors use the word "expression" without specification to refer to both mRNA expression and marker fluorescence levels throughout the text. This is inaccurate and potentially confusing. More specific terminologies should instead be used to avoid ambiguity.
      3. Figure S1D is mis-referred to as S1E in the text.
      4. Figure 7G is missing.
      5. Table S1 is mis-referred to as S Table 2 in the text.
      6. Typo on page 19: "Controlcontrol"
      7. Figure S6B is mis-referred to as Figure S6A in the text.

      Significance

      The exact roles of the Hippo pathway in pancreas organogenesis are far from being delineated. The current study proposes a new model wherein the Hippo pathway kinase cascade acts to sequester and inhibit Yap1 in the cytosol for Ngn3 to drive endocrine specification, including inhibition of Yap1 expression. This work is potentially valuable to further our understanding of the physiological roles of the Hippo pathway in the context of endocrine pancreas development.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Wu et al. examined the roles of Hippo pathway mediated YAP1/TAZ inhibition in the development stages of endocrine specification and differentiation in vitro and in vivo. This study concluded Hippo pathway-mediated YAP1/TAZ inhibition in endocrine progenitors is a prerequisite for endocrine specification and differentiation. The present study is conducted by solid experiment in some parts, however, there are several major concerns as follows.

      Major comments:

      • Authors concluded that proper Hippo activity was required for the Ngn3 driven differentiation program, further expanding our fundamental understanding of Hippo pathway participation in pancreatic endocrine development. This sentence is too vague. Previous study has already reported that Ngn3 expression and Yap loss occur in parallel within the same cell during development of the endocrine pancreas (Mol Endocrinol Baltim Md. 2015;29: 1594-607. doi:10.1210/me.2014-1375).
      • In the present study, removal of YAP1/TAZ rescued the defect in endocrine specification and differentiation in LATS1/2-null pancreas. These results indicate that both of LATS1/2 and YAP/TAZ are not essential for the normal endocrine specification and differentiation. Furthermore, these results suggest that Hippo pathway-mediated YAP1/TAZ inhibition is also unnecessary for proper pancreatic endocrine specification and differentiation. Additionally, these results suggest that just a deletion of YAP/TAZ is sufficient for endocrine specification and differentiation. How is the endocrine specification and differentiation in Ngn3creyap1fl/flTazfl/fl mice?
      • In Figure 2B and 3B, YFP expression (an indicator for LATS1/2 deletion) is more detectable in control compared to LATS1/2 null mice, suggesting that the LATS1/2 expression is more decreased in control mice. Is this true?
      • Please show the expression of YFP, NGN3, and YAP/TAZ in figure 6A to confirm their expression status.
      • Please present the sequential changes of LATS1/2, YAP/TAZ, NGN3 expressions in the control mice and LATS1/2 null mice at least E12.5, E16.5, and P1, to make ii easy to understand them for general readers.

      Minor comments:

      Please describe the affiliation of authors (number 3 and 4).

      In table 1, the details of secondary antibodies were not described.

      Significance

      Previous studies have already reported that Ngn3 expression and Yap loss occur in parallel manner during development of the endocrine pancreas. The primary aim of the present study is whether the YAP loss is mediated by LATS1/2 in Ngn3 positive cells.

      Furthermore, although authors concluded that Hippo pathway (LATS1/2)-mediated YAP1/TAZ inhibition is essential for proper pancreatic endocrine specification and differentiation, the null condition both of LATS1/2 and YAP/TAZ in Ngn3 positive cells provided the normal endocrine specification and differentiation (Figure 6). These findings do not support the conclusion.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, Wu et al. confirmed the prominent cross talk of the Hippo pathway in beta-cell development, namely of Ngn3-YAP and beta-cell maturation. Deletion of the Hippo kinases Lats1 and 2 in Ngn3-expressing endocrine progenitor cells activated YAP1/TAZ transcriptional activity and reduced islets size and morphology and induced pancreas inflammation. This is in line with the previously established role of YAP/TAZ as regulator of viral response pathways In contrast, deletion of Lats1&2 later in development had no effect on beta-cell function and morphology. The authors conclude that Hippo pathway-mediated YAP1/TAZ inhibition in endocrine progenitors is a prerequisite for beta-cell maturation.

      While the effect of Lats1 and 2 in beta-cell development has never been investigated, the outcome of the study is largely confirmative; namely YAP's necessity to switch off whenever an endocrine cell is formed while it also balances inflammation. The study depends on cell-specific Lats1 and 2-KO mouse models and in-vitro assessments of mechanisms how loss of LATS leads to beta-cell derangement at the time of ngn3 expression and how the inflammatory pathway is activated are missing. Observations are mostly based on stainings of quite low quality and it is unclear how authors performed quantitative evaluations and how many cells from how many mice were counted. Figures show high background, e.g. CD45 in Fig.3 which would make a robust evaluation impossible. Also YAP staining, usually well-expressed in ductal cells, shows low quality and high background. Usually the antibody is well-know for its weak performance in fluorescence and should only be used with chromogenic labels.

      Referees cross-commenting

      All reviewers have similar doubts on quality and quantification of the IHC analyses, so I would agree to give a chance to the authors for a revision, and it is probably doable in between 3 and 6 months (I wrote >6 before).

      Significance

      Largely confirmative. Single model approach with relatively low quality of the IHC analyses.

      My expertise: inflammation, beta-cell, islets, Hippo pathway

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors compared the various multinucleated cells, osteoclasts, LCG and FBGC. Overall, the manuscript shows rigor in the analyses, and also very interesting approaches for retrieving mononuclear cells, for instance using DC-STAMP siRNA. This work adds very much to understanding the biological differences, as summarized in figure 6h. A lot of work in osteoclast field with for instance qPCR is hampered because, inevitably, a mix of mononuclear and multinucleated cells is always measured. Here, a solid attempt to separate those mixes with cell sorting and subsequent analysis on the mononuclear and multinucleated isolates, really adds. Choice of figures is good, also the extra info of the supplementary figures is relevant and makes it easy to read.

      Major and minor concerns:

      1. For osteoclasts, various markers exist for their biological characterization, for instance the ability to resorb bone. What, apart from the arrangement and number of nuclei, were the biological parameters that confirmed that the cells made by addition of IFN or IL-4 were LCG and FBGC? [Authors’ reply]. In order to address this point, we focused on gene sets that characterize LCGs and FBGCs. By doing so, we aimed to identify (i) lineage dependent factors and (ii) markers of LGCs and FBGCs. (See new Supplementary Figure 1B and C, New Supplementary Table 1 and highlighted text in Results). As expected, and in line with the lineage-determining factors, the transcriptomics comparison between mononucleated/multinucleated IFN-γ and IL-4-differentiated macrophages showed predominance of IFN-γ and IL-4-related pathways, respectively (Supplementary Figure 1B and C and Supplementary Table 1). Among known LGC and FBGC markers, we confirmed up-regulation of CCL7 [1] and CD86 [2], respectively.* As per the biological parameters, we indeed confirm that FBGCs show enhanced phagocytosis properties (Figure 5C) while LGCs can form granuloma-like clusters in vitro (Figure 4D and E). Altogether, we characterize LGCs and FBGCs with (i) polykaryon-specific nuclear arrangement, (ii) polykaryon-specific gene expression markers, (iii) previously shown and new phenotypic characteristics such as LGCs’ unique ability to form in vitro clusters containing CD3+ cells. *

      In fig 2c: did the authors perform stainings with isotype control antibodies? In my experience, quite often, antibodies stain mononuclear cells much intenser, since the cytoplasm is much more condense, less spread over a large area.

      [Authors’ reply]. According to the reviewer’s suggestion, we provide isotype control staining for MRC1 in IFN-g-stimulated mononucleated/multinucleated cells by ImageStream (left panel) and immunofluorescence in LGCs, FBGCs and osteoclasts (right panel). There was negligible staining with the isotype control antibody for MRC1 in both settings (Figure provided to the journal).

      *We did not observe a potential artefact of staining in multinucleated cells when compared to mononuclear cells. In fact, some markers of multinucleation such as B7-H3 is augmented in LGCs (Figure 4E). *

      Resorption assay in 6 is not clear. It is weird that osteoclasts apparently display so limited resorption? Also the traces are not typical for osteoclasts. Please explain.

      [Authors’ reply]. Human osteoclasts are cultured for 2 days on hydroxyapatite-coated plates and the amount of resorption is dependent on the healthy donor the peripheral blood is derived from. In addition to genetic variability, the support (hydroxyapatite) is different from dentine, which is also widely used for measuring osteoclast resorptive activity. The visualization of the human osteoclast resorption is made by transparency (area not coated by hydroxyapatite due to its resorption) on image J.

      Provide a better image Supplementary 2A, even at 250% the lettering is vague. What do the colours in 2A mean?

      [Authors’ reply]. *According to the reviewer’s suggestion, we now provide the Supplementary Figure 2A with better resolution. In STRING protein-protein interaction analysis, there is no particular meaning of the node color itself. *

      CROSS-CONSULTATION COMMENTS

      I have read the comments of the other two reviewers, and together. I absolutely agree with their additions, Indeed, supplementary tables are lacking, as well as there could be a bit more emphasis on the fact that it is all in vitro work. Together, I think the three of us are complementary in our comments, with good overlap as well. Any effort to stain for instance pathology material with the markers that have been found, would be great, especially for the LGC and the FBGC, that are much less studied in the field of MNGs. Having said that ,I can also live without this addition, but then it could be highlighted in the discussion that these are the future avenues that should be considered. Collaborate with Pathology!

      [Authors’ reply]. We appreciate that the reviewer provides cross-consultation comments which we address in our revised manuscript. As such, we discuss future avenues regarding the translatability of these results to human pathology involving MGCs.

      Reviewer #1 (Significance (Required)):

      This manuscript is particularly interesting to those who are interested in the BIOLOGY of MNCs. In essence, three types of MNCs were cultured and compared, with each of them a specific function.

      I am an osteoclast expert (76 publications), and have two publications on FBGCs

      [Authors’ reply]. *We sincerely thank the reviewer for his/her pertinent comments, enthusiasm for our findings and for providing us an overall summary of our findings in view of all other reviewer comments. *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript, the authors performed a comparative transcriptome analysis of mononuclear and multinuclear human osteoclasts, LGCs and FBGCs. They found that multinucleation triggers a significant downregulation of macrophage identity in all three types of MGCs. Furthermore, RNA-seq data and in-vitro functional analysis of multinucleated cells showed that macrophage cell-cell fusion and multinucleation enhance phagocytosis and contribute to lysosome-dependent intracellular iron homeostasis. Furthermore, multinucleation of osteoclasts promoted mitochondrial activity and oxidative phosphorylation, resulting in maximal respiration. This unique and interesting study addresses the fundamental question of how cell-cell fusion and multinucleation contribute to cellular activity and biological homeostasis.

      Major comments

      1 The authors generated mature multinucleated cells by stimulating human PBMC-derived macrophages with either IFN-g, IL-4, or RANKL. However, no quantitative data have been presented to determine how effectively IL-4, IFN-g, and RANKL can induce multinucleated giant cells from mononuclear macrophages. Quantitative data showing induction efficiency would provide a more detailed picture of the overall experiment.

      [Authors’ reply]. According to the reviewer’s suggestion, quantitative data showing the efficiency of these cytokines to induce multinucleation (i.e. fusion index) is now provided as part of the revised Supplementary Figure 1A (right panel).

      2 The authors mentioned, "The distinct morphological appearance of these three types of MGCs (Figure 1B) suggested cell type-specific functional properties and shared mechanisms underlying macrophage multinucleation". However, there is no discussion or data showing how the nuclear arrangement and intracellular location affect the biological function of multinucleated cells.

      [Authors’ reply]. This is good point and is now discussed in the revised manuscript (see highlighted text in revised manuscript and below).

      Whether MGC-specific nuclear arrangements and/or numbers are indicative of specialized function is currently unclear. Intracellular nuclei arrangement is likely to be important for the sealing zone formation in a polarized bone-resorbing osteoclast. Furthermore, whether distinct transcriptional activities are assigned to different nuclei of the MGC also remain to be tested. Recent elegant work performed in multinucleated skeletal myofibers suggest transcriptional heterogeneity among the different nuclei of the polykaryon [3].

      3 Based on the results of DC-stamp knockdown experiments, the authors concluded that cell-cell fusion and multinucleation suppress the mononuclear phagocytic gene signature. However, to strengthen this hypothesis, it would be necessary to provide at least data showing that DC-stamp knockdown reduces the number of multinucleated cells.

      [Authors’ reply]. According to the reviewer’s suggestion, we provide data showing that DCSTAMP knockdown reduces multinucleation in LGCs and FBGCs (see below and new Supplementary Figure 2F). For human osteoclasts, the data was included in our previously published paper ([4] and figure provided to the journal).

      4 In Figure4, the authors showed that transcripts in LGCs were enriched for antigen presentation and adaptive immune system pathways. In addition, multinucleation of LGCs increased the surface expression of B7-H3 (CD276) and colocalized with CD3+ cells, suggesting that LGC multinucleation potentiates T cell activation. However, the authors did not present enough data to demonstrate the antigen-presenting ability of LGCs or their specific T cell activating capacity.

      [Authors’ reply]. We agree with the reviewer that our data on a potential role of LGCs’ on T cell activation is based on increased surface expression of B7-H3 and the unique CD3+ cluster forming ability of LGCs. In order to check for further markers of antigen presentation, we have performed MHC-1 and MHC-2 quantification by ImageStream in 3 types of MGCs (figure provided to the journal).

      Although there was no difference in MHC-I/MHC-2 between the mononucleated and multinucleated macrophages, the mean fluorescent intensity (MFI) range was the highest in IFN-g-stimulated macrophages, suggesting that LGCs may be better equipped for antigen presentation than the other 2 types of MGCs. A more comprehensive analysis of antigen presentation requires enzymatic digestion and isolation and phenotyping of LGCs from clusters in vitro and human tissues in vivo. This is a program of research that we have initiated as part of a separate study, which will focus on the in vivo relevance of the current findings such as the unique Ag presentation ability of LGCs in a non-sterile tissue environment.

      5 Figure 6 clearly shows that mature multinucleated osteoclasts exhibit increased ATP production and maximal respiration. However, the glycolytic pathway did not differ between mononuclear and multinuclear osteoclasts. No explanation for this observation has been provided. It is easy to understand that osteoclasts acquire ATP through aerobic respiration during multinucleation. But how NADPH, which is essential for its redox reaction, is produced? Is it by acquiring αKG from the glutamine pathway?

      [Authors’ reply]. This is a point worth expending (see also discussion; highlighted text). Osteoclast multinucleation is characterized by increased mitochondrial gene expression which also translates into increased spare respiratory capacity (SRC or maximal respiration). This metabolic rewiring does not modify glycolysis and basal respiration rate. As the reviewer correctly states, increased SRC may be a way to supply more ATP to the energy-demanding polykaryon.

      As per the production of NAD(P)H as an electron source for ETC, it could indeed be through glutamine rather than glucose usage in multinucleated osteoclasts. Furthermore, as iron is an essential cofactor for ETC activity through activity of iron-sulfur clusters, the mitochondrial concentration of iron is likely to be critical for the mitochondrial activity of multinucleated osteoclasts (see also discussion).

      Minor comments:

      6 Supplementary tables 1-6 were not provided.

      [Authors’ reply]. We apologize for this. The revised versions of supplementary tables are provided as part of the revised manuscript.

      7 Figure 2D right panel, difficult to see DAPI+ nuclei.

      [Authors’ reply]. Thanks for pointing this out. We have now replaced Figure 2D with a more pronounced DAPI+ nuclei.

      Reviewer #2 (Significance (Required)):

      Although it is well known that multinucleation of cells constantly occurs, especially in osteoclasts, skeletal muscle, and trophoblasts of the placenta, the biological significance of multinucleation and the intracellular functions of multinucleation are not well understood. In this unique study, three types of multinucleated cells were generated from human peripheral blood to elucidate the genetic and functional differences between mononucleated and multinucleated cells. Furthermore, by demonstrating the possibility that the morphological peculiarity of multinucleation can regulate cell function, this paper provides clues to understanding the underlying biology of multinucleated cells and how they maintain cell function in homeostatic and pathological settings.

      [Authors’ reply]. We thank the reviewer for finding our study unique and biologically meaningful. We also thank the reviewer for all the suggestions that improved significantly the overall message of the manuscript.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      The manuscript of Ahmadzadeh and Pereira et al is an interesting study of the fusion process key to the formation of multinucleated giant cells (MGCs). Our current ability to discriminate between different types of MGCs is limited, and there are gaps in our understanding of the molecular determinants of cell fusion. In this study, the authors isolated different MGC variants - osteoclasts, Langhans giant cells (LGCs) and foreign body giant cells (FBGCs) and identified common, as well as MGC-specific genes and pathways involved in the process of cell fusion. The approach of isolating and comparing different types of MGCs is novel, and the manuscript is well presented and written. However, due to the in vitro nature of the study, the physiological significance of the findings is unclear. I have further minor and major points for the authors to address, as detailed below.

      Minor comments:

      1. The approach to isolate the different MGCs using FACS and imaging technique is highly novel. However the difference between MGC subtypes isolated isn't immediately apparent beyond the morphological comparisons. In my opinion some of the results of MGC-specific assays from Figures 4, 5 and 6 can be included in Figure 1, e.g. TRAP staining and hydroxyapatite resorption for osteoclasts, to provide evidence of purity and specificity of each MGC subtype early on in the manuscript. Classical or canonical genes associated with each MGC subtype can also be highlighted in the volcano plots in Figure 1C, e.g ACP5, CTSK, TNFRSF11A for osteoclasts. [Authors’ reply]. We thank the reviewer for this point and we agree it is important to highlight markers for each polykaryon early in the manuscript. In accordance with this reviewers’ comment (and also with Reviewer 1’s point), we first verified existence of lineage-dependent factors and markers of LGCs and FBGCs as these cells are relatively less well-defined compared to osteoclasts. (New Supplementary Figure 1B and C and New Supplementary Table 1). As expected, and in line with the lineage-determining effects, the transcriptomics comparison between mononucleated/multinucleated IFN-γ and IL-4-differentiated macrophages showed predominance of IFN-γ and IL-4-related pathways, respectively (New Supplementary Figure 1B and C and New Supplementary Table 1). Among known LGC and FBGC markers, we confirmed up-regulation of CCL7 [1] and CD86 [2], respectively (New Supplementary Table 1). We have added this information in the revised manuscript (see highlighted text). Osteoclast phenotyping is provided by TRAP staining and resorption assay (Figure 6C) and we also confirm that CTSK is indeed significantly up-regulated upon multinucleation (LogFc=1.69; P=9.2 x 10E-6; highlighted in the revised manuscript).

      The overall decrease in phagocytic identity of all the MGCs, and the specific upregulation of phagocytic pathways in the FBGCs are conflicting. Are there subsets of phagocytic pathways that were down and upregulated during the formation of FBGCs?

      [Authors’ reply]. This is a very good point. As the reviewer indicates, the results suggest that subsets of phagocytic pathways are changed upon multinucleation. All three types of MGCs show a downregulation of transcripts that belong to Fc receptors and complement C1Q family. However only FBGCs show an up-regulation of S. Aureus bioparticle-mediated phagocytosis. Hence the exact surface receptors responsible for this pathogen clearance remain to be identified. FBGC phagocytosis is a complex process including non-canonical phagocytosis pathways and participation of increased membrane area and endoplasmic reticulum [5, 6]*. Whether these pathways are specifically induced in human FBGCs remain to be identified. We now discuss this point in the revised manuscript (see highlighted text in Discussion). *

      What are the identities of the mononuclear cells in each of the MGC experiment? They appeared to be quite heterogeneous based on the DEGs identified, beyond the common phagocyte signature. Can the authors comment on the difference between the mononuclear cells and whether this will affect the DEG analysis?

      [Authors’ reply]. This is also a very relevant point that we now address in the revised manuscript (New Supplementary Figure 1B and C; New Supplementary Table 1 and highlighted revised text in Results). The reviewer is correct that MGC-specific pathways are in line with the known function of each polykaryon (Figure 4A, 5A and 6A). To what extent lineage-dependent effects (e.g. IFN-g and IL-4) are conserved between the mononucleated and multinucleated state is yet to be determined. In order to address this point, we compared DEG in IFN-g and IL-4-differentiated mononucleated macrophages to the ones obtained in multinucleated macrophages (New Supplementary Figure 1B and C; New Supplementary Table 1). The results showed that the multinucleated cell state preserves the majority of the lineage-dependent pathways which are very significantly represented at the mononucleated cell state (e.g. IFN-g and IL-4-related pathways). Interestingly, although less significant, this analysis also showed pathways that were specific to the mononucleated or multinucleated state in IFN-γ-differentiated macrophages when compared to IL-4-differentiated ones and vice versa. (Supplementary Figure 1B and C). For instance, TRAF3-dependent IRF activation pathway is specific to mononucleated IFN-g-differentiated macrophages (Supplementary Figure 1B).

      The authors should also frame/discuss the findings in the context of diagnostic and therapeutic potentials to highlight the clinical significance of this study.

      [Authors’ reply]. We thank the reviewer for this point and we now discuss our results from a clinical/diagnostic perspective (see highlighted text in the Discussion and below).

      From a clinical perspective, since lysosome-regulated intracellular iron homeostasis appears to be a general condition for macrophage multinucleation across different tissues, its blockade may hold therapeutic potential. However, it is still unclear whether granulomatous disease can benefit from targeting LGC fusion. For non-granulomatous inflammatory diseases, inhibiting MGC formation by targeting lysosomes may be a therapeutic avenue. This approach would avoid FBGC-related adverse effects during foreign body reaction or inhibit the formation of MGCs of white adipose tissue during obesity. v-ATPase inhibitors have been previously proposed to inhibit osteoclast activity and bone resorption [7]* so their selective targeting in the lysosomal compartment may be generalized to other MGCs such as FBGCs. In addition to potential clinical translation, the results presented in this study require confirmation in tissues originating from human pathology involving MGCs. *

      Major comments:

      • As mentioned before, the physiological significance of the findings is unclear. Some form of in vivo data is needed to support some of the key conclusions of the study, e.g validating some of the markers of the pathways identified (common and MGC subtype-specific), and the role of lysosome-mediated iron homeostasis in multinucleation. The authors can make use of the FACs and imaging approaches they developed to look at MGCs in relevant tissues. [Authors’ reply]. This is an important point that we would like to explore in a comprehensive way. We have initiated a 2-year program to undertake a Multiplexed Immunohistochemistry (mIHC) using MILAN (Multiple Iterative Labeling by Antibody Neodeposition) https://www.lpcm.be/multiplex-ihc-milan approach in human biopsies using >100 antibodies. The current study is pivotal in selecting the gene targets (i.e. common and MGC-specific markers) for prioritization. We foresee to gain critical pathophysiological information about the tissue characteristics of MGCs. The reviewer would acknowledge that these high-throughput and biopsy-based initiatives are lengthy and not the primary scope of our current findings which set the foundation of major cellular events governing multinucleation in macrophages.

      Reviewer #3 (Significance (Required)):

      Significance:

      • The approach of isolating and comparing different types of MGCs is novel, and the findings certainly improved our understanding of the fusion processes of MGCs. However, the physiological role of these processes in health and disease that involve MGCs is still unclear due to the lack of in vivo data. The findings were discussed in quite a bit of detail in the context of current literature, though clinical impact was not explored. [Authors’ reply]. *We are grateful to Reviewer 3 for raising relevant and constructive points regarding the main findings. His/her review significantly improved the clarity of the overall manuscript. *

      We recognize our study lacks human clinical association, but we highlight the prospective translatability of our findings and the usage of donor-based human macrophages throughout the manuscript. As also recommended by Reviewer 1 in his/her cross-consultation, we discuss the potential clinical impact of our findings in the Discussion of our revised manuscript.

      • My background is bone biology with a very keen interest in osteoclast biology so arguably my knowledge on other MGCs eg LGCs and FBGCs is limited. References

      • Chen Y, Jiang H, Xiong J, Shang J, Chen Z, Wu A, Wang H: Insight into the Molecular Characteristics of Langhans Giant Cell by Combination of Laser Capture Microdissection and RNA Sequencing. J Inflamm Res 2022, 15:621-634.

      • McNally AK, Anderson JM: Foreign body-type multinucleated giant cells induced by interleukin-4 express select lymphocyte co-stimulatory molecules and are phenotypically distinct from osteoclasts and dendritic cells. Exp Mol Pathol 2011, 91(3):673-681.
      • Petrany MJ, Swoboda CO, Sun C, Chetal K, Chen X, Weirauch MT, Salomonis N, Millay DP: Single-nucleus RNA-seq identifies transcriptional heterogeneity in multinucleated skeletal myofibers. Nat Commun 2020, 11(1):6374.
      • Pereira M, Ko JH, Logan J, Protheroe H, Kim KB, Tan ALM, Croucher PI, Park KS, Rotival M, Petretto E et al: A trans-eQTL network regulates osteoclast multinucleation and bone mass. Elife 2020, 9.
      • McNally AK, Anderson JM: Multinucleated giant cell formation exhibits features of phagocytosis with participation of the endoplasmic reticulum. Exp Mol Pathol 2005, 79(2):126-135.
      • Milde R, Ritter J, Tennent GA, Loesch A, Martinez FO, Gordon S, Pepys MB, Verschoor A, Helming L: Multinucleated Giant Cells Are Specialized for Complement-Mediated Phagocytosis and Large Target Destruction. Cell Rep 2015, 13(9):1937-1948.
      • Qin A, Cheng TS, Pavlos NJ, Lin Z, Dai KR, Zheng MH: V-ATPases in osteoclasts: structure, function and potential inhibitors of bone resorption. Int J Biochem Cell Biol 2012, 44(9):1422-1435.
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The manuscript of Ahmadzadeh and Pereira et al is an interesting study of the fusion process key to the formation of multinucleated giant cells (MGCs). Our current ability to discriminate between different types of MGCs is limited, and there are gaps in our understanding of the molecular determinants of cell fusion. In this study, the authors isolated different MGC variants - osteoclasts, Langhans giant cells (LGCs) and foreign body giant cells (FBGCs) and identified common, as well as MGC-specific genes and pathways involved in the process of cell fusion. The approach of isolating and comparing different types of MGCs is novel, and the manuscript is well presented and written. However, due to the in vitro nature of the study, the physiological significance of the findings is unclear. I have further minor and major points for the authors to address, as detailed below.

      Minor comments:

      • The approach to isolate the different MGCs using FACS and imaging technique is highly novel. However the difference between MGC subtypes isolated isn't immediately apparent beyond the morphological comparisons. In my opinion some of the results of MGC-specific assays from Figures 4, 5 and 6 can be included in Figure 1, e.g. TRAP staining and hydroxyapatite resorption for osteoclasts, to provide evidence of purity and specificity of each MGC subtype early on in the manuscript. Classical or canonical genes associated with each MGC subtype can also be highlighted in the volcano plots in Figure 1C, e.g ACP5, CTSK, TNFRSF11A for osteoclasts.

      • The overall decrease in phagocytic identity of all the MGCs, and the specific upregulation of phagocytic pathways in the FBGCs are conflicting. Are there subsets of phagocytic pathways that were down and upregulated during the formation of FBGCs?

      • What are the identities of the mononuclear cells in each of the MGC experiment? They appeared to be quite heterogeneous based on the DEGs identified, beyond the common phagocyte signature. Can the authors comment on the difference between the mononuclear cells and whether this will affect the DEG analysis?

      • The authors should also frame/discuss the findings in the context of diagnostic and therapeutic potentials to highlight the clinical significance of this study

      Major comments:

      • As mentioned before, the physiological significance of the findings is unclear. Some form of in vivo data is needed to support some of the key conclusions of the study, e.g validating some of the markers of the pathways identified (common and MGC subtype-specific), and the role of lysosome-mediated iron homeostasis in multinucleation. The authors can make use of the FACs and imaging approaches they developed to look at MGCs in relevant tissues.

      Significance

      Significance:

      • The approach of isolating and comparing different types of MGCs is novel, and the findings certainly improved our understanding of the fusion processes of MGCs. However the physiological role of these processes in health and disease that involve MGCs is still unclear due to the lack of in vivo data. The findings were discussed in quite a bit of detail in the context of current literature, though clinical impact was not explored.

      • My background is bone biology with a very keen interest in osteoclast biology so arguably my knowledge on other MGCs eg LGCs and FBGCs is limited.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, the authors performed a comparative transcriptome analysis of mononuclear and multinuclear human osteoclasts, LGCs and FBGCs. They found that multinucleation triggers a significant downregulation of macrophage identity in all three types of MGCs. Furthermore, RNA-seq data and in-vitro functional analysis of multinucleated cells showed that macrophage cell-cell fusion and multinucleation enhance phagocytosis and contribute to lysosome-dependent intracellular iron homeostasis. Furthermore, multinucleation of osteoclasts promoted mitochondrial activity and oxidative phosphorylation, resulting in maximal respiration. This unique and interesting study addresses the fundamental question of how cell-cell fusion and multinucleation contribute to cellular activity and biological homeostasis.

      Major comments:

      1. The authors generated mature multinucleated cells by stimulating human PBMC-derived macrophages with either IFN-g, IL-4, or RANKL. However, no quantitative data have been presented to determine how effectively IL-4, IFN-g, and RANKL can induce multinucleated giant cells from mononuclear macrophages. Quantitative data showing induction efficiency would provide a more detailed picture of the overall experiment.

      2. The authors mentioned, "The distinct morphological appearance of these three types of MGCs (Figure 1B) suggested cell type-specific functional properties and shared mechanisms underlying macrophage multinucleation". However, there is no discussion or data showing how the nuclear arrangement and intracellular location affect the biological function of multinucleated cells.

      3. Based on the results of DC-stamp knockdown experiments, the authors concluded that cell-cell fusion and multinucleation suppress the mononuclear phagocytic gene signature. However, to strengthen this hypothesis, it would be necessary to provide at least data showing that DC-stamp knockdown reduces the number of multinucleated cells.

      4. In Figure4, the authors showed that transcripts in LGCs were enriched for antigen presentation and adaptive immune system pathways. In addition, multinucleation of LGCs increased the surface expression of B7-H3 (CD276) and colocalized with CD3+ cells, suggesting that LGC multinucleation potentiates T cell activation. However, the authors did not present enough data to demonstrate the antigen-presenting ability of LGCs or their specific T cell activating capacity.

      5. Figure 6 clearly shows that mature multinucleated osteoclasts exhibit increased ATP production and maximal respiration. However, the glycolytic pathway did not differ between mononuclear and multinuclear osteoclasts. No explanation for this observation has been provided. It is easy to understand that osteoclasts acquire ATP through aerobic respiration during multinucleation. But how NADPH, which is essential for its redox reaction, is produced? Is it by acquiring αKG from the glutamine pathway?

      Minor comments:

      1. Supplementary tables 1-6 were not provided.

      2. Figure 2D right panel, difficult to see DAPI+ nuclei.

      Significance

      Although it is well known that multinucleation of cells constantly occurs, especially in osteoclasts, skeletal muscle, and trophoblasts of the placenta, the biological significance of multinucleation and the intracellular functions of multinucleation are not well understood. In this unique study, three types of multinucleated cells were generated from human peripheral blood to elucidate the genetic and functional differences between mononucleated and multinucleated cells. Furthermore, by demonstrating the possibility that the morphological peculiarity of multinucleation can regulate cell function, this paper provides clues to understanding the underlying biology of multinucleated cells and how they maintain cell function in homeostatic and pathological settings.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The authors compared the various multinucleated cells, osteoclasts, LCG and FBGC. Overall, the manuscript shows rigor in the analyses, and also very interesting approaches for retrieving mononuclear cells, for instance using DC-STAMP siRNA. This work adds very much to understanding the biological differences, as summarized in figure 6h. A lot of work in osteoclast field with for instance qPCR is hampered because, inevitably, a mix of mononuclear and multinucleated cells is always measured. Here, a solid attempt to separate those mixes with cell sorting and subsequent analysis on the mononuclear and multinucleated isolates, really adds. Choice of figures is good, also the extra info of the supplementary figures is relevant and makes it easy to read.

      Major and minor concerns:

      For osteoclasts, various markers exist for their biological characterization, for instance the ability to resorb bone. What, apart from the arrangement and number of nuclei, were the biological parameters that confirmed that the cells made by addition of IFN or IL-4 were LCG and FBGC? In fig 2c: did the authors perform stainings with isotype control antibodies? In my experience, quite often, antibodies stain mononuclear cells much intenser, since the cytoplasm is much more condense, less spread over a large area. Resorption assay in 6 is not clear. It is weird that osteoclasts apparently display so limited resorption? Also the traces are not typical for osteoclasts. Please explain. Provide a better image Supplementary 2A, even at 250% the lettering is vague. What do the colours in 2A mean?

      CROSS-CONSULTATION COMMENTS

      I have read the comments of the other two reviewers, and together. I absolutely agree with their additions, Indeed, supplementary tables are lacking, as well as there could be a bit more emphasis on the fact that it is all in vitro work. Together, I think the three of us are complementary in our comments, with good overlap as well. Any effort to stain for instance pathology material with the markers that have been found, would be great, especially for the LGC and the FBGC, that are much less studied in the field of MNGs. Having said that ,I can also live without this addition, but then it could be highlighted in the discussion that these are the future avenues that should be considered. Collaborate with Pathology!

      Significance

      This manuscript is particularly interesting to those who are interested in the BIOLOGY of MNCs. In essence, three types of MNCs were cultured and compared, with each of them a specific function.

      I am an osteoclast expert (76 publications), and have two publications on FBGCs

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This paper provides a detailed step by step protocol of the CUT&RUN technique, which enables high-resolution chromatin mapping and probing, adapted to the malaria parasite Plasmodium falciparum. In particular, Kafsack and colleagues apply the CUT&RUN protocol to infected red blood cells from in-vitro culture and obtain very good quality genome-wide profiles of two histone modifications, H3K4me3 and H3K9me3. The results are congruent with previous ChIP-seq data with a substantial improvement in terms of coverage and chip-to-input noise. The protocol is very detailed and the figures are great.

      Major comments: 1. Authors successfully adapted the CUT&RUN protocol in P. falciparum. First, the binding profiles obtained by CUT&RUN for H3K4me3 and H3K9me3 are very similar to those reported by previous ChIP-seq studies. Secondly, by down-sampling 4X and 16X the test samples, authors demonstrate that 1M PE reads of sequencing depth would be enough to obtain accurate profiling of these histone modifications.

      Despite this data is convincing, only one region in chr. 8 is shown as an example in figures 2, 3 and 4.

      Different regions should be included, at least as supplementary figures, to reinforce their conclusions.

      __Response: __We chose that locus on chromosome 8 to provide a gene-level resolution view at a locus encompassing genes in both eu- and heterochromatin states. We have now included Supplementary Figure 1, which shows these tracks for full length chromosomes 4 and 7. Additionally, genome-wide enrichment tracks for all data sets in this study are available for download at NCBI Gene Expression Omnibus under accession number GSE210062.

      Related to this, there is evidence of the impact of chromatin structure on ChIP-seq analysis. Specifically, heterochromatin is typically depleted in ChIP input controls because of technical and experimental issues and this can result in a false enrichment of heterochromatic regions in the tested sample. How represented is heterochromatin (i.e. sub-telomeric and telomeric regions) in the test and control samples using the cut&run protocol?

      __Response: __The reviewer is correct that chromatin structure may alter accessibility which may bias absolute measurements but since the accessibility biases based to chromatin-structure are identical for both the histone PTM-specific antibody and the isotype control and cancel out in the enrichment score.

      How biased is the cut&run sample compare to the ChIP-seq sample?

      __Response: __We have included this in Supplementary Figure 1. The H3K4me3 and H3K9me3 enrichment scores are strongly correlated both between CUT&RUN replicates and between CUT&RUN and previously published ChIP-seq results.

      In this sense, it would be desirable if authors provide more information about the quality analysis results, for example the chip to input signal ratio and the coverage for heterochromatic (telomeric, centromeric and subtelomeric) regions.

      __Response: __We agree that this would be of interest to the reader. For this reason, the full genome-wide enrichment tracks were made available for all datasets in this study. We have added language to draw further attention to this availability.

      Additionally, loci typically biased in ChIP-seq samples, i.e. clonally variant gene families in sub-telomeric regions, should be shown as examples.

      __Response: __We chose that locus on chromosome 8 to provide a gene-level resolution view at a locus encompassing genes in both eu- and heterochromatin states, including 2 var genes (PF3D7_0808600 and PF3D7_0808700) and two rifin genes (PF3D7_0808800 and PF3D7_0808900). We have now included Supplementary Figure 1, which shows these tracks for full length chromosomes 4 and 7, which also include subtelomeric and non-subtelomeric heterochromatin loci containing these genes. Additionally, genome-wide enrichment tracks for all data sets in this study are available for download at NCBI Gene Expression Omnibus under accession number GSE210062.

      1. For P. falciparum WGS a PCR-free library preparation is strongly recommended. We wonder if it would be possible to try to integrate this step in their CUT&RUN protocol.

      __Response: __Since such biases are sequence dependent, they would impact raw coverage but cancel out in the enrichment plots since the sequence-based biases are identical in both samples. While PCR-free amplification may be desirable for some applications, we feel this is outside the scope of this study to implement these changes.

      It would have been desirable to have tried the CUT&RUN protocol on other type of proteins, different to hPTMs which are highly abundant, for example one of the Pf Api-AP2 transcription factors. Assaying the CUT&RUN protocol on a different type of protein shouldn't be cost/time consuming and would provide evidence of the versatility of the approach.

      Response: As indicated by the title, this protocol was optimized specifically for profiling of histone modifications. CUT&RUN has been used in other systems to profile genome-wide binding of other proteins but this was not our aim and outside the scope of this study.

      1. The step by step protocol is very detailed, however there are some parts that need to be better explained:

      In the section "Binding cells to Concanavalin A-coated beads": it's not mentioned the harvest time and the stage of the parasites used. In addition, several methods are proposed for iRBCs enrichment, but is not mentioned which method was used and the life stage of the parasites. In this part of the protocol authors state "resuspend cells containing 1-5x107 nuclei to a cell density to 1x107 cells/mL".

      According to our calculations, to guarantee this nuclei number it would be necessary to enrich in iRBCs and late stages. Otherwise the red blood cells density should be much larger. Could you please clarify this point?

      Response: For this study we enriched for trophozoites using a percoll/sorbitol density gradient, which we have now clarified in step 8. However, whether and which enrichment strategy is employed will vary based on the desired parasite stages and experimental design.

      In the section "P. falciparum culturing and synchronization of erythrocytic stages" the authors indicate that the method used for synchronization was double-synchronization with sorbitol treatment to achieve a {plus minus} 6 h synchrony. The details provided appear insufficient to replicate the procedure. E.g. it's not explained how the double step synchronization was performed and for how long the culture was incubated after the synchronization.

      The number of parasite cells and the life-stage used is mentioned at the end (in the section of expected outcomes). It would be more useful if this information is specified at the beginning together with the most appropriate procedure to get an iRBC culture well synchronised and enriched in late stages.

      Response: The stage, synchrony and growth conditions are determined by the scientific question the experimenter is asking, not by the assay. For this reason, we provide the number of infected erythrocytes and nuclei used in our studies so that other experimenters can aim for similar numbers regardless of the stage and synchrony. For this study we used asexual blood-stages at 36±4 hp.i. We have clarified this in step 11.

      • With regards to reproducibility, all experiments were done in replicate (3 Rs) and the statistics appear adequate.

      Minor comments: - Abstract. A closing bracket is missing.

      Response: Corrected - Step 11: Split each sample into 1mL aliquots at ?

      Response: Corrected

      • The affinity of proteins A/G to IgG antibodies varies based on host species and IgG subtype (see link). This link does not seem to work

      Response: Corrected

      • Low bind tubes are mentioned several times. Please clarify whether it refers to low bind protein or low bind DNA. Step 77.

      Response: The vendor and catalog number for the low-bind tubes are specified in the Reagents, Materials & Equipment list.

      Which was the desired sequencing depth per library? It could be mentioned here. It is mentioned later in "Quantification and statistical analysis" that the initial desired depth was of 40M read pairs, but what was the real depth obtained? from the Figure 4 seems to be less than 17M read pairs per sample.

      Response: Thank you for catching that error. The target was 10M read pairs per library but since CUT&RUN is so specific the isotype controls release less DNA and the resulting libraries produces fewer clusters than aimed for leading to slight over sequencing of the remaining samples.

      • Step 79. Please clarify/justify why 50 bp paired-end reads were chosen as sequence length. Response: After excluding the telomere repeats 100 bp (50+50) are sufficient for uniquely mapping 98.3% of the nuclear. Paired-end sequencing was chosen over single-end because it provides the actual size of each fragment.

      In the section "Quantification and statistical analysis", references to Figure 3 and 4 are inverted or do not correspond with the actual figures 3 and 4.

      Response: Corrected

      • Figure 2. Among the replicates, sample 2 seems to have higher background, could you comment why? Response: It is inherent in biological replicates that one would have the greatest amount of noise but we unfortunately have no further insight into why Sample 2 had a higher elevated background that Samples 1 and 3. Furthermore, even with this slightly higher background the relative enrichment of signal to noise ratio enrichment peaks are readily identifiable.

      • Below some suggestions that may help the authors improve the presentation of their data and conclusions:

      The limitations and potential shortcomings of the protocol are mentioned along the text (e.g. the use of different antibodies, different targets, weak interactions..), but could be good if they are included in a different section, preferably at the end.

      Response: A “Limitations” section was added.

      Also in this section it would be good if they develop further (or at least speculate) on the differences in the protocol or things to consider if other type of proteins are assayed (i.e. TFs).

      Response: As mentioned above, we have not applied to CUT&RUN to profile chromatin other than Histone PTMs, as this was not the aim of our study. Since chromatin-bound histones always occur within a nucleosomal context, we are hesitant to make claims to the utility of this specific protocol for profiling DNA-binding proteins with smaller DNA-binding footprints. That said, CUT&RUN has been used to great success in other systems to profile a wide range of chromatin-bound proteins. We have included mention of this at the end of the introduction.

      Authors should better comment on the potential impact of chromatin structure and DNA sequence (i.e. AT richness) on the biased representation of heterochromatic regions in the data, the level of background and the peak calling analysis.

      Response: For the enrichment scores, sequence and accessibility biases cancel out since they are the same for both the PTM-specific antibody and the isotype controls.

      The coverage of critical loci, like those belonging to clonally variant gene families, should be calculated and examples of tracks included as supplemental figures.

      Response: Gene Expression Omnibus under accession number GSE210062 as indicated in the Quantification & Statistical Analysis and data availability sections.

      Authors claim that the CUT&RUN protocol has exceptionally low background and has been successfully used to profile chromatin interactions from very small numbers of cells. But it is not specified how many. That is, which is the standard in other fields and how it compares with the number of cells used here.

      Response: As stated in the note following step 10, we did not optimize the minimum number of parasites required in this study since at the 1e7 iRBC required for each sample correspond as little as 1mL of bloodstage culture 2% parasitemia and 5% hematocrit. The down-sampling analysis in figure 3 suggests that the number of input cells can likely be reduced at least 10-fold.

      Information about synchronisation, estimation of iRBCs density and nuclear content appears insufficiently described and has been fragmented in different sections so it is difficult to replicate. For example, within the section "Binding cells to Concanavalin A-coated beads" different alternative protocols for iRBCs synchronisation and enrichment are mentioned but it is not clear whether authors actually perform that step. It could be convenient to describe it and include it in the step-by-step protocol.

      Response: The stage, synchrony and growth conditions are determined by the scientific question the experimenter is asking, not by the assay. For this reason, we provide the number of infected erythrocytes and nuclei used in our studies so that other experimenters can aim for similar numbers regardless of the stage and synchrony. For this study we used asexual blood-stages at 36±4 hp.i. We have clarified this in step 11.

      Significance (Required) The CUT&RUN is a novel technique to profile chromatin modifications genome-wide that has been successfully adapted to P. falciparum by the authors. This technique overcomes important limitations of the traditional ChIP-seq and provides better quality data. First, fewer cells and lower sequencing depths are required which is fundamental for the analysis of certain parasite life stages. Second, the binding step is carried out in-situ using unfixed and intact cells. This allows to avoid crosslinking, which can interfere with target recognition that results in unspecific background, and also avoids the random fragmentation of the chromatin, that can bias in the analysis.

      This work is significant since it represents the first CUT&RUN step by step protocol adapted to P. falciparum. The results are important for researchers from the malaria field and parasitologists in general who could eventually leverage this protocol to other Apicomplexa.

      Our expertise is on transcriptional regulation, molecular parasitology, genomics and epigenomics, of malaria parasites. We hope the comments above will help the authors to improve the ms. Congratulations on the work.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): ____ In general the paper is very clear and convincing and I have only minor comments for the authors to address.

      Introduction The authors state that 'crosslinking presents another challenge as it can interfere with antibody recognition.' Would it be possible to provide a reference to strengthen this statement? Response: Additional references (Baranello et al, O’Neill et al) were added.

      In the third paragraph the authors mention that CUT&RUN can be used to profile chromatin interactions from very small numbers of cells. This argument would be strengthened by adding references or examples from mammalian systems, and the authors might mention that the slight modification CUT&TAG has been employed for single cell sequencing. https://doi.org/10.1038/s41587-021-00865-z.

      Response: The reference was added.

      Figure 2 A label of Relative fold enrichment should be added to the y axis. This applies also to Figure 4. In the legend, it isn't entirely clear from what control the fold enrichment is being generated. Based on the other figures I assume it's the isotype control, and it would be helpful to state that in the legend.

      Response: Thank you for the suggestion. We have made these changes.

      Figure 3 An HP1 ChIP is included but there is no track for HP1 using CUT&RUN. It isn't entirely clear to me why HP1 is included; is it to make the point that it overlaps with H3K9me3? There is a sentence at the end of the Quantifcation and statistical analysis section that indicates that an HP1 CUT&RUN experiment was performed ('Representative tracks of H3K9me3, H3K4me3, and HP1 produced using CUT&RUN or ChIPseq at chromosome 8 of P. faliciparum are shown in Figure 4), but I don't see an HP1 track for Figure 4 and I don't see CUT&RUN HP1 tracks on Figure 3.

      Response: Correct, no HP1 CUT&RUN was performed, we are just trying to show that H3K9me3 CUT&RUN recapitulates ChIP-seq of both H3K9me3 and HP1, which binds to H3K9me3.

      Figure 4 The downsampling of reads is a nice demonstration that low numbers of reads are required for the CUT&RUN technique. It might be helpful to include downsampling of ChIP-seq reads within this figure to compare the two techniques more directly.

      Response: The reason we included the down-sampling of CUT&RUN sequence reads was to explore whether were over-sequencing our CUT&RUN libraries not to provide a comparison to ChIP-seq. For simplicity we have therefore kept the figure as is.

      DNA purification by Phenol/Chloroform extraction In step 38, I noticed that RNAse was not added at this step, as described in the original paper by Skene et al. Can the authors make a brief note about why they omit this reagent?

      Response: RNAse A is already present since it was included in the STOP buffer at step 35.

      The authors mark the TE buffer in bold, but I don't see a description of its makeup in the buffer section, though possibly I missed it. While this is a pretty standard buffer, it might still be nice to include it for completeness.

      Response: TE buffer recipe was added.

      Clean-up of PCR amplified library Between step 70 and 71 the authors include a warning to not discard the beads. However, this warning is not included in the Post-ligation Clean-up, which involves much the same procedure. Response: Corrected

      Typos and writing It might be helpful to define CUT&RUN in the abstract by spelling out the acronym there. Response: It is defined in the 3rd paragraph of the introduction

      I've mostly seen ChIP-seq with a dash between the IP and the seq.

      Response: Corrected

      Powerful is used twice in consecutive sentences in the first paragraph of the introduction. Consider substituting the words 'important tool' for 'powerful tool.' Response: Corrected

      Figure 2 legend. “(purple) in of three biological replicates” should be “(purple) in three biological replicates” Response: Corrected

      Figure 3 legend Last sentence should include 'to' between the words 'shown' and 'the'. Response: Corrected

      Post-ligation Clean-up “Wash twice with 200µl of 80% Ethanol freshly prepared” should be “Wash twice with 200µl of freshly prepared 80% Ethanol” Response: Corrected

      Library PCR amplification “Fragments are PCR amplified using Kapa polymerase, which it is more efficient” should be “Fragments are PCR amplified using Kapa polymerase, which is more efficient” Response: Corrected

      Figure 7 legend “Indicated in the tope left panel” Should be “Indicated in the top left panel” Response: Corrected

      Expected outcomes “to dismiss any sort of contamination” should be “To dismiss contamination” Response: Corrected

      Potential Solution After the sentence 'Incubation buffer is added.' The next letter 'i' should be capitalized in the word Isolate. Response: Corrected

      CROSS-CONSULTATION COMMENTS Plasmodium is not my model organism, so I'd defer to Reviewer 2 on the comments regarding additional detail for synchronization and Plasmodium culture conditions. I have nothing further to add, and I'm excited to see what experiments come from the addition of this technique to the parasite field.

      Reviewer #3 (Significance (Required)):

      This excellent methods paper describes a detailed protocol for the adaptation of the Cleavage Under Targets & Release Using Nuclease (CUT&RUN) technique to Plasmodium falciparum, the causative agent of malaria. CUT&RUN is an alternative to ChIP-seq, and has the advantage that it does not require crosslinking of targets, which can introduce artifacts and cause issues with antibody recognition. CUT&RUN can also be performed with low numbers of cells and has an excellent signal to noise ratio, which the authors demonstrate by downsampling the number of reads used in their analysis. The authors also clearly demonstrate that profiling of histone modifications using CUT&RUN yields comparable results to ChIP-seq. Because it can be difficult to obtain large numbers of cells from Plasmodium cultures, CUT&RUN is especially useful in this important model system. Publication of a detailed protocol will help other Plasmodium researchers answer important questions regarding genomic localization for their targets of interest.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In general the paper is very clear and convincing and I have only minor comments for the authors to address.

      Introduction

      The authors state that 'crosslinking presents another challenge as it can interfere with antibody recognition.' Would it be possible to provide a reference to strengthen this statement?

      In the third paragraph the authors mention that CUT&RUN can be used to profile chromatin interactions from very small numbers of cells. This argument would be strengthened by adding references or examples from mammalian systems, and the authors might mention that the slight modification CUT&TAG has been employed for single cell sequencing. https://doi.org/10.1038/s41587-021-00865-z.

      Figure 2

      A label of Relative fold enrichment should be added to the y axis. This applies also to Figure 4. In the legend, it isn't entirely clear from what control the fold enrichment is being generated. Based on the other figures I assume it's the isotype control, and it would be helpful to state that in the legend.

      Figure 3

      An HP1 ChIP is included but there is no track for HP1 using CUT&RUN. It isn't entirely clear to me why HP1 is included; is it to make the point that it overlaps with H3K9me3? There is a sentence at the end of the Quantifcation and statistical analysis section that indicates that an HP1 CUT&RUN experiment was performed ('Representative tracks of H3K9me3, H3K4me3, and HP1 produced using CUT&RUN or ChIPseq at chromosome 8 of P. faliciparum are shown in Figure 4), but I don't see an HP1 track for Figure 4 and I don't see CUT&RUN HP1 tracks on Figure 3.

      Figure 4

      The downsampling of reads is a nice demonstration that low numbers of reads are required for the CUT&RUN technique. It might be helpful to include downsampling of ChIP-seq reads within this figure to compare the two techniques more directly.

      DNA purification by Phenol/Chloroform extraction In step 38, I noticed that RNAse was not added at this step, as described in the original paper by Skene et al. Can the authors make a brief note about why they omit this reagent?

      The authors mark the TE buffer in bold, but I don't see a description of its makeup in the buffer section, though possibly I missed it. While this is a pretty standard buffer, it might still be nice to include it for completeness.

      Clean-up of PCR amplified library

      Between step 70 and 71 the authors include a warning to not discard the beads. However, this warning is not included in the Post-ligation Clean-up, which involves much the same procedure.

      Typos and writing

      It might be helpful to define CUT&RUN in the abstract by spelling out the acronym there.

      I've mostly seen ChIP-seq with a dash between the IP and the seq.

      Powerful is used twice in consecutive sentences in the first paragraph of the introduction. Consider substituting the words 'important tool' for 'powerful tool.'

      Figure 2 legend.

      (purple) in of three biological replicates

      Should be

      (purple) in three biological replicates

      Figure 3 legend Last sentence should include 'to' between the words 'shown' and 'the'.

      Post-ligation Clean-up Wash twice with 200µl of 80% Ethanol freshly prepared

      Should be

      Wash twice with 200µl of freshly prepared 80% Ethanol

      Library PCR amplification Fragments are PCR amplified using Kapa polymerase, which it is more efficient

      Should be

      Fragments are PCR amplified using Kapa polymerase, which is more efficient

      Figure 7 legend Indicated in the tope left panel

      Should be

      Indicated in the top left panel

      Expected outcomes to dismiss any sort of contamination

      Should be

      To dismiss contamination

      Potential Solution After the sentence 'Incubation buffer is added.' The next letter 'i' should be capitalized in the word Isolate.

      CROSS-CONSULTATION COMMENTS

      Plasmodium is not my model organism, so I'd defer to Reviewer 2 on the comments regarding additional detail for synchronization and Plasmodium culture conditions. I have nothing further to add, and I'm excited to see what experiments come from the addition of this technique to the parasite field.

      Significance

      This excellent methods paper describes a detailed protocol for the adaptation of the Cleavage Under Targets & Release Using Nuclease (CUT&RUN) technique to Plasmodium falciparum, the causative agent of malaria. CUT&RUN is an alternative to ChIP-seq, and has the advantage that it does not require crosslinking of targets, which can introduce artifacts and cause issues with antibody recognition. CUT&RUN can also be performed with low numbers of cells and has an excellent signal to noise ratio, which the authors demonstrate by downsampling the number of reads used in their analysis. The authors also clearly demonstrate that profiling of histone modifications using CUT&RUN yields comparable results to ChIP-seq. Because it can be difficult to obtain large numbers of cells from Plasmodium cultures, CUT&RUN is especially useful in this important model system. Publication of a detailed protocol will help other Plasmodium researchers answer important questions regarding genomic localization for their targets of interest.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This paper provides a detailed step by step protocol of the CUT&RUN technique, which enables high-resolution chromatin mapping and probing, adapted to the malaria parasite Plasmodium falciparum. In particular, Kafsack and colleagues apply the CUT&RUN protocol to infected red blood cells from in-vitro culture and obtain very good quality genome-wide profiles of two histone modifications, H3K4me3 and H3K9me3. The results are congruent with previous ChIP-seq data with a substantial improvement in terms of coverage and chip-to-input noise. The protocol is very detailed and the figures are great.

      Major comments:

      • Authors successfully adapted the CUT&RUN protocol in P. falciparum. First, the binding profiles obtained by CUT&RUN for H3K4me3 and H3K9me3 are very similar to those reported by previous ChIP-seq studies. Secondly, by down-sampling 4X and 16X the test samples, authors demonstrate that 1M PE reads of sequencing depth would be enough to obtain accurate profiling of these histone modifications. Despite this data is convincing, only one region in chr. 8 is shown as an example in figures 2, 3 and 4. Different regions should be included, at least as supplementary figures, to reinforce their conclusions. Related to this, there is evidence of the impact of chromatin structure on ChIP-seq analysis. Specifically heterochromatin is typically depleted in ChIP input controls because of technical and experimental issues and this can result in a false enrichment of heterochromatic regions in the tested sample. How represented is heterochromatin (i.e. sub-telomeric and telomeric regions) in the test and control samples using the cut&run protocol? How biased is the cut&run sample compare to the ChIP-seq sample? In this sense, it would be desirable if authors provide more information about the quality analysis results, for example the chip to input signal ratio and the coverage for heterochromatic (telomeric, centromeric and subtelomeric) regions. Additionally, loci typically biased in ChIP-seq samples, i.e. clonally variant gene families in sub-telomeric regions, should be shown as examples.

      • For P. falciparum WGS a PCR-free library preparation is strongly recommended. We wonder if it would be possible to try to integrate this step in their CUT&RUN protocol. It would have been desirable to have tried the CUT&RUN protocol on other type of proteins, different to hPTMs which are highly abundant, for example one of the Pf Api-AP2 transcription factors. Assaying the CUT&RUN protocol on a different type of protein shouldn't be cost/time consuming and would provide evidence of the versatility of the approach.

      • The step by step protocol is very detailed, however there are some parts that need to be better explained: In the section "Binding cells to Concanavalin A-coated beads": it's not mentioned the harvest time and the stage of the parasites used. In addition, several methods are proposed for iRBCs enrichment, but is not mentioned which method was used and the life stage of the parasites. In this part of the protocol authors state "resuspend cells containing 1-5x107 nuclei to a cell density to 1x107 cells/mL". According to our calculations, to guarantee this nuclei number it would be necessary to enrich in iRBCs and late stages. Otherwise the red blood cells density should be much larger. Could you please clarify this point? In the section "P. falciparum culturing and synchronization of erythrocytic stages" the authors indicate that the method used for synchronization was double-synchronization with sorbitol treatment to achieve a {plus minus} 6 h synchrony. The details provided appear insufficient to replicate the procedure. E.g. it's not explained how the double step synchronization was performed and for how long the culture was incubated after the synchronization. The number of parasite cells and the life-stage used is mentioned at the end (in the section of expected outcomes). It would be more useful if this information is specified at the beginning together with the most appropriate procedure to get an iRBC culture well synchronised and enriched in late stages.

      • With regards to reproducibility, all experiments were done in replicate (3 Rs) and the statistics appear adequate.

      Minor comments:

      • Abstract. A closing bracket is missing.

      • Step 11: Split each sample into 1mL aliquots at ?

      • The affinity of proteins A/G to IgG antibodies varies based on host species and IgG subtype (see link). This link does not seem to work

      • Low bind tubes are mentioned several times. Please clarify whether it refers to low bind protein or low bind DNA. Step 77. Which was the desired sequencing depth per library? It could be mentioned here. It is mentioned later in "Quantification and statistical analysis" that the initial desired depth was of 40M read pairs, but what was the real depth obtained? from the Figure 4 seems to be less than 17M read pairs per sample.

      • Step 79. Please clarify/justify why 50 bp paired-end reads were chosen as sequence length. In the section "Quantification and statistical analysis", references to Figure 3 and 4 are inverted or do not correspond with the actual figures 3 and 4.

      • Figure 2. Among the replicates, sample 2 seems to have higher background, could you comment why?

      • Below some suggestions that may help the authors improve the presentation of their data and conclusions: The limitations and potential shortcomings of the protocol are mentioned along the text (e.g. the use of different antibodies, different targets, weak interactions..), but could be good if they are included in a different section, preferably at the end. Also in this section it would be good if they develop further (or at least speculate) on the differences in the protocol or things to consider if other type of proteins are assayed (i.e. TFs). Authors should better comment on the potential impact of chromatin structure and DNA sequence (i.e. AT richness) on the biased representation of heterochromatic regions in the data, the level of background and the peak calling analysis. The coverage of critical loci, like those belonging to clonally variant gene families, should be calculated and examples of tracks included as supplemental figures. Authors claim that the CUT&RUN protocol has exceptionally low background and has been successfully used to profile chromatin interactions from very small numbers of cells. But it is not specified how many. That is, which is the standard in other fields and how it compares with the number of cells used here. Information about synchronisation, estimation of iRBCs density and nuclear content appears insufficiently described and has been fragmented in different sections so it is difficult to replicate. For example, within the section "Binding cells to Concanavalin A-coated beads" different alternative protocols for iRBCs synchronisation and enrichment are mentioned but it is not clear whether authors actually perform that step. It could be convenient to describe it and include it in the step-by-step protocol.

      Significance

      The CUT&RUN is a novel technique to profile chromatin modifications genome-wide that has been successfully adapted to P. falciparum by the authors. This technique overcomes important limitations of the traditional ChIP-seq and provides better quality data. First, fewer cells and lower sequencing depths are required which is fundamental for the analysis of certain parasite life stages. Second, the binding step is carried out in-situ using unfixed and intact cells. This allows to avoid crosslinking, which can interfere with target recognition that results in unspecific background, and also avoids the random fragmentation of the chromatin, that can bias in the analysis.

      This work is significant since it represents the first CUT&RUN step by step protocol adapted to P. falciparum. The results are important for researchers from the malaria field and parasitologists in general who could eventually leverage this protocol to other Apicomplexa.

      Our expertise is on transcriptional regulation, molecular parasitology, genomics and epigenomics, of malaria parasites. We hope the comments above will help the authors to improve the ms. Congratulations on the work.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      This reviewer did not leave any comments

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): *In the manuscript " Functional peroxisomes are required for heat shock-induced hormesis in Caenorhabditis elegans" the authors show that a hormetic heat shock requires the peroxidase catalase ctl-2 for longevity and thermotolerance. Furthermore the authors characterize the hormetic stress response in ctl-2 mutants and show that ctl-2 is required for the proper formation of HSF-1 stress granules, and that ctl-2 mutants have a changed transcriptional response to heat shock and a changed induction of heat-stress induced oxidative stress. The authors go on to characterize peroxisomes, and peroxidase-regulated fatty acids, as well as mitochondria in both ctl-2 mutants and upon heat shock. The authors conclude that functional peroxisomes play an important role in the hormetic heat stress response.

      This is an interesting study, that demonstrates that the loss of the peroxidase catalase ctl-2 plays an important role in the heat stress response. The authors need to provide more details on experimental repeats and experiments strengthening the conclusion that peroxisomes are generally important for the hormetic stress response would also improve this manuscript.

      Furthermore, some textual changes would clarify some specifics in this manuscript.*

      **Major issues:** *1. Throughout the manuscript it is unclear how many times the experiments were conducted (e.g. Lifespan, thermotolerance) or how many biological replicates were used (qRT-PCR, peroxisome characterization etc). This critical information should be included. A table with the individual lifespan experiments and thermotolerance experiments is expected. *

      We are grateful to the Reviewer for drawing our attention to this and we have now included this information throughout the manuscript.

      • The authors are characterizing the ctl-2 mutant upon heat stress and find some compelling differences to wild-type animals. The authors state "We measured the expression levels of two peroxisomal transport proteins, PRX-5 and PRX-11, and found no differences in expression between WT and Δctl-2 strains, suggesting that morphogenesis should not be impaired and that proteins needed for their proper function should be present, with the exception of ctl-2 in the Δctl-2 strain (Figure 3A)." The authors however conclude that functional peroxisomes are required for the benefits of a hormetic HS. What the authors are in fact demonstrating is that ctl-2 specifically is required for the hormetic HS response. To demonstrate that peroxisomes in general are required they should disrupt peroxisome function by additional means. What the authors also demonstrate is that heat shock has an effect on peroxisomes. These differences need to be clarified. It is also not fully clear whether peroxisomes are functional in ctl-2 mutants? Further peroxisome-relevant enzymes should be tested for levels and functionality in ctl-2 mutants and upon HS. *

      This is correct and we have now addressed the above-mentioned issues throughout the manuscript. As the Reviewer noted, the peroxisomes are functional if monitored as the transcript levels of two peroxisomal transport proteins, implying that they have the potential to import their components. On the other hand, they are lacking the catalase, so they do not have a complete set of their components. Our phrasing in the original manuscript indeed was not optimal and we have now modified the conclusions, stating the peroxisomal catalase is important for the proper heat shock response.

      Moreover, it is partially correct to say that a thorough explanation is missing as to what heat shock does to peroxisomes. We would like to emphasize that we measured the transcript levels of three enzymes from the very long chain fatty acid oxidation: straight-chain acyl-CoA oxidase, ACOX-1, MAO-C-like dehydratase domain protein, MAOC-1, and propanoyl-CoA C-acyltransferase, DAF-22. We have now measured the transcript levels of several other peroxisomal enzymes in the WT and the ctl-2 mutant under optimal conditions and heat shock and reported about it in the manuscript.

      • The TORC1 experiments are too indirect. The authors should perform WesternBlots with a S6Kinase -Phospho antibody to determine whether TORC1 is inhibited or not. Alternatively, the authors may choose to remove this experiment from the manuscript without disrupting the main message of the manuscript, since no link between TORC1 and peroxisome function nor with HS has been established.

      *

      We are aware that the classical experiment to measure the activity of TORC1 is the Western Blot based assay to observe the phosphorylation of S6 kinase. However, we find the available antibodies not good enough to perform this experiment. We, therefore, conducted a different experiment to estimate the TORC1 activity, and it is the one described in our manuscript. We suggest that, as the phosphorylation of S6 is a downstream readout of TORC1 activity, so is the transcript level of the group of genes we measured, as reported previously (Das, Melo et al. 2017; Kenyon 2010; Kenyon 2011; Robida-Stubbs, Glover-Cutter et al. 2012; from the manuscript). While we are aware of the shortcomings of the assay we used, we find it valuable to report that the heat shock has an inhibitory effect on the TORC1 activity in WT C. elegans but not in the peroxisomal mutant. We would like to ask the Reviewer to reconsider this comment.

      **Textual changes:** *1. The authors should adhere to C. elegans nomenclature convention (at least once), by referring to the ctl-2 deletion with the specific allele name [ctl-2(xx)] (unlike in yeast, ∆ is not commonly used). Furthermore a more thorough description of the specific allele (complete gene deletion? Point mutation? Truncation?) should be included. *

      We have now taken care of this issue.

      • Throughout the manuscript the authors should refer back to their specific hormetic heat shock paradigm, since several studies (also cited here) have shown differences in the specific physiological changes (e.g. HS on day 1 of adulthood has profound effects on broodsize, whereas the authors show here that a HS at L4 does not, other differences in terms of activation of other stress reporters has also been reported). *

      Since our data regarding brood size were not put into context of the present study and were not studied in-depth, we have decided to remove these results from the present manuscript.

      • The abbreviation OGT (original growth temperature??) should be defined *

      OGT stands for optimal growth temperature and we have now defined it in the manuscript.

      • The authors should change the sentence "The binding of HSF-1 to the HSEs can be visualized by a HSF-1::GFP fusion protein". This is not technically true. While HS has been shown to induce HSF-1 stress granules and these stress granules have been indirectly shown to correlate with transcription, it has however not been demonstrated that HSF-1 is bound to HSEs or whether it is bound to other DNA stretches or what is in fact transcribed by HSF-1 when it localizes to these foci. *

      We are grateful to the Reviewer for this clarification, which we have included into the manuscript.

      • The authors describe the morphological changes of peroxisomes upon HS, however they missed the opportunity to fully interpret these results in the discussion. Again, their conclusion that peroxisomes may be impaired in ctl-2 mutants is very vague. *

      We agree with this comment and we have now taken care of this issue by introducing additional clarifications.


      Reviewer #1 (Significance (Required)):

      *Hormesis and the role of peroxisomes in stress responses is an important and interesting topic. *

      The role of peroxisomes in stress responses has not been addressed.

      *Researchers with interest in stress responses will be interested in this work.

      My expertise lies in C.elegans stress and longevity with a specific focus on hormetic mechanisms.*

      **Referee Cross-commenting**

      While I agree with most comments from the other referees, I don't believe it is feasible to ask the authors to generate c. Elegans cell cultures for any follow up experiments. I would be satisfied with a more thorough comparison of the HS response between WT and Ctl-2 mutants, I.e: compare preconditiöning: 1h and 4h HS at L4 and d1 and then do thermotolerance experiments

      We thank the Reviewer for this comment. We agree that to generate C. elegans cell cultures is not feasible. We have, however, performed the HS treatments in mammalian cells in culture.

      HEK cells were transfected with Lipofectamine 3000 for 48h with either negative siRNA control (siCont), siPEX5 or siCAT at 100 nM, according to the manufacturer’s protocol. For HSR induction, cells were moved for 1h to a humidified incubator containing 5% CO2 at 42,5ºC, or maintained at 37ºC. After, RNA was immediately isolated with NucleoSpin RNA Columns for RNA purification (Macherey-Nagel), and 1000 ng of total RNA was converted to cDNA with iScriptTM cDNA Synthesis Kit (Biorad). Relative gene expression of PEX5, catalase (CAT), HSP90AA1, HSP70 (Hspa1b) and small HSPs (Hspb1, Hsph1 and Hspe1) were determined by qPCR, which were performed in a CFX Opus 384 Real-Time PCR System (Biorad) in a final volume of 5 µL using 2× SYBR Green PCR master mix and 0.3 μM of each primer pair. Relative gene expressions were calculated according to the Pfaffl method, with the results being normalized to the three most stable housekeeping genes in this experimental setting (HPRT, B2M and RPL7), as determined with NormFinder. The Ct averages of the four siCont samples at 37ºC were used as the calibrators to determine ΔCt for each gene. Data comparisons were conducted with two-way analysis of variance (ANOVA) followed by Bonferroni post hoc tests and data is presented as mean ± SEM (Figures 1 and 2).

      Although the siRNA-mediated silencing of CAT and PEX5 was successful (93% and 60% respectively), as determined by qPCR (Figure 1), the expression of several HSPs (HSP70, HSP90 and the small HSPs Hspb1, Hsph1 and Hspe1) remained similar following HS between either CAT- or PEX5- silenced cells and siCont-transfected cells (Figure 2). Preliminary data from SH-SY5Y cells treated as described above yielded similar results (data not shown). These observations may suggest that catalase and/or functional peroxisomes are not necessary for HSR induction and hormesis, but these experiments were limited to two cell lines and one form of HS (1h at 42,5ºC). Therefore, it is possible that different protocols may yet unravel a still undescribed link between peroxisomes, HSR, and life extension, but we to pursue this will take a long time and would be far outside the scope of this manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): **Summary:** In the manuscript entitled "Functional peroxisomes are required for heat shock-induced hormesis in Caenorhabditis elegans", Musa et al. explored the role of functional peroxisomes in the heat shock (HS)-induced hormesis in C. elegans using peroxisomal catalase ctl-2 deletion mutants. They showed that ctl-2 deletion abolished HS-induced longevity and suppressed the HS-induced upregulation of small heat shock proteins and HS-induced HSF-1 nuclear accumulation. Furthermore, the authors analyzed the differences in HS-induced phenotypes between wild type animals and ctl-2 mutants: the activation of the antioxidant response, the pentose phosphate pathway, and increased triglyceride content, which are the most prominent changes observed during heat shock in wild-type animals. However, there is only weak evidence supporting their claim that functional peroxisomes are required for HS-induced hormesis. * **Major comments:** 1. The authors conclude that the functional peroxisomes are essential for the HS-induced hormesis based only on one observation that the transient heat shock treatment did not increase the lifespan in the short-lived ctl-2 mutants. *

      This is only partially correct. In addition to the absence of lifespan extension in the ctl-2 mutants, our conclusion was based also on the lower extent of the heat shock protein expression during HS in ctl-2 mutant. In the revision, we have reinforced this conclusion by adding the measurements of the kinetics of the heat shock response activation in the WT and the ctl-2 mutant, whereby the possible differences in the kinetics between the strains have been excluded as a possible interpretation. The results are now reported in the manuscript.

      *Because the ctl-2 mutants are short-lived, the author should have carefully examined the relevant role of functional peroxisomes in hormesis response by conducting lifespan measurements using ctl-2 RNAi-treated animals or mutants deficient in other genes essential for peroxisome function. *

      • *

      We entirely agree with this comment. Since we are unable to perform such an elaborate study in this moment, and due to other similar comments from other Reviewers, we have decided to modify the conclusion, pointing out that it is the lack of the peroxisomal catalase that is associated to the lower extent of the heat shock response activation.

      • The authors examined the difference between wild-type animals and ctl-2 mutants in many aspects; however, there is no rational explanation or examination that their observations have a role in HS-induced hormesis. *

      We acknowledge that we are lacking the mechanistic insight into the reported phenomenon. This will be the focus of future studies, while in our present study, we have modified the conclusions to match the presented results.

      • There is a severe shortage in the explanation of their experiments, especially for the methods of the experiments and statistical procedures. The authors should clarify the details of the experiments with proper statistical analyses. Follows are the points to be clarified for Figure 1 as insufficiency examples in the explanation in their study.

      *

      We have now taken care of this issue throughout the manuscript.

      *Fig. 1A, B: Is this the representative survival curve of several experiments? The authors should clarify this point. The authors should include a table with the statistical numbers: the number of animals they used, the number of animals they measure in each experiment, and the p-value. *

      • *

      We have now clarified this issue in the manuscript.

      *Fig. 1C: Is this a representative brood size of one animal, the mean brood size of some animals of one representative experiment, or the mean brood sizes from some experiments? The authors should clarify this point. Also, the authors should perform statistical analysis. *

      We have removed the results related to the brood size from the present version of the manuscript.

      *Fig. 1D-F: The authors should state the number of experiments they conducted. The author should perform multiple comparison statistical analyses instead of t-test without any correction. *

      Done.

      *Fig. 1H: Is this a representative plot of a representative experiment or the plot of the mean values from some experiments? The authors should clarify this point. The author should also clarify the tissue they analyzed and the exact number of the experiment (the number of nuclei in each animal they analyze, the number of animals they analyze in each experiment). The author should also clarify the detailed statistical procedure: one-way ANOVA or two-way ANOVA; which multiple comparison method they used in their analyses. *

      We have now clarified all these issues.

      **Minor comments:** *The author should explain the details for the experiments in either the Methods section or the figure legends. The authors should integrate the figures 6B-J (6B-D -> one figure, 6E-G -> one figure, and 6H-J -> one figure) into three figures because there are redundancies in these figures. In addition, the authors should perform the statistical analyses with multiple comparison procedures instead of a simple t-test. *

      We have now done as suggested by the Reviewer.

      *The authors should perform statistical analyses on Figs. 6N and 6O. *

      Done.

      *The authors would better cite relevant articles when referring to representative target genes of UPRER, UPRmt, and TORC1. *

      Done.

      Reviewer #2 (Significance (Required)): *Understanding the mechanism underlying the HS-induced hormesis in a multicellular organism is essential in the research field. Their finding that functional peroxisomes play a pivotal role in the HS-induced hormesis, if properly demonstrated, would provide us with the significant progress in this field; however, much more proper experiments are required to support their conclusion. Nevertheless, their finding can stimulate the attention of researchers who study the aging process, stress response, and especially peroxisome function.

      I am an expertise in the study of aging in C. elegans. *

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): In this MS, the authors aim at the understanding of the role of peroxisome related-anti-oxidant capacity for the effect of a heat conditioning treatment on the HSR and associated longevity. Despite the finding that ctl-2 mutants they used show reduced resistance upon heat pre-conditioning, neither the mechanism (Ctl-2 is required for HSR) nor the claim that functional peroxisomes are required for heat shock-induced hormesis are, in my view, fully proven by the data in this MS. What the data basically show is that, related to a fragile status of the ctl-2 mutants, pre-conditioning was either to severe (toxic) or/and lead to development defects such that it was no longer effective in priming organismal resistance, likely to HSR-independent features. * **Major comments:** Figure 1: The level of pre-conditioning induced resistance (as opposed to intrinsic sensitivity) that can be induced in a given genetic background is dependent on a number of things, one of which is the severity of the priming dose. A more severe heat shock (that initially causes more damage) leads to a slower rate of tolerance development but the level of tolerance (of the surviving cells) is much higher. However, if too toxic, the priming treatment will result is loss of cells, which - at the organismal level- not or less reveal the resistance of the surviving primed and thus tolerant cells. *

      Agreed.

      Furthermore, such intrinsic sensitivity (un-primed) is determined by many more factors that only the capacity to induce the HSR. It is thus important to better evaluate the relevance of differences in intrinsic sensitivity of wildtype and ctl-2 mutants to the priming heat shock (figure 1F). Albeit interesting that Ctl-2 strains are hypersensitive to heat, this data also could imply that the real mechanism of being able to build up induced-resistance and longevity is not mechanistically due to an altered regulation of the HSR, but merely a reflection of that intrinsic difference in the sensitivity to the damage inflicted by the priming heat treatment. *

      Also, as the treatment was given during the L4 stage, many of the effects may be blurred by differences in the sensitivity to heat treatment on developmental processes, conditions under which also many HSP are differently regulated (also HSF-1 independently). Whilst still interesting, this is even more complex to interpret mechanistically.

      *

      We chose the late L4 stage as opposed to adults since worms are mostly developed at L4, and have no embryos, which is a big advantage. The presence of embryos may confound the measurements, especially in the qPCR experiments since they may express HSPs due to their role in developmental processes as opposed to heat stress. It was also reported elsewhere that L4 stage is the most sensitive to HS in terms of HSR, while exhibiting higher survival rate than younger or older worms following HS. We have included this explanation in the manuscript.

      *In fact, to conclude on the mechanistic involvement on peroxisome redox status for resistance inducting by heat priming, one of would require to e.g. derive cell lines from wildtype and ctl-2 mutant worms and perform an induced-thermotolerance / survival experiments (a iso-toxic and iso-dose heat priming treatments) to see whether ctl-2 truly have an impaired HSR due to cell autonomous features.

      *

      We are not able to establish cell lines from C. elegans, however, we have performed the suggested experiment in the mammalian cells in culture.

      HEK cells were transfected with Lipofectamine 3000 for 48h with either negative siRNA control (siCont), siPEX5 or siCAT at 100 nM, according to the manufacturer’s protocol. For HSR induction, cells were moved for 1h to a humidified incubator containing 5% CO2 at 42,5ºC, or maintained at 37ºC. After, RNA was immediately isolated with NucleoSpin RNA Columns for RNA purification (Macherey-Nagel), and 1000 ng of total RNA was converted to cDNA with iScriptTM cDNA Synthesis Kit (Biorad). Relative gene expression of PEX5, catalase (CAT), HSP90AA1, HSP70 (Hspa1b) and small HSPs (Hspb1, Hsph1 and Hspe1) were determined by qPCR, which were performed in a CFX Opus 384 Real-Time PCR System (Biorad) in a final volume of 5 µL using 2× SYBR Green PCR master mix and 0.3 μM of each primer pair. Relative gene expressions were calculated according to the Pfaffl method, with the results being normalized to the three most stable housekeeping genes in this experimental setting (HPRT, B2M and RPL7), as determined with NormFinder. The Ct averages of the four siCont samples at 37ºC were used as the calibrators to determine ΔCt for each gene. Data comparisons were conducted with two-way analysis of variance (ANOVA) followed by Bonferroni post hoc tests and data is presented as mean ± SEM (Figures 1 and 2).

      Although the siRNA-mediated silencing of CAT and PEX5 was successful (93% and 60% respectively), as determined by qPCR (Figure 1), the expression of several HSPs (HSP70, HSP90 and the small HSPs Hspb1, Hsph1 and Hspe1) remained similar following HS between either CAT- or PEX5- silenced cells and siCont-transfected cells (Figure 2). Preliminary data from SH-SY5Y cells treated as described above yielded similar results (data not shown). These observations may suggest that catalase and/or functional peroxisomes are not necessary for HSR induction and hormesis, but these experiments were limited to two cell lines and one form of HS (1h at 42,5ºC). Therefore, it is possible that different protocols may yet unravel a still undescribed link between peroxisomes, HSR, and life extension, but we to pursue this will take a long time and would be far outside the scope of this manuscript.

      *Related to the actual data in figure 1A-c, also relative effects need to be taken into account as the Ctl-2 mutants are short lived. E.g., if one looks at maximum life span, the differences between the strains seem minimal (20/17 = 1,17 fold for wildtype and 15/13 = 1,15 fold for Ctl-2), so to conclude on no effect of heat pre-conditioning in ctl-2 strains seems an overstatement. *

      This is not entirely correct. Usually, in lifespan measurements in C. elegans, we compare the median lifespan. For WT worms, we measured increased median and maximum lifespan; median lifespan was increased from 11 to 14 days post HS (≈20% increase), and maximum lifespan from 17 to 20 days (15% increase) (Figure 1A). In contrast, while maximum lifespan of the ctl-2(ua90)II strain after HS was increased from 13 to 15 days (≈13% increase), the median lifespan was unchanged and was 10 days for both HS and OGT ctl-2(ua90)II worms (Figure 1B). Overall, mild HS did not significantly affect ctl-2(ua90)II strain lifespan.

      *Regarding to the brood size, it is not only true that these are smaller for clt-2 worms, but also that there was no effect of the pre-conditioning treatment in wildtype whereas a reduction was caused by the pre-conditioning of the ctl-2 worms. What does this imply?

      *

      Since the results of brood size were not sufficiently understood and out of context of the present study, we have removed them from the current version of the manuscript.

      *Regarding the HSF/HSP data. First of all, a better visual insight in the quantitative differences in basal transcription levels between the strains should be provided. It looks as if they could be significantly lower for at least HSP16.1 in clt-2 strains. *

      We have now included description of these results in the manuscript.

      *Next, it would then be essential to evaluate the responses relative to these basal levels in the ctl-2 lines themselves (and not relative to that in wildtype animals). Second, looking at Hsp70, the HSP being most dependent of HSF1 upon a heat shock, the data imply that there is nothing wrong with the heat shock activated HFS-1 response in ctl-2 as such. As stated above, magnitude differences might also be a matter of kinetics, so measuring this at a single time point (4h after HS) may e.g. too early for being at its peak in ctl-2 cells. *

      • *

      We agree with this comment. Therefore, we have performed the experiment where we evaluated the kinetics of the heat shock response in the WT and the ctl-2 mutant worm. The measurements of the kinetics of the heat shock response activation in the WT and the ctl-2 mutant reveal that there are no differences in the kinetics between the strains, whereby the possible differences in the kinetics between the strains have been excluded as a possible interpretation. The results are now reported in the manuscript.

      *Third, it must be emphasized that HSF-1 foci/granules formation is a well-known feature of the response of HSF-1 to heat shock, but these granules are not the site of Hsp transcription, i.e they are not functionally related to HSP expression and not necessarily correlate quantitatively to HSR activation. So, the conclusion that HSF-1 activation/the HSR is truly attenuated in ctl-2 strains is -in my view- not fully proven. In fact, there is an intricate related between small HSP and oxidative stress and the lower (if correct) Hsp16.1 and Hsp16.2 expression could rather be related to such features.

      *

      We thank the Reviewer for drawing our attention to this issue, which we have now addressed in the manuscript by modifying the conclusion of a specific set of results related to the HSF-1 foci formation during heat shock. We have also discussed more extensively the connection with the oxidative stress.

      *Figure 2: As for the HSR, kinetics may be different for wildtype and ctl-2 strains for all these endpoints, reflecting the higher intrinsic, non-primed heat sensitivity of the ctl-2 strain. Again, whilst interesting phenotypically and maybe relevant physiologically (i.e. being able to be primed as weak animal to show organismal resistance), this means that the data are elusive in terms of mechanisms.

      *

      We acknowledged from the beginning that we are missing mechanistic details of the presented result, and we therefore agree with this comment. As for the possible differences in the kinetics of the heat shock response activation in the WT and the ctl-2 worms: the measurements of the kinetics of the heat shock response activation in the WT and the ctl-2 mutant reveal that there are no differences in the kinetics between the strains, whereby the possible differences in the kinetics between the strains have been excluded as a possible interpretation. The results are now reported in the manuscript.

      *In panel 2E, the lack of elevation in CellROX fluorescence by heat shock in wildtype cells in explained as due to the result in activation of the antioxidant defenses. Whereas this may sound OK, it is contradictory the reasoning given above that heat stress induced oxidative stress and hence cause G6PD upregulation (data panel 2D). In addition, whist the authors suggest they are the same, to me the CellROX data for ctl-2 strains appear to be lower (rather than, if anything, higher) for unstressed ctl-2 strains than wildtype strains. Is this not surprising given they are used as model for an impaired oxidation status? And does this not (also) indicate that the knockout lines have developed compensating strategies? Anyway: I got confused here.

      *

      First, there is no significant difference between the CellROX signal of the unstressed WT and the ctl-2 mutants. Why exactly the WT and ctl-2 mutant worms have the same CellROX signal is hard to say; the absence of the peroxisomal catalase could have triggered the activation of the cytosolic antioxidant defenses in optimal conditions already, or other compensatory strategies. It is also possible that the absence of the peroxisomal catalase has its consequences only in stressful conditions. In addition, our results during heat shock suggest that the WT strain is able to neutralize the ROS produced during heat shock, unlike the ctl-2 mutant. The increase in the G6PD expression may have helped with that, since it fed the activation of the pentose phosphate pathway. Our reasoning is that the ctl-2 mutant was for some reason not able to respond in the same way as the WT, and not that there was no need. The anti-ROS protection seems to have been insufficient during heat shock in the ctl-2 strain.

      *Figure 3: First of all, it seems dangerous to conclude anything on peroxisome NUMBER here as what is measured is the presence of an imported FGP-tagged protein into peroxisomes and hence difference may be (also) due to import related effects. In fact, EM data would be required to make firmer conclusions on peroxisomal morphology, size and numbers. *

      • *

      In principle we agree with this comment, however, we demonstrate in Figure 3 that Prx-5 and Prx-11 expression levels do not display any differences in the WT and ctl-2 worms. Therefore, we remain confident about the analysis of the peroxisome number. We hesitate to perform an EM analysis due to the lack of specific markers; anything even slightly resembling a peroxisome could falsely be counted as one. In other words, every methodological approach comes with its own imperfections.

      *In panel C, it is unclear what is significantly different from what; is the signal in clt-2 strains truly lower in ctl-2 strains that in wildtype strains? *

      We thank the Reviewer for drawing our attention to this. In Figure 3C, for the significance analysis, everything was compared to the WT in optimal growth conditions. We have now clarified this in the text. The difference between the ctl-2 mutant and the WT is debatable: while the medians of the data sets are significantly different, this difference may not have any biological significance. Still, we reported the result of the statistical analysis.

      *Figure 4 - 6: I sympathize with the comprehensive analysis presented in these figures. It is clear to me that different things are either (not) up or down in unprimed ctl-2 strains and that heat shock does or does not cause similar effects on these endpoints in wildtype and ctl-2 strains. Whilst this indeed shows that they do respond differently, I do not understand from these that what it all means and, in particular if and how it related causally or consequentially to the impaired priming effects (if true) of the heat shock in ctl-2 strains. *

      The results presented in Figure 4-6 have not been put into a mechanistic context in the present study. In the mentioned figures, we do not report any causal or consequential relationships, but we do wish to report on the differential phenotypes of lipid metabolism and storage, as well as mitochondrial morphology in the WT and ctl-2 mutant worms. However, the presented analyses were performed well and we believe that these results may benefit to other researchers working in related topics.

      **Minor additional comments (textual only)**

      *Whereas this paper discusses the possibility of pre-heat conditioning to induce (long term) resistance, the generality of this as being a hormesis response (or general stress responses) related to other challenges is not warranted and all text concerning that should be deleted. Stresses damaging primarily DNA (ionizing radiation) or proteins & lipid (heat shock) are fundamentally different and each of them requires entirely different and largely independent systems to respond to. *

      Done.

      *Moreover, whilst the induced HSR is clearly an established hermetic response, this is still far less clear for e.g DNA damage responses. Therefore, it is also relevant define the types of stress to which is referred to e.g. when mentioning the envorinmental stimuli to which ctl-2 mutants are apparently hypersensitive. *

      Done.

      Also, the text related to cell non-autonomous response is irrelevant to this study and should be deleted. *

      *

      Done.

      *Page 3 lines 1-4: It is incorrect to write that the role HSP has been put forward as most relevant to heat-induced hormesis. This is how they were discovered, but it is now clear that they play a role in many other pre-conditioning induced resistant phenotypes as well as in the cell-intrinsic sensitivity to proteotoxic stresses in general. Please also be aware that for unprimed, intrinsic resistance to e.g. heat shock, pre-existing levels of HSP are more relevant than the ability to activate the HSR. Activating the HSR is more relevant to resistance to more chronic temperature elevations and acquired resistance via the priming (hormesis).

      *We thank the Reviewer for drawing our attention to this. We agree about the relevance of these points and have modified the manuscript accordingly.

      Reviewer #3 (Significance (Required)): * Regulation of the cell intrinsic heat shock response is an important item to understand how cels may be primed to become resilient to (certain) other stresses. Besides the main studied regulator (HSF-1), many other levels of regulation likely exist and intra-organellar communication and proteostasis might be an important aspect for controlling such regulation.

      As such, peroxisome proteostasis (that, unlike other organelles, are not (also) controlling a organellar unfolded protein response) is an interesting organelle that could co-control the cytosolic heat shock response. So, the aim of this study per se is quite interesting, However, although for evaluation of physiological relevance c. elegance is a good model of choice, for the mechanistic studies, cellular experiments would have been better suited. *

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this MS, the authors aim at the understanding of the role of peroxisome related-anti-oxidant capacity for the effect of a heat conditioning treatment on the HSR and associated longevity. Despite the finding that ctl-2 mutants they used show reduced resistance upon heat pre-conditioning, neither the mechanism (Ctl-2 is required for HSR) nor the claim that functional peroxisomes are required for heat shock-induced hormesis are, in my view, fully proven by the data in this MS. What the data basically show is that, related to a fragile status of the ctl-2 mutants, pre-conditioning was either to severe (toxic) or/and lead to development defects such that it was no longer effective in priming organismal resistance, likely to HSR-independent features.

      Major comments:

      Figure 1: The level of pre-conditioning induced resistance (as opposed to intrinsic sensitivity) that can be induced in a given genetic background is dependent on a number of things, one of which is the severity of the priming dose. A more severe heat shock (that initially causes more damage) leads to a slower rate of tolerance development but the level of tolerance (of the surviving cells) is much higher. However, if too toxic, the priming treatment will result is loss of cells, which - at the organismal level- not or less reveal the resistance of the surviving primed and thus tolerant cells.

      Furthermore, such intrinsic sensitivity (un-primed) is determined by many more factors that only the capacity to induce the HSR. It is thus important to better evaluate the relevance of differences in intrinsic sensitivity of wildtype and ctl-2 mutants to the priming heat shock (figure 1F). Albeit interesting that Ctl-2 strains are hypersensitive to heat, this data also could imply that the real mechanism of being able to build up induced-resistance and longevity is not mechanistically due to an altered regulation of the HSR, but merely a reflection of that intrinsic difference in the sensitivity to the damage inflicted by the priming heat treatment.

      Also, as the treatment was given during the L4 stage, many of the effects may be blurred by differences in the sensitivity to heat treatment on developmental processes, conditions under which also many HSP are differently regulated (also HSF-1 independently). Whilst still interesting, this is even more complex to interpret mechanistically.

      In fact, to conclude on the mechanistic involvement on peroxisome redox status for resistance inducting by heat priming, one of would require to e.g. derive cell lines from wildtype and ctl-2 mutant worms and perform an induced-thermotolerance / survival experiments (a iso-toxic and iso-dose heat priming treatments) to see whether ctl-2 truly have an impaired HSR due to cell autonomous features.

      Related to the actual data in figure 1A-c, also relative effects need to be taken into account as the Ctl-2 mutants are short lived. E.g., if one looks at maximum life span, the differences between the strains seem minimal (20/17 = 1,17 fold for wildtype and 15/13 = 1,15 fold for Ctl-2), so to conclude on no effect of heat pre-conditioning in ctl-2 strains seems an overstatement. Regarding to the brood size, it is not only true that these are smaller for clt-2 worms, but also that there was no effect of the pre-conditioning treatment in wildtype whereas a reduction was caused by the pre-conditioning of the ctl-2 worms. What does this imply?

      Regarding the HSF/HSP data. First of all, a better visual insight in the quantitative differences in basal transcription levels between the strains should be provided. It looks as if they could be significantly lower for at least HSP16.1 in clt-2 strains. Next, it would then be essential to evaluate the responses relative to these basal levels in the ctl-2 lines themselves (and not relative to that in wildtype animals). Second, looking at Hsp70, the HSP being most dependent of HSF1 upon a heat shock, the data imply that there is nothing wrong with the heat shock activated HFS-1 response in stl-2 as such. As stated above, magnitude differences might also be a matter of kinetics, so measuring this at a single time point (4h after HS) may e.g. too early for being at its peak in ctl-2 cells. Third, it must be emphasized that HSF-1 foci/granules formation is a well-known feature of the response of HSF-1 to heat shock, but these granules are not the site of Hsp transcription, i.e they are not functionally related to HSP expression and not necessarily correlate quantitatively to HSR activation. So, the conclusion that HSF-1 activation/the HSR is truly attenuated in ctl-2 strains is -in my view- not fully proven. In fact, there is an intricate related between small HSP and oxidative stress and the lower (if correct) Hsp16.1 and Hsp16.2 expression could rather be related to such features.

      Figure 2: As for the HSR, kinetics may be different for wildtype and ctl-2 strains for all these endpoints, reflecting the higher intrinsic, non-primed heat sensitivity of the ctl-2 strain. Again, whilst interesting phenotypically and maybe relevant physiologically (i.e. being able to be primed as weak animal to show organismal resistance), this means that the data are elusive in terms of mechanisms.

      In panel 2E, the lack of elevation in CellROX fluorescence by heat shock in wildtype cells in explained as due to the result in activation of the antioxidant defenses. Whereas this may sound OK, it is contradictory the reasoning given above that heat stress induced oxidative stress and hence cause G6PD upregulation (data panel 2D). In addition, whist the authors suggest they are the same, to me the CellROX data for ctl-2 strains appear to be lower (rather than, if anything, higher) for unstressed ctl-2 strains than wildtype strains. Is this not surprising given the are used as model for an impaired oxidation status? And does this not (also) indicate that the knockout lines have developed compensating strategies? Anyway: I got confused here.

      Figure 3: First of all, it seems dangerous to conclude anything on peroxisome NUMBER here as what is measured is the presence of an imported FGP-tagged protein into peroxisomes and hence difference may be (also) due to import related effects. In fact, EM data would be required to make firmer conclusions on peroxisomal morphology, size and numbers.

      In panel C, it is unclear what is significantly different from what; is the signal in clt-2 strains truly lower in ctl-2 strains that in wildtype strains?

      Figure 4 - 6: I sympathize with the comprehensive analysis presented in these figures. It is clear to me that different things are either (not) up or down in unprimed ctl-2 strains and that heat shock does or does not cause similar effects on these endpoints in wildtype and ctl-2 strains. Whilst this indeed shows that they do respond differently, I do not understand from these that what it all means and, in particular if and how it related causally or consequentially to the impaired priming effects (if true) of the heat shock in ctl-2 strains.

      Minor additional comments (textual only)

      Whereas this paper discusses the possibility of pre-heat conditioning to induce (long term) resistance, the generality of this as being a hormesis response (or general stress responses) related to other challenges is not warranted and all text concerning that should be deleted. Stresses damaging primarily DNA (ionizing radiation) or proteins & lipid (heat shock) are fundamentally different and each of them requires entirely different and largely independent systems to respond to.

      Moreover, whilst the induced HSR is clearly an established hermetic response, this is still far less clear for e.g DNA damage responses. Therefore, it is also relevant define the types of stress to which is referred to e.g. when mentioning the envorinmental stimuli to which ctl-2 mutants are apparently hypersensitive.

      Also, the text related to cell non-autonomous response is irrelevant to this study and should be deleted.

      Page 3 lines 1-4: It is incorrect to write that the role HSP has been put forward as most relevant to heat-induced hormesis. This is how they were discovered, but it is now clear that they play a role in many other pre-conditioning induced resistant phenotypes as well as in the cell-intrinsic sensitivity to proteotoxic stresses in general. Please also be aware that for unprimed, intrinsic resistance to e.g. heat shock, pre-existing levels of HSP are more relevant than the ability to activate the HSR. Activating the HSR is more relevant to resistance to more chronic temperature elevations and acquired resistance via the priming (hormesis).

      Significance

      Regulation of the cell intrinsic heat shock response is an important item to understand how cels may be primed to become resilient to (certain) other stresses. Besides the main studied regulator (HSF-1), many other levels of regulation likely exist and intra-organellar communication and proteostasis might be an important aspect for controlling such regulation.

      As such, peroxisome proteostasis (that, unlike other organelles, are not (also) controlling a organellar unfolded protein response) is an interesting organelle that could co-control the cytosolic heat shock response. So, the aim of this study per se is quite interesting, However, although for evaluation of physiological relevance c. elegance is a good model of choice, for the mechanistic studies, cellular experiments would have been better suited.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In the manuscript entitled "Functional peroxisomes are required for heat shock-induced hormesis in Caenorhabditis elegans", Musa et al. explored the role of functional peroxisomes in the heat shock (HS)-induced hormesis in C. elegans using peroxisomal catalase ctl-2 deletion mutants. They showed that ctl-2 deletion abolished HS-induced longevity and suppressed the HS-induced upregulation of small heat shock proteins and HS-induced HSF-1 nuclear accumulation. Furthermore, the authors analyzed the differences in HS-induced phenotypes between wild type animals and ctl-2 mutants: the activation of the antioxidant response, the pentose phosphate pathway, and increased triglyceride content, which are the most prominent changes observed during heat shock in wild-type animals. However, there is only weak evidence supporting their claim that functional peroxisomes are required for HS-induced hormesis.

      Major comments:

      1. The authors conclude that the functional peroxisomes are essential for the HS-induced hormesis based only on one observation that the transient heat shock treatment did not increase the lifespan in the short-lived ctl-2 mutants. Because the ctl-2 mutants are short-lived, the author should have carefully examined the relevant role of functional peroxisomes in hormesis response by conducting lifespan measurements using ctl-2 RNAi-treated animals or mutants deficient in other genes essential for peroxisome function.
      2. The authors examined the difference between wild-type animals and ctl-2 mutants in many aspects; however, there is no rational explanation or examination that their observations have a role in HS-induced hormesis.
      3. There is a severe shortage in the explanation of their experiments, especially for the methods of the experiments and statistical procedures. The authors should clarify the details of the experiments with proper statistical analyses. Follows are the points to be clarified for Figure 1 as insufficiency examples in the explanation in their study.

      <Points for Figure 1: they should also carefully consider the clarification on other figures> Fig. 1A, B: Is this the representative survival curve of several experiments? The authors should clarify this point. The authors should include a table with the statistical numbers: the number of animals they used, the number of animals they measure in each experiment, and the p-value. Fig. 1C: Is this a representative brood size of one animal, the mean brood size of some animals of one representative experiment, or the mean brood sizes from some experiments? The authors should clarify this point. Also, the authors should perform statistical analysis. Fig. 1D-F: The authors should state the number of experiments they conducted. The author should perform multiple comparison statistical analyses instead of t-test without any correction. Fig. 1H: Is this a representative plot of a representative experiment or the plot of the mean values from some experiments? The authors should clarify this point. The author should also clarify the tissue they analyzed and the exact number of the experiment (the number of nuclei in each animal they analyze, the number of animals they analyze in each experiment). The author should also clarify the detailed statistical procedure: one-way ANOVA or two-way ANOVA; which multiple comparison method they used in their analyses.

      Minor comments:

      The author should explain the details for the experiments in either the Methods section or the figure legends. The authors should integrate the figures 6B-J (6B-D -> one figure, 6E-G -> one figure, and 6H-J -> one figure) into three figures because there are redundancies in these figures. In addition, the authors should perform the statistical analyses with multiple comparison procedures instead of a simple t-test. The authors should perform statistical analyses on Figs. 6N and 6O. The authors would better cite relevant articles when referring to representative target genes of UPRER, UPRmt, and TORC1.

      Significance

      Understanding the mechanism underlying the HS-induced hormesis in a multicellular organism is essential in the research field. Their finding that functional peroxisomes play a pivotal role in the HS-induced hormesis, if properly demonstrated, would provide us with the significant progress in this field; however, much more proper experiments are required to support their conclusion. Nevertheless, their finding can stimulate the attention of researchers who study the aging process, stress response, and especially peroxisome function.

      I am an expertise in the study of aging in C. elegans.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In the manuscript " Functional peroxisomes are required for heat shock-induced hormesis in Caenorhabditis elegans" the authors show that a hormetic heat shock requires the peroxidase catalase ctl-2 for longevity and thermotolerance. Furthermore the authors characterize the hormetic stress response in ctl-2 mutants and show that ctl-2 is required for the proper formation of HSF-1 stress granules, and that ctl-2 mutants have a changed transcriptional response to heat shock and a changed induction of heat-stress induced oxidative stress. The authors go on to characterize peroxisomes, and peroxidase-regulated fatty acids, as well as mitochondria in both ctl-2 mutants and upon heat shock. The authors conclude that functional peroxisomes play an important role in the hormetic heat stress response.

      This is an interesting study, that demonstrates that the loss of the peroxidase catalase ctl-2 plays an important role in the heat stress response. The authors need to provide more details on experimental repeats and experiments strengthening the conclusion that peroxisomes are generally important for the hormetic stress response would also improve this manuscript.

      Furthermore, some textual changes would clarify some specifics in this manuscript.

      Major issues:

      1. Throughout the manuscript it is unclear how many times the experiments were conducted (e.g. Lifespan, thermotolerance) or how many biological replicates were used (qRT-PCR, peroxisome characterization etc). This critical information should be included. A table with the individual lifespan experiments and thermotolerance experiments is expected.
      2. The authors are characterizing the ctl-2 mutant upon heat stress and find some compelling differences to wild-type animals. The authors state "We measured the expression levels of two peroxisomal transport proteins, PRX-5 and PRX-11, and found no differences in expression between WT and Δctl-2 strains, suggesting that morphogenesis should not be impaired and that proteins needed for their proper function should be present, with the exception of ctl-2 in the Δctl-2 strain (Figure 3A)." The authors however conclude that functional peroxisomes are required for the benefits of a hormetic HS. What the authors are in fact demonstrating is that ctl-2 specifically is required for the hormetic HS response. To demonstrate that peroxisomes in general are required they should disrupt peroxisome function by additional means. What the authors also demonstrate is that heat shock has an effect on peroxisomes. These differences need to be clarified. It is also not fully clear whether peroxisomes are functional in ctl-2 mutants? Further peroxisome-relevant enzymes should be tested for levels and functionality in ctl-2 mutants and upon HS.
      3. The TORC1 experiments are too indirect. The authors should perform WesternBlots with a S6Kinase -Phospho antibody to determine whether TORC1 is inhibited or not. Alternatively, the authors may choose to remove this experiment from the manuscript without disrupting the main message of the manuscript, since no link between TORC1 and peroxisome function nor with HS has been established.

      Textual changes:

      1. The authors should adhere to C. elegans nomenclature convention (at least once), by referring to the ctl-2 deletion with the specific allele name [ctl-2(xx)] (unlike in yeast, ∆ is not commonly used). Furthermore a more thorough description of the specific allele (complete gene deletion? Point mutation? Truncation?) should be included.
      2. Throughout the manuscript the authors should refer back to their specific hormetic heat shock paradigm, since several studies (also cited here) have shown differences in the specific physiological changes (e.g. HS on day 1 of adulthood has profound effects on broodsize, whereas the authors show here that a HS at L4 does not, other differences in terms of activation of other stress reporters has also been reported).
      3. The abbreviation OGT (original growth temperature??) should be defined
      4. The authors should change the sentence "The binding of HSF-1 to the HSEs can be visualized by a HSF-1::GFP fusion protein". This is not technically true. While HS has been shown to induce HSF-1 stress granules and these stress granules have been indirectly shown to correlate with transcription, it has however not been demonstrated that HSF-1 is bound to HSEs or whether it is bound to other DNA stretches or what is in fact transcribed by HSF-1 when it localizes to these foci.
      5. The authors describe the morphological changes of peroxisomes upon HS, however they missed the opportunity to fully interpret these results in the discussion. Again, their conclusion that peroxisomes may be impaired in ctl-2 mutants is very vague.

      Significance

      Hormesis and the role of peroxisomes in stress responses is an important and interesting topic.

      The role of peroxisomes in stress responses has not been addressed.

      Researchers with interest in stress responses will be interested in this work.

      My expertise lies in C.elegans stress and longevity with a specific focus on hormetic mechanisms.

      Referee Cross-commenting

      While I agree with most comments from the other referees, I don't believe it is feasible to ask the authors to generate c. Elegans cell cultures for any follow up experiments. I would be satisfied with a more thorough comparison of the HS response between WT and Ctl-2 mutants, I.e: compare preconditiöning: 1h and 4h HS at L4 and d1 and then do thermotolerance experiments

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time. We aim to provide a revised version of the manuscript within two month.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In recent years, significant progress has been made in defining the molecular details of many structural features of the nuclear pore complex (NPC). However, one area that remains ill-defined is the interface between the core structures of the NPC and the pore membrane domain. This is an especially intriguing area when one considers that the NPC contains several integral proteins and numerous peripheral membrane proteins contain amphipathic helices whose functions and interactions with the membrane, as well as with one another, remain largely undefined.

      In this manuscript by Amm et al., the authors have examined the functional role of the integral membrane Nup Ndc1 and its interactions with various peripheral membrane Nups, including members of the Nup84 complex (termed the Y-complex) and the linker Nups Nup53 and Nup59. The authors show that Ndc1 interacts with specific members of the Nup84 complex, namely Nup120 and Nup133, supporting the idea that Ndc1 functions, in part, to anchor this NPC substructure to the pore membrane. In addition, they identified an amphipathic helix (AH) within the C-terminal half of Ndc1, and they showed that it can directly bind to membranes. Importantly, they have used genetic assays to show that the Ndc1-AH functionally interacts with AHs present at the C-terminus of Nup53 and Nup59. Strikingly, they show that the lethal phenotype detected in strains lacking Ndc1 can be suppressed by the deletion of NUP53, but not NUP59, and, more specifically, only the loss of the C-terminal AH Nup53 was required to suppress the lethal phenotype of the ndc1 null mutation. Further ultrastructural analysis of these mutants revealed that, while these mutants were viable, they exhibited extensive NE expansion phenotypes.

      Overall, the data presented in this manuscript are of high quality, and the experiments are well controlled. My specific comments are relatively minor and listed below.

      Minor points

      1) The authors state "Serial ultrathin sections of fixed yeast cells overexpressing ProtA-CtNdc1 revealed that these unusual extranuclear membrane proliferations exhibited pore-like structures with diameters similar to the diameter of NPCs within the nuclear membrane (Fig. 2C)." This is not entirely clear from the data. I suggest the authors provide direct measurements that support their statement.

      2) The authors examined the total cellular lipid content following overexpression of Ndc1-AH-containing constructs, as well as ProtA-ScHmg1. There is little discussion of the significance of these results, which would provide a clear justification for including these data in the manuscript.

      3) There are numerous typographical and grammatical errors throughout the manuscript that need to be addressed.

      Significance

      The results presented in this manuscript provide further insight into the molecular interactions between Nups and the pore membrane. They suggest that AHs present in a subset of Nups perform linked functions and contribute, in part, to nuclear membrane biogenesis. As such, these results are an important advance in our knowledge of NPC structure and function. They will be of general interest to those studying the function of NPCs and, more generally, NE and organelle biogenesis.

      Reviewer expertise: NPC structure and function, NE biogenesis, yeast model system.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Amm et al report on the role of new motifs and interactions between the essential and conserved integral nuclear pore membrane protein Ndc1 and other key components of the yeast nuclear pore complex. They show that members of the Y-subcomplex that coats the pore membrane bind directly to Ndc1 and identify an amphipathic helix at the C-terminus of Ndc1 that displays genetic interactions with other nucleoporins carrying analogous amphipathic helices. The authors find that cells can survive without Ndc1 when these related amphipathic helices from other nups are coincidentally deleted.

      Despite significant recent advances in our structural understanding of the nuclear pore complex, how the NPC associates with the curved nuclear membrane remains poorly understood. Previous studies in yeast have uncovered significant redundancy in this association but again the basis for this remains unclear. Therefore, I find this study on the amphipathic helix of Ndc1 and its interaction with other membrane binding components of the NPC an important and timely contribution to the field. Technically, the paper is solid and I find that most of the authors' conclusions are well supported by the evidence they provide (but see below for few experimental issues). Overall, the paper is well written, and despite the use of several mutants and methodologies, it is easy to read. I think the paper's significance would improve if the authors could present some "larger picture" view on how the Ndc1 helix and/or domains they describe interact with the Nup84 complex and the pore membrane or other elements of the NPC. For example, the authors make the remarkable finding that removal of Nup53 makes ndc1 nulls able to survive. Would it be possible to use existing models of the yeast NPC and provide some structural explanation of why that is? However, I would like to emphasize that this is not required to support the main claims of the paper and should only be considered if the authors wish to provide a more "molecular" view of their findings.

      Specific experimental issues and clarifications:

      • A major part of the manuscript describes a detailed structure-function analysis of Ndc1. The link between the two domains of ScNdc1 studied and their effects on membrane proliferation could be better defined: specifically, can the authors exclude that the N-domain of Ndc1 that includes its transmembrane domain, is not also involved in the membrane proliferation phenotype shown in Fig2A and C? It also seems as if GAL-ProtA-ScNdc1 (1-260) also causes growth inhibition (Fig. 2F). How do cells with GAL-ProtA-ScNdc1 (1-260) look like? Finally, although the authors convincingly show that overexpression of 261-655 inhibits growth, from the EM it seems as its effects on membrane proliferation is not the same as that of the overexpression of full-length Ndc1 (compare Fig. 3D vs Fig. 2D).

      • Figure 1A: Do the CtNups shown under "Input" represent 100% of what used in the binding reaction? If so, please indicate at the figure.

      • CtNup120 and CtPom133 would migrate close to CtPom152, which could make visualization by Coomassie stain a bit tricky - if the authors could provide SDS PAGE gels with lower %, that would be helpful. Along similar lines, how do the authors know that CtNup120beta does not bind the CtNdc1 if these two appear to migrate at the same size (Fig. 1D)?

      • Figure 1B, GUVs: Why do the authors use CtNup85 for the GUV experiment instead of CtNup84 that was used in Fig. 1A?

      • Moreover, CtNup120 and CtNup133 ...BC08/SCL1 (Fig. 1C)" Don't see this in Fig. 1C

      • The imaging of ProtA-AHNdc1-eGFP (Fig. 3C) is not great and the localization of the AH does not look very clear - can the authors provide better micrographs? Perhaps co-expression of a red ER reporter or similar reporter would also help.

      • The ndc1 nup53 double mutant appears to display a striking cold-sensitive growth defect (Supplemental Figure 6A, compare 23 vs 30C). Can the authors comment on this?

      Significance

      Despite significant recent advances in our structural understanding of the nuclear pore complex, how the NPC associates with the curved nuclear membrane remains poorly understood. Previous studies in yeast have uncovered significant redundancy in this association but again the basis for this remains unclear. Therefore, I find this study on the amphipathic helix of Ndc1 and its interaction with other membrane binding components of the NPC an important and timely contribution to the field. Technically, the paper is solid and I find that most of the authors' conclusions are well supported by the evidence they provide (but see below for few experimental issues). Overall, the paper is well written, and despite the use of several mutants and methodologies, it is easy to read. I think the paper's significance would improve if the authors could present some "larger picture" view on how the Ndc1 helix and/or domains they describe interact with the Nup84 complex and the pore membrane or other elements of the NPC. For example, the authors make the remarkable finding that removal of Nup53 makes ndc1 nulls able to survive. Would it be possible to use existing models of the yeast NPC and provide some structural explanation of why that is? However, I would like to emphasize that this is not required to support the main claims of the paper and should only be considered if the authors wish to provide a more "molecular" view of their findings.

      Audience: Mostly the following - Nuclear pore complex, nuclear envelope, and possibly some membrane biologists.

      My field of expertise: Cell biology, Nuclear envelope.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Ndc1 is a transmembrane nucleoporin, essential for insertion of the nuclear pore complex (NPC) and spindle pole body (SPB) into the nuclear envelope (NE). How NE-associated proteins contribute to the bending and fusion of membranes during NPC insertion has not been fully elucidated. Here, the authors report a number of loosely connected, interesting observations related to Ndc1 function. Their main findings are the following: (i) The N-terminal transmembrane domain of Ndc1 mediates the membrane recruitment of two Y-complex nucleoporins. Therefore, these interactions are likely to contribute to NPC biogenesis. (ii) Over-expression of a novel amphipathic helix (AH) in the non-essential C-terminus of Ndc1, and of a similar AH in the non-essential nucleoporin Nup53, alters the lipid composition and nuclear morphology of yeast cells, although the underlying mechanisms remain unknown. (iii) The essential function of Ndc1 can be suppressed by deleting the amphipathic helix from Nup53, or by deleting the transmembrane nucleoporin POM34. Surviving strains have altered nuclear morphology (NE expansions), and are sensitive to membrane-fluidizing drugs, suggesting that NPC assembly is somehow linked to lipid homeostasis.

      Overall, the experiments are of high technical quality, are presented in a clear way, and the conclusions are well-supported by the data. I have some minor suggestions for clarifications, which can be addressed by textual changes or by additional experiments.

      1. When overexpressed in budding yeast, the C-terminal domain of Ndc1 is toxic and induces membrane expansion with NPC-like openings, which the authors describe as enlarged ER membranes (Figure 2). Could these be NE expansions instead? ER and NE membranes are continuous but perhaps this issue could be addressed by examining the distribution of fluorescent markers specific for each compartment.

      2. The essential function of Ndc1 can be suppressed by deleting the amphipathic helix from Nup53 or by deleting POM34. These experiments are done using a plasmid shuffle strategy, in which Ndc1 is temporarily expressed from a low copy plasmid. I wonder if surviving strains are stable, or whether they survive for a limited time only due to stabilisation of the Ndc1 protein in the absence of Nup53 or Pom34. Could the authors discard this possibility, for example by checking whether viable double mutants are recovered after backcrossing of the survivor strains?

      3. Cells over-expressing Ndc1, and surviving ndc1-delta strains display ER and/or NE expansions. It would be interesting to discuss these observations in the context of nuclear morphology studies by the Cohen-Fix and Liakopoulos labs, among others, showing NE expansion is partially dependent on the coordination between lipid synthesis, cell growth rate, and cell cycle progression (doi: 10.1091/mbc.E18-04-0204, 10.1091/mbc.e05-09-0839, 10.1016/j.cub.2012.04.022).

      4. Related to the previous point: nuclear membrane expansions caused by metaphase arrest usually overlap with the nucleolus, and appear DAPI-negative. Did the authors examine nucleolar distribution relative to NE expansion in cells shown in figure 4C? Along the same lines, what is the cell cycle distribution of cells with ER/NE expansion? If they are delayed in mitosis, nuclear morphology defects may be a secondary consequence of cell cycle progression defects, themselves due to NPC and/or SPB insertion problems.

      5. I suggest to rephrase the last sentence of the abstract: "nuclear membrane biogenesis dependent on a balanced ratio between amphipathic motifs in diverse nucleoporins is essential for interphase NPC biogenesis". This study does not directly assess NPC biogenesis and therefore, the interesting link between lipids and NPC biogenesis remains correlative.

      6. It would be useful to include some information on the number of cells observed in the EM figures.

      7. Results, first page: "Moreover, CtNup120 and CtNup133 did not associate with GUVs containing the unrelated inner nuclear membrane protein BC08/SCL1 (Fig. 1C)" should be Figure S1C.

      8. P. 19: "Prompted by the finding that Ndc1 and Nup53/Nup59 amphipathic motifs may (modify?) the nuclear ... "

      I am an expert in yeast genetics and cell cycle progression.

      Significance

      Significance: This report describes novel functional motifs in the Ndc1 protein that may be important for NPC assembly, and intriguing genetic interactions between NPC assembly and lipid homeostasis pathways. Although the mechanisms linking Ndc1 motifs with NE expansion and lipid composition remain unclear, these observations will be interesting for researchers working on NPC biogenesis and nuclear morphology.

      Reviewer Expertise: yeast genetics, cell cycle progression and NPCs.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Major #1

      This study primarily uses the genetic mouse model in which LSD1 gene is inactivated after tamoxifen injection in 8 weeks old mice, as shown in supplemental figure 1 B and C. 8 weeks after birth postnatal growth of muscle is not complete and the contribution of satellite cells to muscle growth is still significant. Therefore the timing of tamoxifen injection used cannot discriminate if the observed phenotype involves the function of LSD1 during the post-natal growth of the muscle or in the muscle fibers or both. One way to demonstrate the real contribution of LSD1 in the maintenance of muscle fibers plasticity under environmental stress would be to inject Tamoxifen later (around 10-12 weeks of age), in order to remove a possible bias caused by the contribution of satellite cells during the post-natal growth. At least key findings should be confirmed at this later stage.

      In this study, we used ACTA1-CreERT mice to conditionally knockout LSD1 in the skeletal muscle. The ACTA1 promoter is derived from human muscle actin gene, which is not expressed in the satellite cells, and has been widely used for the transgene expression in myofibers (Stantzou et al. Development 2017). Thus, the inactivation of LSD1 occurs in the existing myofibers, and alterations in satellite cell function, if any, would be indirect effects of the loss of LSD1 in mature myocytes or differentiating myoblasts.

      To test whether postnatal muscle growth was affected in our LSD1-mKO mice, we administrated tamoxifen (4OHT) to pre-weaning mice (11 days old). LSD1 depletion did not affect the expression of muscle fiber genes, when muscle tissues were isolated from mice 11 days after the start of 4OHT (Additional Data).

      These evidences exclude the contribution of satellite cells in the phenotypes observed in the LSD1-mKO mice. __Additional Data __will be added in the revised manuscript.

      Major #2

      LSD1 m-KO muscles seem to have more type I and IIA fibers than WT, even without DEX treatment. Is it possible to quantify the results in supplemental figure 4C?

      As suggested, we quantitatively analyzed the fiber type compositions in Supplemental Fig. 4C using the data from WT (n=4) and LSD1-mKO (n=5) mice (Additional Data). We did not find a significant difference between these mice, confirming our finding that the loss of LSD1 accelerates the Dex-driven phenotypic changes. __Additional Data__will be added in the revised manuscript.

      Major #3

      The effect on fiber type is convincing, while variations in gene expression are of quite low amplitude. However, the atrophy should be induced by other means to ensure that the effects are specific to GC/nuclear receptors pathways; Denervation? Starvation? Not all the experiments need to be repeated, just key results such as, for example, exacerbation of atrophy in LSD1 m-KO, Foxk1 increase.

      We agree that testing alternative atrophy models is important for generalizing our findings. For this, we employed a model for diabetes-related muscle atrophy. A pro-diabetic agent streptozotocin (STZ) disturbs the function of pancreatic islet leading to fast-fiber atrophy (O’Neill et al. Diabetes 2019). LSD1-mKO did not affect the muscle weight in STZ-treated mice (Additional Data). Consistently, there were no major difference in the expression of atrophy genes in STZ-treated WT and LSD1-mKO mice (Additional Data). These results suggest that the LSD1 function depends on the source of atrophy-inducing stress, and that the loss of LSD1 sensitized the muscle to GC-mediate signaling. Additional Data will be added in the revised manuscript.

      Major #4

      Autophagy data: the effect on the LC3I/LC3II ratio are modest. The autophagy part should be removed or completed with additional data to convincingly show that autophagy is affected. Links between LSD1 and mTOR have been published, so the mTOR pathway could be investigated in the model (S6k, S6 and 4EBP1 phosphorylation). Given AKT levels and phosphorylation are affected by the absence of LSD1 + DEX, it can be predicted that mTOR activity will change.

      We have analyzed the expression of additional autophagy markers p62 and phosphorylated 4EBP1. Consistent with the upregulated expression of atrophy genes and increased LC3I/II ratio, LSD1-mKO mice had elevated levels of p62 and phosphorylated 4EBP1 (Additional Data). Altogether, the data suggest that Dex-induced muscle atrophy was exacerbated by the loss of LSD1. Additional Data will be added in the revised manuscript.

      Major #5 The ability of LSD1 to retain FOXK1 in the nucleus is an important information that should be better supported experimentally. In the absence of such information, no mechanism can be proposed for the effect of LSD1 of FOXK1. The immunofluorescence images provided are not convincing and moreover they could be interpreted by a reduction in the level of FOXK1 protein (degradation?) rather than by a nuclear exclusion in the presence of DEX. This point should be addressed, authors could include western blot of nuclear and cytoplasmic fractions to better quantify the nuclear level of FOXK1 in absence of LSD1.

      We agree that performing the suggested experiment would further enhance the quality of our study.

      Major #6 The absence of centralized nuclei indicates that there is no fiber regeneration but it does not exclude the possibility that satellite cells were recruited to existing fibers and thus participated to hypertrophy. To eliminate this possibility, the average nuclei/cytoplasm volume should decrease if hypertrophy results from increased protein synthesis and not myonuclei accretion. This should be checked.

      We histologically analyzed the sections of Gas muscles after Dex treatment and found that there is no evidence of central nuclei in either WT or KO mice (Supplemental Fig. 4D).

      As mentioned above (Major #1), it is unlikely that the satellite cell function was responsible for the enhanced atrophic phenotype.

      Major #7 The upregulation of ERR____g in the absence of LSD1 is convincing in the VWR conditions. ERR____g level should be evaluated in the sedentary LSD1 KO mice.

      We have analyzed the expression of ERRg in sedentary mice, and found no significant difference between WT and KO mice (Additional Data). This suggests that the loss of LSD1 in combination with VWR training led to the increased expression of ERRg. Additional Data will be added in the revised manuscript.

      Minor #1

      There is a clear difference in the number of mouse replicates between treated (Dex or VWR) and non-treated mice, regardless the genotype. Experiments with non-treated mice lack adequate numbers to make a definitive conclusion. For example, there is a huge spread in the data in Figure 1 B and 4 D. If the number of animals would have been increased, would the absence of difference hold up?

      We increased the number of non-treated animals in Figures 1B and 4B as suggested. Nonetheless, we did not find any significant differences in the muscle weight (Additional Data). These changes will be reflected on original Figures 1B and 4B.

      Minor #2 The authors claim that: "Consistent with the results of the augmented endurance capacity, the Sol muscle in the KO mice showed enhanced succinate dehydrogenase (SDH) staining, indicating that the number of oxidative fibers increased (Figure 4F and Supplemental Figure 8F)". However, supplemental figure 8 D indicates that the number of type I fibers does not change compared to WT. Authors should clarify this statement.

      Indeed, we found that the area of type I fiber but not the number was increased in the LSD1-KO Sol (Fig. 4D and Supplemental Fig. 8D). Because SDH staining reflects the OXPHOS capacity in all fiber types, it is possible that the OXPHOS capacity in the fibers other than type I had been augmented by LSD1-KO. Thus, for clarification, we will change the statement as follows: OXPHOS capacity of Sol was enhanced by the loss of LSD1.

      Reviewer #2

      __Methods

      1__

      The authors used the Cre-lox system with tamoxifen to generate skeletal muscle-specific LSD1 KO mice. It is clear that both the mRNA and protein levels of LSD1 in various muscles were dramatically reduced, but there is still some LSD1 expressed in skeletal muscle, especially in Sol muscle (Supplemental Figure 1C). The author needs to think about whether it is appropriate to use the term "LSD1 knockout" or "LSD1 deficiency".

      We thank the reviewer for this comment. In this study, we crossed LSD1-floxed mice with ACTA1-creERT mice. This enables the deletion of critical exons of LSD1 in mature myocytes and myogenic precursors that have initiated the differentiation program. LSD1 is a ubiquitously expressed gene, and it is known that immature myogenic cells (e.g., satellite cells, Tosic et al. Nat Commun. 2018) and other non-myogenic cells such as hematopoietic and vascular cells abundantly express LSD1 (Kerenyi et al. Elife 2013, Yuan et al. Biochem Pharmacol. 2022). Thus, it is likely that LSD1 expression by these cell types were detected in our whole muscle western blots. We will add these statements in the text for clarification.

      __Results

      2__

      To identify the transcriptional regulators that mediate the regulation of atrophy-associated genes by LSD1, the authors performed motif analyses on the promotor regions of upregulated genes in LSD1-mKO Gas. Based on the results and other reports, they focused on Foxk1 and proved LSD1 and Foxk1 cooperatively regulate the atrophy transcriptome in the presence of Dex. However, Figure 3C showed that a transcription factor Nfatc1 is also reduced in Sol muscle similar to Foxk1. Also, other studies demonstrated that the transcription factor NFATc1 controls fiber type composition and is required for fast-to-slow fiber type switching in response to exercise in vivo. More specifically, NFATc1 inhibits MyoD-dependent fast fiber gene promoters by physically interacting with the N-terminal activation domain of MyoD and blocking recruitment of the essential transcriptional coactivator p300 (Cell Rep. 2014 Sep 25; 8(6): 1639-1648). Furthermore, it has been reported that LSD1 Controls Timely MyoD Expression via MyoD Core Enhancer Transcription (Cell Rep. 2017 Feb 21;18(8):1996-2006. doi: 10.1016/j.celrep.2017.01.078). It is unclear how the authors exclude Nfatc1 for the LSD1-mediated effects in different muscle fibers. Further experiments may be necessary to exclude Nfatc1.

      We thank the reviewer for an insightful comment. In addition to Foxk1, we tested the involvement of NFATc1 in the gene regulation under LSD1-depleted state. We treated C2C12 with an LSD1 inhibitor S2101 in combination with a calcium ionophore that promotes the transcriptional function of NFATc1 by inducing its nuclear localization (Meissner et al. J Cell Physiol. 2007). While LSD1 inhibition promoted the expression of Pgc1a and Myh7, ionophore treatment had no additive effects (Additional Data). Because we found a physical association of Foxk1 with LSD1, we focused on the functional involvement of Foxk1 in LSD1-mediated repression of atrophy genes. We recently performed an ATAC-seq analysis in Dex-treated muscle, and found that the Foxk1 motif but not the NFATc1 motif was enriched in the LSD1-KO-specific open chromatin regions. This data further suggests the significant contribution of Foxk1 in the transcriptional regulation under LSD1 depletion.

      #3

      In figure 3D, only merged images were colored. It would be better to show colored images for Foxk1 and DAPI.

      We will replace the images with the colored ones.

      #4

      Immunofluorescence analysis in C2C12 myotubes showed that Dex exposure reduced the nuclear retention of Foxk1, which was further promoted by the addition of T-3775440, an LSD1 inhibitor (Figure 3D). The author also used Foxk1-KO C2C12 myotubes to prove LSD1 and Foxk1 cooperation to regulate the expression of type I /IIA fiber and atrophy genes in Foxk1-KO cells. Are the effects of LSD1 dependent on Foxk1 or synergistically acting with Foxk1? The treatment of LSD1 inhibitor in Foxk1-KO C2C12 may be helpful to answer this question.

      As suggested, we will examine the combination effect of LSD1 inhibition and Foxk1-KO. In addition, we will analyze chromatin association of LSD1 in Foxk1-KO cells by ChIP experiments, to test whether the function of LSD1 depends on Foxk1.

      #5

      In supplementary figure 2, body weight in the mKO+Dex group was reduced in comparison to that of WT+Dex. How about the body weight of mKO mice without Dex injection compared to that of WT? This data will be helpful to understand the effect of muscle-specific LSD1 deficiency on whole-body energy balance.

      We measured the body weight of untreated mice, and found that there is no genotype effect (Additional Data). Thus, we think that LSD1-mKO alone does not influence the whole-body energy balance. We will include this data in the revised version.

      #6

      The authors analyzed the size distribution of myofibers and mentioned that large type I and type IIA fibers preferentially increased in the LSD1-mKO muscle, whereas large type IIB + IIX fibers decreased (Supplemental Figure 4, B, E, and F). It is better to show the results of statistics. If no significance were found, it should be mentioned in the result section.

      We have performed statistical analyses on Supplemental Fig. 4E and F, and found that a fraction of large type I fibers was significantly larger in KO mice. This result will be added in the next version.

      #7

      Page 11, To reveal the genes regulated by LSD1 under the VWR condition, the authors performed additional RNA-seq analysis using Sol muscle. The non-hierarchical clustering analysis was informative and showed signaling pathways related to ‘mitochondrion’, ‘mitochondrion organization’, and ‘oxidative phosphorylation’ were altered in the Sol muscle deficient in LSD1 under the VWR condition (Figure 5B). However, it is unclear why they focus on Err-gamma to explain LSD1-KO phenotypes in Sol muscle. Is this gene also derived from RNA seq? It is better to show whether Err-gamma expression is also significantly altered based on RNA seq data.

      Indeed, ERRg was upregulated by LSD1-KO+VWR and was included in the Cluster 6 genes together with the OXPHOS and mitochondria-related genes (Additional Data and Fig. 5A). These data prompted us to focus on ERRg as a potential factor that explains the LSD1-KO phenotype. Additional Data will be included in the revised version.

      #8

      The authors claim that LSD1 serves as an "epigenetic barrier" that optimizes fiber type-specific responses and muscle mass under stress conditions. This claim is derived from the loss of function studies. To generalize the functions of LSD1, the gain of function studies will be also necessary. Adding the characteristics of LSD1 overexpression in C2C12 cells will further improve the quality of the manuscript.

      We agree that the gain of function studies will further strengthen the quality of our manuscript. As suggested, we will perform an LSD1 overexpression experiment using C2C12 cells and analyze the expression of atrophy and fast fiber related genes. Because Esrrg is completely silenced in C2C12 cells, it is difficult to monitor ERRg-mediated gene regulation in these cells. To overcome this, we will use a cardiomyocyte cell line, in which ERRg is functionally involved in differentiation (Sakamoto et al. Nat Commun 2022). We will overexpress LSD1 in these cells and examine whether the expression of ERRg and its downstream targets are altered.

      __Discussion

      9__

      The authors mentioned supplementary figure 10 only at the end of the manuscript of the discussion section (page 15) without a specific explanation of the figures in the result section. The data are important in that LSD1 expression in human muscles declined with age and showed a negative correlation with the expression of the atrophy gene. It should be presented in the result section with a more detailed description.

      We agree that these data are important and need further explanations. We will describe the details in the Results section and move the entire figure to the main figure.

      #10

      There are other studies to examine LSD1 and muscle regeneration or functions (e.g. Nat Commun 9, 366 (2018). ____https://doi.org/10.1038/s41467-017-02740-5____). More discussion to compare the current study and other studies will be necessary.

      We thank the reviewer for this comment. We will add the discussion accordingly.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors investigated the functions of LSD1 in skeletal muscle in response to different stimulations. The study is quite interesting in that it revealed the dual roles of an epigenetic regulator LSD1 in the types of muscle fibers using a muscle-specific LSD1 KO mouse model and related cell lines with big data analysis. The LSD1 deficiency sensitized the skeletal muscle to Dex-induced atrophy and VWR-induced hypertrophy. LSD1 appears to suppress the catabolic pathways and the formation of type I fibers. Also, LSD1 may exert its effects together with Foxk1 to suppress genes related to protein catabolism, while repressing ERRg and its downstream oxidative genes, depending on the stimulus and muscle location. These findings will shed light on related fields in that LSD1 may be an epigenetic molecular switch that induces fiber-type-specific responses. Nevertheless, some questions need to be further answered to make up for the shortcomings of the study. The specific comments are as follows;

      Methods

      1. The authors used the Cre-lox system with tamoxifen to generate skeletal muscle-specific LSD1 KO mice. It is clear that both the mRNA and protein levels of LSD1 in various muscles were dramatically reduced, but there is still some LSD1 expressed in skeletal muscle, especially in Sol muscle (Supplemental Figure 1C). The author needs to think about whether it is appropriate to use the term "LSD1 knockout" or "LSD1 deficiency".

      Results

      1. To identify the transcriptional regulators that mediate the regulation of atrophy-associated genes by LSD1, the authors performed motif analyses on the promotor regions of upregulated genes in LSD1-mKO Gas. Based on the results and other reports, they focused on Foxk1 and proved LSD1 and Foxk1 cooperatively regulate the atrophy transcriptome in the presence of Dex. However, Figure 3C showed that a transcription factor Nfatc1 is also reduced in Sol muscle similar to Foxk1. Also, other studies demonstrated that the transcription factor NFATc1 controls fiber type composition and is required for fast-to-slow fiber type switching in response to exercise in vivo. More specifically, NFATc1 inhibits MyoD-dependent fast fiber gene promoters by physically interacting with the N-terminal activation domain of MyoD and blocking recruitment of the essential transcriptional coactivator p300 (Cell Rep. 2014 Sep 25; 8(6): 1639-1648). Furthermore, it has been reported that LSD1 Controls Timely MyoD Expression via MyoD Core Enhancer Transcription (Cell Rep. 2017 Feb 21;18(8):1996-2006. doi: 10.1016/j.celrep.2017.01.078). It is unclear how the authors exclude Nfatc1 for the LSD1-mediated effects in different muscle fibers. Further experiments may be necessary to exclude Nfatc1.
      2. In figure 3D, only merged images were colored. It would be better to show colored images for Foxk1 and DAPI.
      3. Immunofluorescence analysis in C2C12 myotubes showed that Dex exposure reduced the nuclear retention of Foxk1, which was further promoted by the addition of T-3775440, an LSD1 inhibitor (Figure 3D). The author also used Foxk1-KO C2C12 myotubes to prove LSD1 and Foxk1 cooperation to regulate the expression of type I /IIA fiber and atrophy genes in Foxk1-KO cells. Are the effects of LSD1 dependent on Foxk1 or synergistically acting with Foxk1? The treatment of LSD1 inhibitor in Foxk1-KO C2C12 may be helpful to answer this question.
      4. In supplementary figure 2, body weight in the mKO+Dex group was reduced in comparison to that of WT+Dex. How about the body weight of mKO mice without Dex injection compared to that of WT? This data will be helpful to understand the effect of muscle-specific LSD1 deficiency on whole-body energy balance.
      5. The authors analyzed the size distribution of myofibers and mentioned that large type I and type IIA fibers preferentially increased in the LSD1-mKO muscle, whereas large type IIB + IIX fibers decreased (Supplemental Figure 4, B, E, and F). It is better to show the results of statistics. If no significance were found, it should be mentioned in the result section.
      6. Page 11, To reveal the genes regulated by LSD1 under the VWR condition, the authors performed additional RNA-seq analysis using Sol muscle. The non-hierarchical clustering analysis was informative and showed signaling pathways related to 'mitochondrion', 'mitochondrion organization', and 'oxidative phosphorylation' were altered in the Sol muscle deficient in LSD1 under the VWR condition (Figure 5B). However, it is unclear why they focus on Err-gamma to explain LSD1-KO phenotypes in Sol muscle. Is this gene also derived from RNA seq? It is better to show whether Err-gamma expression is also significantly altered based on RNA seq data.
      7. The authors claim that LSD1 serves as an "epigenetic barrier" that optimizes fiber type-specific responses and muscle mass under stress conditions. This claim is derived from the loss of function studies. To generalize the functions of LSD1, the gain of function studies will be also necessary. Adding the characteristics of LSD1 overexpression in C2C12 cells will further improve the quality of the manuscript.

      Discussion

      1. The authors mentioned supplementary figure 10 only at the end of the manuscript of the discussion section (page 15) without a specific explanation of the figures in the result section. The data are important in that LSD1 expression in human muscles declined with age and showed a negative correlation with the expression of the atrophy gene. It should be presented in the result section with a more detailed description.
      2. There are other studies to examine LSD1 and muscle regeneration or functions (e.g. Nat Commun 9, 366 (2018). https://doi.org/10.1038/s41467-017-02740-5). More discussion to compare the current study and other studies will be necessary.

      Significance

      The study is quite interesting in that it revealed the dual roles of an epigenetic regulator LSD1 in the types of muscle fibers using a muscle-specific LSD1 KO mouse model and related cell lines with big data analysis.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, Araki et. al show that LSD1 is required for muscle fiber maintenance upon chemical and environmental stress. By using a LSD1 conditional KO mouse model, along with C2C12 cell culture, the authors demonstrate that loss of LSD1 exacerbates glucocorticoid-induced muscle atrophy in fast fiber-dominant muscle. On the other hand, loss of LSD1 together with endurance exercise induces a hypertrophy in slow fiber-dominant muscle. Thus, these data suggest that LSD1 acts as a modulator of the muscle response to chemical and environmental stress, might be a target for therapeutic strategies against stress-induced myopathies. However, some concerns should be addressed in a revised version.

      Major comments:

      This study primarily uses the genetic mouse model in which LSD1 gene is inactivated after tamoxifen injection in 8 weeks old mice, as shown in supplemental figure 1 B and C. 8 weeks after birth post natal growth of muscle is not complete and the contribution of satellite cells to muscle growth is still significant. Therefore the timing of tamoxifen injection used cannot discriminate if the observed phenotype involves the function of LSD1 during the post-natal growth of the muscle or in the muscle fibers or both. One way to demonstrate the real contribution of LSD1 in the maintenance of muscle fibers plasticity under environmental stress would be to inject Tamoxifen later (around 10-12 weeks of age), in order to remove a possible bias caused by the contribution of satellite cells during the post-natal growth. At least key findings should be confirmed at this later stage.

      • LSD1 m-KO muscles seem to have more type I and IIA fibers than WT, even without DEX treatment. Is it possible to quantify the results in supplemental figure 4C?
      • The effect on fiber type is convincing, while variations in gene expression are of quite low amplitude. However, the atrophy should be induced by other means to ensure that the effects are specific to GC/nuclear receptors pathways; Denervation? Starvation? Not all the experiments need to be repeated, just key results such as, for example, exacerbation of atrophy in LSD1 m-KO, Foxk1 increase.
      • Autophagy data: the effect on the LC3I/LC3II ratio are modest. The autophagy part should be removed or completed with additional data to convincingly show that autophagy is affected. Links between LSD1 and mTOR have been published, so the mTOR pathway could be investigated in the model (S6k, S6 and 4EBP1 phosphorylation). Given AKT levels and phosphorylation are affected by the absence of LSD1 + DEX, it can be predicted that mTOR activity will change.
      • The ability of LSD1 to retain FOXK1 in the nucleus is an important information that should be better supported experimentally. In the absence of such information, no mechanism can be proposed for the effect of LSD1 of FOXK1. The immunofluorescence images provided are not convincing and moreover they could be interpreted by a reduction in the level of FOXK1 protein (degradation?) rather than by a nuclear exclusion in the presence of DEX. This point should be addressed, authors could include western blot of nuclear and cytoplasmic fractions to better quantify the nuclear level of FOXK1 in absence of LSD1.
      • The absence of centralized nuclei indicates that there is no fiber regeneration but it does not exclude the possibility that satellite cells were recruited to existing fibers and thus participated to hypertrophy. To eliminate this possibility, the average nuclei/cytoplasm volume should decrease if hypertrophy results from increased protein synthesis and not myonuclei accretion. This should be checked
      • The upregulation of ERR in the absence of LSD1 is convincing in the VWR conditions. ERR level should be evaluated in the sedentary LSD1 KO mice.

      Minor comments:

      • There is a clear difference in the number of mouse replicates between treated (Dex or VWR) and non-treated mice, regardless the genotype. Experiments with non-treated mice lack adequate numbers to make a definitive conclusion. For example, there is a huge spread in the data in Figure 1 B and 4 D. If the number of animals would have been increased, would the absence of difference hold up?
      • The authors claim that: "Consistent with the results of the augmented endurance capacity, the Sol muscle in the KO mice showed enhanced succinate dehydrogenase (SDH) staining, indicating that the number of oxidative fibers increased (Figure 4F and Supplemental Figure 8F)". However, supplemental figure 8 D indicates that the number of type I fibers does not change compared to WT. Authors should clarify this statement.

      Significance

      Conceptually, Araki and colleagues have described the potential contribution of LSD1 in the maintenance of muscle fiber typing upon environmental stress. However, the technical approach used only partially allows to demonstrate the concept.

      Field of expertise: Muscle atrophy and regeneration, Muscle stem cells, Epigenetic regulation in muscle, LSD1.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Thank you for giving us the opportunity to submit a revised draft of the manuscript “Tup1 is Required for Transcriptional Repression Necessary in Quiescence in S. cerevisiae” to Review Commons. We appreciate the time and effort that you and the other reviewers dedicated to providing feedback on our manuscript and are grateful for the insightful comments on and valuable improvements to our paper. We believe that the experiments suggested in these comments would bring clarity to the manuscript, and wish that we had the ability to perform them all. Unfortunately our lab is closing, and the only remaining lab member is the PI, so we are only able to perform limited experiments to address some of the concerns raised during review. We have incorporated several changes in response to comments from the reviewers. Those changes are highlighted within the manuscript. Please see below, in blue, for a point-by-point response to the reviewers’ comments and concerns. All page numbers refer to the revised manuscript Word file with tracked changes.

      Of particular note is the discussion of cellular morphology of the tup1∆ and sds3∆ strains. We realize that our findings are purely descriptive, and are not surprised that all three reviewers had comments on this data. This was the source of much discussion among the authors and consultation with other labs; we debated even including these observations in the manuscript, since we were unable to figure out the underlying mechanism. Ultimately we decided that it was worth reporting in case other labs may benefit from the knowledge, and we have altered the language in the manuscript (page 6) to better reflect this. However, if the reviewers feel that this observation would be better left out of the manuscript, we would be willing to remove Figure 6 and any discussion of these images.

      Reviewer #1 Comments:1. The authors chose to examine 3-day SP cells to interrogate quiescence because tup1∆ cells are highly flocculant, interfering with the isolation of purified quiescent cells. These cells are a mixture of both nonquiescent and quiescent cells, so it is not correct to state that they represent a quiescent cell population. The addition of EDTA to the gradients used to isolate quiescent cells could eliminate flocculation and permit the isolation of quiescent cells. EDTA is also often added to media in low amounts to reduce flocculation. The authors need to indicate the proportion of quiescent cells in their SP cultures by applying these tools.

      We appreciate the suggestion, but the phenotype of this strain is not typical flocculation (see photo below, also added to the paper as Supplementary Figure 1). We did add EDTA (pH 8.0) to a final concentration of 10 mM to two separate tup1∆ and it did not visibly affect the clumping of cells. Furthermore, changes to the cell wall are a distinct feature of quiescent S. cerevisiae and contribute to the ability to separate different cell types by density-gradient centrifugation, so it is difficult to anticipate how EDTA would affect our ability to isolate Q cells. We have provided more explanation in the manuscript to better explain this (page 3).

      1. The authors reported that while Xpb1 and Tup1 share many overlapping binding sites, but that Xbp1 does not regulate Tup1's binding. What other factors might be responsible for their shared binding? Could histone deacetylation play a role? This could be addressed by a Tup1 ChIP in an sds3∆ mutant.

      This is a good thought; histone acetylation levels may have a role in regulating Tup1 localization and we would have liked to address this if we had more time. Unfortunately, we had some difficulty performing ChIP of Tup1, because initially we used a FLAG tag which caused a phenotype similar to deletion of Tup1, and had to switch to making myc-tagged strains. This delay meant we did not have time to pursue creating myc-tagged Tup1 in an sds3∆ strain, and now we do not have the ability to follow up on this for revisions.

      1. Has PolII occupancy been examined in Log vs SP cells of tup1∆ to determine if Tup1 inhibits PolII association with its genes that are repressed ?

      We did not look at PolII occupancy in our Tup1 deletion, and could not find any existing datasets with this information. It is our hope that another lab is able to carry out this experiment, because it could be very enlightening, but it is beyond the scope of this work.

      1. The observation that tup1∆ cells have several nuclear puncta is intriguing, although the cytological images need to be improved.

      The nuclear puncta we see in the tup1 deletion are definitely a puzzle. We had limited time to investigate this phenomena, and in discussing the matter with some other labs it seemed doubtful that more advanced imaging would yield anything of use to us. We realized that we accidentally omitted important details for this figure and have updated the manuscript to add them. We imaged 2 biological replicates for each strain and imaged many yeast samples for each strain (which has been added to the caption for Figure 6) and found that our findings were statistically significant (p

      Reviewer #2 Comments1. The authors acknowledge that it would be better to work with purified quiescent cells but couldn't isolate pure populations. As a result, a mixture of quiescent and nonquiescent cells are analyzed in stationary phase. They say this is because Tup1 deletion strains are flocculent. But they performed ChIP-Seq on Myc-tagged Tup1 strain. Don't these cells express Tup1? If not, could this be performed in wild-type yeast with Myc-tagged Tup1? It seems important to separate quiescent from nonquiescent yeast for the authors' conclusions.

      It is true that we could have done ChIP-seq for Tup1 in purified Q cells. We considered it, but decided to look at the mixed population so that we could directly compare our RNA-seq results from the tup1∆ strain. It’s a balance between having some results that are specific to quiescence, versus being able to directly compare the effects of deletion of Tup1 at the sites where it binds. We are now unable to perform this experiment, but we have updated the language in the manuscript (page 3) to better reflect this choice.

      1. The Chipseq data in Fig 1B do not have a y axis and it is consequently not clear whether these data are normalized and shown with the same axis.

      Thank you for pointing this out - these data are normalized to RPKM during processing, and we have updated the caption for figure 1 and the methods on page 10 to reflect this information. Normalizing the data in IGB itself, however, causes an adjustment in the y-axis that makes the tracks appear to be inconsistent. In any case, we are not making claims about the relative amount of signal, and as it is common in the field to not include y-axes on IGB tracks, we have opted to keep the y-axis for Figure 1B as-is.

      1. In Fig 2, it seems important to determine how many genes are different between WT and Tup1 deletion strains in log phase. Are just as many genes different? Or is Tup1 more important in diauxic shift and stationary phase than log phase?

      We did intend to focus only on diauxic shift and stationary phase data for this paper, since there has already been so much work on the role of Tup1 in log phase. As mentioned above, comparisons of RNA between log and DS/Q is difficult. We attempted to find a publicly available dataset to perform some analysis for revisions, but unfortunately most previous work on the effect of Tup1 on transcription was performed via tiling arrays, which is not comparable.

      1. Are the genes that are regulated by Tup1 normally regulated during diauxic shift or stationary phase compared with log growth?

      Because there is a massive global decrease in the level of total RNA in diauxic shift and quiescence (McKnight, Boerma, et al., 2015) it is impossible to directly compare transcript levels between these states in our experiments. If there was time, we could have attempted to repeat these experiments with an external spike-in control; this is potentially something another lab could do to follow up on our findings.

      1. What fraction of the genes that are differentially expressed in Tup1 knockout yeast have Tup1 binding at the promoter? Enhancer? What fraction can be explained by Tup1, Hap1, Nrg1, Mig1 individually and together?

      We have added the number of genes that are differentially expressed in Tup1 knockout yeast during DS to the manuscript (page 3). Regarding enhancers, the genome of S. cerevisiae is very compact, and there is not evidence of long-distance activation of genes as seen in metazoans (Dujon, 1996; Dobi & Winston, 2007; Spiegel and Arnone, 2021). Upstream activating sequences (UASs) are generally considered the closest equivalent to enhancers in cerevisiae, and they tend to function within a few hundred base pairs of the promoter. Our analysis only identifies the nearest gene; it would be difficult to parse out locations in the promoter versus a UAS without a more advanced analysis that is beyond our capabilities now.

      As for the effect of Hap1, Nrg1, and Mig1, we were able to look for their motifs in the genes that are differentially expressed in the Tup1 knockout but we do not have binding data for these factors in quiescence or stationary phase so it is impossible to conclusively state what role those TFs play. This would be a very interesting followup to our work, but is outside the scope of this manuscript.

      References:

      Dujon, B. 1996. The Yeast Genome Project: What did we learn? Trends Genet. 12, 263-270.

      Dobi, K.C.; Winston, F. 2007. Analysis of Transcriptional Activation at a Distance in Saccharomyces cerevisiae. Mol Cell Biol. 27(15), 5575-5586. https://doi.org/10.1128/MCB.00459-007

      Spiegel, JA; Arnone, J.T. 2021.Transcription at a Distance in the Budding Yeast Saccharomyces cerevisiae. Appl. Microbiol. 1(1), 142-149. https://doi.org/10.3390/applmicrobiol1010011

      1. The methodology used to generate the gene ontology enrichments should be described in the methods.

      Thank you for noticing this omission; we have added the relevant information to the manuscript (page 10) and have also added the related citation (page 11).

      1. The authors should provide genomewide data to support the statement that Tup1 and Rpd3 ChIP datasets have substantial overlap. They should also provide genomewide data to support the statement that there is substantial overlap between Rpd3 and Tup1. How much overlap is observed and how much is expected by chance?

      We have compared the existing ChIP data for Rpd3 binding in quiescent cells to our ChIP data for Tup1 in 3-day cultures and included this in the manuscript (page 4, Supplementary Figure 2B), along with a p-value.

      1. For Sds3, similar to Tup1 inactivation, it would be helpful to know how many genes change in with Sds3 inactivation in log phase in addition to diauxic shift and stationary phase.

      As with our response to comment #3, we focused only on diauxic shift and stationary phase data for this paper, and analysis of this data would be difficult without a spike-in control. While there are some existing datasets for RNA-seq of Rpd3 knockouts, this would include both Rpd3L and Rpd3S activity, rather than just Rpd3L, which is our focus with the Sds3 deletion strains. As such, we did not perform RNA-seq of sds3∆ in log phase.

      1. If the argument is that Sds3 and Xbp1 cooperate with Tup1 to affect gene expression, testing the gene expression changes that are associated with Tup1 in Sds3 or Xbp1 knockout strains would help the authors make this point.

      We do not have tup1∆/sds3∆ or tup1∆/xbp1∆ double knockout strains. We attempted to make these strains but could not, which may indicate that these double deletions are synthetic lethal. Deletion of sds3 alone causes a significant reduction in growth rate, so it is perhaps not surprising that we could not create the double knockouts.

      1. The final phenotype of extra DAPI positive blobs in the nucleus is not very specific or clear.

      We agree, please see our comments at the top of this letter.

      Reviewer #3 (Major comments):Did tup1∆/sds3∆ double mutant show the same phenotype with tup1∆ (or sds3∆) single mutant in G0? If Tup1 actually plays role in tandem with Sds3 in the gene regulation during G0, the epistatic relationship might be estimated.

      We do not have tup1∆/sds3∆ or tup1∆/xbp1∆ double knockout strains. We attempted to make these strains but could not, which may indicate that these double deletions are synthetic lethal. Deletion of sds3 alone causes a significant reduction in growth rate, so it is perhaps not surprising that we could not create the double knockouts.

      The histone acetylation was not synergistically augmented in the above double mutant?

      Please see the response above.

      The authors showed that tup1∆ but not sds3∆ cells contain multiple DAPI signals but sds3∆ cells show abnormal cell shape in G0 phase. These phenotypic abnormalities in these mutants suggest a potential mitotic defect. Both mutants showed very similar abnormalities in H3K23 acetylation and gene expressions in quiescent state. Why these showed distinctly different abnormality in cell morphology during G0?

      Unfortunately we were unable to investigate this further.

      Did iswi2∆ cells also show abnormality in G0 phase?

      No, they did not; thank you for asking, this is a good question. We have added this information to the manuscript (page 6).

      (Minor comments)Supplementary figure1. This data seems to be very important. I recommend to use this data in the main figure with statistical analysis (p-values) to show the significant overlap of Tup1 and Rdp3 distribution.

      We have compared the existing ChIP data for Rpd3 binding in quiescent cells to our ChIP data for Tup1 in 3-day cultures and included this in the manuscript (page 4, Supplementary Figure 2B) . We do feel that this data belong in the supplement, however, because the data is not exactly equivalent to our studies: quiescent cells and 3-day cultures are not the same, and knockout of Rpd3 eliminates function of both Rpd3L and Rpd3S complexes, while knocking out Sds3 targets only the Rpd3L complex.

      Figure 4. Histone acetylation level data in Figure 4A and the data for gene repressions by Tup1 and Sds3 in Figure 4C seem to be very important. However, statistical analysis data (p-values) was not presented. Please show the statistical analysis data (p-values) as in figure 3 to show that the Tup1 and Sds3 contribute similarly in histone deacetylation and repression. The author did not find the significant changes of histone deacetylation in xbp1∆ cells but said that when filtered in Xbp1 binding motif Xbp1 depletion has similar effect on the acetylation level. Please show this data.

      We have added language in the manuscript comparing genes with altered acetylation levels to those that are differentially expressed in our RNA-seq datasets, along with a p-value, to page 5.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, the authors found that Tup1 corepressor coordinates with Rpd3L HDAC complex in the deacetylation of histone H3K23 during quiescence entry. They also found that Tup1 coordinates with ISWI2 to regulate +1 nucleosome at HXT family genes for the gene repression during G0. Finally, they showed that loss of Tup1 results in abnormal shape of nuclei during G0. These data suggest a critical role of Tup1 corepressor in the proper gene regulations in tandem with Rpd3 HDAC and ISWI2 remodeler upon quiescence entry. This study seems to be a critical progress from their previous study on the roles of Rpd3 complex in the quiescence entry (McKnight et al. 2015 Mol. Cell). My comments were listed bellow.

      Major comments

      Did tup1∆/sds3∆ double mutant show the same phenotype with tup1∆ (or sds3∆) single mutant in G0? If Tup1 actually plays role in tandem with Sds3 in the gene regulation during G0, the epistatic relationship might be estimated.

      The histone acetylation was not synergistically augmented in the above double mutant?

      The authors showed that tup1∆ but not sds3∆ cells contain multiple DAPI signals but sds3∆ cells show abnormal cell shape in G0 phase. These phenotypic abnormalities in these mutants suggest a potential mitotic defect. Both mutants showed very similar abnormalities in H3K23 acetylation and gene expressions in quiescent state. Why these showed distinctly different abnormality in cell morphology during G0?

      Did iswi2∆ cells also show abnormality in G0 phase?

      Minor comments

      Supplementary figure1. This data seems to be very important. I recommend to use this data in the main figure with statistical analysis (p-values) to show the significant overlap of Tup1 and Rdp3 distribution.

      Figure 4. Histone acetylation level data in Figure 4A and the data for gene repressions by Tup1 and Sds3 in Figure 4C seem to be very important. However, statistical analysis data (p-values) was not presented. Please show the statistical analysis data (p-values) as in figure 3 to show that the Tup1 and Sds3 contribute similarly in histone deacetylation and repression. The author did not find the significant changes of histone deacetylation in xbp1∆ cells but said that when filtered in Xbp1 binding motif Xbp1 depletion has similar effect on the acetylation level. Please show this data.

      Significance

      The current work by the authors significantly progressed their work (McKnight et al. 2015 Mol. Cell) showing important role of Rpd3 deacetylase complex in the quiescence entry. This study will significantly contribute to understanding the gene regulation mechanisms in the G0 entry.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this paper, the authors investigate the role of the Tup1 transcriptional co-repressor in transcriptional repression after glucose depletion in quiescent S. cerevisiae. Tup1 N terminal helix facilitates oligomerization and interacts with histone tails and a beta propeller domain interacts with histone deacetylases and DNA binding factors. Tup1 interacts with nucleosomes by binding histone H3 and H4 tails, binds to hypoacetylated tails and stabilizes +1 nucleosomes repositioned by Isw2. Tup1 had been identified as a gene essential for viability of cells in G0 and implicated in glucose repression in yeast. The authors found that Tup1-Ssn6 binds new targets when S cerevisiae are glucose deprived and as they enter G0. Tup1 was found to repress the expression of genes involved in glucose metabolism and glucose transport. Tup1 was discovered to coordinate with the Rpd3L complex to deacetylate H3K23. Tup1 was also found to coordinate with Isw2 to affect nucleosome positions at hexose transporter family genes. Some cells with Tup1 deletion had multiple DAPI puncta in stationary phase, suggesting a possible role for Tup1 in mitosis.

      Overall Comments

      In Fig 1, the shift in Tup1 binding with diauxic shift and stationary phase is clear. The data clearly show an effect of Tup1 on histone acetylation and nucleosome positioning. However, whether Tup1 has an important functional role in quiescence is not clear.

      Comments

      1. The authors acknowledge that it would be better to work with purified quiescent cells but couldn't isolate pure populations. As a result, a mixture of quiescent and nonquiescent cells are analyzed in stationary phase. They say this is because Tup1 deletion strains are flocculent. But they performed ChIP-Seq on Myc-tagged Tup1 strain. Don't these cells express Tup1? If not, could this be performed in wild-type yeast with Myc-tagged Tup1? It seems important to separate quiescent from nonquiescent yeast for the authors' conclusions.

      2. The Chipseq data in Fig 1B do not have a y axis and it is consequently not clear whether these data are normalized and shown with the same axis.

      3. In Fig 2, it seems important to determine how many genes are different between WT and Tup1 deletion strains in log phase. Are just as many genes different? Or is Tup1 more important in diauxic shift and stationary phase than log phase?

      4. Are the genes that are regulated by Tup1 normally regulated during diauxic shift or stationary phase compared with log growth?

      5. What fraction of the genes that are differentially expressed in Tup1 knockout yeast have Tup1 binding at the promoter? Enhancer? What fraction can be explained by Tup1, Hap1, Nrg1, Mig1 individually and together?

      6. The methodology used to generate the gene ontology enrichments should be described in the methods.

      7. The authors should provide genomewide data to support the statement that Tup1 and Rpd3 ChIP datasets have substantial overlap. They should also provide genomewide data to support the statement that there is substantial overlap between Rpd3 and Tup1. How much overlap is observed and how much is expected by chance?

      8. For Sds3, similar to Tup1 inactivation, it would be helpful to know how many genes change in with Sds3 inactivation in log phase in addition to diauxic shift and stationary phase.

      9. If the argument is that Sds3 and Xbp1 cooperate with Tup1 to affect gene expression, testing the gene expression changes that are associated with Tup1 in Sds3 or Xbp1 knockout strains would help the authors make this point.

      10. The final phenotype of extra DAPI positive blobs in the nucleus is not very specific or clear.

      Significance

      Discovering transcription factors that are critical for the entry into and maintenance of quiescence is an important area of discovery as relatively little is known about gene regulation during quiescence and understanding this process is fundamentally important for our understanding of cell biology. Previous studies had implicated Tup1 in stationary phase in yeast. These studies provide additional insight into the mechanism.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, the authors have investigated the role of the Tup1 corepressor in transcriptional repression that occurs during the establishment of quiescence in S. cerevisiae. Using ChIP-seq, they found that Tup1 is present at many sites during log phase growth, and that it localizes to new targets upon glucose exhaustion at a point called diauxie; post-diuaxie, Tup 1 remains associated with these sites in stationary phase (SP), which they define as growth of yeast cells for 3 days after inoculation in glucose-rich medium. To understand the significance of Tup1 relocalization during quiescence establishment, they performed differential expression (DE) analysis following RNA-seq in WT and tup1∆ cells at the diauxic shift and 3-day SP. The large number of DE genes in both cell states were consistent with a role for Tup1 in the regulation of key targets associated with the initiation of quiescence, although there was not a strict correlation between Tup1 occupancy at these DE genes. Next, they investigated the relationship of Tup1 to Xbp1 and Rpd3, factors that play a role in transcriptional repression during quiescence. They noted significant overlap between the binding sites for the three factors in either isolated quiescent cells or 3-day SP cells using published and newly acquired ChIP-seq data. However, the absence of Xbp1, a transcription factor that recruits the Rpd3 histone deacetylase in quiescent cells, did not alter Tup1 binding to its targets in these cells. RNA-seq analysis of xbp1∆ and tup1∆ SP cells found a significant overlap between the genes that were repressed in the absence of the two transcription factors. Moreover, Tup1, Xbp1, and Rpd3 were noted to be required for H3K23 deacetylation at repressed genes in SP. Finally, the authors asked if the Tup1 affects the position of nucleosomes at TSSs in SP cells by performing MNase-seq. They found that Tup1 and the chromatin remodeler Isw2, which interacts with Tup1, are required to position nucleosomes at the promoters of a family of glucose transporter genes, targets of Tup1 during the initiation of quiescence.

      Comments:

      1. The authors chose to examine 3-day SP cells to interrogate quiescence because tup1∆ cells are highly flocculant, interfering with the isolation of purified quiescent cells. These cells are a mixture of both nonquiescent and quiescent cells, so it is not correct to state that they represent a quiescent cell population. The addition of EDTA to the gradients used to isolate quiescent cells could eliminate flocculation and permit the isolation of quiescent cells. EDTA is also often added to media in low amounts to reduce flocculation. The authors need to indicate the proportion of quiescent cells in their SP cultures by applying these tools.

      2. The authors reported that while Xpb1 and Tup1 share many overlapping binding sites, but that Xbp1 does not regulate Tup1's binding. What other factors might be responsible for their shared binding? Could histone deacetylation play a role? This could be addressed by a Tup1 ChIP in an sds3∆ mutant.

      3. Has PolII occupancy been examined in Log vs SP cells of tup1∆ to determine if Tup1 inhibits PolII association with its genes that are repressed ?

      4. The observation that tup1∆ cells have several nuclear puncta is intriguing, although the cytological images need to be improved.

      Significance

      This manuscript presents some new information that links Tup1, a multifacteted transcriptional co-repressor, to the repression of transcription that occurs during the establishment of yeast quiescence. The McKnight lab has previously shown that the Rpd3 HDAC plays an important role in this process, in part through its recruitment by the TF Xbp1. Tup1 also binds to Rpd3, and the overlap between the sites where Tup1 and Xpb1 bind and the targets that they repress, suggests a shared function in establishing transcriptional reprogramming. Whether this shared function is based only on the recruitment of the Rpd3 histone deacetylation is unclear, and, more importantly, there is also some question if deacetylation is in fact the main factor driving transcriptional repression during the initiation of quiescence. The finding that Tup1 is responsible for repositioning nucleosomes at a class of glucose transporters to repress the transcription of these genes suggests a specific function for Tup1 during the establishment of quiescence; however, this is apparently not a broad function. Thus, we are left with a lot of nice descriptive information as a result of well-done experiments that has not revealed much about the mechanism by which Tup1 regulates the global transcriptional reprogramming that occurs during quiescence.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01536

      Corresponding author(s): Michael Glotzer

      [Please use this template only if the submitted manuscript should be considered by the affiliate journal as a full revision in response to the points raised by the reviewers.

      If you wish to submit a preliminary revision with a revision plan, please use our "Revision Plan" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements We thank the reviewers for their thoughtful and helpful comments. In general, the reviews were highly positive, although their reviews indicated parts of the manuscript that needed further clarification. We have made extensive changes that improve the clarity and rigor of this submission. We have performed several additional experiments which have extended our analysis in several ways detailed below. None of the conclusions have changed.

      The following is a list of eight major changes implemented during the revisions. Point-by-point responses to the reviewers comments follow on subsequent pages.

      1. The reviews made clear that we needed to more explicitly discuss the AIR-1 depletion phenotype. This phenotype is complex, it does not result in a complete loss of asymmetry, unlike, for example, depletion of the centrosome component SPD-5. This is because, in AIR-1 depleted embryos, a PAR-2 and cortical flow-dependent pathway induces PAR-2 accumulation at both anterior and posterior poles that induces flows from each pole to the lateral region (Reich 2019, Kapoor 2019, Zhao 2019, Klinkert 2019; PMIDs 31155349, 31636075, 30861375, 30801250). These flows also modulate ECT-2 localization. To clarify this point which came up in multiple reviews, we now include an explanation of the complexity of the AIR-1 phenotype and we present an analysis of ECT-2 localization in embryos depleted of both AIR-1 and PAR-2.

      In addition to the 95% confidence intervals that were present on our graphs, we now include indications of the results of statistical tests of significance to the results of different treatments.

      We have revised the analysis ECT-2 accumulation in two ways. First, in the previous draft, we assessed the anterior accumulation over the anterior 40% and the posterior 15% of the embryo. We have revised this analysis comparing the anterior and posterior 20% of the cortex, respectively. This is simpler and more logical in contexts where embryos are symmetric. In addition, we altered the measurements of the length of the posterior boundary. Previously we used a common threshold value, below which we counted pixels to assess boundary length. During the revisions, we noticed that this value was not appropriate for our mutant transgenes which accumulated to higher levels. Therefore, we revised our analysis pipeline such that, for each embryo, we measure the average intensity of the cortex in the anterior 60% of the embryo. We set a threshold of 0.85* this average anterior intensity value. As before, cortical positions below this threshold contribute to the boundary length. This is a more robust and simpler means of evaluating the size of the posterior domain. Neither of these changes affect any of our conclusions, but they are simpler and more rigorous.

      Most of our figures include quantification of the degree of ECT-2 asymmetry as well as the average anterior and posterior accumulation of ECT-2 as a function of time. While the images show the intensity profiles across the embryo, previously, we did not explicitly show a quantification of the average intensity of ECT-2 as a function of position along the embryo. A new graph, Figure 2Bv, shows this for control embryos and embryos in which tubulin is depleted and depolymerized. This shows that the MT depolymerization results in lower accumulation at the posterior of the embryo and higher accumulation at the anterior.

      We provide documentary and quantitative evidence that ZYG-9 depletion induces potent cortical flows (Figure 3c and Figure 3, supplement 3), further bolstering the central role of cortical flows in inducing ECT-2 asymmetry.

      As requested by reviewer 2 (R2b), we have included the analysis of ECT-2 distribution in Gα depleted embryos. As expected due to the lack of spindle elongation, the displacement of ECT-2 from the posterior cortex is greatly attenuated.

      As requested by reviewer 2 (R2d), we now show that ECT-2C fragments accumulate on the cortex in embryos depleted of ECT-2.

      One other important point raised by several reviewers concerns the behavior of the ECT-2 T634E allele. This allele, due to the substitution of a phosphomimetic residue, accumulates on the cortex at about 50% the level of the wild-type version. To investigate the possibility that this quantitative difference was the cause of the phenotype, we depleted both the wild-type and mutant ECT-2 constructs by RNAi (these are the sole sources of ECT-2 in the animals). First, we find that wild-type ECT-2 can be depleted to 20% of wild type levels with only a 13% rate of cytokinesis failure (when T634E is depleted to 20%, embryos fail more than 50% of the time). Thus the two-fold reduction in cortical ECT-2 seen in T634E not likely highly significant (ECT-2 is not haploinsufficient). In addition, embryos with ECT-2 T634E initiate ingression in a timely manner, but the furrows ingress more slowly than wild-type. In contrast, depletion of ECT-2 to 20% results in a delay in furrow initiation, but once these furrows form, they ingress at rates similar rates to wild-type. Thus, the T634E variant exhibits a behavior that is quite distinct from that resulting from a (strong) reduction in the levels of wild-type ECT-2.

      Point-by-point description of the revisions

      This section is mandatory. Please insert a point-by-point reply describing the revisions that were already carried out and included* in the transferred manuscript. *

      (Reviewer comments: italicized 9 pt font, author response: plain text 10 pt font. Numbers have been added to the reviewer comments e.g. R2c=Reviewer 2, third comment)

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary

      R1a* In this study the authors addressed how Ect2 localization is controlled during polarization and cytokinesis in the one-cell C. elegans embryo. Ect2 is a central regulator of cortical contractility and its spatial and temporal regulation is of uttermost importance. After fertilization, the centrosome induces removal of Ect2 from the posterior plasma membrane. During cytokinesis Ect2 activity is expected to be high at the cell equator and low at the cell poles. Similarly to polarization, the centrosome provides an inhibitory signal during cytokinesis that clears contractile ring components from the cell poles. Whether and how the centrosomes regulate Ect2 localization is not know and investigated in the study. *

      This is an accurate summary of the goals of this study.

      R1b *The authors start by filming endogenously-tagged Ect2 and find that Ect2 localizes asymmetrically, with high anterior and low posterior membrane levels during polarization and cytokinesis. They reveal that the centrosome together with myosin-dependent flows results in asymmetric Ect2 localization. Previous studies had suggested that Air1, clears Ect2 from the posterior during polarization and the authors expand those finding by showing that Air1 function is also required to displace Ect2 from the posterior membrane during cytokinesis. *

      *To elucidate if Ect2 displacement is induced by phosphorylation of Ect2 by Air1, the authors investigate the localization of a C-terminal Ect2 fragment containing the membrane binding PH domain. When the predicted Air1 phosphorylation sites are mutated to alanine, the Ect2 fragment still localizes asymmetrically but exhibits increased membrane accumulation. *

      *Finally, they investigate the functional role of Air-1 during furrow ingression. They demonstrate that embryos deficient of Air1 and NOP1 have impaired furrow ingression. Lastly, the authors sought to confirm that there is a direct effect of Air1 on Ect2 function by generating a phosphomimetic point mutation of Ect2 using Crispr. They find that the membrane localization of phosphomimetic Ect2 is reduced and consequently furrow ingression is impaired. *

      This is an accurate summary of our results.

      Major comments

      R1c *It is not convincing that the six putative phosphorylation sites are targeted by the Air1. If Air1 phosphorylation displaces Ect2 from the membrane, a reduction in Ant/Post Ect2 ratio is expected in the phosphodeficient mutants, like after air1 RNAi. However this is not observed for cytokinesis or polarization (Fig. 5D(i); E). This suggests that phosphorylation of those sites is not essential for the asymmetric Ect2 localization. *

      In otherwise wild-type embryos, phosphorylation of these sites is not required for asymmetric ECT-2 localization. Non-phosphorylatable ECT-2 variants exhibit asymmetric localization because these proteins relocalize due to myosin-directed flows. To test the role of phosphorylation, we examine the distribution of ECT-2 and ECT-2C fragments in myosin-depleted embryos in which the flows are blocked, under these conditions, transient local depletion is observed with the phosphorylatable variants, Fig 5E.

      While AIR-1 promotes normal polarity establishment, as shown in several recent papers, cortical changes nevertheless occur in the absence of AIR-1. Specifically, a parallel PAR-2 dependent pathway induces weaker flows from both poles toward the equator. To further substantiate the effect of PAR-2 accumulation on ECT-2 accumulation in AIR-1 depleted embryos, we assayed ECT-2 accumulation in air-1(RNAi); par-2(RNAi) embryos (Figure 4, supplement 2). These results show that ECT-2 is nearly symmetric in these double depleted embryos. In addition we have edited the text to describe the unusual bi-polar PAR-2 accumulation that occurs in AIR-1 depleted embryos.

      R1d *The authors aim to demonstrate that phosphorylation of the identified sites is important for cytokinesis. For this they investigate contractile ring ingression in the phosphomimetic point mutation. Since ring ingression is slower and fails in nop1 mutant they authors conclude that this demonstrates a functional importance of this site. I am not surprised that embryos ingress slower in this mutant since Ect2 localization to the membrane is reduced. This however does not show that this phosphorylation site is the target of the centrosome signal. Importantly, authors would need to demonstrate that Rho signaling and thus Ect2 activity, is increased at the poles, when phosphodeficient Ect2 is the only Ect2 in the embryo. *

      The fact that a phosphomimetic residue at this site leads to reduced membrane localization is highly relevant, as we suggest that phosphorylation of this site contributes to the mechanism by which AIR-1 generates asymmetric ECT-2. Given the role of AIR-1 in regulating polarity, a version of ECT-2 that can not be phosphorylated would be predicted to be dominant lethal, necessitating a conditional expression strategy which does not currently exist in the early C. elegans embryo system (indeed we were unable to recover a T-> A allele at this site, despite extensive efforts). To avoid this issue, we used a viable, fertile, hypomorphic allele that is predicted to be less responsive to AIR-1 activity. The goal of this experiment was to evaluate whether the putative AIR-1 sites affect not only the NOP-1 pathway for furrow ingression, but also impact furrowing that is centralspindlin-dependent.

      To complement this finding have performed experiments in which ECT-2 was partially depleted We used RNAi to partially deplete ECT-2 and ECT-2 T634E and measured the total embryo fluorescence of each ECT-2 variant and the kinetics of furrow ingression. Partial depletion of wt ECT-2, to ~ 20% of control levels leads to delay in furrow formation and all but 2/18 (11%) of embryos complete cell division. In contrast, a similar depletion of ECT-2T634E depletion results in a failure of furrow ingression in ~52 % of embryos. Furthermore, while ECT-2T634E embryos initiate furrowing with normal kinetics, they exhibit a slower rate of furrow ingression, in contrast, partial depletion of WT ECT-2 results in a delay in furrow initiation, but once initiated, the rate of furrow ingression is not significantly affected. These results demonstrate that ECT-2T634E behavior can not simply be explained by a modest reduction in membrane binding.

      R1e *The authors use the Aurora A inhibitor MLN8237: It was shown prior (De Groot et al., 2015) that this inhibitor is not highly specific for Aurora A, and that it also inhibits Aurora B. Thus experiments need to be repeated with MK5108 or MK8745. They should also be conducted during polarization. Why does Aurora A inhibition not abolish asymmetry? That would be expected? *

      The role of AIR-1 in symmetry breaking during polarization is previously published, including with chemical inhibitors (Reich 2019, Kapoor 2019, Zhao 2019, Klinkert 2019, PMID 31155349, 31636075, 30861375, 30801250). ECT-2 localization depends on both the spatial regulation of AIR-1 activity and the distribution of cortical factors that contribute to ECT-2 cortical association, as a result of cortical flows. During acute, chemical perturbation of AIR-1 it is likely that these factors, which were polarized prior to drug treatment, remain polarized, allowing the residual cortical ECT-2 to remain asymmetric. The reviewer is correct about the specificity of MLN8237 and we do not rely on it alone to demonstrate the role of AIR-1. Rather this experiment is a complement to our AIR-1 depletion studies, which are sufficient to establish specificity. We present this experiment merely to show that AIR-1 acutely regulates ECT-2 during cytokinesis in embryos that were entirely unperturbed during polarization.

      R1f *There is no statistical analysis of the results in the entire study. For all claims stating a change in Ant/Post Ect2 ratio or Ect2 membrane localization selected time points should be statistically compared: for example the main point of Fig.1 is that Ect2 becomes more asymmetric during anaphase. Thus a statistical analysis of the Ect2 ratio at anaphase onset (t=0s) and eg. t=90 s after anaphase onset should be performed; or Fig. 3A nop-1 mutant Ant/Post Ect2 ratio during polarization: again statistical analysis of control and nop-1 mutant embryos is needed at a particular time point. *

      All of the graphs were presented with the mean of ~10 embryos per condition and included the 95% confidence intervals. In the revised manuscript, we have included tests of statistical significance, at each time point. While non-overlapping confidence intervals generally suggest statistical significance, we include these analyses on the graphs as it can be difficult to assess statistical significance when the confidence intervals overlap.

      R1g *The aim of Fig. 2B is to demonstrate that Ect2 localization is independent of microtubules, however they still observe some microtubules with the Cherry-tubulin marker and those are even very close to the membrane and therefore could very well influence Ect2 on the membrane. Therefore I am not convinced that this experiment rules out that microtubules have no role in regulating Ect2 localization. *

      We do not exclude that microtubules play a contributing role in ECT-2 phosphoregulation, but rather we conclude that the primary cue is the centrosome. Indeed, microtubules can play an important role in controlling spindle positioning which affects the proximity of the centrosome to the cortex.

      The manuscript states, “Despite significant depletion of tubulin and near complete depolymerization of microtubules (Figure 2B, insets), we observed strong displacement of ECT-2 from a broad region of the posterior cortex during anaphase (Figure 2B).” Thus, despite dramatic reductions in microtubules, not only does ECT-2 become polarized, it becomes hyperpolarized. In contrast, were microtubules directly involved in ECT-2 displacement, one would expect a reduction in polarization as a result microtubule depolymerization. Conversely, though SPD-5 depleted embryos contain far more microtubules than embryos in which microtubule assembly is suppressed, ECT-2 is not polarized in SPD-5 depleted embryos. Thus in the manuscript, we conclude, “Collectively, these studies suggest that ECT-2 asymmetry during anaphase is centrosome-directed.” This conclusion is well supported by the results shown.

      R1h *Throughout the paper the authors should tone down their statement that Air1 breaks symmetry by phosphorylating Ect2, since phosphorylation of Ect2 by Air2 is not shown. *

      We agree with this comment and will make the necessary edits to the text. Indeed, this is the reason why we had included the final section in our original draft, “Limitations of this study” which makes this point explicitly.

      R1i *I understand that the establishment of Ect2 asymmetry is important for polarization. However, how does asymmetric Ect2 localization result in more active Ect2 at the cell equator, which is required for the formation of the active RhoA zone? Would we not expect an accumulation of Ect2 at the cell equator, or if that is not the case more active Ect2 at the equator versus the poles? *

      The pseudocleavage furrow forms as a result of the anterior enrichment of active RHO-1 and its downstream effectors. There is no evidence for a local accumulation of active RHO-1 specifically at the site of the pseudocleavage furrow. Rather, this furrow forms at the boundary between the portion of the embryo where RHO-1 is active and the posterior of the embryo where RHO-1 is far less active (Figure 1 Supplement 2). We suggest that aster-directed furrowing during cytokinesis likewise results from asymmetric accumulation of the same components, without them necessarily being specifically enriched solely at the furrow.

      While cytokinesis generally involves an equatorial contractile ring, furrow formation can be driven by an asymmetric - i.e. non-equatorial - accumulation of actomyosin. This behavior is exemplified during pseudocleavage during which the entire anterior cortex is enriched for actomyosin and the posterior is depleted of myosin (Figure 1 Supplement 2). Several published studies provide evidence that the asymmetric pattern of myosin accumulation contributes to cytokinesis (PMID 22918944, 17669650).

      Minor comments

      R1j *Can the authors explain why the quantification of Ant/Post Ect2 ratio in control embryos differs in different figures? For example: in Fig. 1D i) a slight increase of Ect2 asymmetry ratio is seen at around 80 s after anaphase onset. In comparison, in Fig. 2C (i) this increase is not obvious. Are those different genetic backgrounds? *

      In figure 1 D, time 0 begins at anaphase onset, whereas in 2C, time 0 is specified at the time of nuclear envelope breakdown (NEBD). The duration between NEBD and anaphase onset is ~130 sec and an increase in ECT-2 polarization is observed at 220 s post NEBD, ie 90 sec post anaphase onset comparable to that seen in Fig 1D.

      R1k *One key point of the paper is that myosin-dependent cortical flows amplify Ect2 asymmetry during polarization and cytokinesis. During polarization the data is convincing, however during cytokinesis Ect2 ratio is only slightly decreased after nmy-2 depletion, again is this decrease even significant? *

      Figure 3 supplement 1 shows a significant difference in ECT-2 asymmetry between control and myosin-depleted embryos.

      R1l *In the introduction: "Centralspindlin both induces relief of ECT-2 auto-inhibition and promotes Ect2 recruitment to the plasma membrane" it should be added 'Equatorial' membrane, since Ect2 membrane binding is, to my knowledge, not compromised in centralspindlin mutants or in Ect2 mutants that cannot bind centralspindlin. *

      Generally speaking, the reviewer is correct that cortical accumulation of ECT-2 globally is centralspindlin independent. However, as seen in e.g. ZYG-9 depleted embryos, ECT-2 is recruited to the posterior cortex in a centralspindlin-dependent manner. Thus centralspindlin can promote ECT-2 accumulation to the cortex and the site of that accumulation will be dictated by the position of the spindle midzone.

      R1m *Labels in the figures are often very small eg Fig. 1 ii-v) and difficult to read. In addition it is easier for the reader if the proteins shown in the fluorescent images is also labeled in the figure (eg Fig. 2B add NG-Ect2). *

      These useful suggestions have been incorporated.

      R1n *Material and methods it should be mentioned which IPTG concentration was used. *

      The IPTG concentration (1 mM) has been added to the revised text.

      R1o *The authors speculate that the Air1 phosphorylation sites in Ect2 PH domain prevent binding to phospholipid due the negative charge. At the same time, the authors propose that the PH domain binds to a more stable protein on the membrane, which is swept along with the cortical flows and they propose anillin could be that additional binding partner. I might miss something, but do the authors suggest Ect2 has two binding partners: anillin and the phospholipids? It would be necessary to explain this better. *

      *The authors should test if anillin represents the suggested myosin II dependent Ect2 anchor. For this they should check if Ect2 localization to the membrane is altered upon on anillin RNAi. *

      This summary of our model is largely correct, though we do not know the identity of the more stable cortical anchor(s). While we suspect the PH domain binds to a phospholipid, ECT-2 cortical localization also requires ~100 residues C-terminal to the PH domain. It is likely that this domain interacts with a cortical component.

      In preliminary experiments, ECT-2 accumulation is not strictly anillin-dependent. However, functional redundancy may obscure a contribution of anillin. Anillin was mentioned simply because of the evidence for a physical interaction between ECT-2 and anillin (Frenete PMID 22514687). In the revised manuscript we also include the possibility that ECT-2 accumulations involves one or more anterior PAR proteins. The identity of the cortical anchor(s) is an interesting question for future studies. We consider this question beyond the scope of the current manuscript.

      R1p *The title of fig. 3 does not fit the statement the authors want to make, since the key point is how Ect2 polarization is affected and not membrane localization in general. *

      Thank you for this suggestion. The title has been changed to “Cortical flows contribute to asymmetric cortical accumulation of ECT-2”

      R1q *In Fig 4A/C. After air1 depletion the authors observe a reduction in Ect2 asymmetry. Why are the centrosomes not marked in the figures? Because they cannot be detected? The authors would also need to show that the mitotic spindle and centrosomes are no altered by air1 RNAi in the zyg9 mutant. Otherwise the observed effect might be indirect. *

      Centrosomes are perturbed by depletion of AIR-1 (Hannak, PMID 11748251), but they are still detectable and their positions will be added to figure 4. As has been extensively demonstrated, AIR-1 depletion does lead to attenuated spindles and defects in spindle assembly, some of which are also seen TPXL-1 depleted embryos. These consequences of AIR-1 depletion does complicate the analysis, but this is typical of factors that regulate many processes. This is one of the key reasons why we used ZYG-9 depletion in combination with AIR-1 depletion to overcome these indirect effects.

      R1r *The authors state that tpxl-1 depletion attenuates Ect2 asymmetry, this is not seen in the quantification ((Fig. 4B(i)). The main phenotype they observe is that Ect2 levels on the membrane increase (Fig. 4 (ii) and (iii). They go on testing the function of tpxl1 by depleting tpxl1 in the zyg9 mutant, where the centrosomes are close to the posterior cortex. Here they see no effect on Ect2 asymmetry. Based on that they conclude that tpxl1 has no role in this process. To me this finding is not surprising since the centrosome is close the cortex in zyg9 mutant embryos. Therefore sufficient amounts of active Air1 could reach the membrane and displace Ect2. Thus an amplification of the inhibitory signal by tpxl1 on astral microtubules might not be required. The authors need to mention this possibility and tone down their statment (also in the discussion) that tpxl1 is not required for this process. *

      In the text, we state, “Cortical ECT-2 accumulation is enhanced by TPXL-1 depletion, though the degree of ECT-2 asymmetry is unaffected (Figure 4B).… we observed robust depletion of ECT-2 at the posterior pole in zyg-9 embryos depleted of TPXL-1, but not AIR-1 (Figure 4C). We conclude that while AIR-1 is a major regulator of the asymmetric accumulation of ECT-2, the TPXL-1/AIR-1 complex does not play a central role in this process.” We consider this to be an accurate description of the results. In sum, we have found no evidence that TPXL-1 contributes to generating ECT-2 asymmetry, beyond its well established role in regulating spindle length and position. The are several other processes that are known to be AIR-1 dependent and TPXL-1 independent; these primarily involve the centrosome (Ozlu, PMID 16054030). Given that TPXL-1 associates with astral microtubules, the fact that microtubule depletion can enhance ECT-2 asymmetry also argues against a requirement for TPXL-1.

      R1s *It was shown that the C-terminus of Ect2 is sufficient and the PH domain is required for Ect2 membrane localization in C. elegans (Chan and Nance, 2013; Gomez-Cavazos et al., 2020). Papers should be cited. *

      Thank you for this helpful comment. Chan and Nance 2013 indeed shows that the ECT-2 C-term is sufficient to localize to the cell cortex. In contrast, the Gomez-Cavasos paper (PMID 32619481) shows in figure S2 that the PH domain is required for cortical localization of ECT-2; this paper does not focus extensively on cortical accumulation of ECT-2. We have cited Chan and Nance in the revised manuscript.

      R1t *The authors find that nmy-2 depletion results in loss of asymmetry for the Ect2 C-term and Ect2 3A fragment during polarization. Why is the same experiment not shown for cytokinesis? *

      Strong depletion of NMY-2 prevents polarity establishment, resulting in symmetric spindles, which in turn results in symmetric ECT-2 accumulation. Thus, the requested experiment would not provide significant additional information.

      R1u *Air1 is targeted to GFP-C-term Ect2 fragment via GFP-binding to determine the influence on GFP-C-term Ect2 localization (Fig. 5F). They state that they see a reduction of Ect2 C-term but not of C-term 3A after targeting. The reader has to compare Fig. 5D with F. Since the differences are not big, they need to compare the Ect2 C-term and Ect2 C-term 3A with and without Air1 targeting in the same graph (plus statistics). Otherwise this statement is not convincing. *

      It is not straightforward to directly compare ECT-2C in the presence and absence of GBP-mCherry-AIR-1, because the GBP:AIR-1 fusion protein recruits a large fraction of ECT-2C to the centrosome. For this reason we think it is best to compare the behavior over time of ECT-2C and ECT-2C3A in the presence of GBP-mCherry-AIR-1. At the onset of anaphase, these two fragments localize similarly, but they then diverge over time.

      R1v *In Fig. 6A the authors determine the contribution of air1 to furrowing. For this they deplete air1 in the nop1 mutant. According to previous studies, air1 mutants have a monopolar spindle. How can the authors analyze the function of air1 in cytokinesis when the spindle is monopolar? Did the authors do partial air1 depletion? They authors need to show that there is not major effect on the spindle and centrosome for their conditions. For comparison air1(RNAi) alone has to be included, otherwise the experiment is not conclusive. *

      AIR-1 depletion does not result in a monopolar spindle in C. elegans embryos, though the spindle is attenuated and disorganized (PMID 9778499). TPXL-1 depletion also results in short, well organized spindles (PMID 19889842). The concerns are the reason we performed the ZYG-9 depletion experiments in Figure 4C to ensure the centrosomes are proximal to the cortex.

      R1w *Upon air1(RNAi) in the nop1 mutant NMY2 intensity seems decreased and not increased. Can the authors comment on that, since that is opposite of what is expected. *

      This is expected as previous studies have shown that NOP-1 contributes to RHO-1 activation during polarization and cytokinesis (Tse, PMID 22918944). (NOP stands for No Pseudocleavage).

      R1x *In Fig 6B they introduce a phosphomimetic point mutation in S634 [sic, T634] in the endogenous Ect2 locus. It not clear why the authors chose this site out of the six putative sites and why they only chose one and not 3 or 6 sites? This needs some explanation. *

      In our early work with ECT-2 transgenes, we found that a T634E mutation strongly affected cortical ECT-2C, so we decided to assess its affect on the function and localization of endogenous ECT-2. While we were able to recover a T634E variant, we were not able to recover a T634A variant, despite considerable effort. Based on these experiences, we anticipated that we would be unable to recover a mutant version of ECT-2 in which all sites were changed to phosphomimetic.

      R1y *In the model (fig. 7) no astral microtubules are shown during pronuclear meeting and metaphase. Astral microtubules are present at this stage and should be added to the schematic. *

      MTs will be added to the figure.

      Reviewer #1 (Significance (Required)):

      R1z *The centrosomes inhibit cortical contractility during polarization and cytokinesis in the one-cell C. elegans embryo. Centrosome localized Air1 was proposed to be part of this inhibitory signal, however the phosphorylation target of Air1 is not known. The identification of Ect2 as a phosphorylation target of Air1 would be a great advancement in the field. However, the presented manuscript lacks convincing data that Ect2 is the phosphorylation target of Air1 during polarization and cytokinesis. *

      We explicitly acknowledge that we have not directly shown that AIR-1 phosphorylates ECT-2. However, we have shown that (i) AIR-1 inhibits cortical ECT-2 localization, (ii) the negative regulator of AIR-1, SAPS-1, promotes AIR-1 cortical accumulation, (iii) that the cortical localization domain of ECT-2 has putative AIR-1 sites, which, when mutated to non-phosphorylatable residues leads to increased cortical accumulation of ECT-2 (and (iv) phosphomimetic residues reduce its cortical accumulation), and (v) that these AIR-1 sites are required to render GFP-ECT-2C responsive to GBP-AIR-1. For these reasons we feel that our data makes a strong, albeit indirect, case that AIR-1 regulates ECT-2, even though we clearly acknowledge that we do not directly show that AIR-1 directly phosphorylates ECT-2.

      Direct proof would require the demonstration that AIR-1 phosphorylates ECT-2 in vivo. This would be difficult to show as ECT-2 phosphorylation is likely transient, it likely affects only a subset of the total ECT-2 pool, and it likely results in loss of membrane association of ECT-2. As it it not possible to synchronize C. elegans embryos, biochemical analysis would be very difficult. Even a phosphospecific antibody for the putative ECT-2 phosphosites might not be particularly informative, as it would be predicted to give a diffuse cytoplasmic signal.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      R2a* In this work, Longhini and Glotzer investigate the localization of an essential regulator of polarity and cytokinesis, RhoGEF ECT-2, in the one-cell C. elegans embryo. The authors show that centrosome localized Aurora A kinase (AIR-1 in C. elegans) and myosin-dependent cortical flows are critical in asymmetric ECT-2 accumulation at the membrane. Since membrane interaction of ECT-2 is dependent on the Pleckstrin homology domain present at the C-terminus of ECT-2, they further analyzed the importance of putative AIR-1 consensus sites present in this domain. The authors linked the relevance of these sites in controlling ECT-2 localization and its significance on cytokinesis. The manuscript is well written, the work is interesting, and the data quality is high. *

      We thank the reviewer for their critique.

      Major comments:

      R2b - In Fig. 2, the authors claim that the centrosomes and the position of the mitotic spindle are critical in regulating the asymmetric enrichment of ECT-2 at the membrane. To test the relevance of spindle positioning on ECT-2 localization, the authors depleted PAR-3 and PAR-2. The authors observed that the ECT-2 asymmetry is affected in these settings. However, PAR-3 or PAR-2 depletion impacts polarity, which is critical for many cellular processes, including spindle positioning. Can the authors try to specifically misposition the spindle without affecting polarity? For instance, by depleting Galpha/GPR-1/2 and assessing the impact of such depletion on ECT-2 localization.

      Thank reviewer for good suggestion. We have performed the suggested experiment (presented in Figure 2, supplement 2). As one might predict, ECT-2 starts out polarized as Gα is not required for polarity establishment. During anaphase, ECT-2 becomes more symmetric in Gα depleted embryos as compared to wild-type.

      R2c *-I wonder why the intensity of ECT-2 at the anterior and posterior membrane decreases in air-1(RNAi) post anaphase onset (Fig. 4A)? Moreover, I fail to observe a significant asymmetric distribution of ECT-2 in embryos depleted for PERM-1. Therefore it appears that the difference between DMSO and MLN8237-treated embryos is not substantial (at least in the images)? *

      We do not have a complete or rigorous explanation for all the changes in cortical ECT-2, but they are highly reproducible. We speculate that there are cell cycle regulated changes in ECT-2 accumulation, in addition to its regulation by AIR-1. For example, in figure 1, a strong reduction in both anterior and posterior cortical ECT-2 is evident beginning at approximately -350 sec, which may reflect the initial stages of Cdk1 activation. This may result from cell cycle regulated modulation of ECT-2, as there is evidence that mammalian ECT-2 is subject to a very potent inhibition membrane association by Cdk1 (PMID 22172673). Alternatively, there could be cell cycle modulation of the cortical factor that serves as the “co-anchor” of ECT-2. The ability of GBP-AIR-1 to induce GFP-ECT-2C dissociation also appears cell cycle regulated.

      Consistent with a cell cycle regulated component, note that NEBD is delayed in AIR-1 depleted embryos (PMID 17669650, 17419991, 30861375). This delay results in a shorter interval between NEBD and e.g. the peak in Cdk1 activation, explaining the earlier decrease in AIR-1(RNAi) embryos vs. control, relative to NEBD.

      Our quantitative analysis indicates a significant increase of cortical ECT-2 upon treatment with MLN8237. In addition, the quantitation in the previous version did show a significant polarization of ECT-2 in PERM-1-depleted embryos prior to treatment. We have revised this figure to simply show an acute increase in cortical ECT-2 upon drug treatment, as the focus of this experiment was solely to show that ECT-2 cortical accumulation is acutely responsive to chemical inhibition during cytokinesis in otherwise normal embryos.

      *-The data in Fig. 5 and 6 are exciting but raise a few concerns: *

      R2d *a). The authors show that ECT-2C localization mimics the localization of endogenous tagged ECT-2. However, all these analyses with ECT-2C and various mutants are performed in the presence of endogenous ECT-2. Can the author check the localization of these mutant strains in conditions where the endogenous proteins are depleted? I understand that the cortical flow would be perturbed in conditions where endogenous ECT-2 is depleted. However, I suspect that one can analyze the anaphase-specific distribution. *

      We have examined ECT-2C localization in embryos depleted of ECT-2. Cortical localization of ECT-2C is not dependent upon endogenous ECT-2. This result is now shown in figure 5 supplement 1. However, as the reviewer suggested, embryos depleted of ECT-2 do not show a high degree of ECT-2C asymmetry as ECT-2 is required for the cortical flows that amplify the symmetry breaking during polarization. During cytokinesis, ECT-2C does show a modest change in localization at the poles; the extent of the polar reduction is limited and the changes are symmetric as ECT-2 displacement causes spindles to be symmetrically positioned and limits their elongation during anaphase.

      R2e *b). Can the author comment on why ECT-2C does not accumulate at a similar level as ECT-2C(3A or 6A) at the cell membrane when AIR-1 is depleted (compare Fig. 5D with Supplemental Fig. 5)? *

      When ECT-2C(3A or 6A) are expressed in otherwise wild-type embryos, embryo polarization occurs, resulting in anterior-directed flows that concentrate the factor(s) that enables the anterior enrichment of ECT-2 (and ECT-2C 3A/6A). By contrast, when AIR-1 is depleted, most embryos exhibit a “bipolar” phenotype in which PAR-2 is recruited to both anterior and posterior poles, and the actomyosin network becomes somewhat concentrated laterally (PMID 30801250, 30861375, 31636075). The differential positioning of the actomyosin network in AIR-1 depleted embryos is likely responsible for the interesting difference that the reviewer points out. This section of the results states. “Nevertheless, these variants accumulated in an asymmetric manner. ECT-2C asymmetry temporally correlated with anteriorly-directed cortical flows (Figure 5 D,E), raising the possibility that asymmetric accumulation of endogenous ECT-2 drives flows that cause asymmetry of the transgene, irrespective of its phosphorylation status.”

      R2f *c). Does the cortical localization of the ECT-2C(6A) mutant become symmetric upon further depletion of AIR-1? Of course, if the asymmetric distribution of ECT-2C(6A) is dependent on the presence of endogenous protein in the cellular milieu, the point raised earlier will help address this concern. *

      We have not performed this exact experiment with ECT-2C-3A though we have performed it with a longer ECT-2 C-terminal fragment (aa 559-924). As expected, due to the considerations described above, the asymmetry of ECT-2C-3A is reduced when AIR-1 is depleted. Likewise, ECT-2C-6A is becomes symmetric when endogenous ECT-2 is depleted due to the dependence of its asymmetry on cortical flows, as discussed above.

      In the revised manuscript, we provide additional explanation of the AIR-1 depletion phenotype which will explain the origin of the asymmetric distribution of ECT-2.

      R2g *d). The authors predict that the AIR-1 mediated phosphorylation delocalizes ECT-2 from the polar region of the cell cortex. Since the posterior spindle pole is much closer to the posterior cortical region, the delocalization is much more robust at the posterior cell membrane. I wonder why targetting AIR-1 at the membrane (GBP-mCherry-AIR-1) does not entirely abolish GFP-ECT-2C membrane localization? Can the author include the localization of GBP-mCherry-AIR-1 in the data? Also, do we know for sure if GBP-mCherry-AIR-1 is kinase active? *

      The GBP-mCherry-AIR-1 transgene was obtained from the Gönczy lab which demonstrated that it has some activity (PMID 30801250). Given that centrosomal AIR-1 (as compared to astral AIR-1) is the primary pool of AIR-1 responsible for modulating cortical ECT-2 levels, it is a not clear that the GBP-fused form of AIR-1 is as active as the centrosomal pool of AIR-1; indeed we suspect it is significantly less active, similar to the manner in which TPXL-1/AIR-1 appears less active towards ECT-2 than centrosomal AIR-1. Indeed as the reviewer suggests, were this pool of AIR-1 highly active, we would expect that its cortical recruitment would preclude embryo polarization, and this transgene would cause lethality when expressed with a GFP-tagged cortical protein. These concerns notwithstanding, we do observe a specific reduction in the anterior accumulation of ECT-2C as compared to ECT-2C3A, suggesting that this form of the kinase has some ability to modulate ECT-2C.

      Co-expression of GFP-ECT-2C with GBP-mCherry-AIR-1 induces the centrosomal/astral accumulation of GFP-ECT-2C, which is highly visible in the figure and not seen in the absence of GBP-mCherry-AIR-1. Not surprisingly, the co-expression also induces a cortical pool of GBP-mCherry-AIR-1 that is not seen in the absence of GFP-ECT-2C. These redistributions indicate formation of the complex between GFP-ECT-2C and GBP-mCherry-AIR-1. The mCherry-AIR-1 images could be added as insets to the figure, but in our opinion, they would not make a substantive contribution, given the dramatic accumulation of centrosomal GFP-ECT-2C.

      R2h *e). The authors show that centrosomal enriched AIR-1 [spd-5(RNAi)], but not the astral microtubules localized AIR-1 [tpxl-1(RNAi)], is vital for ECT-2 membrane localization. Interestingly, the authors showed that AIR-1 acts in the centralspindlin-directed furrowing pathway (Fig. 6A). I wonder if the authors can combine NOP-1 depletion with TPXL-1 depletion? I guess this will further help to exclude the function of TPXL-1 in the centralspindlin-directed furrowing pathway. *

      We would like to clarify that our data indicates that AIR-1 acts on both the centralspindlin-independent furrowing (e.g. the anterior furrow in 4C), as well as centralspindlin-dependent furrowing (Figure 6).

      While the experiment the reviewer proposes appears simple in theory, the interpretation is potentially a bit more complex, due to the role of TPXL-1 in spindle elongation, which can affect centralspindlin-directed furrowing. That said, there are two published experiments and one experiment in the manuscript that indicate that centralspindlin dependent furrowing can occur in TPXL-1 depleted embryos. First, Lewellyn et. al. showed that while tpxl-1(RNAi) embryos furrow, tpxl-1(RNAi); zen-4(RNAi) embryos do not, suggesting centralspindlin can function in the absence of TPXL-1. Second, the same paper shows that embryos doubly depleted of TPXL-1 and GPR-1/2 exhibit multiple furrows. Our previous work has shown that furrowing in Galpha-depleted embryos is centralspindlin dependent (Dechant and Glotzer). Furthermore, in the current manuscript we found that embryos depleted of both TPXL-1 and ZYG-9 form posterior furrows (8/8 embryos, 6/8 furrows were strong furrows) although the appearance of these furrows is delayed, presumably due to the reduction in spindle elongation due to TPXL-1-depletion. As described in the manuscript, these posterior furrows have been previously shown to be centralspindlin dependent and NOP-1 independent.

      In accordance with these results, and in direct response to the reviewer’s specific suggestion, we do observe furrowing in nop-1(it142); TPXL-1(RNAi) embryos (10/10 embryos furrow, 9/10 complete cytokinesis) . Thus, all of the available results indicate that TPXL-1 is largely dispensable for centralspindlin dependent furrowing. However, the role of TPXL-1 in centralspindlin-dependent furrowing is not a focus of the manuscript, thus we do not favor including this result, as it distracts from the primary focus of the study.

      R2i *f). Why do NMY-2-GFP cortical levels appear lower in 30% of the embryos that show various degrees of cytokinesis defects (Fig. 6A)? *

      There are a number of possible origins of the variability. As shown in (Reich 2019, Kapoor 2019, Zhao 2019, Klinkert 2019, PMID 31155349, 31636075, 30861375, 30801250), AIR-1 depletion results in variable polarization (unpolarized PAR-2, bipolarized PAR-2, anterior PAR-2, posterior PAR-2). Furthermore, spindles in AIR-1 depleted embryos exhibit somewhat variable positioning. While we were unable to correlate these sources of variability with furrow formation, these results demonstrate that AIR-1 depletion impairs furrowing directed by centralspindlin, which was not entirely expected, given that (i) AIR-1 depletion potently suppresses NOP-1 dependent flows of cortical myosin, as evidenced by the loss of an anterior furrow in AIR-1(RNAi); nop-1(it142) embryos and (ii) centralspindlin directed furrowing can occur in the posterior in ZYG-9 depleted embryos both in the presence or absence of AIR-1 (Figure 4C).

      R2j *g). The authors report that phosphomimetic mutation at the phospho-acceptor residue in ECT-2 impacts its cortical accumulation. This strain, together with NOP-1 depletion, affects furrow ingression. One explanation for this phenotype is that phosphomimetic mutant weakly accumulates at the membrane. However, one interesting observation is that ECT-2T634E enriches at the central spindle (Fig. 6B, panel 120 sec), which somehow I could not find in the text. Could this additional localization of ECT2 at the central spindle contribute to the cytokinesis defects that the authors have observed? The microscopy images the authors have included show that ECT-2T634E significantly localizes at the equator at the time of furrow initiation. Can the authors add the localization of ECT2 wild-type and ECT-2T634E in NOP-1 depleted conditions where they see an apparent impact on the cytokinesis? Similarly, if the authors include the localization of NMY-2 in these conditions-it will further add more weightage to the data. *

      We regularly detect trace amounts of ECT-2 on the central spindle and this is slightly enhanced at in the ECT-2T634E mutant. However, given the large cytoplasmic pool of ECT-2, it seems unlikely that the slight enrichment of ECT-2 on the central spindle significantly affects the cortical pool of ECT-2, though the reduction in cortical ECT-2 may facilitate its enrichment on the central spindle.

      As shown in figure 3B, depletion of NOP-1 does not dramatically affect cortical ECT-2 levels in wild-type embryos. Likewise, we did not observe a significant effect of NOP-1 depletion in ECT-2 T634E, thus we decided not to include this negative result.

      As discussed in general point 8, we suggest the modest reduction in the membrane pool of ECT-2 is unlikely to be the primary cause of the T634E, but rather the ability of AIR-1 to modulate induce its relocalization. Consistent with this interpretation, the embryos that failed ingression tended to have more symmetric spindles, which could limit the residual cortical flows that facilitate furrow ingression.

      Minor comments:

      R2k -An explanation of how the timing of NEBD was analyzed in multiple settings would be helpful.

      Depending on the experiment, we used either ECT-2:mNG fluorescence (it is excluded from the nucleus until NEBD) and/or the Nomarski images to score NEBD.

      R2l ____-*The authors mentioned on p. 6-'Despite significant depletion of tubulin.....during anaphase'. These experiments are performed in the near complete depolymerization of microtubules; thus, regular anaphase will not establish. I understand that the authors are monitoring localization wrt the timing similar to anaphase in the non-perturbed condition, and thus a bit of change in the sentence is required. *

      Thank you for highlighting this point. We have substituted “following mitotic exit” for “anaphase”. In these images, mitotic exit can be scored by the emergence of contractility.

      R2m*-After testing the relevance of SPD-5 (that primarily acts on PCM and not on centrioles)-the authors write on p. 6 that 'two classes of explanation...early embryo'. I did not understand the importance of this sentence here. *

      To clarify, we deleted the words “classes of” from the sentence in question and following that sentence we added the word, “first” indicating that we were explaining the first of the two possible explanations

      R2n*-The observed impact of spd-5 (RNAi) on ECT-2 localization could be because of the effects of SPD-5 depletion on centrosomal AIR-1? The authors can link the impact of SPD-5 depletion not only with the centrosome but also with AIR-1 in the discussion. *

      Indeed, it is well established that SPD-5 is required for centrosomal AIR-1 (Hamill DR, et. Al Dev Cell (2002). The revised discussion now states, “Specifically, during both processes, ECT-2 displacement requires the core centrosomal component SPD-5, which is required to recruit AIR-1 to centrosomes{Hamill et al., 2002, #1201}, but ECT-2 displacement is not inhibited by depolymerization of microtubules and it does not require the AIR-1 activator TPXL-1 (see below).”

      R2o-In the various Figure legends, sometimes the authors mention time '0' as anaphase, and other time as anaphase onset.

      In all cases, anaphase onset was intended and the legends will be corrected.

      Reviewer #2 (Significance (Required)):

      R2p *The manuscript is well written, the work is interesting, and the data quality is of good quality. *

      We thank the reviewer for their encouragement as well as for their thoughtful critique!

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      R3a* Symmetry breaking is the process by which uniformity of the system is broken. Many biological systems, such as the body axes establishment and cell divisions in embryos, undergo symmetry breaking to pattern cellular interior design. C. elegans zygote has been a classic model system to study the molecular mechanism of symmetry breaking. Previous studies demonstrated critical roles of centrosomes and microtubules in breaking symmetry in the actin cytoskeleton during anterior-posterior polarization and cytokinesis. It, however, remains elusive how centrosomes and/or microtubules regulate the assembly and contractility of the actin cytoskeleton. Recent reports identified Aurora-A AIR-1 as the key centrosomal kinase that suppresses the function of the actin cytoskeleton, but little is known about a substrate of the kinase during symmetry breaking events. *

      Longhini and Glotzer proposed in this manuscript that RhoGEF ECT-2 plays a critical role in symmetry breaking of the actin cytoskeleton under the control of AIR-1 kinase. Kapoor and Kotak (2019) previously proposed the same GEF as a downstream effector of centrosomes, but this work did not provide direct evidence for ECT-2 as the AIR-1 effector. This manuscript identified three putative phospho-acceptor sites in the PH domain of ECT-2 that render ECT-2 responsive to inhibition by AIR-1. Although this manuscript lacks direct in vivo and in vitro evidence for phosphorylation of ECT-2 by AIR-1 kinase, the above findings reasonably support a model where in AIR-1 promotes the local inhibition of ECT-2 on the cortex. Design of the experiments, the quality of images, and data analysis are reasonable, and the main text was written very well. The main conclusion of this work will attract many readers in cell and developmental biology fields. I basically support its publication in the journals supported by Review Commons with minor revisions (see below).

      We thank the reviewer for their encouraging remarks and helpful comments.

      Minor comments

      R3b 1) In Figures 2A and 2B, the authors claimed apparent correlation between spindle rocking and ECT-2 displacement. However, because both MTs and ECT-2 in Fig2AB images are blur, I cannot convince myself whether ECT-2 intensities on the cortex showed negative correlation with the distance between the posterior centrosome and the cortex. The authors may want to provide quantitative data set and use a statistical test to support this conclusion.

      Only figure 2A focuses on the rocking. The important structure to assess is the position of the centrosome, as the astral arrays of microtubules are largely radially symmetric (except towards the spindle midzone). As this point in the manuscript were were not discriminating between the astral microtubules and the centrosomes, rather focusing on the overall position of the aster as a whole. Figures 2B, 2D, Fig 2 Supplements 1 and 2, Fig 3C, and Fig 4B, summarized in figure 7A provide quantitive evidence that the centrosome-cortex distance is an important determinant of ECT-2 cortical accumulation.

      R3c *2) Figure 2D would [sic; presumably should] show a ratio between the anterior/posterior pole and the lateral cortex. *

      The reviewer is presumably noticing that the lateral cortex is brighter than the poles when PAR-3 is depleted. While we agree with this assessment, the point of this experiment was to evaluate whether both centrosomes are equally capable of regulating cortical ECT-2 at the respective poles. It appears to us that comparing the anterior and posterior poles is the appropriate measurement to make to address this point and comparison of the poles to the lateral cortex in par-3(RNAi) vs control would be confusing to readers.

      R3d *3) In Figure 3D, the authors need to clarify why they measured ECT-2 dynamics only within the "anterior pole". It would be reasonable to measure ECT-2 dynamics by FRAP and cortical high-speed live imaging on the posterior and the lateral cortex during symmetry breaking. *

      We measured ECT-2 recovery at a variety of sites with similar recovery kinetics. The comparison of ECT-2 dynamics on anterior and posterior furrows were shown in order to compare ECT-2 dynamics on centralspindlin-dependent and -independent furrows.

      We now provide additional supplemental data on ECT-2 dynamics during symmetry breaking. When ECT-2 is polarized, the residual signal is too low to obtain a measure of its recovery.

      R3e 4) In Figure 4 supplement, a difference between with or without ML8237 seems marginal. The authors need to show a statistical test to claim "rapid enhancement of cortical ECT-2 after ML8237 treatment".

      We will provide a statistical analysis. As the inhibitor affects ECT-2 globally, the anterior/posterior ratio doesn’t change significantly. To avoid confusion, we now present total cortical ECT-2 levels upon anaphase onset in this experiment as this is the most relevant parameter.

      R3f *5) I would strongly suggest the authors to clearly state in the first paragraph of discussion that "this working hypothesis is not supported by direct evidence for phosphorylation of ECT-2 by AIR-1 kinase in vitro and in vivo." It should be reasonable to weaken the statement "by Aurora A-dependent phosphorylation of the ECT-2 PH domain" in p13. *

      We agree with the underlying sentiment (as indicated by the “limitations” section that was present in the original version) and we have revised these sentences accordingly: “Our studies suggest that asymmetric, posteriorly-shifted, spindle triggers an initial focal displacement of ECT-2 from the posterior cortex by Aurora A-dependent phosphorylation of the ECT-2 PH domain, though the evidence for this phosphorylation event is indirect.”

      Reviewer #3 (Significance (Required)):

      *See the second paragraph of the Evidence, Reproducibility, and Clarity section. *

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Symmetry breaking is the process by which uniformity of the system is broken. Many biological systems, such as the body axes establishment and cell divisions in embryos, undergo symmetry breaking to pattern cellular interior design. C. elegans zygote has been a classic model system to study the molecular mechanism of symmetry breaking. Previous studies demonstrated critical roles of centrosomes and microtubules in breaking symmetry in the actin cytoskeleton during anterior-posterior polarization and cytokinesis. It, however, remains elusive how centrosomes and/or microtubules regulate the assembly and contractility of the actin cytoskeleton. Recent reports identified Aurora-A AIR-1 as the key centrosomal kinase that suppresses the function of the actin cytoskeleton, but little is known about a substrate of the kinase during symmetry breaking events.

      Longhini and Glotzer proposed in this manuscript that RhoGEF ECT-2 plays a critical role in symmetry breaking of the actin cytoskeleton under the control of AIR-1 kinase. Kapoor and Kotak (2019) previously proposed the same GEF as a downstream effector of centrosomes, but this work did not provide direct evidence for ECT-2 as the AIR-1 effector. This manuscript identified three putative phospho-acceptor sites in the PH domain of ECT-2 that render ECT-2 responsive to inhibition by AIR-1. Although this manuscript lacks direct in vivo and in vitro evidence for phosphorylation of ECT-2 by AIR-1 kinase, the above findings reasonably support a model where in AIR-1 promotes the local inhibition of ECT-2 on the cortex. Design of the experiments, the quality of images, and data analysis are reasonable, and the main text was written very well. The main conclusion of this work will attract many readers in cell and developmental biology fields. I basically support its publication in the journals supported by Review Commons with minor revisions (see below).

      Minor comments

      1. In Figures 2A and 2B, the authors claimed apparent correlation between spindle rocking and ECT-2 displacement. However, because both MTs and ECT-2 in Fig2AB images are blur, I cannot convince myself whether ECT-2 intensities on the cortex showed negative correlation with the distance between the posterior centrosome and the cortex. The authors may want to provide quantitative data set and use a statistical test to support this conclusion.
      2. Figure 2D would show a ratio between the anterior/posterior pole and the lateral cortex.
      3. In Figure 3D, the authors need to clarify why they measured ECT-2 dynamics only within the "anterior pole". It would be reasonable to measure ECT-2 dynamics by FRAP and cortical high-speed live imaging on the posterior and the lateral cortex during symmetry breaking.
      4. In Figure 4 supplement, a difference between with or without ML8237 seems marginal. The authors need to show a statistical test to claim "rapid enhancement of cortical ECT-2 after ML8237 treatment".
      5. I would strongly suggest the authors to clearly state in the first paragraph of discussion that "this working hypothesis is not supported by direct evidence for phosphorylation of ECT-2 by AIR-1 kinase in vitro and in vivo." It should be reasonable to weaken the statement "by Aurora A-dependent phosphorylation of the ECT-2 PH domain" in p13.

      Significance

      See the second paragraph of the Evidence, Reproducibility, and Clarity section.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this work, Longhini and Glotzer investigate the localization of an essential regulator of polarity and cytokinesis, RhoGEF ECT-2, in the one-cell C. elegans embryo. The authors show that centrosome localized Aurora A kinase (AIR-1 in C. elegans) and myosin-dependent cortical flows are critical in asymmetric ECT-2 accumulation at the membrane. Since membrane interaction of ECT-2 is dependent on the Pleckstrin homology domain present at the C-terminus of ECT-2, they further analyzed the importance of putative AIR-1 consensus sites present in this domain. The authors linked the relevance of these sites in controlling ECT-2 localization and its significance on cytokinesis. The manuscript is well written, the work is interesting, and the data quality is high.

      Major comments:

      • In Fig. 2, the authors claim that the centrosomes and the position of the mitotic spindle are critical in regulating the asymmetric enrichment of ECT-2 at the membrane. To test the relevance of spindle positioning on ECT-2 localization, the authors depleted PAR-3 and PAR-2. The authors observed that the ECT-2 asymmetry is affected in these settings. However, PAR-3 or PAR-2 depletion impacts polarity, which is critical for many cellular processes, including spindle positioning. Can the authors try to specifically misposition the spindle without affecting polarity? For instance, by depleting Galpha/GPR-1/2 and assessing the impact of such depletion on ECT-2 localization.
      • I wonder why the intensity of ECT-2 at the anterior and posterior membrane decreases in air-1 (RNAi) post anaphase onset (Fig. 4A)? Moreover, I fail to observe a significant asymmetric distribution of ECT-2 in embryos depleted for PERM-1. Therefore it appears that the difference between DMSO and MLN8237-treated embryos is not substantial (at least in the images)?
      • The data in Fig. 5 and 6 are exciting but raise a few concerns:
        • a). The authors show that ECT-2C localization mimics the localization of endogenous tagged ECT-2. However, all these analyses with ECT-2C and various mutants are performed in the presence of endogenous ECT-2. Can the author check the localization of these mutant strains in conditions where the endogenous proteins are depleted? I understand that the cortical flow would be perturbed in conditions where endogenous ECT-2 is depleted. However, I suspect that one can analyze the anaphase-specific distribution.
        • b). Can the author comment on why ECT-2C does not accumulate at a similar level as ECT-2C(3A or 6A) at the cell membrane when AIR-1 is depleted (compare Fig. 5D with Supplemental Fig. 5)?
        • c). Does the cortical localization of the ECT-2C(6A) mutant become symmetric upon further depletion of AIR-1? Of course, if the asymmetric distribution of ECT-2C(6A) is dependent on the presence of endogenous protein in the cellular milieu, the point raised earlier will help address this concern.
        • d). The authors predict that the AIR-1 mediated phosphorylation delocalizes ECT-2 from the polar region of the cell cortex. Since the posterior spindle pole is much closer to the posterior cortical region, the delocalization is much more robust at the posterior cell membrane. I wonder why targetting AIR-1 at the membrane (GBP-mCherry-AIR-1) does not entirely abolish GFP-ECT-2C membrane localization? Can the author include the localization of GBP-mCherry-AIR-1 in the data? Also, do we know for sure if GBP-mCherry-AIR-1 is kinase active?
        • e). The authors show that centrosomal enriched AIR-1 [spd-5(RNAi)], but not the astral microtubules localized AIR-1 [tpxl-1(RNAi)], is vital for ECT-2 membrane localization. Interestingly, the authors showed that AIR-1 acts in the centralspindlin-directed furrowing pathway (Fig. 6A). I wonder if the authors can combine NOP-1 depletion with TPXL-1 depletion? I guess this will further help to exclude the function of TPXL-1 in the centralspindlin-directed furrowing pathway.
        • f). Why do NMY-2-GFP cortical levels appear lower in 30% of the embryos that show various degrees of cytokinesis defects (Fig. 6A)?
        • g). The authors report that phosphomimetic mutation at the phospho-acceptor residue in ECT-2 impacts its cortical accumulation. This strain, together with NOP-1 depletion, affects furrow ingression. One explanation for this phenotype is that phosphomimetic mutant weakly accumulates at the membrane. However, one interesting observation is that ECT-2T634E enriches at the central spindle (Fig. 6B, panel 120 sec), which somehow I could not find in the text. Could this additional localization of ECT2 at the central spindle contribute to the cytokinesis defects that the authors have observed? The microscopy images the authors have included show that ECT-2T634E significantly localizes at the equator at the time of furrow initiation. Can the authors add the localization of ECT2 wild-type and ECT-2T634E in NOP-1 depleted conditions where they see an apparent impact on the cytokinesis? Similarly, if the authors include the localization of NMY-2 in these conditions-it will further add more weightage to the data.

      Minor comments:

      • An explanation of how the timing of NEBD was analyzed in multiple settings would be helpful.
      • The authors mentioned on p. 6-'Despite significant depletion of tubulin.....during anaphase'. These experiments are performed in the near complete depolymerization of microtubules; thus, regular anaphase will not establish. I understand that the authors are monitoring localization wrt the timing similar to anaphase in the non-perturbed condition, and thus a bit of change in the sentence is required.
      • After testing the relevance of SPD-5 (that primarily acts on PCM and not on centrioles)-the authors write on p. 6 that 'two classes of explanation...early embryo'. I did not understand the importance of this sentence here.
      • The observed impact of spd-5 (RNAi) on ECT-2 localization could be because of the effects of SPD-5 depletion on centrosomal AIR-1? The authors can link the impact of SPD-5 depletion not only with the centrosome but also with AIR-1 in the discussion.
      • In the various Figure legends, sometimes the authors mention time '0' as anaphase, and other time as anaphase onset.

      Significance

      The manuscript is well written, the work is interesting, and the data quality is of good quality.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      In this study the authors addressed how Ect2 localization is controlled during polarization and cytokinesis in the one-cell C. elegans embryo. Ect2 is a central regulator of cortical contractility and its spatial and temporal regulation is of uttermost importance. After fertilization, the centrosome induces removal of Ect2 from the posterior plasma membrane. During cytokinesis Ect2 activity is expected to be high at the cell equator and low at the cell poles. Similarly to polarization, the centrosome provides an inhibitory signal during cytokinesis that clears contractile ring components from the cell poles. Whether and how the centrosomes regulate Ect2 localization is not know and investigated in the study.

      The authors start by filming endogenously-tagged Ect2 and find that Ect2 localizes asymmetrically, with high anterior and low posterior membrane levels during polarization and cytokinesis. They reveal that the centrosome together with myosin-dependent flows results in asymmetric Ect2 localization. Previous studies had suggested that Air1, clears Ect2 from the posterior during polarization and the authors expand those finding by showing that Air1 function is also required to displace Ect2 from the posterior membrane during cytokinesis. To elucidate if Ect2 displacement is induced by phosphorylation of Ect2 by Air1, the authors investigate the localization of a C-terminal Ect2 fragment containing the membrane binding PH domain. When the predicted Air1 phosphorylation sites are mutated to alanine, the Ect2 fragment still localizes asymmetrically but exhibits increased membrane accumulation.

      Finally, they investigate the functional role of Air-1 during furrow ingression. They demonstrate that embryos deficient of Air1 and NOP1 have impaired furrow ingression. Lastly, the authors sought to confirm that there is a direct effect of Air1 on Ect2 function by generating a phosphomimetic point mutation of Ect2 using Crispr. They find that the membrane localization of phosphomimetic Ect2 is reduced and consequently furrow ingression is impaired.

      Major comments

      It is not convincing that the six putative phosphorylation sites are targeted by the Air1. If Air1 phosphorylation displaces Ect2 from the membrane, a reduction in Ant/Post Ect2 ratio is expected in the phosphodeficient mutants, like after air1 RNAi. However this is not observed for cytokinesis or polarization (Fig. 5D(i); E). This suggests that phosphorylation of those sites is not essential for the asymmetric Ect2 localization.

      The authors aim to demonstrate that phosphorylation of the identified sites is important for cytokinesis. For this they investigate contractile ring ingression in the phosphomimetic point mutation. Since ring ingression is slower and fails in nop1 mutant they authors conclude that this demonstrates a functional importance of this site. I am not surprised that embryos ingress slower in this mutant since Ect2 localization to the membrane is reduced. This however does not show that this phosphorylation site is the target of the centrosome signal. Importantly, authors would need to demonstrate that Rho signaling and thus Ect2 activity, is increased at the poles, when phosphodeficient Ect2 is the only Ect2 in the embryo.

      The authors use the Aurora A inhibitor MLN8237: It was shown prior (De Groot et al., 2015) that this inhibitor is not highly specific for Aurora A, and that it also inhibits Aurora B. Thus experiments need to be repeated with MK5108 or MK8745. They should also be conducted during polarization. Why does Aurora A inhibition not abolish asymmetry? That would be expected?

      There is no statistical analysis of the results in the entire study. For all claims stating a change in Ant/Post Ect2 ratio or Ect2 membrane localization selected time points should be statistically compared: for example the main point of Fig.1 is that Ect2 becomes more asymmetric during anaphase. Thus a statistical analysis of the Ect2 ratio at anaphase onset (t=0s) and eg. t=90 s after anaphase onset should be performed; or Fig. 3A nop-1 mutant Ant/Post Ect2 ratio during polarization: again statistical analysis of control and nop-1 mutant embryos is needed at a particular time point.

      The aim of Fig. 2B is to demonstrate that Ect2 localization is independent of microtubules, however they still observe some microtubules with the Cherry-tubulin marker and those are even very close to the membrane and therefore could very well influence Ect2 on the membrane. Therefore I am not convinced that this experiment rules out that microtubules have no role in regulating Ect2 localization.

      Throughout the paper the authors should tone down their statement that Air1 breaks symmetry by phosphorylating Ect2, since phosphorylation of Ect2 by Air2 is not shown.

      I understand that the establishment of Ect2 asymmetry is important for polarization. However, how does asymmetric Ect2 localization result in more active Ect2 at the cell equator, which is required for the formation of the active RhoA zone? Would we not expect an accumulation of Ect2 at the cell equator, or if that is not the case more active Ect2 at the equator versus the poles?

      Minor comments

      Can the authors explain why the quantification of Ant/Post Ect2 ratio in control embryos differs in different figures? For example: in Fig. 1D i) a slight increase of Ect2 asymmetry ratio is seen at around 80 s after anaphase onset. In comparison, in Fig. 2C (i) this increase is not obvious. Are those different genetic backgrounds?

      One key point of the paper is that myosin-dependent cortical flows amplify Ect2 asymmetry during polarization and cytokinesis. During polarization the data is convincing, however during cytokinesis Ect2 ratio is only slightly decreased after nmy-2 depletion, again is this decrease even significant?

      In the introduction: "Centralspindlin both induces relief of ECT-2 auto-inhibition and promotes Ect2 recruitment to the plasma membrane" it should be added 'Equatorial' membrane, since Ect2 membrane binding is, to my knowledge, not compromised in centralspindlin mutants or in Ect2 mutants that cannot bind centralspindlin.

      Labels in the figures are often very small eg Fig. 1 ii-v) and difficult to read. In addition it is easier for the reader if the proteins shown in the fluorescent images is also labeled in the figure (eg Fig. 2B add NG-Ect2).

      Material and methods it should be mentioned which IPTG concentration was used.

      The authors speculate that the Air1 phosphorylation sites in Ect2 PH domain prevent binding to phospholipid due the negative charge. At the same time, the authors propose that the PH domain binds to a more stable protein on the membrane, which is swept along with the cortical flows and they propose anillin could be that additional binding partner. I might miss something, but do the authors suggest Ect2 has two binding partners: anillin and the phospholipids? It would be necessary to explain this better. The authors should test if anillin represents the suggested myosin II dependent Ect2 anchor. For this they should check if Ect2 localization to the membrane is altered upon on anillin RNAi.

      The title of fig. 3 does not fit the statement the authors want to make, since the key point is how Ect2 polarization is affected and not membrane localization in general.

      In Fig 4A/C. After air1 depletion the authors observe a reduction in Ect2 asymmetry. Why are the centrosomes not marked in the figures? Because they cannot be detected? The authors would also need to show that the mitotic spindle and centrosomes are no altered by air1 RNAi in the zyg9 mutant. Otherwise the observed effect might be indirect.

      The authors state that tpxl-1 depletion attenuates Ect2 asymmetry, this is not seen in the quantification ((Fig. 4B(i)). The main phenotype they observe is that Ect2 levels on the membrane increase (Fig. 4 (ii) and (iii). They go on testing the function of tpxl1 by depleting tpxl1 in the zyg9 mutant, where the centrosomes are close to the posterior cortex. Here they see no effect on Ect2 asymmetry. Based on that they conclude that tpxl1 has no role in this process. To me this finding is not surprising since the centrosome is close the cortex in zyg9 mutant embryos. Therefore sufficient amounts of active Air1 could reach the membrane and displace Ect2. Thus an amplification of the inhibitory signal by tpxl1 on astral microtubules might not be required. The authors need to mention this possibility and tone down their statment (also in the discussion) that tpxl1 is not required for this process.

      It was shown that the C-terminus of Ect2 is sufficient and the PH domain is required for Ect2 membrane localization in C. elegans (Chan and Nance, 2013; Gomez-Cavazos et al., 2020). Papers should be cited.

      The authors find that nmy-2 depletion results in loss of asymmetry for the Ect2 C-term and Ect2 3A fragment during polarization. Why is the same experiment not shown for cytokinesis?

      Air1 is targeted to GFP-C-term Ect2 fragment via GFP-binding to determine the influence on GFP-C-term Ect2 localization (Fig. 5F). They state that they see a reduction of Ect2 C-term but not of C-term 3A after targeting. The reader has to compare Fig. 5D with F. Since the differences are not big, they need to compare the Ect2 C-term and Ect2 C-term 3A with and without Air1 targeting in the same graph (plus statistics). Otherwise this statement is not convincing.

      In Fig. 6A the authors determine the contribution of air1 to furrowing. For this they deplete air1 in the nop1 mutant. According to previous studies, air1 mutants have a monopolar spindle. How can the authors analyze the function of air1 in cytokinesis when the spindle is monopolar? Did the authors do partial air1 depletion? They authors need to show that there is not major effect on the spindle and centrosome for their conditions. For comparison air1(RNAi) alone has to be included, otherwise the experiment is not conclusive.

      Upon air1(RNAi) in the nop1 mutant NMY2 intensity seems decreased and not increased. Can the authors comment on that, since that is opposite of what is expected.

      In Fig 6B they introduce a phosphomimetic point mutation in S634 in the endogenous Ect2 locus. It not clear why the authors chose this site out of the six putative sites and why they only chose one and not 3 or 6 sites? This needs some explanation.

      In the model (fig. 7) no astral microtubules are shown during pronuclear meeting and metaphase. Astral microtubules are present at this stage and should be added to the schematic.

      Significance

      The centrosomes inhibit cortical contractility during polarization and cytokinesis in the one-cell C. elegans embryo. Centrosome localized Air1 was proposed to be part of this inhibitory signal, however the phosphorylation target of Air1 is not known. The identification of Ect2 as a phosphorylation target of Air1 would be a great advancement in the field. However, the presented manuscript lacks convincing data that Ect2 is the phosphorylation target of Air1 during polarization and cytokinesis.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

      We will provide the point-by-point response when we submit the full revision of our manuscript

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This paper presents an investigation of the mechanisms of how chitin is synthesized in Drosophila by investigating the chitin synthetase Kkv and two proteins related/redundant proteins that are required for chitin production Exp and Reb.

      The authors show that synthesis of nascent chitin polymers is separable from the secretion of chitin and that Ex/Reb is specifically required for chitin translocation/secretion. To understand the functions of Exp/Reb, the authors perform structure/function analyses and examine the localization of the proteins. They find that Na-MH2 domain in Exp/Reb is required for chitin translocation, and that a motif the authors name CM2 is required for Exp localization. For Kkv, they show the WGTRE domain is required for ER exit and that a coiled-coiled domain is required for KKV localization and full Kkv activity. By using live imaging and mutations that disrupt membrane trafficking, the authors show that Kkv, which is a transmembrane protein, cycles to the membrane, and like most membrane proteins, is endocytosed and transits through the endocytic system and is returned to the apical surface. Interestingly, despite being dynamically moved around the cell, chitin synthesis produces highly organized extracellular matrixes. Considering that constitutive production of chitin by Kkv everywhere in the cell would create a mess, these results underscore that regulated organized secretion/translocation of chitin is central to generating patterned extracellular matrixes (as the saying goes, "location, location, location"). Consistent with Exp/Reb being important regulators in extracellular matrix patterning, Exp/Reb not only are required for export of chitin, in the absence of Exp/Reb, the pattern of Kkv localization at the apical surface is altered. Unexpectedly however, by using super resolution microscopy the authors show that Kkv and Exp/Reb have complementary rather than matching localizations. Thus, while it is not clear exactly how Exp/Reb are regulating Kkv, they are doing something very interesting.<br /> Overall, this paper will be of broad interest to the cell biology and developmental biology communities, and to the translational community working to develop chitin as a commercial biopolymer. It is also generally clearly written, although I think there are some inaccuracies in the how some points are phrased. The experiments are well done, and subject to the revisions out lined below.

      Major concerns:

      • A major conclusion of the paper is that Exp/Reb are not required for chitin synthesis. On the most basic level this statement is well supported, because chitin grains are made in the cytoplasm in the Exp/Reb mutants. However, I think the field would be better served with a more nuanced consideration or the role of Reb/Exp. From the data presented, it seems that in the absence of Reb/Exp, the total amount of chitin produced is greatly reduced. I think it would be worth considering Exp/Reb, or the synthesis process in general, as having processivity or duty cycle or quality control such that in the absence of Exp/Reb while Kkv may make short chitin polymers, or occasional long polymers, the major production of chitin doesn't get going without Exp/Reb. Thinking of Reb/Exp as processivity factors in addition to export factors dramatically changes how one thinks of the proteins and the process of chitin synthesis. While these considerations can be handed with some discussion, it would be very interesting to look at the length of the chitin polymers in the Reb/Exp mutants and see if the average chain length is much reduced. This would help distinguish between Exp/Reb reving up the total number of Kkv molecules that produce chitin and Exp/Reb allowing the same number of Kkv molecules to stay active and produce much longer chitin chains. A caveat here is that I have no idea how hard this is to do, so I won't put this at the level of a required revision, but this result would significantly deepen the analysis in the paper.
      • In looking that the subcellular localization of the Kkv and Reb in regular and super resolution, the authors I think the authors missed an important, but straight forward way to gain insight into the apparent complementary distribution of Kkv and Exp/Reb. In stage 16 WT embryos, Kkv has a distinct ringed pattern that corresponds to the tanedial ridges (e.g. clearly visible in Fig. 6A and 6G). How those ridges are set up is unclear, although there are some interesting Turing-pattern models out there. One prediction might be that Exp/Reb should be in between the Kkv rings. If so, maybe Exp/Reb are key components of patterning chitin secretion to make this 3D patterned matrix? Alternatively, maybe Exp/Reb act on a smaller length scale and will match the Kkv ring pattern, just not overlapping with Kkv at the very fine scale. These are straightforward experiments and again could provide key insights into the function of Exp/Reb.
      • In general, most of the figures do not include WT or a control for comparison. This makes it hard for non-experts to assess what the effect of a mutation or condition is. For example, there are no examples of WT or Df(exp reb) in Figures 1-4. I realize this would increase the number of panels, but the paper would be more accessible if comparisons were within figures instead of comparing between main and supplementary figures and other papers.
      • To bolster the case the Exp/Reb directly regulate Kkv distribution, the authors should examine the distribution of Kkv in a catalytically null Kkv mutant, or drugs that block Kkv, or mutations in other genes required for Kkv activity to show that the altered distribution of Kkv in Exp/Reb mutants is a direct consequence of the lack of Exp/Reb rather than in indirect consequence of lack of extracellular chitin, which causes gross perturbations in the trachea. Also, are there differences in the distributions of Kkv in salivary glands with or without the presence of Exp/Reb? If Exp/Reb change the distribution of Kkv in the salivary glands, which normally do not express Kkv and presumably many other components of the chitin ECM system, this would be a powerful argument that there is a direct effect.

      Minor concerns.

      • Page 5 "These intracellular chitin punctae disappeared from stage 14, when chitin is then deposited extracellularly (Fig 1B')." Fig. 1B' is stage 15 embryos.
      • Page 5 "lead to tracheal morphogenetic defects". It would be helpful to the reader if the text or legend told the reader what they were looking for? Broken tubes? Inflated tubes? Variable tubes?
      • Fig. 1H. Main text says "co-expression of Kkv and expMH2/rebMH2 did not lead to tracheal morphogenetic defects (Fig 1H, ...". The tracheal dorsal trunk in Fig. 1H does not look WT. The legend does not state the stage, but the DT looks to have an enlarged diameter and it might be too long. Please present measurements on stage 16 trachea to confirm that there is no effect on tracheal morphology.
      • Fig. 3E there is a lot of GFP-Kkv that is not in co-localized with the KDEL marker. Can the authors clarify what compartment all the other staining is? ER?
      • Section 3.1. The authors imply that the WGTRE domain is specifically required for ER exit. However, an alternative is that absent the WGTRE domain, the protein just does not fold correctly, which would also preclude ER exit, but would be a different problem for the protein to make chitin if it isn't folded.
      • Page 15. I disagree with statement "At stage 16, control embryos showed a highly homogeneous apical distribution of Kkv in stripes, corresponding to the taenidial folds, and Kkv vesicles were largely absent (Fig 6G)." In Fig. 6G, the tandeal ring pattern is clearly visible, as are the fusion cells. If Kkv distribution were "highly homogeneous" these structures/pattern would not be visible.
      • Page 15. I also disagree with the characterization of the apical Kkv distribution in st 15 embryos. "In control embryos we detected a very uniform and homogenous pattern of apical Kkv (Fig 6I).". To my eye, the pattern is punctate and random for the clumps of stain, with the underlying beginnings of the tanidial pattern starting to be visible. The pattern appears neither uniform nor homogenous.
      • P16. The degree of order in the distribution of Kkv is overstated. The authors state that "The results of this analysis, showed that Kkv on the apical membrane, is evenly distributed following a regular pattern (Fig. 6L,L',L',M)." However, given that there is barely a visibly perceptible difference between the actual distribution of Kkv in 6L' and a calculated random distribution in 6L", and that the pattern is neither visibly even or regular, it would be more representative to say something to the effect that the analysis shows there is "underlying order" or "some degree of order" or a "non-random pattern". Visually, the key difference between 6L ' and L" is that there are fewer closely clustered Kkv dots. You could still have an uneven distribution of Kkv that maintains minimum spacing, which is a kind of ordered organization, but not one that would be assumed from the description. It would be helpful if the authors instead of just saying a "regular pattern" also stated the nature of the pattern they observe, i.e. Grid? Stripes? Minimum spacing?
      • Discussion. Another model for the role of Exp/Reb could be to bind and neutralize an inhibitor of Kkv activity. This would account complementary distribution of Kkv and Exp/Reb.
      • Fig. 6L. what tissue is being analyzed? Presumably trachea, but this should be specified as salivary glands are also mentioned in the legend.
      • Fig. 7 C models. I believe that the super resolution data is not accurately accounted for in the models. In both model 1 and model 2, Kkv and Exp/Reb are shown to be in close proximity, but the super resolution data suggests that most Kkv and Exp/Reb are separated hundreds of nanometers. Further, showing Kkv and Exp/Reb as touching was not supported by the coIP experiments, which failed to detect an interaction. It is possible that only a small fraction of Exp/Reb that is in close proximity to Kkv is active, but if so, this should be explicitly mentioned in the models to reconcile the data showing that Kkv and Exp/Reb are mostly not anywhere near each other.
      • -Image analysis. Please detail the criteria for "apical" and "basal" regions were the basis for freehand segmentation. What was counted as apical and what was basal?
      • Abstract and Introduction: The authors state that "We find that Kkv activity in chitin translocation, but not in polymerization, requires the activity of Exp/Reb, and in particular of its conserved Na-MH2 domain.", but then follow that with the statement that "Furthermore, we find that Kkv and Exp/Reb display a largely complementary pattern at the apical domain, and that Exp/Reb activity regulates the topological distribution of Kkv at the apical membrane." Many readers, will find the use of "furthermore" confusing because they will take furthermore as the about to be described data logically following the previous data, but then run headlong into the fact the Kkv and Exp/Reb show a complementary distribution, which does not obviously follow from Kkv activity requiring Exp/Reb. The authors could clarify this and highlight the interesting, unexpected and exciting nature of their results by replacing "Furthermore" with "Unexpectedly" or "Surprisingly", and emphasizing the important role of Exp/Reb in Kkv organization. Maybe something like: Unexpectedly, we find that although Kkv and Exp/Reb display largely complementary patterns at the apical domain, Exp/Reb activity nonetheless regulates the topological distribution of Kkv at the apical membrane.

      Significance

      The topic is interesting from the aspect of cell biology in terms of how a long polymer is created intracellularly, secreted and spatially organized to create a sophisticated extracellular matrix. The topic is also of general interest because chitin is central to the body plan of all insects, crustaceans and many other species, and chitin is of increasing interest as a biopolymer that could have extensive commercial uses.

      In addition to an informative structure/function analysis of the Kvv and Exp/Reb, the results identify what is, to my knowledge, the first regulator of the spatial organization of chitin sythase in insects and it unexpectedly shows a complementary pattern to the the synthase. This highlights just how little we understand about how complex extracellular matrixes are synthesized.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This paper deciphers very nicely the genetics and cellular events where and how chitin polymers become synthesized and translocated towards the apical cell membrane for further release into the extracellular space. Altogether this is fundamental work of high significance explaining how chitin is produced and released.

      The authors initially detected unusual intracellular chitin by overexpression of the chitin synthase kkv in tracheal cells before regular chitin deposition occurs. In addition, they recognized that the kkv gain of function mediated unusual intracellular chitin vesicles and in later stages in exp/reb mutants. These findings were the starting point of further experiments, suggesting that Kkv synthesizes chitin and that Kkv-mediated chitin deposition requires Exp/Reb activity to translocate and release chitin. Their genetic studies further show that chitin polymerization and translocation are uncoupled.

      All primary studies were tested in embryonic tracheal cells and as a proof of principle control in salivary glands which do not express chitin. Elegant rescue experiments in the mutant background showed that the Exp/Reb Nα-MH2 domains are required but not sufficient for chitin translocation and deposition and are dispensable for protein localization. Additionally, they identified another conserved domain (CM2) which is required Exp/Reb localization but not for chitin translocation. Similarly, they investigated Kkv domains by rescue experiments in kkv mutant embryos. They identified the WGTRE domain as essential for ER exit and the coiled-coil region for proper apical Kkv localization. Altogether they provide evidence that Kkv requires proper localization at the apical membrane, which is likely coordinated by Exp/Reb. This precise Kkv localization is linked with Kkv activity and chitin deposition. Therefore, this work is new and fundamental to many disciplines such as insect biology, chitin biology, cell & developmental biology, and others. Thus, the work is worth publishing but requires some changes from my point of view.

      Major Comments

      1. The authors nicely show that Kkv is able to synthesize chitin in a constitutive manner and that it can accumulate intracellularly. However, this needs some more input to understand the underlying biological sense. For example, what are the chitin vesicles' nature of early vesicles (st 13) and the unusual late (st15) chitin vesicles? Exocytosis, endocytosis or recycling? This can be clarified to understand chitin translocation and that synthesis and translocation are uncoupled. The authors tested rab5DN mutant salivary glands to exclude endocytosis. However, the chitin-positive vesicle size and amount in the rab5 mutant appear different from the control experiment, where much more intracellular chitin accumulates. Thus, it may suggest that some chitin vesicles are independent of Rab5-mediated endocytosis, others probably not. Indeed, the authors identified some KKv and some other chitin vesicles in all discussed intracellular processes; however, additionally, chitin appears to accumulate also in the cytoplasm. The authors conclude that Kkv protein might be able to polymerize chitin at all different intercellular stages, including endocytosis and degradation pathway. First, I wonder why chitin was found within membranous vesicles and, at the same time, within the cytoplasm. Second, does it make sense in the biological context when tracheal cells or other chitin-producing organs want to secrete chitin at the apical membrane while Kkv has the ability to produce chitin in all cellular areas, even in endosomes? In this context, another fundamental question concerning chitin secretion and subsequent organization could be investigated with the author's tools. Are the chitin vesicles loaded with chitin binding proteins or deacetylases that organize the formation of the nano and makro fibrillar chitin matrix in tracheal tubes? For example, previous research showed a reduced luminal accumulation of the 2A12 antigen in kkv mutants and expRNAi knockdown embryos.
      2. Observation of Extracellular kkv-GFP: does extracellular anti-GFP staining co-localize with the anti-Kkv antibody?
      3. Putative Kkv microvesicles: the authors state that extracellular GFP staining could be Kkv located in microvesicles. I wonder whether the observed extracellular GFP puncta contain a membrane or other membranous proteins.
      4. Fig.1:
      5. general remarks: some images of this figure could be improved by showing the single channels of CBP to judge whether chitin is secreted and/or vesicles appear. -In addition, some images show higher magnifications, others overview only. It would be beneficial to visualize the small vesicles additionally with higher magnifications.
      6. Fig.1M: This image is problematic due to the epidermal background staining. The tracheal system is hard to recognize. A single channel of Cbp ist not indicated.
      7. Fig. 1P: Apical membrane marker or any cytoplasmic marker would be extremely useful to judge subcellular Exp localization in this experiment - this image is hard to compare with Exp localization in Fig S2D.

      Fig. 2O: Apical/Basal accumulation, what are the numbers at the Y-axis?

      Fig. 3F: The authors state that the WGTRE domain is required for ER exit of Kkv based on colocalization studies with KDEL and FK2. However, the study with FK2 is not convincing as immunostainings are of poor quality. The GFP construct appears not to be expressed in all tracheal cells, and moreover, the FK2 staining is faint. Thus, judging whether the protein is not ubiquitinated from the presented image is challenging. However, it does not change the key message, Kkv does not exit ER. By the way, there is a new paper showing Serca to be essential for ER exit of Kkv, which would fit the discussion of the kkv domains.

      Fig. 7 - model: First, Since the authors do not show that Kkv is part of a membranous microvesicle, I'm skeptical whether this should be part of a model that explains the shown data. Therefore, I'm asking the authors to delete it or to show it more clearly. Second, the meaning of the yellow arrowheads is not indicated. Third, the explanation in the legend is sound, but showing the two options could be improved.

      Minor comments:

      1. Missing reference: Chirin has been recognized importance in physiology (Zhao et al., 2019; Zhu et al., 2016) but also as a biomaterial (? Reference). Suggestion: DOI: 10.3390/ma15031041 (Improving Polysaccharide-Based Chitin/Chitosan-Aerogel Materials by Learning from Genetics and Molecular Biology) This paper discusses the current usage and potential of chitin as a biomaterial in many disciplines.)
      2. 3.1, second paragraph final sentence: double point

      Referees cross-commenting

      Rev#1 asks the same questions as I do. Technical questions and the idea to compare endogenous Kkv.The same is true with Rev#3. Overlapping questions concerning technical things about figure illustration and clarity of presented stainings. Altogether, the criticism will improve the manuscript

      Significance

      This paper deciphers very nicely the genetics and cellular events where and how chitin polymers become synthesized and translocated towards the apical cell membrane for further release into the extracellular space. Altogether this is fundamental work of high significance explaining how chitin is produced and released.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, the authors provide data on the function of exp/reb and Kkv in chitin deposition. They show chitin polymerization and deposition are uncoupled and exp/reb are required for the deposition of the chitin by regulating the distribution of kkv at the apical membrane. However, there is no direct interaction between Kkv and Exp/reb. The functional analysis of Kkv and Exp/reb is interesting.

      The overexpression lines are used throughout the manuscript to analyze protein functions. Since ectopic expression of kkv and exp leads to chitin synthesis and deposition. Authors use this overexpression system to analyze the functional domain of kkv and Exp/reb. It is reasonable. However overexpression line might not represent the endogenous protein perfectly, it might cause some issues to answer certain questions.

      Major comments

      1. Fig. 4 Does ectopic overexpression of Kkv-GFP have the same expression pattern as the endogenous Kkv? The overexpression line may lead to ectopic expression. the colocalization of endogenous Kkv and intracellular vesicles would be more accurate.
      2. Are Kkv and Exp/reb expressed at the same time endogenously? If kkv is expressed earlier than Exp, can intracellular chitin be detected in wild-type embryos at early stages? Fig. 1b shows overexpression of Kkv at S13 has intracellular chitin (exp is not expressed at this stage).
      3. Fig. 1B no intracellular chitin is detected. Fig. 1H intracellular chitin is detected. Does Overexpression of exp-MH2 interfere with the endogenous Exp function?
      4. For measurement, some detailed info is needed, for example, what is your area of interest?

      Minor comments:

      1. Fig. 2 and Fig. 3. how do you define the region of apical and basal? An apical marker is needed here. N is the total number of embryos or the number of sections in the same embryo?
      2. Fig. 5H What is your area of interest to measure vesicles? Which tracheal segment do you measure? Some details need to be provided here.
      3. Fig. 5 what is your area of interest when you measure Kkv?

      Significance

      This work further advance the knowledge about chitin synthesis and deposition

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      All three reviewers demonstrated similar concerns and provided clear guidance on potential revisions. All three reviewers recognized the importance of our study to the mitosis, genome instability and DNA damage fields, and identify limitations regarding potential therapeutic implications of the study. To mitigate these limitations and extend the breadth of our study, the revised manuscript now includes colony formation assays to more directly evaluate the impact of mitotic DNA damage therapies in cancer cell proliferation. It also includes several of the experimental suggestions/clarifications proposed by the reviewers. Lastly, we include here a clear plan of additional experiments that we agree to conduct.

      2. Description of the planned revisions

      Reviewer #1

      Major comments:

      1) In the opinion of the reviewer, the study is somewhat unbalanced as it starts with a high throughput analysis of a large number of compounds but only etoposide treatment is investigated in detail in the key experiments shown in Figures 5 and 6. The effect of topoisomerase II inhibitor on kinetochore-MT stability has already been demonstrated by Bakhoum et al, 2014. If authors wish to generalize that similar phenotypes are observed after various types of DNA damage, they should test additional compounds (such as Lomustine, Mitomycin C, and Carboplatin). In addition, measuring of the kinetochore-MT half-life in figure 5 should be performed with better time resolution within the first 5 minutes. This would allow better comparison of the measured half lives that are much shorter than 5 minutes.

      R: We agree with the reviewer and we will perform additional measurements of kinetochore microtubule half-life with these different compounds and include shorter time points to increase the temporal resolution within the first 5 min.

      3) Involvement of Kid and Kif4A in arm ejection of the polar chromosomes is an interesting observation in context of mitotic DNA damage. However, it is unclear how the distribution of the chromokinesines was evaluated in Figure 6A-D. Was the signal quantified at the metaphase plate or at polar chromosomes? It seems that Kif4A localizes to the polar chromosome caused by etoposide treatment whereas no signal is visible in DMSO control (Fig. 6C).

      R: Since whole cells were exposed to DNA damage, we had previously quantified total fluorescence intensity of Kid and Kif4a on all chromosomes (aligned and unaligned). We nevertheless concede that some chromosomes might have been more exposed (or are more susceptible) to DNA damage and we will therefore provide a quantification of the fluorescence intensity ratio of Kid and Kif4A levels on polar chromosomes relative to the metaphase plate, with and without mitotic DNA damage.

      4) Authors convincingly showed that SAC is activated by mitotic damage and this is also consistent with previous reports. However they did not address if DDR pathways contribute to the activation of SAC. This would be interesting especially in context of a recent report that showed Bub3 as a direct substrate of ATM (Xiao et al. 2022, JBC). I wonder if the polar chromosomes are formed and missegregate also in the absence of ATM activity.

      R: This is an interesting suggestion and we have already obtained data from one experiment regarding the formation of polar chromosomes upon ATM inhibition. We will perform additional independent validation of these data and include our findings in the fully revised manuscript.

      Reviewer #2

      Major comments:

      …They need to present the % of cells with polar chromosomes, and it would also be informative to understand the rate of cells with lagging chromosomes, or that underwent anaphase with polar chromosomes with and without chromokinesin depletion.

      R: We will quantify the frequency of the different segregation errors upon DNA damage, with and without Chromokinesin depletion.

      It would be very helpful for them to provide a schematic model between DNA damage, overstable microtubules, satisfied SAC, monotelic attached chromosomes, and the role of chromokinesins. At present these connections are very unclear.

      R: We thank the reviewer for drawing attention to this point. We will provide a step-by-step model in the fully revised manuscript.

      Reviewer #3

      Major comments:

      According to Figure 2C, the ratio of "Exit with micronuclei (from misaligned chromosome(s))" is relatively low compared to other phenotypes such as "Mitotic arrest" or "Cell death." I wonder if polar chromosome phenotype is also correlated with these other cell fates. Please clarify which fate is correlated with polar chromosome formation after DNA damage.

      R: We will provide these correlations. We already know that those cells that arrest in mitosis is due to misaligned chromosomes. We will also perform the correlation between cells that died and the presence of misaligned chromosomes.

      In Figure 3, the authors used Nocodazole-treated background to assess the involvement of SAC in DNA-damaging compound-induced mitotic delay. However, as shown in Figure 2B, DNA-damaging compounds cause a minor delay in mitosis, which might be challenging to analyze in the presence of Nocodazole. There is also a possibility that DNA damage response (DDR) works independently and adjunctly to delay mitosis. Because one of the major claims of the authors is that "the SAC is the only mechanism that is required to delay mitosis in the presence of long-term mitotic DNA damage (page 10, line278)", I recommend Nocodazole wash-out (as in Figure 2B) to examine the effect of MPS1-IN-1 (and ideally an inhibitor of the DDR pathway, such as ATMi) on mitotic delay induced by DNA-damaging compounds.

      R: We now clarify that the observed mitotic delay in the presence of DNA damaging compounds occurred after nocodazole washout. As so, nocodazole was no longer present in the system. We also draw the attention that DNA damage in the presence of nocodazole, a condition that promotes maximal SAC activity, was fully dependent on MPS1 activity (Figure 4A). We have also obtained data from one experiment regarding the formation of polar chromosomes after nocodazole wash-out and ATM inhibition. We will perform additional independent validation of these data and include our findings in the fully revised manuscript.

      Figure 6E-G: I wonder whether siKid+siKif4a affected %polar chromosomes or not.

      R: We will perform this experiment and include the results addressing this point in the fully revised manuscript.

      Page 10, line 287: the authors claim that "we show that long-term mitotic DNA damage..., causing the missegregation of polar chromosomes due to the action of arm-ejection forces by chromokinesisns,...." However, only Mad1 localization data is provided in Figure 6E-G, and whether siKid + siKif4a rescues the missegregation of polar chromosomes is not clear. The authors should either provide supporting evidence or revise this sentence for clarity.

      R: We will determine whether Kid+Kif4a depletion rescues the missegregation of polar chromosomes (i.e. reduces the frequency of cells that exit with polar chromosomes in the presence of mitotic DNA damage).

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #1

      Major comments:

      2) Authors claim that the observed phenotype of the chromosomal missegregation following the mitotic DNA damage occurs specifically in cancer cells but the data supporting this statement is poor. They need to show more data in non-transformed cells or remove the statement.

      R: Data from non-transformed RPE-1 cells are now included in the revised manuscript.

      Minor comments:

      1) Specificity of Aurora-B pT232 antibody should be validated to exclude cross-reactivity with Aurora-A at spindle pole (Fig S8C)

      R: Antibody specificity against active Aurora A and B was validated with specific kinase inhibitors. These data are now included in the revised manuscript.

      Reviewer #2

      Major comments:

      1. The term 'long-term DNA damage during mitosis' is confusing. DNA damage occurs when? (before, or during mitosis?). Their live cell data shows they are following cells that underwent mitosis within a certain time window after damage occurred, but it is not clear if it occurs only before, or sometimes during mitosis.

      R: We realize that this was indeed confusing. The only experiment where mitotic DNA damage might have been introduced before mitosis is in Figure 1A-C. All other experiments were directly or indirectly based on live-cell data in which only cells that were already in mitosis when they were exposed to DNA damage were analysed. Because these live-cell experiments revealed that DNA damage before mitosis prevented mitotic entry at the used concentration of the DNA-damaging agents, mitotic cells scored upon DNA damage in fixed cell experiments must have been already in mitosis when DNA damage was applied. We also now include a scheme of each experimental set up in the respective figures to clearly indicate how the data was obtained and to facilitate the respective interpretations. This is now clarified in the main text.

      All damage occurred in cells treated with nocodazole - could this have impacted the results? Was similar DNA damage induced if cells were arrested in monastrol/STLC, or MG132 for example? They show the distal Mad2 data in STLC but not the damage. Also Figure S4A/B appears to be from one experiment only which makes it difficult to interpret.

      R: We have additionally induced DNA damage when the cells were arrested in STLC. yH2AX levels as inferred by western blot analysis were indistinguishable from cells treated with nocodazole. These data are now included in the revised manuscript. We also clarify that data presented in previous Figure S4 are from 2 independent experiments.

      Why not show the RPE1 data for polar chromosomes?

      R: Data from non-transformed RPE-1 cells are now included in the revised manuscript.

      Confusing interpretation - monotelic attachments, yet also stable attachments,..? Please can they clarify what is meant by these terms.

      R: Monotelic attachments in which a single chromatid is attached to microtubules are intrinsically unstable due to the lack of centromeric tension. However, mitotic DNA damage alters this scenario and stabilizes monotelic attachments. We have now clarified this point in the main text.

      They focus most of the study on understanding why polar chromosomes arise after DNA damage. However, this phenotype seems to be a relatively minor effect. Eg. In Figure 2b only a few cells exhibit very extended mitosis, and in Figure 2c only a small percentage exit mitosis with misaligned chromosomes. Furthermore, in Figure 4B the percentage of polar chromosomes with only distal Mad2 is low. In Figure 4A the images are not clear whether a 'both' or 'distal' is being shown. It does not seem as if any are distal only, and an example of this would be helpful.

      R: We thank the reviewer for alluding to this important point. The frequency of cells with polar chromosomes with/without DNA damage is indicated in Figure 2C. The respective duration of mitosis due to polar chromosomes is clearly and significantly increased as shown in Figure 2B. In fixed material we chose 70 min after nocodazole washout in these experiments because there is no difference in the frequency of cells with polar chromosomes between DMSO and DNA-damage-treated cells, allowing a direct comparison of the types of kinetochore-microtubule attachments on polar chromosomes only (please see new Figure 5). We now clarify that beyond this time frame, only DNA-damage-treated cells show polar chromosomes. We also draw the reviewer’s attention to the B&W panels where the different types of attachments are clearly highlighted.

      It is also not clear what 'other' means in Figure 2C.

      R: The phenotypes classified as ‘other’ in Figure 2C are detailed in Figure S1C. This was indicated in the figure 2 legend, but we have now clarified it also in the main text.

      They conclude that chromokinesin depletion rescues the polar chromosomes phenotype. However they do not directly assess the rate of polar chromosome formation, only the % of polar chromosomes that are only distal Mad2 as far as I can see?

      R: Similar to reviewer 1, we thank this reviewer for alluding to this important point. The frequency of cells with polar chromosomes with/without DNA damage is indicated in figure 2C. The respective duration of mitosis due to polar chromosomes is clearly and significantly increased as shown in Figure 2B. In fixed material we chose 70 min after nocodazole washout in these experiments because there is no difference in the frequency of cells with polar chromosomes between DMSO and DNA-damage-treated cells, allowing a direct comparison of the types of kinetochore-microtubule attachments (please see new Figure 5). We now clarify that beyond this time frame, only DNA-damage-treated cells show polar chromosomes.

      It is not clear from how many experiments data are shown for the Mad1 distal experiments in Figures 4 and S4 (there are no error bars, so is this one experiment only?). This should be indicated in figure legends, and repeated if performed only once.

      R: Data presented in Figure 4 is a pool from 3 independent experiments and data presented in Figure S4 is a pool from 2 independent experiments. For these reasons, there are no error bars to include. This is now clarified in the figure legends.

      Reviewer #3

      Major comments:

      1. Page 6, line 155: the authors claim that "In contrast, among other defects, treatment with any of the DNA-damaging compounds caused a significant mitotic delay due to the presence of misaligned chromosomes near the spindle poles." Although Figure 2A shows a representative image of polar chromosomes, I do not find quantitative data that analyze %polar chromosomes in mitosis treated with DNA-damaging compounds. I also do not find the data supporting the claim that polar chromosomes caused a mitotic delay. Because most subsequent analyses were performed based on this result, the quantitative data should be provided here. For the latter, I suggest showing "time in mitosis (Fig 2B)" separately with or without polar chromosomes.

      R: The frequency of cells with polar chromosomes with/without DNA damage is indicated in figure 2C. The respective duration of mitosis due to polar chromosomes is clearly and significantly increased as shown in Figure 2B. In fixed material we chose 70 min after nocodazole washout in these experiments because there is no difference in the frequency of cells with polar chromosomes between DMSO and DNA-damage-treated cells, allowing a direct comparison of the types of kinetochore-microtubule attachments (please see new Figure 5). We now clarify that beyond this time frame, only DNA-damage-treated cells show polar chromosomes. We now highlight in figure 2C what fraction of cells underwent mitotic arrest due to polar chromosomes, as well as those that exited mitosis with polar chromosomes.

      In Figure 3, the authors used Nocodazole-treated background to assess the involvement of SAC in DNA-damaging compound-induced mitotic delay. However, as shown in Figure 2B, DNA-damaging compounds cause a minor delay in mitosis, which might be challenging to analyze in the presence of Nocodazole. There is also a possibility that DNA damage response (DDR) works independently and adjunctly to delay mitosis. Because one of the major claims of the authors is that "the SAC is the only mechanism that is required to delay mitosis in the presence of long-term mitotic DNA damage (page 10, line278)", I recommend Nocodazole wash-out (as in Figure 2B) to examine the effect of MPS1-IN-1 (and ideally an inhibitor of the DDR pathway, such as ATMi) on mitotic delay induced by DNA-damaging compounds.

      R: We now clarify that the observed mitotic delay in the presence of DNA damaging compounds occurred after nocodazole washout. As so, nocodazole was no longer present in the system. We also draw the attention that DNA damage in the presence of nocodazole, a condition that promotes maximal SAC activity, was fully dependent on MPS1 activity (Figure 4A).

      Line 226, (our unpublished observations): because the authors claim that "the formation of polar chromosomes due to the stabilization of kinetochore-microtubule attachments upon long-term mitotic DNA damage is likely exclusive to cancer cells," the authors should present data on RPE-1 cells at least for %polar chromosome formation (as suggested in comment 1) and Mad1 localization. Plus, even though the data is provided, the statement "exclusive to cancer cells (page 8, line 230)" is speculative and should be toned down. Mad1 localization data is also important because the authors claim that "long-term mitotic NA damage specifically stabilized kinetochore-microtubule attachments in cancer cells (page 10, line 288)" in the discussion.

      R: Data from non-transformed RPE-1 cells are now included in the revised manuscript.

      For the Mad1 assay, such as in Fig. 4A, the authors analyzed the CENP-C pair with two or one Mad1 foci formation. However, in some representative pictures, for example, Fig S4A-Etoposide, I found pairs of CENP-C signals on the polar chromosome without any Mad1 foci (the one next to the pairs shown in the square). As the authors argue, these kinetochores may represent polar chromosomes that eventually satisfy SAC and may be important. I, therefore, wonder why those kinetochores are omitted from the assay. Please explain this point in the manuscript if there is any reason.

      R: We have now provided a clearer example and clarified in the main text that only chromosomes outside the spindle area were considered polar chromosomes.

      Minor comments:

      Page 7, line 168: the authors claim that "regardless of the type of DNA lesion, long-term mitotic DNA damage persists throughout mitosis and promotes micronuclei formation from polar chromosomes." However, the former claim is not fully supported by Figure S3, which addressed the effect of Etoposide only; the latter claim is not fully supported by Figure 2C, which lacks clarity (as pointed out in comment 2) and statistical analysis. Please revise this sentence.

      R: We now present the levels of yH2AX after treatment with Lomustine, Mitomycin C, and Carboplatin and compared it with DMSO-treated controls and Etoposide. We also include statistics for the cell fate and respective chromosome segregations errors after treatment with the different DNA damaging agents.

      Line 182: it would be helpful for readers to explain why MG132 was used.

      R: This is now explained in the main text.

      Line 210: it would be helpful for readers to explain briefly what PA-GFP means and how the assay works.

      R: This is now explained in the main text.

      Figure 1E: some color codes for each compound are difficult to distinguish. I also found it challenging to locate some lines on the graph. I recommend separating this graph, for example, by types of DNA lesions caused by compounds, and color codes that are easy to distinguish should be used.

      R: We have now changed the most confusing colors and provide a higher temporal resolution chart in the low yH2AX region to facilitate visualization.

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      R: None

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In the manuscript entitled "Long-term mitotic DNA damage promotes chromokinesin-mediated missegregation of polar chromosomes in cancer cells," the authors propose that DNA damage on mitotic chromosomes causes chromokinesin-mediated polar chromosomes, which eventually results in missegregation and micronuclei formation. They first performed screening of compounds that cause DNA damage on mitotic chromosomes and found that DNA damage delayed mitosis in the nocodazole wash-out experiment. The authors found that several DNA damage-inducing compounds all caused an increase of asymmetric Mad1 localization on polar chromosomes. Using photoactivatable GFP-a-tubulin, the authors showed that a-tubulin stabilizes after Etoposide treatment. They finally showed that chromokinesin Kid and Kif4a knockdown rescues the asymmetric Mad1 localization.

      Major comments:

      1. Page 6, line 155: the authors claim that "In contrast, among other defects, treatment with any of the DNA-damaging compounds caused a significant mitotic delay due to the presence of misaligned chromosomes near the spindle poles." Although Figure 2A shows a representative image of polar chromosomes, I do not find quantitative data that analyze %polar chromosomes in mitosis treated with DNA-damaging compounds. I also do not find the data supporting the claim that polar chromosomes caused a mitotic delay. Because most subsequent analyses were performed based on this result, the quantitative data should be provided here. For the latter, I suggest showing "time in mitosis (Fig 2B)" separately with or without polar chromosomes.
      2. According to Figure 2C, the ratio of "Exit with micronuclei (from misaligned chromosome(s))" is relatively low compared to other phenotypes such as "Mitotic arrest" or "Cell death." I wonder if polar chromosome phenotype is also correlated with these other cell fates. Please clarify which fate is correlated with polar chromosome formation after DNA damage.
      3. In Figure 3, the authors used Nocodazole-treated background to assess the involvement of SAC in DNA-damaging compound-induced mitotic delay. However, as shown in Figure 2B, DNA-damaging compounds cause a minor delay in mitosis, which might be challenging to analyze in the presence of Nocodazole. There is also a possibility that DNA damage response (DDR) works independently and adjunctly to delay mitosis. Because one of the major claims of the authors is that "the SAC is the only mechanism that is required to delay mitosis in the presence of long-term mitotic DNA damage (page 10, line278)", I recommend Nocodazole wash-out (as in Figure 2B) to examine the effect of MPS1-IN-1 (and ideally an inhibitor of the DDR pathway, such as ATMi) on mitotic delay induced by DNA-damaging compounds.
      4. Line 226, (our unpublished observations): because the authors claim that "the formation of polar chromosomes due to the stabilization of kinetochore-microtubule attachments upon long-term mitotic DNA damage is likely exclusive to cancer cells," the authors should present data on RPE-1 cells at least for %polar chromosome formation (as suggested in comment 1) and Mad1 localization. Plus, even though the data is provided, the statement "exclusive to cancer cells (page 8, line 230)" is speculative and should be toned down. Mad1 localization data is also important because the authors claim that "long-term mitotic NA damage specifically stabilized kinetochore-microtubule attachments in cancer cells (page 10, line 288)" in the discussion.
      5. For the Mad1 assay, such as in Fig. 4A, the authors analyzed the CENP-C pair with two or one Mad1 foci formation. However, in some representative pictures, for example, Fig S4A-Etoposide, I found pairs of CENP-C signals on the polar chromosome without any Mad1 foci (the one next to the pairs shown in the square). As the authors argue, these kinetochores may represent polar chromosomes that eventually satisfy SAC and may be important. I, therefore, wonder why those kinetochores are omitted from the assay. Please explain this point in the manuscript if there is any reason.

      Minor comments:

      1. Page 7, line 168: the authors claim that "regardless of the type of DNA lesion, long-term mitotic DNA damage persists throughout mitosis and promotes micronuclei formation from polar chromosomes." However, the former claim is not fully supported by Figure S3, which addressed the effect of Etoposide only; the latter claim is not fully supported by Figure 2C, which lacks clarity (as pointed out in comment 2) and statistical analysis. Please revise this sentence.
      2. Line 182: it would be helpful for readers to explain why MG132 was used.
      3. Line 210: it would be helpful for readers to explain briefly what PA-GFP means and how the assay works.
      4. Figure 6E-G: I wonder whether siKid+siKif4a affected %polar chromosomes or not.
      5. Page 10, line 287: the authors claim that "we show that long-term mitotic DNA damage..., causing the missegregation of polar chromosomes due to the action of arm-ejection forces by chromokinesisns,...." However, only Mad1 localization data is provided in Figure 6E-G, and whether siKid + siKif4a rescues the missegregation of polar chromosomes is not clear. The authors should either provide supporting evidence or revise this sentence for clarity.
      6. Figure 1E: some color codes for each compound are difficult to distinguish. I also found it challenging to locate some lines on the graph. I recommend separating this graph, for example, by types of DNA lesions caused by compounds, and color codes that are easy to distinguish should be used.

      Referees cross-commenting

      I generally agree with other reviewers' comments and confirmed that they raised similar concerns.

      Significance

      It has been described previously that mitotic arrest induces DNA damage and that the DDR pathway during mitosis is attenuated. The data presented in this manuscript provide a potentially novel cellular response against DNA damage during mitosis. The manuscript will be of interest to those in the field of the cell cycle (especially mitosis), the DDR, and tumor chemotherapies. While the finding that DNA damage during mitosis causes polar chromosomes is potentially interesting, the manuscript is still rather descriptive, and data that address the molecular mechanism is insufficient for the level that the authors conclude. Although the data quality is high, I think some essential data supporting their conclusion and clarity of the description are missing from the manuscript, which can be addressed before publication.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Novais-Cruz et al present an interesting and generally well-performed study on the impact of DNA damage just prior to mitosis on chromosome segregation fidelity. Overall, the experiments are performed and presented to a high standard, and the key findings are potentially of interest. However, in its present form the manuscript is overall quite confusing and it is difficult to assess the robustness of their conclusions. Their observations and mechanistic model connecting the observations is not clear at all in the current form. Several key pieces of data are missing that would help craft the story, and more explanation is needed to connect their overarching hypothesis to the data better. The below points serve as an illustration of the missing information that would be needed in order to make a proper judgement of whether the data support their main conclusions.

      1. The term 'long-term DNA damage during mitosis' is confusing. DNA damage occurs when? (before, or during mitosis?). Their live cell data shows they are following cells that underwent mitosis within a certain time window after damage occurred, but it is not clear if it occurs only before, or sometimes during mitosis.
      2. All damage occurred in cells treated with nocodazole - could this have impacted the results? Was similar DNA damage induced if cells were arrested in monastrol/STLC, or MG132 for example? They show the distal Mad2 data in STLC but not the damage. Also Figure S4A/B appears to be from one experiment only which makes it difficult to interpret.
      3. Why not show the RPE1 data for polar chromosomes?
      4. Confusing interpretation - monotelic attachments, yet also stable attachments,..? Please can they clarify what is meant by these terms.
      5. They focus most of the study on understanding why polar chromosomes arise after DNA damage. However, this phenotype seems to be a relatively minor effect. Eg. In Figure 2b only a few cells exhibit very extended mitosis, and in Figure 2c only a small percentage exit mitosis with misaligned chromosomes. Furthermore, in Figure 4B the percentage of polar chromosomes with only distal Mad2 is low. In Figure 4A the images are not clear whether a 'both' or 'distal' is being shown. It does not seem as if any are distal only, and an example of this would be helpful.
      6. It is also not clear what 'other' means in Figure 2C.
      7. They conclude that chromokinesin depletion rescues the polar chromosomes phenotype. However they do not directly assess the rate of polar chromosome formation, only the % of polar chromosomes that are only distal Mad2 as far as I can see? They need to present the % of cells with polar chromosomes, and it would also be informative to understand the rate of cells with lagging chromosomes, or that underwent anaphase with polar chromosomes with and without chromokinesin depletion.
      8. It would be very helpful for them to provide a schematic model between DNA damage, overstable microtubules, satisfied SAC, monotelic attached chromosomes, and the role of chromokinesins. At present these connections are very unclear.
      9. It is not clear from how many experiments data are shown for the Mad1 distal experiments in Figures 4 and S4 (there are no error bars, so is this one experiment only?). This should be indicated in figure legends, and repeated if performed only once.

      Significance

      Novais-Cruz et al present an interesting and generally well-performed study on the impact of DNA damage just prior to mitosis on chromosome segregation fidelity. Overall, the experiments are performed and presented to a high standard, and the key findings are potentially of interest to the mitosis, and genomic instability fields.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In the presented manuscript, authors investigated consequences of DNA damage on progression through mitosis. In agreement with previous reports, they observed a mitotic delay that was dependent on activation of the spindle assembly checkpoint (SAC). Starting with a high throughput screening authors concluded that the SAC-dependent mitotic delay is a common feature and is not limited to a specific type of DNA damage. They followed with a more detailed analysis using selected compounds and showed that mitotic DNA damage promotes formation of polar chromosomes with stable kinetochore-microtubule attachments. The live-cell imaging revealed that the cells carrying DNA damage eventually exited mitosis with the misaligned chromosomes forming micronuclei in the daughter cells. Most of the conclusions are supported by the experimental data with some exceptions detailed below.

      Major comments:

      1. In the opinion of the reviewer, the study is somewhat unbalanced as it starts with a high throughput analysis of a large number of compounds but only etoposide treatment is investigated in detail in the key experiments shown in Figures 5 and 6. The effect of topoisomerase II inhibitor on kinetochore-MT stability has already been demonstrated by Bakhoum et al, 2014. If authors wish to generalize that similar phenotypes are observed after various types of DNA damage, they should test additional compounds (such as Lomustine, Mitomycin C, and Carboplatin). In addition, measuring of the kinetochore-MT half-life in figure 5 should be performed with better time resolution within the first 5 minutes. This would allow better comparison of the measured half lives that are much shorter than 5 minutes.
      2. Authors claim that the observed phenotype of the chromosomal missegregation following the mitotic DNA damage occurs specifically in cancer cells but the data supporting this statement is poor. They need to show more data in non-transformed cells or remove the statement.
      3. Involvement of Kid and Kif4A in arm ejection of the polar chromosomes is an interesting observation in context of mitotic DNA damage. However, it is unclear how the distribution of the chromokinesines was evaluated in Figure 6A-D. Was the signal quantified at the metaphase plate or at polar chromosomes? It seems that Kif4A localizes to the polar chromosome caused by etoposide treatment whereas no signal is visible in DMSO control (Fig. 6C).
      4. Authors convincingly showed that SAC is activated by mitotic damage and this is also consistent with previous reports. However they did not address if DDR pathways contribute to the activation of SAC. This would be interesting especially in context of a recent report that showed Bub3 as a direct substrate of ATM (Xiao et al. 2022, JBC). I wonder if the polar chromosomes are formed and missegregate also in the absence of ATM activity.

      Minor comments:

      1. Specificity of Aurora-B pT232 antibody should be validated to exclude cross-reactivity with Aurora-A at spindle pole (Fig S8C)

      Significance

      This study addresses the impact of mitotic DNA damage on chromosome segregation which is an important but largely unexplored topic. Authors extend earlier observations by Bakhoum et al, 2014 and demonstrate that also the misaligned polar chromosomes result in formation of micronuclei and may promote chromosomal instability. The study will be of interest mainly for the mitosis filed. The possibility that the described phenotype may have implications for cancer therapies is interesting but will surely require more detailed studies.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01406

      Corresponding author(s): Harris, Reuben

      1. General Statements [optional]

      We would like to thank all three reviewers for their time and thorough assessment of our manuscript. We appreciate their constructive feedback and believe our work has been considerably strengthened by addressing the comments, suggestions, and concerns raised during peer review. In the following responses, we address the reviewer’s critiques point-by-point.

      2. Point-by-point description of the revisions

      Reviewer 1

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Moraes et al. build upon their recent studies of APOBEC3 antagonism by EBV BORF2 by showing that additional RNR subunits encoded by other herpesviruses share this activity, suggesting that the host-virus arms races involving APOBEC3 proteins is more widespread than previously thought. Furthermore, the authors show that herpesviruses infecting primates that lack A3B (New World Monkeys) do not apparently exhibit a capacity to antagonize human A3B, suggesting that this function was not required during the evolution of those viruses (while it seemingly was important for viruses infecting hosts that encode A3B). Overall, this is a technically sound submission that combines confocal immunofluorescence, co-immunoprecipitation, and enzymatic assays to comprehensively test the sensitivity of different A3s to counteraction by viral RNRs. The enzymatic (deamination) assays performed prove to be the most insightful, since co-IP and colocalization microscopy was not entirely sufficient to reveal which domains of A3 are important for targeting by RNRs. It is well-written, well-organized, and well-referenced, and will be of interest to readers who study APOBEC3s, herpesviruses, and host-virus arms races more generally.

      Response: Thank you for appreciating the technical aspects and broader impact of our studies.

      Major points:

      1. Figure 4B: the fluorescence microscopy data does not match well with the Co-IP (in Figure 4A). For example, L1B in A3A enhances the A3A-BORF2 co-IP but no clear differences are observed in colocalization. Or are the authors claiming the presence of L1B results in greater colocalization between A3A and BORF2, because there is slightly less diffuse BORF2 in the cytoplasm under these conditions? If that is the case, then quantitative colocalization analysis will need to be performed. In general, virtually none of the colocalization analysis in Figure 4B matches well with the co-IP results of Figure 4A. The authors take this to suggest that L7, and not L1, is most important determinant for BORF2 binding to A3s, but in that case, then the colocalization data is disconnected functionally from co-IP results. This is not necessarily a large problem, since the authors ultimately test the enzymatic activity of A3s in the presence of different RNRs. These latter functional experiments more objectively define what regions of A3s are important for antagonism by RNRs.

      Response: Thank you for giving us an opportunity to clarify these important points. We understand how the fluorescence microscopy data in Figure 4B may at first appear to disagree with the co-IP results in Figure 4A. However, we would like to point out that WT A3A—which has a shorter L1 region and binds less strongly to BORF2 compared to A3B—is nevertheless efficiently relocalized by BORF2 (PMID 35476445, 31534038, 31493648). We believe that this observation can be explained by compensatory avidity interactions during cytoplasmic aggregate formation in living cells, a process mediated by the formation of a non-canonical BORF2-BORF2 dimer as detailed in our recent cryo-EM studies (PMID 35476445). These avidity interactions explain how a weaker interaction (as indicated by weaker co-IP levels) can still result in the formation of large cytoplasmic aggregates. We have therefore revised our text to explain this apparent incongruity (page 11, lines 10-13).

      Can the authors discuss/cite more about the actual subcellular compartments that the A3s are being relocated towards by the RNPs? In general, the authors' comments are limited to whether the A3 is predominantly in the nucleus, or not.

      Response: Previous imaging studies with markers for cytoplasmic organelles by our lab suggested that BORF2-A3B aggregates accumulate within the endoplasmic reticulum (ER) (PMID 30420783). However, in our recent cryo-EM studies of the BORF2-A3B complex (PMID 35476445), we discovered that disrupting BORF2-BORF2 dimerization prevents aggregate formation but does not affect EBV BORF2’s ability to bind to A3B and relocalize the complex to the cytoplasmic compartment. In other words, dimerization-deficient mutants of BORF2 clearly cause A3B-BORF2 heterodimers to appear diffusely cytoplasmic. Therefore, we no longer have a reason to implicate the ER and we aim to clarify this in future studies aiming to define the full molecular composition of the large cytoplasmic aggregates.

      Since the authors draw a connection between the absence of A3B in New World Monkeys and the fact that New World Monkey-specific viruses don't seem to counteract A3s, can the authors discuss what could be learned by studying human individuals who lack A3B and the evolution of herpesviruses in those individuals?

      Response: This is a very interesting point, but we would prefer not to speculate on this in our manuscript. Although there is indeed an A3B deletion allele in the human population (predominantly southeast Asia), its worldwide allele frequency is quite low and most people still have 1 or 2 copies of this antiviral gene. Thus, the deletion allele frequency is not high enough to remove the selective pressure on the virus to maintain A3B counteraction activity through its RNR.

              However, we did discover one Old World monkey species that completely lacks *A3B* (*Colobus angolensis*). We showed that the RNR from the gamma-herpesvirus that infect these monkeys (ColHV-1) lacks the ability to antagonize human A3B, ancestral A3B, human A3A, or the endogenous A3A of its natural host (__Figure 8__ and __Figure 8—figure supplement 3__). Thus, as you predicted, relieving the selective pressure within a species over an evolutionary period of time likely resulted in loss of A3B-antagonism activity by the viral RNR (page 19, lines 8-16).
      

      Minor points:

      1. I'm not sure it makes sense to call out Figures 1A-D in the Introduction section, rather than the Results section.

      Response: We have changed the Introduction and removed the original Figure 1 completely. We have however added a new Figure 1, which provides a structural rationale for our overall experimental approach.

      Reviewer #1 (Significance (Required)):

      This work represents a step-wise advance from the authors' previous work on herpesvirus RNPs and counteraction of host APOBEC3s. I study host-virus molecular arms race on evolutionary scales and this article is of interest and significance to me, and I assume to others in the field as well. The findings found within the submission are interesting but not necessarily informative about human health and disease. However, the subsequent work that this manuscript inspires is likely to tell us more about herpesvirus evolution in human patients and the mechanisms by which APOBEC3s promote cancer.

      Response: We thank you again for appreciating the broader significance of our work and how the results present here may inspire important future studies.

      Reviewer 2

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: Building off the groups prior work on A3B and EBV BORF2 interactions, here they have expanded their studies to examine additional herpesvirus RNRs, demonstrating which features are conserved. Using a combination of IP experiments and IF, they have included KSHV ORF61 and HSV-1 ICP6 RNRs, and demonstrated that the A3 loop structures, L1, L3, and L7 from A3A, A3B, and A3G play varying roles in determining the ability to interact with the different RNRs. They then go on to demonstrate that the ability of BORF2 to block the deaminase activity of A3B is dependent on the tyrosine at position 481. Lastly, and most interestingly, they show that RNRs from Old World monkeys, but not New World monkeys, can bind to A3A and A3B, lead to their re-localization, and block deaminase activity.

      Response: We thank you for appreciating the molecular details and broader impact of our studies. Please note that we have revised the paper to focus on gamma-herpesviruses by removing the less informative results with HSV-1 and adding new studies on Old/New World viral RNRs including comparisons with ancestral A3B.

      Major comments: The vast majority of this work is very convincing. The authors claims are clearly reflected in the data presented for the most part. However, the work done with HSV-1 ICP6 co-IP is not very convincing. The authors claim that L7 and L3 swaps from A3Bctd to A3Gctd decreases pulldown (lines 5-12, p.7; lines 18-21, p.8; line 17, p.16). The figures (2A, 3A, 4A) however show only A3A being pulled down with ICP6. The re-localization data however does seem more consistent with the above claims. The authors note this in line 9, p.8. However, they come to a different conclusion in line 2, p.8, regarding the discrepancy between IP and IF data.

      Response: As mentioned above, we have removed the less convincing results with HSV-1 ICP6. We believe uncovering the mechanistic details of HSV-1 ICP6 interaction with A3B will require significant additional work and, therefore, would prefer to address this question in future studies.

      The data and methods are clearly presented, with the exception of the supplemental figures, where it is unclear how the predicted modeling was conducted.

      Response: We apologize for the brief description in our earlier submission. We have revised our Methods section and included a more detailed description regarding the generation of protein structural models (page 29, lines 20-23; page 30, lines 1-6).

      Experiments all seem to be sufficiently replicated.

      Response: Thank you.

      Minor comments:

      The references to prior studies seem comprehensive. Text and figures were all very clear. Introducing the supplemental figure 1 earlier, may provide clarity to the argument about degree of relatedness (line 2, p.7).

      Response: We agree with this suggestion and have made changes to introduce the structural model of KSHV ORF61 in our new Figure 1.

      The suggestion of ORF61 interaction with L3 as an anchor region (line 10-12, p.9) was not very clear/could benefit from a bit more elaboration.

      Response: We agree with this comment and have placed the predicted structural model of KSHV ORF61 bound to A3Bctd in our new Figure 1 and we have changed the text to clarify the role of A3B L3 in binding to KSHV ORF61 (page 10, lines 4-8).

      Reviewer #2 (Significance (Required)):

      This work builds on the conceptual framework of host-pathogen interactions and co-evolution, adding new examples of co-divergence of primate herpesviruses with their respective host restriction factors. Following up on past findings (Cheng et al., 2019; Shaban et al., 2021), and reports from others (Stewart et al., 2019), they outline the degree to which their initial findings (BORF2 and A3B interactions) are conserved across other herpesvirus RNRs, and place them in the context of the evolution of the A3 gene locus and expansion.

      This work will be of great interest to virologists. Especially those that work in the field of host pathogen evolution and the molecular arms race.

      My background is in host-pathogen interactions and herpesvirus evolution. I lack the sufficient expertise to evaluate the predicted modeling.

      Response: We thank you again for appreciating the novelty and significance of our work. We are also hopeful that it will be of great interest to virologists.


      Reviewer 3


      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Moraes and colleagues build upon previous publications from this group to 1) characterize the variation in the ability of orthologs of BORF2 from six different herpesviruses to bind and/or relocalize and/or inhibit the deaminase activity of A3A and A3B; 2) use swaps and other mutagenesis to measure whether various regions and amino acids in A3A, A3B, and A3G contribute to the observed variation in the ability of RNR subunits from different viruses to bind these A3s.

      The data convincingly show that different regions of different A3s contribute differently to binding of RNRs from different viruses. These same regions also have variable effects on RNR-mediated relocalization and inhibition of the deamination activity of A3A, A3B, and A3G. In the last set of experiments presented in the manuscript, the authors show that the RNR from the four viruses isolated from humans and rhesus macaques are able to bind human A3B, while the RNRs from two New World monkey viruses are unable to bind human A3B. Finally, the authors suggest a correlation between the timing of the birth of A3B in the branch leading to the last common ancestor of hominoids/Old World monkeys and the gain of A3 binding/antagonism by herpesvirus RNRs. However, these evolutionary implications are not convincingly supported by the current datasets and would require a significant burden of initial experiments to test.

      Response: We thank the reviewer for the nice summary of our work and for appreciating the loop swaps experiments showing differential RNR binding to APOBEC3s. In our original submission, we compared the RNRs of 4 viruses infecting Catarrhini primates and 2 viruses infecting New World primate species. We found that only the RNRs from viruses that infect Catarrhini primates bind, relocalize, and inhibit human A3B. We have now performed additional experiments to further investigate this remarkable association (Figures 7, 8, and associated supplementary material) which are detailed in our revised manuscript and summarized below:

      First, we have expanded the scope of our experiments to include all publicly available RNR sequences from primate gamma-herpesviruses (i.e., 11 RNRs in contrast to our initial 6 RNRs). Second, we tested this whole panel against human A3B and found that only the RNRs from viruses that infect Old World primates that encode A3B are able to bind, relocalize, and inhibit human A3B (Figures 6 & 8). In comparison, binding to human A3A in co-IP experiments is invariably weaker and/or not detectable, relocalization phenotypes are less pronounced, and DNA deaminase activity is not inhibited (Figure 6 and Figure 8—figure supplement 2B). __Third, a subset of this RNR panel was tested against the A3A enzymes of their natural host species (__Figure 7) and, again, only the RNRs form viruses that infect Old World primates bind and relocalize the A3A enzymes tested. Fourth, as an addition test of this idea, we grafted a short helical loop structure (HLS) from EBV BORF2 into the marmoset CalHV-3 RNR and showed that this small change enabled the chimeric protein to bind to both human A3B and A3A (likely through L7), though not to the natural marmoset A3A protein. Fifth, we used all available present-day primate A3B sequences to reconstruct the most likely ancestral A3B sequence and showed that this enzyme is nuclear and as active (if not more active) than human A3B (Figure 8). This ancestral A3B protein is also bound, relocalized, and inhibited by most present day RNRs from gamma-herpesviruses that infect species with A3B, but not by the RNRs of any of the NWM-infecting viruses tested. The only exception to this association between A3B and Catarrhini-infecting gamma-herpesviruses is the RNR of the African Colobus virus ColHV-1, which we found can likely be explained by the loss of A3B in its host species due to a deletion that occurred approximately 10-14 mya after the split of the Colobinae subfamily into African and Asian tribes, which further supports the idea that A3-antagonism by gamma-herpesvirus RNRs is maintained by the selective pressure imposed by the antiviral activity of A3B.

      Major Comments

      1) The use of only the human orthologs of A3A and A3B limit the inferences that can be made regarding the ability of RNRs from various viruses to bind the A3s from the host species of that virus. For example, human A3A (and other hominoid A3As) have a rather distinct Loop 1 sequence, where that same loop in rhesus A3A is a much more similar to A3B. It follows that the RNRs from rhesus-tropic viruses could very well bind and inhibit A3A from rhesus. Likewise, the A3B-RNR interactions within and between species could differ markedly. Indeed, we know that the loops of A3s are some of the most rapidly evolving regions of these genes.

      Response: We agree fully with these points and have addressed them through several new experiments. We have now tested the RNRs from rhesus macaque and NWM gamma-herpesviruses against the A3A enzymes of their natural host species (Figure 7) and found that only RNRs form viruses that infect Old World primates bind and relocalize the A3A enzymes tested. In addition, we used all available present-day primate A3B sequences to reconstruct the most likely ancestral sequence and showed that this ancestral A3B enzyme is antagonized exclusively by the RNRs of present-day gamma-herpesviruses that infect A3B-encoding primates.

      2) If RNR's ability to bind A3s correlated or was driven by the birth of A3B in catarrhine primates, the evolution of the binding/antagonism trait would be highly unparsimonious. The most parsimonious scenario would be emergence of A3 antagonism in the LCA of alpha and gammaherpesviruses (since the authors show A3 binding in HSV-1 and several gammaherpesviruses) with a loss of the trait in NWM-infecting viruses; alternatively, the trait could have been horizontally transferred gained 3 independent times, but this is certainly unlikely and not supported by any data. However, it is also possible that the RNRs from NWM infecting viruses do, in fact, bind/antagonize the A3 orthologs from NWMs. This needs to be tested before addressing the complexities of the birth of the antagonism trait.

      Response: Please see our responses above. All of our results support a model in which the birth of A3B in an ancestral primate selected for a gamma-herpesvirus with A3B binding and neutralization activity and that this activity has been maintained through evolution and still manifests today in all of the tested present-day RNRs of gamma-herpesviruses that infect species with A3B.

      Minor Comments

      1) The authors should state more discreetly what is new to this paper and what was shown in previous paper and in some cases repeated here. For example, figure 1 is all repeated experiments from previous papers which is unusual for a manuscript.

      Response: This is a fair point and we have removed the original Figure 1 and replaced it with a structural model that provides a strong rationale for the rest of our studies. For the sake of clarity, we have also revised our text and made the necessary changes to ensure a clear distinction between new and repeated results.

      2) The authors conclude that RNRs bind to A3s via partially distinct surfaces, but they don't actually test binding. Swaps and mutations do not show that the site of mutation is a site of interaction. but they do test the requirement of these AAs or regions for binding. Formally, these mutations could be exerting an allosteric effect on the binding interface of RNR and A3. In combination with the CryoEM data, these new data do support the model that these are different surfaces of interaction, but the wording should be more precise to present this.

      Response: In our revised manuscript we use a combination of in silico protein structure prediction and docking to model the binding interface between human A3Bctd and KSHV ORF61 (new Figure 1). This approach predicts an interaction with the L3 region of A3B, which we validate through co-IP and co-localization experiments (Figure 3). In contrast, EBV BORF2 requires the L7 region to bind to A3Bctd and this interaction is additionally strengthened by L1 residues (Figures 2 & 4). Taken together with our prior cryo-EM data, these results point to a model in which EBV BORF2 and KSHV ORF61 bind to different surfaces of A3B (albeit near the active site and likely due to evolutionary “wobbling”). We therefore believe it to be unlikely that this mechanism is allosteric given our prior structural studies and the likely common evolutionary origin of this A3B antagonism mechanism.

      3) Similar to point 1, the authors repeatedly discuss the "most critical determinant of EBV BORF2 binding" and other "most critical" interactions. This is not supported by the data and should be changed to something along the lines of 'the site of largest effect among the sites we analyzed'.

      Response: We have endeavored to change this text as suggested except in cases referring to the interactions between EBV BORF2 and A3Bctd, since the results presented here together with our cryo-EM structure of the BORF2-A3Bctd complex (PMID 35476445) allow us to confidently say that L7 and L1 are the most critical determinants.

      4) All microscopy figures need an A3 only panel (no RNR) to be able to judge relocalization.

      Response: Changed as suggested.

      5) The matrix of labels above each IP blot is excessive since each lane only has one component that differs from the other lanes. A single label for each lane would make the plot easier to discern. These figures would also benefit from clearer labels including which virus each blot panel corresponds to (these could be along the left side of each blot; currently, the RNR gene name is provided, but this is a bit hard to find within the figure). Figures 2-4 would benefit from a label above each panel A indicating "L1" "L3" "L7".

      Response: Changed as suggested.

      6) If the authors comment on pg9 ln 8 about intermediate relocalization effect, they should also mention 1C A3B L7G against BORF2

      Response: Changed as suggested (page 8, lines 16-19).

      7) Why is there no quantification of 6D relocalization? Could be supplemental if needed.

      Response: We have performed quantification of the relocalization phenotypes in Figures 2, 3 & 4 in order to allow direct comparison between WT and chimeric A3 enzymes in the presence of the same viral RNR (EBV BORF2 or KSHV ORF61). On the other hand, the images in Figure 6D are representative of the A3B/A relocalization phenotypes elicited by a larger panel of different viral RNRs. These representative images should be interpreted together with the co-IP and ssDNA deaminase activity assay data in Figures 6C & 6E, respectively.

      8) Pg 6 ln 13 and Pg 8 ln2-4, ICP6 doesn't coIP w A3B; this should be clarified.

      Response: A similar concern was also raised by reviewer #2, and we agree that the ICP6 data present in the original version of this manuscript are not as easily interpretable compared to results with the RNRs from gamma-herpesviruses such as EBV and KSHV. For the sake of clarity and cohesion, we decided to remove all of the HSV-1 ICP6 data from the revised version of our manuscript and focus on the A3B interactions with gamma-herpesviruses.

      9) Pg 8 ln21-23, the authors assume loss of function for A3G, but this swap could be functionally equivalent, but necessary for binding; it should be clarified that this is different than changing sequence and still binding.

      Response: Rephrased (page 9, lines 13-16).

      10) Pg 9, ln 11, what is an anchor region?

      Response: We have removed the term “anchor region” and rephrased our text to more clearly describe the importance of L3 in KSHV ORF61 binding (page 10, lines 4-8).

      11) Pg 9, ln 23, speculative - this might be explained by this 3 AA motif but it has not been tested.

      Response: Changed wording (page 10, lines 17-20).

      12) Pg 10 ln 8, this doesn't show that this region is dispensable for binding, only that there is equivalent contribution or lack of contribution of function by the A and B loops, again assuming that the G loop is LOF

      Response: Removed the phrase “indicating that this region may be dispensable for the interaction” (page 11, 1-2).

      13) Pg 10 ln 10 - "can be explained by presence of bulky tryp" - this should be reworded to 'could or is likely caused by'.

      Response: Changed as suggested (page 11, lines 4-6).

      14) Pg 11 ln 21, "can be explained by our cryo-EM" should be reworded to 'is supported by these contacts in cryo-EM'

      Response: Changed as suggested (page 12, lines 15-18).

      15) Pg 13 ln 10 (and other places) dissociation rates are only part of affinity, Ka is equally important (pg 18 ln 8 also)

      Response: We agree and have revised our text account for this suggestion (page 13, lines 22-23; page 14, lines 1-2; page 22, lines 12-15).

      16) Pg 14, ln 15 should be reworded to 'relocalize HUMAN cellular A3s'

      Response: Changed as suggested (page 15, line 11).

      17) Pg 16 Ln 16, this should be reworded as the data can't say it is completely dispensable without deletion of the loop.

      Response: Changed as suggested (page 21, lines 10-11).

      18) New World monkeys have high activity of transposable elements of distinct types relative to catarrhines. It would be useful to mention that A3s restrict endogenous elements as well and how this might be a factor in the proposed evolutionary model.

      Response: This is a very interesting point that we plan to discuss in a future review. In addition, many future experiments will be needed to test the potential relationship between the birth of A3B and its potential impact on different classes of endogenous transposable elements.

      19) Are the New World monkey viruses pathogenic in their native hosts? Perhaps not based on previous literature (reviewed in PMID: 11313011). This should be included in the discussion as it could certainly effect the evolutionary model for the birth/retention of A3 antagonism in these viruses.

      Response: While interesting, the observation that NWM herpesviruses do not cause disease in their native hosts is not unusual. In fact, most gamma-herpesviruses (including human viruses like EBV and KSHV) have limited pathogenic potential when infecting their natural hosts (Fleckenstein B, Ensser A. Gammaherpesviruses of new world primates. Human Herpesviruses: Biology, Therapy, and Immunoprophylaxis. 2007). Additionally, although the mentioned study (PMID: 11313011) reports asymptomatic infection of squirrel monkeys with SaHV-2, pathogenic infection/ oncogenic transformation have been reported following natural infection of marmosets with CalHV-3 (PMID: 11158621).

      20) In previous papers on this topic, the lab has tested the effect of mutations on viral titers. While this may be beyond the scope of this paper, this would certainly elevate the paper and should be more clearly discussed.

      Response: As noted by the reviewer, we have previously demonstrated that A3B restricts EBV replication though a mutation-dependent mechanism and that this is counteracted by EBV BORF2 (PMID 30420783). While we completely agree that investigating the effect of A3B-catalyzed mutations on the titers of different gamma-herpesviruses would be interesting, this would be technically challenging as we are currently not equipped to work with KSHV or any non-human primate herpesvirus.

      21) What is the degree of sequence similarity among these and other RNRs? Is there any sense of what region of RNR binds A3s from the CryoEM structures and differences within these regions that might explain the functional differences?

      Response: We thank the reviewer for raising this important point. We have now included a new Figure 1 where we leverage the cryo-EM structure of the EBV BORF2-A3Bctd complex to make inferences about which regions of KSHV ORF61 may be involved in binding A3B/A. As described above, we also graft a short helical loop structure (HLS) from EBV BORF2 into the marmoset CalHV-3 RNR and showed that this small change enables the chimeric protein to bind to bind both human A3B and A3A (likely through L7), though not to the natural host marmoset A3A protein (Figure 7). Many additional interspecies chimeras could be constructed but we feel these are better suited for future studies (and specially to accompany future structural work in this area).


      Reviewer #3 (Significance (Required)):

      Significance

      Previous work from the Harris lab showed that a subunit of the ribonucleotide reductase of some herpesviruses acts as an antagonist of several human APOBEC3s. Mechanistically, these viral protein block A3 inhibition by relocalizing nuclear A3s as well as inhibiting A3 deamination by binding and occluding the A3 active site. For Epstein-Barr virus, deletion of the antagonist (BORF2) results in a decrease in viral replication and accumulation of mutations likely introduced by host A3B that is no longer inhibited. However, deletion of the A3 antagonist from herpes simplex virus-1 (ICP6) had no effect on viral titers. Most recently, this group published a cryoEM structure of BORF2 in complex with the c-terminal half of A3B. This structure showed extensive contacts between BORF2 and two loops of A3B - L1 and L7.

      The manuscript under review focuses on the previously suggested differences in the ability of different RNRs to bind A3A and A3B. This work provides an important contribution to this topic in defining specific regions of A3A and A3B and A3G that are necessary for viral RNRs to bind them. The variability in these interactions is surprising and likely testament to the impactful coevolution of herpesviruses and primate A3s. This manuscript will be of particular interest to virologists studying A3s or herpesviruses as well as evolutionary biologists interested in the rules of engagement between host restriction factors and viruses.

      Response: We thank you again for these thoughtful comments and for appreciating the overall significance of our work.


    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      In this manuscript, Moraes and colleagues build upon previous publications from this group to 1) characterize the variation in the ability of orthologs of BORF2 from six different herpesviruses to bind and/or relocalize and/or inhibit the deaminase activity of A3A and A3B; 2) use swaps and other mutagenesis to measure whether various regions and amino acids in A3A, A3B, and A3G contribute to the observed variation in the ability of RNR subunits from different viruses to bind these A3s.

      The data convincingly show that different regions of different A3s contribute differently to binding of RNRs from different viruses. These same regions also have variable effects on RNR-mediated relocalization and inhibition of the deamination activity of A3A, A3B, and A3G. In the last set of experiments presented in the manuscript, the authors show that the RNR from the four viruses isolated from humans and rhesus macaques are able to bind human A3B, while the RNRs from two New World monkey viruses are unable to bind human A3B. Finally, the authors suggest a correlation between the timing of the birth of A3B in the branch leading to the last common ancestor of hominoids/Old World monkeys and the gain of A3 binding/antagonism by herpesvirus RNRs. However, these evolutionary implications are not convincingly supported by the current datasets and would require a significant burden of initial experiments to test.

      Major Comments

      1. The use of only the human orthologs of A3A and A3B limit the inferences that can be made regarding the ability of RNRs from various viruses to bind the A3s from the host species of that virus. For example, human A3A (and other hominoid A3As) have a rather distinct Loop 1 sequence, where that same loop in rhesus A3A is a much more similar to A3B. It follows that the RNRs from rhesus-tropic viruses could very well bind and inhibit A3A from rhesus. Likewise, the A3B-RNR interactions within and between species could differ markedly. Indeed, we know that the loops of A3s are some of the most rapidly evolving regions of these genes.
      2. If RNR's ability to bind A3s correlated or was driven by the birth of A3B in catarrhine primates, the evolution of the binding/antagonism trait would be highly unparsimonious. The most parsimonious scenario would be emergence of A3 antagonism in the LCA of alpha and gammaherpesviruses (since the authors show A3 binding in HSV-1 and several gammaherpesviruses) with a loss of the trait in NWM-infecting viruses; alternatively, the trait could have been horizontally transferred gained 3 independent times, but this is certainly unlikely and not supported by any data. However, it is also possible that the RNRs from NWM infecting viruses do, in fact, bind/antagonize the A3 orthologs from NWMs. This needs to be tested before addressing the complexities of the birth of the antagonism trait.

      Minor Comments

      1. The authors should state more discreetly what is new to this paper and what was shown in previous paper and in some cases repeated here. For example, figure 1 is all repeated experiments from previous papers which is unusual for a manuscript.
      2. The authors conclude that RNRs bind to A3s via partially distinct surfaces, but they don't actually test binding. Swaps and mutations do not show that the site of mutation is a site of interaction. but they do test the requirement of these AAs or regions for binding. Formally, these mutations could be exerting an allosteric effect on the binding interface of RNR and A3. In combination with the CroEM data, these new data do support the model that these are different surfaces of interaction, but the wording should be more precise to present this.
      3. Similar to point 1, the authors repeatedly discuss the "most critical determinant of EBV BORF2 binding" and other "most critical" interactions. This is not supported by the data and should be changed to something along the lines of 'the site of largest effect among the sites we analyzed'.
      4. All microscopy figures need an A3 only panel (no RNR) to be able to judge relocalization.
      5. The matrix of labels above each IP blot is excessive since each lane only has one component that differs from the other lanes. A single label for each lane would make the plot easier to discern. These figures would also benefit from clearer labels including which virus each blot panel corresponds to (these could be along the left side of each blot; currently, the RNR gene name is provided, but this is a bit hard to find within the figure). Figures 2-4 would benefit from a label above each panel A indicating "L1" "L3" "L7".
      6. If the authors comment on pg9 ln 8 about intermediate relocalization effect, they should also mention 1C A3B L7G against BORF2
      7. Why is there no quantification of 6D relocalization? Could be supplemental if needed.
      8. Pg 6 ln 13 and Pg 8 ln2-4, ICP6 doesn't coIP w A3B; this should be clarified.
      9. Pg 8 ln21-23, the authors assume loss of function for A3G, but this swap could be functionally equivalent, but necessary for binding; it should be clarified that this is different than changing sequence and still binding.
      10. Pg 9, ln 11, what is an anchor region?
      11. Pg 9, ln 23, speculative - this might be explained by this 3 AA motif but it has not been tested.
      12. Pg 10 ln 8, this doesn't show that this region is dispensable for binding, only that there is equivalent contribution or lack of contribution of function by the A and B loops, again assuming that the G loop is LOF
      13. Pg 10 ln 10 - "can be explained by presence of bulky tryp" - this should be reworded to 'could or is likely caused by'.
      14. Pg 11 ln 21, "can be explained by our cryo-EM" should be reworded to 'is supported by these contacts in cryo-EM'
      15. Pg 13 ln 10 (and other places) dissociation rates are only part of affinity, Ka is equally important (pg 18 ln 8 also)
      16. Pg 14, ln 15 should be reworded to 'relocalize HUMAN cellular A3s'
      17. Pg 16 Ln 16, this should be reworded as the data can't say it is completely dispensable without deletion of the loop.
      18. New World monkeys have high activity of transposable elements of distinct types relative to catarrhines. It would be useful to mention that A3s restrict endogenous elements as well and how this might be a factor in the proposed evolutionary model.
      19. Are the New World monkey viruses pathogenic in their native hosts? Perhaps not based on previous literature (reviewed in PMID: 11313011). This should be included in the discussion as it could certainly effect the evolutionary model for the birth/retention of A3 antagonism in these viruses.
      20. In previous papers on this topic, the lab has tested the effect of mutations on viral titers. While this may be beyond the scope of this paper, this would certainly elevate the paper and should be more clearly discussed. What is the degree of sequence similarity among these and other RNRs? Is there any sense of what region of RNR binds A3s from the CryoEM structures and differences within these regions that might explain the functional differences?

      Significance

      Previous work from the Harris lab showed that a subunit of the ribonucleotide reductase of some herpesviruses acts as an antagonist of several human APOBEC3s. Mechanistically, these viral protein block A3 inhibition by relocalizing nuclear A3s as well as inhibiting A3 deamination by binding and occluding the A3 active site. For Epstein-Barr virus, deletion of the antagonist (BORF2) results in a decrease in viral replication and accumulation of mutations likely introduced by host A3B that is no longer inhibited. However, deletion of the A3 antagonist from herpes simplex virus-1 (ICP6) had no effect on viral titers. Most recently, this group published a cryoEM structure of BORF2 in complex with the c-terminal half of A3B. This structure showed extensive contacts between BORF2 and two loops of A3B - L1 and L7.

      The manuscript under review focuses on the previously suggested differences in the ability of different RNRs to bind A3A and A3B. This work provides an important contribution to this topic in defining specific regions of A3A and A3B and A3G that are necessary for viral RNRs to bind them. The variability in these interactions is surprising and likely testament to the impactful coevolution of herpesviruses and primate A3s. This manuscript will be of particular interest to virologists studying A3s or herpesviruses as well as evolutionary biologists interested in the rules of engagement between host restriction factors and viruses.

      Expertise keywords: restriction factors, APOBEC3 evolution, evolutionary genomics, genetic conflict

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Building off the groups prior work on A3B and EBV BORF2 interactions, here they have expanded their studies to examine additional herpesvirus RNRs, demonstrating which features are conserved. Using a combination of IP experiments and IF, they have included KSHV ORF61 and HSV-1 ICP6 RNRs, and demonstrated that the A3 loop structures, L1, L3, and L7 from A3A, A3B, and A3G play varying roles in determining the ability to interact with the different RNRs. They then go on to demonstrate that the ability of BORF2 to block the deaminase activity of A3B is dependent on the tyrosine at position 481. Lastly, and most interestingly, they show that RNRs from Old World monkeys, but not New World monkeys, can bind to A3A and A3B, lead to their re-localization, and block deaminase activity.

      Major comments:

      The vast majority of this work is very convincing. The authors claims are clearly reflected in the data presented for the most part. However, the work done with HSV-1 ICP6 co-IP is not very convincing. The authors claim that L7 and L3 swaps from A3Bctd to A3Gctd decreases pulldown (lines 5-12, p.7; lines 18-21, p.8; line 17, p.16). The figures (2A, 3A, 4A) however show only A3A being pulled down with ICP6. The re-localization data however does seem more consistent with the above claims. The authors note this in line 9, p.8. However, they come to a different conclusion in line 2, p.8, regarding the discrepancy between IP and IF data.

      The data and methods are clearly presented, with the exception of the supplemental figures, where it is unclear how the predicted modeling was conducted.

      Experiments all seem to be sufficiently replicated.

      Minor comments:

      The references to prior studies seem comprehensive. Text and figures were all very clear. Introducing the supplemental figure 1 earlier, may provide clarity to the argument about degree of relatedness (line 2, p.7).

      The suggestion of ORF61 interaction with L3 as an anchor region (line 10-12, p.9) was not very clear/could benefit from a bit more elaboration.

      Significance

      This work builds on the conceptual framework of host-pathogen interactions and co-evolution, adding new examples of co-divergence of primate herpesviruses with their respective host restriction factors. Following up on past findings (Cheng et al., 2019; Shaban et al., 2021), and reports from others (Stewart et al., 2019), they outline the degree to which their initial findings (BORF2 and A3B interactions) are conserved across other herpesvirus RNRs, and place them in the context of the evolution of the A3 gene locus and expansion.

      This work will be of great interest to virologists. Especially those that work in the field of host pathogen evolution and the molecular arms race.

      My background is in host-pathogen interactions and herpesvirus evolution. I lack the sufficient expertise to evaluate the predicted modeling.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Moraes et al. build upon their recent studies of APOBEC3 antagonism by EBV BORF2 by showing that additional RNR subunits encoded by other herpesviruses share this activity, suggesting that the host-virus arms races involving APOBEC3 proteins is more widespread than previously thought. Furthermore, the authors show that herpesviruses infecting primates that lack A3B (New World Monkeys) do not apparently exhibit a capacity to antagonize human A3B, suggesting that this function was not required during the evolution of those viruses (while it seemingly was important for viruses infecting hosts that encode A3B). Overall, this is a technically sound submission that combines confocal immunofluorescence, co-immunoprecipitation, and enzymatic assays to comprehensively test the sensitivity of different A3s to counteraction by viral RNRs. The enzymatic (deamination) assays performed prove to be the most insightful, since co-IP and colocalization microscopy was not entirely sufficient to reveal which domains of A3 are important for targeting by RNRs. It is well-written, well-organized, and well-referenced, and will be of interest to readers who study APOBEC3s, herpesviruses, and host-virus arms races more generally.

      Major points:

      1. Figure 4B: the fluorescence microscopy data does not match well with the Co-IP (in Figure 4A). For example, L1B in A3A enhances the A3A-BORF2 co-IP but no clear differences are observed in colocalization. Or are the authors claiming the presence of L1B results in greater colocalization between A3A and BORF2, because there is slightly less diffuse BORF2 in the cytoplasm under these conditions? If that is the case, then quantitative colocalization analysis will need to be performed. In general, virtually none of the colocalization analysis in Figure 4B matches well with the co-IP results of Figure 4A. The authors take this to suggest that L7, and not L1, is most important determinant for BORF2 binding to A3s, but in that case, then the colocalization data is disconnected functionally from co-IP results. This is not necessarily a large problem, since the authors ultimately test the enzymatic activity of A3s in the presence of different RNRs. These latter functional experiments more objectively define what regions of A3s are important for antagonism by RNRs.
      2. Can the authors discuss/cite more about the actual subcellular compartments that the A3s are being relocated towards by the RNPs? In general, the authors' comments are limited to whether the A3 is predominantly in the nucleus, or not.
      3. Since the authors draw a connection between the absence of A3B in New World Monkeys and the fact that New World Monkey-specific viruses don't seem to counteract A3s, can the authors discuss what could be learned by studying human individuals who lack A3B and the evolution of herpesviruses in those individuals?

      Minor points:

      1. I'm not sure it makes sense to call out Figures 1A-D in the Introduction section, rather than the Results section.

      Significance

      This work represents a step-wise advance from the authors' previous work on herpesvirus RNPs and counteraction of host APOBEC3s. I study host-virus molecular arms race on evolutionary scales and this article is of interest and significance to me, and I assume to others in the field as well. The findings found within the submission are interesting but not necessarily informative about human health and disease. However, the subsequent work that this manuscript inspires is likely to tell us more about herpesvirus evolution in human patients and the mechanisms by which APOBEC3s promote cancer.

  2. Sep 2022
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      See cover letter for more details.

      Summary of response to reviewers:

      We were immensely pleased that the reviewers considered our conclusions “well supported” and our study “beautifully executed”. Reviewers also recognized the significance of our work. Reviewer 1 stated that “building a model that describes one of these pathways will allow us to begin to test therapies to treat or prevent scoliosis” then noted that we “help to build a larger model of normal spine morphogenesis” and that this is “important”. Reviewer 2 called our work an “exciting advance in our understanding of one of the essential signaling pathways that help regulate body axis straightening and spine morphogenesis in zebrafish” and mentioned that our work “may also help to further our understanding of the etiology and pathophysiology of multiple forms of neuromuscular scoliosis in humans”. Reviewer 3 agreed, stating that our work “adds important information on the role of urotensin signaling in spine formation” and noted that it is timely: “findings are of special significance in the light of recent reports that mutations in UTS2R3 show association with spinal curvature in patients with adolescent idiopathic scoliosis”.

      We thank the three reviewers for reading our research and providing feedback. In all cases, we have incorporated (or plan to incorporate) their suggestions, and we believe this has (will) make our manuscript much stronger. Indeed, reviewers had only a small number of “major points”, and all are easily addressed as summarized below. We have already addressed some of those “major points”, as well as the majority of “minor points” raised by reviewers, in our current draft. We expect that all comments can be fully addressed within around 1 month.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are plannedto address the points raised by the referees.

      • *

      We have divided our responses by whether the reviewers considered their points major or minor. All points have already been, or will soon be, fully addressed.


      Major points


      Reviewer 1

      • *

      The key conclusions are well supported, see below for my two major issues.

      Please don't call this lordosis. Lordosis or hyperlordosis effects lumbar vertebra. The curve in the lumbar region shifts body weight so that human gait is more efficient that that in the great apes, or so the story goes. Zebrafish do not have lumbar vertebra equivalents or a natural curve in the caudal region. Similarly, fish do not have the equivalent vertebra to generate kyphosis, which is again a hyper flexion of a normal human spinal curve. Instead zebrafish have Weberian, precaudal and caudal vertebra. It would be so much more useful for the field if the authors used these terms and specified ranges, i.e. numbered vertebrae, that are effected so we can directly and accurately compare regions of defects between zebrafish mutants. It would help to make the point that the uts2r3 mutant has more caudally located curves than urp1/2 double mutants. We appreciate this point and agree with the reviewer. Lordosis (or hyperlordosis) is indeed the accentuation of a curve which naturally exists in humans but not zebrafish. We called the phenotype of Urotensin pathway mutants ‘lordosis’ or ‘lordosis-like’ because of the position of the curves — in caudal vertebrae, which are evolutionarily and positionally equivalent to lumbar vertebrae, though they are structurally different to human lumbar vertebrae. To address this comment, we will no longer refer to the phenotype as lordosis in our Introduction or Results sections and we will expand our Discussion to include this point raised by the reviewer.

      1. The observation that urp1/2 double mutants have curves only in the D/V plane and almost completely lack side-to-side curves is noteworthy. Does the urp1-/-urp2-/- mutant uncouple two systems for posture? If this separate a DV from side-to-side postural control system, that would be very interesting. It is particularly important to describe how penetrant the phenotype is and how many times it was observed. See 9 minor comments. It would help the reader if the authors explicitly described the features that they see in the cfap298 mutant that constitute lateral curves and that are lacking in urp1/2 (e.g. in figure 4E).

      We plan to expand the figure and analysis describing D/V curves and M/L curves. While our first draft included only cfap298 and urp1-∆P;urp2-∆P mutants, our next draft will also incorporate uts2r3 and pkd2l1 mutants. We have already scanned cohorts of all mutant fish, and so the remaining work to render and quantify the degree of lateral curvature will not take long. This will allow us to conclusively determine whether these different mutations indeed uncouple two systems controlling posture in different directions. As the reviewer requests, we will include all fish analyzed in either main or supplementary figures, include numbers in figure legends, and quantify the penetrance of M/L and D/V curves.

      We have also generated cfap298;urp1-∆P;urp2-∆P triple mutants and are currently scanning them to reveal skeletal form. Preliminary data suggests triple mutants have three-dimensional curves but D/V curves are more severe in triple mutants than in cfap298 mutants alone. This makes sense if Urp1/Urp2 are important for controlling D/V spinal shape and, as our qPCR shows, Urp1/Urp2 are downregulated but not lost completely in cfap298 mutants. It also furthers the notion that cilia motility controls D/V and M/L curves by separable mechanisms. * *

      • *

      Reviewer 2

      Need to show that the CRISPANT targeting was effective for mutagenesis at each loci screened in the work presented in Figure 1E. In Figure 1E, we presented the phenotypes of crispant embryos (i.e. embryos injected with four gRNAs targeting a specific gene alongside relatively high doses of Cas9 protein; see schematic in Figure 1G). In positive controls (cfap298 and sspo), crispants showed the expected phenotype in all cases (Figure 1E and see Figure 1H for quantitation). As for germline mutants, urp1 and urp2 crispants showed no early axial phenotypes (Figure 1E and 1H). As such, the reviewer requests that we perform molecular assays to determine whether mutagenesis was successful in these embryos. To do so, we will perform either T7 assays or next-generation/Sanger sequencing of mutated loci. This will allow us to determine and quantify the effectiveness of our mutagenesis. Results will be shared in a new supplementary figure. These assays are straightforward and we expect they will not take very long to complete. Indeed, we have performed these assays previously for other genes (e.g. Grimes et al., 2019 and several unpublished genes). We have achieved high levels of mutagenesis in all cases, making us very confident that we will achieve similarly high levels of mutagenesis in this case.

      Reviewer 3


      The addition of the F0 crispant experiment to show that the pro-peptide of urp1/2 does not have a function and is responsible for the difference between the observed morpholino and the crispr phenotype was important. However, since no phenotype was observed in crispants it is important to add evidence of induced cuts for all guide RNAs used in the crispant experiment. These control experiments might have been done already. If not, they can easily be done in a short period of time by performance of T7 assays on injected fish and would not require additional reagents. This is the same point raised by reviewer 2 and so we refer to the response above. In summary, we agree with the reviewer and we are currently performing these suggested experiments which are straightforward and working well.

      The authors claim that there were no structural defects observed in urp1/2 double mutants. However, the hemal arch in figure 3 E seems to be deformed. This could be normal variance or a phenotype. This can be addressed by simple reinspection of the scans.

      We believe there are no major vertebral structural defects that could be attributed to causing the spinal curves because vertebrae are well-formed in mutants and we see no defects in the initial patterning of vertebrae in our calcein experiments. However, since urp1-∆P;urp2-∆P and uts2r3 mutant spines are curved, the vertebrae are a little misshapen. We plan two revisions, one textual and one analytical.

      First, we will make clear in our textual edits that some vertebrae are slightly misshapen, as occurs in non-congenital forms of human spinal curve disease (in congenital forms, the shape defects are more striking and likely causative in the curvature). We agree with the reviewer that stating that there is a lack of vertebral structural defects lacked nuance, so we will expand on this in our next draft.

      Second, we will quantify vertebral shapes in spinal curve mutants and report these data in our next draft. After reinspection of the scans, as the reviewer suggested, we believe it would be informative for our readers to see quantitation of vertebral shape. We expect these data to more rigorously back up our statements about ‘minor structural differences’ of vertebrae between uncurved and curved individuals. We have already begun this work, and completing it should only take a few more weeks. As an example, we have measured the shape of centra by calculating aspect ratios in wild-type and urp1-∆P;urp2-∆P double mutants in curved regions of the spine:

      These preliminary data already make clear that there are indeed subtle morphological differences between vertebrae in mutants and wild-type, as occurs in human spinal curve deformities. We will present completed versions of these data (several parameters that describe vertebral shape) in our next draft and provide comments about whether such changes could be causative in spinal curve etiology as occurs in congenital-type scoliosis.

      Minor points


      Reviewer 1

      Supplementary FigS3B How to measure the Cobb Angle is unclear. Why is the first curve not counted? I count 3 curves. First a ventral displacement, then a dorsal to ventral return, then a sharp flex before the tail. How to measure Cobb angle might be easier to explain if the figure is expanded into steps. Identify the apical vertebra, then showing how the lines are drawn parallel to those vertebrae, then where the measured angle forms between the lines perpendicular to the drawn parallel lines.

      We will more thoroughly explain how Cobb angle is measured in our next draft.

      5a. I think we (zebrafish biologists) need be explicit about what we mean with "without vertebral defects." What do we count as defects? Vertebrae can be fused, bent, shortened or the growing edges can be slanted. In Figure 3E, and movie7, it is clear that the highlighted mutant vertebrae are shorter than WT. The growing ends of normal vertebra are perpendicular to the long axis of the vertebra. In the mutants the ends are slanted. Please define in the text what you consider a relevant vertebral defect, because these vertebrae have defects. Or are you only considering the calcein stained centra at 10dpf?

      We strongly agree with the reviewer. As described more thoroughly above in response to Major Comment – Reviewer 3, we plan both textual edits and new quantitation of vertebral shape to address this comment. Our quantitation indeed shows some vertebrae are shorter in mutants as the reviewer noticed. We also plan a new paragraph in the Discussion section which will speak about the issue of what zebrafish biologists might mean by “without vertebral defects”.

      5b. Do you want to base your patterning conclusion on primarily the calcein data as these are closer to the notochord patterning time window. Please anchor this conclusion to a specific time or standard length e.g. 10dpf/5.6mm.

      When we edit our descriptions of vertebral defects, and include new quantitative data on the shape of vertebrae, we will be clear that the vertebrae are slightly structurally malformed. In addition, when we speak of the calcein data, we will anchor those conclusions to the specific timepoint best studied by this method, as the reviewer suggests.

      "At 30 dpf... several mutants exhibited a significant curve in the pre-caudal vertebrae, in addition to a caudal curve (Fig. 3D and S3C). Since pre-caudal curves were rare in mutants at 3-months, this suggested that curve location is dynamic".The frequency of this observation is important. Does it effect all or a fraction of mutants? Can you provide some numbers to anchor these observations? Maybe fractions e.g.. 3 of 4 fish had precaudal curves at 30pdf, and 0 of 10 fish had precaudal curves by 3 mpf?

      In our next draft, we will provide numbers of fish examined at 30 dpf and also show graphical summaries of curve position (as we did for younger fish). Last, all scans will be included in a new supplementary figure.

      The description of the pkd2l1 mutant, instead of terming it kyphosis can you tell the reader the vertebra number at the peak of the curve. The authors say the pkd2l1 mutant is highly distinct from urp1/urp2-/-, but the reader needs to hear exactly what is distinct. For example, does this mutant have both lateral and D/V curves?

      We have now scanned several pkd2l1 mutant fish and we will include images of pkd2l1 mutants at two different timepoints together with quantitation of curve position. Our results agreed with those previously published for this mutant line (Sternberg et al., 2018) but we believe it is important for our readers to see side-by-side images and quantitation so they can see the distinctions.

      At 3-months of age, pkd2l1 mutants essentially appear wild-type but by around 12-months they have developed a D/V curve in the pre-caudal vertebrae. They do not exhibit M/L curves; we will quantify this and include these data in our Figure about M/L deviation.

      We called the phenotype displayed by pkd2l1 mutants “kyphosis” to be in line with a previous publication describing these mutants (Sternberg et al., 2018). We will add new wording in the Discussion about whether or not zebrafish can truly model kyphosis and lordosis (see response to Reviewer 1 major comment above), and we make clear in our Results that the phenotype has “been argued to model kyphosis (Sternberg et al., 2018)” rather than “is kyphosis”.

      It is intriguing that pkd2l1 mutants do not exhibit any curves until much later in life than urp1-∆P;urp2-∆P and uts2r3mutants. Inspired by this finding, we aged urp1-∆P and urp2-∆P single mutants and found that they go on to develop D/V curves by 12-months i.e.

      • *

      • *3-months 12-months Position of curve

      urp1-∆P no curves mild D/V curves Mostly caudal

      urp2-∆P mild D/V curves intermediate D/V curves Mostly caudal

      urp1-∆P;urp2-∆P severe D/V curves severe D/V curves Mostly caudal

      uts2r3 severe D/V curves severe D/V curves Mostly caudal

      cfap298 severe 3D curves severe 3D curves Caudal and pre-caudal

      pkd2l1 no curves mild D/V curves Mostly pre-caudal

      Phenotypes in urp1-∆P and urp2-∆P single mutants upon aging shows: 1) Urp1 and Urp2 are not entirely redundant in long-term spine maintenance and 2) proper Urp1/Urp2 dose is essential. We will include these new data in our next draft.

      Does uts2r3-/- have no /minimal side-to-side curves like urp1/urp2-/-?

      This is an interesting question. To address it, we will add images of uts2r3 mutant spines from the dorsal aspect and include them with our new quantitation of lateral curvature. To sum, the reviewer’s suggestion is correct – there are minimal side-to-side curves in uts2r3 mutants.

      One finding that deserves more discussion is the observation that urp1/urp2 double mutants have almost no side-to-side defects and all the obvious bends are in the D/V plane. Does this uncouple two systems for posture? Please consider the following paper. It shows a proprioception system that maintains normal side-to-side posture. A spinal organ of proprioception for integrated motor action feedback. Picton LD, Bertuzzi M, Pallucchi I, Fontanel P, Dahlberg E, Björnfors ER, Iacoviello F, Shearing PR, El Manira A. Neuron. 2021 Apr 7;109(7):1188-1201.e7. doi: 10.1016/j.neuron.2021.01.018. Epub 2021 Feb 11. PMID: 33577748

      Thank you for pointing out this manuscript. We will include it in our expanded Discussion.

      Reviewer 2

      Fig 3F: might be improved by making the images black and white and possibly inverted. It is not easy to clearly see the vertebrae as is. * *

      Thanks for the suggestion, we will make this change.

      • *

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Minor points


      Reviewer 1

      • *

      Figure 1D legend says urp1 is expressed in dorsal while urp2 is express in all CSF-cNeurons, but the image for urp1 shows only ventral cells in WT, while the image for urp2 shows the same cells ...and more dorsal cells. Please replace image with one that matches the text. Apologies for this, we have now corrected it. The image was correct but we accidentally wrote “dorsal” instead of “ventral” when describing the CSF-cN sub-population harboring urp1 transcripts.

      In Figure 2H, the position of curve apex graphic, how many fish were examined? In 2f it looks like n=8 and n=9. Can this info be added to the figure?

      We have now included the number examined in the legend.

      I did not find legends for the movies. The first call to the movies calls movies 1-3 without explaining what each shows. The labels on the downloaded files are not informative.

      Apologies for forgetting to submit these. We have now added informative Movie legends.

      Reviewer 3

      • *

      It would be helpful to the reader to add a little more information on urp1 and upr2 proteins and their processing to make it clear while only the 3' region of the protein was targeted to induce mutations. We have incorporated textual edits to make this more clear. We now state in the second sentence of the Results section:

      Urp1 and Urp2 are encoded by 5-exon genes with the final exon coding for the 10-amino acid peptides that are released by cleavage from the pro-domain (Fig. 1A).

      Together with Fig. 1A and Supplementary Fig. 1, we hope it is now clear to readers how Urp1 and Urp2 are generated from a 5-exon gene encoding the pro-domain and the peptide, which are separated by cleavage.

      It would also be helpful to the reader to have a schematic indicating the guide target sites (they could be added to figure S1 C + D) in the protein to be able to interpret the result more easily.

      Done!

      Figure 5: Addition of a square to H would help understand were the pictures in D-F were taken.

      Done!

      4. Description of analyses that authors prefer not to carry out

      N/A. We are performing all experiments/analyses requested by reviewers.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The presented work by Bearce et al. is based on the hypothesis that urp1 and urp2, and their receptor uts2r3 play a role during zebrafish spine development. Previously it had been shown that cilia function as well as Reissner fiber formation are important for spine development and that both cilia motility and the Reissner fiber influence urp1/2 expression. Further, morpholino knock-down of upr1/2 did show the typical curly down phenotype observed in cilia and RF mutants. The authors generate CRISPR mutants for urp1, urp2 by targeting the 10-amino acid secreted peptides and do not find an early phenotype in single, double or maternal zygotic mutants or cripants. However, they observe a late onset curvature of the spine in urp1/2 double mutants and in generated uts2r3 single mutant. Spinal curvature was assessed through measurement of the Cobb angel in microCT scans and compared with other scoliosis mutants. This analysis revealed similarities between urp1/2 and uts2r3 mutants and differences with curvatures observed in cilia motility (cfap298) or Reissner fiber (sspo) mutants, which did show decreased expression levels of urp1 and 2. These differences in spine curvature do indicate that the phenotypes are not caused by the same mechanism. Analysis of the Reissner fiber in transgenic animals did show no defects.

      Major points:

      The paper is generally well written and easy to follow. All experiments are described in sufficient detail and reagents are listed. However, there are two points that should be addressed to strengthen the conclusion of the paper.

      1. The addition of the F0 crispant experiment to show that the pro-peptide of urp1/2 does not have a function and is responsible for the difference between the observed morpholino and the crispr phenotype was important. However, since no phenotype was observed in crispants it is important to add evidence of induced cuts for all guide RNAs used in the crispant experiment. These control experiments might have been done already. If not, they can easily be done in a short period of time by performance of T7 assays on injected fish and would not require additional reagents.
      2. The authors claim that there were no structural defects observed in urp1/2 double mutants. However, the hemal arch in figure 3 E seems to be deformed. This could be normal variance or a phenotype. This can be addressed by simple reinspection of the scans.

      Minor points:

      1. It would be helpful to the reader to add a little more information on urp1 and upr2 proteins and their processing to make it clear while only the 3' region of the protein was targeted to induce mutations.
      2. It would also be helpful to the reader to have a schematic indicating the guide target sites (they could be added to figure S1 C + D) in the protein to be able to interpret the result more easily.
      3. Figure 5: Addition of a square to H would help understand were the pictures in D-F were taken.

      Significance

      While scoliosis in human patients is very prevalent, our understanding on the mechanism that lead to the development of spinal curvature are very limited and so are the treatment strategies. The zebrafish has emerged as an important model to study spine development and formation of scoliosis. While not all findings in the presented work are novel, this work adds important information on the role of urotensin signaling in spine formation. These findings are of special significance in the light of recent reports that mutations in UTS2R, the human ortholog of uts2r3, show association with spinal curvature in patients with adolescent idiopathic scoliosis. As such, this work will be of interest not only to basic researches but also the medical field.

      My field of expertise: zebrafish, CRISPR/Cas, genetics, skeletal development, spine formation

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Major concern:

      1. Need to show that the CRISPANT targeting was effective for mutagenesis at each loci screened in the work presented in Figure 1E.

      Minor Concern:

      1. Fig 3F : might be improved by making the images black and white and possibly inverted. It is not easy to clearly see the vertebrae as is.

      Significance

      Summary:

      This is a beautifully executed study on the role of Urp signaling in spine morphogenesis in zebrafish. This work also challenges the model that Urp1/ 2 controls the extension and straightening of the body axis of the zebrafish embryos. Here, using a double mutant in urp1 and urp2, they show that urp1/2 are dispensable for axial straightening. Moreover, they provide redundant roles during larval development in particular for maintaining a straight spine. They go on to show that scoliosis observed in urp1/2 double mutant fish are distinct - showing only dorsal-ventral lordosis , whereas previously published scoliosis phenotypes _showing curvates in dorsal-ventral and medial-lateral axes as observed in cilia- and Reissner fiber-related scoliosis mutants. They provide clear evidence that loss of Urp signaling does not affect the stability of the Reissner fiber as it does in cilia-related scoliosis mutants. Underscoring the distinct regulation of Urp signaling on spine morphology during larval development. Altogether, this is an exciting advance in our understanding of one of the essential signaling pathways that help to regulate body axis straightening and spine morphogenesis in zebrafish. These studies may also help to further our understanding of the etiology and pathophysiology of multiple forms of neuromuscular scoliosis in humans. I recommend it for publication after revisions.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The authors investigate the role of Urotensin Related Peptides (Urp1 and Urp2) on zebrafish spine straightness. One model of normal spinal morphogenesis proposes that when the spine bends, material in the central canal of the spinal cord (the Reissner Fiber, RF, mostly composed of scospondin) stimulates surrounding Cerebral Spinal Fluid contacting neurons (CSF-cN), that in turn release Urotensin like peptides that cause dorsl muscles to contract and straighten the spine. It is clear that motile cilia in the central canal are responsible for forming/compacting the RF from monomers of scospondin. Mutations were generated that removed the peptide-coding portion of Urp1, Urp2 and that removed most of the Urp receptor Uts2r3 and made a missense scospondin gene. They used cfap298-/- as an immotile cilia control and scospondin-/- as a Reissner Fiber absent control. The authors show Urotensin peptides and receptor Uts2r3 function in juvenile but not embryonic axis straightening. They defined the timecourse of spinal curves onset and change during larval life, i.e., 9 dpf to 17 dpf and found that curves were dynamic between 30dpf and 3 mpf. Unlike cfap mutants, urotensin mutants show no sex bias in scoliosis expression. The authors used a temperature sensitive mutation in cfap298 and the GFP-tagged scospondin gene to show that active cilia are required for both initial formation of the RF before 28 hpf and to maintain the RF between 6 and 12 dpf. Finally the authors demonstrated that the receptor Uts2r3 is not required for establishment or maintenance of the RF at 28 hpf and 12 dpf.

      Major comments

      The key conclusions are well supported, see below for my two major issues.

      1. Please don't call this lordosis. Lordosis or hyperlordosis effects lumbar vertebra. The curve in the lumbar region shifts body weight so that human gait is more efficient that that in the great apes, or so the story goes. Zebrafish do not have lumbar vertebra equivalents or a natural curve in the caudal region. Similarly, fish do not have the equivalent vertebra to generate kyphosis, which is again a hyper flexion of a normal human spinal curve. Instead zebrafish have Weberian, precaudal and caudal vertebra. It would be so much more useful for the field if the authors used these terms and specified ranges, i.e. numbered vertebrae, that are effected so we can directly and accurately compare regions of defects between zebrafish mutants. It would help to make the point that the uts2r3 mutant has more caudally located curves than urp1/2 double mutants.
      2. The observation that urp1/2 double mutants have curves only in the D/V plane and almost completely lack side-to-side curves is noteworthy. Does the urp1-/-urp2-/- mutant uncouple two systems for posture? If this separate a DV from side-to-side postural control system, that would be very interesting. It is particularly important to describe how penetrant the phenotype is and how many times it was observed. See 9 minor comments. It would help the reader if the authors explicitly described the features that they see in the cfap298 mutant that constitute lateral curves and that are lacking in urp1/2 (e.g. in figure 4E).

      Minor comments

      1. Figure 1D legend says urp1 is expressed in dorsal while urp2 is express in all CSF-cNeurons, but the image for urp1 shows only ventral cells in WT, while the image for urp2 shows the same cells ...and more dorsal cells. Please replace image with one that matches the text.
      2. In Figure 2H, the position of curve apex graphic, how many fish were examined? In 2f it looks like n=8 and n=9. Can this info be added to the figure?
      3. Supplementary FigS3B How to measure the Cobb Angle is unclear. Why is the first curve not counted? I count 3 curves. First a ventral displacement, then a dorsal to ventral return, then a sharp flex before the tail. How to measure Cobb angle might be easier to explain if the figure is expanded into steps. Identify the apical vertebra, then showing how the lines are drawn parallel to those vertebrae, then where the measured angle forms between the lines perpendicular to the drawn parallel lines.
      4. I did not find legends for the movies. The first call to the movies calls movies 1-3 without explaining what each shows. The labels on the downloaded files are not informative.
      5. a. I think we (zebrafish biologists) need be explicit about what we mean with "without vertebral defects." What do we count as defects? Vertebrae can be fused, bent, shortened or the growing edges can be slanted. In Figure 3E, and movie7, it is clear that the highlighted mutant vertebrae are shorter than WT. The growing ends of normal vertebra are perpendicular to the long axis of the vertebra. In the mutants the ends are slanted. Please define in the text what you consider a relevant vertebral defect, because these vertebrae have defects. Or are you only considering the calcein stained centra at 10dpf?

      5b. Do you want to base your patterning conclusion on primarily the calcein data as these are closer to the notochord patterning time window. Please anchor this conclusion to a specific time or standard length e.g. 10dpf/5.6mm. 6. "At 30 dpf... several mutants exhibited a significant curve in the pre-caudal vertebrae, in addition to a caudal curve (Fig. 3D and S3C). Since pre-caudal curves were rare in mutants at 3-months, this suggested that curve location is dynamic" The frequency of this observation is important. Does it effect all or a fraction of mutants? Can you provide some numbers to anchor these observations? Maybe fractions e.g.. 3 of 4 fish had precaudal curves at 30pdf, and 0 of 10 fish had precaudal curves by 3 mpf? 7. The description of the pkd2l1 mutant, instead of terming it kyphosis can you tell the reader the vertebra number at the peak of the curve. The authors say the pkd2l1 mutant is highly distinct from urp1/urp2-/-, but the reader needs to hear exactly what is distinct. For example, does this mutant have both lateral and D/V curves? 8. Does uts2r3-/- have no /minimal side-to-side curves like urp1/urp2-/-? 9. One finding that deserves more discussion is the observation that urp1/urp2 double mutants have almost no side-to-side defects and all the obvious bends are in the D/V plane. Does this uncouple two systems for posture? Please consider the following paper. It shows a proprioception system that maintains normal side-to-side posture. A spinal organ of proprioception for integrated motor action feedback. Picton LD, Bertuzzi M, Pallucchi I, Fontanel P, Dahlberg E, Björnfors ER, Iacoviello F, Shearing PR, El Manira A. Neuron. 2021 Apr 7;109(7):1188-1201.e7. doi: 10.1016/j.neuron.2021.01.018. Epub 2021 Feb 11. PMID: 33577748

      Significance

      Scoliosis effects about 3% of children worldwide. Mammals have not been good models for this condition. Zebrafish seem to have an intrinsic susceptibility to scoliosis, as well as several technical advantages. Scoliosis is likely caused by disruption of several different and independent pathways. Building a model that describes one of these pathways will allow us to begin to test for therapies to treat or prevent scoliosis.

      1. The authors demonstrate that urp1 and 2 are required for normal adult spine straightness. While loss of the uts2r3 receptor (A.K.A. uts2ra, Zhang et.al., Nat Genet, 2018) and the uts4 (receptor, Alejevski, et.al, Open Bio, 2021) lead to adult spinal bends or scoliosis, of the four described urotensin ligand paralogs, only urp, not uts2, urp1 or urp2 have been tested by deletion for a role in scoliosis (Quan et.al., Peptides 2021). In the current work, the authors help to build a larger model of normal spine morphogenesis and show that mutations effecting later steps do not have typical cilia associated phenotypes. Contributing a step to this model is important.
      2. The authors show that juvenile or adult scoliosis can be independent of the embryonic curves, Curly Tail Down phenotype. This result is somewhat in conflict with previous work from Zhang, in which Curly Tail Down phenotype from a cilia defective mutant (ZMYND10) was rescued by overexpression of urp1 peptide. It is possible that urp1 functions in place of the natural peptide for this function. As before there are four paralogs of urotensin peptides. The second conflicting observation from Zhang is that embryos injected with morpholino to urp1 shows Curly Tail Down phenotype. It is well known that morpholinos can have off-target effects.
      3. The authors observe that urp1/urp2 double mutants have almost no side-to-side defects and all the obvious bends are in the D/V plane. Does this uncouple two systems for posture? If this separate a DV from side-to-side postural control system, that would be amazing.
      4. The authors provide evidence that curves are dynamic and erasable between 30 dpf and 3 mpf. This could be a time window to apply therapeutics.
      5. The authors provide a new graphic tool, a chart that logs the location of the apical curve vertebra (Figure 2H and SFigure 3C). This will allow better comparison between various scoliosis mutants.
      6. The authors describe 3 different version of scoliosis in 3 mutants. In cfap298 mutants (immotile cilia) curves effect all 3 dimensions. In urp1/urp2-/- mutants, curves only appear in the D/V plane. In uts2r3 mutants, curves appear more caudal than those in urp1-/-,urp2-/- mutants, though it is not clear if these are 3D curves.

      Audience: Biologists and physicians interested in 1) scoliosis, 2) normal morphogenesis, and 3) maintenance of the spine, 4)neurophysiologists interested in postural control and regulation of repetitive movements, like walking and swimming.

      My expertise: zebrafish genetics, scoliosis, gastrulation

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01574

      Corresponding author(s): Casey, Greene

      1. General Statements [optional] We thank the reviewers for their thorough feedback. We have addressed all the points raised, revised the manuscript accordingly, and explained our changes below. To aid readability, the reviewers’ comments have been converted to italics, and our responses have been bolded.

      Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors systematically evaluate the performance of linear and non-linear ML methods for making predictions from gene expression data. The results are interesting and timely, and the experiments are well designed.

      I have a few minor comments:

      - It was hard for me to understand Figure 1B. I think a figure like this would be very helpful however. What do the numbers represent? If sample ID, then I am not sure why x-axis label is also "samples"

      - For analysis of GTEx data, not sure what "studywise splitting" would mean, since the GTEx dataset is one study? Do you leave out the same individuals from all tissues for evaluation?

      We thank the reviewer for their input on these two points. To make Figure 1B clearer and to elaborate on our stratified splitting methods, we have amended its description to “We stratify the samples into cross-validation folds based on their study (in Recount3) or donor (in GTEx). We also evaluate the effects of sample-wise splitting and pretraining (B).”

      - I found the sample size on x-axis of Fig 2a confusing. If I understand correctly, GTEx has a total of ~1000 subjects. So in some sense, effective sample size can not be bigger than 1000. If you are counting subjects x tissue as sample, then it can be misleading in terms of the effective sample size.

      We thank the reviewer for this point. To incorporate it into the manuscript, we’ve added the following text to the description of Fig. 2: “It is worth noting that "Sample Count" in these figures refers to the total number of RNA-seq samples, some of which share donors. As a result, the effective sample size may be lower than the sample count. “

      - Would be interesting to assess out-of-sample generalizability of linear and non-linear models. Have you tried training on GTEx and predicting on Recount3 or vice versa?

      This question intrigued us. We reran the tissue prediction experiments from the manuscript on a subset of the GTEx and Recount3 datasets in which we performed an intersection over tissues and genes. We found that in the out-of-sample domain the logistic regression model and the three layer neural network performed similarly, while the five layer net generally had a lower accuracy despite having similar accuracy in the training domain. We also found (consistent with our results in the paper) that GTEx predictions are an easier task than their Recount counterparts. Below are plots demonstrating these findings:

      [These plots appear in the PDF but do not appear to work in the ReviewCommons Form].

      Reviewer #1 (Significance (Required)):

      Important and timely study, evaluating linear vs non-linear methods for predicting phenotype from gene expression datasets.

      We appreciate the reviewer’s positive comments on the timeliness of our manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary

      The authors want to assess the presence of non-linear signal in gene expression values in the task of tissue and sex classification. They use logisitic regression classifiers and two types of neural networks, with 3 and 5 layers, and assess classification performance on two large expression datasets from Recount3 and GTEX and three simulated datasets.

      The authors carefully construct their learning setup in such a way that one can reason about the removal of linear signal from the expression features. The interesting conclusion is, that although the linear approach works well on both datasets, and sometimes even better than the more complex models. The authors convingly show, that there is a significant non-linearity in the gene expression data. However, just because it is "there" does not imply that any non-linear methods performs better.

      Major comments:

      - Are the key conclusions convincing?

      The authors did a good job in showing, that there is non-linear signal in gene expression features for the classification problems studied.

      We thank the reviewer for their positive feedback.

      - Should the authors qualify some of their claims as preliminary or speculative, or

      remove them altogether?

      The overall claims of the authors are justified, the discussion may be improved.

      We appreciate the reviewer’s support for our overall claims and we have adjusted the manuscript as noted point by point below.

      - Would additional experiments be essential to support the claims of the paper?

      No, additional experiments are not essential. But the authors did not compare to other non-linear methods such as SVM or knn-classifiers in the resulst or conclusion section. It is unlikely that the main conclusion would change if those methods were tried. But it is possible that other "simpler" non-linear methods, such as knn for example, are able to outperform the logistic regression classifier on the GTEX and Recount3 data set. Thus, the authors should at least mention this as part of the conclusion and could extend their discussion on the implications of their study concerning other tasks or models.

      We agree that there should be more discussion of other models in the conclusion section. We have updated the fifth paragraph of the conclusion accordingly:

      “We are also unable to make claims about all problem domains or model classes. There are many potential transcriptomic prediction tasks and many datasets to perform them on. While we show that non-linear signal is not always helpful in tissue or sex prediction, and others have shown the same for various disease prediction tasks, there may be problems where non-linear signal is more important. It is also possible that other classes of models, be they simpler nonlinear models or different neural network topologies are more capable of taking advantage of the nonlinear signal present in the data.”

      - Are the suggested experiments realistic in terms of time and resources?

      Not applicable.

      - Are the data and the methods presented in such a way that they can be reproduced?

      There is a separate github repo which has the code to reproduce the analyses. This is good. However, would be nice to explain in more detail in the manuscript how the limma function was used for removing the linear signal, as they mention the "removeBatchEffect" function was used, but it would be good to tell the reader how that works, as this is their way for assessing the effect of linear-signal removal. Are there any limitations for the assessment of signal removal in this way?

      We thank the reviewer for their input, and have updated the model training section on signal removal to read: “We also used Limma[24] to remove linear signal associated with tissues in the data. We ran the ‘removeBatchEffect’ function on the training and validation sets separately, using the tissue labels as batch labels. This function fits a linear model that learns to predict the training data from the batch labels, and uses that model to regress out the linear signal within the training data that is predictive of the batch labels.”

      We have also elaborated on the limitations of signal removal by updating the sentence “This experiment supported our decision to perform signal removal on the training and validation sets separately, as removing the linear signal in the full dataset induced predictive signal (supp. fig. 6)” to read “This experiment supported our decision to perform signal removal on the training and validation sets separately. One potential failure state when using the signal removal method would be if it induced new signal as it removed the old. This state can be seen when removing the linear signal in the full dataset(supp. fig. 6).”

      - Are the experiments adequately replicated and statistical analysis adequate?

      Yes

      Minor comments:

      - Specific experimental issues that are easily addressable.

      no

      - Are prior studies referenced appropriately?

      Yes

      - Are the text and figures clear and accurate?

      *Also, they conducted 3 different experiments in Figure 3. It would be useful to separate the figure into 3) A, 3) B, and 3) C and link that specifically in the text. Figure 4 is an extended version of Figure 2, just with the additional results of the signal removed performances. *

      We appreciate the feedback. To make the figure and the text more clear, we have added A, B, and C subheadings to figure 3, and updated the subfigure’s references within the text accordingly.

      First, the pairwise results in 4B are hard to read as the differences in colors and line type are difficult to see as some lines are short. Second, we did not find it helpful to reproduce the full signal approach in Figure 4. We would suggest to make Figure 4 as Figure 2, and simply only talk about the Full signal mode in the beginning, how it is in the text.

      We agree. We have made Figure 4 our new Figure 2 and updated the references in the text.

      Further, it would be nice to give better names in the legends of these plots. Pytorch_lr is not a nice name.

      We thank the reviewer for pointing this out. We have updated the names in the legends to be “Five Layer Network”, “Three Layer Network”, and “Logistic Regression”

      - Do you have suggestions that would help the authors improve the presentation of

      their data and conclusions?

      As the Recount3 dataset is different in quality and complexity it would be reasonable to show the results of the binary classifcation also in the main paper. In particular, as this behaves different to the GTEX binary classification.

      We have now moved the Recount binary classification figure from the supplement to join the GTEx binary classification data as the new figure 4.

      -The title is somewhat unprecise. It may induce the impression that the paper is about expression-prediction, although that is not the case. Further, in the abstract they don't mention what prediction problem they solve and that these are classification problems. After reading the paper it is clear why the authors choose that, but we are suggesting an alternative title that the authors may consider:

      The effect of nonlinear signal in classification problems using gene expression values

      We agree with the reviewer’s comment and have updated our title to “The effect of non-linear signal in classification problems using gene expression”

      Further, they should give more details on the problem learned in the abstract.

      We thank the reviewer for their feedback, and have added details to the abstract about the problem domains. The relevant sentence now reads “We verified the presence of non-linear signal when predicting tissue and metadata sex labels from expression data by removing the predictive linear signal with Limma, and showed the removal ablated the performance of linear methods but not non-linear ones.”

      *-In addition, the conclusion section, which may be title as Disucssion and Conclusion, could contain additional points concerning the topology and training of the neural networks. *

      We have updated the heading of the final section to Discussion and Conclusion. To expand on the potential drawbacks of our neural network topologies, we have also updated the limitation portion of Discussion and Conclusion to read “We are also unable to make claims about all problem domains or model classes. There are many potential transcriptomic prediction tasks and many datasets to perform them on. While we show that non-linear signal is not always helpful in tissue or sex prediction, and others have shown the same for various disease prediction tasks, there may be problems where non-linear signal is more important. It is also possible that other classes of models, be they simpler nonlinear models or different neural network topologies are more capable of taking advantage of the nonlinear signal present in the data.”

      Obviously, it is possible that other simpler or more complex neural networks have a better performance on the GTEX and Recount3 data sets compared to logistic regression. In fact, the results from Figure4 suggest that, as there is clearly useful non-linear signal in those datasets for the classification problems studied. However, optimizing a non-linear model is inherently more complex and time-consuming, and thus may not be done thoroughly in previously published papers. Compared to a linear model that is easier and faster to optimize, this may be one reason why studies find that, despite non-linear signal, the linear model performs better. Other factors such as the samples size, which the authors already mention, of course also plays a big role, and if hundreds of thousands of datasets would be there , e.g. from single cell measurements, non-linear methods may have a better chance of outcompeting linear models.

      We agree, which is why we consider the signal removal experiment to be so important. By demonstrating that the non-linear methods we used were in fact learning non-linear signal we were able to show that there was something that non-linear models were able to learn that logistic regression was unable to. That is to say that while the presence of non-linearity in the decision boundary is necessary for non-linear models to outperform linear ones, it is not by itself sufficient. Perhaps with more data or a different model non-linear methods would perform better, but there is certainly a class of models and problems where logistic regression is preferable.

      Reviewer #2 (Significance (Required)):

      The submitted manuscript adds to the discussion of the necessity of non-linear models when solving classification problems using gene expression data. The significance is mostly technically, as a comparison of logistic regression and two neural network topologies that are being compared on two large expression datasets. However, there is also a conceptual part of the contribution, which is with regards to the implications of their experiments.

      Interested audience would be computer scientists and bioinformaticians or others, that are involved in creating or interpreting these or similar prediction models.

      Our field of expertise is in the creation of machine learning models using different types of OMICs data. All aspects of the work could be assessed.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, the authors discuss an interesting problem regarding the comparative performance of linear and non-linear machine learning models. The main conclusion is that logistic regression (linear model) and neural networks (non-linear model) have comparable performance if the data contain both linear and non-linear relations between the features (X) and the prediction target (Y), however, if the linear component in the X-Y relation is removed (e.g. regressed out) the neural networks will outperform logistic regression. This conclusion implies that linear models such as logistic regression mainly relies on the linearity in the X-Y relation.

      However, whether X-Y relation has a linear component and whether the data (e.g. for different Y classes) are linearly separable are two different questions. For example, consider a data generating mechanism, y=x^2+x and label the data points using two classes (y1). Clearly, the data is linearly separable, and any machine learning algorithm should perform very well on this problem. Now remove the linear component form the X-Y relation and use y=x^2 to generate the data. The data is still linearly separable, and the performance of logistic regression should not be affected.

      We agree that there is a difference between optimal linear decision boundaries and linear relationships between elements in the training data. Our use of the term “relationship” in place of “decision boundary” was imprecise. To make this more clear, we have made the following changes:

      Introduction:

      “Unlike purely linear models such as logistic regression, non-linear models should learn more sophisticated representations of the relationships between expression and phenotype.” -> “Unlike purely linear models such as logistic regression, non-linear models can learn non-linear decision boundaries to differentiate phenotypes.”

      “However, upon removing the linear signals relating the phenotype to gene expression we find non-linear signal in the data even when the linear models outperform the non-linear ones.” -> “However, when we remove any linear separability from the data, we find non-linear models are still able to make useful predictions even when the linear models previously outperformed the nonlinear ones.”

      Discussion and conclusion:

      We removed the following paragraph: “Given that non-linear signal is present in our problem domains, why doesn’t that signal allow non-linear models to make better predictions? Perhaps the signal is simply drowned out. Recent work has shown that only a fraction of a percent of gene-gene relationships have strong non-linear correlation despite a weak linear one [23].”

      The point is that the performance of linear models is mainly dependent on whether the data are linearly separable instead of the linearity in X-Y relation as the manuscript suggests.

      We agree that this is the key point and appreciate the reviewer for helping us to more carefully hone the language to convey this point.

      Reviewer #3 (Significance (Required)):

      The performance comparison between linear and non-linear machine learning models is important.

      We appreciate the reviewer’s recognition of the significance of the work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript, the authors discuss an interesting problem regarding the comparative performance of linear and non-linear machine learning models. The main conclusion is that logistic regression (linear model) and neural networks (non-linear model) have comparable performance if the data contain both linear and non-linear relations between the features (X) and the prediction target (Y), however, if the linear component in the X-Y relation is removed (e.g. regressed out) the neural networks will outperform logistic regression. This conclusion implies that linear models such as logistic regression mainly relies on the linearity in the X-Y relation. However, whether X-Y relation has a linear component and whether the data (e.g. for different Y classes) are linearly separable are two different questions. For example, consider a data generating mechanism, y=x^2+x and label the data points using two classes (y<=1 and y>1). Clearly, the data is linearly separable, and any machine learning algorithm should perform very well on this problem. Now remove the linear component form the X-Y relation and use y=x^2 to generate the data. The data is still linearly separable, and the performance of logistic regression should not be affected. <br /> The point is that the performance of linear models is mainly dependent on whether the data are linearly separable instead of the linearity in X-Y relation as the manuscript suggests.

      Significance

      The performance comparison between linear and non-linear machine learning models is important.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The authors want to assess the presence of non-linear signal in gene expression values in the task of tissue and sex classification. They use logisitic regression classifiers and two types of neural networks, with 3 and 5 layers, and assess classification performance on two large expression datasets from Recount3 and GTEX and three simulated datasets. The authors carefully construct their learning setup in such a way that one can reason about the removal of linear signal from the expression features. The interesting conclusion is, that although the linear approach works well on both datasets, and sometimes even better than the more complex models. The authors convingly show, that there is a significant non-linearity in the gene expression data. However, just because it is "there" does not imply that any non-linear methods performs better.

      Major comments:

      • Are the key conclusions convincing?

      The authors did a good job in showing, that there is non-linear signal in gene expression features for the classification problems studied. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The overall claims of the authors are justified, the discussion may be improved. - Would additional experiments be essential to support the claims of the paper?

      No, additional experiments are not essential. But the authors did not compare to other non-linear methods such as SVM or knn-classifiers in the resulst or conclusion section. It is unlikely that the main conclusion would change if those methods were tried. But it is possible that other "simpler" non-linear methods, such as knn for example, are able to outperform the logistic regression classifier on the GTEX and Recount3 data set. Thus, the authors should at least mention this as part of the conclusion and could extend their discussion on the implications of their study concerning other tasks or models. - Are the suggested experiments realistic in terms of time and resources?

      Not applicable. - Are the data and the methods presented in such a way that they can be reproduced?

      There is a separate github repo which has the code to reproduce the analyses. This is good. However, would be nice to explain in more detail in the manuscript how the limma function was used for removing the linear signal, as they mention the "removeBatchEffect" function was used, but it would be good to tell the reader how that works, as this is their way for assessing the effect of linear-signal removal. Are there any limitations for the assessment of signal removal in this way? - Are the experiments adequately replicated and statistical analysis adequate?

      Yes

      Minor comments:

      • Specific experimental issues that are easily addressable.

      no - Are prior studies referenced appropriately?

      Yes - Are the text and figures clear and accurate?

      Also, they conducted 3 different experiments in Figure 3. It would be useful to separate the figure into 3) A, 3) B, and 3) C and link that specifically in the text. Figure 4 is an extended version of Figure 2, just with the additional results of the signal removed performances. First, the pairwise results in 4B are hard to read as the differences in colors and line type are difficult to see as some lines are short. Second, we did not find it helpful to reproduce the full signal approach in Figure 4. We would suggest to make Figure 4 as Figure 2, and simply only talk about the Full signal mode in the beginning, how it is in the text. Further, it would be nice to give better names in the legends of these plots. Pytorch_lr is not a nice name. - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      As the Recount3 dataset is different in quality and complexity it would be reasonable to show the results of the binary classifcation also in the main paper. In particular, as this behaves different to the GTEX binary classification. - The title is somewhat unprecise. It may induce the impression that the paper is about expression-prediction, although that is not the case. Further, in the abstract they don't mention what prediction problem they solve and that these are classification problems. After reading the paper it is clear why the authors choose that, but we are suggesting an alternative title that the authors may consider:

      The effect of nonlinear signal in classification problems using gene expression values

      Further, they should give more details on the problem learned in the abstract. - In addition, the conclusion section, which may be title as Disucssion and Conclusion, could contain additional points concerning the topology and training of the neural networks. Obviously, it is possible that other simpler or more complex neural networks have a better performance on the GTEX and Recount3 data sets compared to logistic regression. In fact, the results from Figure4 suggest that, as there is clearly useful non-linear signal in those datasets for the classification problems studied. However, optimizing a non-linear model is inherently more complex and time-consuming, and thus may not be done thoroughly in previously published papers. Compared to a linear model that is easier and faster to optimize, this may be one reason why studies find that, despite non-linear signal, the linear model performs better. Other factors such as the samples size, which the authors already mention, of course also plays a big role, and if hundreds of thousands of datasets would be there , e.g. from single cell measurements, non-linear methods may have a better chance of outcompeting linear models.

      Significance

      The submitted manuscript adds to the discussion of the necessity of non-linear models when solving classification problems using gene expression data. The significance is mostly technically, as a comparison of logistic regression and two neural network topologies that are being compared on two large expression datasets. However, there is also a conceptual part of the contribution, which is with regards to the implications of their experiments.

      Interested audience would be computer scientists and bioinformaticians or others, that are involved in creating or interpreting these or similar prediction models.

      Our field of expertise is in the creation of machine learning models using different types of OMICs data. All aspects of the work could be assessed.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors systematically evaluate the performance of linear and non-linear ML methods for making predictions from gene expression data. The results are interesting and timely, and the experiments are well designed.

      I have a few minor comments:

      • It was hard for me to understand Figure 1B. I think a figure like this would be very helpful however. What do the numbers represent? If sample ID, then I am not sure why x-axis label is also "samples"
      • For analysis of GTEx data, not sure what "studywise splitting" would mean, since the GTEx dataset is one study? Do you leave out the same individuals from all tissues for evaluation?
      • I found the sample size on x-axis of Fig 2a confusing. If I understand correctly, GTEx has a total of ~1000 subjects. So in some sense, effective sample size can not be bigger than 1000. If you are counting subjects x tissue as sample, then it can be misleading in terms of the effective sample size.
      • Would be interesting to assess out-of-sample generalizability of linear and non-linear models. Have you tried training on GTEx and predicting on Recount3 or vice versa?

      Significance

      Important and timely study, evaluating linear vs non-linear methods for predicting phenotype from gene expression datasets.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the Reviewers

      We thank the reviewers for their excellent suggestions and constructive comments. We now added new data on PE15/PPE20 binding to Ca2+, the PDIM status of mutant strains, additional controls, added to the discussion, added detail to the Methods, and provide all RNA-seq data. Please see replies to the comments in detail below:

      Reviewer 1:

      Major points

      1. Cellular localization:
      2. “The authors do not describe the cellular fractionation method…”, “The authors show some Western blot data in Fig. S3, though the legend is superficial (abbreviations not explained) and the controls with markers for cellular localization appear to be lacking”. “Further, the authors do not prove that FLAG-tagged PE20 is functional.”

      We included a description of the fractionation method in Materials and Methods (lines 475-485). We also added detail to the legend of Fig. 4A to explain the abbreviations and controls used. The same cell fractions were used in Fig. 4A and Fig. S3A, as mentioned in the Figure S3 legend (“The same cell fractions as in Fig. 4A were used, see controls therein”). We know that the FLAG-tagged PPE20 is functional because the strain used in this experiment is the same we used for genetic complementation experiments in which FLAG-tagged PPE20 functionally complements ppe20 deletion in all three assays (ATP consumption, biofilm, Ca2+ influx, Fig.4 B,C,D,G).

      • “The authors should extend discussion part of the manuscript. Several proteomic studies.” “Did authors analyze culture filtrate fraction by Western?

      We thank the reviewer for the references and extended the Discussion to include results from existing proteomic studies on PE15/PPE20 (lines 229-234). We did not test for PE15/PPE20 in culture filtrate, and previous proteomic results are contradictory. Several PE/PPE proteins, including PE15/PPE20 have been detected in the cell wall and in the CFP, but not consistently. The functional significance of this dual localization is unclear.

      1. Is PE15/PPE20 a channel?

      2. “PPE20 purified alone from the cytosol of E. coli?”

      We did not purify either protein by itself. As the reviewer correctly notes, PE/PPE proteins are refractory to individual purification. We clarified that we purified and used the complex for experiments even if only PPE20 is shown, as in Figure 3C,D, and E (Lines 124-127). See also Methods line 382 ff.

      • “…a positive control of a mutant that is indeed deficient in Mg2+ import (and thus showing a phenotype) is lacking.”

      Lacking a specific Mg2+ import mutant, and because it is a relatively minor point, we removed the statements about selectivity.

      1. Thermal melting assay

      2. It is surprising to see that the thermal melting assays was done for PPE20 and PE15 as separately purified proteins.

      We co-purified PE15 and PPE20 for all biochemical experiments. We clarified that point (see also point 2 above).

      • “the thermal melting assay only seemed to give some results for PPE20 alone, and not for PE15”

      PE15 did not produce interpretable results in this assay, as mentioned in line 144. We clarified in the Fig. 3 legend that the complex was used although only PPE20 is detected by Western blot and shown in Figure 3C.

      • “…the results are counter-intuitive… How can the authors be sure that the presence of Ca2+ does not simply lead to more protein precipitation (via rather unspecific interactions) at elevated temperatures? Some positive controls with bona fide calcium binding protein in the same thermal melting setup would have helped to clarify this.”

      The effect of Ca2+ on PPE20 is somewhat counterintuitive, although not unprecedented. Proteins can be stabilized or destabilized by ligand binding, and a recent proteome-wide study on the basis of thermal shift analysis showed that ~17% of proteins were destabilized by ligand (ATP). For a channel in particular, ligand binding might be expected to be coupled to protein relaxation in the process of channel opening, which could well translate to lower thermal stability. We added the positive control showing the behavior of a known Ca2+ binding protein (new Fig. S2A). In addition, we included a negative control showing that Ca2+ does not generally increase protein denaturation (Fig. S2B). We think that this control addresses the reviewer’s concern more directly.

      • If the authors want to stick to their claims regarding Ca2+ binding to PE15/PPE20, they have to perform additional assays (e.g. equilibrium dialysis or ITC) with the entire PE15/PPE20 complex. Further, they have to show that PE15/PPE20 forms a proper oligomeric protein that is membrane bound and reasonably behaved on size exclusion chromatography, when expressed in and purified from E. coli.

      Detecting Ca2+ binding to proteins is not trivial, and we thank the reviewer for suggesting equilibrium dialysis as another, orthogonal assay. We now show an equilibrium dialysis experiment that confirms Ca2+ binding by the PE15/PPE20 complex. Please see the new Fig. 3F. and G. and lines 146-152 (Results) and 429-443 (Methods).

      The PE/PPE proteins are generally difficult to express and purify recombinantly, likely due to the typically large unstructured regions. Also, the yield of PE15/PPE20 when expressed in E. coli was very low so that we were not able to detect the complex by SEC. However, data in Fig. 3 conclusively show that PE15 and PPE20 bind.

      1. RNA-seq data

      2. The authors should include a table with all other identified genes that are potentially involved in calcium homeostasis

      We provided all other significant differentially expressed genes in the new Table S1.

      Minor points:

      1. “what is the binding affinity of the Ca sensor?”

      We added the Ca2+ binding affinity of Twitch-2B (KD: 200nM) in line 176.

      1. Figure 4D: “one would expect a drop in FRET signal after EGTA addition… Can the authors explain?”

      We do see a clear drop in FRET signal after EGTA addition, in particular in 7H9 medium (black versus red line, Fig. 5B). Given the high affinity of Twitch-2B for Ca2+ (200nM), however, it is not surprising that the drop is not more pronounced, as intracellular Ca2+ is expected to be tightly bound to Twitch.

      1. The experiments showing outer membrane localization of PE15/PPE20 are very important, but results of these experiments (western-blot and FRET) are shown in supplementary figures. They should be transferred/integrated into the main Figures.

      We agree and moved Figure S3A to the main Figures as Figure 4A.

      1. Line 166: the authors claim that the assay did not work in 7H9 due to low Ca2+ concentration in this medium. Why did the authors not just add a bit more calcium to show whether this claim holds true?

      7H9 is not a suitable medium for these experiments because the baseline Ca2+ concentration is too high, not too low (see Fig. 5B, grey versus black line). Adding more Ca2+ to 7H9 medium resulted in precipitation, probably due to its interaction with phosphates. Our use of “low” in this context was confusing, we changed the wording of this sentence (line 180-181).

      1. Line 183: more detailed description on cellular fractionation and subsequent anti-FLAG Western needed here.

      We added more detail in the Materials section (lines 475 ff).

      Reviewer 2:

      • A major concern regarding the importance of the data: there are considerable technical challenges in generating Ca2+ depleted media. This is clear in that M. tuberculosis seems to be unaffected by Ca2+ in the medium - similar growth seems in Ca2+-free media to media with up to 10mM Ca2+ (Fig. S1). This raises a concern about the physiological relevance of the data (mammalian cells have intracellular Ca2+ of 0.01-0.1mM, extracellular free Ca2+ is around 1mM).

      If we correctly understand this comment, the reviewer is unconvinced that we fully and reproducibly depleted Ca2+ from medium based on a lack of an effect of Ca2+ on in vitro growth. We tested for baseline Ca2+ levels and depletion in media by inductively coupled plasma optical emission spectrometry and added these data showing precise quantitation of Ca2+ in medium (see new Fig. S1B).

      • The role of PE15/PPE20 in Ca2+ acquisition may be clearer if the authors ensure that the PDIM layer is intact. Specifically, there is a technical issue: The authors use Tween80 as a detergent. Tween-80 partially strips the outer cell wall of M. tuberculosis resulting in shedding of PDIM and PE/PPE proteins. Tyloxapol is a somewhat milder detergent. Some of the experiments would possibly show clearer phenotypes by use of Tyloxapol.

      We share the concern about PDIM, as PDIM loss is common in in vitro culture. We analyzed the total lipids by thin layer chromatography and confirmed the presence of PDIM in all three strains (Fig S3C, lines 198-201). We repeated experiments with Tyloxapol and did not see differences to Tween-80. We nonetheless now show the Tyloxapol data (Fig 5D).

      • The authors could increase the impact of their work be exploring the role of PE15/PPE20 during pathogenesis of resting versus activated bone marrow macrophages where Ca2+ fluxes of the host cell play a role in host responses.

      We agree. In vivo or macrophage experiments are a logical next step to fully characterize the function of PE15/PPE20, but we think it is beyond the scope of this manuscript. The main contribution of this paper is the identification of channel function of a PE/PPE protein pair that extends the novel channel paradigm for these proteins. These data support that transport might indeed be a shared function of the entire PE/PPE family with 169 members.

      Minor:

      • The authors should consider citing Sharma et al (2021)

      We cited the paper.

      • Are there Ca2+ binding motifs in PPE20?

      We did not detect canonical Ca2+ binding motifs in PPE20.

      • RNAseq data may need to be deposited in a public database.

      RNA-seq data have been deposited to NCBI - GEO accession GSE214266

      Link: https://urldefense.com/v3/https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE214266;!!NuzbfyPwt6ZyPHQ!tCf4MS_HRKJFn6qV2orkDAkXTWvx9IIU11fAV7TguYE2ietoMBpBgRC7rvfnM9bsoiVdIvDBUHdPmHZliDP2o5sRZR2ziK4$

      Token: cvmhakcgbpmbfuz

      • In its current state, the work is somewhat incremental

      The function of the large PE/PPE protein family of Mtb has been one of the most longstanding and perplexing puzzles in Mtb biology. For more than 20 years, speculation about their potential role, for example in antigenic variation, abounded but no conclusive evidence for this or another shared function emerged. A recent landmark paper then conclusively showed that a subset of the PE/PPE proteins function as nutrient channels (Wang et al., Science 2020). However, whether transporter function is a general function of the family of 169 PE/PPE proteins remains untested. Our PE/PPE pair is associated with a different type VII secretion system (Esx-3) and belongs to a different subfamily than the previous examples, suggesting a shared function across families and perhaps even all of these proteins. Given the intense interest and many false leads that have plagued the identification of PE/PPE function in the last 20 years, the difficulty of working with them biochemically, as well as the almost complete absence of understanding of Ca2+ homeostasis in Mtb, we do not consider our work incremental.

      Reviewer 3

      • My only slight concern is the meaning attached to the "biofilm" assays. It is never very clear to me that this is anything more than formation of a surface pellicle and general hydrophobicity of the mycobacterial cells.

      We fully agree that Mtb biofilms remain poorly defined. However, the term biofilm as used in our study has already found its way into the literature and we would rather not cause confusion by calling the same phenomenon by a different name. Whatever the term used, we do not suggest any other relevance other than it being a Ca2+-dependent phenotype that serves as one of several tests to parse PE15/PPE20’s role in Ca2+ homeostasis.

      Cross-consultation comments:

      • We agree with the concerns of reviewer#2 that the role of PDIM and use of detergent should be looked at more closely.

      We tested the roles of PDIM and detergent, see reviewer 2.

      • Likewise, the paper would strongly benefit from some further insights into the potential physiological role of PPE20/PE15 in calcium homeostasis.

      We show PE15/PPE20 function in the transport of Ca2+ and the first Ca2+-related cellular phenotypes in Mtb. Testing the role of the complex in an infection model is outside of the scope of this manuscript and mouse infection experiments would take many months and would likely be intractable because of the expected extensive redundancy among the 169 PE/PPE proteins.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Review of Boradia et al, "Calcium transport by the Mycobacterium tuberculosis PE15/PPE20 proteins" This manuscript describes studies aimed at understanding the role of calcium in the pathogenesis of tuberculosis. The authors begin by showing that, analogous to the situation in other bacteria, ATP levels are directly (and rather dramatically) affected by extracellular calcium levels. The authors then look at the effect on "biofilm formation" and, again analogous to other bacteria, find a link. The authors then perform RNAseq on bacterial cells with and without 1mM Ca++ and identify a pair of genes that is strongly downregulated by calcium sufficiency. These genes are PE/PPE family members which have been recently associated with channel formation in the mycomembrane to allow transport of small molecule solutes across the outer cell envelope. The authors show these proteins are associated in a complex by reciprocal pull-down experiments in tagged proteins and show directly that they bind calcium by a thermal stability change of this complex in the presence of calcium. Finally, they show, using a calcium sensitive FRET reporter expressed in Mtb, that these two proteins allow calcium influx and that such an influx is blocked in a strain where they have been deleted.

      Overall, the study is excellent and convincingly establishes the transport function of another pair of PE/PPE proteins. My only concern with this is that they stop just short of delving into the actual infection biology of calcium, but I suppose that will be next. The tools they developed in this study, specifically the knockout strain and the FRET reporter, put them in a strong position to explore the role of calcium during growth in macrophages and other in vivo studies that are surely planned.

      My only slight concern is the meaning attached to the "biofilm" assays. It is never very clear to me that this is anything more than formation of a surface pellicle and general hydrophobicity of the mycobacterial cells. I wonder if the presence of calcium alters the aggregation state of the bacilli and or affects the surface in some more subtle manner. I am not convinced that the word "biofilm" as it is used commonly in other bacteria, has anything to do with the physical properties that are being observed in the case of Mtb.

      Significance

      The manuscript clearly establishes that this pair of PE/PPE proteins plays a direct role in calcium transport in MTB and provides several useful tools to begin to understand the role of calcium in TB pathogenesis. The work is outstanding and novel.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors demonstrate that M. tuberculosis responds to increasing Ca2+ concentrations by increasing ATP levels as well as increased ability to form biofilms. Culturing of M. tuberculosis with Ca2+ results in downregulation of pe15/ppe20. The proteins are recombinantly expressed and the results show that PE15 and PPE20 form a complex. In addition, PPE20 seems to be destabilized by Ca2+ suggesting that it interacts with this metal ion. A pe15/ppe20 knockout shows lower levels of ATP increase upon incubation with Ca2+ although the differences with would-type are very modest. Similarly, the knockout shows an impaired ability to form biofilms at 1mM and 10mM Ca2+. Finally, the authors make a FRET reporter of intracellular Ca2+ concentrations based on the Twitch system which nicely shows that intracellular Ca2+ levels are lower in the knockout mutant.

      Overall, the data suggest that PE15/PPE20 are involved in Ca2+ uptake which contributes to our evolving understanding of the role of the different PE/PPE proteins in nutrient acquisition. The highlight of the paper is the Twitch bioreporter for Ca2+ which could be useful in exploring the role of Ca2+ in mycobacteria.

      A major concern regarding the importance of the data: there are considerable technical challenges in generating Ca2+ depleted media. This is clear in that M. tuberculosis seems to be unaffected by Ca2+ in the medium - similar growth seems in Ca2+-free media to media with up to 10mM Ca2+ (Fig. S1). This raises a concern about the physiological relevance of the data (mammalian cells have intracellular Ca2+ of 0.01-0.1mM, extracellular free Ca2+ is around 1mM). The role of PE15/PPE20 in Ca2+ acquisition may be clearer if the authors ensure that the PDIM layer is intact. Specifically, there is a technical issue: The authors use Tween80 as a detergent. Tween-80 partially strips the outer cell wall of M. tuberculosis resulting in shedding of PDIM and PE/PPE proteins. Tyloxapol is a somewhat milder detergent. Some of the experiments would possibly show clearer phenotypes by use of Tyloxapol. In experiments where clumping is not a concern (ATP measurement), the cells can be pre-grown as indicated but then transferred to the multiwell plates in detergent-free media. At the time of processing of the cells for readout of, for example ATP, detergent can be used as needed. The authors could increase the impact of their work be exploring the role of PE15/PPE20 during pathogenesis of resting versus activated bone marrow macrophages where Ca2+ fluxes of the host cell play a role in host responses.

      Minor:

      The authors should consider citing Sharma et al (2021): PGRS Domain of Rv0297 of Mycobacterium tuberculosis functions in A Calcium Dependent Manner

      Are there Ca2+ binding motifs in PPE20?

      RNAseq data may need to be deposited in a public database.

      Significance

      In its current state, this work is somewhat incremental: the authors have provided data that suggest that PE15/PP20 are involved in Ca2+ uptake (data could be strengthened as suggested above). The physiological relevance of the PE15/PPE20 system remains unclear - no data on its role in pathogenesis.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      PE/PPE proteins build up 10% of genome of Mtb, but the function of these proteins is only currently investigated in more detail. Recent studies show the involvement of individual PE/PPE proteins in the transport of nutrients, and more data supporting this functional role are about to emerge. Using RNA-seq, Boradia et al identified the pe15/ppe20 genes to be downregulated in response to calcium exposure. Purified PPE20 (but not PE15) appear to bind calcium in a thermal stability assay (though this claim needs further experimental support). The authors generated a Mtb pe15/ppe20 knockout strain and convincingly show in three types of assays (ATP levels, biofilm formation and lower signal in FRET measurements corresponding to lower calcium concentrations compared to the wild type strain) that the PE15/PPE20 proteins are involved in cellular calcium import, but do not appear to import magnesium. All phenotypes could be restored to the behavior of wildtype Mtb by complementing the KO strain with pe15/ppe20.

      The manuscript is clearly written and easy to follow. The authors combined molecular biology (RNA-seq), biochemistry (proteins purification), biophysics (FRET) and microbiology (knockout generation, in vivo measurements) to reach their conclusions. Overall, the study reports novel and interesting data and as such is of high interest for the mycobacterial research community. However, some of the claims have a rather experimental basis, and thus the study needs to be strengthened with further experiments (or statements have to be removed) as outlined below.

      Major points:

      1. Cellular localization of PE15/PPE20.

      The authors do not describe the cellular fractionation method they applied (no mentioning of cellular localization experiments in materials and methods). The same applies to the main text (very superficial description). The authors show some Western blot data in Fig. S3, though the legend is superficial (abbreviations not explained) and the controls with markers for cellular localization appear to be lacking. Further, the authors do not prove that the FLAG-tagged PPE20 is functional.

      The authors should extend discussion part of the manuscript. Several proteomic studies did not identify PE15 or PPE20 in the cell wall - doi: 10.1021/pr1005873, doi: 10.1091/mbc.E04-04-0329. At the same time PE15 (but not PPE20) is membrane or membrane-associated protein according to this work: doi: 10.1186/1471-2180-10-132. Quite recent work (https://doi.org/10.1073/pnas.1523321113) showed that PE15/PPE20 are secreted substrates of ESX-3 and these proteins have been found in the culture filtrate. Did authors analyze culture filtrate fraction by Western blotting? 2. Is PE15/PPE20 a channel?

      A major claim of the authors is that PE15/PPE20 forms a (specific) channel for Ca2+ and not a porin-like protein that is permeable to a large set of solutes. However, this claim has its main experimental backing that PPE20 (purified alone from the cytosol of E. coli?) binds to Calcium in a (rather weirdly looking) "thermal melting assay" (further comments on these assays, see below). The second experiment supporting this idea is a lack of difference between wt and KO strain of Mtb in an assay that should report Mg2+ transport deficiency (Fig. S3). But here, a positive control of a mutant that is indeed deficient in Mg2+ import (and thus showing a phenotype) is lacking. In conclusion, the experimental basis on the grounds of which the authors claim PE15/PPE20 to be a specific Mg2+ channel is weak. On the other hand, the functional data clearly show a link between PE15/PPE20 and calcium uptake: Hence the data are solid enough to claim that PE15/PPE20 facilitates Ca2+ transport across the mycomembrane. 3. Thermal melting assay.

      It is surprising to see that the thermal melting assays was done for PPE20 and PE15 as separately purified proteins. How did you purify PPE20 alone for this assay? It is broadly accepted for PE/PPE proteins that they only can be purified as pairs, including for PE15/PPE20 (https://doi.org/10.1073/pnas.0602606103). As for the cellular localization, the method section falls short in providing relevant information on how PPE20 and PE15 were purified in separate forms (it states they were co-expressed using a pETDuet vector). Further, the thermal melting assay only seemed to give some results for PPE20 alone, and not for PE15. There is no mentioning of the PE15/PPE20 complex in this assay. Further, the results are counter-intuitive, as Ca2+ addition leads to more precipitation at higher temperatures (and it does seem to weaken the stability of PPE20 instead of stabilizing it). How can the authors be sure that the presence of Ca2+ does not simply lead to more protein precipitation (via rather unspecific interactions) at elevated temperatures? Some positive controls with bona fide calcium binding protein in the same thermal melting setup would have helped to clarify this.

      If the authors want to stick to their claims regarding Ca2+ binding to PE15/PPE20, they have to perform additional assays (e.g. equilibrium dialysis or ITC) with the entire PE15/PPE20 complex. Further, they have to show that PE15/PPE20 forms a proper oligomeric protein that is membrane bound and reasonably behaved on size exclusion chromatography, when expressed in and purified from E. coli. As it is doubtful that the authors can meet such quality standards, I would recommend to remove all statements regarding Ca2+ binding to PPE20 from the manuscript, as the underlying experiments are of poor quality. 4. RNA-seq data

      The authors should include a table with all other identified genes that are potentially involved in calcium homeostasis. This is of interest because the KO strain is still capable of calcium import, hence other Ca2+ transport systems likely exist.

      Minor comments:

      1. FRET experiments What is the binding affinity of the sensor for calcium?
      2. Figure 4D: one would expect a drop in FRET signal after EGTA addition, because this reverts the Ca2+ gradient from out-to-in (thus facilitating calcium flow into the cells) to in-to-out (EGTA actually acting as a sink into which all Ca2+ (also the one from within the cell) would flow). Can the authors explain?
      3. The experiments showing outer membrane localization of PE15/PPE20 are very important, but results of these experiments (western-blot and FRET) are shown in supplementary figures. They should be transferred/integrated into the main Figures.
      4. Line 166: the authors claim that the assay did not work in 7H9 due to low Ca2+ concentration in this medium. Why did the authors not just add a bit more calcium to show whether this claim holds true?
      5. Line 183: more detailed description on cellular fractionation and subsequent anti-FLAG Western needed here.

      Referees cross-commenting

      We agree with the concerns of reviewer#2 that the role of PDIM and use of detergent should be looked at more closely.

      Likewise, the paper would strongly benefit from some further insights into the potential physiological role of PPE20/PE15 in calcium homeostasis.

      Significance

      Slow-growing mycobacteria like Mtb lack porins. Therefore, it is not clear how nutrients can be transported through the outer membrane. More and more data hint on PE/PPE protein family that can fulfill this function (Wang et al., Science 367, 1147-1151 (2020)). In the current work, the authors show that PE15/PPE20 are involved in calcium transport in Mtb. Mtb is a difficult model organism to work with because of its pathogenicity and slow rate of growth. Therefore, any information on nutrients transport in Mtb is highly appreciable.

      The RNA-seq experiments as well as the genetic/functional experiments clearly show that PE15/PPE20 facilitates calcium import in Mtb. The corresponding sections and figures are convincing.

      The experimental data attempting to show PE15/PPE20's cellular localization and its interaction with Ca2+ are currently weak, and need to be strengthened.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This paper demonstrates a link between oxidative stress, lipid biosynthesis, and targeted histone acetylation in fission yeast. In mutant cells with defects in lipid synthesis (cbf11, mga2 lacking transcription factors, and cut6 lacking acetyl-CoA carboxylase), transcripts of a number of genes implicated in resistance to oxidative stress are increased. This is associated with higher levels of H3K9 acetylation and increased tolerance to oxidative stress. These effects are mediated through Sty1, a stress-activated MAP kinase and the transcription factor Atf1.

      It is also shown that H3K9 acetylation levels in the promoter region and just downstream of the transcriptional start site are increased in cbf11 mutants (Fig. 5A).

      By mutational analysis, the authors implicate the acetyl transferases Mst1 and Gcn5 in this transcriptional effect. Other related acetyl transferases, Hat1, Elp3, Mst2, Rtt109 have been ruled out as main contributors to the dysregulation in unstressed cbf11 mutants. That specific acetyl transferases have been shown to be required is a strength of the investigation.

      Major comments:

      The hypothesis is put forward in the manuscript that altered acetyl-CoA levels in cbf1 mutants would underlie the dysregulation of genes induced by oxidative stress. Histone acetyl transferases compete for acetyl-CoA with lipid biosynthesis, and so with increased demand for acetyl-CoA underacetylation in the concerned promoters would result - specifically at H3K9. These results do not directly support the hypothesis, on the other hand they are not sufficient to rule it out.

      Actually, we view this phenomenon the other way round: We primarily focus on exponentially growing cells, which have substantial demand for fatty acid (FA) production (= high acetyl-CoA consumption). So the level of promoter histone acetylation under these conditions is our baseline, or “normal” state. When FA production is decreased (cbf11 or cut6 mutants; inhibition of FA synthase by cerulenin…), stress gene promoters get *hyper*acetylated. We do not have any data on (or claims about) histone underacetylation compared to the baseline. Nevertheless, we now show that overexpression of Cut6/ACC results in decreased resistance to oxidative stress (Fig. 5C), which is compatible with the notion that increased acetyl-CoA consumption would result in insufficient histone acetylation at stress gene promoters during stress.

      Acetyl-CoA levels were measured only in undisturbed cells, and the possibility remains that under oxidative stress there would be changes in acetyl-CoA pools that could explain this apparent contradiction - why did not the authors examine that?

      Under oxidative stress, the Sty1 stress MAPK is activated, leading to a massive Atf1-dependent transcription wave, which is also associated with increased SAGA-dependent H3K9 acetylation (PMID: 21515633). This well-studied cellular response, however, is not the main focus of our study. Rather, we found a novel connection between perturbed lipid metabolism and increased expression of stress genes in cells *not challenged* by oxidative stress (i.e. Sty1-Atf1 are not hyperactivated). This is why we only measured acetyl-CoA concentrations in untreated cells.

      The authors argue that although the global acetyl-CoA levels are not increased, local concentrations might be altered in a way to permit higher H3K9 acetylation levels at selected promoters. Although a formal possibility, this is rather far-fetched as a small and freely diffusible molecule like acetyl-CoA should quickly equilibrate within one cellular compartment. I think that although the overall relationships that the authors have established between oxidative stress, H3K9 acetylation levels with increased expression, and lipid biosynthesis, are compelling, the role of acetyl-CoA concentrations is not clear and should be de-emphasized.

      Interestingly, acetyl-CoA production in the nucleus has been published by several studies (reviewed in PMID: 29174173), suggesting that local acetyl-CoA concentrations (microgradients) within the cell are functionally relevant. We agree that acetyl-CoA is a small molecule which, in theory, should diffuse quickly throughout the nucleocytoplasmic space. However, empirical evidence shows that the lipid synthesis in the cytosol and histone acetylation in the nucleus may not access a uniform nuclear-cytosolic pool of acetyl-CoA (PMID: 28099844, PMID: 28552616). This is related to the fact that the acetyl-CoA sink is large and acetyl-CoA may react with many proteins (i.e. any extra amounts will be consumed rapidly).

      Even though we provide strong evidence that HAT activity is critical for the crosstalk between FA synthesis and stress gene expression, we do agree that we have not conclusively established the role of acetyl-CoA in the process. However, we still feel that it is justified to point out acetyl-CoA is a “possible” mediator molecule for the crosstalk in the Results and Discussion sections.

      Minor comments:

      In many of the bar diagrams, only a borderline statistical significance is indicated (p ~ 0.05) despite seemingly large numerical differences between the means. In the legends it is stated that one-sided Mann-Whitney U tests were used. This is a non-parametric test with low power - would it not have been better to use a t test?

      We do agree that the non-parametric Mann-Whitney U test is rather conservative and, therefore, less sensitive for small sample sizes, such as n = 3. Our reason for using this particular test instead of the parametric t-test is that qPCR fold-change values come from a log-normal distribution, which is incompatible with t-test (requires normal distribution of data). Importantly, using conservative statistical testing does not invalidate our conclusions.

      What do the error bars in the diagram show, SEM? If a non-parametric test is used, a parametric measure of variability is irrelevant.

      The error bars represent standard deviation (SD). We do not see an issue here as, in our opinion, the visual style of numeric data presentation is independent from any chosen statistical testing methods.

      It would be helpful to the reader to indicate directly in the diagram panels what is actually shown, not just "fold change vs ..." In Fig. 1, 2, 4 D and 5 we see mRNA levels, in Fig. 3 chromatin IP.

      Done

      Reviewer #1 (Significance (Required)):

      The paper represents conceptual advances for our understanding of how stress responses, metabolism and transcriptional regulation are linked, although one of the links (acetyl-CoA levels in this case) is tenuous.

      This manuscript belongs in a rich literature on stress responses on the gene expression level, mostly from studies in yeast. Potentially, it adds entirely new information on how cellular stress may be mechanistially linked to stress responses.

      These results are potentially general and of broad interest to the biological community.

      This reviewer is familiar with yeast genetics, stress responses, and quantification of gene expression.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      As more and more metabolic intermediates are found to also serve as co-factors for epigenetic modifications, it has been widely accepted that regulating the levels of these key metabolites can be an effective way to control nutrient related gene expression. Acetyl-CoA is one of those early examples. Increased acetyl-CoA was shown to promote local acetylation at growth genes (Mol Cell 2011 PMID: 21596309), and ACC deletion funnels more Acetyl-CoA towards histone acetylation reactions and causes global hyperacetylation (Ref 17). However, whether those increased metabolite/co-factor can exert signal-specific effects remains elusive. For instance, although increased acetyl-CoA stimulates the SAGA complex enzymatic activity, it is not clear whether it also causes SAGA to be targeted to new sites without external cues to induce new transcription factor binding. Does increased acetyl-CoA cause broad hyperacetylation at all inducible genes which are the primary targets for those HAT complexes?

      In this manuscript, Princová et al. found that deletion of fatty acid synthesis transcriptional factors Cbf11 and Mga2 increases cell survival under H2O2 induced oxidative stress in S. pombe. They further showed that several stress-related genes increased upon Cbf11 deletion, and H3K9 acetylation at their promotor regions were elevated. They argued that FA-TF deletion may indirectly regulate stress-related genes potentially through influencing Acetyl-CoA level, although they failed to detect significant changes of global Acetyl-CoA levels. While it's interesting to see yet another example of metabolite-mediated gene expression regulation, the current manuscript only made incremental advance towards mechanistic principles of how these co-factors finetune specific gene expression program.

      Specific comments:

      1. This work showed convincingly that deletion of CBF11 or MGA2 leads to resistance to oxidative stress. However, it provides little mechanistic insight into how deletion of Cbf11 increased the expression of stress response genes and why some HATs are involved but others not (Figure EV5).

      We respectfully disagree with the notion that we only provide “little mechanistic insight” into the process whereby FA metabolism affects stress gene expression.

      • First, we show that not only deletion of cbf11, but also a very specific manipulation of the rate-limiting FA-producing enzyme (Cut6/ACC; Fig. 4D), or chemical inhibition of FA synthase by cerulenin (new Fig. 4F) all lead to increased stress gene expression. On the other hand, overproduction of Cut6/ACC results in decreased stress gene expression and lower resistance to ox. stress (new Fig. 5B-C). These findings clearly show the specific and tight mutual relationship between FA synthesis and expression of stress genes.

      • Second, we show that the DNA-binding activity of Cbf11 is critical for affecting stress gene expression levels, yet Cbf11 does not act as a stress gene repressor.

      • Third, we show that, compared to e.g. peroxide treatment, stress gene mRNA levels are only moderately increased upon downregulation of FA synthesis. So the situation can be called stress gene “derepression”. At the same time the major stress-response regulators (Sty1-Atf1, Fig. 2A-C; Pap1, new Fig. 2D-E) are required for the derepression, but, importantly, neither of them shows increased activation compared to unstressed WT cells (Fig. 3A-C). These data suggest a qualitative difference between the two phenomena (canonical stress response vs dysregulation of FA synthesis). Furthermore, they hint at an important role of the chromatin environment.

      • Fourth, we show that Gcn5/SAGA and Mst1, but not 4 other HATs, mediate the connection between FA metabolism and stress gene expression (Fig. 5D-E), and we show clear and specific H3K9 hyperacetylation of stress gene promoters in FA metabolism mutants (Fig. 5A), arguing that this is not a general acetylome issue.

      • Fifth, we show that the stress genes affected by changes in FA metabolism show unusually high nucleosome (H3) occupancy in their transcribed regions (even in unperturbed WT cells; Fig. 5A bottom panels), which could dictate the observed specificity in regulation.

      While we agree that our understanding is not yet complete, we have already described many mechanistic aspects of the link between FA metabolism and stress gene expression.

      1. Although in Cbf11 deletion cells, increased resistance to H2O2 is relied upon the Sty1/Atf1 pathway, the authors did not establish a link between lipid synthesis and Atf1 activity because Cbf11 deletion does not affect the phosphorylation of Atf1.

      Sty1 and/or Atf1 show non-zero activity even in normal, healthy, unstressed cells. Importantly, Atf1 is bound to many target promoters even in the absence of stress (Fig. 3B; PMID: 20661279, PMID: 28652406). Moreover, Sty1 is actually needed for orderly cell cycle progression (sty1KO cells are elongated, a result of postponed mitotic entry; e.g. PMID:7501024), which we now mention in the Introduction and Discussion. Our point is that Sty1-Atf1 are not hyperactivated under normal conditions - this only happens during major stress insults. Thus, in unstressed cbf11KO cells, stress gene promoters are hyperacetylated, which may facilitate their (Sty1-Atf1 and Pap1-dependent) transcription, without the need for hyperactivation of the stress response regulators. Such increased transcriptional competence of stress promoters is consistent with our findings that upon peroxide treatment stress gene mRNA levels in cbf11KO exceed those in WT (Fig. 1B). We have amended the corresponding section of the Discussion to more clearly explain our conclusions and hypotheses.

      1. Cbf11 deletion causes elevated H3K9 acetylation at the promotor regions of a number of stress respond genes, the author did not mention whether demonstrate how lipid synthesis defect causes the hyperacetylation at these promoters.

      As discussed in our manuscript, we suggest that following downregulation of FA synthesis, the surplus acetyl-CoA is used by Gcn5 and Mst1 HATs to hyperacetylate stress gene promoters.

      1. As all lipid-metabolism mutants show increased stress response, it would helpful to examine whether H2O2 induction of WT cells influence lipid synthesis, thus establish physiological links between FA synthesis and stress response.

      We now mention in the Discussion section that, curiously, cut6/ACC mRNA levels are downregulated upon peroxide treatment. However, the significance of this finding is unclear as FA metabolism is strongly regulated at the post-translational level (PMID: 12529438). Unfortunately, we are not in a position to measure changes in metabolic fluxes upon stress. In any case, we believe that such experiments would be outside the scope of the current study.

      Beside, fatty acid may be beneficial to fight oxidative stress because they maintain the integrity of cell membrane. What is the potential effect of CBF11 deletion in this aspect? The author may want to discuss it.

      The reviewer suggests that higher production of FA would result in higher resistance to oxidative stress. However, our data do not indicate this - we show that under low FA synthesis the stress resistance is actually higher. Nevertheless, we acknowledge in the Discussion that the scenario suggested by the reviewer can occur, for example, in cancer cells which become more resistant to oxidative stress following increased lipid biosynthesis/storage.

      1. Since H2O2 treatment also causes change in glucose metabolism including upregulation of glucose transporter Ght5 (PMID: 30782292), it would be enlightening to see if there is a crosstalk between the lipid and glucose metabolisms. Does Ght5 expression increase upon H2O2 treatment in CBF11 deletion strain?

      While the topic is interesting, we strongly believe that the relationship between glucose metabolism and stress gene expression is outside the scope of this study.

      According to our data used in Fig. 4A, ght5 expression in cbf11KO at 60 min after 0.74 mM H2O2 treatment is downregulated 3-fold.

      5 Different H2O2 concentration causes different stress response in pombe: Pap1 and Sty1 mediate responses for low and high H2O2, respectively. For fully activated Sty1 response, the concentration of H2O2, needs to reach 1mM (PMID: 17043891). In this study, the H2O2 concentration ranges from 0.5-1.5mM and Pap1 regulated Ctt1 does show increase upon H2O2 treatment. To test if suppressed lipid synthesis facilitates Sty1 dependent activation, it would be helpful to examine the activation of Pap1 (its nuclear translocation) to eliminate other influences.

      We agree with the reviewer. We have now included data on the role of Pap1 in the crosstalk between lipid metabolism and stress gene expression. We show that Pap1 is required for increased expression of gst2 and ctt1 in untreated cbf11KO cells (Fig. 2D). We note that ctt1 is coregulated by both Pap1 and Atf1 (Fig. 2B, D). Also, Pap1 is partially required for H2O2 resistance of cbf11KO cells (Fig. 2E). Importantly, similar to Sty1-Atf, Pap1 is not hyperactivated (no nuclear accumulation) by 10 or 60 min of cerulenin treatment (Fig. 3C), while stress gene expression is upregulated at 60 min in cerulenin (Fig. 4F) and keeps increasing after 120 min (data not shown). These data collectively support our hypothesis that upon decreased FA synthesis, stress gene promoters become more transcription-competent without the requirement for hyperactivation of the corresponding stress gene regulators.

      Reviewer #2 (Significance (Required)):

      see above

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This study examines the intriguing phenomenon that perturbation of fatty acid biosynthesis induces expression of stress-response genes by increased intracellular levels of acetyl-CoA and hyperacetylation of histones at the promoters of these genes. Loss of the CSL transcription factor Cbf11 results in induced expression of a subset of stress-response genes in unperturbed conditions and resistance to H2O2. These stress-response genes are not direct targets of Cbf11, but their upregulation is dependent on the Sty1-Atf1 pathway. Similar effects in upregulation of stress-response genes were observed in the cut6 hypomorph and mga2 deletion strain, however no change in global levels of acetyl-Co-A in the former as well as in the cbf11 deletion was detected. The upregulated stress-response genes appear to be linked to increased H3K9 acetylation in their promoters and dependent on the Gcn5 and Mst1 HATs.

      The authors present good supportive evidence linking fatty acid biosynthesis to epigenetic regulation of stress response genes potentially mediated by intracellular levels of acetyl-CoA. This is an exciting area and the fission yeast model system is ideal to elucidate the molecular mechanisms behind this process. This is a substantial body of work with state-of-the art functional genomics approaches and LC-MS analysis. The data is of high quality and the manuscript is well written and relatively easy to read. Below are my comments for the manuscript.

      It was determined that increased expression of stress-response genes in the cbf11 deletion is dependent on the presence of Sty1, and partially dependent on Atf1. How about Pap1 (or Prr1) - would this transcription factor that is also regulated by Sty1 be involved in the upregulation of the stress-response genes in the cbf11 deletion? Activation of Sty1 and Atf1 by phosphorylation was not observed in unperturbed cbf11 deletion cells which would be expected in the proposed model. This discrepancy was not well explained. Could activation of Sty1/Atf1/Pap1 in unperturbed cbf11 cells be assayed in a different way such as nuclear localization?

      As these concerns were also raised by Reviewer 2, to avoid duplication, we kindly ask you to read our detailed responses above. Briefly, we have now included new data clarifying the role of Pap1 in the increased expression of selected stress genes in cbf11KO cells (or when FA synthesis is chemically inhibited) - comment #5 of Reviewer 2 above. Also, we explain why Sty1-Atf1 and/or Pap1 hyperactivation (i.e. above their activity level in untreated WT) is actually not needed in order for decreased FA synthesis to trigger a mild/moderate increase in stress gene expression - comment #2 of Reviewer 2 above. We have now also clarified this issue in the Discussion section.

      As for the use of alternative methods for measuring the activation status of Sty1-Atf, we have already provided data from multiple independent and very sensitive methods (western blot, ChIP-qPCR; Fig. 3A-B). Also, it is questionable whether microscopy would be more sensitive than our current methods. Moreover, our H2O2-sensitive reporter does not indicate an increasingly oxidative environment inside cbf11KO cells, quite on the contrary (Fig. 1D).

      It would strengthen the model that perturbation of fatty biosynthesis induces expression of stress-response genes and H2O2 resistance if more mutant strains other than cut6 and two of its known regulators were tested. Does the proposed model apply to any deficiency in fatty acid synthesis in general or only those that result in increased levels of acetyl-CoA? For example, would deletion strains of fas1, fas2, lsd90, lcf1, lcf2 or the4 show the same stress response as cut6, mga2, and cbf11 mutants?

      The roles of lsd90, lcf1, lcf2 and the4 have been only poorly characterized so far, making it potentially difficult to interpret any stress-related phenotypes of these mutants. However, the role of the fatty acid synthase Fas1/Fas2 complex in FA production is well established. We have therefore inhibited FAS using cerulenin and found that this treatment also leads to increased stress gene expression (Fig. 5F), without causing Pap1 hyperactivation (Fig. 3C). Importantly, fas1/fas2 are not Cbf11 target genes, and FAS inhibition by cerulenin represents an acute intervention, very different from the long-term effects in cbf11/mga2/cut6 mutants.

      Also, does overexpression of cut6+ confer sensitivity to H2O2?

      Yes, our new data show that ~2-fold overexpression of cut6 both partially abolished the derepression of stress genes in cbf11KO cells (Fig. 5B), and increased sensitivity to H2O2 of WT cells (new Fig. 5C).

      The authors hypothesize that induced expression of stress-response genes in the cbf11 deletion and cut6 hypomorph is due to H3K9 hyperacetylation because of increased acetyl-CoA abundance in the cell. However, LC-MS analysis showed no change in global abundance of acetyl-CoA in the cbf11 deletion and cut6 hypomorph although differential levels of acetyl-CoA in the nucleus relative to the rest of the cell cannot be ruled out. The authors mentioned that ppc1-537 and ssp2 null are known to have lower abundance of acetyl-CoA and the latter could suppress the cbf11 deletion-induced gene expression for two of three genes tested by qPCR. Can ppc1-537 also suppress the cbf11 deletion-induced gene expression? Are ppc1-537 and the ssp2 null sensitive to H2O2?

      The ppc1-537 mutant is sick and has a growth defect, making it difficult to interpret any findings regarding its survival/resistance phenotype (see a similar issue with the cut6-621 mutant in Fig. 4E). Ssp2/AMPK has a pleiotropic role in the cell and its activity is actually controlled by Sty1-Atf1 under some stress conditions (PMID: 28515144) and the ssp2KO is resistant to osmotic stress (PMID: 28600551). All this makes it potentially difficult to derive reliable conclusions about ppc1 and ssp2. However, our current data on cut6 (ts hypomorph, Pcut6MUT, overexpression) and FAS/cerulenin are derived from precisely targeted and specific interventions, and support the proposed connection between FA synthesis and stress gene expression, and are consistent with the suggested role of acetyl-CoA (and its microgradients) in mediating the connection.

      I think Rtt109 is H3K56 specific.

      Indeed, H3K56 is the characterized specificity of Rtt109, and we indicate this explicitly in the manuscript. We wanted to make our HAT screen comprehensive since we could not presume which histone or even non-histone acetylation target(s) is involved in lipid metabolism-mediated stress gene expression. Even though we have observed increased H3K9ac (Gcn5/SAGA target), other modifications are likely involved since Mst1 affects stress gene expression in lipid mutants, but Mst1 is not known to target H3K9.

      Reviewer #3 (Significance (Required)):

      The authors present good supportive evidence linking fatty acid biosynthesis to epigenetic regulation of stress response genes potentially mediated by intracellular levels of acetyl-CoA. This is an exciting area and not all the molecular details have been elucidated in this process. S. pombe is ideal to study this fundamental process and discoveries would be applicable to other eukaryotic study organisms.

      My expertise is in eukaryotic gene regulation, molecular genetics and functional genomics, so I am quite qualified to critically review this paper.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study examines the intriguing phenomenon that perturbation of fatty acid biosynthesis induces expression of stress-response genes by increased intracellular levels of acetyl-CoA and hyperacetylation of histones at the promoters of these genes. Loss of the CSL transcription factor Cbf11 results in induced expression of a subset of stress-response genes in unperturbed conditions and resistance to H2O2. These stress-response genes are not direct targets of Cbf11, but their upregulation is dependent on the Sty1-Atf1 pathway. Similar effects in upregulation of stress-response genes were observed in the cut6 hypomorph and mga2 deletion strain, however no change in global levels of acetyl-Co-A in the former as well as in the cbf11 deletion was detected. The upregulated stress-response genes appear to be linked to increased H3K9 acetylation in their promoters and dependent on the Gcn5 and Mst1 HATs.

      The authors present good supportive evidence linking fatty acid biosynthesis to epigenetic regulation of stress response genes potentially mediated by intracellular levels of acetyl-CoA. This is an exciting area and the fission yeast model system is ideal to elucidate the molecular mechanisms behind this process. This is a substantial body of work with state-of-the art functional genomics approaches and LC-MS analysis. The data is of high quality and the manuscript is well written and relatively easy to read. Below are my comments for the manuscript.

      It was determined that increased expression of stress-response genes in the cbf11 deletion is dependent on the presence of Sty1, and partially dependent on Atf1. How about Pap1 (or Prr1) - would this transcription factor that is also regulated by Sty1 be involved in the upregulation of the stress-response genes in the cbf11 deletion? Activation of Sty1 and Atf1 by phosphorylation was not observed in unperturbed cbf11 deletion cells which would be expected in the proposed model. This discrepancy was not well explained. Could activation of Sty1/Atf1/Pap1 in unperturbed cbf11 cells be assayed in a different way such as nuclear localization?

      It would strengthen the model that perturbation of fatty biosynthesis induces expression of stress-response genes and H2O2 resistance if more mutant strains other than cut6 and two of its known regulators were tested. Does the proposed model apply to any deficiency in fatty acid synthesis in general or only those that result in increased levels of acetyl-CoA? For example, would deletion strains of fas1, fas2, lsd90, lcf1, lcf2 or the4 show the same stress response as cut6, mga2, and cbf11 mutants? Also, does overexpression of cut6+ confer sensitivity to H2O2?

      The authors hypothesize that induced expression of stress-response genes in the cbf11 deletion and cut6 hypomorph is due to H3K9 hyperacetylation because of increased acetyl-CoA abundance in the cell. However, LC-MS analysis showed no change in global abundance of acetyl-CoA in the cbf11 deletion and cut6 hypomorph although differential levels of acetyl-CoA in the nucleus relative to the rest of the cell cannot be ruled out. The authors mentioned that ppc1-537 and ssp2 null are known to have lower abundance of acetyl-CoA and the latter could suppress the cbf11 deletion-induced gene expression for two of three genes tested by qPCR. Can ppc1-537 also suppress the cbf11 deletion-induced gene expression? Are ppc1-537 and the ssp2 null sensitive to H2O2?

      I think Rtt109 is H3K56 specific.

      Significance

      The authors present good supportive evidence linking fatty acid biosynthesis to epigenetic regulation of stress response genes potentially mediated by intracellular levels of acetyl-CoA. This is an exciting area and not all the molecular details have been elucidated in this process. S. pombe is ideal to study this fundamental process and discoveries would be applicable to other eukaryotic study organisms.

      My expertise is in eukaryotic gene regulation, molecular genetics and functional genomics, so I am quite qualified to critically review this paper.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      As more and more metabolic intermediates are found to also serve as co-factors for epigenetic modifications, it has been widely accepted that regulating the levels of these key metabolites can be an effective way to control nutrient related gene expression. Acetyl-CoA is one of those early examples. Increased acetyl-CoA was shown to promote local acetylation at growth genes (Mol Cell 2011 PMID: 21596309), and ACC deletion funnels more Acetyl-CoA towards histone acetylation reactions and causes global hyperacetylation (Ref 17). However, whether those increased metabolite/co-factor can exert signal-specific effects remains elusive. For instance, although increased acetyl-CoA stimulates the SAGA complex enzymatic activity, it is not clear whether it also causes SAGA to be targeted to new sites without external cues to induce new transcription factor binding. Does increased acetyl-CoA cause broad hyperacetylation at all inducible genes which are the primary targets for those HAT complexes?

      In this manuscript, Princová et al. found that deletion of fatty acid synthesis transcriptional factors Cbf11 and Mga2 increases cell survival under H2O2 induced oxidative stress in S. pombe. They further showed that several stress-related genes increased upon Cbf11 deletion, and H3K9 acetylation at their promotor regions were elevated. They argued that FA-TF deletion may indirectly regulate stress-related genes potentially through influencing Acetyl-CoA level, although they failed to detect significant changes of global Acetyl-CoA levels. While it's interesting to see yet another example of metabolite-mediated gene expression regulation, the current manuscript only made incremental advance towards mechanistic principles of how these co-factors finetune specific gene expression program.

      Specific comments:

      1. This work showed convincingly that deletion of CBF11 or MGA2 leads to resistance to oxidative stress. However, it provides little mechanistic insight into how deletion of Cbf11 increased the expression of stress response genes and why some HATs are involved but others not (Figure EV5).
      2. Although in Cbf11 deletion cells, increased resistance to H2O2 is relied upon the Sty1/Atf1 pathway, the authors did not establish a link between lipid synthesis and Atf1 activity because Cbf11 deletion does not affect the phosphorylation of Atf1.
      3. Cbf11 deletion causes elevated H3K9 acetylation at the promotor regions of a number of stress respond genes, the author did not mention whether demonstrate how lipid synthesis defect causes the hyperacetylation at these promoters.
      4. As all lipid-metabolism mutants show increased stress response, it would helpful to examine whether H2O2 induction of WT cells influence lipid synthesis, thus establish physiological links between FA synthesis and stress response. Beside, fatty acid may be beneficial to fight oxidative stress because they maintain the integrity of cell membrane. What is the potential effect of CBF11 deletion in this aspect? The author may want to discuss it.
      5. Since H2O2 treatment also causes change in glucose metabolism including upregulation of glucose transporter Ght5 (PMID: 30782292), it would be enlightening to see if there is a crosstalk between the lipid and glucose metabolisms. Does Ght5 expression increase upon H2O2 treatment in CBF11 deletion strain? 5 Different H2O2 concentration causes different stress response in pombe: Pap1 and Sty1 mediate responses for low and high H2O2, respectively. For fully activated Sty1 response, the concentration of H2O2, needs to reach 1mM (PMID: 17043891). In this study, the H2O2 concentration ranges from 0.5-1.5mM and Pap1 regulated Ctt1 does show increase upon H2O2 treatment. To test if suppressed lipid synthesis facilitates Sty1 dependent activation, it would be helpful to examine the activation of Pap1 (its nuclear translocation) to eliminate other influences.

      Significance

      see above

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This paper demonstrates a link between oxidative stress, lipid biosynthesis, and targeted histone acetylation in fission yeast. In mutant cells with defects in lipid synthesis (cbf11, mga2 lacking transcription factors, and cut6 lacking acetyl-CoA carboxylase), transcripts of a number of genes implicated in resistance to oxidative stress are increased. This is associated with higher levels of H3K9 acetylation and increased tolerance to oxidative stress. These effects are mediated through Sty1, a stress-activated MAP kinase and the transcription factor Atf1.

      It is also shown that H3K9 acetylation levels in the promoter region and just downstream of the transcriptional start site are increased in cbf11 mutants (Fig. 5A).

      By mutational analysis, the authors implicate the acetyl transferases Mst1 and Gcn5 in this transcriptional effect. Other related acetyl transferases, Hat1, Elp3, Mst2, Rtt109 have been ruled out as main contributors to the dysregulation in unstressed cbf11 mutants. That specific acetyl transferases have been shown to be required is a strength of the investigation.

      Major comments:

      The hypothesis is put forward in the manuscript that altered acetyl-CoA levels in cbf1 mutants would underlie the dysregulation of genes induced by oxidative stress. Histone acetyl transferases compete for acetyl-CoA with lipid biosynthesis, and so with increased demand for acetyl-CoA underacetylation in the concerned promoters would result - specifically at H3K9.

      These results do not directly support the hypothesis, on the other hand they are not sufficient to rule it out. Acetyl-CoA levels were measured only in undisturbed cells, and the possibility remains that under oxidative stress there would be changes in acetyl-CoA pools that could explain this apparent contradiction - why did not the authors examine that?

      The authors argue that although the global acetyl-CoA levels are not increased, local concentrations might be altered in a way to permit higher H3K9 acetylation levels at selected promoters. Although a formal possibility, this is rather far-fetched as a small and freely diffusible molecule like acetyl-CoA should quickly equilibrate within one cellular compartment. I think that although the overall relationships that the authors have established between oxidative stress, H3K9 acetylation levels with increased expression, and lipid biosynthesis, are compelling, the role of acetyl-CoA concentrations is not clear and should be de-emphasized.

      Minor comments:

      In many of the bar diagrams, only a borderline statistical significance is indicated (p ~ 0.05) despite seemingly large numerical differences between the means. In the legends it is stated that one-sided Mann-Whitney U tests were used. This is a non-parametric test with low power - would it not have been better to use a t test? What do the error bars in the diagram show, SEM? If a non-parametric test is used, a parametric measure of variability is irrelevant.

      It would be helpful to the reader to indicate directly in the diagram panels what is actually shown, not just "fold change vs ..." In Fig. 1, 2, 4 D and 5 we see mRNA levels, in Fig. 3 chromatin IP.

      Significance

      The paper represents conceptual advances for our understanding of how stress responses, metabolism and transcriptional regulation are linked, although one of the links (acetyl-CoA levels in this case) is tenuous.

      This manuscript belongs in a rich literature on stress responses on the gene expression level, mostly from studies in yeast. Potentially, it adds entirely new information on how cellular stress may be mechanistially linked to stress responses.

      These results are potentially general and of broad interest to the biological community.

      This reviewer is familiar with yeast genetics, stress responses, and quantification of gene expression.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We are grateful to the reviewers for their time and expertise, and we have addressed all points they raised as detailed in our point-to-point response and highlighted the changes in the main manuscript. We have addressed all points raised by the reviewers and elaborated how this was done in a point-by-point reply. There are two new tables and a new supplementary figure. The figures and the text have been reshaped, according to the suggestions.

        We are looking forward to your reply.
      
         Best regards, Yannick Schwab and Anna M Steyer
      

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      Serra Lleti et al. report a new software (CLEMSite) for fully automated FIB-SEM imaging based on locations identified beforehand in LM. The authors have implemented routines for automatically identifying common reference patterns and an automated FIB-SEM quality control. This allows autonomous data acquisition of multiple locations distributed over the entire sample dish. CLEMSite has been developed as a powerful tool for fast and highly efficient screening of morphological variations.

      **Major comments:**

      The performance of CLEMSite has been demonstrated by the authors with two typical biological example applications. The stated performance parameters such as correlation precision and reproducibility are highly convincing and supported by the presented data. The authors give detailed information on their workflow and how on to use CLEMSite, which should allow other researchers to implement this for their own applications. The only comment I have in this regard, and I might have overlooked it, but how will CLEMSite be made available to the scientific community?

      Reply 1.1

      We would like to warmly thank Reviewer #1 for their very supportive feedback. It is important to us to share our work with the community. Our prime intention is to offer CLEMSite as a proof of concept that has been demonstrated on a specific instrument, thus linked to a company (Zeiss). Because we are convinced this code can be adapted to other APIs provided by other vendors, we made it fully available via a Github repository (https://github.com/josemiserra/CLEMSite).

      To make this more visible in the manuscript, we have modified this sentence to the first paragraph of the Results section:

      “To control the FIB-SEM microscope, CLEMSite-EM interfaces commercial software (SmartSEM and ZEISS Atlas 5 from Carl Zeiss Microscopy GmbH) via a specific Application programming interface (API) provided by Zeiss. CLEMSite code is openly accessible and free to download from a Github repository (https://github.com/josemiserra/CLEMSite ).”

      **Minor comments:**

      The author mention that decreasing the z-resolution to 200 nm steps was critical to achieve high throughput. For applications that require higher resolution: is the only disadvantage a longer data acquisition time or are there also other limitations?

      Reply 1.2:

      Reviewer #1 is right, we have designed CLEMSite as a screening tool, where we emphasize the number of cells versus the resolution at which each cell is acquired. By acquiring images every 200 nm, we are gaining speed, but also stability. We have indeed noticed that below 50 nm, on occasions the beginning of the acquisition is not stable enough (the milling has to hit the front of the cross-section at view precisely), and it requires manual intervention to retract/advance the milling. In addition, to gain time in our current workflow, we have opted to not cover the region of interest with a platinum protective layer, which has no consequences when imaging at larger z steps because the overall time spent on one cell is very short. At higher z resolution regimes though, a non-protected block surface is inevitably damaged during the successive numerous mill & view cycles. We have added one sentence in the Methods section to make this point clearer.

      “Note that to gain time in the preparation process for a run, we have not covered the ROI with a platinum protective layer and alternatively we increased the thickness of the gold coating of the full sample. In such cases, only low z-resolution acquisition is possible, as acquiring at a higher resolution would require sputtering of the sample surface.”

      Finally, we may argue that if an experiment requires high-resolution acquisition, the time overhead spent to switch from one cell to the next (a few minutes) is not significant anymore relative to the time spent to acquire one cell (from several days to weeks). In such cases, automation for multi-site acquisitions would lose its relevance.

      I would assume that locating the finer structural details in a much larger data set might also introduce additional challenges in the data analysis pipeline.

      Reply 1.3:

      We fully agree with Reviewer #1. In this proof of concept study though, we are not addressing the image analysis part but assess ultrastructural phenotypes manually using established stereology protocols. At the image resolution that we are using, our analysis is restricted to features such as volumes, surfaces, number of rather large organelles. Finer details, such as microtubules or fine contact sites between organelles would require a higher resolution, and indeed very likely other means to extract the morphometric data. State-of-the-art image analysis of isotropic FIB-SEM datasets is based on computer vision/machine learning. With such tools, the analysis of fine details is indeed accessible with very high accuracy, but at the cost of the throughput, at least for now as already mentioned in the Discussion section of the paper.

      In Table 1 in the supplements, the units are missing for the targeting positions. On page 4, 4th line from the bottom, there is a typo in "reaaching a global targeting...".

      Reply 1.4

      We thank Reviewer#1 for their thorough inspection of the paper. We have corrected it accordingly.

      Reviewer #1 (Significance (Required)):

      With CLEMSite, the authors present a powerful new software tool for the FIB-SEM imaging community. The high level of automation allows high throughput data acquisition with minimal user interaction. To my knowledge, this is the first software that fully automatically recognises reference features and is able to run fully autonomously after points of interest have been selected in FM. This high throughput screening tool for FIB-SEM imaging would make a substantial technical contribution to the field of cellular imaging. My own expertise lies in the field of technical developments for CLEM and super-resolution FM. I am not able to judge the biological content of the manuscript.

      We would like to thank Reviewer #1 for their constructive and encouraging feedback.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Review on "CLEMsite, a software for automated phenotypic screens using light microscopy and FIB-SEM" by Serra Lleti et al. The manuscript describes a toolset to correlate LM data with automated FIB-SEM of selected regions of interest. This allows 3D correlative microscopy of multiple adherent cells from a single resin block. This allows much needed high throughput in CLEM analysis to become quantitative. Two applications on Golgi apparatus morphology are shown.

      **Major questions:**

      -The software has been developed in collaboration between Zeiss/ Fibics in collaboration with academic groups and will only function on Zeiss SEMs that have the proper software. Thus, if I understand correct, it will not be of generic use and a more appropriate title would be 'CLEMsite, a software for automated phenotypic screens using light microscopy and Zeiss FIB-SEM"

      Reply 2.1.

      Reviewer #2 is right about the fact that our work was done on a Zeiss microscope and in CLEMSite’s current version, it would only work with that model, including firmware and software. As already phrased in the manuscript, we would like to stress our work is a proof-of-concept. For example, we wrote in the introduction that CLEMSite is a “software prototype”. We’ve also made clearer the links to Zeiss in the first paragraph of the Results (see also answer to reviewer 1.1)

      CLEMSite is by no means designed to become an integrated part of current or future Zeiss microscopes. On the contrary, we have designed the software as an independent unit. All the parts of the software that are sending commands to the Zeiss API are indeed customized to that brand, but other functions are stand-alone units. In particular, the correlation strategy is independent of the microscope type and can be used generically. Similarly, the principles that we developed for finding the FIB-SEM coincidence point, or for selecting features-rich regions to perform the AFAS function would be valid whichever microscope model would be used.

      For these reasons, we would prefer to avoid mentioning Zeiss already in the title of the manuscript.

      • How is the described approach using FIB-SEM advantageous compared to methods like Serial Block-face EM (SBEM) and array tomography using serial section where larger fields of multiple cells can be imaged? Especially because the axial resolution was set to 200 nm and discussed as essential for the throughput speed.

      Reply 2.2.

      This is a very important point that we tried to bring across in the introduction of the manuscript. Other volume EM methods, such as SBEM and AT, like conventional TEM, require an ultramicrotome to produce thin sections (AT and TEM) or to remove the top layers from the resin block (SBEM). This inevitably requires trimming large specimens in order to accommodate the dimensions of the diamond knife used in the associated microtomes. FIB-SEMs does not have such limitations and selected volumes can be imaged from samples of any size, providing they fit in the chamber of the microscope. In our case, we were screening cells growing on a 1 cm2 surface area, which is already beyond what standard diamond knives can process. We would even argue that larger surfaces are at CLEMSite reach, but we have not tested this.

      • Is the data FAIR available?

      Reply 2.3

      It is one of EMBL’s ambitions to make all data as FAIR as possible. For this study, we saved all the raw images and their corresponding embedded metadata as they came from the original software (ATLAS 5, Fibics for the SEM images and LAS X, Leica microsystems for the confocal images). The images published in this manuscript will be deposited on the EMPIAR data repository upon acceptance. The raw data and unpublished data, due to their size, will be fully available upon request to the authors. Additionally, their data is specifically generated for the correlation workflow, which is stored together with the image information as separated files. To store the information of logs we used text files, for intercommunication between processes, JSON, and XML to store coordinates in a readable format. As far as we know, there is no standard FAIR protocol yet that describes CLEM workflows in microscopy. We made our best possible efforts to archive our data in an understandable folder architecture, with detailed information on how to navigate through it, such that we are confident that our data could be mined by others in the future, thus reaching the goals of the FAIR charter.

      • How is CLEMsite available? Is the code public or for sale?

      Reply 2.4

      It is important to us that our proof-of-concept can be used or adapted by others in the future. For this reason, we are sharing the full code that was developed for CLEMSite - See Reply 1.1 for further details.

      **Other comments:**

      • Can you comment on the flexibility of this method? It is described as a flexible method, but only HeLa cells (quite flat cells) and Golgi apparatus targeting was used. What about different cell types and what about targets with a less obvious EM morphology?

      Reply 2.5

      It is correct that we have demonstrated our workflow only on Hela cells which present a more or less homogeneous topology. Yet our workflow is flexible when it comes to the dimensions of the region of interest and the acquisition field of view, and can accommodate a wide range of cell shapes, as long as they adhere to a culture substrate. Dimensions of ROI and FOV can be adapted in the CLEMSite interface as described in Supplementary Figure 4. Following reviewer 2 question, we realize that this feature may not appear clearly and we have modified the corresponding section of the Result:

      “The dimensions of the image stack, as well as the z resolution are set when initializing the run, via the CLEMSite interface (Supplementary Figure 4). Whilst every cell of one run can be acquired with the same recipe (as defined in ZEISS Atlas 5: sample preparation, total volume to be acquired, slice thickness and FIB currents applied at each step), CLEMSite-EM also offers individual definition of recipes, allowing a per cell adaptation of the shape or volume (Supplementary Fig. 4a).”

      Changing the ROI size would thus accommodate the surface occupancy of a cell (in the plane parallel to the culture substrate) and changing the FOV would accommodate the cell’s height.

      The morphology of the cell as it appears in the EM (SESI) does not alter the targeting strategy, since we are solely relying on the correlation, which means that the position of the target cell is extracted from the light microscopy images and the coordinate system provided by the gridded coverslip. Even if the cells were invisible at the surface of the resin block when inspected in the SEM, CLEMSite would still navigate to the proper region and create an image stack by FIB-SEM imaging.

      • For EM acquisition ZEISS smartSEM with ATLAS was used. LM was recorded with a microscope from a different vendor. Can the software be used regardless of microscope type?

      Reply 2.6

      Yes, the correlation is based on collecting the stage coordinates from the light microscope, and on analyzing the images from the various magnifications and channels. This information can be obtained by most microscope types, but it might involve minor adaptations regarding the specific brand of a microscope (e.g. changes in the coordinate system of the stage used or the naming of the channels).

      • Create less variation in the size of scale bars.

      Reply 2.7

      We have modified all figures to take this comment into account and thank Reviewer #2 for a good suggestion.

      • M&M: High-resolution light microscopy: Why call this 'high resolution'?

      Reply 2.8

      We used this term to differentiate, in the feedback microscopy setup, the first stage where images are acquired at low magnification from the images acquired at high magnification. We agree that the term is misleading, so we decided to update the manuscript and change the term high resolution by higher magnification (the second stage in feedback microscopy).

      Specs given seem randomly chosen: For example objective magnification yes, NA not; excitation wavelength yes, emission not.

      Reply 2.9

      We thank Reviewer #2 for spotting these missing details. We have edited the method section to add the NA and the emission wavelengths.

      Reviewer #2 (Significance (Required)): See above: This depends on the availability of code, as well as the usability in FIB-SEM that is not based on Zeiss.

      Reply 2.10

      We hope our answers have addressed these concerns. When the code is indeed fully available, we can not at this stage presume of the transferability of CLEMSite to microscope from other manufacturers. Yet we would like to stress once more that our main aim is to demonstrate a proof of concept.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      Schwab and coworkers present an automation software for correlative light and electron microscopy (CLEM) to acquire high-resolution SEM volume datasets at room temperature. This automation enables large-scale data collection and morphometric analysis of cell phenotypes. The paper is overall well written, but often assumes a lot of prior knowledge of the workflow, which might not be present in a general audience or for newcomers to the technique. This is also seen in the insufficient labeling and explanation of the figures. They seem a bit like presentation slides, which could be well understood with the help of the presenter/narrator, but alone lack a lot of information (see more specific comments below).

      **Major Comments (in no particular order):**

      • Final accuracy of ~ 5 µm ... is this really sufficient? Given that the size of many mammalian cells is ~10-15 µm, this is still a HUGE error. Of course, there is a tradeoff between throughput and accuracy, the area covered and speed. Nonetheless, this means a serious limitation in terms of the kind of targets / biological questions that can be addressed with this technique! (Especially in the context of "rare events") This should be discussed in more detail. Reply 3.1

      We thank reviewer #3 for their constructive criticism of the work. Indeed, our final accuracy is 5 µm at best, which may at first glance appear as a disappointing value. This accuracy is the consequence of a couple of strategic decisions that we have made in designing the workflow, which will be further explained below. We have chosen to constantly opt for a large field of view that would be larger than the average cell size, thus mitigating the potential 5 µm offset in targeting. In our hands, this yielded satisfying results, yet we agree that a higher targeting precision would allow narrower fields of view and potentially an even increased throughput.

      Our correlation strategy fully relies on the coordinate system built from the gridded pattern embossed at the surface of the culture dishes. The precision of CLEMSite automated targeting thus relies on i) its ability to properly detect the grid edges, both at the LM and at the ME, and ii) on the mesh size of the grid. To ensure a wide range of applications, we decided to design CLEMSite on commercial culture dishes, of which the MatTek gridded culture dishes appeared the most convenient, for each grid square presented a unique alphanumeric ID together with a relatively large and flat surface area to accommodate a large number of cells away from the grid pattern. Whilst such dishes showed a topology that satisfied our first criteria, the grid spacing was 600 µm. A smaller mesh size would have undoubtedly resulted in higher precision in the targeting but at the expense of losing free areas. Other commercial dishes with denser meshes unfortunately would not have ID engraved directly inside every square or we experienced difficulties in reproducibility during the sample preparation process to detach the glass from the resin block.

      We also have excluded the option to design our own grids, which would have created another dependency for potential users from other laboratories.

      Another possibility for targeting would be to register the fluorescence maps to the shapes of the cell as visible in the resin block. Adherent cells can be detected in the SEM if high energies are used to scan the surface of the blocks, and also if the block is not coated with a too thick layer of gold. In our experience, switching between voltages for acquiring such overviews and low voltages for acquiring FIB-SEM stacks is another source of imprecision and doesn’t improve the targeting in very confluent areas. Another interesting idea, as shown in Hoffman et al 2020, would be to scan the embedded samples by X-ray prior the FIB-SEM targeting, but not only this would imply that high-end X-ray machines would be available for such tasks, but would still require landmarks to register the X-ray maps to the SEM overviews. This would potentially yield a higher accuracy, but we have opted for the gridded substrates, judged more accessible to a large number of laboratories.

      We tried to explain such a choice in the discussion, by adding this sentence in the description :

      “Detection of local landmarks imprinted in the culture substrate enables automated correlation and targeting with a 5 µm accuracy. We estimate that this number could still be improved by customizing a gridded substrate with a smaller mesh size, as landmarks would be much closer to the targets. The detection algorithm we developed could be extrapolated to other customized dishes or commercial substrates for cell culture in SEM samples41. An advantage of using local landmarks for the correlation is that they mitigate the impact of sample surface defects or optical aberration across long distances. Alternatively, targeting individual cells with a FIB-SEM has been achieved by mapping the resin embedded cells with microscopic X-ray computed tomography44. We speculate that such tools could be an alternative to a gridded substrate, yet cannot predict its adaptability to large resin blocks such as the ones we used in this study. “

      • Given that the whole point of the paper is "large scale automation", I would have preferred a few more examples/higher n-count. A comment on which type of targets the authors envision/have validated would be nice (also in the context of the limitation in accuracy). Reply 3.2

      To our best knowledge, no one has ever imaged multiple cells automatically. So even 5 in a row is a high number.

      In addition to this, we added an extra paragraph in the discussion.

      “We believe that other research questions could benefit from this type of screening. As an example, the 2021 Human Protein Atlas Image Classification competition61 managed to classify multiple organelles of individual cells in fluorescence microscopy. Such machine learning models could be used to find rare events or particularly interesting phenotypes. In another example, in host-pathogen interactions, early infected cells might start to display a recognizable phenotype in a small subpopulation of cells62. In both cases, those marked cells could be used to establish a FIB-SEM screening to discover new morphological differences at the micrometer level.

      To expand the applicability of these screenings beyond the proof-of-concept here presented, we propose two directions of improvement. First, by acquiring smaller enclosed volumes with isotropic resolution, we could target area-delimited organelles, like centrioles21. In this case, the full cell volume is neglected in favor of a small portion of it, but with higher z resolution. At the software level, that would require improving targeting accuracy by using smaller grids and extending the maps to 3D coordinates. 3D registration against a light microscopy Z-stack would considerably help to constrain the field of view during acquisition, thus reducing the imaging time and keeping the field of view position during tracking. At the instrument level, this would require, first, stabilizing the ion beam before the critical region is acquired, to compensate for the change between high currents for milling and fine currents for sectioning. Second, to make sure that the fine current beam hits exactly the front face of the milled cross-section, and then prevent milling artifacts. Finally, the second direction is to increase the number of samples acquired per session. That would imply ion beams that automatically reheat the Gallium source when it is exhausted (like proposed in Xu et al. 60), with faster algorithms for autofocus and autostigmatism in SEM.”

      • It should be mentioned somewhere that "commercial dishes or coverslips" contain an imprinted grid pattern with numbers and letters to locate specific squares. [Again: probably clear to "aficionados" of the technique but totally unclear to newcomers/outsiders] Reply 3.3

      We have added an explanation of the layout of the coordinate system in the part of the correlation strategy and the methods section “gridded dish with numbers and letters' to explain the correlation and targeting strategy better.

      • "It is important to keep the initial number high in order to compensate for the loss of targets" - what % of targets is lost exactly in the final step (FIB-SEM imaging)? The 10 cells out of 35 (29%!) that were not "of sufficient quality for further downstream analysis", were they lost/discarded because of problems in the automation (e.g. autofocus/tracking failure) or for other reasons (e.g. preservation of the cells during fixation/embedding)? Reply 3.4 In the main text we decide to explain the process of filtering better. We have added a supplementary figure showing different causes for problems of coordinate system detection due to scratches, cracks, or dirt. None of the cells in the study were discarded due to bad preservation, but the system being a proof of concept, we dealt with multiple difficulties that forced us to filter the acquired stacks for getting the ones showing the best quality. We also added supplementary material (Sup. Tables 3 and 4 with explanation) about the possible causes of cell losses during different experiments.

      • "One essential paradigm shift for increasing the acquisition throughput is the decision to decrease the resolution in the z-dimension, thus prioritizing the speed of acquisition and ultimately the total number of cells acquired in one run.". Surely, reducing z-resolution is an obvious way to speed up acquisition times. But this is not tied to the use of this software and obviously comes at a price ... this has been discussed before and is nothing novel. Hence "paradigm shift" might be a bit too strong. I however fully agree with CLEMSite's potential as a screening tool. Could a "high resolution" (isotropic) mode not be implemented, too? [then it would be up to the user to decide what to prioritize - throughput or resolution] Reply 3.5

      We have replaced the word “paradigm shift” with “original strategy”. It is indeed up to the user to decide if higher z resolution or higher speed should be achieved by setting up different recipes.

      Additionally, we direct the reviewer to read Reply 1.2.

      • There is no mentioning of why this specific hardware was used. Are there any limitations that currently restrict the approach to Zeiss machines? Any plans supporting other vendors? Of course, there are always certain benefits with certain instruments. Or just simply no others were available... A comment on which part was performed by/at Zeiss and which in the labs would be useful to understand specific contributions. (Since a conflict of interest statement seems missing). Reply 3.6

      The original plan was to set up a proof-of-principle study developing a program that is fully open source. We created an interface, which could plug any control via proprietary API, by simply adapting commands from the API to our interface. The idea is very similar to what is done in light microscopy open-source controllers, like the Micropilot software (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3086017/).

      That interface would be the place where to modify the software and add the external API and requires only that such API can be used with a .NET framework in C#. The programmer would have to modify only the following file:

      https://github.com/josemiserra/CLEMSite/blob/master/CLEMSiteServer/TestApp/AtlasCom.cs.

      We expect that FIB-SEMs are very similar across companies, at least in basic functionality (get images, get positions, mill execution for trench digging with a recipe file, ...). We thus believe our software could be adapted to other vendors. As an example, we used the Fibics API, but we could also program the same functionality with the Zeiss API to achieve the same goal.

      Zeiss’s contribution to the project was i) providing a system during the initial phase of the project, and allocating time with programmers from FIBICS to help to provide the control API to be used by CLEMSite.

      All experiments were performed at the European Molecular Biology Laboratory or Max-Planck Institute of Experimental Medicine.

      **Figures:**

      Figures should be improved. They often contain too little information to understand the concepts/results discussed and there's lots of white space. The legends should be improved accordingly. In general, a more concise and structured figure design could go a long way of improving the quality of the manuscript. Please find a few suggestions (for the main figures) below (but the same should be applied to the supporting figures):

      Reply 3.7

      We thank the reviewer for the suggestions on the figures in general. We have revisited all the figures and made corresponding changes as highlighted below.

      Fig. 1: While I believe it is clear to me what each scheme is supposed to represent, someone less immersed in this topic (or just entering the field) may have problems navigating the figure. For example: what are all the different letters and numbers? What's the blue box with the trapezoid ("EM targets" - it may become clear later, but here it is not), what are the blue and the red arrowhead, respectively (I suppose EM and focused ion beam?). This should be improved and labeled accordingly.

      We have addressed the queries by explaining the figure more explicitly in the legend (e.g. blue box, blue and red arrowhead). We have added i, ii, iii to separate b into subsections and adjusted the text accordingly.

      Fig. 2: Again, a lot of annotation is missing. E.g. what is the 3rd insert in b exactly (edge-detection? After CNN identification?)? For most of the figure, yellow squares are used to indicate the zoom-in region, why not for b) 1st row? With the "zoo" of scale bars, wouldn't it make sense to either always show the same bar (e.g. 200 µm), or scale the images so things become more comparable? In this regard: a) 2nd column and e) 1st column represent the same FOV. Why are they shown with different magnification/cropping?

      Reply 3.8

      The scale bars have been homogenized when possible, in the case of b the image was zoomed to match a (first image in both cases). A yellow box was added for the zoom in b as suggested. We added descriptions in the figure legend.

      Fig. 3: a) The procedure is well described in the legend, but no motivation is given in the text, why this is necessary. c) There's some floating density in the white space. Is this due to thresholding?

      Already explained in figure legend.

      Reply 3.9

      We have adapted the text in the main manuscript to explain better that the coincidence point is normally found manually and for a routine with as little as possible intervention by an operator this had to be automated. We have also explained figure 3c more in the figure legend.

      “The following steps, usually performed by a trained human operator, are triggered autonomously: localization of the coincidence point, needed to bring the FIB and SEM beams to point at the same position (Fig. 3a); milling of the trench to expose the imaging surface, detection of the trench to ensure a well-positioned imaging field of view (FOV) (Fig. 3b); automated detection of image features in the imaged surface needed to find an optimal location for the initial autofocus and autostigmation (AFAS) (Fig. 3c); and finally the stack acquisition (Fig. 3d).”

      Fig. 4) Again, description/labeling of the figure is poor. E.g. what are the red outlines present in c) row 1-3 (but missing in row 4; why?)? [presumably these are the siRNA spots?] Is there any reason this figure could not be further subdivided into d), e), f) etc? As it stands, a lot of additional descriptors ("second from the left", "two images on the right") are necessary while a simple call to a), b), c) would be much easier...

      Reply 3.10

      We have more precisely described the siRNA spots in the legend more explicitly and have added headings to divide part c into a grid rather than adding letters/numbers to subdivide to make the figure more clear.

      Fig. 5) Additional labeling (a,b,c...) could be helpful here, too. While intuitively I would assume that blue = DAPI and green = GFP, these things should be labeled or described in the legend. Especially in the 3D rendering it is quite unclear what is being portrayed. Is this an overlay of a FIB/SEM segmentation with the confocal 3D-data?

      Reply 3.11

      We have added headings to subdivide the images in b and explanations in the legends to explain the color-dye relation (blue is DAPI).

      **Minor:**

      The first element in the filtered list can thus be stored for the subsequent application of autofocus and autostigmation procedures (AFAS) (Supplementary Fig. 3c). [technically this has been defined before]

      Reply 3.12

      All typos and grammar-related issues have been addressed in the following ways:

      A transformation is computed to register together the LM list and EM landmarks list, ...

      “A transformation is computed to register the positions from the LM and the EM landmarks list, ... “

      “The FOV is changed from a 305 μm by 305 μm, used for the detection of the trench, to a 36.4 μm by 36.4 μm in the exposed cross-section and an image of that cross-section is taken for analysis (Fig. 3c).”

      The sample is positioned at the target coordinates of the first cell, and the Multisite module performs the coincidence point alignment of both the electron and ion beams (Fig. 3a and Supplementary Fig. 2a).

      Reviewer #3 (Significance (Required)):

      **Significance:**

      It is clear that the kind of automation outlined here is necessary to elevate correlative SEM volume imaging to a "high-throughput" technique, which could become valuable for many biological questions. CLEMSite offers a valid technical solution and appears to be a solid implementation of the traditional/manual workflow. However, its presentation needs to be improved before we can support publication.

      Reply 3.13

      We have worked on different aspects of the presentation, rearranged the figures, and extended figure legends and hope this meets the reviewer’s expectations.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Schwab and coworkers present an automation software for correlative light and electron microscopy (CLEM) to acquire high-resolution SEM volume datasets at room temperature. This automation enables large-scale data collection and morphometric analysis of cell phenotypes.

      The paper is overall well written, but often assumes a lot of prior knowledge of the workflow, which might not be present in a general audience or for newcomers to the technique. This is also seen in the insufficient labeling and explanation of the figures. They seem a bit like presentation slides, which could be well understood with the help of the presenter/narrator, but alone lack a lot of information (see more specific comments below).

      Major Comments (in no particular order):

      • Final accuracy of ~ 5 µm ... is this really sufficient? Given that the size of many mammalian cells is ~10-15 µm, this is still a HUGE error. Of course, there is a tradeoff between throughput and accuracy, the area covered and speed. Nonetheless, this means a serious limitation in terms of the kind of targets / biological questions that can be addressed with this technique! (Especially in the context of "rare events") This should be discussed in more detail.

      • Given that the whole point of the paper is "large scale automation", I would have preferred a few more examples/higher n-count. A comment on which type of targets the authors envision/have validated would be nice (also in the context of the limitation in accuracy).

      • It should be mentioned somewhere that "commercial dishes or coverslips" contain an imprinted grid pattern with numbers and letters to locate specific squares. [Again: probably clear to "aficionados" of the technique but totally unclear to newcomers/outsiders]

      • "It is important to keep the initial number high in order to compensate for the loss of targets" - what % of targets is lost exactly in the final step (FIB-SEM imaging)? The 10 cells out of 35 (29%!) that were not "of sufficient quality for further downstream analysis", were they lost/discarded because of problems in the automation (e.g. autofocus/tracking failure) or for other reasons (e.g. preservation of the cells during fixation/embedding)?

      • "One essential paradigm shift for increasing the acquisition throughput is the decision to decrease the resolution in the z-dimension, thus prioritizing the speed of acquisition and ultimately the total number of cells acquired in one run.". Surely, reducing z-resolution is an obvious way to speed up acquisition times. But this is not tied to the use of this software and obviously comes at a price ... this has been discussed before and is nothing novel. Hence "paradigm shift" might be a bit too strong. I however fully agree with CLEMSite's potential as a screening tool. Could a "high resolution" (isotropic) mode not be implemented, too? [then it would be up to the user to decide what to prioritize - throughput or resolution]

      • There is no mentioning of why this specific hardware was used. Are there any limitations that currently restrict the approach to Zeiss machines? Any plans supporting other vendors? Of course, there are always certain benefits with certain instruments. Or just simply no others were available... A comment on which part was performed by/at Zeiss and which in the labs would be useful to understand specific contributions. (Since a conflict of interest statement seems missing).

      Figures:

      Figures should be improved. They often contain too little information to understand the concepts/results discussed and there's lots of white space. The legends should be improved accordingly. In general, a more concise and structured figure design could go a long way of improving the quality of the manuscript. Please find a few suggestions (for the main figures) below (but the same should be applied to the supporting figures):

      Fig. 1: While I believe it is clear to me what each scheme is supposed to represent, someone less immersed in this topic (or just entering the field) may have problems navigating the figure. For example: what are all the different letters and numbers? What's the blue box with the trapezoid ("EM targets" - it may become clear later, but here it is not), what are the blue and the red arrowhead, respectively (I suppose EM and focused ion beam?). This should be improved and labeled accordingly.

      Fig. 2: Again, a lot of annotation is missing. E.g. what is the 3rd insert in b exactly (edge-detection? After CNN identification?)? For most of the figure, yellow squares are used to indicate the zoom-in region, why not for b) 1st row? With the "zoo" of scale bars, wouldn't it make sense to either always show the same bar (e.g. 200 µm), or scale the images so things become more comparable? In this regard: a) 2nd column and e) 1st column represent the same FOV. Why are they shown with different magnification/cropping?

      Fig. 3: a) The procedure is well described in the legend, but no motivation is given in the text, why this is necessary. c) There's some floating density in the white space. Is this due to thresholding?

      Fig. 4) Again, description/labeling of the figure is poor. E.g. what are the red outlines present in c) row 1-3 (but missing in row 4; why?)? [presumably these are the siRNA spots?] Is there any reason this figure could not be further sub-divided into d), e), f) etc? As it stands, a lot of additional descriptors ("second from the left", "two images on the right") are necessary while a simple call to a), b), c) would be much easier...

      Fig. 5) Additional labeling (a,b,c...) could be helpful here, too. While intuitively I would assume that blue = DAPI and green = GFP, these things should be labeled or described in the legend. Especially in the 3D rendering it is quite unclear what is being portrayed. Is this an overlay of a FIB/SEM segmentation with the confocal 3D-data?

      Minor:

      The first element in the filtered list can thus be stored for the subsequent application of autofocus and autostigmation procedures (AFAS) (Supplementary Fig. 3c). [technically this has been defined before]

      Typos/grammar:

      In our case, we had two of such experiments, reaaching a global targeting accuracy (RMSE) of 8 {plus minus} 5 μm. A transformation is computed to register together the LM list and EM landmarks list, ...

      The FOV is magnified from a 305 μm by 305 μm to a 36.4 μm by 36.4 μm surface area and an image of the cross-section is taken (Fig. 3c).

      The sample is positioned at the target coordinates of the first cell, and the Multisite module performs the coincidence point alignment of both the electron and ion beams (Fig. 3a and Supplementary Fig. 3a).

      To preserve the target, the sample is drifted 50 μm in x. [shifted?]

      Significance

      Significance:

      It is clear, that the kind of automation outlined here is necessary to elevate correlative SEM volume imaging to a "high-throughput" technique, which could become valuable for many biological questions. CLEMSite offers a valid technical solution and appears to be a solid implementation of the traditional/manual workflow. However, its presentation needs to be improved before we can support publication.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Review on "CLEMsite, a software for automated phenotypic screens using light microscopy and FIB-SEM" by Serra Lleti et al.

      The manuscript describes a toolset to correlate LM data with automated FIB-SEM of selected regions of interest. This allows 3D correlative microscopy of multiple adherent cells ¬from a single resin block. This allows much needed high throughput in CLEM analysis to become quantitative. Two applications on Golgi apparatus morphology are shown.

      Major questions:

      • The software has been developed in collaboration between Zeiss/ Fibics in collaboration with academic groups and will only function on Zeiss SEMs that have the proper software. Thus, if I understand correct, it will not be of generic use and a more appropriate title would be 'CLEMsite, a software for automated phenotypic screens using light microscopy and Zeiss FIB-SEM"
      • How is the described approach using FIB-SEM advantageous compared to methods like Serial Block-face EM (SBEM) and array tomography using serial section where larger fields of multiple cells can be imaged? Especially because the axial resolution was set to 200 nm and discussed as essential for the throughput speed.
      • Is the data FAIR available?
      • How is CLEMsite available? Is the code public or for sale?

      Other comments:

      • Can you comment on the flexibility of this method? It is described as a flexible method, but only HeLa cells (quite flat cells) and Golgi apparatus targeting was used. What about different cell types and what about targets with a less obvious EM morphology?
      • For EM acquisition ZEISS smartSEM with ATLAS was used. LM was recorded with a microscope from a different vendor. Can the software be used regardless of microscope type?
      • Create less variation in the size of scale bars.
      • M&M: High resolution light microscopy: Why call this 'high resolution'? Specs given seem randomly chosen: For example objective magnification yes, NA not; excitation wavelength yes, emission not.

      Significance

      See above: This depends on the availability of code, as well as the usability in FIB-SEM that is not based on Zeiss.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Serra Lleti et al. report a new software (CLEMSite) for fully automated FIB-SEM imaging based on locations identified beforehand in LM. The authors have implemented routines for automatically identifying common reference patterns and an automated FIB-SEM quality control. This allows autonomous data acquisition of multiple locations distributed over the entire sample dish. CLEMSite has been developed as a powerful tool for fast and highly efficient screening of morphological variations.

      Major comments:

      The performance of CLEMSite has been demonstrated by the authors with two typical biological example applications. The stated performance parameters such as correlation precision and reproducibility are highly convincing and supported by the presented data. The authors give detailed information on their workflow and how on to use CLEMSite, which should allow other researchers to implement this for their own applications. The only comment I have in this regard, and I might have overlooked it, but how will CLEMSite be made available to the scientific community?

      Minor comments:

      The author mention that decreasing the z-resolution to 200 nm steps was critical to achieve high throughput. For applications that require higher resolution: is the only disadvantage a longer data acquisition time or are there also other limitations? I would assume that locating the finer structural details in a much larger data set might also introduce additional challenges in the data analysis pipeline.

      In Table 1 in the supplements, the units are missing for the targeting positions.

      On page 4, 4th line from the bottom, there is a typo in "reaaching a global targeting...".

      Significance

      With CLEMSite, the authors present a powerful new software tool for the FIB-SEM imaging community. The high level of automation allows high throughput data acquisition with minimal user interaction. To my knowledge, this is the first software that fully automatically recognises reference features and is able to run fully autonomously after points of interest have been selected in FM. This high throughput screening tool for FIB-SEM imaging would make a substantial technical contribution to the field of cellular imaging.

      My own expertise lies in the field of technical developments for CLEM and super-resolution FM. I am not able to judge the biological content of the manuscript.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This study presents a first structural insight on formin mDia bound to actin filaments in physiological conditions. Based mainly negative stain EM, the authors use 2D and 3D class averaging to describe two main configuration of the formin at the filament barbed end. The two configurations support the previously proposed stair-stepping model, which was based on crystal structures, with an open state where the formin binds two actin monomers and a closed state where three monomers are bound. Because the majority of the structures fall in the first, open state, this supports the existence of this intermediate. The authors also show that the orientation of the free FH2 in this open state is somewhat flexible, as several sub-classes with different angles can be distinguished. Finally, they identify, for the first time, formin densities bound along the length of the filament.

      The data is well presented and I don't have any major issue. The only point is that the information that all the initial structural data comes from negative stain EM comes should be put upfront. One gets the feeling that cryoEM is used throughout until one reads the section on cryoEM. Given that the methodology is now also established for cryoEM, it is regrettable that data was not collected with a 300kV microscope, which may have revealed more details of the conformations, but I understand microscope time is hard to come by, and the authors did a remarkable job from negative-stain EM.

      The finding of formin densities binding along the length of the actin filament is very interesting. Besides the previous cited finding, it also reminds of the observations made in yeast where Bni1 (in S. cerevisiae; PMID 17344480) and For3 (in S. pombe; PMID 16782006) where shown to exhibit retrograde movement with polymerizing actin cables in vivo. This would be interesting to consider in the discussion.

      Reviewer #1 (Significance (Required)):

      This study extends our understanding of the mechanism of formin-mediated actin assembly, by providing a first structural observation in physiological conditions. While confirmatory of previously proposed model, but also excludes an alternative model, and offers novel observations of flexibility and binding along the actin filament length. It will be of great interest to researchers on the actin cytoskeleton.

      My expertise is in the actin cytoskeleton and formins, but I am no expert in EM structural analysis.

      We thank reviewer 1 for the very positive comments and for pointing out the relevance of our study for the actin cytoskeleton field. As advised, we now specify upfront in the abstract and in the introduction that most of the presented results were obtained from negative stain electron microscopy. Following the reviewer’s advice, we have enriched the discussion to highlight the retrograde movements of formins in actin cables observed in vivo.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Maufront et al. have used EM to study the conformation of mDia1 at the barbed end and the core of actin filaments to explain the molecular mechanism of the FH2 dimer processivity at these sites. Based on modelled structural data they tried to describe how the conformational changes in FH2 dimer lead to its partial dissociation, and then association with filaments during the process of translocation coupled to subunit addition at actin filaments barbed ends. This supports a previous study (Otomo et al. 2005, Nature), in which using X-ray crystallography structural data were used to propose a stair-stepping model for Bni1p translocation at the barbed ends during actin polymerization. The model for mDia1 binding to core filaments is also given. Moreover, using EM structure and the previously reported structures of actin (PDB: 5OOE), and actin with formin FH2 dimer (PDB: 1Y64), authors explained the dynamic nature of FH2 dimer at barbed ends of the filaments using the flapping model. But due to the low resolution of their structures ~ 26-29A0, the finer details of actin and the FH2 dimer structure at barbed ends could not be resolved, leaving open questions about the orientation of actin helical twist at this end during elongation. The authors tried several conditions to get high density barbed-end filaments, but that did not collect adequate number of particles, resulting in low number of particles selected for structure modelling purposes. However, to attain more physiologically relevant structure they used cryo-EM, but were successful in capturing only the open conformation structure of FH2 dimer (at low resolution). Thus, due to low resolution of structures the key findings have not added much to what we already know about the mechanism of FH2 dimer translocation during actin polymerization, except that their studies support the stair-stepping model (Otomo et al. 2005, Nature) and not that of "stepping second" model ( Paul and Pollard. 2008, Curr. Bio.). Thus, this manuscript does not merit publication in this journal.

      We thank reviewer 2 for taking the time to read and review our study. However, we respectfully disagree with the statement that our findings “have not added much to what we already know about the mechanism of FH2 dimer translocation during actin polymerization”. As mentioned in our report, collecting EM data for formins in physiological conditions (at the barbed ends of growing filaments), as we do here for the first time, entails limitations on the number of particles one can observe and on the resulting resolution. Despite this rather low resolution, our data allow us to discriminate between two proposed models accounting for the processivity of formin FH2 domains at filament barbed ends. Being able to determine which of two competing models is valid (as the reviewer says we do) does add a lot to what we already know.

      Major comments:

      1. Present study does not provide any new insight about the conformation of the actin dimer at the barbed ends of actin filaments when FH2 domains of formin are bound. This study appears to be more like an extension of previous research (Otomo et al. 2005, Nature), in which the authors used X-ray crystallography data to propose a model for actin filaments elongation by formin bound at the barbed ends.

      As mentioned above, we respectfully disagree with this remark. First, in Otomo et al. 2005, formins are arranged in a crystal into a non-physiological “daisy chain” arrangement around a non-canonical tetramethyl rhodamine-actin filament. Our observations were made in physiological conditions displaying a single formin dimer at the barbed end of a polymerizing filament. Second, the stair stepping model originating from Otomo et al. was only inferred and extrapolated from the crystal structure and not directly observed. Both the open and the closed conformations were speculations, that had never been observed up to now. In our current report we directly visualize these two conformations. Third, the observations of Otomo et al. were obtained using formin Bni1p from yeast, not the mammalian formin mDia1, for which there is little (PDB 1V9B) structural data available describing the structure of a truncated mDia1 in the absence of actin. Finally, in addition to validating the stair-stepping model experimentally, we make unexpected observations that are totally absent from the model derived from Otomo et al. and subsequent studies.

      The low resolution of structures is a major concern.

      As mentioned above, the limited resolution is the price we had to pay for being in physiological conditions, with formins interacting with the barbed ends of growing actin filaments. Nonetheless, this resolution is sufficient to discriminate between the two previously existing models, and to make new observations, beyond these models.

      Given the low resolution of data, how can the authors decide on the number (4) of classes of FH2 domain (in open state) and present them as "continuum of conformations". They stated "details featured in class 4 do not appear as sharp as in class 2". What was the basis of deciding on the sharpness level?

      We agree that this point was unclear, and we thank the reviewer for pointing it out. The choice of the number of sub-classes for the open state is a trade-off between the sharpness (ie signal-to-noise ratio) of the resulting image, which is a direct consequence of the number of particles within each sub-class, and the internal variability within each sub-class. Class 4 might appear more “blurry” because it gathers particles displaying a range of angles. When increasing the number of generated classes in the 2D processing, we observe angular variations of the FH2 domains intermediate to the ones displayed in Figure 3. However, because increasing the number of classes results in averaging less particles per class, the generated classes appeared more noisy or “blurry” and not as “sharp”, as mentioned in the manuscript. Hence, we chose the number of displayed classes so that the signal-to-noise would remain satisfactory and sufficient to be able to determine the relative angle between the two FH2 domains. To make things clearer, “do not appear as sharp” was replaced by “displayed a lower signal-to-noise ratio and thus looked noisier”. The expression “sharp” was replaced by “enough contrast”.

      The authors showed 30Å structure of FH2 domain encircling actin filaments towards their pointed ends, but said nothing about the kind of decoration it could be, a "daisy-chain" or "concentric circle"? Also, they did not mention anything about the orientation of actin helical twist and specific sites of binding. These information would provide new in-depth understanding of how formins binds while diffusing along the filaments.

      The quality is sufficient to distinguish isolated FH2 dimers along the core of actin.

      Accordingly, the FH2 dimers we observed along the core of our actin filaments adopt a conformation similar to that observed at the barbed end, as mentioned in the text (‘concentric circle’). This observation differs from the reported for INF2 which accumulated along filaments and may interact in a ‘daisy-chain‘ manner (Gurel et al, 2014 ; Sharma et al, 2014). From our data, we can thus assume that formins interact with F-actin along the core of filaments similarly to the way they do at the barbed ends, and might translocate in a two-step manner alongside the actin filament. As stated in the manuscript, the actin helical twist could not be deciphered. For docking the crystal structures within our EM envelope, we used the formin-actin contacts described previously in Otomo et al.

      The author stated - "The leading FH2 domain likely provides a first docking intermediate for actin monomers that would help their orientation relative to the barbed end, resulting in a higher actin monomer on-rate". This statement was made on the basis of observing 79% times FH2 in the open state in their data set. This seems like an overstatement because they don't have any direct structural data to support such claim.

      We agree with the reviewer that our statement, taken from the discussion section, is speculative, and we apologize if this was unclear. Our purpose was to propose a plausible mechanism, based on our structural data, since the FH2 domain stands in front of the barbed end in the “open conformation” and since it likely interacts with actin monomers. We have now rephrased our sentence to state more clearly that is a hypothetical mechanism : “We propose that… could provide…”.

      In the Discussion they mentioned "the FH2 dimer would then be "lagging" behind the elongating barbed end if actin twisting back to 180{degree sign} occurs before the addition of actin monomer and this explains the diffusing along the actin filaments". Did authors encounter filaments with two formins bounds to them in their negative stain images? What is their view on this? In current data, they showed structure in which only one FH2 dimer is bound to the pointed ends of actin filaments. Have they tried increasing the concentration of formins to obtain structures with more than one formin is bound towards the pointed ends of actin filaments?

      Following the recommendations from reviewer 2, we have performed an additional analysis and we now show typical examples of filaments observed with a formin along their core, including cases where two formins are observed on the same filament (Supplementary Figure 12). As we now explain in the discussion section, five different mechanisms (including lagging) can be invoked to explain how a formin can be located along the core of the filament. These five mechanisms can all account for the possibility to have more than one formin on the same filament.

      The lagging mechanism, however, is the only one where we would expect that the filaments with a formin along their core are less likely to also have a formin at their barbed end (because the formin at the core spontaneously departed the bare barbed, that was left bare and with a shorter time to load another formin before fixation of the sample). A simple statistical analysis of our data leads to the estimation that 48 ± 7% (n=50) of actin filaments with a formin within their core also display a formin at their barbed ends. This is significantly less than for the global filament population, where 77 ±0.4% (n=10,461) of barbed ends are decorated with formins. This supports the lagging scenario as a likely mechanism putting formins along the core of the filament.

      Regarding the specific suggestion to increase the formin concentration: We did screen different formin concentrations, but with higher concentrations the level of noise due to unbound formins was significantly increased in the image background and impeded a proper analysis. This is why we consistently used 100 nM formins.

      To increase the density of short filaments for sample preparation, the authors used additional actin binding proteins "shown in supplementary Figure 2.C". There is no supplementary Figure 2.C. Moreover, it would be nice if the concentrations of these proteins are mentioned in the text.

      We apologize for this mistake. Supplementary Figure 2.C has now been added and the protein concentrations have been added in the main text.

      Minor comments:

      1. Figure 1 legend needs editing. E is missing in the legend.

      Thanks for noticing this. We have added the missing legend for 1.E. 2. There is no supplementary Figure 2.C.

      We apologize for this mistake. We have now added supplementary Figure 2.C.

      It is recommended that the authors report the number of particle used during 2D and RELION 3D classifications in the figures. This would help in better understanding of the probability of the conformations mentioned in the text.

      It was mentioned in the text. We have now made this information clearer to the reader.

      Reviewer #2 (Significance (Required)):

      This is the first direct study showing the two (open and closed) conformations of mDia1 FH2 domain at the barbed ends of actin filaments using EM and cryoEM. The study supports the proposed molecular mechanism of FH2 processivity at the barbed ends during filaments elongation using stair-stepping model reported earlier (Otomo et al. 2005, Nature). For the first time, FH2 has been shown to fluctuate between various angles with respect to static actin filaments, and on this basis they propose a flapping model (Fig 5). They explained the whole mechanism using structural proof, but the low resolution of data raises a question about their quality sufficiency to propose this mechanism. The overall novelty of this manuscripts is insufficient for the publication in this journal. Audience having understanding of the actin and actin binding proteins will be interested in this study. Additionally, researcher from the field of structural biology (EM and CryoEM) will be interested. I have been working in the field of actin and actin binding proteins for past 4 years. Over 10 years' experience in protein biochemistry, structural biology and molecular biology.

      We do not fully understand why, on one hand, reviewer 2 indicates that “for the first time, FH2 has been shown to fluctuate between various angles…” and that “Audience having understanding of the actin and actin binding proteins will be interested in this study. Additionally, researcher from the field of structural biology (EM and CryoEM) will be interested.”. On another hand, reviewer 2 states that “The overall novelty of this manuscripts is insufficient for the publication in this journal.”, which seems contradictory with the above statements and comments.

      Regarding novelty, we insist on the fact that we have achieved for the first time the direct observation of FH2 formin domains at a resolution sufficient to discriminate between two distinct models at the barbed ends, as well as to observe the presence of formin mDia1 along the core of actin filaments in conditions where nobody has proposed that this could happen.

      In addition, we have not specified any specific journal within the possible ones from “review commons”, up to now.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript, Julien et al. use negative stain electron microscopy and cryo-EM to show two conformations of the FH2 domain for the formin mDia1 bound to the barbed end of an actin filament. These conformations support the "stair-stepping" model of FH2 domain movement with an elongating actin filament, as previously postulated by Otomo et al. (reference 1). The two states observe correspond to the "open" (~79%) and closed (~21%). The authors also show the conformational variability of the open state suggesting flexibility in this state. Finally, the authors observe FH2 domains encircling the actin filament at a distance from the barbed end, and suggest that the FH2 can diffuse from the barbed end down the filament.

      Major comments:

      1) Novel insights into formin function derived from this structure would raise impact. Issues that could be addressed include the following. Simply adding some lines to the discussion would not really add impact, but additional experimental/modeling work would.

      We agree that comparing the binding mode of different formins on actin filaments, testing the impact of profilin, and assaying FH2 domains in the absence of FH1, as proposed below, would provide a broad set of interesting additional data. However, without claiming that our results can be generalized to all formins in all conditions, we believe that our findings are novel and should be of interest to a large community. The proposed additional experiments/modeling represent an impressive amount of work, and will be carried out in future investigations. We answer these comments in more details below.

      1. Whether this model really holds true for all FH2 domains. Formin FH2 dimerization and processive filament barbed end elongation are widespread features of formins, which have been evidenced for many organisms from metazoan to plants. Since we could dock the FH2 from yeast formin Bni1p to account for mammalian mDia1, we think the FH2 domain conformations may be conserved enough among species to display similar translocation mechanisms at the barbed ends of actin filaments, using a two-state mechanism. We chose to use the crystal structure from Bni1 formin (PDB 1Y64) because this structure was obtained in the presence of an actin filaments and brings some insights about the formin-actin contacts.

      In order to convince reviewer 3, we superimposed the existing crystal structure of the FH2 mDia1 domain (PDB: 1V9D) with our model and reconstruction and show (Supplementary Figure 12) that the differences are minor. The mDia1 FH2 domains (atomic structures in red, PDB : 1V9D) are aligned with Bni1p FH2 domains (atomic structures in green and blue, PDB : 1Y64) previously fitted into the electron microscopy envelope of a barbed end capped by a formin in the « open state ». The FH2 domains are well aligned with a slight discrepancy in the knob/actin contact regions (blue arrows). This discrepancy most likely results from the absence of actin partners in the crystals obtained with mDia1 FH2 domains. The Bni1p structure thereby most accurately represents the knob/actin contact region. In addition, the folding of the lasso domain around the post domain is resolved in the Bni1p structure. Note here that the Bni1p lasso domains wrap equally well around the Bni1p post domain and the mDia1 post domain (green arrows).

      1. Whether the % time spent in the open and closed states might dictate the vastly different elongation rates mediated by different formins. For example, mDia1 is considered one of the 'faster' elongators (equivalent to actin alone in the absence of profilin), while fission yeast Cdc12 essentially caps filaments in the absence of profilin. We have discussed this aspect thoroughly in the discussion section to conclude that:” Our direct assessment of the open state occupancy rate thus provides important information on the molecular nature of the formin-barbed end conformations which could not be directly inferred from kinetic measurements, with or without mechanical tension, so far. Considering a gating factor of 0.9 and considering that formin mDia1 spends 79% of the time in the open state, we can compute that the on-rate for monomers would be slightly higher (14% higher) for an mDia1-bearing barbed end in the open state, than for a bare barbed end.”

      We agree that repeating our set of EM experiments and analysis with other formins, like fission yeast Cdc12, would be interesting. However, this would take a long time, and falls out of the scope of our paper.

      1. Whether the % time spent in the open and closed states varies if filaments are actively elongating in the presence or absence of profilin. We have chosen not to include profilin in our experiments, and to limit the concentration of G-actin, in order to reduce the background in our EM micrographs. Also, a rapid filament elongation would increase the amount of F-actin per barbed end, while a dense population of short filaments is key to obtain accurate data (as we explain in the discussion, paragraph 1, p9).

      We speculate that, by providing a link between the FH1 domains and the filament barbed end, profilin might very well alter the percentage of time spent in the open state, and mitigate lagging as mentioned in the discussion section. Properly addressing the impact of profilin with our EM experiments is very challenging, for the reasons we have explained. It would require further investigations, beyond the scope of this study.

      1. How this model impacts the interactions of formins with other proteins at the barbed end. For example, capping proteins. We did not include capping proteins (or other additional proteins) because we wanted to avoid increasing the number of particles from diverse nature per field of view, as they constitute a background that is detrimental for the analysis of EM micrographs. We would have add to sort out additional populations in the course of image analysis. We thus only mixed actin and formin in our assays.

      2. Do these results relate to formin function in disease? Because formins regulate actin polymerization, their malfunction is linked to a variety of diseases. We therefore expect our findings to be useful to researchers in the medical field. However, our study remains in the scope of basic research and primarily aims at understanding the mechanisms of formin-assisted actin polymerization.

      2) The observation that formin FH2 domains can bind filament sides has been made several times. In particular, a structural model of the FH2 domain of the INF2 formin along the side of an actin filament (Gurel et al 2014, PMID 24915113). This publication also references other papers showing other formins binding to filament sides. There are two points to this comment:

      1. The model in Gurel et al is that the FH2 domain does not slide down the filament from the barbed end. Rather, the FH2 dimer has an appreciable dissociation rate, enabling it to encircle the filament without having to slide. This FH2 dissociation has been observed for another formin that has been shown to bind filament sides, FMNL1 (called FRL1 in the listed publication), in Harris et al 2006 (PMID 16556604). The authors must explain their reasoning for thinking that mDia1's FH2 can slide down the filament from the barbed end. One possibility is to make observations of this FH2 population in filaments that were not sonicated. What is the average distance of FH2s from the barbed end? We thank the reviewer for pointing our attention to this report from Gurel et al. which we now cite. Following this comment, as well as point 6 of reviewer 2, we now discuss the different mechanisms that could lead to our observation of mDia1 along the core of the filament. We provide a new analysis of our data (discussion section), arguing in favor of the lagging mechanism (i.e. ‘sliding down’ from the barbed end), without excluding the competing scenarios. Briefly, we compute that 48 ± 7% (n=50) of actin filaments with a formin within their core also display a formin at their barbed ends. This is significantly less than for the global filament population, where 77 ±0.4% (n=10,461) of barbed ends are decorated with formins. This supports the lagging scenario, which is the only one where a filament with a formin along its core should be less likely to also have a formin at its barbed end.

      The distance of FH2s from the barbed end would provide additional information. However, it is difficult to estimate, since we often to not see the entire filament, and since we do not know which end is the barbed end.

      1. Interestingly, in some of the works studying formin binding to filament sides, mDia1 was shown to be rather poor in this property. It would be useful to get an idea of what % of the observed FH2s are in the filament core, as opposed to at the barbed end. Along with the additional analysis mentioned in the previous point, we have now estimated that about 8% of actin filaments display a formin within their core. We have added this number in the manuscript (end of the Results section). As a comparison, in our assays, 77% of filament barbed ends bear a formin.

      2. The authors must reference the past works showing FH2 binding to filament sides, particularly the structural work. At present, no mention of prior work on FH2 side binding is mentioned. As advised, we have now added additional references and more particularly Gurel et al, 2014.

      3) My major technical concern in this manuscript is that the authors use the FH1-FH2-DAD domain of mDia1 for the imaging, but use FH2 structure of Bni1p for 3D characterization (Otomo et al.). Even though Bni1p has been used for functional and structural analysis, mDia1 and Bni1p FH2 domains share low sequence homology. In addition, mDia1 only partially complements loss of Bni1 function in vivo (Moseley et al., 2004 PMID 14657240). Can the authors use the partial structural information of the mDia1 FH2 from Shimada et al 2004 (PDB 1V9D, PMID 14992721)? Alternately, the authors could have used FH2 domain of Bni1p for imaging. At the very least, the authors should explain clearly why they used different proteins for imaging and modeling.

      As mentioned above (please see our response to point 1.a), we chose to use the crystal structure from formin Bni1 (PDB 1Y64) because this structure was obtained in the presence of an actin monomers, and it thus brings some insights about the formin-actin contacts. The existing structures obtained from formin mDia1 does not include actin (full length by EM: Maiti et al, 2012; crystal structure of subdomains (without FH1): Otomo et al., 2010 PLoS one). It thus seems relevant, in the context of our investigations, to use a structure where formin-actin contacts could be at least partially inferred.

      Further, we superimposed the existing crystal structure of the FH2 mDia1 domain (PDB: 1V9D) with our model and reconstruction and show that the differences are minor (please see the figure in our response to point 1.a, above).

      4) The open and closed states are observed from negative staining data. However, the authors can only find one of the states (open) by cryo-EM, which decreases the confidence level of the paper's conclusions. It would be useful for the authors do a little more to try to find the closed conformation by cryo-EM.

      Using Cryo-EM we can already recover the most abundant open conformation.

      Unfortunately, as pointed out here, the number of particles obtained was too low to enable high resolution and reveal the two observed conformations. Indeed, considering a density of ~ 5 barbed ends par micrograph, the collection of tens of thousands of images would have been necessary, which was not realistic regarding the access we have to latest generation microscopes.

      5) It is unclear whether there are additional effects of using FH1-FH2-DAD protein (not FH2 only) for the imaging, as it shows long protrusion at the tip of actin barbed end. To avoid those concerns the authors could use only FH2 domain of mDia1. Also the authors have to note that they used Bni1p structure because there are no published structures of mDia1 so far.

      We had indeed tried to use a construct deprived of the flexible FH1 domain but the lower purity of this construct and the presence of aggregates led to the collection of lower quality EM micrographs. As profilin was not included in our assay, FH1 domains were not involved in actin polymerization at the barbed end and thus remain very flexible and unstructured. Consistently, we did not detect any additional electronic density that could result from the FH1 domains.

      We indeed point out (p5) that “We used the crystal structure from yeast Bni1p FH2 domains in interactions with an actin filament, rather than the existing one from mammalian mDia1 formin FH2 dimer in isolation (PDB 1V9D), because actin-formin contacts are described in the Bni1p structure.” Minor comments:

      1) Figure 1: It would be interesting if imaging is provided for mDia1 bound to filaments which it has nucleated. Would it be possible that binding to pre-formed filaments is different to that for mDia1-nucleated filaments?

      This is a good suggestion for further investigations but it extends beyond the scope of this study: as we explain, our attempts to nucleate filaments from mDia1 lead to lower quality micrographs, and the sonication of preformed filaments was our best option. However, we do not expect the translocation mechanism of FH2 to differ, as a function of the nucleation history of the filament, since the formin interacts with a filament whose elongation it has assisted over several subunits.

      2) Supplementary figure 2: Numbers of things in the S2 is unclear and poorly described in both results and methods. In particular, figure S2A, the definitions of the black and gray lines (steady state actin) is not clear. Are they containing 5% pyrene actin? Is that actin in polymerization buffer or in monomer-actin buffer? Is that actin incubated with actin polymerization buffer for a certain time before measurement of fluorescent intensity? In figure S2B, how the authors calculate the monomer actin concentration? The authors should provide the information in either results or methods part.

      We apologize for the lack of information. Since this is a standard assay, we have now added more details in the Methods section (rather than in the Results section).

      All curves shown in figure S2 were obtained with 5% pyrene actin. The gray curve shows the pyrene fluorescence intensity baseline from 1 µM G-actin monomers, obtained in G-buffer. The black curve is the fluorescence intensity at steady-state of 1 µM actin in polymerizing conditions, (after 1 hour of incubation at room temperature, at 5 µM, the sample was diluted without sonication and left for another hour before measuring the fluorescence intensity).

      The monomeric actin concentrations shown in figure S2B are derived from the intensity level of pyrene at any time point during the experiment, using the simple equations we now present in the Methods section.

      3) Supplementary figure 2 C: The figure and legend are missing in the manuscript. Furthermore, the authors describe that they used Gc-globulin to sequester monomeric actin in solution. Is gc-globulin widely used for actin monomer sequestration?

      Thank you for noticing the missing panel which is now back in place. Indeed, Gc globulin is known to sequester G-actin (Van Baelen, H., R. Bouillon, and P. DeMoor. 1980. “Vitamin D binding protein (Gc-globulin) binds actin”. J. Biol. Chem. 255:2270-2272). This is why we have attempted to use it. We could see a slight effect but we did not want to increase the noise within our images with additional proteins that would have made the analysis more complicated.

      CROSS-CONSULTATION COMMENTS Reviewer #1 mentions that the authors identify formin densities bound along the actin filament for the first time. I agree that the imaging of the mDia1 along the actin filament using electron microscopy is novel, but the concept of formin binding has already been found and studied well with other formins (PMID 16556604, PMID 24915113) and even mDia1 has poor binding activity compared to other formins. It was really nice of the authors to show the mDia1 side filament binding, but I don't think it is a striking finding.

      I have no comment for Reviewer #2.

      Reviewer #3 (Significance (Required)):

      If the EM refinements and 3D rendering techniques are conducted rigorously (which this reviewer is unable to judge), the data support an existing theory of how FH2 domains interact with the actin barbed end. Overall, the data will be of interest in formin field. However, as written the paper confirms an existing model, and does not represent new insight. Impact would be raised by providing insights from these findings that impact formin function or disease.

      We have answered this concern above. The existing models were speculative and not based on direct observations. They relied on data obtained in non-physiological conditions.

      Here, we directly observe two distinct conformations in our structural data, and clearly validate one model over the other. This provides a major advancement in our understanding of formin interaction with actin filaments. In addition, we uncovered an unexpected behavior of formin mDia1, which can readily be found along the core of the filament without the aid of additional proteins, and we propose a mechanism based on our data to account for this observation.

      Another main point is that the observation of FH2 domains bound along an actin filament, while interesting, is not novel. Others have found this for other formins, but those papers are not referenced here.

      The direct binding of formins to the sides of actin filaments is thought to be specific to some particular formins (we now cite additional references in our manuscript, to discuss this point). Formin mDia1, which is a ubiquitous and widely studied mammalian formin (perhaps the most studied), has only been described to diffuse along actin filaments when a capping protein dislodges it from the barbed end (Bombardier et al. Nat Com 2015). Here, we show that formin mDia1 can be found encircling the core of actin filaments, in the absence of any capping protein. This behavior is novel and unexpected. It should open new avenues for research on formin mDia1, as well as on other formins.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, Julien et al. use negative stain electron microscopy and cryo-EM to show two conformations of the FH2 domain for the formin mDia1 bound to the barbed end of an actin filament. These conformations support the "stair-stepping" model of FH2 domain movement with an elongating actin filament, as previously postulated by Otomo et al. (reference 1). The two states observe correspond to the "open" (~79%) and closed (~21%). The authors also show the conformational variability of the open state suggesting flexibility in this state. Finally, the authors observe FH2 domains encircling the actin filament at a distance from the barbed end, and suggest that the FH2 can diffuse from the barbed end down the filament.

      Major comments:

      1. Novel insights into formin function derived from this structure would raise impact. Issues that could be addressed include the following. Simply adding some lines to the discussion would not really add impact, but additional experimental/modeling work would.
        • a. Whether this model really holds true for all FH2 domains.
        • b. Whether the % time spent in the open and closed states might dictate the vastly different elongation rates mediated by different formins. For example, mDia1 is considered one of the 'faster' elongators (equivalent to actin alone in the absence of profilin), while fission yeast Cdc12 essentially caps filaments in the absence of profilin.
        • c. Whether the % time spent in the open and closed states varies if filaments are actively elongating in the presence or absence of profilin.
        • d. How this model impacts the interactions of formins with other proteins at the barbed end. For example, capping proteins.
        • e. Do these results relate to formin function in disease?
      2. The observation that formin FH2 domains can bind filament sides has been made several times. In particular, a structural model of the FH2 domain of the INF2 formin along the side of an actin filament (Gurel et al 2014, PMID 24915113). This publication also references other papers showing other formins binding to filament sides. There are two points to this comment:
        • a. The model in Gurel et al is that the FH2 domain does not slide down the filament from the barbed end. Rather, the FH2 dimer has an appreciable dissociation rate, enabling it to encircle the filament without having to slide. This FH2 dissociation has been observed for another formin that has been shown to bind filament sides, FMNL1 (called FRL1 in the listed publication), in Harris et al 2006 (PMID 16556604). The authors must explain their reasoning for thinking that mDia1's FH2 can slide down the filament from the barbed end. One possibility is to make observations of this FH2 population in filaments that were not sonicated. What is the average distance of FH2s from the barbed end?
        • b. Interestingly, in some of the works studying formin binding to filament sides, mDia1 was shown to be rather poor in this property. It would be useful to get an idea of what % of the observed FH2s are in the filament core, as opposed to at the barbed end.
        • c. The authors must reference the past works showing FH2 binding to filament sides, particularly the structural work. At present, no mention of prior work on FH2 side binding is mentioned.
      3. My major technical concern in this manuscript is that the authors use the FH1-FH2-DAD domain of mDia1 for the imaging, but use FH2 structure of Bni1p for 3D characterization (Otomo et al.). Even though Bni1p has been used for functional and structural analysis, mDia1 and Bni1p FH2 domains share low sequence homology. In addition, mDia1 only partially complements loss of Bni1 function in vivo (Moseley et al., 2004 PMID 14657240). Can the authors use the partial structural information of the mDia1 FH2 from Shimada et al 2004 (PDB 1V9D, PMID 14992721)? Alternately, the authors could have used FH2 domain of Bni1p for imaging. At the very least, the authors should explain clearly why they used different proteins for imaging and modeling.
      4. The open and closed states are observed from negative staining data. However, the authors can only find one of the states (open) by cryo-EM, which decreases the confidence level of the paper's conclusions. It would be useful for the authors do a little more to try to find the closed conformation by cryo-EM.
      5. It is unclear whether there are additional effects of using FH1-FH2-DAD protein (not FH2 only) for the imaging, as it shows long protrusion at the tip of actin barbed end. To avoid those concerns the authors could use only FH2 domain of mDia1. Also the authors have to note that they used Bni1p structure because there are no published structures of mDia1 so far.

      Minor comments:

      1. Figure 1: It would be interesting if imaging is provided for mDia1 bound to filaments which it has nucleated. Would it be possible that binding to pre-formed filaments is different to that for mDia1-nucleated filaments?
      2. Supplementary figure 2: Numbers of things in the S2 is unclear and poorly described in both results and methods. In particular, figure S2A, the definitions of the black and gray lines (steady state actin) is not clear. Are they containing 5% pyrene actin? Is that actin in polymerization buffer or in monomer-actin buffer? Is that actin incubated with actin polymerization buffer for a certain time before measurement of fluorescent intensity? In figure S2B, how the authors calculate the monomer actin concentration? The authors should provide the information in either results or methods part.
      3. Supplementary figure 2 C: The figure and legend are missing in the manuscript. Furthermore, the authors describe that they used Gc-globulin to sequester monomeric actin in solution. Is gc-globulin widely used for actin monomer sequestration?

      Referees cross-commenting

      Reviewer #1 mentions that the authors identify formin densities bound along the actin filament for the first time. I agree that the imaging of the mDia1 along the actin filament using electron microscopy is novel, but the concept of formin binding has already been found and studied well with other formins (PMID 16556604, PMID 24915113) and even mDia1 has poor binding activity compared to other formins. It was really nice of the authors to show the mDia1 side filament binding, but I don't think it is a striking finding.

      I have no comment for Reviewer #2.

      Significance

      If the EM refinements and 3D rendering techniques are conducted rigorously (which this reviewer is unable to judge), the data support an existing theory of how FH2 domains interact with the actin barbed end. Overall, the data will be of interest in formin field. However, as written the paper confirms an existing model, and does not represent new insight. Impact would be raised by providing insights from these findings that impact formin function or disease.

      Another main point is that the observation of FH2 domains bound along an actin filament, while interesting, is not novel. Others have found this for other formins, but those papers are not referenced here.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Maufront et al. have used EM to study the conformation of mDia1 at the barbed end and the core of actin filaments to explain the molecular mechanism of the FH2 dimer processivity at these sites. Based on modelled structural data they tried to describe how the conformational changes in FH2 dimer lead to its partial dissociation, and then association with filaments during the process of translocation coupled to subunit addition at actin filaments barbed ends. This supports a previous study (Otomo et al. 2005, Nature), in which using X-ray crystallography structural data were used to propose a stair-stepping model for Bni1p translocation at the barbed ends during actin polymerization. The model for mDia1 binding to core filaments is also given. Moreover, using EM structure and the previously reported structures of actin (PDB: 5OOE), and actin with formin FH2 dimer (PDB: 1Y64), authors explained the dynamic nature of FH2 dimer at barbed ends of the filaments using the flapping model. But due to the low resolution of their structures ~ 26-29A0, the finer details of actin and the FH2 dimer structure at barbed ends could not be resolved, leaving open questions about the orientation of actin helical twist at this end during elongation.

      The authors tried several conditions to get high density barbed-end filaments, but that did not collect adequate number of particles, resulting in low number of particles selected for structure modelling purposes. However, to attain more physiologically relevant structure they used cryo-EM, but were successful in capturing only the open conformation structure of FH2 dimer (at low resolution). Thus, due to low resolution of structures the key findings have not added much to what we already know about the mechanism of FH2 dimer translocation during actin polymerization, except that their studies support the stair-stepping model (Otomo et al. 2005, Nature) and not that of "stepping second" model ( Paul and Pollard. 2008, Curr. Bio.). Thus, this manuscript does not merit publication in this journal.

      Major comments:

      1. Present study does not provide any new insight about the conformation of the actin dimer at the barbed ends of actin filaments when FH2 domains of formin are bound. This study appears to be more like an extension of previous research (Otomo et al. 2005, Nature), in which the authors used X-ray crystallography data to propose a model for actin filaments elongation by formin bound at the barbed ends.
      2. The low resolution of structures is a major concern.
      3. Given the low resolution of data, how can the authors decide on the number (4) of classes of FH2 domain (in open state) and present them as "continuum of conformations". They stated "details featured in class 4 do not appear as sharp as in class 2". What was the basis of deciding on the sharpness level?
      4. The authors showed 30Å structure of FH2 domain encircling actin filaments towards their pointed ends, but said nothing about the kind of decoration it could be, a "daisy-chain" or "concentric circle"? Also, they did not mention anything about the orientation of actin helical twist and specific sites of binding. These information would provide new in-depth understanding of how formins binds while diffusing along the filaments.
      5. The author stated - "The leading FH2 domain likely provides a first docking intermediate for actin monomers that would help their orientation relative to the barbed end, resulting in a higher actin monomer on-rate". This statement was made on the basis of observing 79% times FH2 in the open state in their data set. This seems like an overstatement because they don't have any direct structural data to support such claim.
      6. In the Discussion they mentioned "the FH2 dimer would then be "lagging" behind the elongating barbed end if actin twisting back to 180{degree sign} occurs before the addition of actin monomer and this explains the diffusing along the actin filaments". Did authors encounter filaments with two formins bounds to them in their negative stain images? What is their view on this? In current data, they showed structure in which only one FH2 dimer is bound to the pointed ends of actin filaments. Have they tried increasing the concentration of formins to obtain structures with more than one formin is bound towards the pointed ends of actin filaments?
      7. To increase the density of short filaments for sample preparation, the authors used additional actin binding proteins "shown in supplementary Figure 2.C". There is no supplementary Figure 2.C. Moreover, it would be nice if the concentrations of these proteins are mentioned in the text.

      Minor comments:

      1. Figure 1 legend needs editing. E is missing in the legend.
      2. There is no supplementary Figure 2.C.
      3. It is recommended that the authors report the number of particle used during 2D and RELION 3D classifications in the figures. This would help in better understanding of the probability of the conformations mentioned in the text.

      Significance

      This is the first direct study showing the two (open and closed) conformations of mDia1 FH2 domain at the barbed ends of actin filaments using EM and cryoEM. The study supports the proposed molecular mechanism of FH2 processivity at the barbed ends during filaments elongation using stair-stepping model reported earlier (Otomo et al. 2005, Nature). For the first time, FH2 has been shown to fluctuate between various angles with respect to static actin filaments, and on this basis they propose a flapping model (Fig 5). They explained the whole mechanism using structural proof, but the low resolution of data raises a question about their quality sufficiency to propose this mechanism. The overall novelty of this manuscripts is insufficient for the publication in this journal.

      Audience having understanding of the actin and actin binding proteins will be interested in this study. Additionally, researcher from the field of structural biology (EM and CryoEM) will be interested. I have been working in the field of actin and actin binding proteins for past 4 years. Over 10 years' experience in protein biochemistry, structural biology and molecular biology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study presents a first structural insight on formin mDia bound to actin filaments in physiological conditions. Based mainly negative stain EM, the authors use 2D and 3D class averaging to describe two main configuration of the formin at the filament barbed end. The two configurations support the previously proposed stair-stepping model, which was based on crystal structures, with an open state where the formin binds two actin monomers and a closed state where three monomers are bound. Because the majority of the structures fall in the first, open state, this supports the existence of this intermediate. The authors also show that the orientation of the free FH2 in this open state is somewhat flexible, as several sub-classes with different angles can be distinguished. Finally, they identify, for the first time, formin densities bound along the length of the filament.

      The data is well presented and I don't have any major issue. The only point is that the information that all the initial structural data comes from negative stain EM comes should be put upfront. One gets the feeling that cryoEM is used throughout until one reads the section on cryoEM. Given that the methodology is now also established for cryoEM, it is regrettable that data was not collected with a 300kV microscope, which may have revealed more details of the conformations, but I understand microscope time is hard to come by, and the authors did a remarkable job from negative-stain EM.

      The finding of formin densities binding along the length of the actin filament is very interesting. Besides the previous cited finding, it also reminds of the observations made in yeast where Bni1 (in S. cerevisiae; PMID 17344480) and For3 (in S. pombe; PMID 16782006) where shown to exhibit retrograde movement with polymerizing actin cables in vivo. This would be interesting to consider in the discussion.

      Significance

      This study extends our understanding of the mechanism of formin-mediated actin assembly, by providing a first structural observation in physiological conditions. While confirmatory of previously proposed model, but also excludes an alternative model, and offers novel observations of flexibility and binding along the actin filament length. It will be of great interest to researchers on the actin cytoskeleton.

      My expertise is in the actin cytoskeleton and formins, but I am no expert in EM structural analysis.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      In this section we list all the comments done by the three referees and our corresponding action.

      Regarding Reviewer #1:

      1. On "mechanical control": The authors show changes in circadian power fraction with changes in YAP and with cytoskeletal inhibitors, but there are no properly-controlled experiments that directly perturb mechanics. The authors show a correlation between YAP nuclear/cytoplasmic ratio and circadian power, but YAP N/C alone is not a readout of mechanotrasndcution, per se. The authors have shown two different experiments where cells are cultured on a stiff (30kPa) substrate and soft substrate (300Pa), but they do not shown a direct comparison of YAP nuclear localization and circadian power under these two conditions in the same experiment. Direct, controlled perturbation of mechanical cues is necessary to support the title's use of the phrase "mechanical control."

      We agree with the referee that further mechanical perturbations could strengthen our conclusions. In our original manuscript we directly controlled the mechanical environment by culturing cells on substrates of 300Pa and 30kPa in stiffness. These differences in stiffness were not sufficient to drive changes in circadian power fraction and YAP localisation, as depicted in Fig. 3C (we note that the direct comparison requested by the referee is shown in that figure). We hypothesise that this negative result is due to a very low “rigidity threshold” or to secretion of extracellular matrix that stiffens the initially soft substrate. In any case, we plan to strengthen the “mechanical control” message of our paper with one or more of the below experiments:

      A) We will measure circadian power fraction and YAP localisation in even extremer stiffness/adhesion conditions, using 300 Pa and 30 kPa polyacrylamide gels with a different fibronectin coating protocol, as described in Elósegui-Artola et al., 2017. This allows a much finer control of the concentration of fibronectin coated, so we can reach low enough levels to compromise the cell adhesion to the substrate and cross down the threshold that would lead to cytosolic localisation of YAP. We will perform this experiment in presence of the FUD peptide, which inhibits matrix deposition (Tomasini-Johansson et al., 2001; this peptide has already been tested in our lab).

      B) We will use the approach described in Fig. 2E to compare the circadian power fraction in cells spread in stadium-shaped islands of 2400 um2 and 1200 um2. Oakes et al., 2014 already showed that traction forces exerted by 3T3 fibroblasts depend on the size of the spread area of the cells, so we expect differences in mechanotransduction that should affect YAP localisation and, if our hypothesis is correct, the RevVNP circadian oscillations.

      C) We will abolish the physical connection between the actin cytoskeleton and the nucleus by disrupting the LINC complex via the overexpression of a dominant negative (DN) nesprin-1 KASH domain (Lombardi et al., 2011). The plasmid designed for the inducible overexpression of the DN KASH domain, originally tested in NIH3T3 cells (Mayer et al., 2019), is available in our lab and has been used to prove that uncoupling cytoskeleton and nucleus leads to nuclear YAP decrease in single cells (Kechagia at al., 2022). We will aim to increase the circadian power fraction in low density cells upon the overexpression of the DN KASH domain.

      Elosegui-Artola A, Andreu I, Beedle AEM, Lezamiz A, Uroz M, Kosmalska AJ, Oria R, Kechagia JZ, Rico-Lastres P, Le Roux AL, et al (2017) Force Triggers YAP Nuclear Entry by Regulating Transport across Nuclear Pores. Cell 171: 1397-1410.e14

      Kechagia Z, Sáez P, Gómez-González M, Zamarbide M, Andreu I, Koorman T, Beedle AEM, Derksen PWB, Trepat X, Arroyo M, et al (2022) The laminin-keratin link shields the nucleus from mechanical deformation and signalling Cell Biology

      Lombardi ML, Jaalouk DE, Shanahan CM, Burke B, Roux KJ & Lammerding J (2011) The Interaction between Nesprins and Sun Proteins at the Nuclear Envelope Is Critical for Force Transmission between the Nucleus and Cytoskeleton*. Journal of Biological Chemistry 286: 26743–26753

      Mayer CR, Arsenovic PT, Bathula K, Denis KB & Conway DE (2019) Characterization of 3D Printed Stretching Devices for Imaging Force Transmission in Live-Cells. Cel Mol Bioeng 12: 289–300

      Oakes PW, Banerjee S, Marchetti MC & Gardel ML (2014) Geometry regulates traction stresses in adherent cells. Biophysical Journal 107: 825–833

      Tomasini-Johansson BR, Kaufman NR, Ensenberger MG, Ozeri V, Hanski E & Mosher DF (2001) A 49-Residue Peptide from Adhesin F1 of Streptococcus pyogenes Inhibits Fibronectin Matrix Assembly*. Journal of Biological Chemistry 276: 23430–23439

      2. On "via YAP/TAZ": In addition to above, it is necessary to show that the changes in Circadian power fraction induced by mechanical cues in fact require YAP/TAZ signaling. Thus, an experiment comparing soft (300Pa) substrate with Stiff (30kPa) substrate in the presence or absence of YAP/TAZ is necessary to state that YAP and TAZ are the mechanistic mediators of mechanical cues on the clock.

      We are currently generating via CRISPR-KO and shRNA silencing a YAP1/TAZ double mutant. We plan to use this cell line in those conditions where YAP is prominently nuclear (low density in stiff substrates) with the purpose of rescuing the RevVNP circadian power fraction.

      1. While the TEAD-binding domain mutant experiment is elegant, to claim that TEAD is the transcriptional mediator, it must be demonstrated that this mutant indeed fails to induce TEAD-mediated transcription. This could be simply executed by demonstrating that the CCD mutant expresses reduced CTGF and Cyr61 (for example), compared to the 5SA, under these conditions. Further, endogenous YAP is still active and available to bind to TEAD in this system, which should be discussed.

      We plan to carry out quantitative real-time PCR of CTGF and Cyr61 in all the YAP mutants and the control. Regarding the presence of endogenous YAP, we will clarify in the text that a) the overexpression of the different YAP mutants was done in high-density conditions, where endogenous YAP is significantly less localised in the nucleus, and that b) the levels of the exogenous YAP are much higher (we already have western blots showing this).

      1. In Figure 3a: The cell perimeter needs to be shown either by actin staining or by brightfield images. The manually marking of cell boundaries is insufficient, specifically because the drugs used in this experiment affect the cytoskeleton. It would be very helpful to see this via actin staining or in the least with brightfield images.

      The cell perimeter was drawn based on the cytosolic YAP immunostaining, whose levels are high enough to infer the cell shape (higher resolution images can be attached if necessary). As stated in the manuscript, the YAP nuclear-to-cytosolic ratio is calculated using two adjacent areas of identical size, one inside the nucleus and the other one just outside (see Materials and Methods/Immunostainings), so the exact cell shape is irrelevant for this particular quantification.

      Regarding Reviewer #2:

      Effects on the circadian clock

      1. The authors use the fluorescent reporter created by Nagoshi from sections of the Rev-erbα gene. This reporter is widely used to estimate relative circadian timing in individual cells but it does not provide direct information on the circadian clock activity. In other words, while Reverb rhythmic expression is driven by the clock, it is not known whether less-rhythmic or non-rhythmic expression or change in expression level of Rev-erbα is affecting the core clock. For example, it has been shown that Rev-erbα knock-down cells are rhythmic as long as Rev-erb-beta is present. Thus, one major shortcoming of the current version of the manuscript is the missing dissection between Rev-erbα rhythmicity/expression and the circadian clock. More concretely, it remains unclear whether the change in Rev-erbα expression is a direct effect or caused by a defect clock. Since the authors presume a direct effect of YAP/TAP on Rev-erb expression, the former is likely. If that is the case, the data could be interpreted as that (missing) mechanic stimuli can lead to nuclear YAP/TAZ, which rises the level of Rev-erbα (and maybe interfere with its rhythmic accumulation). Beyond Rev-erbα expression, there may or may not be an effect on the circadian clock (core clock, CCGs). With the current version we do not know since the authors do not look beyond Rev-erbα expression. Thus, the claims on circadian clock or circadian rhythms in their cells is not studied in this version of the manuscript. The current version is still very interesting and provides insights into the Rev-erbα modulation, but additional work would be needed to show links with the core clock machinery. For this the authors could show influence (or at least correlation) of the YAP/TAZ/REVERBA phenotype on the oscillations of core clock genes or clock-controlled genes. Either through the use of alternative (ideally constitutive) reporters (e.g. PER2, BMAL1, fluorescent or LUC), or/and by analyzing RNA/Protein of core clock genes or output genes. This would not be necessary for all experiments, but at least for some were its possible (e.g. experiments with drugs perturbations). Otherwise, any claim like "YAP/TAZ perturbs the circadian clock ..." or "the circadian clock deregulation in nuclear YAP-enriched cells" is potentially flawed and has to be removed/reformulated.

      We agree with the reviewer. In order to understand if the core clock is affected, beyond REV-ERBA, by YAP/TAZ expression and localisation, we plan to perform the two experimental approaches explained below. For both of them we will use high-density cells with and without YAP-5SA overexpression since the other conditions (drugs, micropatterned cells, low density) may not render enough cells for analytical approaches that are not based on fluorescent microscopy (real-time qPCR or luminescence recordings). Also, the potential results obtained with YAP-5SA overexpression will be more informative regarding causality YAP-circadian clock than those using the other conditions described in the manuscript.

      1. We will use NIH3T3 bmal1::luc cells (already generated in our lab with the pABpuro-BluF plasmid; https://www.addgene.org/46824/) and an adapted microscopy-based system to track bioluminescence. We will need to give our cells a synchronisation shock since the single-cell signal with this reporter is too low and noisy to perform single-cell tracking.
      2. We will check during 48 hours, every 4 hours, the mRNA levels of Bmal1, Clock, Cry1, Per2, Yap1 and Rev-erbα via quantitative real-time PCR. As in A), we will need to synchronise our cells prior RNA collection. In case the expression of the other components of the clock are not affected by YAP-5SA overexpression, we will modify the message of our manuscript to emphasize the role of REV-ERBA. As the referee mentions (and we thank them for that comment), finding that the modulation of Rev-erbα is mechano-sensitive and dependent on YAP/TAZ signalling would be still very relevant, given the role of this factor in metabolism, inflammation, mitochondrial activity, or Alzheimer’s disease, as discussed in lines 231-235 in the manuscript.
      1. The authors aim to discard the possibility of paracrine signals by showing no increase in circadian power fraction of cells growing in low density with conditioned medium (Figure 2D). A paracrine signal coming from an oscillatory system is likely to oscillate and in that case, I do not see how growing cells in constant conditional medium can discard the effects of an oscillatory paracrine signal. I believe the elegant experiment shown in Figure 2E more precisely address this issue.

      The reviewer is right in the sense that paracrine coupling of circadian oscillators would require a circadian paracrine signal, like shown in Finger et al., 2021, and that we provide sufficient experimental evidence of a mechanics- rather than paracrine-driven control of the RevVNP circadian oscillations. Specifically, by using micropatterning (Fig. 2E) and gap closure (Fig. 2A) we show that cells under the same paracrine medium are able to display acute differences in RevVNP expression. The experiment with conditioned medium, which is a traditional technique used in some papers in the field like in Noguchi et al., 2013, was performed to rule out the possibility that secreted factors, even if not circadian, could ultimately impact the low-density cells’ circadian clock. We will rephrase the manuscript to stress out this reasoning.

      Finger AM, Jäschke S, del Olmo M, Hurwitz R, Granada AE, Herzel H & Kramer A (2021) Intercellular coupling between peripheral circadian oscillators by TGF-β signaling. Science Advances 7

      Noguchi T, Wang LL & Welsh DK (2013) Fibroblast PER2 circadian rhythmicity depends on cell density. Journal of Biological Rhythms 28: 183–192

      Data analysis methodology:

      1. Single-cell circadian recordings like the ones analyzed here are characterized by noisy amplitude and non-sinusoidal waveforms with fluctuating period (Bieler et al., 2014; Feillet et al., 2014). The authors interpolate, smooth, detrend and normalize their data; operations that are known to introduce spectral artifacts that can mislead the interpretation of the power spectrum. Moreover, the time-series pre-processing operations described by the authors in the methods sections is incomplete and the authors should more explicitly describe all their operations with exact methods applied, filter parameters and time-windows sizes (if applicable). To validate their pre-processing steps the authors could provide their time-series analysis pipeline code and/or provide a few examples of raw versus pre-processed data together with their respective spectrums before and after pre-processing. In addition, the authors could provide their raw trace signal data together with the corresponding post-processed signal data as plain text files.

      In our response to the reviewers, we will address this point exactly as requested by the reviewer. We will rewrite our methods section to explain better our analysis pipeline, clarifying that we do not apply detrending, that we resort rarely to interpolation of missing points, and stating the specifics of the standard low-pass filter we apply. We will then strengthen Supplementary Figure 1 with more examples of raw-data and processed data, and will provide raw trace signal data and the corresponding processed data to illustrate our approach.

      1. The authors rely on Fourier analysis and a reasonable self-made definition of circadian strength named as "circadian power fraction". Using a stationary-based method for noisy non-stationary data can lead to inaccurate spectrum power estimations. As the current version of the manuscript does not provide any alternative/complementary analysis method nor we have any available raw signal data it is unclear if their analysis appropriately represents the circadian power. The authors could consider implementing complementary data-analysis strategies to validate their conclusions. Fortunately, there are multiple suitable data analysis strategies already available that are exactly designed for this kind of data (eg. (Price et al., 2008; Leise et al., 2012; Leise, 2013; Bieler et al., 2014; Mönke et al., 2020). This time-series analysis methods is a crucial step as all main results on this manuscript rely on the authors self-made definition of circadian power. This is particularly important as there is no standardized method in the circadian field to estimate circadian rhythmicity and/or circadian power of single-cell traces.

      We will take this point into consideration by running a complementary analysis of our data with one of the methods recommended by the reviewer. Our choice is pyBOAT, as presented in Mönke et al. (2020), because on first inspection its implementation of the wavelet method appears to be the most suitable for our dataset type. If we find that our time-series are too short for these methods we will use the RAIN algorithm (Thaben and Westermark, 2014) instead.

      Mönke G, Sorgenfrei FA, Schmal C & Granada AE (2020) Optimal time frequency analysis for biological data - pyBOAT Systems Biology

      Thaben PF & Westermark PO (2014) Detecting Rhythms in Time Series with RAIN. J Biol Rhythms 29: 391–400

      1. The authors mainly show circadian power fraction and analyze rhythmicity scores/powers. Is there the a chance that a rise in the basal expression level of Rev-erbα is reducing the rhythmicity score? Or to phrase it otherwise, the absolute amplitude may remain the same, but the relative amplitude may be reduced? Would that affect the FT analysis power scored? To clarify this the authors could provide an analysis of the relative amplitude in addition to the circadian intensity (as in Fig.1C).

      Our analysis pipeline subtracts the mean signal from each cell’s intensity-time trace, and then divides each trace by its standard deviation. This procedure eliminates any bias due to basal expression of Rev-erbα. We will address this point by clarifying the methods section and providing examples in Supplementary Figure 1 of raw data with high-basal levels and low basal levels, showing their pre- and post-processed spectra.

      Minor points by text-line:

      YAP and TAZ should be introduced to the reader during introduction. by set a of proteins. Here the authors probably meant that cells were not reset nor entrained during the experiment. "..expression depends on..". This is a correlation, not proof of causation is shown until this point. This is an overstatement. Using the term "provoked" suggests a causal relationship not shown. Similarly last sentence "This result established.... is caused..". Again, this is an overstatement as only correlation is shown. According to their description the authors are not using any image-preprocessing steps, eg background subtraction or other filters. Is this correct? It is not clear what image metric for the single-cell signals are the authors using, eg. integrated nuclear intensity or mean/median nuclear intensity. I am not familiar with TrackMate but it might be possible to export and share with the readers the image-analysis pipeline used which would clarify any questions about image processing and signal extraction.

      We thank the reviewer for pointing out all these minor points. We will address each one of them to make the paper clearer.

      Regarding Reviewer #3:

      The authors state in lines 163-165: 'This striking anticorrelation reveals that the robustness of the Rev-erbα circadian expression depends on the nucleocytoplasmic transport of YAP and its mechanosensitive regulation'. Although interesting, the data in figure 3 to which this statement refers is, as the authors identify, correlative, rather than causative. I would strongly suggest altering this statement to better reflect the data.

      We will modify the text to eliminate this overstatement.

      It looks to me as though all experiments were carried out in the same clonal reporter 3T3 line. To avoid possible issues with founder effects, I would ask that the authors repeat the initial experiment in figure 1B, and the associated analysis as in 1C-E with a different clonal 3T3 line. Hopefully this will not be very arduous, as the methods suggest that multiple clonal 3T3 reporter lines were made initially. With time to defrost, plate, record and analyse the data, I would hope that this would not take more than six weeks maximum.

      We will perform the experiments regarding the cell density effect on the RevVNP oscillations (Fig. 1) in another clonal 3T3 line as the reviewer suggests. We have already initiated the experimental repeats with the alternative clone.

      I would note that the custom software used for analysis does not appear to be generally available. I would assume that the authors would make this available upon request.

      We will extend the explanation of our method as suggested by Reviewer #2 and make the code available to the community.

      Experiments appear to have been adequately replicated in terms of n. However, the robustness of these findings would be supported though use of a different clonal reporter line, as discussed above.

      We will solve this problem as stated above.

      Statistical analysis is generally appropriate. I would suggest including statistical analysis in figures 3B and S4B to demonstrate that the pharmacological treatments are indeed having a statistically significant effect on the MAL and YAP nuclear/cytoplasmic ratio.

      We will perform the corresponding statistical analysis on those data.

      For Figure 4, it is not stated which statistical tests have been used, with only P values given in table S1. Please state which test has been used.

      We will specify the statistical test used in the figure legend.

      Furthermore, it would be valuable to see if it is possible to perform statistical analysis looking at the populations should in Figure 4A, to either support or refute the statement made in Line 189-90 that 'we overexpressed 5SA-S94A-YAP, a mutant version of YAP unable to interact with TEAD and observed that the cells recovered, to a large extent, both the RevVNP circadian power fraction and the REV-ERBα basal levels displayed by the wild-type high-density population'

      The p-values corresponding to that dataset are represented in Table S1, but we will move them to the figure legend so the extent of the differences between the YAP mutants and the control becomes more noticeable. This applies too to the next comment of the reviewer.

      Additionally, it is a little unclear to me why exact p values are reported in table S1. It seems that they might be better placed in the relevant figure legend.

      Minor comments:

      Although the authors took good care to try to ensure that there was minimal phase synchrony between cells, it would be good to see some analysis to confirm that these efforts were successful. This is of particular concern, given that many things that commonly happen during cell handling, such as temperature change and media change, even with conditioned media, can act to synchronise cells. Hopefully, this information should be available from your existing analysis.

      All our experiments, except for the gap closure ones (which imply an unavoidable medium shock after the removal of the gasket where the cells are cultured to achieve high density) are carried out in a similar way (see Materials and Methods). This approach does not involve the typical shock of serum, dexamethasone, or other hormones, because we want to avoid biochemical signalling that could mask the “pure” effect of mechanics on the pathways that affect the circadian clock. In any case, a certain level of synchrony should not affect the analysis we perform, since this is single cell-based and does not consider the phase but the strength of its circadian frequency. But as requested by the reviewer we will analyze the phase signal and report the results if relevant to the project.

      It would be informative to see both phase and period analysis for the data shown in figure 2C. Do cells at the edge show differences in relative synchrony following the removal of the PDMS barrier and Rev-erba induction? Is there a period difference between cells at the edge and those that remain confluent?

      We agree with the referee that the “shock” received by the cells at the edge should work as a reset of their circadian phase and we have tried to analyse this effect. However, there are technical limitations that make this analysis difficult, mainly the short duration of the experiment and the fact that these cells transition very fast, upon gap closure, from a non-circadian to a circadian behaviour. We will attempt to better report this interesting effect by using the WAVECLOCK (Price et al., 2008) or the pyBOAT method (Mönke et al., 2020), suggested by Reviewer #2, which are designed to analyse non-stationary data.

      Mönke G, Sorgenfrei FA, Schmal C & Granada AE (2020) Optimal time frequency analysis for biological data - pyBOAT Systems Biology

      Price TS, Baggs JE, Curtis AM, Fitzgerald GA & Hogenesch JB (2008) WAVECLOCK: wavelet analysis of circadian oscillation. Bioinformatics 24: 2794–2795

      Figure 2B - the text states that those cells far from the edge oscillate robustly thoughout the experiment, but this is not easy to see from this kymograph due to the dynamic range. Is there another way of presenting this that might make it easier to confirm?

      We will calculate the circadian power fraction of the “bulk” cells as we do for the other conditions described in the manuscript. We can also show examples of individual traces if the average shown in Fig. 2C or the kymograph in Fig. 2B are not clear enough.

      Figure 1D-E - the text provides periodicity for the high-density cells, but not the low density ones. Could you provide periodicity for both populations - do they differ?

      We will represent in more detail the results of the frequency analysis on the low-density cells so the diversity of periods (frequencies) at this condition gets more evident.

      Figure S3 - it is interesting to note the difference in population rhythmicity between the bulk and edge data here, which is not seen so clearly in cells without thymidine. Could the authors comment on this?

      We agree with the referee that there is an obvious difference regarding RevVNP expression (mainly on the edge cells but also in the bulk) between the experiments with and without thymidine. We hypothesise this is due to the pronounced decrease in cell divisions in the presence of thymidine, which considerably slows down the gap closure and impacts the density of the entire cell population. We will comment this effect in the manuscript.

      Line 148 - it is unclear here what is meant by 'the onset of circadian oscillations'. Could you rephrase this for clarity?

      We will change that sentence.

      Line 173 - a few words to highlight that Lats is a kinase and the function of YAP phosphorylation by Lats would aid clarity here. Similarly, explanation of the functional difference between the protein with 4 Serine to alanine mutations and 5 mutations and why both of these mutants were used would be helpful.

      We will clarify this point following the reviewer’s suggestion.

      Line 174 - for accuracy, this should perhaps read 'fibroblast circadian clock', as this work is only in 3T3 cells, and therefore may not apply more generally.

      We will implement this change.

      Line 202 - could you expand to explain the existing limitations of studying cell signalling cascades in synchronised cells? This is not clear to me. Thanks.

      We will discuss the signalling effects caused by 50% serum shocks and other traditional ways to synchronise the cells as requested by the reviewer.

      Figures 1D and 4B - the choice of colour range used in these kymographs is skewed towards the warmer colours, making it quite hard to discern differences between the groups. I would suggest using the cooler colour range for a greater proportion of the data set, to make rhythmicity, or lack of it, clearer to see.

      We will invest further efforts to finding the optimal colour map and range for our datasets.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this study, the authors employ the NIH 3T3 fibroblast cell line to study the effect of cell density and the associated mechanical cues on cellular circadian rhythmicity. For this, they generate a Rev-erbα:VENUS line and, combined with constitutive nuclear mCherry expression, are able to track Rev-erbα: expression in single cells within populations of differing densities. Using overexpression of the transcriptional co-regulator YAP and specific mutants thereof, they suggest a role for YAP and its associated transcription factor family, TEAD, in the regulation of Rev-erbα expression under conditions of differing cell density.

      Major comments:

      • Are the key conclusions convincing? Yes, the major conclusions are supported by the data shown.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The authors state in lines 163-165: 'This striking anticorrelation reveals that the robustness of the Rev-erbα circadian expression depends on the nucleocytoplasmic transport of YAP and its mechanosensitive regulation'. Although interesting, the data in figure 3 to which this statement refers is, as the authors identify, correlative, rather than causative. I would strongly suggest altering this statement to better reflect the data.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      It looks to me as though all experiments were carried out in the same clonal reporter 3T3 line. To avoid possible issues with founder effects, I would ask that the authors repeat the initial experiment in figure 1B, and the associated analysis as in 1C-E with a different clonal 3T3 line. Hopefully this will not be very arduous, as the methods suggest that multiple clonal 3T3 reporter lines were made initially. With time to defrost, plate, record and analyse the data, I would hope that this would not take more than six weeks maximum.

      • Are the data and the methods presented in such a way that they can be reproduced?

      Yes. I would note that the custom software used for analysis does not appear to be generally available. I would assume that the authors would make this available upon request.

      • Are the experiments adequately replicated and statistical analysis adequate?

      Experiments appear to have been adequately replicated in terms of n. However, the robustness of these findings would be supported though use of a different clonal reporter line, as discussed above.

      Statistical analysis is generally appropriate. I would suggest including statistical analysis in figures 3B and S4B to demonstrate that the pharmacological treatments are indeed having a statistically significant effect on the MAL and YAP nuclear/cytoplasmic ratio.

      For Figure 4, it is not stated which statistical tests have been used, with only P values given in table S1. Please state which test has been used.

      Furthermore, it would be valuable to see if it is possible to perform statistical analysis looking at the populations should in Figure 4A, to either support or refute the statement made in Line 189-90 that 'we overexpressed 5SA-S94A-YAP, a mutant version of YAP unable to interact with TEAD and observed that the cells recovered, to a large extent, both the RevVNP circadian power fraction and the REV-ERBα basal levels displayed by the wild-type high-density population'

      Additionally, it is a little unclear to me why exact p values are reported in table S1. It seems that they might be better placed in the relevant figure legend.

      Minor comments:

      • Specific experimental issues that are easily addressable. Although the authors took good care to try to ensure that there was minimal phase synchrony between cells, it would be good to see some analysis to confirm that these efforts were successful. This is of particular concern, given that many things that commonly happen during cell handling, such as temperature change and media change, even with conditioned media, can act to synchronise cells. Hopefully, this information should be available from your existing analysis.

      It would be informative to see both phase and period analysis for the data shown in figure 2C. Do cells at the edge show differences in relative synchrony following the removal of the PDMS barrier and Rev-erba induction? Is there a period difference between cells at the edge and those that remain confluent?

      • Are prior studies referenced appropriately?

      To the best of my knowledge, yes.

      • Are the text and figures clear and accurate?

      Figure 2B - the text states that those cells far from the edge oscillate robustly thoughout the experiment, but this is not easy to see from this kymograph due to the dynamic range. Is there another way of presenting this that might make it easier to confirm?

      Figure 1D-E - the text provides periodicity for the high-density cells, but not the low density ones. Could you provide periodicity for both populations - do they differ?

      Figure S3 - it is interesting to note the difference in population rhythmicity between the bulk and edge data here, which is not seen so clearly in cells without thymidine. Could the authors comment on this?

      Line 148 - it is unclear here what is meant by 'the onset of circadian oscillations'. Could you rephrase this for clarity?

      Line 173 - a few words to highlight that Lats is a kinase and the function of YAP phosphorylation by Lats would aid clarity here. Similarly, explanation of the functional difference between the protein with 4 Serine to alanine mutations and 5 mutations and why both of these mutants were used would be helpful.

      Line 174 - for accuracy, this should perhaps read 'fibroblast circadian clock', as this work is only in 3T3 cells, and therefore may not apply more generally.

      Line 202 - could you expand to explain the existing limitations of studying cell signalling cascades in synchronised cells? This is not clear to me. Thanks.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Figures 1D and 4B - the choice of colour range used in these kymographs is skewed towards the warmer colours, making it quite hard to discern differences between the groups. I would suggest using the cooler colour range for a greater proportion of the data set, to make rhythmicity, or lack of it, clearer to see.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      This work provides a potential mechanism for the modulation of cellular rhythmicity under conditions of varying cell density, a currently relatively understudied area to which the contribution made here will be valuable. However, the work is limited by its use of only one cell type (NIH 3T3) and one reporter (Rev-erb:VENUS), which makes the work difficult to generalise in the context of all cell types and environments that exist in a mammalian context.

      • Place the work in the context of the existing literature (provide references, where appropriate).

      Previous work has identified YAP activity as a mechanism of signalling growth substrate stiffness (Halder et al. 2012, Panciera et al. 2017). It has also been speculated, but not demonstrated, that YAP might influence circadian rhythmicity (Streuli and Meng, 2019). This work provides some initial evidence to support this speculation.

      • State what audience might be interested in and influenced by the reported findings.

      This work would be of genera; interest to those working on mammalian cellular circadian rhythmicity. Additionally, given YAP's status as an oncogene, this work would also be relevant to those considering circadian disruption in cancer.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Mammalian circadian cell biology and biochemistry.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Abenza et al. investigate an important question of how the physical environment affects the properties of the individual circadian clocks. The authors utilize a set of clever experiments, pharmacological manipulations and data analysis techniques to unveil a potential role of YAP/TAZ in the circadian clock.

      Major comments:

      Effects on the circadian clock

      1. The authors use the fluorescent reporter created by Nagoshi from sections of the Rev-erbα gene. This reporter is widely used to estimate relative circadian timing in individual cells but it does not provide direct information on the circadian clock activity. In other words, while Reverb rhythmic expression is driven by the clock, it is not known whether less-rhythmic or non-rhythmic expression or change in expression level of Rev-erbα is affecting the core clock. For example, it has been shown that Rev-erbα knock-down cells are rhythmic as long as Rev-erb-beta is present. Thus, one major shortcoming of the current version of the manuscript is the missing dissection between Rev-erbα rhythmicity/expression and the circadian clock. More concretely, it remains unclear whether the change in Rev-erbα expression is a direct effect or caused by a defect clock. Since the authors presume a direct effect of YAP/TAP on Rev-erb expression, the former is likely. If that is the case, the data could be interpreted as that (missing) mechanic stimuli can lead to nuclear YAP/TAZ, which rises the level of Rev-erbα (and maybe interfere with its rhythmic accumulation). Beyond Rev-erbα expression, there may or may not be an effect on the circadian clock (core clock, CCGs). With the current version we do not know since the authors do not look beyond Rev-erbα expression. Thus, the claims on circadian clock or circadian rhythms in their cells is not studied in this version of the manuscript. The current version is still very interesting and provides insights into the Rev-erbα modulation, but additional work would be needed to show links with the core clock machinery. For this the authors could show influence (or at least correlation) of the YAP/TAZ/REVERBA phenotype on the oscillations of core clock genes or clock-controlled genes. Either through the use of alternative (ideally constitutive) reporters (e.g. PER2, BMAL1, fluorescent or LUC), or/and by analyzing RNA/Protein of core clock genes or output genes. This would not be necessary for all experiments, but at least for some were its possible (e.g. experiments with drugs perturbations). Otherwise, any claim like "YAP/TAZ perturbs the circadian clock ..." or "the circadian clock deregulation in nuclear YAP-enriched cells" is potentially flawed and has to be removed/reformulated.

      2. The authors aim to discard the possibility of paracrine signals by showing no increase in circadian power fraction of cells growing in low density with conditioned medium (Figure 2D). A paracrine signal coming from an oscillatory system is likely to oscillate and in that case, I do not see how growing cells in constant conditional medium can discard the effects of an oscillatory paracrine signal. I believe the elegant experiment shown in Figure 2E more precisely address this issue.

      Data analysis methodology:

      1. Single-cell circadian recordings like the ones analyzed here are characterized by noisy amplitude and non-sinusoidal waveforms with fluctuating period (Bieler et al., 2014; Feillet et al., 2014). The authors interpolate, smooth, detrend and normalize their data; operations that are known to introduce spectral artifacts that can mislead the interpretation of the power spectrum. Moreover, the time-series pre-processing operations described by the authors in the methods sections is incomplete and the authors should more explicitly describe all their operations with exact methods applied, filter parameters and time-windows sizes (if applicable). To validate their pre-processing steps the authors could provide their time-series analysis pipeline code and/or provide a few examples of raw versus pre-processed data together with their respective spectrums before and after pre-processing. In addition, the authors could provide their raw trace signal data together with the corresponding post-processed signal data as plain text files.

      2. The authors rely on Fourier analysis and a reasonable self-made definition of circadian strength named as "circadian power fraction". Using a stationary-based method for noisy non-stationary data can lead to inaccurate spectrum power estimations. As the current version of the manuscript does not provide any alternative/complementary analysis method nor we have any available raw signal data it is unclear if their analysis appropriately represents the circadian power. The authors could consider implementing complementary data-analysis strategies to validate their conclusions. Fortunately, there are multiple suitable data analysis strategies already available that are exactly designed for this kind of data (eg. (Price et al., 2008; Leise et al., 2012; Leise, 2013; Bieler et al., 2014; Mönke et al., 2020). This time-series analysis methods is a crucial step as all main results on this manuscript rely on the authors self-made definition of circadian power. This is particularly important as there is no standardized method in the circadian field to estimate circadian rhythmicity and/or circadian power of single-cell traces.

      3. The authors mainly show circadian power fraction and analyze rhythmicity scores/powers. Is there the a chance that a rise in the basal expression level of Rev-erbα is reducing the rhythmicity score? Or to phrase it otherwise, the absolute amplitude may remain the same, but the relative amplitude may be reduced? Would that affect the FT analysis power scored? To clarify this the authors could provide an analysis of the relative amplitude in addition to the circadian intensity (as in Fig.1C).

      Minor points by text-line:

      1. YAP and TAZ should be introduced to the reader during introduction.
      2. by set a of proteins.
      3. Here the authors probably meant that cells were not reset nor entrained during the experiment.
      4. "..expression depends on..". This is a correlation, not proof of causation is shown until this point. This is an overstatement.
      5. Using the term "provoked" suggests a causal relationship not shown.
      6. Similarly last sentence "This result established.... is caused..". Again, this is an overstatement as only correlation is shown.
      7. According to their description the authors are not using any image-preprocessing steps, eg background subtraction or other filters. Is this correct?
      8. It is not clear what image metric for the single-cell signals are the authors using, eg. integrated nuclear intensity or mean/median nuclear intensity. I am not familiar with TrackMate but it might be possible to export and share with the readers the image-analysis pipeline used which would clarify any questions about image processing and signal extraction.

      References:

      1. Bieler, J, Cannavo, R, Gustafson, K, Gobet, C, Gatfield, D, and Naef, F (2014). Robust synchronization of coupled circadian and cell cycle oscillators in single mammalian cells. Mol Syst Biol 10, 739.

      2. Leise, TL (2013). Wavelet analysis of circadian and ultradian behavioral rhythms. J Circadian Rhythms 11, 5.

      3. Leise, TL, Wang, CW, Gitis, PJ, and Welsh, DK (2012). Persistent Cell-Autonomous Circadian Oscillations in Fibroblasts Revealed by Six-Week Single-Cell Imaging of PER2::LUC Bioluminescence. PLoS One 7, 1-10.

      4. Mönke, G, Sorgenfrei, F, Schmal, C, and Granada, A (2020). Optimal time frequency analysis for biological data - pyBOAT. BioRxiv 179, 985-986.

      5. Price, TS, Baggs, JE, Curtis, AM, FitzGerald, GA, and Hogenesch, JB (2008). WAVECLOCK: wavelet analysis of circadian oscillation. Bioinformatics 24, 2794-2795.

      Significance

      I believe this manuscript is of high significant both for the circadian as well as the mechanobiology fields. Readers from single-cell signalling studies will also be very interested in this work.

      To my knowledge the discussed link has not been studied before at single cell level, which as the authors show can provide multiple new insights.

      I do work with similar single-cell signals, have broad expertise in microscopy, image analysis methods, time series analysis, and the circadian clock mechanisms but very little experience in mechanobiology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Here Abenza & Rossetti et al. show that in 3T3 fibroblasts, the circadian clock depends on cell density, correlates with YAP activity, and further demonstrate that circadian power fraction is suppressed by genetic YAP activation (5SA), but is rescued by expression of 5SA YAP without the tead-binding domain. This is a striking study on an important question; however, the data do not directly support the conclusions and the title of the paper. These conclusions/title should be altered, or supported with additional experiments, as detailed below:

      Major Critiques:

      1. On "mechanical control": The authors show changes in circadian power fraction with changes in YAP and with cytoskeletal inhibitors, but there are no properly-controlled experiments that directly perturb mechanics. The authors show a correlation between YAP nuclear/cytoplasmic ratio and circadian power, but YAP N/C alone is not a readout of mechanotrasndcution, per se. The authors have shown two different experiments where cells are cultured on a stiff (30kPa) substrate and soft substrate (300Pa), but they do not shown a direct comparison of YAP nuclear localization and circadian power under these two conditions in the same experiment. Direct, controlled perturbation of mechanical cues is necessary to support the title's use of the phrase "mechanical control."

      2. On "via YAP/TAZ": In addition to above, it is necessary to show that the changes in Circadian power fraction induced by mechanical cues in fact require YAP/TAZ signaling. Thus, an experiment comparing soft (300Pa) substrate with Stiff (30kPa) substrate in the presence or absence of YAP/TAZ is necessary to state that YAP and TAZ are the mechanistic mediators of mechanical cues on the clock.

      3. While the TEAD-binding domain mutant experiment is elegant, to claim that TEAD is the transcriptional mediator, it must be demonstrated that this mutant indeed fails to induce TEAD-mediated transcription. This could be simply executed by demonstrating that the CCD mutant expresses reduced CTGF and Cyr61 (for example), compared to the 5SA, under these conditions. Further, endogenous YAP is still active and available to bind to TEAD in this system, which should be discussed.

      4. In Figure 3a: The cell perimeter needs to be shown either by actin staining or by brightfield images. The manually marking of cell boundaries is insufficient, specifically because the drugs used in this experiment affect the cytoskeleton. It would be very helpful to see this via actin staining or in the least with brightfield images.

      Significance

      This is an exciting paper that potentially links mechanotransduction to the circadian clock. While my group is not focused on circadian rhythms, and I don't have the background to comment on the measurements or robustness of circadian power, the idea is striking and significant.

      I strongly recommend inclusion of loss-of-function approaches (either genetic or pharmacologic) in addition to the gain-of-function methods employed here to support the necessity of YAP/TAZ signaling. Also, appropriately controlled experiments to show true mechanical effects on the circadian clock are necessary.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): ____ *A significant criticism of the paper is an assumption that readers will be familiar with all of the findings in the author's previous 2016 paper and the PGL-1 papers by Aoki et al. Minimal context is given for each approach. *

      To address this concern, we have added a paragraph in the Introduction section of the revised manuscript.

      *Some conclusions are not well supported and require further analysis, proper controls, and more extensive descriptions of the experiments performed. *

      We have addressed the reviewer’s concerns as detailed below.

      Most importantly, the central conclusion and title of the paper is that composition can buffer the dynamics of individual proteins within liquid-like condensates. In other words, in vitro condensation assays often do not recapitulate LLPS behavior in vivo. That said, the findings in this study would be significantly strengthened and complemented by observing endogenously tagged PGL-3 and PGL-3 mutants in living worms, considering the efficiency of using CRISPR in C. elegans to insert tags and make precise mutations.

      The original manuscript already contained data where we microinjected wild-type PGL-3 and mutant PGL-3 proteins (recombinantly purified) into adult C. elegans gonads to assay how the P granule phase supports diffusion of these proteins.

      In the revised version, we now include additional data which shows “dynamics buffering” in transgenic worms generated using CRISPR/Cas9 technology. Briefly, we used CRISPR/Cas9 to generate transgenic C. elegans which expresses PGL-3-mEGFP or PGL-3(D425-452)-mEGFP from the native pgl-3 locus. In vitro, wild-type PGL-3-mEGFP protein generates liquid-like condensates. On the other hand, the recombinantly purified PGL-3(D425-452)-mEGFP protein generates condensates that are non-dynamic. In contrast to these observations in vitro, both wild-type PGL-3-mEGFP and PGL-3(D425-452)-mEGFP show similar dynamics (half-time of FRAP recovery) within P granules in vivo.

      *To improve readability, the introduction to P granules should be expanded, and include the reasons for looking at the nematode-specific PGL-3 protein among all the other known P granule proteins. A recap of previous findings on PGL-3 phase separation, in vivo and in vitro, is warranted, starting with the significant results of Saha et al 2016. Setting up the investigative questions in the context of recent work on PGL-1 (Aoki, et al) is also necessary. *

      To address this concern, we have added a paragraph in the Introduction section of the revised manuscript.

      The physiological concentration of PGL-3 should be more transparent, including why some experiments in this study are done at physiological concentrations while others are not. Describing why salt concentrations, crowding agents, and protein abundance are similar or different for each experiment is necessary and relevant. For example, after showing in Figure 1 that PGL-3 protein phase separates, the paragraph starting on line 161 says that it was previously shown that PGL-3 doesn't phase separate at physiological concentrations without RNA. One has to go back to Figure 1 to realize it was done differently than Figure 2 and Saha 2016.

      The concentrations of PGL-3 protein and use of crowding agents (if any) have already been specified within figures or figure legends. Salt concentrations used are specified within figure legends or materials and methods section.

      We have added the following paragraph to the materials and methods section of the revised manuscript.

      “Saha et al. 2016 showed that at physiological concentrations (approx. 1 mM), the PGL-3 protein is unable to phase separate into condensates. At these concentrations, mRNA promotes phase separation of PGL-3. To assay for mRNA-dependence of condensate assembly, it is therefore essential to use physiological concentrations of the PGL-3 protein or mutants (e.g. Figure 2). However, these condensates are generally too small to assay rate of internal rearrangement of PGL-3 molecules within condensates using fluorescence recovery after photobleaching experiments. Therefore, to generate large condensates for measuring internal rearrangement of PGL-3 or mutant molecules, we primarily used higher concentrations of these proteins where binding to RNA is not essential for phase separation. However, to mimic the in vivo P granule phase as closely as possible, we generally added constituent proteins in proportion to their in vivo abundance estimated in Saha et al. 2016.”

      The added paragraph in the Introduction section of the revised manuscript may be helpful to the readers. * *

      *Statements in the same paragraph like "in contrast to full-length PGL-3, mRNA does not support phase separation..." should be qualified by stating the concentration observed, with or without salts or other crowding agents. Similarly, line 230 "suggests that interactions involving the disordered C-terminal region of PGL-3 are not essential for the fast dynamics" and should be qualified with "at non-physiological concentrations and with XX crowding agents or salt concentration." It would be more consistent if physiological concentrations were consistent from figure to figure, as extra variables weaken some of the stated conclusions. *

      We thank the reviewer for this suggestion. However, we feel the statements (without full experimental details within main text) help convey the conceptual essence of the findings better. Of course, all these statements contain reference to figures or prior publications which provide relevant details about experimental conditions.

      *The 2010 review reference stating that there are 40 P granule enriched proteins is outdated. More recent reviews put the number much higher. This is relevant because the approach to put PGL-3 in a more physiological environment by including just PGL-1, GLH-1 and mRNA with the condensate assays, out of ~100 P granule enriched proteins, may not be sufficient to conclude "that the influence of complex composition on dynamics is modest" (line 223), or imply that the multicomponent nature of the P granule is reconstituted by adding these components (line 355). *

      We revised the text to indicate that P granules contain approx. 70 proteins and added appropriate references.

      • *

      Based on current information of constitutive P granule components (PGL-1, PGL-3, GLH-1, GLH-2, GLH-3, GLH-4, DEPS-1, MIP-1 and mRNA), (Kawasaki et al, 1998, 2004; Spike et al, 2008a, 2008b; Price et al, 2021; Cipriani et al, 2021; Phillips & Updike, 2022) we reconstituted P granule-like phase in vitro with mRNA, PGL- and GLH- proteins that likely constitute the most abundant components within P granules in vivo (based on concentration estimates in Saha et al. 2016).

      We do appreciate the reviewer’s comment that more components can be added to our in vitro reconstitution in addition to the limited set of components used in our study. However, we feel it is interesting to observe that a limited set of components can support dynamics buffering (the main message of the paper). Further, the complementary in vivo experiments show that the P granule phase can also support dynamics buffering.

      *Figure 1C needs to include PGL-3(370-693) in the analysis. Figure 1E is also incomplete without a comparison of FRAP recovery between PGL-3(1-452) and full PGL-3 as the control.

      *

      Fig. 1c already includes data with PGL-3 (370-693) [top row, central panel]. FRAP recovery data with full-length PGL-3 is already available in Supplementary Fig. 2c, g.

      *Figure 4C is missing an essential control where PGL-3 and S1 FRAP is performed without PGL-1, GLH-1, and mRNA. *

      In the revised version, we have added Supplementary Fig. 5f, where FRAP recovery of the following condensates are plotted together: 1) PGL-3 alone, 2) S1 alone, 3) PGL-3 + PGL-1, GLH-1 and mRNA, 4) S1 + PGL-1, GLH-1 and mRNA.

      *It would also help show sup Fig4A in the main figure to show concentration dependence. *

      We revised Fig. 4 to address the reviewer’s suggestion.

      Consider adding subtitles to supplementary figures.

      We considered the suggestion but felt it may not be essential.

      *M&M should include an explanation for statistical analysis *

      We added a paragraph describing statistical analysis within the Materials and Methods section.

      *CROSS-CONSULTATION COMMENTS I am also in agreement with the comments and critiques of reviewers 2 and 3.

      * Reviewer #1 (Significance (Required)): The paper by Saha and colleagues investigate the in vitro liquid-liquid phase separation propensity of a P granule protein PGL-3 and its structural domains. The findings largely replicate and support the phase-separation properties of a paralogous protein called PGL-1, as recently described by Aoki et al. 2021. Furthermore, they show that the dynamics demonstrated by recombinant PGL-3 may be maintained or buffered by the complex composition of P granules.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      *Jelenic et al. describe the effect of partner proteins on the FRAP dynamics of recombinant PGL-3 protein and variants in in vitro condensates and C elegans p-granules. The study shows that the N terminal a-helical dimerization domains is required for condensate formation and modulate of it alters aggregation and the FRAP dynamics of its condensates. Interestingly, a construct including the entire IDR region (370-693) by itself does not phase separate on its own at these conditions. The K126E K129E mutant (known previously to disrupt dimerization) and the deletion mutant abrogate llps. A mutant construct that shuffles the sequence in the region 423-453 called S1 here reduces the helicity and the condensate FRAP dynamics but recovered in the presence of a few P granule components. Also, the reduced dynamics of partially unfolded PGL-3 condensates are also rescued by the p-granule components to a certain degree of the unfolded PGL3 concentrations. This threshold concentration for recovering the condensate dynamics is further reduced in the helix reducing S1 mutant, which is also dependent on the number and the nature of P granule components.

      Overall, the study aims to probe how "composition can buffer protein dynamics within liquid-like condensates" - yet several underlying aspects of the study do not fully support that conclusion. The introduction does not sufficiently introduce the known structural information of the two dimerization domains in C elegans PGL proteins for which structures are known. The region is discussed as "alpha helical" but really there are two evolutionarily conserved independently folding dimerization domains (referring to the mutants as "reduced alpha helicity" is not helpful - these are mutations that destabilize a folded domain).*

      To address this concern, we have added a paragraph in the Introduction section of the revised manuscript.

      *Additionally, the abstract and introduction ignore the aspects of aggregation (touched on in discussion) - this is likely what the disruption to the helical region in residue 450 region is doing (the helix is not on the dimer interface based on homology / sequence identity to the crystal structure of PGL-1 central dimerization domain. *

      We think elucidating the molecular mechanism of apparent aggregation of PGL-3 (D425-452) could be an interesting direction for future investigation. Here, we focused our analysis predominantly on the mutant S1 since it generates liquid-like condensates with ~20- fold slower dynamics (compared to wild-type) in contrast to non-dynamic condensates/aggregates. Therefore, influence of other P granule components on the dynamics of PGL-3 in liquid-like condensates is easier to address using the mutant S1 rather than PGL-3 (D425-452). We didn’t find evidence that S1 aggregates as we did not detect aggregates of S1 molecules using fluorescence confocal microscopy and the slow dynamics in condensates of S1 does not change significantly over 24 h (Supplementary Fig. 3f).

      However, in the revised version, we now include additional in vivo data with C. elegans expressing the aggregation-prone PGL-3 (D425-452)-mEGFP. Briefly, we used CRISPR/Cas9 to generate transgenic C. elegans which expresses PGL-3-mEGFP or PGL-3(D425-452)-mEGFP from the native pgl-3 locus. In vitro, wild-type PGL-3-mEGFP protein generates liquid-like condensates. On the other hand, the recombinantly purified PGL-3(D425-452)-mEGFP protein generates condensates that are non-dynamic. In contrast to these observations in vitro, both wild-type PGL-3-mEGFP and PGL-3(D425-452)-mEGFP show similar dynamics (half-time of FRAP recovery) within P granules in vivo.

      Finally, the "dynamics buffering" is not really clearly established and could also be explained as small concentrations of aggregated proteins act like clients while increasing the concentration results in aggregation and "cross linking" in the entire droplet - and this concentration is never achieved in the in worm experiments so it is not clear. In other words, the change in FRAP dynamics not observed in worms is perhaps not surprising if small amount of recombinant proteins are incorporated into the granules. *

      *

      Data with the S1 mutant establishes that dynamics buffering can be observed in condensates with different sets of additives both in vitro (Fig. 5a, b) and in vivo (Fig. 4a, b). Further, data with condensates of S1 containing the additives PGL-3 (K126E K129E) or S1 (K126E K129E) demonstrate that dynamics (half-time of FRAP recovery) within S1 condensates, and in turn “dynamics buffering” depend on inter-molecular interactions. With respect to the hypothesis proposed by the reviewer, we did not detect aggregates within S1 condensates using confocal fluorescence microscopy.

      In contrast to S1 condensates, condensates containing partially unfolded PGL-3-mEGFP together with PGL-1, GLH-1 and mRNA showed spatial inhomogeneities in fluorescence signal throughout the condensate (Fig. 4g). We have not tested if areas with higher fluorescence signal represent aggregates. It is a possibility that the partially unfolded PGL-3-mEGFP fluorescence signal becomes more homogeneous if higher concentrations of additives (PGL-1, GLH-1 and mRNA) are used. However, the presented data demonstrate the significant effect of the P granule components (PGL-1, GLH-1 and mRNA) on the FRAP recovery rate of partially unfolded PGL-3-mEGFP in condensates (compare figures Fig. 3e and Fig. 4g).

      However, consistent with dynamics buffering, the P granule phase in vivo supports wild-type dynamics of different PGL-3 constructs over a range of concentrations - PGL-3(D425-452)-mEGFP at physiological concentration (CRISPR transgenic strain, Fig. 4e) or at higher concentrations (microinjected S1 and partially unfolded PGL-3-mEGFP, Fig. 4b).

      • *

      *It is also not clear what the mechanism of the changes is - is the protein driven to fold more properly (despite S1 disruption of its conserved sequence) inside the condensate? Does it still self interact and act as a dimerization domain? Does this change disrupt interactions? *

      We agree with the reviewer that identifying the precise structural changes of the S1 protein within the condensate vs. dilute phase could be an interesting direction for future investigation. However, we have already discussed the issues raised by the reviewer in the original manuscript.

      “Our data is consistent with the model that other regions of S1 molecules cooperate with residues 425-452 (shuffled) to generate stronger inter-molecular interactions. For instance, addition of the mutant S1 (K126E K129E) enhances dynamics of S1 within condensates in contrast to maintaining the slower dynamics observed within condensates of S1 alone. This suggests that the interactions disrupted by the mutations K126E and K129E also contribute to slow S1 dynamics. One possibility is that interactions involving the residues K126 and K129 favor S1 conformations that enhance 425-452 (shuffled)-dependent interactions. Indeed, the mutations K126E K129E have been reported to interfere with interactions among N-termini of PGL-3 molecules (Aoki et al, 2021). While two self-association domains within the α-helical N-terminus of PGL-3 have been mapped (Aoki et al, 2021, 2016), structural insights into those associations are limited. However, PGL-3 shares significant sequence similarity with another protein PGL-1. Crystal structures are available for fragments of the PGL-1 protein that show the two self-association domains at the N-terminus are predominantly α-helical and globular in nature (Aoki et al, 2016, 2021). Therefore, one possibility is that shuffling the sequence 425-452 of PGL-3 or heat-induced unfolding of PGL-3 exposes hydrophobic residues that become available to participate in inter-molecular interactions.”

      What is the real mechanism by which PGL-3 phase separates if not via the disordered domains? *

      *

      We agree with the reviewer that elucidating the detailed mechanism of phase separation of PGL-3 is an interesting direction for future investigation. However, we feel this is not required to support the main message of this manuscript.

      Throughout the manuscript, the term "dynamics" is used to indicate FRAP, but it would be better to define what is meant (diffusion of PGL-3 in condensates) instead of using dynamics a term that could mean many things. Secondly, FRAP cannot directly measure liquidity etc (see recent critiques by McSwiggen elife 2019, etc) so it is better to be cautious in the claims. Finally, discussing "dyanmics buffering" adds more terminology where it is not needed - perhaps say "changes to diffusion of PGL-3 in condensates".

      We feel it is useful to introduce a term that describes our observation. To our knowledge, our observation is novel and therefore requires a new term to describe it.

      However, we do appreciate the concern raised by the reviewer. We used a more generic term “dynamics buffering” in contrast to the more specific “diffusion buffering” since we did not directly estimate diffusion behavior at the ‘single-molecule’ level. However, we already described what we mean by “dynamics buffering” in the text as follows.

      “We used condensates of similar size for our analysis (average ± 1 SD of diameter of condensates are 6.4 ± 1.7 mm (Fig. 5a) and 5.9 ± 0.4 mm (Fig. 5b)). Therefore, dynamics buffering here is likely to represent similar diffusion rates of S1 within condensates.”

      • *

      *The "N-terminus" is not 65% of the protein. One could define this as the N-terminal domain, but again there are two clear folded domains in the first 65% of the protein and this needs to be described better. *

      We revised the text to replace the terms “N-terminus” and “N-terminal domain” to “N-terminal fragment”.

      *The description of "stickers" and the references to tau and hnRNPA1 are confusing as this is a predominantly ordered domain while those are IDRs. *

      • *

      We feel this is important as it aids discussing our work in the context of current literature describing the mechanisms of macromolecular phase separation.

      The suggestion in the discussion that "P granule components support dynamics by participating in intermolecular interactions wth PGL-3-mEGFP molecules" is not well supported because no interaction assays are performed and no mutaitons are made that disrupt these interactions to test this.

      Indeed, we have not conducted interaction assays or mutational analysis to directly test this. However, our detailed analysis with the S1 mutant supports this suggestion.

      While partially unfolded PGL-3-mEGFP molecules lose 30% of a-helicity, the a-helicity of the S1 mutant is reduced by 15% compared to wild-type PGL-3. Data with S1 and partially unfolded PGL-3-mEGFP molecules show that loss of a-helicity correlates with slower diffusion of protein molecules within condensates. Using the mutants PGL-3 (K126E K129E) and S1 (K126E K129E), we show that diffusion rate of S1 molecules within condensates depend on inter-molecular interactions, and presence of other P granule components support faster diffusion rate of S1 molecules within condensates. Therefore, we feel it is safe to speculate that intermolecular interactions with P granule components can support dynamics of a “more unfolded” (compared to S1) version of PGL-3 molecule. * *

      *More detailed analysis of some of the claims: Claim 1: An a-helical region mediates the phase separation of PGL-3, and the C-terminal disordered region by itself does not phase separate. The N-terminal dimerization is essential for LLPS. The C-terminal IDR interactions with mRNA facilitate the LLPS. Comments: The authors show sufficient experimental data using microscopy and FRAP on truncated constructs with the N-terminal and C-terminal regions - but see above regarding how these are described - a proper domain structure with the folded domains shown and the RGG motifs highlighted should be added and integrated throughout the discussion. *

      In the revised version of the manuscript, we described the predicted PGL-3 domains within a paragraph in the introduction: “The interactions that support phase separation of the PGL-3 protein remains unclear. Structural studies on the orthologous PGL-1 protein revealed two dimerization domains. This raises the possibility that PGL-3 also contains similar dimerization domains, and phase separation depends on interactions involving these domains.”

      Our Fig. 1a already includes the schematic representation of PGL-3 with predicted N-terminal and Central Dimerization domains and RGG repeats.

      *They show that the N-terminus is necessary and adequate for LLPS, and the C-terminus by itself does not phase separate. But, how does the N-terminal domains phase separate? This is not explained - what are the interactions? *

      • *

      Also, a di-mutant (K126E K129E) that is known, and also authors use SEC-MALS to show their N-terminal construct is consistent with the published results. Disrupting the n-terminal dimerization prevents phase separation, suggesting the importance of these residues in the N-terminus for self-assembly and LLPS. The Microscopy data backs the claim that the mRNA-mediated LLPS is facilitated by binding with C-terminus. However, the m-RNA binding to IDR is not sufficient for LLPS. Yet, the authors do not explain how higher salt prevents phase separation - again the mechanism of phase separation is unclear. Is it multivalent interaction of the two dimerization domains? A basic model (that is tested) would be important.

      We agree with the reviewer that elucidating the detailed mechanism of phase separation of PGL-3 is an interesting direction for future investigation. However, we feel this is not required to support the main message of this manuscript.

      However, our manuscript already provides some relevant insights as follows.

      “To investigate the underlying mechanism further, we began by testing if the N-terminal α-helical region of PGL-3 can self-associate. Our analysis using size exclusion chromatography followed by multi-angle light scattering (SEC-MALS) showed that this PGL-3 fragment 1-452 forms a dimer (Supplementary Fig. 2f). Mutation of two residues (K126E K129E) have been shown to interfere with interactions among the N-termini of PGL-3 molecules (Aoki et al, 2021). We mutated these two residues within the full-length PGL-3 protein (K126E K129E) (Fig. 1a) and found that this mutant PGL-3 (K126E K129E) protein cannot phase separate even at high protein concentrations up to ~130 µM (Fig. 1b, c). Addition of mRNA does not trigger phase separation of this protein at physiological concentrations either (Fig. 2a, b). Taken together, our data is consistent with a model where association among folded N-termini of PGL-3 molecules is essential for phase separation.”

      A likely possibility is that phase separation of PGL-3 depends on electrostatic inter-molecular interactions among the folded N-terminal fragment of PGL-3 molecules. Therefore, high salt prevents phase separation.

      Are the tags removed to ensure that phase separation is not caused by tags or remaining linker regions? Is the protein purified to be without nucleic acid contamination or other purity metrics?

      Most of the experiments were done with only 5% of total protein tagged with 6x-His-mEGFP. No additional tags were present on the constructs. For recombinant expression and purification, proteins were cloned such that it is possible to remove the 6xHis-mEGFP tag following treatment with TEV protease. Following removal of the 6xHis-mEGFP tag, the residual linker is just two amino acid residues long. We used 100% tagged-protein for our experiments only in very few cases (indicated in the figure legends).

      To demonstrate purity of recombinant proteins, SDS-PAGE gels with all protein constructs used in this study are shown in Supplementary Fig. 1.

      To minimize contamination of nucleic acids, we treated samples with Benzonase during the course of purification.

      To assess the extent of nucleic acid contamination, the ratio of absorbance at 260 nm and 280 nm (A260/A280) was monitored. In exceptional cases with high A260/A280 values, we analyzed samples further by purifying RNA from the sample using RNA purification kit (Qiagen) and found that RNA represented 1% or less of the sample mass.* *

      Claim2: The N-terminal a-helical region modulates the dynamics within condensates. The IDR region has minimal effect on the fast dynamics of PGL-3. Comments: The authors show that the full-length PGL-3 condensates have modest influence of components by comparing the FRAP half times with or without the P granule components, including mRNA. However, have the authors tried this in the presence of mRNAs for the constructs lacking the IDRs as they have several RGG domains and bind with mRNA and are likely to change the dynamics.

      We thank the reviewer for this suggestion. However, this experiment is not essential to support the claim made in the context of homotypic condensates of PGL-3 : “The N-terminal a-helical region modulates the dynamics within condensates. The IDR region has minimal effect on the fast dynamics of PGL-3.”

      *The authors report the importance of the N-terminal a-helical region by making a construct that lacks/disrupts a part of the helices lowers the thermal stability and significantly lowers the dynamics of the condensates. Also unfolding of helices is shown to reduce the dynamics. One primary concern is whether these "rescued" protein dynamics imply protein functionality. *

      An assay of “functionality” e.g. an enzymatic activity of the PGL-3 protein is not available.

      However, we compared the fecundity of C. elegans worms expressing from the native pgl-3 locus, PGL-3-mEGFP or the mutant protein PGL-3(D425-452)-mEGFP, to assay the functionality of P granules in these strains. We found that worms of both genotypes produced similar number of offspring (Fig. 4d). This suggests that deletion of residues 425-452 of PGL-3 does not result in significant loss of function of P granules.

      Are these semi denatured proteins refolded in the presence of P-granule components?

      We feel that identifying the precise structural changes of the semi-denatured PGL-3 proteins within the condensate vs. dilute phase could be an interesting direction for future investigation.

      Finally, it is not clear why the authors chose to disrupt folding of the central dimerization domain?

      The manuscript included a paragraph to describe the rationale.

      “This suggests that interactions involving the disordered C-terminal region of PGL-3 are not essential for the fast dynamics within condensates. Therefore, we addressed the role of the N-terminal α-helical region (1-452) in driving dynamics. In order to avoid engineering mutations that result in significant misfolding of PGL-3 and concomitant loss of its ability to phase separate, we focused our mutational analysis close to the junction of the folded N-terminus and the disordered C-terminus of PGL-3. Surprisingly, we found that a full-length PGL-3 construct (D425-452) that lacks only 27 residues phase separates into condensates that are non-dynamic (Fig. 3a, c). Sequence analysis of the PGL-3 protein predicts that this region 425-452 spans two α-helices (one complete helix and fraction of a second helix) (Supplementary Fig. 3d). We generated a PGL-3 construct (hereafter called ‘S1’) (Fig. 3a) in which the sequence in the region, 425-452, is shuffled while keeping the overall amino acid composition unchanged. We found that S1 phase separates into condensates that are 20- fold less dynamic than with wild-type PGL-3 (Fig. 3d, Supplementary Fig. 3c).”

      Saying that "reduced alpha-helicity of PGL-3 correlates with slower dynamics in condensates" may be factual in these assays but "correlation" should be expanded upon to include mechanism and to me it seems that the statement should read "aggregation of PGL-3 causes slower dynamics in condensates" (both the partially destabilized mutant and the fully unfolded WT show similar effects perhaps to different degrees).

      We feel that identifying the precise structural changes of the semi-denatured PGL-3 proteins within the condensate vs. dilute phase could be an interesting direction for future investigation.

      We did not use the term "aggregation" since we did not detect aggregates of S1 molecules using fluorescence confocal microscopy.

      *CROSS-CONSULTATION COMMENTS I agree with the other reviewer's comments and critiques, I have concerns about the biological relevance and also the biophysical mechanisms. Reflecting on the other reviewers' comments, the papers could provide more depth in one or both of these areas to come to firm conclusions that are either revealing about PGL biology or elucidate a (possible) general biophysical mechanism. *

      In the revised version, we now include additional data which shows “dynamics buffering” in transgenic worms generated using CRISPR/Cas9 technology. Briefly, we used CRISPR/Cas9 to generate transgenic C. elegans which expresses PGL-3-mEGFP or PGL-3(D425-452)-mEGFP from the native pgl-3 locus. In vitro, wild-type PGL-3-mEGFP protein generates liquid-like condensates. On the other hand, the recombinantly purified PGL-3(D425-452)-mEGFP protein generates condensates that are non-dynamic. In contrast to these observations in vitro, both wild-type PGL-3-mEGFP and PGL-3(D425-452)-mEGFP show similar dynamics (half-time of FRAP recovery) within P granules in vivo.

      Reviewer #2 (Significance (Required)): *Hence, although the authors shows how inclusion of other components can alter the one protein component phase separation, this is done with entirely artificial means of destabilizing the fold of one of the domains which likely leads to aggregation. So the true impact of the work is hard to understand because the mutations impact on the basic biophysical properties of the domain (stability, interaction) are not completely characterized and the reason for disrupting this folding is not clear. *

      A major impact of our work is elucidation of a novel “dynamics buffering” property within biomolecular condensates in vitro. Our in vivo data is consistent with this finding.

      • *

      We have chosen two orthogonal ways of perturbing the PGL-3 protein (i.e. mutations and temperature-dependent unfolding) to assay the effect on diffusion rate against different levels of perturbation (e.g. 30% loss of a-helicity in heat-denatured PGL-3-mEGFP vs. 15% loss of a-helicity in the S1 mutant, compared to wild-type PGL-3). Studying the phase separation behavior of these “artificially-generated” constructs provided the understanding that dynamics of PGL-3 in condensates depends on inter-molecular interactions, and slower dynamics generally correlate with stronger inter-molecular interactions. Further, interactions among two or more P granule components can buffer against large change in dynamics / aggregation within the P granule phase. These insights may lay the groundwork for addressing how more “natural” modifications (e.g., post-translational modifications, high local concentration of “sticky” molecules) may influence dynamics within biomolecular condensates in vivo.

      Based on current knowledge of P granule composition, chaperone proteins (e.g. heat-shock family proteins) do not show abundant concentration within P granules. However, it is unclear if chaperone proteins are completely excluded from the P granule phase. Therefore, we speculate that weak interactions among two or more non-chaperone proteins contribute significantly to “dynamics buffering” within the P granule phase in vivo.

      In the discussion section of the manuscript, we had speculated that “dynamics buffering” may potentially explain observations reported in the nucleolus: “Similarly, interactions among components could be a potential mechanism of storage of misfolding-prone proteins in non-aggregated state within the liquid-like nucleolus under stress in vivo (Frottin et al, 2019).”

      Our finding is also relevant in the context of synthetic biology with applications that require steady diffusion rate of macromolecules during biochemical reactions within biomolecular condensates.

      • *

      My field of expertise is protein phase separation and protein structure. * *

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: P granules are liquid condensates found in the developing germlines and embryos of C. elegans. Prior work by the authors and others have established P granules as a tractable model to investigate the basic biophysical properties of liquid condensates. Much of the prior published work focused on specific P granule scaffold proteins, PGL-1 and PGL-3. How attributes of these PGL proteins and the effect of other P granule components affect condensate properties is not fully understood. Here, Jelenic, et al. probe the biophysical properties of PGL-3. Using recombinant protein, they show that an N-terminal, alpha-helical region of PGL-3 is sufficient for liquid condensate formation and that N-terminal assembly is required for this formation. Creation of a scrambled alpha-helical region in PGL-3 and heat treatment affects PGL-3 fluidity. This fluidity can be "rescued" in vivo and in vitro with the inclusion of other P granule factors, including wildtype PGL-3, PGL-1, GLH-1 and mRNA. The authors note an inverse correlation between fluidity and mutant PGL-3 fluorescent intensity. They propose a model that heterotypic compositions of condensates can buffer their fluidity against components with stronger multivalent interactions. *

      MAJOR: 1. PGL-3 is a fantastic model to study the biophysical properties of a liquid condensate. But as the authors address in their discussion, the S1 mutant will likely affect the central domain folding, at its minimum causing exposure of a hydrophobic surface not typically exposed in biology. These helices are found at the terminal portion of the domain determined in the crystal structure and as depicted in the authors' Figure 1A. While the cause of S1's enhanced molecular interactions does not affect the in vitro work presented in this manuscript, it does affect how the conclusions connect to the biological nature of P granules and liquid condensates more generally. *

      We have chosen two orthogonal ways of perturbing the PGL-3 protein (i.e. mutations and temperature-dependent unfolding) to assay the effect on diffusion rate against different levels of perturbation (e.g. 30% loss of a-helicity in heat-denatured PGL-3-mEGFP vs. 15% loss of a-helicity in the S1 mutant, compared to wild-type PGL-3). Studying the phase separation behavior of these “artificial” constructs provided the understanding that dynamics of PGL-3 in condensates depends on inter-molecular interactions, and slower dynamics generally correlate with stronger inter-molecular interactions. Further, interactions among two or more P granule components can buffer against large change in dynamics / aggregation within the P granule phase. These insights may lay the groundwork for addressing how more “natural” modifications (e.g., post-translational modifications, high local concentration of “sticky” molecules) may influence dynamics within biomolecular condensates in vivo.

      Based on current knowledge of P granule composition, chaperone proteins (e.g. heat-shock family proteins) do not show abundant concentration within P granules. However, it is unclear if chaperone proteins are completely excluded from the P granule phase. Therefore, we speculate that weak interactions among two or more non-chaperone proteins contribute significantly to “dynamics buffering” within the P granule phase in vivo.

      In the discussion section of the manuscript, we had speculated that “dynamics buffering” may potentially explain observations reported in the nucleolus: “Similarly, interactions among components could be a potential mechanism of storage of misfolding-prone proteins in non-aggregated state within the liquid-like nucleolus under stress in vivo (Frottin et al, 2019).”

      Our finding is also relevant in the context of synthetic biology with applications that require steady diffusion rate of macromolecules during biochemical reactions within biomolecular condensates.

      • Recombinant PGL-3 experiments added PGL-1, GLH-1 and mRNA simultaneously and measured fluidity. It will be interesting to know which components contribute to fluidity and whether fluidity enhancement of each component is dependent on one another. Addition experiments with each component should be included and/or at least discussed in the main text. *

      Our data with S1-mEGFP or PGL-3-mEGFP (pre-heated at 50°C) proteins microinjected into C. elegans gonads, and the transgenic strain expressing PGL-3(D425-452)-mEGFP from the pgl-3 locus showed that the P granule phase can support fast dynamics of these mutant PGL-3 constructs. Since P granules have a complex composition, one possibility is that fast dynamics of these constructs is supported by interactions involving many P granule components. We found that using only a limited set of P granule components (PGL-1, GLH-1 and mRNA) can buffer dynamics of S1 in condensates in vitro.

      In absence of a systematic analysis investigating the individual role of approx. 70 P granule proteins in buffering S1 dynamics in condensates in vitro, we have claimed in the text that dynamics-buffering of S1 in condensates is supported by interactions among two or more components. However, we do appreciate the reviewer’s comment and feel it would be interesting to investigate the contribution of individual P granule components towards fluidity in future studies. We have discussed this in the ‘Discussion’ section of the manuscript.

      • The biological relevance of PGL-1, GLH-1, and mRNA were not discussed in the main text. How these factors contribute to P granule assembly and function should be mentioned in the Introduction or Results. *

      To address this concern, we have added a paragraph in the Introduction section of the revised manuscript.

      *MINOR: 1. Line 20, "most non-membrane-bound compartments...have complex composition": Are there examples of condensates that do not have complex composition? *

      Not all non-membrane-bound compartments may have been characterized. To accommodate this possibility, we refrained from making a more general statement, but stated “most non-membrane-bound compartments…”.

      • Lines 40-43, RNA interactions driving LLPS: Please include citations from the Parker Lab (e.g. Van Treeck and Parker, Cell. 2018 doi: 10.1016/j.cell.2018.07.023) *

      We added the reference suggested by the reviewer.

      • *

      • Line 60, condensates contain hundreds of different proteins and RNA: Please cite at least a few examples of condensates with their components identified. *

      We added some references following suggestion by the reviewer.

      • Lines 82-84, PGL-3 drives assembly: Please cite Kawasaki, et al. Genetics 2004 for the discovery of PGL-3. *

      We added the reference suggested by the reviewer.

      • Lines 88-89, PGL-3 N-terminal fragment predominantly alpha-helical: The PGL domain structures should be cited here as supporting evidence that these regions are composed primarily of alpha helices (Aoki, et al 2016, 2021) *

      • *

      To address this concern, we have added a paragraph in the Introduction section of the revised manuscript.

      • Lines 158-159, driving forces for phase separation: This statement should be removed or expanded. The authors point regarding the protein concentrations is not clear here but clarified in the Discussion (Lines 691-693). Recommend removing due to its speculative nature. *

      We retained the speculative comment in the results section. We feel that this prepares the readers for the discussion later in the manuscript.

      • Lines 210: Add commas before and after "PGL-1 and GLH-1"*

      We addressed the reviewer’s suggestion.

      • Lines 218-219: add "and" instead of comma between PGL-1 and GLH-1 *

      We addressed the reviewer’s suggestion.

      • Lines 238-239, alpha-helices: The PGL CDD structure should also be referenced here (Aoki, et al 2016). *

      To address this concern, we have added a paragraph in the Introduction section of the revised manuscript.

      • Lines 680-682, MEG proteins: Please cite accordingly. *

      We added the reference suggested by the reviewer.

      • Lines 694-695, heterotypic interactions: Please cite Saha, et al. 2016. *

      We added the reference suggested by the reviewer.

      • Figure 1: Add space between 1 and mM DTT *

      We addressed the reviewer’s suggestion.

      • Figure 2b: Please provide statistics between condensate numbers. *

      We provide statistics between condensate numbers in Fig. 2b.

      • Figure 4A: The region of the germline imaged and analyzed should be mentioned in the caption or the main text. *

      We revised the Figure legend of Fig. 4a to address this issue.

      • Figure 4B,C: Please include statistics between the FRAP curves. *

      We have included statistics comparing FRAP curves in Supplementary Fig. 4a-c.

      • Figure 4D: It will be helpful to compare this curve to Figure S4A in the same graph. Please also include graph statistics. *

      We have revised Fig. 4 to address the reviewer’s suggestion.

      • Figure 5: The data points are difficult to resolve. Recommend use of color.*

      We considered the suggestion, but felt it works better in the original form.

      • Figure 6: This is a very general model that does not highlight the extensive experimental work performed by the authors. Recommend incorporating PGL-3, mutants and P granule factors into this model. *

      We thank the reviewer for appreciating our extensive work. However, we retained the original Fig. 6 for the sake of simplicity.

      • Methods, Line 939, C. elegans section: What worms were used? TH623? Please describe the genotype. *

      We have included a table listing the strains used in the study and their genotype. * CROSS-CONSULTATION COMMENTS While my review was arguably the more favorable of the three, I agree with the other reviewers' comments and evaluation, particularly with Reviewer #1. As written in my review, my primary concern was the biological relevance of the work.*

      Reviewer #3 (Significance (Required)):

      Overall, the in vitro work presented investigating the biophysical properties of this minimal P granule system was thorough and well-analyzed, and the manuscript was clearly written. Additional citations and statistics will improve the manuscript and the strength of the conclusions, respectively. The biological relevance of this study to P granule form and function in vivo, and to condensates in vivo, is debatable. This work will interest those who study condensate biology, the biophysics of protein-protein and protein-RNA interactions, and RNA biochemists more generally.

      A major impact of our work is elucidation of a novel “dynamics buffering” property within biomolecular condensates in vitro. Our in vivo data is consistent with this finding.

      We have chosen two orthogonal ways of perturbing the PGL-3 protein (i.e. mutations and temperature-dependent unfolding) to assay the effect on diffusion rate against different levels of perturbation (e.g. 30% loss of a-helicity in heat-denatured PGL-3-mEGFP vs. 15% loss of a-helicity in the S1 mutant, compared to wild-type PGL-3). Studying the phase separation behavior of these “artificially-generated” constructs provided the understanding that dynamics of PGL-3 in condensates depends on inter-molecular interactions, and slower dynamics generally correlate with stronger inter-molecular interactions. Further, interactions among two or more P granule components can buffer against large change in dynamics / aggregation within the P granule phase. These insights may lay the groundwork for addressing how more “natural” modifications (e.g., post-translational modifications, high local concentration of “sticky” molecules) may influence dynamics within biomolecular condensates in vivo.

      • *

      Based on current knowledge of P granule composition, chaperone proteins (e.g. heat-shock family proteins) do not show abundant concentration within P granules. However, it is unclear if chaperone proteins are completely excluded from the P granule phase. Therefore, we speculate that weak interactions among two or more non-chaperone proteins contribute significantly to “dynamics buffering” within the P granule phase in vivo.

      In the discussion section of the manuscript, we had speculated that “dynamics buffering” may potentially explain observations reported in the nucleolus: “Similarly, interactions among components could be a potential mechanism of storage of misfolding-prone proteins in non-aggregated state within the liquid-like nucleolus under stress in vivo (Frottin et al, 2019).”

      Our finding is also relevant in the context of synthetic biology with applications that require steady diffusion rate of macromolecules during biochemical reactions within biomolecular condensates.

      *I have expertise in P granules, protein/RNA biochemistry, condensate assembly, and C. elegans. *

      References

      Aoki ST, Kershner AM, Bingman CA, Wickens M & Kimble J (2016) PGL germ granule assembly protein is a base-specific, single-stranded RNase. Proceedings of the National Academy of Sciences of the United States of America

      Aoki ST, Lynch TR, Crittenden SL, Bingman CA, Wickens M & Kimble J (2021) C. elegans germ granules require both assembly and localized regulators for mRNA repression. Nat Commun 12: 996

      Cipriani PG, Bay O, Zinno J, Gutwein M, Gan HH, Mayya VK, Chung G, Chen J-X, Fahs H, Guan Y, et al (2021) Novel LOTUS-domain proteins are organizational hubs that recruit C. elegans Vasa to germ granules. Elife 10: e60833

      Frottin F, Schueder F, Tiwary S, Gupta R, Körner R, Schlichthaerle T, Cox J, Jungmann R, Hartl FU & Hipp MS (2019) The nucleolus functions as a phase-separated protein quality control compartment. Science 365: 342–347

      Kawasaki I, Amiri A, Fan Y, Meyer N, Dunkelbarger S, Motohashi T, Karashima T, Bossinger O & Strome S (2004) The PGL family proteins associate with germ granules and function redundantly in Caenorhabditis elegans germline development. Genetics 167: 645–661

      Kawasaki I, Shim YH, Kirchner J, Kaminker J, Wood WB & Strome S (1998) PGL-1, a predicted RNA-binding component of germ granules, is essential for fertility in C. elegans. Cell 94: 635–645

      Phillips CM & Updike DL (2022) Germ granules and gene regulation in the Caenorhabditis elegans germline. Genetics 220: iyab195

      Price IF, Hertz HL, Pastore B, Wagner J & Tang W (2021) Proximity labeling identifies LOTUS domain proteins that promote the formation of perinuclear germ granules in C. elegans. Elife 10: e72276

      Saha S, Weber CA, Nousch M, Adame-Arana O, Hoege C, Hein MY, Osborne Nishimura E, Mahamid J, Jahnel M, Jawerth L, et al (2016) Polar Positioning of Phase-Separated Liquid Compartments in Cells Regulated by an mRNA Competition Mechanism. Cell 166: 1572-1584.e16

      Spike C, Meyer N, Racen E, Orsborn A, Kirchner J, Kuznicki K, Yee C, Bennett K & Strome S (2008a) Genetic analysis of the Caenorhabditis elegans GLH family of P-granule proteins. Genetics 178: 1973–1987

      Spike CA, Bader J, Reinke V & Strome S (2008b) DEPS-1 promotes P-granule assembly and RNA interference in C. elegans germ cells. Development (Cambridge, England) 135: 983–993

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      P granules are liquid condensates found in the developing germlines and embryos of C. elegans. Prior work by the authors and others have established P granules as a tractable model to investigate the basic biophysical properties of liquid condensates. Much of the prior published work focused on specific P granule scaffold proteins, PGL-1 and PGL-3. How attributes of these PGL proteins and the effect of other P granule components affect condensate properties is not fully understood. Here, Jelenic, et al. probe the biophysical properties of PGL-3. Using recombinant protein, they show that an N-terminal, alpha-helical region of PGL-3 is sufficient for liquid condensate formation and that N-terminal assembly is required for this formation. Creation of a scrambled alpha-helical region in PGL-3 and heat treatment affects PGL-3 fluidity. This fluidity can be "rescued" in vivo and in vitro with the inclusion of other P granule factors, including wildtype PGL-3, PGL-1, GLH-1 and mRNA. The authors note an inverse correlation between fluidity and mutant PGL-3 fluorescent intensity. They propose a model that heterotypic compositions of condensates can buffer their fluidity against components with stronger multivalent interactions.

      MAJOR:

      1. PGL-3 is a fantastic model to study the biophysical properties of a liquid condensate. But as the authors address in their discussion, the S1 mutant will likely affect the central domain folding, at its minimum causing exposure of a hydrophobic surface not typically exposed in biology. These helices are found at the terminal portion of the domain determined in the crystal structure and as depicted in the authors' Figure 1A. While the cause of S1's enhanced molecular interactions does not affect the in vitro work presented in this manuscript, it does affect how the conclusions connect to the biological nature of P granules and liquid condensates more generally.
      2. Recombinant PGL-3 experiments added PGL-1, GLH-1 and mRNA simultaneously and measured fluidity. It will be interesting to know which components contribute to fluidity and whether fluidity enhancement of each component is dependent on one another. Addition experiments with each component should be included and/or at least discussed in the main text.
      3. The biological relevance of PGL-1, GLH-1, and mRNA were not discussed in the main text. How these factors contribute to P granule assembly and function should be mentioned in the Introduction or Results.

      MINOR:

      1. Line 20, "most non-membrane-bound compartments...have complex composition": Are there examples of condensates that do not have complex composition?
      2. Lines 40-43, RNA interactions driving LLPS: Please include citations from the Parker Lab (e.g. Van Treeck and Parker, Cell. 2018 doi: 10.1016/j.cell.2018.07.023)
      3. Line 60, condensates contain hundreds of different proteins and RNA: Please cite at least a few examples of condensates with their components identified.
      4. Lines 82-84, PGL-3 drives assembly: Please cite Kawasaki, et al. Genetics 2004 for the discovery of PGL-3.
      5. Lines 88-89, PGL-3 N-terminal fragment predominantly alpha-helical: The PGL domain structures should be cited here as supporting evidence that these regions are composed primarily of alpha helices (Aoki, et al 2016, 2021)
      6. Lines 158-159, driving forces for phase separation: This statement should be removed or expanded. The authors point regarding the protein concentrations is not clear here but clarified in the Discussion (Lines 691-693). Recommend removing due to its speculative nature.
      7. Lines 210: Add commas before and after "PGL-1 and GLH-1"
      8. Lines 218-219: add "and" instead of comma between PGL-1 and GLH-1
      9. Lines 238-239, alpha-helices: The PGL CDD structure should also be referenced here (Aoki, et al 2016).
      10. Lines 680-682, MEG proteins: Please cite accordingly.
      11. Lines 694-695, heterotypic interactions: Please cite Saha, et al. 2016.
      12. Figure 1: Add space between 1 and mM DTT
      13. Figure 2b: Please provide statistics between condensate numbers.
      14. Figure 4A: The region of the germline imaged and analyzed should be mentioned in the caption or the main text.
      15. Figure 4B,C: Please include statistics between the FRAP curves.
      16. Figure 4D: It will be helpful to compare this curve to Figure S4A in the same graph. Please also include graph statistics.
      17. Figure 5: The data points are difficult to resolve. Recommend use of color.
      18. Figure 6: This is a very general model that does not highlight the extensive experimental work performed by the authors. Recommend incorporating PGL-3, mutants and P granule factors into this model.
      19. Methods, Line 939, C. elegans section: What worms were used? TH623? Please describe the genotype.

      CROSS-CONSULTATION COMMENTS

      While my review was arguably the more favorable of the three, I agree with the other reviewers' comments and evaluation, particularly with Reviewer #1. As written in my review, my primary concern was the biological relevance of the work.

      Significance

      Overall, the in vitro work presented investigating the biophysical properties of this minimal P granule system was thorough and well-analyzed, and the manuscript was clearly written. Additional citations and statistics will improve the manuscript and the strength of the conclusions, respectively. The biological relevance of this study to P granule form and function in vivo, and to condensates in vivo, is debatable. This work will interest those who study condensate biology, the biophysics of protein-protein and protein-RNA interactions, and RNA biochemists more generally.

      I have expertise in P granules, protein/RNA biochemistry, condensate assembly, and C. elegans.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Jelenic et al. describe the effect of partner proteins on the FRAP dynamics of recombinant PGL-3 protein and variants in in vitro condensates and C elegans p-granules. The study shows that the N terminal a-helical dimerization domains is required for condensate formation and modulate of it alters aggregation and the FRAP dynamics of its condensates. Interestingly, a construct including the entire IDR region (370-693) by itself does not phase separate on its own at these conditions. The K126E K129E mutant (known previously to disrupt dimerization) and the deletion mutant abrogate llps. A mutant construct that shuffles the sequence in the region 423-453 called S1 here reduces the helicity and the condensate FRAP dynamics but recovered in the presence of a few P granule components. Also, the reduced dynamics of partially unfolded PGL-3 condensates are also rescued by the p-granule components to a certain degree of the unfolded PGL3 concentrations. This threshold concentration for recovering the condensate dynamics is further reduced in the helix reducing S1 mutant, which is also dependent on the number and the nature of P granule components.

      • Overall, the study aims to probe how "composition can buffer protein dynamics within liquid-like condensates" - yet several underlying aspects of the study do not fully support that conclusion. The introduction does not sufficiently introduce the known structural information of the two dimerization domains in C elegans PGL proteins for which structures are known. The region is discussed as "alpha helical" but really there are two evolutionarily conserved independently folding dimerization domains (referring to the mutants as "reduced alpha helicity" is not helpful - these are mutations that destabilize a folded domain). Additionally, the abstract and introduction ignore the aspects of aggregation (touched on in discussion) - this is likely what the disruption to the helical region in residue 450 region is doing (the helix is not on the dimer interface based on homology / sequence identity to the crystal structure of PGL-1 central dimerization domain. Finally, the "dynamics buffering" is not really clearly established and could also be explained as small concentrations of aggregated proteins act like clients while increasing the concentration results in aggregation and "cross linking" in the entire droplet - and this concentration is never achieved in the in worm experiments so it is not clear. In other words, the change in FRAP dynamics not observed in worms is perhaps not surprising if small amount of recombinant proteins are incorporated into the granules. It is also not clear what the mechanism of the changes is - is the protein driven to fold more properly (despite S1 disruption of its conserved sequence) inside the condensate? Does it still self interact and act as a dimerization domain? Does this change disrupt interactions? What is the real mechanism by which PGL-3 phase separates if not via the disordered domains?

      • Throughout the manuscript, the term "dynamics" is used to indicate FRAP, but it would be better to define what is meant (diffusion of PGL-3 in condensates) instead of using dynamics a term that could mean many things. Secondly, FRAP cannot directly measure liquidity etc (see recent critiques by McSwiggen elife 2019, etc) so it is better to be cautious in the claims. Finally, discussing "dyanmics buffering" adds more terminology where it is not needed - perhaps say "changes to diffusion of PGL-3 in condensates".

      • The "N-terminus" is not 65% of the protein. One could define this as the N-terminal domain, but again there are two clear folded domains in the first 65% of the protein and this needs to be described better.

      • The description of "stickers" and the references to tau and hnRNPA1 are confusing as this is a predominantly ordered domain while those are IDRs.

      • The suggestion in the discussion that "P granule components support dynamics by participating in intermolecular interactions wth PGL-3-mEGFP molecules" is not well supported because no interaction assays are performed and no mutaitons are made that disrupt these interactions to test this.

      More detailed analysis of some of the claims:

      Claim 1:

      An a-helical region mediates the phase separation of PGL-3, and the C-terminal disordered region by itself does not phase separate. The N-terminal dimerization is essential for LLPS. The C-terminal IDR interactions with mRNA facilitate the LLPS.

      Comments:

      The authors show sufficient experimental data using microscopy and FRAP on truncated constructs with the N-terminal and C-terminal regions - but see above regarding how these are described - a proper domain structure with the folded domains shown and the RGG motifs highlighted should be added and integrated throughout the discussion. They show that the N-terminus is necessary and adequate for LLPS, and the C-terminus by itself does not phase separate. But, how does the N-terminal domains phase separate? This is not explained - what are the interactions? Are the tags removed to ensure that phase separation is not caused by tags or remaining linker regions? Is the protein purified to be without nucleic acid contamination or other purity metrics? Also, a di-mutant (K126E K129E) that is known, and also authors use SEC-MALS to show their N-terminal construct is consistent with the published results. Disrupting the n-terminal dimerization prevents phase separation, suggesting the importance of these residues in the N-terminus for self-assembly and LLPS. The Microscopy data backs the claim that the mRNA-mediated LLPS is facilitated by binding with C-terminus. However, the m-RNA binding to IDR is not sufficient for LLPS. Yet, the authors do not explain how higher salt prevents phase separation - again the mechanism of phase separation is unclear. Is it multivalent interaction of the two dimerization domains? A basic model (that is tested) would be important.

      Claim 2:

      The N-terminal a-helical region modulates the dynamics within condensates. The IDR region has minimal effect on the fast dynamics of PGL-3.

      Comments:

      The authors show that the full-length PGL-3 condensates have modest influence of components by comparing the FRAP half times with or without the P granule components, including mRNA. However, have the authors tried this in the presence of mRNAs for the constructs lacking the IDRs as they have several RGG domains and bind with mRNA and are likely to change the dynamics. The authors report the importance of the N-terminal a-helical region by making a construct that lacks/disrupts a part of the helices lowers the thermal stability and significantly lowers the dynamics of the condensates. Also unfolding of helices is shown to reduce the dynamics. One primary concern is whether these "rescued" protein dynamics imply protein functionality. Are these semi denatured proteins refolded in the presence of P-granule components? Finally, it is not clear why the authors chose to disrupt folding of the central dimerization domain?

      Saying that "reduced alpha-helicity of PGL-3 correlates with slower dynamics in condensates" may be factual in these assays but "correlation" should be expanded upon to include mechanism and to me it seems that the statement should read "aggregation of PGL-3 causes slower dynamics in condensates" (both the partially destabilized mutant and the fully unfolded WT show similar effects perhaps to different degrees).

      CROSS-CONSULTATION COMMENTS

      I agree with the other reviewer's comments and critiques, I have concerns about the biological relevance and also the biophysical mechanisms. Reflecting on the other reviewers' comments, the papers could provide more depth in one or both of these areas to come to firm conclusions that are either revealing about PGL biology or elucidate a (possible) general biophysical mechanism.

      Significance

      Hence, although the authors shows how inclusion of other components can alter the one protein component phase separation, this is done with entirely artificial means of destabilizing the fold of one of the domains which likely leads to aggregation. So the true impact of the work is hard to understand because the mutations impact on the basic biophysical properties of the domain (stability, interaction) are not completely characterized and the reason for disrupting this folding is not clear.

      My field of expertise is protein phase separation and protein structure.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      • A significant criticism of the paper is an assumption that readers will be familiar with all of the findings in the author's previous 2016 paper and the PGL-1 papers by Aoki et al. Minimal context is given for each approach. Some conclusions are not well supported and require further analysis, proper controls, and more extensive descriptions of the experiments performed. Most importantly, the central conclusion and title of the paper is that composition can buffer the dynamics of individual proteins within liquid-like condensates. In other words, in vitro condensation assays often do not recapitulate LLPS behavior in vivo. That said, the findings in this study would be significantly strengthened and complemented by observing endogenously tagged PGL-3 and PGL-3 mutants in living worms, considering the efficiency of using CRISPR in C. elegans to insert tags and make precise mutations.

      • To improve readability, the introduction to P granules should be expanded, and include the reasons for looking at the nematode-specific PGL-3 protein among all the other known P granule proteins. A recap of previous findings on PGL-3 phase separation, in vivo and in vitro, is warranted, starting with the significant results of Saha et al 2016. Setting up the investigative questions in the context of recent work on PGL-1 (Aoki, et al) is also necessary.

      • The physiological concentration of PGL-3 should be more transparent, including why some experiments in this study are done at physiological concentrations while others are not. Describing why salt concentrations, crowding agents, and protein abundance are similar or different for each experiment is necessary and relevant. For example, after showing in Figure 1 that PGL-3 protein phase separates, the paragraph starting on line 161 says that it was previously shown that PGL-3 doesn't phase separate at physiological concentrations without RNA. One has to go back to Figure 1 to realize it was done differently than Figure 2 and Saha 2016. Statements in the same paragraph like "in contrast to full-length PGL-3, mRNA does not support phase separation..." should be qualified by stating the concentration observed, with or without salts or other crowding agents. Similarly, line 230 "suggests that interactions involving the disordered C-terminal region of PGL-3 are not essential for the fast dynamics" and should be qualified with "at non-physiological concentrations and with XX crowding agents or salt concentration." It would be more consistent if physiological concentrations were consistent from figure to figure, as extra variables weaken some of the stated conclusions.

      • The 2010 review reference stating that there are 40 P granule enriched proteins is outdated. More recent reviews put the number much higher. This is relevant because the approach to put PGL-3 in a more physiological environment by including just PGL-1, GLH-1 and mRNA with the condensate assays, out of ~100 P granule enriched proteins, may not be sufficient to conclude "that the influence of complex composition on dynamics is modest" (line 223), or imply that the multicomponent nature of the P granule is reconstituted by adding these components (line 355).

      • Figure 1C needs to include PGL-3(370-693) in the analysis. Figure 1E is also incomplete without a comparison of FRAP recovery between PGL-3(1-452) and full PGL-3 as the control.

      • Figure 4C is missing an essential control where PGL-3 and S1 FRAP is performed without PGL-1, GLH-1, and mRNA. It would also help show sup Fig4A in the main figure to show concentration dependence.

      • Consider adding subtitles to supplementary figures.

      • M&M should include an explanation for statistical analysis

      CROSS-CONSULTATION COMMENTS

      I am also in agreement with the comments and critiques of reviewers 2 and 3.

      Significance

      The paper by Saha and colleagues investigate the in vitro liquid-liquid phase separation propensity of a P granule protein PGL-3 and its structural domains. The findings largely replicate and support the phase-separation properties of a paralogous protein called PGL-1, as recently described by Aoki et al. 2021. Furthermore, they show that the dynamics demonstrated by recombinant PGL-3 may be maintained or buffered by the complex composition of P granules.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors have assembled an enormous amount of statistical data on the genomes and phylogeny of Arctic algae, including the genomes of four new species that they sequenced for this study. Their main finding is that horizontal gene transfer has led to convergent evolution in distantly related microalgae.

      **Major comments**

      Reviewer #1__: The purpose of the study is not clearly stated in the abstract or the introduction. The authors say (line 93) "Defining the genetic adaptations underpinning these small algal species is crucial as a baseline to understand their response to anthropogenic global change (Notz & Stroeve,2016)." Is this their goal? Or are they just quoting another study? The authors state (line 103) "We extend by sequencing the genomes of four distantly related microalgae...". This is not really a question or a hypothesis. I am sure the authors can provide a more compelling reason to embark on such a labor-intensive study.__

      Reply: We agree that the aim was lost in the details and the Introduction is now focused towards the original goal of the study, which was to investigate convergent evolution in a biogeographically isolated ocean. Additional references on the formation and history of the Arctic Basin have been added to the Introduction to provide context. “An ocean has been present at the pole since the beginning of the Cretaceous. Shaped by tectonic processes (Nikishin et al., 2021) the Arctic Ocean has been a relatively closed basin since the Masstrichtian at the end of the late Cretaceous epoch (ca. 70 million years before present), with episodic sea-ice cover since that time (Niezgodzki et al., 2019). This long history suggests limited gene flow from the global ocean over vast time scales and Arctic marine species including microalgae could well have unique adaptations to cold arctic conditions.” Line 78-83.

      And following this we provide a clear hypothesis “The potential for lineages of ancient Arctic origin and the episodic input of outside species led us to our hypothesis that Arctic microalgae convergently evolved traits or adaptations aiding survival in an ice-influenced ocean. Line 112-117.

      We also discuss both the adaptive and distinct physical environment of the Arctic, and its topographical separation from other ocean regions as dispersal limitation would enhance the Arctic-specific genomic signatures. We now cite the recent paper by Sommeria-Kline et al. (2020), which puts eukaryotic plankton biogeography into a global context (Line 72)

      Reveiwer #1__: The most prominent shared trait that the authors found are genes for ice-binding proteins. However, in view of their importance, little information is given about their different types and possible functions.__

      Reply: We appreciate the comment and have added information on relevant ice binding proteins found in the Arctic Algae. In addition, we discuss how the functional and secretory diversity of IBP would enhance the survivability of pelagic taxa. Lines 534 to 564.

      Although ice binding proteins from multicellular animals and plants are outside the scope of this study, there is a recent review; Bar Doley, Braslavsky and Davies 2016 Annual review of Biochemisty 85: 515-542.

      .

      Reviewer #1__: The HGT of ice-binding proteins is a major focus of this study, but little is said about what previous studies have said about this. What are the previous studies, what are their findings and how do the present findings contribute to this?__

      Reply: We agree that this aspect should have been more visible. We incorporated new data to characterize IBPs drawn from MMETSP transcriptomes, and environmental Tara Ocean metagenomes, as well as our Arctic strains. We note that as we take a PFAM-based approach, the IBPs treated are DUF3494/PF11999 domain, which are type 1 IBPs / algal IBPs (Raymond and Remia 2019). As an example of novelty, we identify the position of IBPs from dinoflagellates, within a larger Arctic Clade that included CCMP2293, CCMP2436 and CCMP2097 and Arctic TARA IBP, rendering this a pan-algal IBD clade.

      In addition, we were able to resolve the position of anomalous F. cylindrus IBP that fell between two Arctic associated clades (A and B, in our Fig 4). This finding is consistent with F. cylindrus originating in the Arctic as previously suggested and subsequently invading the Southern Ocean.

      The recurrent acquisition of multiple diverse IBP isoforms in individual species through HGT events has not been previously reported, and the extent of isoforms in the Arctic was surprising. See for example multiple different IBP forms with separate origins in Pavlovales CCMP2436 (Fig 4). The previous studies are referred to in the context of the phylogeny of the IBD within the results section: Lines 322- 413, and Lines 534-585.

      Reviewer #1: Figure 5 on HGT of ice-binding proteins is difficult to follow. It would be clearer if each panel could be described separately, clearly stating its main finding. I doubt that a reader could look at this figure and explain to a colleague what it shows.

      Reply: We have revised rearranged the figure (now Fig 4) with Arctic A, B, C and D clearly indicated as well as the two Antarctic dominated clades. The upper schematic includes the deepest phylogeny of algal IBDs to date, incorporating all of UniRef, MMETSP and TARA Oceans. The fasta files underlying the tree and the nexus file used are provided the S1 Data Folder, which is an excel folder with information on the analysis of the data. The callout and order of the clades has been revised to facilitate interpretation of the phylogenies more clearly. The entire section has been completely rewritten.

      Reviewer #1: This is also a problem with many of the other figures. For each figure, what is the question being asked and what is its take-home message?

      Reply: We agree that the message was lost and have now focused on our original question in our accepted proposal to JGI. “Is there a convergence among arctic microalgae at the genomic level?”. We found some genome properties were common among the Arctic isolates (more unknown PFAMS and several expanded PFAMs). The importance of ice binding proteins in Arctic Isolates and the widespread inter-algal HGT of this important protein among the Arctic strains. The IBP biogeography and phylogeny strongly indicate that the Arctic microalga have acquired IBP locally and that the Antarctic strains have acquired additional isoforms independently from Antarctic bacteria and fungi (Lines 565-585).

      Reviewer ____#1____: ____The paper has more data than a reader can absorb. It could be strengthened by reducing the number of figures, simplifying them if possible, and more clearly stating the value of the remaining figures.

      Reply. As suggested, we have refocused the paper, removing more speculative statistics based analysis and associated figures. The main conclusions are supported by the 5 main figures. We are now present 5 main figures and 11 supplementary figures (previously 23 downloadable supplementary figures and 40 on-line only figures supporting the support figures). We agree with the reviewer, and we feel the revised version is a more transparent synthesis. Briefly the Figures illustrate the following points. Fig. 1. The multigene tree of available algal genomes and transcriptomes provides a clear framework for judging the divergence of subsequent individual gene and PFAMs phylogenies. Fig. 2 (originally Fig. 3). Indicates the convergence of PFAM domains in the Arctic strains, in contrast to strains from elsewhere. Fig. 3 (originally Figure 4) shows Arctic specific expansions and contraction of PFAM domains, again demonstrating convergent evolution in the Arctic. The figure identifies specific PFAMs that contribute to the within-Arctic convergence. This figure is based on statistical methods independent of Fig 2. Figure 4 is the most extensive IBP phylogeny to date and has been discussed above. Figure 5, which was supplementary in our non-peer reviewed version, shows the biogeographic distribution of IBP, and can be compared to the distributions of the 18S rRNA genes from the four Arctic algae provided as supplementary (S6 Fig.)

      **Minor comments**Reviewer #1

      1. The figure citations are confusing. E.g., what does "Fig.1- Figure supplement 1" refer to? Does this refer to 1 or 2 figures? Apparently, it refers only to Fig. S1, so many readers will be confused when they look at Fig. 1.

      Reply: We apologize for the confusing format; the manuscript had been formatted for the online journal eLife. Our revision follows the more traditional style of PLoS Biology and other Review Commons journals.

      .

      Multiple citations should be in order of publication date, not alphabetical order.

      Reply ; We agree that date of publications is quite standard and recognizes priority of publication. Several on line journals no longer follow this rule and citation order will follow the specific style used by our accepting journal.

      Reviewer #1 (Significance (Required)): It is well known that useful genes tend to be shared among microorganisms. The present study strengthens previous studies in showing that gene transfer is an important process in polar regions.

      Reply: We thank the reviewer for recognizing the importance of our study.


      Reviewer #2 ____(Evidence, reproducibility, and clarity (Required)):

      This manuscript is the result of a large international collaborative effort, including the US Department of Energy Joint Genome Institute. Its focus is comparative genomics of eukaryotic Arctic algae. The primary data described in the ms are four new genome and transcriptome sequences from diverse Arctic algae, represented by a cryptomonad, a haptophyte, a chrysophyte, and a pelagophyte.

      The authors compare these new data to previously published genomic/transcriptomic data from eukaryotic algae with the goal of understanding genome evolution in the Artic. The results of the paper are a series large-scale comparative genomic bioinformatics analyses, including the associated statistical analyses. The key findings center on statistically significant features of Arctic genomes, features that stand out as compared to the genomes of algae that are not primarily found in the Arctic. Together, these findings allow the authors to make various hypotheses and suggestions about genetic adaptations to polar environments.

      By far the most significant finding is that the genomes of Arctic algae are enriched in genes encoding proteins with an ice-binding domain, paralleling findings from Antarctic algae. These genes appear to have spread among Arctic algal genomes via horizontal gene transfer, which raises a series of interesting questions. In my opinion, the major conclusions of this paper are supported by the data. Listed below are a few comments that may improve the ms:

      Reviewer #2.

      1) In today's post-genomics era, everyone seems to be sequencing nuclear genomes. Often what distinguishes high-impact and low-impact genome papers is the number of genomes presented and the quality of the genome assembly. I may have missed it, but reading the main text, the figures/tables, and the supplementary data I was not able to get a sense of the quality of the four genome assemblies from which the main findings are based. I was eventually able to find this information from PhycoCosm (note: some of the links to this site are not working in the ms). My quick scan of the PhycoCosm summary info for the four genomes indicates that the assemblies are highly fragmented, likely because they are based on short-read Illumina sequencing rather than a combination of short and long reads. I think it is important to briefly discuss (and or present) the quality of the assemblies in the ms and to highlight the potential limitations/drawbacks of employing highly fragmented assemblies when carrying out large-scale comparative genomics.

      Reply: We agree and the data concerning the genome quality assemblies has been moved to the main text Table 1. The comparison with other paired related strains is provided in an excel folder designated S2 Data Folder.

      Reviewer #2.

      2) Horizontal gene transfer is undeniably a major driving force in evolution, and one that has shaped genomic architecture across the Tree of Life. I believe the data presented here support a role for HGT in the genome of evolution of Arctic algae, particularly with respect to genes encoding proteins with an ice-binding domain. However, we can all think of numerous instances when authors of genome papers were too quick to point to HGT. Thus, I would urge more caution and balance when presenting the HGT data, including some discussion about factors that could incorrectly lead researchers to conclude a significant role for HGT, such as contamination, gene duplication, mis-assemblies, etc. I'm not suggesting that you change the main conclusions, but just tone down the language in places (e.g., "we reveal remarkable convergence in the coding content ... ").

      Reply: We understand the reviewers concerns and now more clearly outline the pipeline we have used to identify HGTs. This included: filtering each genome to remove all possible contaminant sequences first, considering both contig co-presence of vertical- and horizontally-derived genes, and reciprocal and independent annotations of gene sequences in both genome sequences and MMETSP transcriptomes. Retained genes were subjected to simultaneous BLAST analysis and manually curated phylogenies using decontaminated reference datasets. The most parsimonious explanation for our final IBP domain microbial algal clusters (Fig 4) is HGT. On the side of caution, we removed the entire section that identified potential arctic HGT based primarily on a less targeted broad statistical analysis. The focus is now on 3 genes that have clearly identifiable utility in the Arctic, were found to be enriched in Arctic genomes via a separate analysis and had homologs in the Tara Ocean Polar circle data. In addition, we describe more clearly the role of expansion and enrichment of PFAMs and the high proportion genes without an identifiable PFAMs in the Arctic strains as evidence for Arctic convergence separate from potential HGT.

      Reviewer #2.

      3) The downside of studying protists (as compared to multicellular animals, for instance) is that most are not widely known by the scientific community and even fewer scientists can picture what they actually look like (e.g., Pavlovales sp. CCMP2436). A few more details about the four Arctic algae that make up the focus of this paper might be helpful for the casual reader. My sense is that if at the next departmental meeting I asked my colleagues what a pelagophyte was most would look at me with a blank stare. Moreover, am I right to assume that all four algae are psychrotolerant rather than psychrophilic (Supplement Fig. 1 makes me think otherwise). It might be good to point out the difference in the text.

      Reply: High resolution images of each strain are available on the JGI home page for each alga, given the multiple figures we feel photos would not add information.

      Reviewer #2

      4) I don't think Supp. Table 1 (the Pan-algal dataset) got uploaded correctly during the manuscript submission stage. The first link I click on gives me Supp. Table 2.

      Reply: We apologize for this, the format was incorrect for the file designation and there were lost links. We now more actually refer to these as Data Folders as they are excel folders containing multiple sheets, All supplementary links will be verified again on final submission.

      .

      Reviewer #2 (Significance (Required)):

      By far the most significant finding from this paper is that the genomes of Arctic algae are enriched in genes encoding proteins with an ice-binding domain, paralleling findings from Antarctic algae. These genes appear to have spread among Arctic algal genomes via horizontal gene transfer, which raises a series of interesting questions. This is not the first paper to present these types of ideas, but it is arguably the broadest analysis yet, at least with respect to eukaryotic algae. This work will be of great interest to polar scientists, phycologists, protistologists, and the genomics community. I am genome scientist studying protists, including algae.

      Reply. We thank the reviewer for their insightful comments.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      This manuscript is focused on Arctic microalgae, an important yet understudied community in permanently cold ecosystems. By sequencing the genomes of four phylogenetically diverse and uncharacterized polar algae, the authors seek to elucidate genomic features and protein families that are similar in polar species (and differ from their relatives from temperate environments) This work used high-throughput genomic sequencing and computational analysis to demonstrate significant horizontal gene transfer (HGT) in several gene families, including ice-binding proteins. The authors suggest that this HGT is an effector of environmental adaptation to Arctic environments.

      **Major comments and experiment suggestions:**

      The authors conclude that HGT between arctic species is a driver of polar adaptation. The authors strongly support the claim that HGT is present more frequently in the polar algae examined here. Whether this is adaptive should be further explored though. For instance, ice-binding domains were one PFAM group found at significantly higher frequencies in the polar species - but are all of these species associated with ice? What would be the benefit of IBDs in an alga that is found in the open ocean. Similar with the other domains (Lns 333-335), its not clear whether these are truly adaptive features. ____This is more speculative.

      Reply: We agree that detail was lacking and have considerably expanded our introduction on the character of the Arctic Ocean and have stated the goals and underlying hypothesis. Briefly, all surface water organisms that live in the Arctic encounter ice during the year as the ocean freezes in winter, and surface waters reman around negative 1.7 °C for much of the year. This information has been added to the introduction. We have also expanded the discussion on the multiple effects of different IBPs that would be ecologically beneficial for plankton as well as ice-algae and cite relevant experimental studies and reviews.

      Reviewer #3) ____HGT was a major conclusion of this study, putting this in a wider perspective would strengthen the conclusion, especially in the context of HGT from prokaryotes. Are there insights on whether IBDs are present in Arctic prokaryotes?

      Reply: This is a good question, and we now point out that there were 91 Arctic bacterial and archaeal IBP sequences in our comparative dataset. In contrast to the Antarctic clades, none were closely related to the Arctic strain IBPs (Fig 4). Line 336.

      Reviewer #3) ____The data obtained from the genomic works supports the conclusions stronger that ones from transcriptomes, where what genes/domains are present would depend largely on the sampling conditions. This should be emphasized.

      Reply: The main rational for using transcriptomes was that more of these are available and enabled us to detect convergences and HGT across a broader taxonomic range than would be possible with genome-only data, where we had access to a total of only 21 microalgal genomes. In general transcriptome studies are aimed at identifying responses under different conditions and rely on comparative expression data, usually 2-fold differences in up or down expression under different growth conditions, see for example Freyria et al. 2022 (Communications Biology). Unlike a transcriptome expression study, our data mining detected any (constitutive or regulated) expression in these unicellular haploid cells, we would have detected genes used under any condition that an algal happened to be growing. IBD was not detected in any of the temperate genomes, and only detected in transcriptomes of Arctic and Arctic-Boreal groups. However, we agree that there may be some limitation of transcriptomes only studies and mention this. Lines 522-528.

      Reviewer #3) ____An experiment to determine whether the species are cold extremophiles (psychrophiles) would be useful here to strongly support the data in Figure 1. The authors state that their species can not survive >6C but this is based on experiments done on older studies. Considering the cultures have been maintained as a continuous culture for decades, confirming that they still have psychrophilic characteristic would be useful. This is a straightforward and low cost experiment that requires simply measuring growth rates at several temperatures to define the optimal and confirm that the cells are not viable above 6C.

      Reply: These are interesting points, and the broad “background” statements in the original manuscript would require a separate study,and have been deleted. Temperature tolerance experiments are not so simple for cold adapted algae with slow growth rates. Such experiments require specialized incubators to maintain low temperatures. Temperature experiments have been carried out on the cultures in the context of other studies, see for example, Daugberg et al. 2018, J. Phycol. But this is not within the scope of the present study.

      We now restrict our conclusions to the specific question of convergence among Arctic strains. We apologize for the misunderstanding on the history of the cultures. They have not been in “continuous culture” but are cryopreserved. We now simply indicate that they grow below 6 °C, which is sufficient to assume that they are likely cryophiles, our experience is that they do not grow well or at all at higher temperatures, our efforts have been to maintain the cultures that are otherwise easily lost. We now make no claims about optimality or limits. Here we simply examined genomes and available transcriptomes that were generated from algae growing at 4-6 °C.

      Reviewer #3) ____**Minor comments:**

      Defining the species used here as psychrophiles would put the study in context better. The authors relate their finding to Antarctic species (HGT, ice-binding domains, large genomes) all of which are confirmed psychrophiles.

      Reply: The temperature definition of psychrophiles is surprisingly high (optimal growth below 15 °C) and this definition of psychrophiles is now given in the introduction. The point is really that there are few isolates from cold surface waters that have been well studied. We now add. “A handful of polar algal genomes have been extensively studied, with 4 of these from around Antarctica and classified as psychrophiles (not being able to grow above 15 °C (Feller & Gerday, 2003)”. Lines 103-107.

      Reviewer #3) ____A short rationale on why these species at all would be useful - are they representative of their classes? Do they have psychrophilic characteristics that might make them useful models in the future? Are they widely used now?

      Reply: We appreciate the point as the definition of utility in discovery-based science is an open dialog.

      We agree that the study requires context and have added our rational for selecting the species for genome sequencing to the introduction. “To address questions on genetic adaptations to this ice-influenced environment, we sequenced 4 phylogenetically divergent microalgae, from 4 algal classes belonging to 3 algal phyla: Cryptophyceae (Cryptophyta), Pavlovophyceae (Haptophyta), Chrysophyceae and Pelagophyceae (both in the Ochrophyta) isolated from the ca. 77 °N, where surface ice flow persists through June (Mei et al., 2002). The four isolates were selected as representatives of different water and ice conditions and phylogeny from available strains collected in April and June 1998 during the North Water Polynya study”.

      Reviewer #3) ____Starting algal cultures were maintained in a continuous culture since 1998 and under continuous light since at least 2015, have the authors confirmed that these algae retain their physiological features even after this long time? The accumulation of mutations is a possibility here.

      Reply: We apologize for the misunderstanding of the timeline; the history of the cultures was not given in the manuscript and the inferred history is not quite correct. The 2015 date was the year of publication for the MMETSP data. Our continuous light statement is a record of our standard culture conditions. We now elaborate on the material used in the current study. The cultures were deposited in the Bigelow culture collection (now NCMA) in 2002 and cryopreserved once they had been verified and given a culture designation. We obtained fresh cultures in 2005 and these were used for the MMETSP project. We obtained fresh cultures again in 2011, specifically for the JGI genome project. These algae do not grow fast and most of the DNA was sent to JGI in 2012 for most of the isolates. This history is rather long and not relevant, since one would speculate that over the years the algae would tend to lose the ice associated functionality, e.g. they were not frozen in seawater every year for 4 to 6 months or subject to sudden freshwater exposure, when ice melts. We would encourage other researchers to order the cultures and run experiments. We note that many of the 40 or so algae isolated from the same campaign have been used by others for specific studies and at least 8 are in the MMETSP data set. The presence of 18S rRNA and phylogenetic position of the IBP sequences compared to Tara Arctic circle data confirms long-term Arctic presence of each species and the IBP domains in the Arctic without marked changes over the last 20 years.

      Reviewer #3) ____Ln381 - The culture collection IDs for each sequenced species should be included here

      Reply: we have added the culture IDs throughout.

      Reviewer #3) ____Ln. 389 - Algal cells are harvested and used for nucleic acid extraction, the nucleic acids themselves are not harvested

      Reply: we agree and corrected the wording

      Reviewer #3 (Significance (Required)):

      This study is well places in the current state of research on polar alga and represents a significant and very valuable addition to the current knowledge pool. Algae in general are lagging behind other groups of photosynthetic organisms in the number of sequenced and analyzed genomes, despite algae being one of the main primary producers globally. This is even more strongly felt in polar research, where only 4 species have been sequenced, most of which are restricted to Antarctica. There is a true gap in our knowledge when it comes to Arctic species, and this study fills this gap. As the authors correctly state, we need more knowledge on polar environments and the primary producers that support these important ecosystems in light of current climate change trends.

      Reply: we appreciate the succinct summary of our study and thank the reviewer for insights and suggestions that have improved the manuscript.

      Reviewer field of expertise: Polar algae, stress responses, plant and algal energetics, cell signalling

      Reply: We appreciate the incites and perspective steming from the reviewer's expertise.

      Relevant key references cited in the reply:

      Daugbjerg N, Norlin A, Lovejoy C. Baffinella frigidus gen. et sp. nov. (Baffinellaceae fam. nov., Cryptophyceae) from Baffin Bay: Morphology, pigment profile, phylogeny, and growth rate response to three abiotic factors. Journal of Phycology. 2018;54(5):665-80

      Feller, G. and Gerday, C. (2003) Psychrophilic enzymes: Hot topics in cold adaptation. Nat Rev Microbiol, 1, 200-208.

      Freyria NJ, Kuo A, Chovatia M, Johnson J, Lipzen A, Barry KW, et al. Salinity tolerance mechanisms of an Arctic Pelagophyte using comparative transcriptomic and gene expression analysis. Communications Biology. 2022;5(1). doi: 10.1038/s42003-022-03461-2

      Mei, Z. P., Legendre, L., Gratton, Y., Tremblay, J. E., Leblanc, B., Mundy, C. J., Klein, B., Gosselin, M., Larouche, P., Papakyriakou, T. N., Lovejoy, C. and Von Quillfeldt, C. H. (2002) Physical control of spring-summer phytoplankton dynamics in the North Water, April-July 1998. Deep-Sea Research Part Ii-Topical Studies in Oceanography, 49, 4959-4982.

      Niezgodzki, I., Tyszka, J., Knorr, G. and Lohmann, G. (2019) Was the Arctic Ocean ice free during the latest Cretaceous? The role of CO2 and gateway configurations. Global and Planetary Change, 177, 201-212.

      Nikishin, A. M., Petrov, E. I., Cloetingh, S., Freiman, S. I., Malyshev, N. A., Morozov, A. F., Posamentier, H. W., Verzhbitsky, V. E., Zhukov, N. N. and Startseva, K. (2021) Arctic Ocean Mega Project: Paper 3-Mesozoic to Cenozoic geological evolution. Earth-Science Reviews, 217.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This manuscript is focused on Arctic microalgae, an important yet understudied community in permanently cold ecosystems. By sequencing the genomes of four phylogenetically diverse and uncharacterized polar algae, the authors seek to elucidate genomic features and protein families that are similar in polar species (and differ from their relatives from temperate environments) This work used high-throughput genomic sequencing and computational analysis to demonstrate significant horizontal gene transfer (HGT) in several gene families, including ice-binding proteins. The authors suggest that this HGT is an effector of environmental adaptation to Arctic environments.

      Major comments and experiment suggestions:

      • The authors conclude that HGT between arctic species is a driver of polar adaptation. The authors strongly support the claim that HGT is present more frequently in the polar algae examined here. Whether this is adaptive should be further explored though. For instance, ice-binding domains were one PFAM group found at significantly higher frequencies in the polar species - but are all of these species associated with ice? What would be the benefit of IBDs in an alga that is found in the open ocean. Similar with the other domains (Lns 333-335), its not clear whether these are truly adaptive features. This is more speculative.
      • HGT was a major conclusion of this study, putting this in a wider perspective would strengthen the conclusion, especially in the context of HGT from prokaryotes. Are there insights on whether IBDs are present in Arctic prokaryotes?
      • The data obtained from the genomic works supports the conclusions stronger that ones from transcriptomes, where what genes/domains are present would depend largely on the sampling conditions. This should be emphasized.
      • An experiment to determine whether the species are cold extremophiles (psychrophiles) would be useful here to strongly support the data in Figure 1. The authors state that their species can not survive >6C but this is based on experiments done on older studies. Considering the cultures have been maintained as a continuous culture for decades, confirming that they still have psychrophilic characteristic would be useful. This is a straightforward and low cost experiment that requires simply measuring growth rates at several temperatures to define the optimal and confirm that the cells are not viable above 6C.

      Minor comments:

      • Defining the species used here as psychrophiles would put the study in context better. The authors relate their finding to Antarctic species (HGT, ice-binding domains, large genomes) all of which are confirmed psychrophiles.
      • A short rationale on why these species at all would be useful - are they representative of their classes? Do they have psychrophilic characteristics that might make them useful models in the future? Are they widely used now?
      • Starting algal cultures were maintained in a continuous culture since 1998 and under continuous light since at least 2015, have the authors confirmed that these algae retain their physiological features even after this long time? The accumulation of mutations is a possibility here.
      • Ln381 - The culture collection IDs for each sequenced species should be included here
      • Ln. 389 - Algal cells are harvested and used for nucleic acid extraction, the nucleic acids themselves are not harvested

      Significance

      This study is well places in the current state of research on polar alga and represents a significant and very valuable addition to the current knowledge pool. Algae in general are lagging behind other groups of photosynthetic organisms in the number of sequenced and analyzed genomes, despite algae being one of the main primary producers globally. This is even more strongly felt in polar research, where only 4 species have been sequenced, most of which are restricted to Antarctica. There is a true gap in our knowledge when it comes to Arctic species, and this study fills this gap. As the authors correctly state, we need more knowledge on polar environments and the primary producers that support these important ecosystems in light of current climate change trends.

      Review field of expertise: Polar algae, stress responses, plant and algal energetics, cell signalling

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript is the result of a large international collaborative effort, including the US Department of Energy Joint Genome Institute. Its focus is comparative genomics of eukaryotic Arctic algae. The primary data described in the ms are four new genome and transcriptome sequences from diverse Arctic algae, represented by a cryptomonad, a haptophyte, a chrysophyte, and a pelagophyte.

      The authors compare these new data to previously published genomic/transcriptomic data from eukaryotic algae with the goal of understanding genome evolution in the Artic. The results of the paper are a series large-scale comparative genomic bioinformatics analyses, including the associated statistical analyses. The key findings center on statistically significant features of Arctic genomes, features that stand out as compared to the genomes of algae that are not primarily found in the Arctic. Together, these findings allow the authors to make various hypotheses and suggestions about genetic adaptations to polar environments.

      By far the most significant finding is that the genomes of Arctic algae are enriched in genes encoding proteins with an ice-binding domain, paralleling findings from Antarctic algae. These genes appear to have spread among Arctic algal genomes via horizontal gene transfer, which raises a series of interesting questions. In my opinion, the major conclusions of this paper are supported by the data. Listed below are a few comments that may improve the ms:

      1) In today's post-genomics era, everyone seems to be sequencing nuclear genomes. Often what distinguishes high-impact and low-impact genome papers is the number of genomes presented and the quality of the genome assembly. I may have missed it, but reading the main text, the figures/tables, and the supplementary data I was not able to get a sense of the quality of the four genome assemblies from which the main findings are based. I was eventually able to find this information from PhycoCosm (note: some of the links to this site are not working in the ms). My quick scan of the PhycoCosm summary info for the four genomes indicates that the assemblies are highly fragmented, likely because they are based on short-read Illumina sequencing rather than a combination of short and long reads. I think it is important to briefly discuss (and or present) the quality of the assemblies in the ms and to highlight the potential limitations/drawbacks of employing highly fragmented assemblies when carrying out large-scale comparative genomics.

      2) Horizontal gene transfer is undeniably a major driving force in evolution, and one that has shaped genomic architecture across the Tree of Life. I believe the data presented here support a role for HGT in the genome of evolution of Arctic algae, particularly with respect to genes encoding proteins with an ice-binding domain. However, we can all think of numerous instances when authors of genome papers were too quick to point to HGT. Thus, I would urge more caution and balance when presenting the HGT data, including some discussion about factors that could incorrectly lead researchers to conclude a significant role for HGT, such as contamination, gene duplication, mis-assemblies, etc. I'm not suggesting that you change the main conclusions, but just tone down the language in places (e.g., "we reveal remarkable convergence in the coding content ... ").

      3) The downside of studying protists (as compared to multicellular animals, for instance) is that most are not widely known by the scientific community and even fewer scientists can picture what they actually look like (e.g., Pavlovales sp. CCMP2436). A few more details about the four Arctic algae that make up the focus of this paper might be helpful for the casual reader. My sense is that if at the next departmental meeting I asked my colleagues what a pelagophyte was most would look at me with a blank stare. Moreover, am I right to assume that all four algae are psychrotolerant rather than psychrophilic (Supplement Fig. 1 makes me think otherwise). It might be good to point out the difference in the text.

      4) I don't think Supp. Table 1 (the Pan-algal dataset) got uploaded correctly during the manuscript submission stage. The first link I click on gives me Supp. Table 2.

      Significance

      By far the most significant finding from this paper is that the genomes of Arctic algae are enriched in genes encoding proteins with an ice-binding domain, paralleling findings from Antarctic algae. These genes appear to have spread among Arctic algal genomes via horizontal gene transfer, which raises a series of interesting questions. This is not the first paper to present these types of ideas, but it is arguably the broadest analysis yet, at least with respect to eukaryotic algae. This work will be of great interest to polar scientists, phycologists, protistologists, and the genomics community. I am genome scientist studying protists, including algae.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors have assembled an enormous amount of statistical data on the genomes and phylogeny of Arctic algae, including the genomes of four new species that they sequenced for this study. Their main finding is that horizontal gene transfer has led to convergent evolution in distantly related microalgae.

      Major comments

      1. The purpose of the study is not clearly stated in the abstract or the introduction. The authors say (line 93) "Defining the genetic adaptations underpinning these small algal species is crucial as a baseline to understand their response to anthropogenic global change (Notz & Stroeve,2016)." Is this their goal? Or are they just quoting another study? The authors state (line 103) "We extend <previous findings> by sequencing the genomes of four distantly related microalgae...". This is not really a question or a hypothesis. I am sure the authors can provide a more compelling reason to embark on such a labor-intensive study.
      2. The most prominent shared trait that the authors found are genes for ice-binding proteins. However, in view of their importance, little information is given about their different types and possible functions.
      3. The HGT of ice-binding proteins is a major focus of this study, but little is said about what previous studies have said about this. What are the previous studies, what are their findings and how do the present findings contribute to this?
      4. Figure 5 on HGT of ice-binding proteins is difficult to follow. It would be clearer if each panel could be described separately, clearly stating its main finding. I doubt that a reader could look at this figure and explain to a colleague what it shows.
      5. This is also a problem with many of the other figures. For each figure, what is the question being asked and what is its take-home message?
      6. The paper has more data than a reader can absorb. It could be strengthened by reducing the number of figures, simplifying them if possible, and more clearly stating the value of the remaining figures.

      Minor comments

      1. The figure citations are confusing. E.g., what does "Fig.1- Figure supplement 1" refer to? Does this refer to 1 or 2 figures? Apparently, it refers only to Fig. S1, so many readers will be confused when they look at Fig. 1.
      2. Multiple citations should be in order of publication date, not alphabetical order.

      Significance

      It is well known that useful genes tend to be shared among microorganisms. The present study strengthens previous studies in showing that gene transfer is an important process in polar regions.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We are very grateful about the thorough reading and deep understanding of the work that these 4 reviewers have provided. Although it is evident that they still request an improved profiling of some aspects, it is very encouraging that all four think the work is very interesting, original, insightful and adds a new layer of knowledge to the regulation of DNA damage sensing and repair. It is also very rewarding that the four reviewers estimate that this work will sew connections between different fields and interest a broad readership. This is why we have designed here a very deep revision, tailored to satisfy all the raised concerns except one, and this just for technical reasons.

      Please find below the original reviewers’ comments and our answers to them preceded by the symbol “>”:

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): Ovejero et al. report an increase in lipid droplet (LD) abundance after long (from 120' on) exposure of budding yeast cells to DNA damaging agents zeocin and camptothecin (CPT). Next, they analyze DNA damage signaling in yeast mutants that impair triacylglycerol (TAGs) or sterol (STEs) esterification. They observe a slight anticipation in Rad53/CHK2 phosphorylation (indicative of DDR signaling) in yeast stem mutants, as well as in yeast cells or human cells lines pre-treated with oleate upon zeocin treatment. Yeast stem mutants are sensitive to zeocin and captothecin, but only confer sensitivity to hydroxyurea upon combination with tagD mutations. Authors relate these phenotypes to a somewhat decreases DSB resection in yeh2D mutants (expected to have reduced steryl esters pools) and RPA-foci in steD yeast cells. Next, a reduction in single strand annealing recombination repair events upon zeocin treatment is reported using a genetic reporter in steD mutants and oleate-treated cells. From these data they conclude that inability to process sterols in response to DSBs leads to an exacerbated DDR and prevents DNA repair. Next, it is shown that Flag-tagged Tel1 distinctly interacts with mono-phosphate phosphoinositides, including PI(4)P. An interaction in vivo is also inferred through Proximity Ligation Assays (PLA) using anti-PI(4)P and anti-ATM antibodies in human cell lines, which was moderately downregulated upon treatment with MMS or zeocin. Over-expression of the Osh4/OSBP1 transporter, which consumes PI(4)P, increased the number of Tel1 (nuclear) foci upon zeocin treatment. Conversely Sac1 ablation, in which accumulation of PI(4)P is expected, abrogated nuclear Tel1 foci formation and reduced telomere length (a phenotype related to lack of Tel1 function). From these results authors conclude that Tel1 availability in the nucleus is influenced by PI(4)P availability. Lastly, treatment with an OSBP1 inhibitor led to a cell line and damaging agent -variable reduction of ATM phosphorylation and a mostly non-significant reduction of DNA resection, measured by native BrdU detection, in response to CPT treatment. Overall, authors conclude that i) biding of Tel1/ATM to PI(4)P modulates its functional availability in the nucleus, and that ii) DNA damage elicits the esterification and storage of sterols toward LDs, which contributes to tritate Tel1/ATM away from the nucleus dampening the DDR and affecting long-range resection.

      Major comments: While the conclusion that Tel1/ATM binds PI(4)P and this interaction modulates Tel1/ATM functional availability at the nucleus is convincing, the conclusion that DSBs elicit a change in the metabolism of this lipid to "control" Tel1/ATM function is not demonstrated. The notion that sterol processing occurs in response to DSBs is not sufficiently supported by the data presented, as the increase in LD numbers is observed much after activation of the DDR (Rad53 phosphorylation) in Zeozin-treated yeast cells.

      We are afraid that we have not been clear enough in explaining the kinetics giving rise to our model. As indicated by the reviewer, our work shows, through kinetic studies, that the storage of sterols within LD occurs at later stages than the activation of the DDR by Tel1 and Rad53 phosphorylation. Tel1 foci decline is necessary for subsequent engagement of downstream DNA long-range resection. Since we propose that sterol storage within LD is a means to attenuate Tel1 engagement at DSBs, it is thus logical (and thus compatible with the data we show) that LD number increase occurs simultaneously with Tel1 foci decrease, at late stages of the reactionWe will include this explanation and graph in the revised version of the work.

      In addition, evidence is not provided on the mechanisms by which PI(4)P metabolism would be controlled, which would be expected to be DDR-independent as they are placed upstream of this signaling pathway in the author's model.

      The key mechanism through which, in the end, PI(4)P metabolism will be controlled, is the esterification of sterols within LD. Given that, as clarified above, LD formation in response to DSBs occurs “late” (i.e., after 120 min), it is not excluded that the DDR itself can instruct, through phosphorylation of some effector(s), LD formation. In other words, by ordering LD formation, the DDR would be launching a self-limiting mechanism. In support, we now know, although we do not show in this work, that eliminating key DDR proteins prevents the formation of LD in response to DNA damage. Because of this, we have undertaken an educated-guess approach and chosen critical or rate-limiting enzymes in LD biology either possessing an S/T-Q cluster domain (predicted to be a phosphorylation substrate for the DNA Damage Response kinases (1), and/or retrieved in phospho-proteomic screens as specific DDR targets (2,3). This adds up to 28 proteins in S. cerevisiae and 45 proteins in Homo sapiens. Importantly, the emergent candidates fall into two identical categories in both organisms. To provide initial support for their pertinence, we have generated a point mutant in the putative S/T-Q cluster of one of the yeast candidates. Of high relevance, we find that the concerned mutant is impaired in correctly triggering LD formation in response to DNA damage, and we have now obtained a specific funding to pursue this characterization that, as such, constitutes a different work from the one presented in this manuscript. We hope that the reviewer is now convinced yet that she/he agrees in keeping this information for subsequent manuscript(s).

      The damaging agents used have been suggested to alter the redox metabolism and even lipid peroxidation (Kitanovic 2009, Mizumoto 1993, Krol 2015, Todorova 2015, Ren 2019, Singh 2014). Hence it is possible that PI(4)P changes are not due to DSBs, but an indirect though relevant effect. In absence of direct evidence supporting an active regulation of PI(4)P dynamics in response to DNA breaks, this conclusion remains speculative and this should be noted in the manuscript.

      We fully agree with the reviewer that the used genotoxins are triggering a myriad of effects which could elicit the same phenomenon by indirect means. Yet, we want to stress that the use of camptothecin, which elicits a very robust LD formation phenotype (Figure 1C), is very likely specific, as it is proven as a potent and direct trapper of Top1 onto DNA after having cleaved it. Nevertheless, we propose in the next paragraph two specific experiments to dismiss this problem, please see immediately below.

      Authors conclude that LD is specific to DSB induction. This seems an overstatement as they just reported LD increases in response to two agents that also induce other kinds of DNA damage. To also strengthen the link between DSBs and PI(4)P modulation of Tel1 function, authors should analyze LD numbers, Rad53 phosphorylation and Tel1 nuclear re-localization in response to HO-induced DNA breaks (e.g., using the system employed in Figure 3C).

      We humbly think that enzymatically-induced DNA breaks will both activate Rad53 phosphorylation and Tel1 nuclear concentration, as this has already been established, thus requiring no further exploration. Yet, it is very important to assess the reviewer’s suggestion concerning whether enzymatically-induced DNA breaks also trigger the formation of LD. To this end, we will perform two complementary studies in which, instead of using HO, which cuts only a few times in the genome, we will:

      1. a) exploit the naturally DSB-accumulating mutant rad3-102, which we previously characterized in the past (4), and which we already exploit in this work for recombination analyses (Figure S4A), to evaluate whether it endogenously harbors more LD in comparison with the WT.
      2. b) we have recently created a tool in which gRNAs targeted to different subsets of transposons in the genome can drive Cas9 to create DSB in a dose-dependent manner ((9), under revision in Genetics). We will use this system to monitor the LD formation in response to Cas9-triggered cuts. In addition, on figure 5A, significant differences in GFP-Tel1 foci abundance between WT and steD or yeh2D cells are only observed after 210', way after the slight effect on Rad53 phosphorylation is observed. This is at odds with the conclusion that Tel1 association to STEs modulates DDR signaling.

      We are afraid that we have not been clear enough in explaining the kinetics giving rise to our model. As indicated by the reviewer, our work shows, through kinetic studies, that the storage of sterols within LD occurs at later stages than the activation of the DDR by Tel1 and Rad53 phosphorylation. Tel1 foci decline is necessary for subsequent engagement of downstream DNA long-range resection. Since we propose that sterol storage within LD is a means to attenuate Tel1 engagement at DSBs, it is thus logical (and thus compatible with the data we show) that LD number increase occurs simultaneously with Tel1 foci decrease, at late stages of the reactionWe will include this explanation and graph in the revised version of the work.

      Minor comments:

      Figure S1D and E, experiments should be carried out to include time points in which LD accumulation and cell cycle arrest are observed upon zeocin treatment (i.e., up to 210' as in Figure 1A)

      We will provide cytometry profiles of cells at 210 min. These data exist already in our laboratory.

      How do authors explain increased single strand annealing recombination frequencies in steD and oleate-treated wild type cells (Figure 4A). Should it not be expected that increased STEs also impair recombination induced by endogenous damage?

      Only ste∆ (and not +oleate) indeed manifests an increase in basal recombination frequencies, likely arising from endogenous damage. Although the increase is observed, it is not significant. We agree anyway with the reviewer that, was the experiment to be repeated more times, the increase may be found significantly different. We do not have any honest proposal to explain this.

      Data presented in figure 4B and 4C are not fully convincing. Performing time course experiments might help concluding if the differences observed represent a relevant defect in DSB processing.

      We will perform a Pulsed Field Gel Electrophoresis (PFGE) kinetcis in response to zeocin with or without oleate pre-loading to reinforce the conclusion.

      Is Figure 5B referring to Flag-tagged Tel1 or GFP-tagged Tel1 as stated in the figure legend?

      There is a misunderstanding here, as the mentioned Figure 5B corresponds to P-ATM immunofluorescences in human cells, not to any tagged Tel1 experiment.

      Treatment with the ATM inhibitor AZD0156 increased PI(4)P-ATM PLA signals. From these authors conclude that "association of ATM and PI(4)P inversely correlated with the need for ATM within the nucleus. Do they imply that treatment with ATM-inhibitors reduces the requirement for ATM function in the nucleus? The interpretation of this result should be further elaborated to sustain this conclusion.

      We may have conveyed a wrong notion at this point. We do not imply at all that ATM inhibitors reduce the need for ATM in the nucleus. Instead, we imply that, by reinforcing ATM attachment to Golgi-resident PI(4)P, ATM inhibitors end up titrating ATM away from the nucleus. We will clarify our explanation to avoid misunderstandings.

      An increase of GFP-Tel1 foci upon OSH4 overexpression is described on Figure 7B. These are described as nuclear in the results, but no reference is made in the figure or legend as to how nucleus positions are addressed in these experiments. This should be clarified.

      We systematically combine the tagging of a nucleoplasmic protein (mCherry-Pus1) with the detection of GFP-Tel1 foci, as to unambiguously assess the nuclear position of Tel1 foci. We will include this explanation and the corresponding mCherry-Pus1 channel to clarify this.

      Also, WT controls and quantifications should be included in the experiments shown on Figure 7C.

      These experiments are quantified from the moment we did them, although we did not include such quantifications in the present version for the sake of space. We will do so in the revised version.

      Reviewer #1 (Significance (Required)):

      While the conclusion of lipid metabolism responding to DSBs is not convincing, the observation that Tel1/ATM function is modulated by PI(4)P biding is significant and advances the understanding on the function and regulation of this key kinase in promoting genome integrity maintenance. This is an unanticipated result which is highly novel and has implications for the modulation of Tel1/ATM function through pharmacological manipulation of lipid metabolism. This finding would be of broad interest for scientists working on the response to DNA damage and the maintenance of genome integrity. This reviewer belongs to that group and has limited expertise to evaluate the lipid metabolism genetic manipulation in the manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors show that cytoplasmic PI4P have a regulatory role on ATM response to DNA double strand breaks. The process involves a balance between exchange of PI4P between Golgi and ER in exchange of esterified sterols. The study is of interest, however provides indirect evidences to support their conclusions.

      Major comments : 1). Since the major conclusion relates to PI4P association with ATM in basal conditions to keep ATM outside nucleus and known presence of PI4P, ATM in other organelles of a cell, further experiments such as cell fractionation experimental that show golgi specific interaction would support the main conclusion.

      In continuation of 1st comment, since PI4P in substrate of PI4 phosphoinositol kinases, is there a competition between PI4kinases and ATM for PI4P binding should be addressed through immunoprecipitation studies.

      First of all, we need to specify here that PI4kinases will phosphorylated PI4 to create PI(4)P. Thus, PI(4)P is the product, and not the substrate, of PI4kinases. We therefore do not expect any competition between such kinases and ATM.

      Second, we take good note of the reviewer’s concern that the pool of PI(4)P at the Golgi may be shared, and also that it would be important to reinforce the notion of the relative subcellular localization of ATM under different treatments. To this end, we will perform the following integrative experiment:

      Immunoprecipitation of PI(4)P could theoretically be done using our specific antibody, yet the IP efficiency of a lipid cannot be verified by western blot. Further, there are PI(4)P pools elsewhere in the cell that would mess up with interpretations. We therefore dismiss the use of anti-PI(4)P as a tool to perform immunoprecipitations.

      Instead, to explore the impact of PI(4)P levels on ATM both at the Golgi and within the nucleus, we will split our cultures in two to either immunoprecipitate specific cytoplasmic Trans-Golgi Network-associated proteins (we will test separately TGN46 and GOLPH3); or the nuclear ATM-interacting factor MRE11 from nuclei, then blot for co-immunoprecipitated ATM. The relative co-immunoprecipitated ATM is expected to vary under different treatments to which the cells will be exposed, namely:

      • untreated
      • zeocin, to trigger ATM need in the nucleus
      • OSBP inhibition (+/- zeocin), to stabilize PI(4)P at the Golgi
      • PIK93, an inhibitor of PI4 kinases that prevents PI(4)P synthesis

      2). The authors claim that the ATM retention is the main function of PI4P in Golgi. The authors should rule out the possibility that the phenotype observed on DNA damage response is not due to non availability of PI4P substrate for PI4P kinases, that have recently been shown to participate in genome integrity maintenance.

      We want to explain that we do not intend to say that PI(4)P main function at the Golgi is ATM retention, as PI(4)P is a molecule binding and modulating multiple proteins, as for example the aforementioned GOLPH3. We will first revise our text to correct it, in case we have conveyed this incorrect notion, as it stems from the reviewer’s comment.

      Second, the reviewer evokes the notion that PI(4)P can be the substrate of a second phosphorylation, which could give rise to PI(3,4)P or to PI(4,5)P, which could still undergo remodeling into PI(3)P, for example. Recent work by Dr Michael Sheetz’s lab demonstrated that this set of phosphoinositides serves to drive the nucleation and activation of the ATR-Chk1 branch of the DNA Damage Response upon genotoxic stress, yet was completely inert with respect to the ATM-Chk2 branch (5). To rule out the possibility, as evoked by the reviewer, that the oleate-induced DDR phenomena we describe relate to these other events, we have now explored the response of the ATR-Chk1 branch when comparing the response of zeocin-treated cells that have been pre-loaded or not with oleate. We observe that the ATR-Chk1 branch is unaltered by oleate loading. Thus, we can now propose that the PI(4)P branch exclusively modulates the ATM-Chk2 axis.

      3). Does Oleate treatment influences Rad53 protein levels in addition to its phosphorylation that affect DNA damage response may be addressed.

      Exponential cultures from three different WT, three different ste∆ and three different yeh2∆ strains have now been taken and pre-loaded for 2 hours with 0.05% oleate, then total levels of Rad53 (without induction of DNA damage) assessed. We can now formally say that basal levels of Rad53 protein are not altered by this incubation. We will include this control in the revised manuscript.

      4). Does Yeh2 deletion reduces LDS should be checked.

      We frequently use yeh2∆ cells in our studies. In particular, we have recently published work characterizing the phenotype of this strain with respect to the formation of lipid droplets in the nucleus (6). We are currently exploiting those same sets of data to quantify the total number of LD in order to satisfy the reviewer’s concern.

      5). Figure 4D representation should show % of phospho reduction of initial activation and a better western blot image should be shown that show equal loading of samples.

      We are currently repeating these gels and blots for the sake of clarity, as requested.

      6). In immunoprecipitation experiments, kindly include isotypee IgG controls as well to rule out non-specificity.

      Of course, this important control will be included every time.

      Minor points: 1). Figure S1F do not show oleate treatment as presented in results section.

      We will revise the accurate naming.

      2). A better gel for S4B should be presented with ponceau of the same gel.

      We are currently repeating this gel and associated blot for the sake of clarity, as requested.

      3). Nuclear PI4Ps has also been previously reported, an explanation to the specific interaction of ATM and PI4P in the Golgi should be addressed/discussed.

      We take it that the reviewer is referring here to the recent work by Fáberová et al (7) in which PI(4)P and PI(4,5)P were described as very dynamic in the nucleus, and mostly related then to mRNA transcription, splicing and export. We will reinforce the connection of our phenomenon to the Golgi-associated pool of PI(4)P thanks to the co-immunoprecipitation experiments proposed above, and will timely contextualize these in light of the paper by Fáberová and co-workers in the revised version. Thank you for reminding us of this work.

      Reviewer #2 (Significance (Required)):

      The current work definitely adds a layer in our understanding to ATM regulation and cross-talk between different PIKK family of kinases. ATM localisation in extra nuclear regions of a cell has been described earlier with significant impact on cell physiology such as mitochondria etc., ATM retention at golgi and limiting nuclear ATM levels is significant advance at ATM activity regulation, while signifying non canonical function of PI4P.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript, the authors propose that ATM/Tel1 signaling is regulated in a spatiotemporal manner during genotoxic stress both in yeast and mammalian cells. They show that Lipid droplets accumulate in response to genotoxic stress. As a consequence, there is a decrease of exchange of PI4P from the Golgi to ER, thus dampening ATM/Tel1 signaling by sequestering this kinase into the Golgi. The authors combined findings in yeast and mammals showing that this mechanism is conserved throughout eukaryotes. For this purpose, they use a vast number of techniques that support their proposed model.

      Major comments:

      The conclusions were made based on evidence combining yeast genetics, immunofluorescence, DNA end resection analysis and pharmacological interventions. The hypothesis that ATM is kept away from the nucleus by physically interacting with PI4P at the Golgi, thus allowing processive repair is bold and contributes for a better understanding of the choreography of the DDR kinases during DSB repair. However, many of the experiments in yeast and mammals show only mild phenotypes and there is no evidence that this mode of ATM dampening impact cell viability in mammals.

      We agree with the reviewer that the effects associated to the reported phenomenon are indeed mild. This is a fact. We would like to remind that the metabolism of sterols is finely controlled, and at many different levels, in a very complex manner. For example, sterol increases in the cell will immediately be compensated by reduced synthesis, while synthesis inhibition will immediately promote uptake from the medium, and/or release from stores (for example, see (8)). As a natural consequence, the window of manipulation and, more importantly, the strength of the phenotypes we can uncover are small.

      Therefore, I have some comments and suggestions of experiments that I think could improve the quality of the manuscript. I believe that most of these new experiments does not require much time and resources.

      • Does oleate treatment in RPE-1/Huh-7 cells induce loss of viability? An experiment showing loss of viability like MT-assay or decreased cell proliferation would reinforce the importance of the mechanism proposed.

      This experiment was already included in the previous version, yet it may have escaped the attention of the reviewer. We show in Figure S2E that oleate treatment restricts viability in Huh-7 cells alone, and also worsens their tolerance to zeocin. Perhaps we should reconsider moving this result to the main figures so that it does not go unnoticed.

      • In yeast there is evidence that a ste delta strain show sensitivity to zeocin/CPT, but there is no experiment showing the same effect on cells lacking Yeh2. Since both strains share similar phenotypes, it would be interesting to show that increased kinetics of Rad53 signaling leads to sensitivity to genotoxins.

      We have now performed this experiment, we will include the matching information for yeh2∆ cells, which agrees with the predictions.

      • The conclusion that ste delta cells exposed to zeocin leads to unproductive events due to defects in DNA-end resection could be reinforced by a decrease in Rad52 foci. It has been previously shown by the group of Dr. Marcus Smolka, that inhibition of DNA-end resection decreases Rad52 foci (https://doi.org/10.1083/jcb.201607031). Since the authors were able to monitor Rad52-YFP (Figure S1A), it shouldn't consume time and resources.

      The reviewer is right that this experiment should not be time- or resources-consuming. We will evaluate the accumulation of Rad52 foci in response to the concerned genotoxin in ste∆ cells.

      • Since the authors propose that there is a DNA repair defect due to inhibition of long-range DNA-end resection, it would be important to monitor gamma-H2A(X) signal either in yeast or mammals.

      Taking into consideration the reviewer’s suggestion, we have now performed anti-yH2AX immunofluorescence of all the implied conditions (genotoxins +/- oleate pre-load) and will quantify them to answer the concern.

      • How do the authors exclude the possibility that yeast mutants or oleate treatment in yeast/mammalian cells change membrane permeability allowing an increase in genotoxin concentration?

      Although this is a very reasonable criticism, we want to remind the data we present in Figure S4A in which we use the naturally DSB-bearing rad3-102 cells for recombination analyses, showing that, in the absence of any genotoxin, the same phenotype also applies. Yet, we want to reinforce the notion that LD formation in response to DSB can also occur when the breaks are not chemically, but physically, induced. To this end, and also to match a related request by Reviewer 1, we will:

      1. a) exploit the naturally DSB-accumulating mutant rad3-102 (4) to evaluate whether it endogenously harbors more LD in comparison with the WT.
      2. b) we have recently created a tool in which gRNAs targeted to different subsets of transposons in the genome can drive Cas9 to create DSB in a dose-dependent manner ((9), under revision in Genetics). We will use this system to monitor the LD formation in response to Cas9-triggered cuts. In addition, on figure 5A, significant differences in GFP-Tel1 foci abundance between WT and steD or yeh2D cells are only observed after 210', way after the slight effect on Rad53 phosphorylation is observed. This is at odds with the conclusion that Tel1 association to STEs modulates DDR signaling.

      We are afraid that we have not been clear enough in explaining the kinetics giving rise to our model. As indicated by the reviewer, our work shows, through kinetic studies, that the storage of sterols within LD occurs at later stages than the activation of the DDR by Tel1 and Rad53 phosphorylation. Tel1 foci decline is necessary for subsequent engagement of downstream DNA long-range resection. Since we propose that sterol storage within LD is a means to attenuate Tel1 engagement at DSBs, it is thus logical (and thus compatible with the data we show) that LD number increase occurs simultaneously with Tel1 foci decrease, at late stages of the reactionWe will include this explanation and graph in the revised version of the work.

      • It would be interesting to investigate genetic interactions between ste delta (or yeh2delta) and yeast mutants with DNA-end resection problems (exo1delta; sae2delta). For instance, it has been shown that Sae2 antagonizes checkpoint signaling by competing with Rad9 to DSB sites (https://doi.org/10.1073/pnas.1816539115). Also, cells lacking Sae2 show an increase in Rad53 signaling due to increased Tel1 Signaling. Therefore, an epistatic effect between these two pathways would reinforce the hypothesis of the manuscript.

      we will build the double mutant sae2∆ yeh2∆ and assess the potential epistatic behavior they may display with respect to some key phenotypes (Tel1 foci formation, Rad53 phosphorylation…).

      • The authors showed that Tel1-GFP does not accumulate in the nucleus in cells lacking Sac1 (Figure 7C). Tel1 is important to cope with increased DSBs in the absence of Mec1, thus avoiding genomic instability. Cells lacking both Mec1 and Tel1 show a sick phenotype with an exponential increase in gross chromosomal rearrangements and sensitivity to genotoxins. Therefore, does deletion of Mec1 (and Sml1) in sac1 delta phenocopies a mec1tel1 delta? Alternatively, does pharmacological inhibition of ATR in the presence of the OSBP1 inhibitor causes loss of viability or chromosomal aberrations?

      We will delete SAC1 in mec1∆ sml1∆ and compare the fitness, through growth drop assays, with respect to the mutant tel1∆ mec1∆ sml1∆.

      We will expose cells either to OSBP1 inhibitor, ATR inhibitor, or both, and assess the phosphorylation of their downstream common effector H2AX. Additionally, we will assess the effect on cell growth of the combination of ATRi and OSBP1i using synergy matrices. We will determine if the combination of both drugs synergizes or not to impair cell proliferation and reduce cell viability.

      • Finally, it seems strange to me that ATR/Mec1 signaling is not mentioned throughout the entire manuscript. Does PI4P pathway affect only ATM/Tel1? In Figure 2D, an antibody against phospho-CHK1 could be used to monitor ATR signaling. In line with that, I would like to see in the discussion how these new findings are in line with evidence from a 2019 paper showing that phophoinositides PIP2 and PIP3, but not PI4P are important for ATR signaling (DOI: 10.1038/s41467-017-01805-9). They showed that a nuclear pool of PIP2 increases upon DNA damage induction and rapidly accumulates at DNA lesions. This event is important for the recruitment of ATR. Since PI4P is substrate for PIP2 synthesis and there is a nuclear pool of PI4P and PIP2, I think it is important to discuss if the results presented here are in line with these previous findings.

      The reviewer evokes recent work by Dr Michael Sheetz’s lab demonstrating that a different set of phosphoinositides serves to drive the nucleation and activation of the ATR-Chk1 branch of the DNA Damage Response upon genotoxic stress, yet was completely inert with respect to the ATM-Chk2 branch (5). We have now explored, also to satisfy a similar concerned raised by Reviewer 2, the response of the ATR-Chk1 branch when comparing the response of zeocin-treated cells that have been pre-loaded or not with oleate. We observe that the ATR-Chk1 branch is unaltered by oleate loading. Thus, we can now propose that the PI(4)P branch exclusively modulates the ATM-Chk2 axis.

      We will of course give the needed credit to this work and contextualize our findings accordingly.

      Minor comments:

      • Line 124: The correct is Figure S1E, lower panel and not Figure S1F -Lines 127-128: Figure S2A does not show zeocin treatment

      Both minor mistakes will be corrected.

      Reviewer #3 (Significance (Required)):

      Together, these new findings, if corroborated by others, might be important to open new lines of investigation in basic and translational research regarding human diseases as explored in the discussion section. I believe this paper will attract attention not only from the DDR field but also from other areas of research such as nutrient and lipid signaling both in yeast and mammals. I hope I was able to collaborate in this review, since my main expertise is in the area of DNA damage signaling using budding yeast as an organism model.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      This is a very interesting study where Sara et al. demonstrated a link between lipid metabolism with DNA repair response (DDR). In this study, they have proposed ATM as a novel PI4P-effector. The sterol deposition into lipid droplets impacts the Golgi PI4P level due to lipid exchange machinery facilitated by OSBP1, therefore regulating the cytosolic retention of ATM due to PI4P binding. Although how lipid droplets in the cytosol sense the DNA damage and control the initiation of DDR by regulating ATM is still unclear, this study linked lipid biology/PI signaling to DNA damage repair and showed the evolutionary conservation of PI signaling and DNA repair machinery from yeast to humans. The experiments are well designed, nicely controlled, with a high quality of data presentation. With some improvements, this work could be a very interesting study attracting a broad readership.

      In their model, ATM is PI4P-bound and sequestered inside the cytosol under basal conditions. Upon genotoxic stress, activation of OSBP1 removes PI4P and free PI4P-bound ATM for nuclear translocation of DNA repair. This could also be interpreted as genotoxic stress-induced PIP-kinase activity, where PI4P is processed into PIP2 or PIP3, somehow redirecting ATM into the nucleus to initiate its activation for DDR. Those aspects should be discussed and improved.

      Both Reviewers 2 and 3 have somehow evoked a similar concern. More precisely, the work by Dr Michael Sheetz’s lab demonstrating that a different set of phosphoinositides serves to drive the nucleation and activation of the ATR-Chk1 branch of the DNA Damage Response upon genotoxic stress, yet was completely inert with respect to the ATM-Chk2 branch (5). We have now explored, to satisfy all reviewers’ concerns, the response of the ATR-Chk1 branch when comparing the response of zeocin-treated cells that have been pre-loaded or not with oleate. We observe that the ATR-Chk1 branch is unaltered by oleate loading. Thus, we can now propose that the PI(4)P branch exclusively modulates the ATM-Chk2 axis.

      Additionally, we will of course give the needed credit to this work and contextualize our findings accordingly.

      Upon stress, there is nuclear activation of p53-phosphoinositide (PI) signalosomes and PIP-kinases. Also, there is a significant PIP2 pool inside the nucleus with an involvement in DNA damage repair. Those papers and their relevance to the current study need to be discussed. If ATM is a novel PI4P-effector, there is also nuclear PI4P formation or nuclear PI4P accumulation upon stresses based on recent studies; how the ATM interacts with PIPn in the nucleus upon translocation? A know ATM substrate p53 is PIP2/PIP3 bound in the nucleus based on recent studies. Will ATM prefer to interact with other PIPn-bound proteins in the nucleus or PIPn regulate their interaction needs to be discussed.

      These additional notions are in line with the previous paragraph presented by the reviewer, and our answers too. We will provide a constructive overview of all these ideas in the revised version of the manuscript.

      Major points: 1. The PI4P-ATM complex is supported only by PLA and PIP strips. Need more robust biochemical characterization of the interaction: co-IP, lipid binding, and/or in vitro constitution.

      We agree with the need to perform assays in which PI(4)P is embedded in a bilayer, as to confidently assess whether Tel1 can bind it in that context. We have now performed a pilot experiment in which we have confronted purified FLAG-Tel1 to liposomes harboring PI(4)P. Western blot analysis using anti-FLAG antibody shows the encouraging result that FLAG-Tel1 can be found there. As a control, we have performed the same process but in the absence of any liposomes. We observe that a residual fraction of FLAG-Tel1 can nevertheless be found in this control, most probably because the buffer used during the liposome assay makes part of FLAG-Tel1 precipitate.To avoid this type of background and to increase our trust in the results, we propose to perform the liposome assay but on a discontinuous density gradient, so that liposomes will be retrieved in the top layer (and bound FLAG-Tel1 with them, if that is the case), while any precipitated FLAG-Tel1 will be in the bottom fraction (liposome floatation assay). As a further control, we will include the same liposomes but lacking PI(4)P. We expect to be successful in the floatation assays. If we are not, we will repeat the experiment presented above to be confident that the observed increase is reproducible.

      1. The use of drug inhibitors only in the final figure is problematic. KD or KO experiments should be performed to confirm that ATM and the exchanger are the relevant targets.

      We have now used siRNAs against the exchanger protein, OSBP1, with a very high silencing rate success. We have next monitored the activation status of the chromatin-associated ATM target KAP1, in order to monitor the predicted decrease of ATM activity specifically inside the nucleus. Our results confirm the role of OSBP1, by KD experiments as requested by the reviewer, in attenuating ATM nuclear participation.

      1. Poor quality of some WBs (e.g Fig. S1F).

      We have now repeated the Western Blot to detect Rad53-P in response to 20 mM HU in WT versus ste∆cells.

      1. Lack of statistical analyses for some data (e.g. Fig. 1B-E)

      We had already included, in the previous version, the complete statistical analyses corresponding to Figures 1B to E and evoked here by the reviewer. They were indeed included in Figure S1C, and our brief reference to them in the text may have escaped her/his attention. We will make a clear reference to this in the revised version.

      Additional clarification points:

      Figure 1: No representative images were shown for quantifications in Figure 1C, D, E.

      If the reviewer / editor estimates it pertinent, we can of course include them. Yet, they will be very redundant with the images displayed in Figure 1A.

      Line 121: Should be Figure S1E, upper panel. Line 124: Should be Figure S1E, lower panel. Figure 2D-E, please show the quantification of the ratio of pCHK2/CHK2 with an N=3

      We will correct / include the requested changes.

      Figure S2B: needs quantification of NileRed staining to conclude induction in LD formation

      We will quantify the LD as requested.

      Figure 3C, to show the selectivity of ATM-binding toward PI4P, PLA of ATM with other PIPn species should be assessed, such as PI3P, PI4,5P2, and PI3,4,5P3.

      We have provided an overview of the binding preferences of ATM with respect to the full battery of phosphoinositides in the strip-binding assay shown in Figures S5C and 6B. Other than that, we are afraid that PLA studies as the ones we develop in the current manuscript for PI(4)P are not feasible, since no reliable antibodies exist for most of the phosphoinositide species evoked by the reviewer.

      Figure S6A, PI4P level could be assessed by IF staining using PI4P antibody besides using PI4P sensor.

      We will use our PI(4)P antibody to monitor by immunofluorescence the behavior of this molecule in response to either MMS or zeocin, as suggested.

      References

      1. Cheung HC, San Lucas FA, Hicks S, Chang K, Bertuch AA, Ribes-Zamora A. An S/T-Q cluster domain census unveils new putative targets under Tel1/Mec1 control. BMC Genomics. 2012;
      2. Bensimon A, Schmidt A, Ziv Y, Elkon R, Wang SY, Chen DJ, et al. ATM-dependent and -independent dynamics of the nuclear phosphoproteome after DNA damage. Sci Signal. 2010;
      3. BastosdeOliveira FM, Kim D, Cussiol JR, Das J, Jeong MC, Doerfler L, et al. Phosphoproteomics Reveals Distinct Modes of Mec1/ATR Signaling during DNA Replication. Mol Cell. 2015;
      4. Moriel-Carretero M, Aguilera A. A Postincision-Deficient TFIIH Causes Replication Fork Breakage and Uncovers Alternative Rad51- or Pol32-Mediated Restart Mechanisms. Mol Cell. 2010;37(5):690–701.
      5. Wang YH, Hariharan A, Bastianello G, Toyama Y, Shivashankar G V., Foiani M, et al. DNA damage causes rapid accumulation of phosphoinositides for ATR signaling. Nat Commun. 2017;
      6. Kumanski S, Forey R, Cazevieille C, Moriel-Carretero M. Nuclear Lipid Droplet Birth during Replicative Stress. Cells. 2022;11(1390).
      7. Fáberová V, Kalasová I, Krausová A, Hozák P. Super-Resolution Localisation of Nuclear PI(4)P and Identification of Its Interacting Proteome. Cells. 2020;9(5):1–17.
      8. Luo J, Yang H, Song BL. Mechanisms and regulation of cholesterol homeostasis. Nat Rev Mol Cell Biol [Internet]. 2020;21(4):225–45. Available from: http://dx.doi.org/10.1038/s41580-019-0190-7
      9. Coiffard J, Santt O, Kumanski S, Pardo B, Moriel-Carretero M. A CRISPR-Cas9-based system for the dose-dependent study of 4 DNA double strand breaks sensing and repair 5 6. bioRxiv [Internet]. 2021;1–37. Available from: https://doi.org/10.1101/2021.10.21.465387.
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      This is a very interesting study where Sara et al. demonstrated a link between lipid metabolism with DNA repair response (DDR). In this study, they have proposed ATM as a novel PI4P-effector. The sterol deposition into lipid droplets impacts the Golgi PI4P level due to lipid exchange machinery facilitated by OSBP1, therefore regulating the cytosolic retention of ATM due to PI4P binding. Although how lipid droplets in the cytosol sense the DNA damage and control the initiation of DDR by regulating ATM is still unclear, this study linked lipid biology/PI signaling to DNA damage repair and showed the evolutionary conservation of PI signaling and DNA repair machinery from yeast to humans. The experiments are well designed, nicely controlled, with a high quality of data presentation. With some improvements, this work could be a very interesting study attracting a broad readership.

      In their model, ATM is PI4P-bound and sequestered inside the cytosol under basal conditions. Upon genotoxic stress, activation of OSBP1 removes PI4P and free PI4P-bound ATM for nuclear translocation of DNA repair. This could also be interpreted as genotoxic stress-induced PIP-kinase activity, where PI4P is processed into PIP2 or PIP3, somehow redirecting ATM into the nucleus to initiate its activation for DDR. Those aspects should be discussed and improved.

      Upon stress, there is nuclear activation of p53-phosphoinositide (PI) signalosomes and PIP-kinases. Also, there is a significant PIP2 pool inside the nucleus with an involvement in DNA damage repair. Those papers and their relevance to the current study need to be discussed. If ATM is a novel PI4P-effector, there is also nuclear PI4P formation or nuclear PI4P accumulation upon stresses based on recent studies; how the ATM interacts with PIPn in the nucleus upon translocation? A know ATM substrate p53 is PIP2/PIP3 bound in the nucleus based on recent studies. Will ATM prefer to interact with other PIPn-bound proteins in the nucleus or PIPn regulate their interaction needs to be discussed.

      Major points:

      1. The PI4P-ATM complex is supported only by PLA and PIP strips. Need more robust biochemical characterization of the interaction: co-IP, lipid binding, and/or in vitro constitution.
      2. The use of drug inhibitors only in the final figure is problematic. KD or KO experiments should be performed to confirm that ATM and the exchanger are the relevant targets.
      3. Poor quality of some WBs (e.g Fig. S1F).
      4. Lack of statistical analyses for some data (e.g. Fig. 1B-E)

      Additional clarification points:

      • Figure 1: No representative images were shown for quantifications in Figure 1C, D, E.

      • Line 121: Should be Figure S1E, upper panel.

      • Line 124: Should be Figure S1E, lower panel.

      • Figure 2D-E, please show the quantification of the ratio of pCHK2/CHK2 with an N=3

      • Figure S2B: needs quantification of NileRed staining to conclude induction in LD formation.

      • Figure 3C, to show the selectivity of ATM-binding toward PI4P, PLA of ATM with other PIPn species should be assessed, such as PI3P, PI4,5P2, and PI3,4,5P3.

      • Figure S6A, PI4P level could be assessed by IF staining using PI4P antibody besides using PI4P sensor.

      Significance

      This is a very interesting study where Sara et al. demonstrated a link between lipid metabolism with DNA repair response (DDR). In this study, they have proposed ATM as a novel PI4P-effector. The sterol deposition into lipid droplets impacts the Golgi PI4P level due to lipid exchange machinery facilitated by OSBP1, therefore regulating the cytosolic retention of ATM due to PI4P binding. Although how lipid droplets in the cytosol sense the DNA damage and control the initiation of DDR by regulating ATM is still unclear, this study linked lipid biology/PI signaling to DNA damage repair and showed the evolutionary conservation of PI signaling and DNA repair machinery from yeast to humans. The experiments are well designed, nicely controlled, with a high quality of data presentation. With some improvements, this work could be a very interesting study attracting a broad readership.

      In their model, ATM is PI4P-bound and sequestered inside the cytosol under basal conditions. Upon genotoxic stress, activation of OSBP1 removes PI4P and free PI4P-bound ATM for nuclear translocation of DNA repair. This could also be interpreted as genotoxic stress-induced PIP-kinase activity, where PI4P is processed into PIP2 or PIP3, somehow redirecting ATM into the nucleus to initiate its activation for DDR. Those aspects should be discussed and improved.

      Upon stress, there is nuclear activation of p53-phosphoinositide (PI) signalosomes and PIP-kinases. Also, there is a significant PIP2 pool inside the nucleus with an involvement in DNA damage repair. Those papers and their relevance to the current study need to be discussed. If ATM is a novel PI4P-effector, there is also nuclear PI4P formation or nuclear PI4P accumulation upon stresses based on recent studies; how the ATM interacts with PIPn in the nucleus upon translocation? A know ATM substrate p53 is PIP2/PIP3 bound in the nucleus based on recent studies. Will ATM prefer to interact with other PIPn-bound proteins in the nucleus or PIPn regulate their interaction needs to be discussed.

      Major points:

      1. The PI4P-ATM complex is supported only by PLA and PIP strips. Need more robust biochemical characterization of the interaction: co-IP, lipid binding, and/or in vitro constitution.
      2. The use of drug inhibitors only in the final figure is problematic. KD or KO experiments should be performed to confirm that ATM and the exchanger are the relevant targets.
      3. Poor quality of some WBs (e.g Fig. S1F).
      4. Lack of statistical analyses for some data (e.g. Fig. 1B-E)

      Additional clarification points:

      • Figure 1: No representative images were shown for quantifications in Figure 1C, D, E.

      • Line 121: Should be Figure S1E, upper panel.

      • Line 124: Should be Figure S1E, lower panel.

      • Figure 2D-E, please show the quantification of the ratio of pCHK2/CHK2 with an N=3

      • Figure S2B: needs quantification of NileRed staining to conclude induction in LD formation.

      • Figure 3C, to show the selectivity of ATM-binding toward PI4P, PLA of ATM with other PIPn species should be assessed, such as PI3P, PI4,5P2, and PI3,4,5P3.

      • Figure S6A, PI4P level could be assessed by IF staining using PI4P antibody besides using PI4P sensor.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, the authors propose that ATM/Tel1 signaling is regulated in a spatiotemporal manner during genotoxic stress both in yeast and mammalian cells. They show that Lipid droplets accumulate in response to genotoxic stress. As a consequence, there is a decrease of exchange of PI4P from the Golgi to ER, thus dampening ATM/Tel1 signaling by sequestering this kinase into the Golgi. The authors combined findings in yeast and mammals showing that this mechanism is conserved throughout eukaryotes. For this purpose, they use a vast number of techniques that support their proposed model.

      Major comments:

      The conclusions were made based on evidence combining yeast genetics, immunofluorescence, DNA end resection analysis and pharmacological interventions. The hypothesis that ATM is kept away from the nucleus by physically interacting with PI4P at the Golgi, thus allowing processive repair is bold and contributes for a better understanding of the choreography of the DDR kinases during DSB repair. However, many of the experiments in yeast and mammals show only mild phenotypes and there is no evidence that this mode of ATM dampening impact cell viability in mammals. Therefore, I have some comments and suggestions of experiments that I think could improve the quality of the manuscript. I believe that most of these new experiments does not require much time and resources.

      • Does oleate treatment in RPE-1/Huh-7 cells induce loss of viability? An experiment showing loss of viability like MT-assay or decreased cell proliferation would reinforce the importance of the mechanism proposed.
      • In yeast there is evidence that a ste delta strain show sensitivity to zeocin/CPT, but there is no experiment showing the same effect on cells lacking Yeh2. Since both strains share similar phenotypes, it would be interesting to show that increased kinetics of Rad53 signaling leads to sensitivity to genotoxins.
      • The conclusion that ste delta cells exposed to zeocin leads to unproductive events due to defects in DNA-end resection could be reinforced by a decrease in Rad52 foci. It has been previously shown by the group of Dr. Marcus Smolka, that inhibition of DNA-end resection decreases Rad52 foci (https://doi.org/10.1083/jcb.201607031). Since the authors were able to monitor Rad52-YFP (Figure S1A), it shouldn't consume time and resources.
      • Since the authors propose that there is a DNA repair defect due to inhibition of long-range DNA-end resection, it would be important to monitor gamma-H2A(X) signal either in yeast or mammals.
      • How do the authors exclude the possibility that yeast mutants or oleate treatment in yeast/mammalian cells change membrane permeability allowing an increase in genotoxin concentration?
      • It would be interesting to investigate genetic interactions between ste delta (or yeh2delta) and yeast mutants with DNA-end resection problems (exo1delta; sae2delta). For instance, it has been shown that Sae2 antagonizes checkpoint signaling by competing with Rad9 to DSB sites (https://doi.org/10.1073/pnas.1816539115). Also, cells lacking Sae2 show an increase in Rad53 signaling due to increased Tel1 Signaling. Therefore, an epistatic effect between these two pathways would reinforce the hypothesis of the manuscript.
      • The authors showed that Tel1-GFP does not accumulate in the nucleus in cells lacking Sac1 (Figure 7C). Tel1 is important to cope with increased DSBs in the absence of Mec1, thus avoiding genomic instability. Cells lacking both Mec1 and Tel1 show a sick phenotype with an exponential increase in gross chromosomal rearrangements and sensitivity to genotoxins. Therefore, does deletion of Mec1 (and Sml1) in sac1 delta phenocopies a mec1tel1 delta? Alternatively, does pharmacological inhibition of ATR in the presence of the OSBP1 inhibitor causes loss of viability or chromosomal aberrations?
      • Finally, it seems strange to me that ATR/Mec1 signaling is not mentioned throughout the entire manuscript. Does PI4P pathway affect only ATM/Tel1? In Figure 2D, an antibody against phospho-CHK1 could be used to monitor ATR signaling. In line with that, I would like to see in the discussion how these new findings are in line with evidence from a 2019 paper showing that phophoinositides PIP2 and PIP3, but not PI4P are important for ATR signaling (DOI: 10.1038/s41467-017-01805-9). They showed that a nuclear pool of PIP2 increases upon DNA damage induction and rapidly accumulates at DNA lesions. This event is important for the recruitment of ATR. Since PI4P is substrate for PIP2 synthesis and there is a nuclear pool of PI4P and PIP2, I think it is important to discuss if the results presented here are in line with these previous findings.

      Minor comments:

      • Line 124: The correct is Figure S1E, lower panel and not Figure S1F
      • Lines 127-128: Figure S2A does not show zeocin treatment

      Significance

      Together, these new findings, if corroborated by others, might be important to open new lines of investigation in basic and translational research regarding human diseases as explored in the discussion section. I believe this paper will attract attention not only from the DDR field but also from other areas of research such as nutrient and lipid signaling both in yeast and mammals. I hope I was able to collaborate in this review, since my main expertise is in the area of DNA damage signaling using budding yeast as an organism model.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors show that cytoplasmic PI4P have a regulatory role on ATM response to DNA double strand breaks. The process involves a balance between exchange of PI4P between Golgi and ER in exchange of esterified sterols. The study is of interest, however provides indirect evidences to support their conclusions.

      Major comments:

      1). Since the major conclusion relates to PI4P association with ATM in basal conditions to keep ATM outside nucleus and known presence of PI4P, ATM in other organelles of a cell, further experiments such as cell fractionation experimental that show golgi specific interaction would support the main conclusion.

      2). In continuation of 1st comment, since PI4P in substrate of PI4 phosphoinositol kinases, is there a competition between PI4kinases and ATM for PI4P binding should be addressed through immunoprecipitation studies.

      3). The authors claim that the ATM retention is the main function of PI4P in Golgi. The authors should rule out the possibility that the phenotype observed on DNA damage response is not due to non availability of PI4P substrate for PI4P kinases, that have recently been shown to participate in genome integrity maintenance.

      4). Does Oleate treatment influences Rad53 protein levels in addition to its phosphorylation that affect DNA damage response may be addressed.

      5). Does Yeh2 deletion reduces LDS should be checked.

      6). Figure 4D representation should show % of phospho reduction of initial activation and a bettier western blot image should be shown that show equal loading of samples.

      7). In ammunoprecipitation experiments, kindly include isotypee IgG controls as well to rule out non-specificity.

      Minor points:

      1). Figure S1F do not show oleate treatment as presented in results section.

      2). A better gel for S4B should be presented with ponceau of the same gel.

      3). Nuclear PI4Ps has also been previously reported, an explanation to the specific interaction of ATM and PI4P in the Golgi should be addressed/discussed.

      Significance

      The current work definitely adds a layer in our understanding to ATM regulation and cross-talk between different PIKK family of kinases. ATM localisation in extra nuclear regions of a cell has been described earlier with significant impact on cell physiology such as mitochondria etc., ATM retention at golgi and limiting nuclear ATM levels is significant advance at ATM activity regulation, while signifying non canonical function of PI4P.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Ovejero et al. report an increase in lipid droplet (LD) abundance after long (from 120' on) exposure of budding yeast cells to DNA damaging agents zeocin and camptothecin (CPT). Next, they analyze DNA damage signaling in yeast mutants that impair triacylglycerol (TAGs) or sterol (STEs) esterification. They observe a slight anticipation in Rad53/CHK2 phosphorylation (indicative of DDR signaling) in yeast stem mutants, as well as in yeast cells or human cells lines pre-treated with oleate upon zeocin treatment. Yeast stem mutants are sensitive to zeocin and captothecin, but only confer sensitivity to hydroxyurea upon combination with tagD mutations. Authors relate these phenotypes to a somewhat decreases DSB resection in yeh2D mutants (expected to have reduced steryl esters pools) and RPA-foci in steD yeast cells. Next, a reduction in single strand annealing recombination repair events upon zeocin treatment is reported using a genetic reporter in steD mutants and oleate-treated cells. From these data they conclude that inability to process sterols in response to DSBs leads to an exacerbated DDR and prevents DNA repair. Next, it is shown that Flag-tagged Tel1 distinctly interacts with mono-phosphate phosphoinositides, including PI(4)P. An interaction in vivo is also inferred through Proximity Ligation Assays (PLA) using anti-PI(4)P and anti-ATM antibodies in human cell lines, which was moderately downregulated upon treatment with MMS or zeocin. Over-expression of the Osh4/OSBP1 transporter, which consumes PI(4)P, increased the number of Tel1 (nuclear) foci upon zeocin treatment. Conversely Sac1 ablation, in which accumulation of PI(4)P is expected, abrogated nuclear Tel1 foci formation and reduced telomere length (a phenotype related to lack of Tel1 function). From these results authors conclude that Tel1 availability in the nucleus is influenced by PI(4)P availability. Lastly, treatment with an OSBP1 inhibitor led to a cell line and damaging agent -variable reduction of ATM phosphorylation and a mostly non-significant reduction of DNA resection, measured by native BrdU detection, in response to CPT treatment. Overall, authors conclude that i) biding of Tel1/ATM to PI(4)P modulates its functional availability in the nucleus, and that ii) DNA damage elicits the esterification and storage of sterols toward LDs, which contributes to tritate Tel1/ATM away from the nucleus dampening the DDR and affecting long-range resection.

      Minor comments:

      • Figure S1D and E, experiments should be carried out to include time points in which LD accumulation and cell cycle arrest are observed upon zeocin treatment (i.e., up to 210' as in Figure 1A)

      • How do authors explain increased single strand annealing recombination frequencies in steD and oleate-treated wild type cells (Figure 4A). Should it not be expected that increased STEs also impair recombination induced by endogenous damage?

      • Data presented in figure 4B and 4C are not fully convincing. Performing time course experiments might help concluding if the differences observed represent a relevant defect in DSB processing.

      • Is Figure 5B referring to Flag-tagged Tel1 or GFP-tagged Tel1 as stated in the figure legend?

      • Treatment with the ATM inhibitor AZD0156 increased PI(4)P-ATM PLA signals. From these authors conclude that "association of ATM and PI(4)P inversely correlated with the need for ATM within the nucleus. Do they imply that treatment with ATM-inhibitors reduces the requirement for ATM function in the nucleus? The interpretation of this result should be further elaborated to sustain this conclusion.

      • An increase of GFP-Tel1 foci upon OSH4 overexpression is described on Figure 7B. These are described as nuclear in the results, but no reference is made in the figure or legend as to how nucleus positions are addressed in these experiments. This should be clarified. Also, WT controls and quantifications should be included in the experiments shown on Figure 7C.

      Major comments:

      • While the conclusion that Tel1/ATM binds PI(4)P and this interaction modulates Tel1/ATM functional availability at the nucleus is convincing, the conclusion that DSBs elicit a change in the metabolism of this lipid to "control" Tel1/ATM function is not demonstrated.

      • The notion that sterol processing occurs in response to DSBs is not sufficiently supported by the data presented, as the increase in LD numbers is observed much after activation of the DDR (Rad53 phosphorylation) in Zeozin-treated yeast cells. In addition, evidence is not provided on the mechanisms by which PI(4)P metabolism would be controlled, which would be expected to be DDR-independent as they are placed upstream of this signaling pathway in the author's model. The damaging agents used have been suggested to alter the redox metabolism and even lipid peroxidation (Kitanovic 2009, Mizumoto 1993, Krol 2015, Todorova 2015, Ren 2019, Singh 2014). Hence it is possible that PI(4)P changes are not due to DSBs, but an indirect though relevant effect.

      • In absence of direct evidence supporting an active regulation of PI(4)P dynamics in response to DNA breaks, this conclusion remains speculative and this should be noted in the manuscript.

      • Authors conclude that LD is specific to DSB induction. This seems an overstatement as they just reported LD increases in response to two agents that also induce other kinds of DNA damage. To also strengthen the link between DSBs and PI(4)P modulation of Tel1 function, authors should analyze LD numbers, Rad53 phosphorylation and Tel1 nuclear re-localization in response to HO-induced DNA breaks (e.g., using the system employed in Figure 3C)

      • In addition, on figure 5A, significant differences in GFP-Tel1 foci abundance between WT and steD or yeh2D cells are only observed after 210', way after the slight effect on Rad53 phosphorylation is observed. This is at odds with the conclusion that Tel1 association to STEs modulates DDR signaling.

      Minor comments:

      • Figure S1D and E, experiments should be carried out to include time points in which LD accumulation and cell cycle arrest are observed upon zeocin treatment (i.e., up to 210' as in Figure 1A)

      • How do authors explain increased single strand annealing recombination frequencies in steD and oleate-treated wild type cells (Figure 4A). Should it not be expected that increased STEs also impair recombination induced by endogenous damage?

      • Data presented in figure 4B and 4C are not fully convincing. Performing time course experiments might help concluding if the differences observed represent a relevant defect in DSB processing.

      • Is Figure 5B referring to Flag-tagged Tel1 or GFP-tagged Tel1 as stated in the figure legend?

      • Treatment with the ATM inhibitor AZD0156 increased PI(4)P-ATM PLA signals. From these authors conclude that "association of ATM and PI(4)P inversely correlated with the need for ATM within the nucleus. Do they imply that treatment with ATM-inhibitors reduces the requirement for ATM function in the nucleus? The interpretation of this result should be further elaborated to sustain this conclusion.

      • An increase of GFP-Tel1 foci upon OSH4 overexpression is described on Figure 7B. These are described as nuclear in the results, but no reference is made in the figure or legend as to how nucleus positions are addressed in these experiments. This should be clarified. Also, WT controls and quantifications should be included in the experiments shown on Figure 7C.

      Significance

      While the conclusion of lipid metabolism responding to DSBs is not convincing, the observation that Tel1/ATM function is modulated by PI(4)P biding is significant and advances the understanding on the function and regulation of this key kinase in promoting genome integrity maintenance. This is an unanticipated result which is highly novel and has implications for the modulation of Tel1/ATM function through pharmacological manipulation of lipid metabolism. This finding would be of broad interest for scientists working on the response to DNA damage and the maintenance of genome integrity. This reviewer belongs to that group and has limited expertise to evaluate the lipid metabolism genetic manipulation in the manuscript.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 Virant and colleagues have devised well-thought-out experimentation and analysis pipelines to obtain unbiased measurements of kinetochore protein counts and distances from the centromeric histone known as Cnp1 in fission yeast. My concerns with this study are mainly regarding the clarity of some of the data analysis strategies and data presentation. The authors should be able to address these concerns without new experimentation.

      We would like to thank reviewer 1 for their valuable comments. We have addressed all comments and highlighted the changed sections in our revised manuscript.

      1. Segmentation of individual centromeres: In general, the authors are to be commended for including a detailed description of their procedures and analysis method. However, it wasn't readily clear to me exactly how they segmented individual centromeres. The lack of a consistent offset between the fluorescence spots corresponding to the protein of interest and Cnp1 in image in Figure 1 makes this issue even more confusing. It will help to display representative segmented individual kinetochores either in the main figure or a supplementary figure.

      We thank reviewer #1 for this comment and highly appreciate that they value our effort for detailed method descriptions. We strongly agree: Correct segmentation and best-possible visualization are crucial for our analyses. We hope that the following clarifications help to understand our work better:

      All images of the manuscript are reconstructed from localization files using Rapidstorm, which linearly interpolates the localizations on a subpixel grid and fills the pixels based on the distance between the localization and the center of the main subpixel bin to avoid discretization errors (Wolter et al., 2012). These images are then overlaid with a Gaussian blur corresponding to their localization uncertainty. In our opinion, this procedure gives the most realistic image impression of the data and results in reconstructed images that as best as possible mimic real fluorescence images. This is in our opinion a very important aspect when presenting SMLM data in reconstructed images as it is crucial that one - when looking at those images - does not over- or under-interpret the SMLM data. We extended the explanation in the manuscript: “For visualization, we aimed to reconstruct SMLM images that neither over- nor under-interpret the resolution of the SMLM data and resemble fluorescence images as closely as possible. Localizations were tracked together using the Kalman tracking filter in Rapidstorm 3.2 with two sigma, and the NeNA value used as sigma. Images were then reconstructed in Rapidstorm 3.2 with a pixel size of 10 nm. Rapidstorm linearly interpolates the localizations and fills neighboring pixels based on the distance between the localization and the center of the main subpixel bin to avoid discretization errors (Wolter et al., 2012). These images were then processed with a Gaussian blur filter based on their NeNA localization uncertainty in the open-source software ImageJ 1.52p (Schindelin et al., 2012). Importantly, images were only used for image representation purposes, all data analysis steps were conducted on the localization data directly (see data analysis).”

      Importantly, individual centromeres were segmented not on the images but on the localization data directly which we visualized in a self-written 3D visualization software that has the functionality to e.g. zoom in and out and to switch or overlay channels. This flexible visualization tool allowed us to make the best-possible informed decisions for the cluster selections and pairing of POI/cnp1CENP-A pairs. We extended our explanation by: For several manual steps, localizations were visualized in a custom software, which allows to zoom in/out flexibly and to switch/overlay between the sad1/POI/ cnp1CENP-A channels. Using this tool, individual localizations could be selected and classified. For channel alignment, localizations belonging to the same fiducial marker in all three channels were grouped together. Cells with visible kinetochore protein clusters in the focal plane were selected and classified as individual region of interests (ROIs) and all clusters were annotated. cnp1CENP-A clusters were paired together with corresponding POI clusters. Whenever there was any doubt whether two clusters belonged to the same kinetochore or whether a cluster represented a single centromere region or several, the clusters were discarded. Two exemplary data sets can be found in the zip-file Supplementary Data 1. The annotation work was quality-checked by cross-checking the annotation of two different persons.” Maybe interesting to add is that we initially and extensively tried several ways to fully automate the annotation. As all tested routines could not reach the quality of manually annotated data and had to a large extend be manually rechecked and corrected, we in the end directly annotated the data manually. We were rigorous in case of doubt: “Whenever there was any doubt whether two clusters belonged to the same kinetochore or whether a cluster represented a single centromere region or several, the clusters were discarded (manuscript page 10, section Visualization and Manual Analytics).”

      Furthermore, the reviewer is right, we didn’t include too much visual data/results into our manuscript. We now showcase some examples for our annotation. In the zip-folder “Supplementary Data 1”, the reviewer will find two csv SMLM data examples of annotated localization data of cells with a mitotic spindle that passed the quality checks of drift control and channel overlay. For the first example, a cell with dam1 as POI, all 6 kinetochores are visible and all were grouped and separated from noise. One can also nicely see (due to the large distance of dam1 to cnp1CENP-A) which of the six belong to which spindle by the spatial orientation of the cnp1CENP-A and dam1 cluster to each other, and both groups of three have a pair that is spatially closer to the wrong pole, important for question 2 below. The second example, with spc7 as the example POI, has only 5 visible clusters. A closer look shows that one of them has unusual dimensions and a high number of localizations, representing most likely two overlapping centromeric regions. Thus, for this cell, only 4 kinetochore pairs were annotated for further analysis. Also, the cluster pairs are overlapping much more, so a direct decision to which spindle they belong to gets difficult.

      Finally, we realized that the example cell used for figure 1 which we chose a long time ago for representative purposes actually is a dataset that did not pass the final quality controls of drift correction and channel overlay. Thus, it is not part of the data that was used for the results of this work. While it is pretty embarrassing that we did not realize earlier, we are really grateful for the reviewers’ question about it. We now replaced it by another example which nicely represents the data that went into the analysis and also represents the biological heterogeneity, not only in offset but also e.g. in shape: In this work, we simplify by ignoring any shape in our current analysis and only use cluster centroid distances. Kinetochore POI cluster shapes are currently investigated in a more detailed follow-up study.

      1. Use the mixture coefficient: The authors use the coefficient λ to create a mixture model for the Bayesian inference of distances. The description provided in the methods section is not sufficient for an average reader to understand how this coefficient is ultimately used (I had to look up the code and then the Stan manual for a superficial understanding of this procedure). It will be very helpful to flesh out this part of the model. Similarly, notation for the model that they use should be included either in the Methods or in supplementary data so that casual readers can get some understanding of the model.

      We added the following sentence to explain the significance of the mixture coefficient:

      “The coefficient then corresponds to the prior probability that the centromere is attached to the first spindle pole.”

      We are not sure whether we understand the last sentence of this comment correctly. We added more explanations and definitions to the Stan code (see kinetochore.stan) to make it easier to understand. However, the section “Distance calculation” is intended to give the reader a full understanding of all relevant parts of the model, a look at the code should therefore not be necessary. The Stan implementation of the model uses non-centered parameterization, which might make it appear more complex than what is described in the text. However, this is an implementation detail that is intended to make the posterior more well-suited for Monte Carlo estimation and does not change the underlying statistical model. It should therefore be of no concern to casual readers. Finally, we prepared a new Supplementary Figure S6 to visualize the model for all readers.

      1. The regional centromeres in fission yeast can incorporate varying levels of Cnp1 depending on its expression level (e.g. see Aravamudhan et al. 2013 Current Biology, Joglekar et al. 2008 JCB). Much of this "extra" Cnp1 is likely to be incorporated at sites distal to the Cnp1 molecules that directly nucleate the kinetochore. Therefore, the centroid of Cnp1 molecules is likely to be "shifted" to some extent from the foundation of the kinetochore. Any shift in the Cnp1 centroid will be important especially when comparing the fission yeast measurements with the budding yeast data. The authors should ascertain whether such a shift can be detected by comparing the budding yeast and fission yeast measurements.

      The reviewer is absolutely right, there is an active discussion in the field to which extend there are cnp1CENP-A molecules at distal sites and thus not part of the platform for the kinetochore structure. While different quantification methods in the literature partly disagree in cnp1CENP-A numbers, we have no indications that our own assessment by PALM imaging is wrong. In this study, we get a very stable (Suppl. Figure S9 b) cnp1CENP-A read-out by PamCherry1, which agrees with our Lando et al. 2012 study (a single color study with a different, much simpler protocol and using a different microscope with different settings and analysis routines, mainly using mEos2 but also some SI data on PamCherry1 for counting cnp1CENP-A molecules). Furthermore, the recombinant fusions in the native locus of cnp1CENP-A are stable and the strains show no signs of growth or phenotypic defects.

      While we therefore can argue that we see a native level and undisturbed distribution of cnp1CENP-A, we nevertheless do not know how much of this cnp1CENP-A is involved in building up the kinetochore. What we believe to know is the following:

      1. With our ChipSeq data in Lando et al. 2012 we explored the distribution and read-out hits of cnp1CENP-A within the outer repeats as well as the inner centromeric region of all three chromosomes (Figure 3 and SI in Lando et al. 2012). While cnp1CENP-A is highly populated within the ~ 10 to 15 kb large inner centromeric regions, there are less detections in the outer regions. Thus, while ChipSeq is not a quantitative method, we believe it’s showing the correct trend with some cnp1CENP-A in the outside regions but most cnp1CENP-A localizing in the inner region. We believe that in overexpression studies (like e.g. done both in Joglekar et al. 2008 & Aravamudhan et al. 2013), this most likely will differ (but we did not experimentally explore this).
      2. We generally would argue that our distances are rather accurate using a symmetry and compaction argument. The about 10kb inner regions are a roughly 3.4 µm long DNA strands at a linear 1-dimensional scale but in vivo are highly compacted. Total kinetochore sizes as seen in EM data for mammalian cells are “approximately 250 nm wide and 80 nm deep, with an electron-opaque inner plate juxtaposed to the centromeric chromatin, a translucent gap layer, and an electron-opaque, chromatin-distal outer plate apparently embedding the plus ends of spindle microtubules” (Musacchio and Desai, Biology 2017, 6(1), 5; 10.3390/biology6010005) and for * pombe also in the 200 nm range (Ding et al., J Cell Biol (1993) 120 (1); 10.1083/jcb.120.1.141). We evaluated the cnp1CENP-A cluster shapes as seen in our SMLM data. The clusters show a major axis length of 218 ± 88 nm and a minor axis length of 110 ± 29 nm on average. Taking into account our NeNA localization precisions, this is in nice agreement with the EM data measuring the lateral extend of the kinetochore structure. All together, we would argue a) that there is no reason or any indication in the literature that cnp1CENP-A not directly involved in kinetochore nucleation preferably gets incorporated on only one of the distal sites and thus would cause an asymmetry. We rather would argue that they are randomly inserted on both sides at low level and thus keep the symmetry needed to determine the center of the cnp1CENP-A cluster involved as the kinetochore platform. We also would argue b) that the structure is highly compacted and thus errors caused by additional cnp1CENP-A molecules will be small in respect to our resolution. We cannot completely exclude that there is such an effect that would increase the measured distances, however, and given all other sources of error (drift correction, channel alignment, cluster selection etc.) this most likely is not the main factor in defining the widths of the posterior distributions as we obtain them (Suppl. Figure S7). This argument is supported by the fact that we do not see any indication of such an effect for cnp1CENP-A-close proteins. We carefully checked fta2, fta7 and cnp20 data and also included some examples for the reviewer (see innerPOIexamples.zip). We hope that the reviewer agrees with us that there is no indication for a systematic asymmetric offset between the POI and cnp1CENP-A clusters. Finally, our distance numbers nicely agree with the distances the Ries group has measured for S. cerevisiae in the co-submitted manuscript. They a) have a point centromere with presumably only one kMT and b) did not use cnp1CENP-A *as their reference, they used spc7KNL1 (spc105).

      Virant and colleagues present a rigorous single-molecule localization-based analysis of the kinetochore protein copy number and organization within the fission yeast kinetochore. Although the fission yeast kinetochore has been extensively studied, the spatial organization of its kinetochore components has remained uncharacterized. This manuscript addresses this deficiency, and in concert with the budding yeast study, highlights the conserved and diverged features of the kinetochore in the two yeast species. Therefore, this manuscript will be of great interest to the kinetochore and cell division field.

      We would like to thank the reviewer again for their very helpful and highly constructive review.

      Reviewer #2

      In this study, Virant and colleagues have applied single molecule localization microscopy to map the positions of proteins in the pombe kinetochore. This has not been reported previously and this study is both well-conducted and the data appear solid. They also use a modification of this technique to assess the stoichiometry of kinetochore proteins. The results that they obtain are broadly in line with several previous studies that use other methodology but not in fission yeast nor to this level of detail. There are some important novel conclusions from this work. I would like the authors to address the following concerns prior to publication:

      We would like to thank Reviewer #2 for their appreciation of our work and their helpful remarks regarding our manuscript which we will answer below.

      1. It is not clear to me why the sad1-Scarlet-I signal in Figure 1C, displays a grid-like pattern? This must be an artefact of image collection or processing. Could the authors explain this pattern since this may affect the ability to find a centroid position of this signal?

      Thanks a lot for this comment. Yes, the grid-like pattern the reviewer observed is an artefact from image processing when compiling the exemplary composite 3-color images. We revisited the raw movie data and changed the procedure to produce the exemplary images to avoid this ugly artifact (which did not influence our data analysis as it is not present in the raw movies used for centroid determinations). Please note: we also changed the example cell for figure 1 due to reasons explained in the answer 1 to reviewer 1.

      1. It is my understanding that the distances reported are based on the positions of the proteins in one dimension, along the spindle axis, consistent with other studies and as illustrated in Figure 1b. This should be clearly stated in the results section.

      The model underlying our Bayesian inference is that cnp1CENP-A-POI and one of the two sad1 spindles are all on one “mitotic” axis, along the kinetochore microtubule. BUT the orientation of this mitotic axis is NOT necessarily parallel to the spindle axis. See Figure 1a, the in red drawn spindle axis is not necessarily parallel to the green drawn microtubules connecting the kinetochores to the spindle poles. (Please note, thanks to this comment, we found that our original sketch of Figure 1a was misleading and corrected for that.). Figure 1b in this respect is a bit misleading as only one spindle pole is shown. The slight difference between the kinetochore and spindle axis cannot be visualized with only half a spindle. For answering the reviewers comment no. 5, we also now plotted the measured offset from and relative position of cnp1CENP-A cluster centroids to the spindle axis, see below.

      1. The distances between proteins in this study are measured during anaphase, whereas the distances measured in cerevisiae previously were in both metaphase and anaphase (Joglekar et al 2009) and in the accompanying manuscript (Cielinski et al) in metaphase. In the comparison of distances, it would be worth describing how the mitotic stage may have affected distances, since Joglekar et al, found significant positional changes in cerevisiae kinetochore proteins from metaphase to anaphase.

      We would like to thank the reviewer for this comment. It sparked a longer discussion to precisely disentangle the cell cycle states. We afterwards went carefully through the literature again and added a column to Table S4 stating for which cell cycle phase(s) the individual works were conducted which we believe is highly useful when comparing the different data.

      Discussing with the Ries group we made sure that we indeed measured the cells in the same state and that we in our manuscript made mistakes in defining it correctly. Important for S. pombe is, that while it is not possible to decide for 100% between metaphase and anaphase A, we can safely exclude anaphase B so that we can state that we did not image anaphase B: Supplementary Figure S10 shows that all spindle distances measured are smaller than one nuclear diameter which is given by 2-3 µm (MacLean 1964, 10.1128/jb.88.5.1459-1466.1964; Tda et al 1981, 10.1242/jcs.52.1.271) plus we ourselves measured a nup132 strain in early G2 phase and obtained 2.4 µm ± 0.19 µm (data not shown)). This is nicely in line with Joglekar et al. 2009 (this paper actually has a very good SI figure on exactly this topic, see Fig. S1) and corrected the manuscript accordingly. Thanks for pointing this out to remove this lapse in definitions.

      __ 4. __It is hard to interpret the POI copy numbers in terms of each kMT. I am assuming that each cluster measured represents a single pombe kinetochore, containing 2-4 kMTs? If we assume that each pombe kinetochore can contain 2, 3, or 4 kMTs, then we might expect to see a trimodal dataset, I am guessing this was not seen in the data? Would it be possible to estimate protein numbers per kMT in Table 2, as done for the Cielinski et al study? I realize this would require an estimate of the number of kMTs per kinetochore. Alternatively, the authors are resolving individual kMTs, in which case this should be made clear.

      Yes, the reviewer is absolutely correct, the clusters are associated with 2-4 kMT (as nicely resolved in Ding et al, J Cell Biol (1993) 120 (1); 10.1083/jcb.120.1.141). Thus, we can assume that 2-4 kinetochore structures are also involved per centromeric region. In our current analysis, we have to work with the average of 2-4 kMTs. The shapes of POI and cnp1CENP-A clusters we have in the SMLM data are definitely diverse, and we plan to extract more data on spatial distribution in the future, perhaps even at the level of individual centromeric regions, but we did not systematically explore shapes in this work. Thus currently, we cannot give a precise answer how individual regional kinetochores look like at the level of a single kinetochore, but we strongly agree with the reviewer that this will be highly interesting to explore further. In this manuscript, therefore, to compare the stoichiometries between the point centromere of S. cerevisiae and the regional centromere of S. pombe, we used ratios as given in Suppl. Table S4. These ratios provided us with comparable results across a wide range of literature since the ratios are only calculated internally for each study and not across studies (which would lead to compatibility issues).

      For this study, we also labeled MT via atb2. Unfortunately, the SMLM experiments were very difficult as atb2 is also present everywhere else in the nucleus, in particular at the dense central MT bundle (see image below, white sad1-mScarlet-I, blue PamCherry1- cnp1CENP-A, and red mEos3.2-A69T-atb2). Thus, we could not resolve such fine details as single fibers for the kMTs: Most kMTs overlapped with the central fiber and due to the dense central MT bundle of atb2, the data of the atb2 channel could not be read-out neither in a quantitative nor complete way and we could not extract which percentage of atb2 molecules we actually successfully recorded in the SMLM data. Thus, especially visualizing fine fibers was difficult and the images obtained do not meet our quality standards – but the exemplary one below is maybe nevertheless informative in a qualitative way. Our current idea would be to use ExM plus SMLM, but this work would be a stand-alone study requiring the set-up and optimization of such a protocol.

      1. The same kMT issue may affect the measurements of distances. Each pombe kinetochore contains multiple kMTs and it is not clear whether these would align perfectly on the spindle axis. Did the authors see anything in their data that would support the notion that individual kMTs are aligned on the axis (as illustrated in Figure 2) or whether they are slightly separated? This is itself a potentially important result.

      It is important to note that we did not measure the distance in projection of the spindle axis (defined as sad1-sad1 centroid axis), see also question 2. We can show in our data that the main microtubule bundle between the spindles is angled to the kinetochore microtubules that connect the centroids of our three-color channels for each centromeric region in a sad1-POI-cnp1 axis. For sad1, we cannot simply determine which one of the two spindles is the correct one, thus we implemented a mixture model for our Bayesian model, see also answers to reviewer 1).

      While in the budding yeast literature it has been measured by EM that there are only small angles of up to 6° between the two axes present (Joglekar et al. 2009, 10.1016/j.cub.2009.02.056, Figure S1), Ding et al. 1993 showed larger deviations for S. pombe. Our data agrees with these findings. In the new Suppl. Figure S12, we plotted the height of all measured cnp1CENP-A centroids, normalized to the spindle lengths to represent the angular distribution between the spindle and kMT axes and in absolute nanometer distances to show that most kinetochores are in direct vicinity to the central bundle and only few show heights larger than 150 nm (also technically important as our focal z-range is ~600 nm).

      1. In all measurements of kinetochore protein intensity (both in this study and previous studies) there seems to be significant variation in the data for individual kinetochores, even for S. cerevisiae, which supposedly has a fixed number of the kMTs. The coefficient of variation is ~ 0.5 in the data shown in Table 2. Could the authors discuss the variability in POI copy numbers since it either reflects an inability to measure protein levels accurately or that there is some flexibility in kinetochore protein stoichiometry (or in this case differing numbers of kMTs per kinetochore - see point above)?

      Regarding the variability in our counting data we indeed expect a mixture of biological and technical nature in line with what the reviewer argues above but intuitively would lean towards the latter, being technically limited by the read-out precision of quantitative PALM imaging using fluorescent proteins, which have finite maturation- and read-out efficiencies and possess limited signal-to-noise contrast. When discussing this comment and how we possibly could support our intuition by evidence, we realized that the main argument to be made is that the FtnA oligomer is a biologically highly defined structure and that the variance we obtain for our FtnA calibration standard indeed also directly can serve as a proxy for our technical variance. Using the results of 21.63 counts ± 10.2 STD for the 24mers and of 7.27 counts ± 2.72 STD for the 8mers, we thus can estimate that the technical variance causes a coefficient of variance of 0.35 to 0.5, thus almost completely explaining the experimentally seen coefficient of ~ 0.5. we added this argument to our manuscript by “The POI protein copy numbers as given in Table 2 show a large coefficient of variation. To assess to which extend this variability reflects a technical inability to measure protein levels accurately or some flexibility in kinetochore protein stoichiometry (e.g. due to differing numbers of kMTs per kinetochore), we can use the data of the FtnA oligomer counting standard: The FtnA oligomer is a biologically highly defined structure. Thus, our FtnA measurements can directly serve as a proxy for the contribution of the technical inaccuracy of our PALM imaging and analysis strategy to the variance. Using the results of 21.63 counts ± 10.2 STD for the 24mers and of 7.27 counts ± 2.72 STD for the 8mers, we can estimate that the technical inaccuracy causes a coefficient of variance of 0.35 to 0.5, thus almost completely explaining the experimentally seen coefficient of ~ 0.5 for our POI data (Table 2). Due to this high technical inaccuracy, we cannot resolve sub-populations of possibly different kinetochore structures (and thus POI copy numbers) on 2-4 kMTs in our current counting data (Supplementary Figure S9).”. Secondly, as the reviewer also pointed out earlier, we have a biological variance of 2-4 kMTs and thus must assume smaller and larger kinetochores on individual centromeres. Due to the high technical variance, we nevertheless cannot resolve sub-populations in our current counting data (please also compare to the data in Supplementary Figure S9).

      Minor points:

      Delete "an" from "...structure at an about 100 nm resolution" (page 3).

      Thanks!

      In Figure 2 the proteins in the schematic are color coded, but it is not clear what the coloured proteins are in all cases. Would it be possible to color code the adjacent text, e.g. Spc7 in orange.

      Thanks for this suggestion, adjusted figure accordingly.

      Also in this figure, the POI copy numbers are indicated by color coding of the data points. However, the points will likely be too small in the final figure for these colors to be clearly visible. Perhaps copy numbers could be indicated in another way or the "mean value" boxes could be larger?

      We tested several options and decided to adjust the widths of the boxes.

      Please define "N" in Table 2. e.g. N = number of kinetochores measured.

      We added this information to the caption: “N number of centromeric regions analyzed.”

      Reviewer #2 (Significance (Required)):

      This manuscript, together with an accompanying one from Cielinski et al., are nice complementary studies that provide the first single molecule localization studies of the yeast kinetochore. Although other labs have used super-resolution methods to study individual kinetochore proteins; both of these new studies map distances between many proteins at the kinetochore and thus are able to produce maps of the overall kinetochore structure. Like the previous study using standard resolution methods (Joglekar et al, 2009. Current Biology 19, 694-699); these studies will likely provide a benchmark for future studies on eukaryotic kinetochore architecture, including those in mammalian systems. Additionally, this work will appeal to super-resolution microscopists.

      My expertise is as a yeast kinetochore cell biologist.

      We would like to thank Reviewer #2 again for their appreciation of our work and their valuable remarks and discussion points which improved our manuscript substantially during the revision phase.

      Additional comments for both reviewers:

      As the co-submitted manuscript from Cieslinski et al. 2021 re-analyzed all their distances and numbers during the revision phase, we updated the comparisons in Suppl. Table S4, S5 and our summary text: 1) their cnn1 distance measurement got corrected and no shows no deviation anymore to our data. Also, their protein copy numbers changed slightly. So we changed the summary from “Additionally, two organism-specific differences surfaced: cnp20CENP-T (cnn1) is located between spindle pole and cnp1CENP-A in our case (at similar distance as fta2CENP-P and fta7CENP-Q), whereas Cieslinski et al. position cnn1 (and mif2) behind cnp1CENP-A. Furthermore, the ratio of cnp20CENP-T to COMA is 1:0.9 in our case and 1:2.1 for S. cerevisiae“ to “Importantly, one substantial organism-specific difference for the inner kinetochore strategy surfaced: The ratio of cnp20CENP-T to COMA is 1:0.9 in our case and 1:2.0 for S. cerevisiae.” 2) Additionally, we were able to add a discussion about their new measurement of ask1 of the dam1 complex. Ask1 is a protein of the DASH ring. Their new distance measurement of S. cerevisiae ask1 fits to the distance we measure for S. pombe dam1 and thus supports our discussion that, for S.pombe, the C-terminus of dam1 localizes at the DASH ring and not at the ndc80 heads like for S. cerevisiae. We added this sentence to the summary: “Furthermore, the S. cerevisiae work measured the position of ask1, a protein of the DASH ring. Their positioning of S. cerevisiae ask1 is consistent with the distance we measured for S. pombe dam1 and thus directly supports our reasoning that the C-terminus of S. pombe dam1 is localized to the DASH ring and not to the ndc80 heads (like for S. cerevisiae).”

      We found that we – very unfortunately - did a calculation mistake ourselves and used the inverse of the correction factor (multiplied with 0.9 instead of dividing with 0.9) when correcting our SMLM localizations to absolute protein counts. Thus, the numbers we gave in Table 2 and the color bar in Figure 2 were wrongly converted. We now corrected for this lapse.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, Virant and colleagues have applied single molecule localization microscopy to map the positions of proteins in the pombe kinetochore. This has not been reported previously and this study is both well-conducted and the data appear solid. They also use a modification of this technique to assess the stoichiometry of kinetochore proteins. The results that they obtain are broadly in line with several previous studies that use other methodology but not in fission yeast nor to this level of detail. There are some important novel conclusions from this work. I would like the authors to address the following concerns prior to publication:

      1. It is not clear to me why the sad1-Scarlett-I signal in Figure 1C, displays a grid-like pattern? This must be an artefact of image collection or processing. Could the authors explain this pattern since this may affect the ability to find a centroid position of this signal?
      2. It is my understanding that the distances reported are based on the positions of the proteins in one dimension, along the spindle axis, consistent with other studies and as illustrated in Figure 1b. This should be clearly stated in the results section.
      3. The distances between proteins in this study are measured during anaphase, whereas the distances measured in cerevisiae previously were in both metaphase and anaphase (Joglekar et al 2009) and in the accompanying manuscript (Cielinski et al) in metaphase. In the comparison of distances, it would be worth describing how the mitotic stage may have affected distances, since Joglekar et al, found significant positional changes in cerevisiae kinetochore proteins from metaphase to anaphase.
      4. It is hard to interpret the POI copy numbers in terms of each kMT. I am assuming that each cluster measured represents a single pombe kinetochore, containing 2-4 kMTs? If we assume that each pombe kinetochore can contain 2, 3, or 4 kMTs, then we might expect to see a trimodal dataset, I am guessing this was not seen in the data? Would it be possible to estimate protein numbers per kMT in Table 2, as done for the Cielinski et al study? I realise this would require an estimate of the number of kMTs per kinetochore. Alternatively, the authors are resolving individual kMTs, in which case this should be made clear.
      5. The same kMT issue may affect the measurements of distances. Each pombe kinetochore contains multiple kMTs and it is not clear whether these would align perfectly on the spindle axis. Did the authors see anything in their data that would support the notion that individual kMTs are aligned on the axis (as illustrated in Figure 2) or whether they are slightly separated? This is itself a potentially important result.
      6. In all measurements of kinetochore protein intensity (both in this study and previous studies) there seems to be significant variation in the data for individual kinetochores, even for S. cerevisiae, which supposedly has a fixed number of the kMTs. The coefficient of variation is ~ 0.5 in the data shown in Table 2. Could the authors discuss the variability in POI copy numbers since it either reflects an inability to measure protein levels accurately or that there is some flexibility in kinetochore protein stoichiometry (or in this case differing numbers of kMTs per kinetochore - see point above)?

      Minor points:

      Delete "an" from "...structure at an about 100 nm resolution" (page 3).

      In Figure 2 the proteins in the schematic are color coded, but it is not clear what the coloured proteins are in all cases. Would it be possible to color code the adjacent text, e.g. Spc7 in orange. Also in this figure, the POI copy numbers are indicated by color coding of the data points. However, the points will likely be too small in the final figure for these colors to be clearly visible. Perhaps copy numbers could be indicated in another way or the "mean value" boxes could be larger?

      Please define "N" in Table 2. e.g. N = number of kinetochores measured.

      Significance

      This manuscript, together with an accompanying one from Cielinski et al., are nice complementary studies that provide the first single molecule localization studies of the yeast kinetochore. Although other labs have used super-resolution methods to study individual kinetochore proteins; both of these new studies map distances between many proteins at the kinetochore and thus are able to produce maps of the overall kinetochore structure. Like the previous study using standard resolution methods (Joglekar et al, 2009. Current Biology 19, 694-699); these studies will likely provide a benchmark for future studies on eukaryotic kinetochore architecture, including those in mammalian systems. Additionally, this work will appeal to super-resolution microscopists.

      My expertise is as a yeast kinetochore cell biologist.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Virant and colleagues have devised well-thought-out experimentation and analysis pipelines to obtain unbiased measurements of kinetochore protein counts and distances from the centromeric histone known as Cnp1 in fission yeast. My concerns with this study are mainly regarding the clarity of some of the data analysis strategies and data presentation. The authors should be able to address these concerns without new experimentation.

      1. Segmentation of individual centromeres: In general, the authors are to be commended for including a detailed description of their procedures and analysis method. However, it wasn't readily clear to me exactly how they segmented individual centromeres. The lack of a consistent offset between the fluorescence spots corresponding to the protein of interest and Cnp1 in image in Figure 1 makes this issue even more confusing. It will help to display representative segmented individual kinetochores either in the main figure or a supplementary figure.
      2. Use the mixture coefficient: The authors use the coefficient λ to create a mixture model for the Bayesian inference of distances. The description provided in the methods section is not sufficient for an average reader to understand how this coefficient is ultimately used (I had to look up the code and then the Stan manual for a superficial understanding of this procedure). It will be very helpful to flesh out this part of the model. Similarly, notation for the model that they use should be included either in the Methods or in supplementary data so that casual readers can get some understanding of the model.
      3. The regional centromeres in fission yeast can incorporate varying levels of Cnp1 depending on its expression level (e.g. see Aravamudhan et al. 2013 Current Biology, Joglekar et al. 2008 JCB). Much of this "extra" Cnp1 is likely to be incorporated at sites distal to the Cnp1 molecules that directly nucleate the kinetochore. Therefore, the centroid of Cnp1 molecules is likely to be "shifted" to some extent from the foundation of the kinetochore. Any shift in the Cnp1 centroid will be important especially when comparing the fission yeast measurements with the budding yeast data. The authors should ascertain whether such a shift can be detected by comparing the budding yeast and fission yeast measurements.

      Significance

      Virant and colleagues present a rigorous single-molecule localization-based analysis of the kinetochore protein copy number and organization within the fission yeast kinetochore. Although the fission yeast kinetochore has been extensively studied, the spatial organization of its kinetochore components has remained uncharacterized. This manuscript addresses this deficiency, and in concert with the budding yeast study, highlights the conserved and diverged features of the kinetochore in the two yeast species. Therefore, this manuscript will be of great interest to the kinetochore and cell division field.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Please note we have uploaded a PDF with the point to point reply.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Wang et al. present an evaluation of a new generation of time-of-flight-based mass spectrometer that improves on the fraction of ions factually used for detection of peptide analytes, thus boosting the sensitivity of the Zenotof 7600 system when compared to the same instrument with the duty-cycle-enhancing Zenotrap module disabled and also when compared to the previous generation instrument of the same vendor in some of the comparisons.

      The authors position the MS acquisition technique as particularly suitable in combination with medium (micro-) and high ('analytical') flow and throughput methods where higher flow rates (vs. conventional nanoflow-LCMS) allow rapid sample turnover and high throughput, yet limit the efficiency of electrospray and ion transfer into the MS system, thus being in dire need for enhanced sensitivity of the MS system employed for detection. The competency of such an MS system for very low input materials as e.g. encountered in emerging single-cell proteomic workflows, typically employing nanoflow chromatography, was thus not part of the study.

      Accordingly, a medium- (micro-flow) and very high ('analytical'-flow) throughput LC method were screened on the three MS (parameter) setups using human cell lysate digests typically utilized in such technical evaluations. Well-received, the authors further extended their analysis (for the new instrument) across additional sample types of clinical and extended biological interest and spanning different levels of complexity and dynamic range of contained protein analytes.

      In addition, the authors also performed a controlled ratio 2-species mixture experiment which allows detailed benchmarking of proteome coverage as well as the quality of protein quantification in a known differential comparison for the medium throughput (micro-flow) method.

      The data quite convincingly demonstrate an increased sensitivity of the instrument based on similar identification performance in DIA bottom up proteomics from ca. 3- to 8-fold lower input peptide mass. However, I see a number of shortcomings mainly in the presentation and in part the completeness of the work, with specific comments below.

      Major comments:

      • Are the key conclusions convincing?

        • The concluded 10x sensitivity increase is overstating the observed numbers (x5-x8). In addition, the authors should at least discuss other changes than the Zeno trap incurred in the Zeno SWATH vs non-Zeno-SWATH DIA setups, particularly changes in accumulation times per m/z range, with Zeno Swath accumulating ~42 % longer per cycle spanning the same m/z range (85 vs 60 windows with 11ms per window) in the uflow method set and ~ 18 % longer in the high-flow method set (same window number but 13 ms vs. 11 ms dwell time per window). This should be discussed as one of the optimizations/factors contributing to the increased sensitivity observed in Zeno Swath measurements vs conventional SWATH. On that note, it was unclear to me when and where the 40 variable window SWATH method mentioned in the methods was used and where the settings can be found.
        • Since injected material is a critical parameter here, it would be good if it was mentioned also with the key conclusion on the increased number of confidently quantified peptides in microflow (based on the 2-species controlled quantity experiment).
        • Conclusions 'increasing protein identification numbers through the use of analytical-flow-rate chromatography' does not capture the observed data; the use of analytical-flow-rate does not convey an increase in protein identification numbers but enhanced sensitivity rather enables the maintenance of high protein identification numbers / proteome coverage despite/concurrent with analytical-flow chromatography
        • In titration curve experiments like these, probing proteome coverage from relatively small sample amounts, special care
        • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

        • 'Zeno SWATH increases protein identification in complex samples<br /> 5- to 10-fold when compared to current SWATH acquisition methods on the same instrument' - At no point this is shown, a decrease of required input amounts by 5-8-fold (increase in sensitivity) is shown by the data, not a multiplication of protein identification rates by that factor.

        • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

        • Figure 1f, Supplemental Figure 1b, Figure 3 and Supplemental Figure 3 lack data for the Zeno SWATH method's performance at higher concentration. Given the fact that there is a clear, continuous trend of significant enhancement of proteomic depth in the highest 3 concentrations sampled by the Zeno SWATH method, I lack an assessment of the upper limit of proteome coverage achievable by the new platform when input material is not limited, or at least learn why injecting more is not advisable on the ZenoTOF 7600 system. It is clear that the region of interest is the lower loads where sensitivity gains are most pronounced, but with the strong trend in IDs per ng injected in the sampled range and discrepant range sampled by the non-Zeno method I feel there is a gap in the dataset and the upper ceiling of proteome coverage could be mapped out more thoroughly (At least for human cell lysate and possibly human plasma where trends appear most (log2-)linear).

        • Similarly, unless constrained for technical or practical reasons, I would suggest to find the ceiling for achievable proteome depth in analytical flow (4, 8 ug?)
        • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

        • All these should be re-injections of existing samples on these MS setups and a minor effort provided instrument availability (<1w) and rapid re-analysis via DIA-NN.

        • Are the data and the methods presented in such a way that they can be reproduced?

        • The raw data have not been deposited to a public repository. Reproducibility of the study would benefit significantly by raw data (including search results and spectral libraries with log files of creation) upload/sharing e.g. via ProteomeXchange/PRIDE.

        • If any software versions or firmwares on the hardware are required to perform the measurements on the ZenoTOF on the market today, these versions and prospective release dates should be included or the accessibility of these settings commented on.
        • Are the experiments adequately replicated and statistical analysis adequate?

        • Figures 3 and Supplemental Figure 3 need a clarification in the legend as to the nature and origin of ID numbers (mean? Number of replicates? Add error bars if possible)

        • The usage of DIA-NN for data analysis is somewhat unclear, in particular the in Methods/Spectral libraries "For the analysis of plasma samples, a project-independent public spectral library [29] was used as described previously [15]. The Human UniProt [30] isoform sequence database (UP000005640, 19 October 2021) was used to annotate the library and the processing was performed using the MBR mode in DIA-NN." The authors should address in a revised version whether the identification numbers reported stem from two-pass or single-pass analysis (i.e. when the feature termed Match-between-runs implemented since DIANNv1.8 was enabled and whether all runs, spanning different injection amounts were co-analyzed and data-re-queried for a targeted library containing precursors identified in high load samples in first pass analysis and then queried in low-load samples. In other words, are the low-load IDs independent of high load IDs? If not (i.e. the different loads were co-analyzed with MBR), what proteome coverage to the low sample loads reach bona fide, without the 'guidance' of high-load IDs?

      Side note: Turning this around, could a high-load injection e.g. from a pool of limited-amount samples serve as a guiding element in a MBR-enabled analysis of a large cohort with limited sample amounts available per biological condition?

      Minor comments:

      • Specific experimental issues that are easily addressable.

        • The authors state the impact on dynamic range of identification when comparing ID sets against an external dataset with presumable cellular concentration numbers. I would in addition suggest comparing the dynamic range of the quantititative values observed from the available data which should provide a direct assessment of the dynamic range of quantification of the two methods.
        • Are prior studies referenced appropriately?

        • The statement that conventional DIA methods rely on nanoflow chromatography (p3, paragraph 3) is not accurate as there is previous implementations of data-independent acquisition MS of microflow separations, in part the group's work and referred to later in the text.

      o Vowinckel, J. et al. Cost-effective generation of precise label-free quantitative proteomes in high-throughput by microLC and data-independent acquisition. Sci. Rep. 8, 4346 (2018)

      o Bruderer, R. et al. Analysis of 1508 plasma samples by capillary-flow data-independent acquisition profiles proteomics of weight loss and maintenance. Mol. Cell Proteom. 18, 1242-1254 (2019).

      It is correct that most early implementations of DIA-MS utilized nanoflow separations due to sensitivity and proteome coverage but DIA as such is a chromatography-flow-speed-agnostic principle and the concept to combine microflow LC with DIA not new, yet powerful as demonstrated by the authors and others previously and once again, here. - P.3 paragraph 3 'Moreover, the increased sensitivity of DIA methods has facilitated applications in large-scale proteomics, including system-biology studies in various model organisms, disease states, and species [5-9]' Include Ref 4 where improved sensitivity of DIA was demonstrated (at proteomic breadth..) - Are the text and figures clear and accurate?

      -   Text and Figures need to be edited for typos, language, and clarity/accuracy.
      
      1. Abstract 'Zeno SWATH increases protein identification in complex samples<br /> 5- to 10-fold when compared to current SWATH acquisition methods on the same instrument' - At no point this is shown, a drop of required input amounts by 5-8-fold (increase in sensitivity) is shown by the data, not a multiplication of protein identification rates.
      2. P. 4 paragraph 3: Use terms 'consensus' or 'shared' identifications or similar to refer to the proteins identified in all 3 replicates, rather than 'reproducible' when discussing the reproducibility of peptide and protein quantification (as contrast to reproducibility of identification).
      3. P.3 paragraph 2 'selects and fragments multiple charge ions' -> multiply charged (?)
      4. P. 4 p. 1 'leading to under-detection' please clarify (leading to partial ion usage and limited sensitivity?)
      5. P. 6 paragraph 3 'The gain in identification number of Zeno SWATH versus SWATH is mostly explained by an increased dynamic range: i.e. more low-abundance proteins are detected' - Reformulate/clarify: Is increased dynamic range of identifications against external quantities an explanation or perhaps simply the increased sensitivity with improved duty cycle?
      6. Term 'active gradient' unclear. An inactive gradient is isocratic flow. Omit 'active'. Isocratic/other portions are overhead.
      7. Figure 1 panel a) iteration scheme a-d) is redundant with the rest of the figure; use alternative iter scheme within panel a). Panel a) is further contains illegibly small fonts and should be edited for legibility
      8. Revisit y-axis labels. Example: Fig. 1f) 'Precursors Identificaiton' -> Precursors identified/Precursor identifications. Correct throughout manuscript
      9. ID bar graphs in all Figures: Cumulative IDs shade of grey is not properly visible, suggest alternate color scheme or add black color outline to the bars
      10. Figure 1 e) legend 'along gradient length' -> gradient time / retention time
      11. Figure 1 d) too small, trend lines mentioned in text invisible in graph. Boxplots very small.
      12. There is three different terms used for the high throughput method (analytical-flow, high-flow, and another one.. please align where possible for clarity (i.e. choose 2 names for the 2 methods throughout the manuscript) etc..
      13. Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

        • They authors may consider adding a short explanation of the term 'dynamic range coverage of identification' to contrast this from a direct assessment of dynamic range of quantitative values observed in this study.
        • 2-species controlled experiment: The discrepancy of observed vs true mixing ratios suggests the data were scaled during the analysis which, with these mixture ratios, tends to distort the accuracy (i.e. generates offset of observed from true ratio. That's very likely not a pipetting error on a log scale). In other words, you may want to evaluate the raw quantitative ratios (w/o any normalization/scaling applied) which should be more reflective of true/manual pipetting ratios in light of normalization strategy incompatibily with certain species mix scenarios (compare Supplementary Figure 1 a). Note to the editor(s): This will not affect the clear benefit of Zenotrap usage demonstrated by the 2-species benchmark as is but can be considered a minor yet recommended improvement (thus here).
        • The 2-species controlled experiment can reveal more information than currently extracted and I would recommend to show Zeno Swath and Swath xy scatters, including count-scaled density distributions of the observed ratios, side-by side. This would give deeper understanding of the large impact of the Zeno SWATH method. Also, I believe I haven't seen any instrument to date delivering precise quantification over as broad a dynamic range as surmisable from Fig. 1d) which might be worth wile highlighting.

      Significance

      Wang et al. describe a technical advance in ion usage and sensitivity based on an ion-trap device storing and focusing ions for TOF-based bottom-proteomics measurements. The study demonstrates improved sensitivity relative to previous generation instrumentation and also explores the impact of the specific trap device relative to the general improvements of the remaining MS system. The work outlines a route towards high coverage proteomics at very high throughput and robustness, as desirable in clinical proteomics and prospective personalized medicine approaches. While not all sample types of interest are limited to the amounts where the strongest improvements are seen in the presented data, large scale studies across expansive cohorts will likey be rendered more practical and realistic due to reduced instrument contamination at reduced loads and also further applications beyond those discussed in the manuscript will be rendered feasible on the newer generation instrument.

      The improved ZenoTOF system and SWATH method follows a series of innovations in the mass spectrometry instrumentation, most notably and related the drastic improvement of ion utilization by storage e.g in a trapped ion mobility device earlier in the ion stream where, beyond an accumulation-based boost of sensitivity, ion mobility as a further biophysical properties is assessed in addition to the conventional m/z, as reviewed recently (doi: 10.1016/j.mcpro.2021.100138.). While these developments culminated and have been targeting low-flow, ultra-high sensitivity applications such as single-cell proteomics, the present study takes a different angle towards higher throughput measurements from significantly larger than single cell, but also significantly lower than historically required sample amounts that were prohibitive to a range of applications that are now easier to accomplish thanks to this and related work of the authors and others. The presented research appears of broad relevance and interest to the scientific community interested in protein abundance pattern analysis, in particular in larger (clinical) cohorts. Furthermore, the performance metrics on proteomic depth from human cell lysate digests will likely allow researchers with analytical quests other than those exemplified in the manuscript to extrapolate the ZenoTOF and Zeno SWATH suitability for their respective analytical targets.

      Reviewer Field of expertise/background:

      Quantitative proteomics. DIA mass spectrometry method & algorithm development & heavy usage. Protein Biochemistry. Molecular Biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript Wang et al, benchmark the new ZenoTOF with analytical and micro-flow set up and show impressive numbers of proteins identified and quantified. The paper is well written, and I have only a few minor comments:

      1. Figure 1. many of the panels are hard to read. Especially 1a
      2. Figure 1d. can the human amount not be normalised to log2=0?
      3. Please provide in the legend the bin size for the figure in 1e.
      4. Page 10 top: did SWATH identify more proteins than Zeno SWATH in plasma? There is something wrong as the figure shows something else. Also: That sentence in brackets is confusing.
      5. Typo: page 10: respectively.
      6. Please add raw data to PRIDE or a similar repository.

      Significance

      This is an impressive new technology that has been benchmarked by the Ralser group. It outperforms current state-of-the-art approaches.

      The primary audience is the proteomics community.

      My expertise is proteomics and quantitative mass spectrometry. I am well qualified to review this paper.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      In our work, we quantified the abundance and positions of major kinetochore proteins within the metaphase kinetochore in budding yeast using single-molecule localization microscopy. Based on these measures, we revised the current model of the kinetochore and provided a nanoscale view of the complex.

      We now revised our manuscript according to reviewers’ points. We performed new analyses to quantify the measurement errors and to justify our data analysis workflows. We further exploited the correlation-based analysis and found a correlation between the spreads of kinetochore proteins perpendicular to the spindle axis and their positions along the axis. We also discussed the potential non-centromeric pools and revised our model of the kinetochore. Further information on our analyses was now provided to improve the clarity. Changes to the text were implemented to better reflect our data. Information from relevant works was incorporated to better connect this work to the field.

      We thank the reviewers for their points, which help us show the rigorousness of our analyses, further demonstrate the potential of our work, and improve clarity.

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors have developed a rigorous methodology for using single-molecule imaging of exogenously labeled kinetochore proteins to count and estimate their copy numbers and the average distance from the kinetochore protein Spc105. Although the method is technically sound, its application to the kinetochore raises some crucial questions below. My biggest concern is the effect of non-centromeric pools of the centromeric proteins Cse4, Cep3, and Ctf19 on the estimated copy number per kinetochore. The authors should be able to address most, if not all, questions by presenting a more in-depth data analysis.

      Major points

      1. Accounting for tilt of the yeast spindle relative to the image plane: It is not clear to me how the authors ascertain whether the spindle being imaged is nearly parallel to the image plane. In the companion fission yeast study, spindle poles are used for this purpose, but this study seems to rely only on the labeled kinetochore proteins. The criteria used to select the in-plane spindles should be clearly defined.

      We thank the reviewer for pointing this out. We selected the in-plane spindles based on their average PSF size, which informs the z positions of the center of the kinetochore cluster (for simplicity, now all ’half-spindle’ was changed to ‘kinetochore cluster’). To calibrate the z position of kinetochore clusters, we first measured the width of the kinetochore cluster by fitting a cylindrical distribution. Overall, the kinetochores are likely symmetrically distributed around the spindle axes. Therefore, the height and the width of a kinetochore cluster should be the same. We then calibrated the z positions of the PSF size based on fluorescent bead data. Next, we plugged in the cylindrical distribution to the calibration curve to correlate the mean PSF size and position of the kinetochore cluster. We only took the kinetochore clusters with a mean PSF size

      1. The effects of PSF depth on counting kinetochore proteins: The authors use a well-characterized nuclear pore protein as the reference to estimate kinetochore protein counts per half-spindle. Although this method appears rigorous in principle, I am unsure about the effect of the spatial distribution of kinetochores on the accuracy of the estimated number. Nuclear pore proteins are all localized within an 100 nm away from the focal plane even when the spindle is perfectly parallel to the focal plane. A discussion of this possibility, its effect on the protein count/distance estimates, and any mitigating factors is essential to highlight the caveats associated with the conclusions.

      Based on the cylindrical distribution (see please the reply to point 1) of kinetochore clusters and their positions in z, we calculated the upper and lower boundaries of the distribution of kinetochore proteins in z, given a specific mean PSF size cutoff of a kinetochore cluster. Regardless of how stringent the cutoff is (130 and 135 nm), we made sure the boundaries do not exceed the imaging depth defined by our choice of the PSF size filtering (

      1. Presentation of the cross-correlation analysis: The authors use cross-correlation for an unbiased calculation of the axial separation between a protein of interest and Cse4, but I am curious about the structure of the underlying data, and the intensity image in Figure 1 is not easy to examine. It will be helpful to include more analysis of the underlying data for at least a subset of the proteins (e.g., proteins at short, intermediate, and long distances from Cse4) as supplementary data.

      2. The authors should include X and Y projections of the cross-correlation function.

      3. Do the widths of cross-correlation functions (i.e., their spread perpendicular to the spindle axis) match across all proteins and experiments? This should be an almost invariant characteristic of the measurements, assuming that proteins within each kinetochore tightly cluster around the 25 nm microtubule. This line of thinking makes the large width of the cross-correlation shown in Figure 1 somewhat surprising.

      4. It will also be interesting to test if the correlation between the positions of Spc105 molecules, especially perpendicular to the spindle axis, is comparable to the known separations between adjacent microtubules in the yeast spindle (the authors could use Winey et al. 1995 for serial-section EM of yeast spindles for comparison).

      The reviewer is interested in the spread, or the size of the distribution, of a protein in a kinetochore along and perpendicular to the spindle axis. This is an interesting idea and can be done practically. However, the information can be more easily obtained based on auto-correlation instead of cross-correlation, due to its better signal-to-noise ratio along the dimension perpendicular to the spindle axis. Cross-correlations in that dimension are convoluted with background localizations and different localization precisions of the two channels. These factors are hard to interpret and disentangled. In auto-correlations, although the background is still present, it can be modeled and then removed easily, as now mentioned on page 15 lines 500-516.

      Accordingly, we performed auto-correlation analysis on all the proteins and compared them to simulations representing different sizes. We find that the size of the distribution correlates to the position of the protein along the spindle axis. The results are now included as the new Fig. S5 and discussed on page 6 lines 169-176.

      The cross-correlation analysis was based on only the position of the maximum value, not the projections. To keep the figure concise, we decided not to include the projections. However, the auto-correlation analysis was indeed based on projections, which we now included in Fig. S5.

      Regarding the correlation between the positions of Spc105 molecules, we believe the reviewer actually refers to the correlation between the positions of kinetochores. Auto-/cross-correlations contain the information of the cluster sizes, based on the first peak (as shown in Fig. S5), and the relative distance (if the pattern is periodic). Unfortunately, the positions of kinetochores perpendicular to the spindle axis are not periodically distributed. Therefore, we cannot comment on the separations between adjacent microtubules.

      1. Cse4 count (4 per kinetochore) and the model presented: One of the surprising conclusions of the study is that there are two nucleosomes associated with each microtubule attachment, with Mif2/CENP-C potentially interacting with both nucleosomes. There are two critical issues that the authors must consider.

      (1) Fluorescent protein chimeras of Cse4 and CBF3 and COMA complex members do not exclusively localize to kinetochores. Biochemical studies show that both Cse4 and CBF3 proteins interact with non-centromeric DNA, e.g., see work from the Biggins lab regarding Cse4 over-expression and also from the Henikoff group that used ChIP-seq. I can't think of a similar reference for the CBF3 complex, but the DNA-binding proteins are also likely to interact with other parts of the genome. The non-centromeric protein is visible as a significant background fluorescence in wide-field microscopy, e.g., see Cep3 localization here: https://images.yeastrc.org/imagerepo/viewExperiment.do?id=202308&experimentGroupOffset=3&experimentOffset=0&experimentGroupSize=3

      Similar background fluorescence can be detected for Cse4 and Ctf19. This extra-centromeric localization of Cse4, Cep3, and Ctf19 makes it possible that the protein counts included by the authors are "contaminated" to some extent by the extra-centromeric protein. The authors should discuss this possibility and how it might affect their counts.

      After consideration, we agree with the reviewer that, specifically, a fraction of counted Cse4 molecules should be considered non-centromeric. We agree that the previous data is certainly sufficient to conclude it. The reviewer made a similar suggestion about COMA and CBF3 subcomplexes. In recent years a substantial portion of inner kinetochore components has been reconstituted. In Harrison et al. 2019, the Ctf19 complex structure has been solved. Two copies of the complex were observed. Therefore, the non-centromeric pool of COMA is certainly possible and we now made the adjustments to the text (page 8, lines 219-225) and Fig. 4. Accordingly, we now also modified the abstract (page 1, lines 26-27) and restructured the sections (page 10) to accommodate the different possibility of Cse4 copy numbers. While, fluorescence imaging of CBF3 presents a signal throughout the nuclear region we observed only four copies of Cep3 (part of CBF3). A CBF3 structure also has been resolved by Yan et al. 2018, in which the complex was proposed to exist as a dimer. This translates into four copies of Cep3. Therefore, we find it more suitable to leave all observed Cep3 (CBF3) molecules within a kinetochore model.

      (2) The model drawn in Figure 4 makes explicit assumptions about the positioning of the four Cse4 molecules (or two nucleosomes) in each kinetochore relative to the rest of the kinetochore components. Yet, the data shown do not justify this specific arrangement. Lawrimore et al. 2011 claim that the non-centromeric Cse4 nucleosomes must be randomly distributed in the pericentromeric chromatin to evade detection in biochemical tests. Therefore, the nearest-neighbor analysis suggested above will be valuable for gaining new insights into the relative positioning of the centromeric- and non-centromeric Cse4 nucleosomes. A similar analysis for Cep3 and Ctf19 will also be helpful. If stereotypical positioning of these molecules cannot be detected, then the model should be revised accordingly (alternative models that are also consistent with the data can be included).

      The reviewer has pointed out that Lawrimore et al. 2011 proposed and justified the existence of a non-centromeric Cse4 pool. This arrangement, also potentially along other inner kinetochore components, makes sense and our data did not indicate it otherwise. Therefore, we now revised our model accordingly by applying changes in the main text on page 10 lines 302-305 __as well as in __Fig. 4.

      (3) I suggest one experiment that can help the authors better understand protein organization in one kinetochore. Joglekar et al. 2006 used a dicentric chromosome to isolate single kinetochores on the spindle axis to test the assumption that each kinetochore consists of approximately the same number of molecules of kinetochore proteins. The strains are easy to construct (transform existing strains with a linearized plasmid). Single kinetochores can be seen with a low but reasonable frequency. I leave the decision to perform the experiment to the authors' discretion depending on whether the experiment will be worth the effort in strengthening or enhancing their conclusions.

      We performed the suggested experiment using the strain published in Joglekar et al. 2006 (kindly provided by Prof. Kerry Bloom) with Cse4 additionally tagged with mMaple. However, we always observed several super-resolved Cse4 clusters (likely of several kinetochores) overlapping with Nuf2-GFP diffraction-limited signal, therefore unable to assign a single isolated kinetochore to the lagging centromere.

      1. Information regarding the degree of correction applied to calculate protein count per half-spindle: It will be helpful to include data regarding the degree of correction applied to the expected and measured numbers of NPC protein as supplementary data so that the readers can see the magnitude of this correction relative to the measured counts.

      We would like to clarify that we did not correct the data. Instead, we calibrate the copy number, given that the copy number of Nup188 per NPC is known. We assume the same ratio between localization and copy number applies to both Nup188 and the kinetochore proteins. We now include a new Table S4 listing calibration factors of all experiments shown in Fig. 3.

      Minor points:

      1. McIntosh et al. JCB 2013 used microtubule plus-ends in serial section electron micrographs of yeast spindles to align the centromeric region and found a disk-shaped structure that roughly corresponds to the size of a single nucleosome ~ 80 nm away from the tip of the microtubule and centered the microtubule axis. The authors should refer to this finding in their discussion of the model that they present with two nucleosomes. In my opinion, this is compelling evidence for a nucleosome-like structure serving as the kinetochore foundation.

      We agree with this reviewer's comment. The study, among others, present compelling evidence for a point-centromere. We now included the finding in the discussion on page 10, lines 293-294.

      1. As discussed by the authors, the number of Cse4 molecules per kinetochore has been the subject of some controversy. Biochemical data from the Biggins group and ChIPseq data from the Westermann group (Altunkaya et al. 2016 Current Biology) strongly suggest that Cse4 molecules can only be found centered on the centromeric sequence. The latter reference should be included in the discussion.

      Thank you for pointing this out. Indeed, this is important. We have now added the relevant reference in the discussion on__ page 10 lines 291-292__.

      1. Although microscopy-based methods have estimated anywhere from 1, 2, to 6 Cse4 molecules per kinetochore, these studies generally agree on the stoichiometry between Cse4 and the rest of the kinetochore proteins, e.g., Ndc80 complex proteins are ~ 4-fold more abundant that Cse4, etc. The present study seems to disagree with protein stoichiometry. The authors may find it worthwhile to note this feature of their data.

      We now discuss the stoichiometry difference between our results and others on page 11 lines 322-324.

      1. Omission of the Dam1 complex from this study is disappointing to me personally, but I am sure that the authors have good reasons for this. They should briefly comment on the absence of the Dam1 complex in this study.

      To provide information on the Dam1 complex, we imaged Ask1, a component of the complex. The measured positioning and copy number of the protein are now included in Fig. 2 and Fig. 3 respectively, and described and discussed in respective parts of the manuscript.

      Reviewer #1 (Significance (Required)):

      Cieslinski and colleagues present a single-molecule localization-based study to define the copy numbers and relative organization of kinetochore proteins in budding yeast. These numbers confirm and significantly refine prior measurements of the same aspects of the kinetochore. They also raise new questions and point to new research directions. The measurements also reveal a model of the protein organization of the budding yeast kinetochore in metaphase. For these reasons, the manuscript is of significant interest to the cell division field.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this study, Cielinski and colleagues have applied single molecule localization microscopy to map the positions of proteins in the yeast kinetochore. This has not been reported previously and this study is both well-conducted and the data appear solid. They also use a modification of this technique to assess the stoichiometry of kinetochore proteins. The results that they obtain are broadly in line with several previous studies that use other methodology. There may be an improvement in accuracy using this new approach that has not been obtained previously and there are some important novel conclusions from this work. I would like the authors to address the following concerns prior to publication:

      Major points

      1. One interesting finding is that there is a discrepancy in the length of both the MIND and NDC80 complexes (from crystallographic data) with their relative positions. The authors suggest that the outer complexes could be twisted or rotated in respect of the spindle axis. It would be great if the authors could illustrate this in their model (or discuss it in the text), to demonstrate the required angle of twist/rotation of both complexes to account for the discrepancy. A twisted filament structure to the outer kinetochore does have some implications for its response to tension - a key determinant of kinetochore-microtubule attachment. It also may provide some flexibility to the structure under tension.

      The discussion about this discrepancy has now been incorporated in the main text, page 9 lines 263-267. For clarity, we only partially reflect this in our schematic model (Fig. 4A; the MIND complex) but we already reflected this in the illustrative structural model in Fig. 4B.

      1. For the experiment with cycloheximide, the authors state "Although we observed minor changes in copy numbers, the overall effect of CHX was small." For some proteins, Cse4i for example, there appears to be a significant decrease in intensity (30-40%) after cycloheximide treatment, see Figure S3. While the conclusion that tag maturation does not affect copy number measurements is sound, I suggest modifying this section to reflect the data.

      We now modified the section accordingly by pointing out that Cse4i under CHX measurements led to reduction of the signal. The modification can be found on page 8 lines 207-211.

      1. Page 5. The statement "These data agree reasonably well with previous diffraction-limited dual-color microscopy studies ..." provides readers with little ability to compare the data. I would like to see a supplementary figure comparing these new data with previous studies, especially those of Joglekar et al 2009, see Figure 3 in this paper.

      We thank the reviewer for suggesting such a table. This will allow readers a direct comparison of the data between our study and Joglekar at al. 2009. The comparison can be found in new Table S1 __and __Fig. S4, which are now mentioned on page 5.

      1. In terms of the distances quoted, are they in one dimension (as per Jogelkar et al 2009) or in three? The results section is entitled "...positions of kinetochore proteins along the metaphase spindle axis", which suggests a single dimension. Please make this very clear in the results section. In the discussion, is the statement "we mapped the relative positions of 15 kinetochore proteins along the kinetochore axis", which is not entirely clear. It seems from the methods that this is one dimension "...we determined the average distance between the two proteins along the spindle axis. “I suggest clarifying the results section briefly and clearly to indicate that this is a single dimension being measured and also using consistent wording of the axis measured throughout the text.

      We agree the previous description may not be clear to the viewers. We now changed the text accordingly in the results section, page 5 lines 129-130.

      Minor points:

      Abstract: I would drop "all" from "For all major kinetochore proteins...", since full characterisation was performed on 14 proteins (9 in terms of copy number).

      We now deleted “all” in the abstract as the reviewer suggested__.__

      Page 2: "trough" to through.

      Corrected.

      Page 2 "S. cerevisiae" to italics

      Corrected.

      Methods p11. How do the MKY strains relate to common yeast genetic backgrounds? (e.g. are they S288C?).

      MKY strains are derivative of S288C. The information was now updated in the Methods section and in Table S2.

      Reviewer #2 (Significance (Required)):

      This manuscript, together with an accompanying one from Virat et al., are nice complementary studies that provide the first single molecule localization studies of the yeast kinetochore. Although other labs have used super-resolution methods to study individual kinetochore proteins; both of these new studies map distances between many proteins at the kinetochore and thus are able to produce maps of the overall kinetochore structure. Like the previous study using standard resolution methods (Joglekar et al, 2009. Current Biology 19, 694-699); these studies will likely provide a benchmark for future studies on eukaryotic kinetochore architecture, including those in mammalian systems. Additionally, this work will appeal to super-resolution microscopists.

      My expertise is as a yeast kinetochore cell biologist.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, Cielinski and colleagues have applied single molecule localization microscopy to map the positions of proteins in the yeast kinetochore. This has not been reported previously and this study is both well-conducted and the data appear solid. They also use a modification of this technique to assess the stoichiometry of kinetochore proteins. The results that they obtain are broadly in line with several previous studies that use other methodology. There may be an improvement in accuracy using this new approach that has not been obtained previously and there are some important novel conclusions from this work. I would like the authors to address the following concerns prior to publication:

      Major points

      1. One interesting finding is that there is a discrepancy in the length of both the MIND and NDC80 complexes (from crystallographic data) with their relative positions. The authors suggest that the outer complexes could be twisted or rotated in respect of the spindle axis. It would be great if the authors could illustrate this in their model (or discuss it in the text), to demonstrate the required angle of twist/rotation of both complexes to account for the discrepancy. A twisted filament structure to the outer kinetochore does have some implications for its response to tension - a key determinant of kinetochore-microtubule attachment. It also may provide some flexibility to the structure under tension.
      2. For the experiment with cycloheximide, the authors state "Although we observed minor changes in copy numbers, the overall effect of CHX was small." For some proteins, Cse4i for example, there appears to be a significant decrease in intensity (30-40%) after cycloheximide treatment, see Figure S3. While the conclusion that tag maturation does not affect copy number measurements is sound, I suggest modifying this section to reflect the data.
      3. Page 5. The statement "These data agree reasonably well with previous diffraction-limited dual-color microscopy studies ..." provides readers with little ability to compare the data. I would like to see a supplementary figure comparing these new data with previous studies, especially those of Joglekar et al 2009, see Figure 3 in this paper.
      4. In terms of the distances quoted, are they in one dimension (as per Jogelkar et al 2009) or in three? The results section is entitled "...positions of kinetochore proteins along the metaphase spindle axis", which suggests a single dimension. Please make this very clear in the results section. In the discussion, is the statement "we mapped the relative positions of 15 kinetochore proteins along the kinetochore axis", which is not entirely clear. It seems from the methods that this is one dimension "...we determined the average distance between the two proteins along the spindle axis."I suggest clarifying the results section briefly and clearly to indicate that this is a single dimension being measured and also using consistent wording of the axis measured throughout the text.

      Minor points:

      Abstract: I would drop "all" from "For all major kinetochore proteins...", since full characterisation was performed on 14 proteins (9 in terms of copy number).

      Page 2: "trough" to through.

      Page 2 "S. cerevisiae" to italics

      Methods p11. How do the MKY strains relate to common yeast genetic backgrounds? (e.g. are they S288C?).

      Significance

      This manuscript, together with an accompanying one from Virat et al., are nice complementary studies that provide the first single molecule localization studies of the yeast kinetochore. Although other labs have used super-resolution methods to study individual kinetochore proteins; both of these new studies map distances between many proteins at the kinetochore and thus are able to produce maps of the overall kinetochore structure. Like the previous study using standard resolution methods (Joglekar et al, 2009. Current Biology 19, 694-699); these studies will likely provide a benchmark for future studies on eukaryotic kinetochore architecture, including those in mammalian systems. Additionally, this work will appeal to super-resolution microscopists.

      My expertise is as a yeast kinetochore cell biologist.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors have developed a rigorous methodology for using single-molecule imaging of exogenously labeled kinetochore proteins to count and estimate their copy numbers and the average distance from the kinetochore protein Spc105. Although the method is technically sound, its application to the kinetochore raises some crucial questions below. My biggest concern is the effect of non-centromeric pools of the centromeric proteins Cse4, Cep3, and Ctf19 on the estimated copy number per kinetochore. The authors should be able to address most, if not all, questions by presenting a more in-depth data analysis.

      1. Accounting for tilt of the yeast spindle relative to the image plane: It is not clear to me how the authors ascertain whether the spindle being imaged is nearly parallel to the image plane. In the companion fission yeast study, spindle poles are used for this purpose, but this study seems to rely only on the labeled kinetochore proteins. The criteria used to select the in-plane spindles should be clearly defined.
      2. The effects of PSF depth on counting kinetochore proteins: The authors use a well-characterized nuclear pore protein as the reference to estimate kinetochore protein counts per half-spindle. Although this method appears rigorous in principle, I am unsure about the effect of the spatial distribution of kinetochores on the accuracy of the estimated number. Nuclear pore proteins are all localized within an < 100 nm3 volume. Therefore, all proteins within an in-focus nuclear pore will also be in focus. This is not the case with yeast kinetochores, especially in metaphase. A fraction of the kinetochores is likely to be > 100 nm away from the focal plane even when the spindle is perfectly parallel to the focal plane. A discussion of this possibility, its effect on the protein count/distance estimates, and any mitigating factors is essential to highlight the caveats associated with the conclusions.
      3. Presentation of the cross-correlation analysis: The authors use cross-correlation for an unbiased calculation of the axial separation between a protein of interest and Cse4, but I am curious about the structure of the underlying data, and the intensity image in Figure 1 is not easy to examine. It will be helpful to include more analysis of the underlying data for at least a subset of the proteins (e.g., proteins at short, intermediate, and long distances from Cse4) as supplementary data.
        • The authors should include X and Y projections of the cross-correlation function.
        • Do the widths of cross-correlation functions (i.e., their spread perpendicular to the spindle axis) match across all proteins and experiments? This should be an almost invariant characteristic of the measurements, assuming that proteins within each kinetochore tightly cluster around the 25 nm microtubule. This line of thinking makes the large width of the cross-correlation shown in Figure 1 somewhat surprising.
        • It will also be interesting to test if the correlation between the positions of Spc105 molecules, especially perpendicular to the spindle axis, is comparable to the known separations between adjacent microtubules in the yeast spindle (the authors could use Winey et al. 1995 for serial-section EM of yeast spindles for comparison).
      4. Cse4 count (4 per kinetochore) and the model presented: One of the surprising conclusions of the study is that there are two nucleosomes associated with each microtubule attachment, with Mif2/CENP-C potentially interacting with both nucleosomes. There are two critical issues that the authors must consider.
        • (1) Fluorescent protein chimeras of Cse4 and CBF3 and COMA complex members do not exclusively localize to kinetochores. Biochemical studies show that both Cse4 and CBF3 proteins interact with non-centromeric DNA, e.g., see work from the Biggins lab regarding Cse4 over-expression and also from the Henikoff group that used ChIP-seq. I can't think of a similar reference for the CBF3 complex, but the DNA-binding proteins are also likely to interact with other parts of the genome. The non-centromeric protein is visible as a significant background fluorescence in wide-field microscopy, e.g., see Cep3 localization here: https://images.yeastrc.org/imagerepo/viewExperiment.do?id=202308&experimentGroupOffset=3&experimentOffset=0&experimentGroupSize=3 Similar background fluorescence can be detected for Cse4 and Ctf19. This extra-centromeric localization of Cse4, Cep3, and Ctf19 makes it possible that the protein counts included by the authors are "contaminated" to some extent by the extra-centromeric protein. The authors should discuss this possibility and how it might affect their counts.
        • (2) The model drawn in Figure 4 makes explicit assumptions about the positioning of the four Cse4 molecules (or two nucleosomes) in each kinetochore relative to the rest of the kinetochore components. Yet, the data shown do not justify this specific arrangement. Lawrimore et al. 2011 claim that the non-centromeric Cse4 nucleosomes must be randomly distributed in the pericentromeric chromatin to evade detection in biochemical tests. Therefore, the nearest-neighbor analysis suggested above will be valuable for gaining new insights into the relative positioning of the centromeric- and non-centromeric Cse4 nucleosomes. A similar analysis for Cep3 and Ctf19 will also be helpful. If stereotypical positioning of these molecules cannot be detected, then the model should be revised accordingly (alternative models that are also consistent with the data can be included).
        • (3) I suggest one experiment that can help the authors better understand protein organization in one kinetochore. Joglekar et al. 2006 used a dicentric chromosome to isolate single kinetochores on the spindle axis to test the assumption that each kinetochore consists of approximately the same number of molecules of kinetochore proteins. The strains are easy to construct (transform existing strains with a linearized plasmid). Single kinetochores can be seen with a low but reasonable frequency. I leave the decision to perform the experiment to the authors' discretion depending on whether the experiment will be worth the effort in strengthening or enhancing their conclusions.
      5. Information regarding the degree of correction applied to calculate protein count per half-spindle: It will be helpful to include data regarding the degree of correction applied to the expected and measured numbers of NPC protein as supplementary data so that the readers can see the magnitude of this correction relative to the measured counts.

      Minor points:

      1. McIntosh et al. JCB 2013 used microtubule plus-ends in serial section electron micrographs of yeast spindles to align the centromeric region and found a disk-shaped structure that roughly corresponds to the size of a single nucleosome ~ 80 nm away from the tip of the microtubule and centered the microtubule axis. The authors should refer to this finding in their discussion of the model that they present with two nucleosomes. In my opinion, this is compelling evidence for a nucleosome-like structure serving as the kinetochore foundation.
      2. As discussed by the authors, the number of Cse4 molecules per kinetochore has been the subject of some controversy. Biochemical data from the Biggins group and ChIPseq data from the Westermann group (Altunkaya et al. 2016 Current Biology) strongly suggest that Cse4 molecules can only be found centered on the centromeric sequence. The latter reference should be included in the discussion.
      3. Although microscopy-based methods have estimated anywhere from 1, 2, to 6 Cse4 molecules per kinetochore, these studies generally agree on the stoichiometry between Cse4 and the rest of the kinetochore proteins, e.g., Ndc80 complex proteins are ~ 4-fold more abundant that Cse4, etc. The present study seems to disagree with protein stoichiometry. The authors may find it worthwhile to note this feature of their data.
      4. Omission of the Dam1 complex from this study is disappointing to me personally, but I am sure that the authors have good reasons for this. They should briefly comment on the absence of the Dam1 complex in this study.

      Significance

      Cieslinski and colleagues present a single-molecule localization-based study to define the copy numbers and relative organization of kinetochore proteins in budding yeast. These numbers confirm and significantly refine prior measurements of the same aspects of the kinetochore. They also raise new questions and point to new research directions. The measurements also reveal a model of the protein organization of the budding yeast kinetochore in metaphase. For these reasons, the manuscript is of significant interest to the cell division field.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      Thank you for the peer review of our manuscript entitled “Hypoxia causes pancreatic β____-cell dysfunction by activating a transcriptional repressor BHLHE40” (RC-2022-01560). We greatly appreciate the reviewers’ constructive suggestions and your invitation to revise the manuscript. Below, we address the comments point-by-point and provide details of the changes we are planning to or have implemented. We believe that the revision plan will meet with the approval of the editor and reviewers. We also would be happy to respond to any further questions and comments that you may have.

      2. Description of the planned revisions

      Response to comments of reviewer 1

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors examine the role of hypoxia-induced transcriptional repression in mediating loss of β-cell function in type2 diabetes. Transcriptional profiling of mouse and human islets exposed to low oxygen conditions revealed downregulation of β-cell identity and oxidative phosphorylation genes, and upregulation of genes associated with hypoxia. Identification of genes commonly upregulated in Min6 cells, mouse and human islets under hypoxic conditions, revealed induction of two transcriptional repressors BHLHE40 and ATF3. The authors further show that Bhlhe40 deficiency rendered β-cells resistant to hypoxic stress and restored glucose- and KCl-stimulated insulin secretion. This rescue in β-cell function was at least, in part, due to restoration of ATP generation and exocytosis of insulin granules. Furthermore, transcriptional profiling of Min6 cells overexpressing Bhlhe40 indicated down-regulation of key β-cell genes including Mafa. The authors elegantly show that BHLHE40 blocks PDX1 binding to Mafa transcription start site by binding to two E-box sites within the Mafa promoter/enhancer region. Lastly, Cre-mediated β-cell-specific deletion of Bhlhe40 in ob/ob mice restored expression of Mafa and exocytotic genes, accompanied by improvements in ATP generation and insulin secretion.

      Major comments 1. The authors conclude that BHLHE40 regulates insulin secretion at two key steps: ATP generation and exocytosis. However, insulin secretory profiles with glucose and KCl seem to be similar with genetic manipulations of Bhlhe40 both in vivo and ex vivo. As the authors indicate in line 176, this suggests a more prominent role of BHLHE40 in regulating exocytotic events downstream of Ca2+ influx. Further experiments are therefore necessary to adequately address the effects on ATP generation. Given the observation that PGC1____α, a regulator of mitochondrial biogenesis is suppressed by BHLHE40, mitochondrial assessments would be crucial. Additionally, the effect on mitochondrial mass in Fig 3K seem to be marginal and need to be confirmed using additional measurements listed below.

      We appreciate your constructive suggestion. According to your suggestions, we will explore the role of BHLHE40 in mitochondrial function in more detail.

      a. In fig 3F, the authors show no change in KCl stimulated Ca2+ influx. Glucose stimulated Ca2+ influx needs to be examined to confirm regulation of ATP generation.

      We thank the reviewer for pointing this out. We performed the experiment according to the suggestion. Please see section 3.

      b. OXPHOS subunits, TOM20 levels by western blotting

      We thank the reviewer for pointing this out. We will perform Western blotting to check the protein levels of OXPHOS subunits and TOM20 in control (Ctrl) and Bhlhe40 knockdown (KD) MIN6 cells cultured under 20% or 5% O­­­2.

      c. mtDNA content, transcript levels by qRT-PCR

      We will perform qRT-PCR to check the mtDNA content in Ctrl and Bhlhe40 KD MIN6 cells cultured under 20% or 5% O­­­2.

      d. Functional assessments: Changes in mitochondrial membrane potential or oxygen consumption

      We will evaluate mitochondrial membrane potential by MitoTracker Red staining in Ctrl and Bhlhe40 KD MIN6 cells cultured under 20% or 5% O­­­2.

      2. Data presented in Figure 4 and 5 indicates transcriptional repression of Mafa by BHLHE40 as a mechanism of beta-cell dysfunction under hypoxic conditions. However, additional experiments are necessary to confirm that repression of PDX1-Mafa binding specifically is responsible for defects in GSIS -

      a. Fig 5G shows inhibition of PDX1-binding to Mafa with overexpression of Bhlhe40. This needs to be confirmed under hypoxic conditions.

      We thank the reviewer for pointing this out. Hypoxia for 16 to 24 hours decreases the expression levels of Pdx1 in mouse islets and MIN6 cells (Figure 1A and Sato Y., PLoS One 2014). In that condition, it is difficult to assess whether the reduction in PDX1 binding to Mafa enhancer is attributed to inhibition of PDX1 binding or PDX1 downregulation. Therefore, we will aim to determine the hypoxia exposure time during which Mafa expression is downregulated but Pdx1 expression is not affected. If we fail to identify the time, we plan to generate Pdx1-overexpressing MIN6 cell lines and evaluate PDX1 binding to Mafa enhancer in hypoxic conditions.

      Sato Y. et al. Moderate Hypoxia Induces β-Cell Dysfunction with HIF-1–Independent Gene Expression Changes. PLoS One. 2014;9(12):e114868.

      b. Fig 4H and 4I show restoration of insulin secretion normalized to total protein with AAV-Mafa. This needs to be supplemented with insulin content as MAFA has been implicated in regulating insulin gene expression (PMID: 25500951).

      We thank the reviewer for pointing this out and agree with the comment on our original manuscript. We will evaluate insulin content by insulin ELISA assay in samples from AAV-Ctrl and AAV-Mafa-overexpressing MIN6 cells cultured under 20% or 5% O2. If the insulin content is affected by Mafa overexpression, insulin secretion will be adjusted by the intracellular insulin content.

      c. qRT-PCR of exocytosis genes and ATP generation with hypoxia and AAV-Mafa.

      We thank the reviewer for pointing this out. We will evaluate expression of exocytosis genes and ATP generation in AAV-Ctrl and AAV-Mafa-overexpressing MIN6 cells cultured under 20% or 5% O2.

      d. Would mutation of A and C E-box sites restore PDX1 binding to Mafa TF region under hypoxia?

      To address this question, we plan to introduce mutations in A and C sites of the Mafa gene in MIN6 cells by using CRISPR-Cas9 technology and then to examine PDX1 binding to the Mafa gene by ChIP assay under hypoxic conditions.

      3. β-dedifferentiation has been proposed to be involved in loss of insulin secretion in T2D (PMID: 22980982, 16123366). One can speculate that transcriptional repression of Mafa by BHLHE40 is a component of a larger dedifferentiation phenomenon occurring under hypoxia, as other ____β-cell genes were decreased with hypoxia (Fig 1A) and Bhlhe40-OE in Fig 4A. Identifying differences in dedifferentiation and ____β-cell disallowed genes with Bhlhe40 overexpression (RNA seq, qRT-PCR) would therefore potentially reveal a dedifferentiation mechanism.

      We thank the reviewer for pointing this out. Please see section 3.

      4. The authors identify Atf3 as another transcriptional repressor enriched under hypoxia although to a lesser degree than Bhlhe40. The role of ATF3 in hypoxia-induced apoptosis and adaptive UPR has been previously suggested (PMID: 20519332, 20349223). Additionally, hypoxia represses adaptive UPR in models of T2D and drives ____β-cell apoptosis (PMID: 27039902). The authors discuss the role of ATF3 under hypoxia in the discussion (lines 319-324) and addressing these research gaps regarding ATF3 function would be insightful.

      We are grateful for the reviewer’s comment. We will generate Atf3 knockdown MIN6 cell lines and examine the effect of ATF3 on hypoxia-induced apoptosis by PI/AnnexinV staining. If ATF3 is involved in hypoxia-induced apoptosis, we will also measure the mRNA expression levels involved in adaptive UPR.

      Minor comments 1. In Fig 2E, increasing replicates would confirm no induction of Bhlhe40 with Thapsigargin.

      Thank you for pointing out this issue. We will perform additional experiments to confirm the effect of thapsigargin on Bhlhe40 expression.

      2. In Fig 2B, BHLHE40 bands need to be quantified to show time-dependent increase in protein levels.

      Thank you for pointing out this issue. Please see section 3.

      3. In Fig 3C, insulin content needs to be shown with Bhlhe40-OE as in Fig 3B with hypoxia.

      Thank you for pointing out this issue. Please see section 3.

      4. In Fid 4E-F, band intensities need to quantified by densitometry to determine degree of downregulation of MAFA.

      We performed three independent experiments for Figure 4E. Please see section 3. We plan to perform additional experiments to determine the expression levels of MAFA (Figure 4F).

      5. In Fig4H and 6G, insulin content needs to be shown as stated above.

      We thank the reviewer for pointing this out. We will perform additional experiments to check the insulin content in these settings.

      6. In Supplemental Figure 3C, apoptosis induced by hypoxia was assessed by PI staining that detects late apoptosis. No significant changes were observed with Bhlhe40-KD, but additional cell death assessments can be used to confirm that B40 does not affect ____β-cell death.

      We thank the reviewer for mentioning this issue. To address this concern, we plan to investigate the effects of Bhlhe40 KD on the number of Annexin V-positive cells (early apoptosis) and cleaved (activated) caspase 3 expression in Ctrl and Bhlhe40 KD MIN6 cells under hypoxic conditions.

      7. It would be interesting to see the rates of diabetes incidence in Bhlhe40KO: ob/ob mice and if Bhlhe40 deficiency protects against or delays development of diabetes.

      Thank you for mentioning this issue. Please see section 3.

      8. Knockdown efficiency shown in Supplementary figure 3A needs to be estimated by quantifying band intensities.

      We plan to perform additional experiments to quantify the band intensities.

      9. Line 43 should say "...reversed defects in insulin secretion."

      We apologize for our incorrect explanation in the original manuscript. We have corrected this error in the text. Please see section 3.

      Reviewer #1 (Significance (Required)):

      The data presented provides novel mechanistic insights into the role of hypoxia in β-cell dysfunction. Studies in multiple models of type 2 diabetes (T2D) have shown the loss of signature β-cell genes including Ins1, Pdx1, Mafa, Slc2a2 as a result of excess nutrient stimulation and hypoxia; the precise causal mechanisms, however, still remain to be determined (PMID: 22980982, 28270834). A previous paper from the same group demonstrated downregulation of β-cell signature genes with hypoxia by a HIF1α independent mechanism (PMID: 25503986). Data presented in this report extend those observations and reveal a previously unappreciated role for transcriptional repressor BHLHE40 in the downregulation of a key β-cell gene Mafa. As the authors have identified additional transcriptional repressors including ATF3 and differentially expressed genes in both human and rodent β-cells, this paper would be of great value in understanding the effects of hypoxia. Moreover, studies in mouse models of T2D extend the association of BHLHE40 to clinical β-cell dysfunction and diabetes.

      My areas of interest are pancreatic β-cell and mitochondrial physiology. GSE analysis and repression of PGC1α by BHLHE40, as appropriately discussed by the authors, point towards impaired mitochondrial function and ATP generation. Additional experiments would greatly support the role of BHLHE40 in mitochondrial dysfunction under hypoxia.

      We thank the reviewer for his/her valuable comments.

      Response to comments of reviewer 2

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): This study examines the role of BHEH40 in beta-cell function and its role in mediating the changes with 'hypoxia'. Most of the studies use 5% oxygen which is probably close to normal oxygen tension for islets, although it is not great for islet survival once the islets are removed from their normal vasculature. The human islet studies use 2% oxygen which would be actual hypoxia.

      Major comments: Why were different oxygen concentrations used for mouse and human islets? What were the effects of 5% oxygen in human islets? Why was 5% oxygen chosen? 5% is close to normal oxygen tension that islets are exposed to in vivo, whereas 2% is not physiological.

      We apologize for the lack of explanation in the original manuscript. Please see section 3.

      If there was only 1 human donor, are the 2 and 3 RNA-seq technical replicates? If so, they do not show high replicability. Please discuss.

      The reviewer is correct. We analyzed human islets from one donor for RNA-seq (Figure 1A). It would be preferable to obtain additional data derived from different human donor samples. Therefore, we plan to analyze human islets from another donor to show the replicability of increased expression of Bhlhe40 under hypoxic conditions.

      Are the sequencing results from individual mice? Were the same mouse's islets used for normal and 5% oxygen or are they all different animals?

      We apologize for the lack of explanation in the original manuscript. Results of RNA-seq were obtained from different animals. Please see section 3.

      The whole gels with appropriate size markers need to be shown for all Westerns - they are not able to be appropriately reviewed in their current formats.

      We apologize for the mistake. Please see section 3.

      The histology in Figure 1K does not appear to match with the Western blot results in 1B, 1C which show a smaller but still clear band in the control 20% conditions, and in figures 1I and J in ob/ob and db/db controls. Lower power views showing most of the pancreas with a zoom-in shot of an example islet would be more appropriate. The immunofluorescence should be repeated to also include insulin so that beta-cells can be identified.

      We thank the reviewer for mentioning this issue. We will repeat the immunohistochemical analysis according to the reviewer’s comments. We also plan to show the data on insulin staining.

      What was the ____β-cell deletion efficiency of the knockdown mouse?

      We apologize for the lack of explanation in the original manuscript. Please see section 3.

      In the setting of hypoxia, would it be 'clinically' beneficial to have increased insulin secretion and thus metabolic demand? Please discuss.

      We thank the reviewer for mentioning this important issue. Our results show that BHLHE40 controls at least two steps of insulin secretion: exocytosis and ATP generation. A relatively smaller reduction of Ppargc1a might suggest a more prominent role of BHLHE40 in regulating exocytosis rather than ATP generation (oxygen consumption step). Chronic hyperglycemia induces b-cell damage and impairs insulin secretion, a process known as glucotoxicity (Weir GC, Diabetes 2004, 2020). We believe that inactivation of BHLHE40 may help to reduce glucotoxicity by increasing insulin secretion. However, we would like to discuss this topic in more detail once we have investigated the roles of BHLHE40 in ATP generation (as suggested by reviewer 1).

      Weir GC. et al. Five stages of evolving beta-cell dysfunction during progression to diabetes. Diabetes. 2004;53 Suppl 3:S16-21.

      Weir GC. Glucolipotoxicity, β-Cells, and Diabetes: The Emperor Has No Clothes. Diabetes. 2020;69(3):273-278.

      - Are the key conclusions convincing? Hard to assess the data in some cases - see above. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Yes, some of the conclusions are too strongly worded. An example is "However, hyperactivation of HIF in ____β-cells impairs insulin secretion by switching glucose metabolism from aerobic oxidative phosphorylation to anaerobic glycolysis (14-16)," this is too broad a statement. Hyperactivation of HIF in ____β-cells BY VHL DELETION impairs insulin secretion by that mechanism. Other ways of increasing HIF in ____β-cells do not all have this effect. So, the following part "suggesting that activation of HIF underlies ____β-cell dysfunction and glucose intolerance in hypoxia." Is not warranted. It would be fair to say "suggesting that unregulated over-activation of HIF may cause ____β-cell dysfunction.

      We apologize for our incorrect explanation in the original manuscript. Please see section 3.

      The paper is not off to a good start when the author spell Abstract as Abstruct - it suggests a spell-check was not performed. For the last sentence of the abstract, 'and its implication' - what implication?

      We apologize for the typo and the poor description in the original manuscript. Please see section 3.

      Line 64 High glucose conditions generate RELATIVE, not absolute hypoxia in beta-cells. This statement should also be referenced.

      We apologize for our inaccurate explanation in the original manuscript. Please see section 3.

      - Would additional experiments be essential to support the claims of the paper? See above.

      - Are the data and the methods presented in such a way that they can be reproduced? Not enough detail for methods, but what is presented looks OK.

      - Are the experiments adequately replicated and statistical analysis adequate? Unclear, see above.

      - Specific experimental issues that are easily addressable.

      - Are prior studies referenced appropriately? No, only the body of work on VHL, not HIFs.

      - Are the text and figures clear and accurate? See above comments.

      - Do you have suggestions that would help the authors improve the presentation of their data and conclusions? See comments about gels etc above.

      We thank the reviewer for his/her valuable comments. Please see section 3 regarding to references.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      The authors conclude that BHLHE40 regulates insulin secretion at two key steps: ATP generation and exocytosis. However, insulin secretory profiles with glucose and KCl seem to be similar with genetic manipulations of Bhlhe40 both in vivo and ex vivo. As the authors indicate in line 176, this suggests a more prominent role of BHLHE40 in regulating exocytotic events downstream of Ca2+ influx. Further experiments are therefore necessary to adequately address the effects on ATP generation. Given the observation that PGC1____α, a regulator of mitochondrial biogenesis is suppressed by BHLHE40, mitochondrial assessments would be crucial. Additionally, the effect on mitochondrial mass in Fig 3K seem to be marginal and need to be confirmed using additional measurements listed below.

      a. In fig 3F, the authors show no change in KCl stimulated Ca2+ influx. Glucose stimulated Ca2+ influx needs to be examined to confirm regulation of ATP generation.

      We thank the reviewer for pointing this out. In accordance with the reviewer’s comments, we examined the glucose-stimulated Ca2+ influx and found that the influx stimulated by 22mM glucose in Bhlhe40-overexpressing (OE) MIN6 cells was significantly smaller than that in control MIN6 cells (Figure 3, J and K). We have added this information to the manuscript, as follows: “In addition, glucose-stimulated [Ca2+]i levels were significantly attenuated by Bhlhe40 overexpression (Figure 3, J and K). These results indicate that BHLHE40 suppresses glucose-stimulated ATP generation and the increase of [Ca2+]i levels in MIN6 cells” (lines 197 to 200).

      3. β-dedifferentiation has been proposed to be involved in loss of insulin secretion in T2D (PMID: 22980982, 16123366). One can speculate that transcriptional repression of Mafa by BHLHE40 is a component of a larger dedifferentiation phenomenon occurring under hypoxia, as other ____β-cell genes were decreased with hypoxia (Fig 1A) and Bhlhe40-OE in Fig 4A. Identifying differences in dedifferentiation and ____β-cell disallowed genes with Bhlhe40 overexpression (RNA seq, qRT-PCR) would therefore potentially reveal a dedifferentiation mechanism.

      We thank the reviewer for pointing this out. To check whether genes involved in dedifferentiation and the expression of b-cell disallowed genes are controlled by BHLHE40, we examined the expression of these genes in Bhlhe40 OE MIN6 cells and found that it was not increased by BHLHE40. However, because the findings were detected under limited experimental conditions, at this point we cannot conclude that BHLHE40 does not cause dedifferentiation of b-cells and induction of b-cell disallowed genes.

      Minor comments 2. In Fig 2B, BHLHE40 bands need to be quantified to show time-dependent increase in protein levels.

      Thank you for pointing out this issue. We performed three independent experiments and showed statistically significant upregulation of BHLHE40 (Figure 2B).

      3. In Fig 3C, insulin content needs to be shown with Bhlhe40-OE as in Fig 3B with hypoxia.

      Thank you for pointing out this issue. Bhlhe40 OE did not affect the insulin content in MIN6 cells (Supplemental Figure 3F).

      4. In Fid 4E-F, band intensities need to quantified by densitometry to determine degree of downregulation of MAFA.

      We performed three independent experiments for Figure 4E. Bhlhe40 OE led to a 67.4% decrease in the expression of MAFA in MIN6 cells (Figure 4E).

      7. It would be interesting to see the rates of diabetes incidence in Bhlhe40KO: ob/ob mice and if Bhlhe40 deficiency protects against or delays development of diabetes.

      Thank you for pointing out this issue. We found that nonfasting blood glucose concentrations were similar in Ctrl:ob/ob (239.9 ± 19.6 mg/dl; n = 9) and bB40KO:ob/ob mice (215.2 ± 17.1 mg/dl; n = 6) at 12 weeks of age (Supplemental Figure 5F). We have added this information to the revised manuscript, as follows: “In these mice, BHLHE40 deficiency in b-cells had no effect on obesity (Figure 6B), insulin sensitivity (Supplemental Figure 5E), or nonfasting glucose concentrations (Supplemental Figure 5F)” (lines 282 to 285).

      9. Line 43 should say "...reversed defects in insulin secretion."

      We apologize for our incorrect explanation in the original manuscript and have corrected it accordingly.

      Response to comments of reviewer 2

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): This study examines the role of BHEH40 in beta-cell function and its role in mediating the changes with 'hypoxia'. Most of the studies use 5% oxygen which is probably close to normal oxygen tension for islets, although it is not great for islet survival once the islets are removed from their normal vasculature. The human islet studies use 2% oxygen which would be actual hypoxia.

      Major comments: Why were different oxygen concentrations used for mouse and human islets? What were the effects of 5% oxygen in human islets? Why was 5% oxygen chosen? 5% is close to normal oxygen tension that islets are exposed to in vivo, whereas 2% is not physiological.

      We apologize for the lack of explanation in the original manuscript. We previously reported that hypoxic responses occur at 5% to 7% oxygen tension in MIN6 cells and mouse islets and that 3% hypoxia for 24 hours markedly increases MIN6 cell death (Sato Y., J Biol Chem 2011, Sato Y., PLoS One 2014). We also examined whether hypoxic responses occur at the same oxygen tension in human islets. Interestingly, in human islets, exposure to 2% but not 5% oxygen tension induced the upregulation of the SLC2A1 gene without apparent cell death. Another group also reported a hypoxic response in human islets in 2% oxygen (Puri S., Genes Dev. 2013). We have no adequate explanation as to why hypoxic responses occur at different oxygen tensions in mouse and human islets, but because of these findings, we used 5% oxygen in MIN6 cells and mouse islets and 2% oxygen in human islets. We have added this information to the text (lines 98 to 104) and Supplemental Figure 1A.

      Sato, Y. et al. Cellular hypoxia of pancreatic beta-cells due to high levels of oxygen consumption for insulin secretion in vitro. J Biol Chem. 2011;286(14):12524-32.

      Sato, Y. et al. Moderate hypoxia induces β-cell dysfunction with HIF-1-independent gene expression changes. PLoS One. 2014;9(12):e114868.

      Puri S. VHL-mediated disruption of Sox9 activity compromises β-cell identity and results in diabetes mellitus. Genes Dev. 2013;27(23):2563-2575.

      Are the sequencing results from individual mice? Were the same mouse's islets used for normal and 5% oxygen or are they all different animals?

      We apologize for the lack of explanation in the original manuscript. Each sample was derived from different mice. We have clarified this point by describing “n = 3” as “n = 3 mice/group” and have added this information to the legends of Figure 1 and Supplemental Figure 1.

      The whole gels with appropriate size markers need to be shown for all Westerns - they are not able to be appropriately reviewed in their current formats.

      We apologize for the inappropriate presentation of the data. We have included whole gel images with molecular weight markers as supplemental material.

      What was the ____β-cell deletion efficiency of the knockdown mouse?

      We apologize for not including this information. Although we showed the expression levels of Bhlhe40 in the islets of bB40KO mice (original Supplemental Figure 5B), we did not explain the data in the text. The deletion efficiency in the islets was 74.1%. We have added this information to the revised text (lines 274 to 275).

      - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Yes, some of the conclusions are too strongly worded. An example is "However, hyperactivation of HIF in ____β-cells impairs insulin secretion by switching glucose metabolism from aerobic oxidative phosphorylation to anaerobic glycolysis (14-16)," this is too broad a statement. Hyperactivation of HIF in ____β-cells BY VHL DELETION impairs insulin secretion by that mechanism. Other ways of increasing HIF in ____β-cells do not all have this effect. So, the following part "suggesting that activation of HIF underlies ____β-cell dysfunction and glucose intolerance in hypoxia." Is not warranted. It would be fair to say "suggesting that unregulated over-activation of HIF may cause ____β-cell dysfunction.

      We apologize for our incorrect explanation in the original manuscript. We have corrected this accordingly, as follows: “However, hyperactivation of HIF in b-cells by von Hippel-Lindau (VHL) deletion impairs insulin secretion by switching glucose metabolism from aerobic oxidative phosphorylation to anaerobic glycolysis (15-17), suggesting that unregulated overactivation of HIF may cause b-cell dysfunction (12, 14)” (lines 73 to 77).

      The paper is not off to a good start when the author spell Abstract as Abstruct - it suggests a spell-check was not performed. For the last sentence of the abstract, 'and its implication' - what implication?

      We apologize for the typo and the poor description in the original manuscript. The spell check was performed by an editing company, and we did not notice the error. We have changed the last sentence of the Abstract, as follows: “Collectively, this work identifies BHLHE40 as a key hypoxia-induced transcriptional repressor in b-cells that negatively regulates insulin secretion by suppressing MAFA expression” (lines 47 to 49).

      Line 64 High glucose conditions generate RELATIVE, not absolute hypoxia in beta-cells. This statement should also be referenced.

      We apologize for our inaccurate explanation in the original manuscript. We have corrected this accordingly, as follows: “high glucose conditions generate relative hypoxia in b-cells because these cells consume large amounts of oxygen (Sato Y., J Biol Chem 2011; Bensellam M., PLoS One 2012; Bensellam M., J Endocrinol 2018; Ilegems E, Sci Transl Med 2022)” (lines 64 to 65).

      Sato, Y. et al. Cellular hypoxia of pancreatic beta-cells due to high levels of oxygen consumption for insulin secretion in vitro. J Biol Chem. 2011;286(14):12524-32.

      Bensellam, M. et al. Glucose-induced O₂ consumption activates hypoxia inducible factors 1 and 2 in rat insulin-secreting pancreatic beta-cells. PLoS One 2012;7(1):e29807.

      Bensellam M. et al. Mechanisms of β-cell dedifferentiation in diabetes: recent findings and future research directions. J Endocrinol. 2018;236(2):R109-R143

      Ilegems, E. et al. HIF-1α inhibitor PX-478 preserves pancreatic β cell function in diabetes. Sci Transl Med. 2022;14(638):eaba9112.

      - Are prior studies referenced appropriately? No, only the body of work on VHL, not HIFs.

      We apologize for our inadequate references about the involvement of HIFs in hypoxia-induced β-cell dysfunction. We have included the following references in the text (line 77):

      Catrina, S.B. et al. Hypoxia and hypoxia-inducible factors in diabetes and its complications. Diabetologia. 2021;64(4):709-716

      Gulton JE. Hypoxia-inducible factors and diabetes. J Clin Invest. 2020; 130(10):5063-5073

      4. Description of analyses that authors prefer not to carry out

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This study examines the role of BHEH40 in beta-cell function and its role in mediating the changes with 'hypoxia'. Most of the studies use 5% oxygen which is probably close to normal oxygen tension for islets, although it is not great for islet survival once the islets are removed from their normal vasculature. The human islet studies use 2% oxygen which would be actual hypoxia.

      Significance

      Major comments:

      Why were different oxygen concentrations used for mouse and human islets? What were the effects of 5% oxygen in human islets? Why was 5% oxygen chosen? 5% is close to normal oxygen tension that islets are exposed to in vivo, whereas 2% is not physiological.

      If there was only 1 human donor, are the 2 and 3 RNA-seq technical replicates? If so, they do not show high replicability. Please discuss.

      Are the sequencing results from individual mice? Were the same mouse's islets used for normal and 5% oxygen or are they all different animals?

      The whole gels with appropriate size markers need to be shown for all Westerns - they are not able to be appropriately reviewed in their current formats.

      The histology in Figure 1K does not appear to match with the Western blot results in 1B, 1C which show a smaller but still clear band in the control 20% conditions, and in figures 1I and J in ob/ob and db/db controls. Lower power views showing most of the pancreas with a zoom-in shot of an example islet would be more appropriate. The immunofluorescence should be repeated to also include insulin so that beta-cells can be identified.

      What was the β-cell deletion efficiency of the knockdown mouse?

      In the setting of hypoxia, would it be 'clinically' beneficial to have increased insulin secretion and thus metabolic demand? Please discuss.

      • Are the key conclusions convincing? Hard to assess the data in some cases - see above.
      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Yes, some of the conclusions are too strongly worded. An example is "However, hyperactivation of HIF in β-cells impairs insulin secretion by switching glucose metabolism from aerobic oxidative phosphorylation to anaerobic glycolysis (14-16)," this is too broad a statement. Hyperactivation of HIF in β-cells BY VHL DELETION impairs insulin secretion by that mechanism. Other ways of increasing HIF in β-cells do not all have this effect. So, the following part "suggesting that activation of HIF underlies β-cell dysfunction and glucose intolerance in hypoxia." Is not warranted. It would be fair to say "suggesting that unregulated over-activation of HIF may cause β-cell dysfunction. The paper is not off to a good start when the author spell Abstract as Abstruct - it suggests a spell-check was not performed. For the last sentence of the abstract, 'and its implication' - what implication? Line 64 High glucose conditions generate RELATIVE, not absolute hypoxia in beta-cells. This statement should also be referenced.
      • Would additional experiments be essential to support the claims of the paper? See above.
      • Are the data and the methods presented in such a way that they can be reproduced? Not enough detail for methods, but what is presented looks OK.
      • Are the experiments adequately replicated and statistical analysis adequate? Unclear, see above.
      • Specific experimental issues that are easily addressable.
      • Are prior studies referenced appropriately? No, only the body of work on VHL, not HIFs.
      • Are the text and figures clear and accurate? See above comments.
      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? See comments about gels etc above.
    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors examine the role of hypoxia-induced transcriptional repression in mediating loss of β-cell function in type2 diabetes. Transcriptional profiling of mouse and human islets exposed to low oxygen conditions revealed downregulation of β-cell identity and oxidative phosphorylation genes, and upregulation of genes associated with hypoxia. Identification of genes commonly upregulated in Min6 cells, mouse and human islets under hypoxic conditions, revealed induction of two transcriptional repressors BHLHE40 and ATF3. The authors further show that Bhlhe40 deficiency rendered β-cells resistant to hypoxic stress and restored glucose- and KCl-stimulated insulin secretion. This rescue in β-cell function was at least, in part, due to restoration of ATP generation and exocytosis of insulin granules. Furthermore, transcriptional profiling of Min6 cells overexpressing Bhlhe40 indicated down-regulation of key β-cell genes including Mafa. The authors elegantly show that BHLHE40 blocks PDX1 binding to Mafa transcription start site by binding to two E-box sites within the Mafa promoter/enhancer region. Lastly, Cre-mediated β-cell-specific deletion of Bhlhe40 in ob/ob mice restored expression of Mafa and exocytotic genes, accompanied by improvements in ATP generation and insulin secretion.

      Major comments

      1. The authors conclude that BHLHE40 regulates insulin secretion at two key steps: ATP generation and exocytosis. However, insulin secretory profiles with glucose and KCl seem to be similar with genetic manipulations of Bhlhe40 both in vivo and ex vivo. As the authors indicate in line 176, this suggests a more prominent role of BHLHE40 in regulating exocytotic events downstream of Ca2+ influx. Further experiments are therefore necessary to adequately address the effects on ATP generation. Given the observation that PGC1α, a regulator of mitochondrial biogenesis is suppressed by BHLHE40, mitochondrial assessments would be crucial. Additionally, the effect on mitochondrial mass in Fig 3K seem to be marginal and need to be confirmed using additional measurements listed below.
        • a. In fig 3F, the authors show no change in KCl stimulated Ca2+ influx. Glucose stimulated Ca2+ influx needs to be examined to confirm regulation of ATP generation.
        • b. OXPHOS subunits, TOM20 levels by western blotting
        • c. mtDNA content, transcript levels by qRT-PCR
        • d. Functional assessments: Changes in mitochondrial membrane potential or oxygen consumption
      2. Data presented in Figure 4 and 5 indicates transcriptional repression of Mafa by BHLHE40 as a mechanism of beta-cell dysfunction under hypoxic conditions. However, additional experiments are necessary to confirm that repression of PDX1-Mafa binding specifically is responsible for defects in GSIS -
        • a. Fig 5G shows inhibition of PDX1-binding to Mafa with overexpression of Bhlhe40. This needs to be confirmed under hypoxic conditions.
        • b. Fig 4H and 4I show restoration of insulin secretion normalized to total protein with AAV-Mafa. This needs to be supplemented with insulin content as MAFA has been implicated in regulating insulin gene expression (PMID: 25500951).
        • c. qRT-PCR of exocytosis genes and ATP generation with hypoxia and AAV-Mafa.
        • d. Would mutation of A and C E-box sites restore PDX1 binding to Mafa TF region under hypoxia?
      3. β-dedifferentiation has been proposed to be involved in loss of insulin secretion in T2D (PMID: 22980982, 16123366). One can speculate that transcriptional repression of Mafa by BHLHE40 is a component of a larger dedifferentiation phenomenon occurring under hypoxia, as other β-cell genes were decreased with hypoxia (Fig 1A) and Bhlhe40-OE in Fig 4A. Identifying differences in dedifferentiation and β-cell disallowed genes with Bhlhe40 overexpression (RNA seq, qRT-PCR) would therefore potentially reveal a dedifferentiation mechanism.
      4. The authors identify Atf3 as another transcriptional repressor enriched under hypoxia although to a lesser degree than Bhlhe40. The role of ATF3 in hypoxia-induced apoptosis and adaptive UPR has been previously suggested (PMID: 20519332, 20349223). Additionally, hypoxia represses adaptive UPR in models of T2D and drives β-cell apoptosis (PMID: 27039902). The authors discuss the role of ATF3 under hypoxia in the discussion (lines 319-324) and addressing these research gaps regarding ATF3 function would be insightful.

      Minor comments

      1. In Fig 2E, increasing replicates would confirm no induction of Bhlhe40 with Thapsigargin.
      2. In Fig 2B, BHLHE40 bands need to be quantified to show time-dependent increase in protein levels.
      3. In Fig 3C, insulin content needs to be shown with Bhlhe40-OE as in Fig 3B with hypoxia.
      4. In Fid 4E-F, band intensities need to quantified by densitometry to determine degree of downregulation of MAFA.
      5. In Fig4H and 6G, insulin content needs to be shown as stated above.
      6. In Supplemental Figure 3C, apoptosis induced by hypoxia was assessed by PI staining that detects late apoptosis. No significant changes were observed with Bhlhe40-KD, but additional cell death assessments can be used to confirm that B40 does not affect β-cell death.
      7. It would be interesting to see the rates of diabetes incidence in Bhlhe40KO: ob/ob mice and if Bhlhe40 deficiency protects against or delays development of diabetes.
      8. Knockdown efficiency shown in Supplementary figure 3A needs to be estimated by quantifying band intensities.
      9. Line 43 should say "...reversed defects in insulin secretion."

      Significance

      The data presented provides novel mechanistic insights into the role of hypoxia in β-cell dysfunction. Studies in multiple models of type 2 diabetes (T2D) have shown the loss of signature β-cell genes including Ins1, Pdx1, Mafa, Slc2a2 as a result of excess nutrient stimulation and hypoxia; the precise causal mechanisms, however, still remain to be determined (PMID: 22980982, 28270834). A previous paper from the same group demonstrated downregulation of β-cell signature genes with hypoxia by a HIF1α independent mechanism (PMID: 25503986). Data presented in this report extend those observations and reveal a previously unappreciated role for transcriptional repressor BHLHE40 in the downregulation of a key β-cell gene Mafa. As the authors have identified additional transcriptional repressors including ATF3 and differentially expressed genes in both human and rodent β-cells, this paper would be of great value in understanding the effects of hypoxia. Moreover, studies in mouse models of T2D extend the association of BHLHE40 to clinical β-cell dysfunction and diabetes. My areas of interest are pancreatic β-cell and mitochondrial physiology. GSE analysis and repression of PGC1α by BHLHE40, as appropriately discussed by the authors, point towards impaired mitochondrial function and ATP generation. Additional experiments would greatly support the role of BHLHE40 in mitochondrial dysfunction under hypoxia (as discussed under comments).

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01594

      Corresponding authors: Hidehiko Kawai and Hiroyuki Kamiya

      1. General Statements [optional]

      We would like to extend our gratitude to the Editor and both Reviewers for their constructive and insightful comments to our manuscript. We deeply appreciate the Reviewers’ careful consideration of our work, in result of which we think the paper has greatly improved. Below, we have responded to all points raised by the Reviewers.

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The analysis of mutations in mammalian, including human, genomes has been of interest for many decades. Early DNA sequencing technologies enabled direct identification of mutations in target genes provided that the mutant genes could be readily isolated. This requirement stimulated the development of shuttle vector plasmids that carried a mutation marker gene and could replicate in both mammalian and bacterial cells. These were used in experiments in which the plasmids, treated with a mutagen, would be passaged through mammalian host cells after which the progeny plasmids were introduced into an indicator bacterial strain. Colonies with mutant marker genes could be distinguished by color or survival, the plasmids recovered, and the sequence of the mutant gene determined. The shuttle vector plasmid that became the most widely used contained as the marker the supF amber suppressor tyrosyl tRNA gene positioned in the plasmid such that deletion mutations associated with mammalian cell transfection were selected against. Although various improvements have been introduced since its introduction in the mid-1980s, including bar codes to distinguish independent from sibling mutations (in the early 1990s), the basics of the system have been maintained, and it and variations are still in use. The Kamiya group has made several adjustments to the supF shuttle vectors, including the construction of indicator bacterial strains based on survival of bacteria containing mutant supF genes (the initial system relied on colony color). They have published many studies of mutagenesis by various agents, error prone polymerases, etc. In the current submission they describe a comprehensive approach to identifying mutations in the supF gene that exploits Next Generation Sequencing technology that can identify the full spectrum of mutations including those that escape detection in phenotypic screens. The study is exhaustive and presents a methodical validation of each component of their approach. They report UV induced mutations, the mechanism of which has been well characterized in previous literature. They also describe a category of multiple mutations, which had been observed in the early work with the supF plasmids, and whose relationship to UV photoproducts is most likely indirect.

      *We thank the Reviewer for their very insightful feedback to our manuscript and their positive assessment. We have added some discussion points based on the essential references mentioned in the Reviewer’s comments, which we believe made the explanation of our study more complete. *

      Major comments: This manuscript presents a technical advance on the use of the supF mutation reporter system. The extent of the validation of each component of the system, including the bar code is rigorous. Their data on the nature and location of UV induced mutations are in very good agreement with previous studies with supF and other reporter genes, a further validation of their approach. Their discussion of the mechanism of the UV induced mutations is in accord with prior work from other laboratories. However, their interpretation of the multiple mutations, although reasonable in invoking a role for APOBEC deamination of cytosines (see eLife. 2014; 3: e02001 for another discussion of this issue), overlooks a much earlier study on the same topic that showed that nicks in the vicinity of the marker gene are mutagenic and can induce multiple mutations (Proc Natl Acad Sci 1987 84:4944-8). It would be useful for the authors to consider their data on the multiple mutations in the light of the earlier analysis. Furthermore, a check to verify the covalently closed circular integrity of the plasmid preparations would be an important quality control and could reduce the mutagenesis observed in 0 UV controls.

      We thank the Reviewer for the valuable comments that made our manuscript clearer and more emphatic. We are hereby addressing all of the Reviewer’s concerns. The available data accumulated from previous studies have proved the high sensitivity of the supF assay as a mutagenesis assay, which now has been clearly supported by the results in the current study. We believe that this NGS assay will be able to fulfil the data requirements to tackle many questions related to mutagenesis, thanks to the simplicity and cost-effectiveness of the procedure. However, to meet the experimental objectives, the preparation and analysis of the library are crucially important procedures in the stages of initial setting up of the assay. The covalently closed circular integrity of the vector library is definitely one of the important points we should pay attention to when performing this assay. After the construction of the BC12-library, we have to check the quality of the library by agarose gel electrophoresis. The background mutation frequency and the sequence of the library itself (uploaded as described in the DATA AVAILABILITY section of this manuscript) also needs to be analyzed by NGS before the experiment. We are also routinely constructing the double-stranded shuttle vector from a single-stranded circular DNA with a variety of site-specific damaged oligonucleotides. The treatment with T5 exonuclease followed by purification is absolutely essential to decrease the background mutation frequency. Without the treatment with the exonuclease, cluster mutations may be increased under specific experimental conditions. For this study, we carried out the conventional supF assay using the BC12-library purified after T5 exonuclease treatment. However, in this case the process of purification slightly increased the mutant frequency of the BC12-library to about 2 x10-4 (corresponding to 1x10-6/bp).Therefore, when setting up the essay, we have to consider the background control that we will need for the data analysis. In response to the Reviewer’s comments, we have now added the following paragraph in the DISCUSSION section:

      Page 16, line 25:

      ”5) For the supF assay, spontaneous cluster mutations at TC:GA sites were often observed, and it was well illustrated in an earlier study that a nick in the shuttle vector was a trigger for these asymmetric cluster mutations (54). Therefore, we need to be aware of the quality of each library and how it affects the outcome of each analysis, especially for detection of very low levels of mutations. Depending on the purpose of the experiments, in the preparation of covalently closed circular vector libraries it is essential to eliminate the background level of mutations. In fact, the in vitro construction of the library of double-stranded shuttle vectors from single-stranded circular DNA requires the process of treatment with T5 Exonuclease, which drastically decreases background mutations.”

      Minor points The authors state that only 30% of the base sequence of the supF gene can be "used for dual-antibiotic selection on the indicator E. coli". An earlier review (Mutation Res 220: 61,1989) indicated that within the mature tRNA region single or tandem mutations had been reported at 87% of sites, using the colony color assay. The direct NGS analyses would be indifferent to phenotype, and one would expect the maximum number of mutable sites would be recovered from this approach. It would be helpful for an explicit statement regarding the number of mutant sites to be in the Discussion, as this should strengthen the case for the NGS strategy.

      We thank the Reviewer for the helpful comment. These are important points we should indeed mention. This method will complement previous data, and especially the data from titer plates will provide us with non-biased mutation spectra for the whole analyzed region. We have now explained in detail about the coverage of mutation spectra in the DISSCUSSION section.

      Page 14, line 14:

      The mutation spectra of single or tandem base-substitutions for inactive supF genes identified by using the blue-white colony color assays were comprehensively summarized in an earlier review article, and it was noted that the mutations were detected at 86 sites within a 158-bp region covering the supF gene (54%) and at 74 sites within the 85-bp mature tRNA region (87%), thus demonstrating the great sensitivity of the supF assay system for analysis of mutation spectra (19). However, obtaining reliable datasets by the conventional supF assay requires skill and experience, especially for studies where the mutations of interest are induced with low frequency. The method has been advanced by the construction of indicator bacterial strains with different supF reporter genes which allow selection based on survival of bacteria containing mutant supF genes. However, the fact that the supF phenotypic selection process relies on the structure and function of transfer RNAs that may be differently affected by different mutations means that the improvement of the efficiency of the selection process may cause loss of coverage of the mutation spectra, as it is under our experimental conditions, where the coverage is about 30% (19,20).”

      Page 15, line 4:

      From this point of view, we believe that we can secure a sufficient number of experiments to improve the accuracy of the analysis and to confirm the reproducibility of the experiments. Furthermore, the data from colonies grown on titer plates provides us, at least in principle, and with the exception of large deletions and insertions, with non-biased mutation spectra for the whole analyzed region.

      Supplementary Figure 1 shows the organization of 8 supF reporter plasmids. Were these discussed in the text and employed in the experiments? It was not clear in the text.

      We thank the Reviewer for the helpful comment. It was indeed not clear which vectors we used and why we constructed a series of vectors. Now, we have added the vectors we used for the constructions of the library and each experiment in the RESULTS and MATERIALS AND METHODS sections. Since this is quite important for us and, we believe, the readers, we also added the explanations in the DISCUSSION section, detailing why we have constructed a series of shuttle vectors, as follows:

      Page 19, line 36:

      Mutational signatures identified in cancer cells are emerging as valuable markers for cancer diagnosis and therapeutics. Innumerable physical, chemical and biological mutagens, including anticancer drugs, induce characteristic mutations in genomic DNA via specific mutagenic processes. The mutation spectra obtained here by using the presented advanced method were in good agreement with accumulated data from previous papers where the conventional method had been used, with the advantage that our method provided less-biased mutation spectra data. As described above, the datasets presented here highlighted novel mutational signatures and also cluster mutations with a strand-bias, which could be associated with the processes of replication, transcription, or repair of DNA-damage, including a single strand break (a nick). In this study, eight series of supF shuttle vector plasmids were constructed, as presented in Supplementary Figure S1; however, the analysis was carried out using N12-BC libraries prepared from either pNGS2-K1 (Figures 1-4) or pNGS2-K3 (Figures 5-10). The pNGS2-K1/-A1/-K4/-A4 and pNGS2-K2/-A2/-K3/-A3 vector series contain an M13 intergenic region with opposite orientations relative to the supF gene, which allows us to incorporate specific types of DNA-damage at specific sites in the opposite strand of the vector library. Also, the pNGS2-K1/-A1/-K3/-A3 and pNGS2-K2/-A2/-K4/-A4 vector series contain the SV40 replication origin, which enables bidirectional replication and transcription, at opposite sides of the supF gene. Although this is still preliminary data, it is notable that the spontaneously induced mutations for the different vectors in U2OS cells were not significantly different. Therefore, the here presented mutagenesis assay with NGS, by using these series of libraries, can be applied in many different types of experiments to address both quantitative and qualitative features of mutagenesis. It is possible to design series of libraries containing DNA lesions or sequences suitable for the investigation of specific molecular mechanisms, such as TLS, template switching, and asymmetric cluster mutations.”

      CROSS-CONSULTATION COMMENTS Comment on the issue raised by Reviewer #2 regarding plasmids with unrepaired DNA damage introduced into E. coli after passage through U2OS cells: treatment of the plasmid harvest with Dpn1 eliminates un-replicated plasmid DNA. Also, SV40 T antigen drives run away replication of the plasmids, which contain the SV40 origin of replication. This greatly dilutes plasmids with remaining UV photoproducts.

      Reviewer #1 (Significance (Required)):

      Significance This is a comprehensive description of a technical advance for the analysis of mutations based on the most widely used system for reporting mutations in mammalian, including human, cells. As costs for NGS decline it is likely to become the approach of choice.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, the authors developed a novel mutagenesis assay by combining the conventional supF forward mutagenesis assay with NGS technology. The manuscript is well written, providing design, methods, and results of the experimental system in very much details, which this reviewer highly evaluates. However, the manuscript may be too long and could be more concise. In addition, this reviewer is afraid that main figures seem difficult to fit printed pages (especially multi-paneled figures of large size, such as Fig. 5 through 8). The authors should re-organize the figures by reducing size and/or moving partly to supplementary information.

      We thank the Reviewer for the helpful comments to our manuscript. It is true that the multi-paneled figures were too large, and we have now re-analyzed and optimized most of the figures by reducing size, transferring to Supplementary Figures, and separating one figure into two. Although the number of Figures and Supplementary Figures have now increased, we believe that it has become easy to follow for readers and to fit printed pages. *We considered carefully the Reviewer’s remark about the length of the manuscript, but we feel that the text was already as concise as we could make it, and we have already left out some more detailed explanations. *

      1. Some UV-induced DNA damage (typically CPD) is repaired only slowly in human cells, so that the replicated plasmid DNAs recovered from U2OS cells may still contain damage and possibly induce mutations in E. coli after transfection. As the result of high sensitivity of NGS analysis, it is worried that such mutations could be also included in the results. To obtain even more accurate mutational characteristics in mammalian cells, the authors could consider to treat the DNA samples with photolyases before transformation of E. coli. The authors could consider to discuss on this point.

      *We thank the Reviewer for the helpful comment, indeed Dpn I treatment is one of the very important procedures for avoiding analysis bias. We have now expanded the explanation why the libraries have to be treated with Dpn I, as follows: *

      Page 11, line 4:

      the libraries were extracted from the cells, and treated with dam-GmATC-methylated DNA specific restriction enzyme Dpn I to digest un-replicated DNAs that contain UV-photoproducts.”

      1. It is quite intriguing that multiple mutations in a single BC clone tend to occur in the same DNA strand. Is there any trend in a distance between the mutated sites? Considering participation of TLS polymerases in the first round of replication, it may be interesting if multiple DNA lesions occur in relatively close positions so that TLS polymerases elongate the DNA strand without switching back to replicative polymerases.

      We thank the Reviewer for the valuable and insightful suggestions for this assay. We have analyzed the positions of SNSs in multiple-mutations shown in Supplementary Figures S11 and S12. As the reviewer mentioned, we may be able to address the mechanisms of TLS switching in mammalian cells by using this assay. In this study, the obtained non-biased mutation spectra of multiple mutations may not be enough for the static analysis, but our results indicate that multiple mutations were induced at relatively close positions. It would be interesting if we could address the mechanisms of TLS polymerase switching. We believe that the accumulation of large numbers of non-biased mutation spectra will provide us with growing opportunities to address more questions in mutagenesis. We have now added the Supplementary Figures S11 and S12, as well as the following discussion points:

      Page 14, line 6:

      5) The distance between two SNSs in multiple mutations induced by UV irradiation was relatively shorter than the theoretically expected based on the sequence (Supplementary Figures S11 and S12).”

      Page 18, line 27:

      “In addition, the positions of SNSs in the multiple mutations were closer to each other compared to the theoretically expected positions (Supplementary Figures S11 and S12), which may reflect switching events involving TLS polymerases. It should be noted that the presented data for the distance between two SNSs in the multiple mutations was analyzed from the data from selection plates in order to secure a sufficient number of mutations, and therefore, there may be a bias due to hot spots associated with the selection process. However, the results from the limited number of mutations from the titer plates are similar to these from the selection plates. It can be proposed that this assay may also be applied for analysis of TLS polymerases in mammalian cells.”

      1. This reviewer is wondering whether the results of mammalian cells are influenced by transcription-coupled repair in this experimental system. Because the SV40 replication origin functions as bidirectional promoters, the supF region may be transcribed in U2OS cells so that DNA damage on transcribed strands may be removed more efficiently than non-transcribed strands. Please comment on this, if relevant.

      *We thank the Reviewer for the insightful comments. This issue is also very important and interesting, and should be addressed in the mutagenesis research. That is exactly the reason why we presented series of vectors for the assay in this paper. The SV40 replication origin has an effect on the background mutations, which this is also dependent on the experimental conditions. However, this needs to be confirmed by further studies. We hope the idea for these constructions will be helpful for many laboratories. We have now added the following parts in the DISCUSSION section. *

      Page 18, line 36:

      Mutational signatures identified in cancer cells are emerging as valuable markers for cancer diagnosis and therapeutics. Innumerable physical, chemical and biological mutagens, including anticancer drugs, induce characteristic mutations in genomic DNA via specific mutagenic processes. The mutation spectra obtained here by using the presented advanced method were in good agreement with accumulated data from previous papers where the conventional method had been used, with the advantage that our method provided less-biased mutation spectra data. As described above, the datasets presented here highlighted novel mutational signatures and also cluster mutations with a strand-bias, which could be associated with the processes of replication, transcription, or repair of DNA-damage, including a single strand break (a nick). In this study, eight series of supF shuttle vector plasmids were constructed, as presented in Supplementary Figure S1; however, the analysis was carried out using N12-BC libraries prepared from either pNGS2-K1 (Figures 1-4) or pNGS2-K3 (Figures 5-10). The pNGS2-K1/-A1/-K4/-A4 and pNGS2-K2/-A2/-K3/-A3 vector series contain an M13 intergenic region with opposite orientations relative to the supF gene, which allows us to incorporate specific types of DNA-damage at specific sites in the opposite strand of the vector library. Also, the pNGS2-K1/-A1/-K3/-A3 and pNGS2-K2/-A2/-K4/-A4 vector series contain the SV40 replication origin, which enables bidirectional replication and transcription, at opposite sides of the supF gene. Although this is still preliminary data, it is notable that the spontaneously induced mutations for the different vectors in U2OS cells were not significantly different. Therefore, the here presented mutagenesis assay with NGS, by using these series of libraries, can be applied in many different types of experiments to address both quantitative and qualitative features of mutagenesis. It is possible to design series of libraries containing DNA lesions or sequences suitable for the investigation of specific molecular mechanisms, such as TLS, template switching, and asymmetric cluster mutations.”

      1. page 13: Please check whether the description of Fig. 9C is correct (6th line, graph on top; 9th line, bottom graph).

      We thank the Reviewer for carefully checking our manuscript, it was mislabeled in the text. Now, following the Reviewer’s comments, most figures have been changed from the figures in the previous submission. We appreciate the careful review.

      CROSS-CONSULTATION COMMENTS Reviewer #1 gives quite relevant comments as an expert of the mutagenesis field. It would improve this manuscript greatly for the authors to make appropriate modifications according to his/her suggestions.

      Reviewer #2 (Significance (Required)):

      It is quite convincing that this method has a great potential to give much more extensive information on mutational characteristics, most importantly, by eliminating the bias caused by phenotypic selection. Therefore, this work certainly must be worth being published in an appropriate journal.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, the authors developed a novel mutagenesis assay by combining the conventional supF forward mutagenesis assay with NGS technology. The manuscript is well written, providing design, methods, and results of the experimental system in very much details, which this reviewer highly evaluates. However, the manuscript may be too long and could be more concise. In addition, this reviewer is afraid that main figures seem difficult to fit printed pages (especially multi-paneled figures of large size, such as Fig. 5 through 8). The authors should re-organize the figures by reducing size and/or moving partly to supplementary information.

      Specific comments

      1. Some UV-induced DNA damage (typically CPD) is repaired only slowly in human cells, so that the replicated plasmid DNAs recovered from U2OS cells may still contain damage and possibly induce mutations in E. coli after transfection. As the result of high sensitivity of NGS analysis, it is worried that such mutations could be also included in the results. To obtain even more accurate mutational characteristics in mammalian cells, the authors could consider to treat the DNA samples with photolyases before transformation of E. coli. The authors could consider to discuss on this point.

      2. It is quite intriguing that multiple mutations in a single BC clone tend to occur in the same DNA strand. Is there any trend in a distance between the mutated sites? Considering participation of TLS polymerases in the first round of replication, it may be interesting if multiple DNA lesions occur in relatively close positions so that TLS polymerases elongate the DNA strand without switching back to replicative polymerases.

      3. This reviewer is wondering whether the results of mammalian cells are influenced by transcription-coupled repair in this experimental system. Because the SV40 replication origin functions as bidirectional promoters, the supF region may be transcribed in U2OS cells so that DNA damage on transcribed strands may be removed more efficiently than non-transcribed strands. Please comment on this, if relevant.

      4. page 13: Please check whether the description of Fig. 9C is correct (6th line, graph on top; 9th line, bottom graph).

      CROSS-CONSULTATION COMMENTS

      Reviewer #1 gives quite relevant comments as an expert of the mutagenesis field. It would improve this manuscript greatly for the authors to make appropriate modifications according to his/her suggestions.

      Significance

      It is quite convincing that this method has a great potential to give much more extensive information on mutational characteristics, most importantly, by eliminating the bias caused by phenotypic selection. Therefore, this work certainly must be worth being published in an appropriate journal.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The analysis of mutations in mammalian, including human, genomes has been of interest for many decades. Early DNA sequencing technologies enabled direct identification of mutations in target genes provided that the mutant genes could be readily isolated. This requirement stimulated the development of shuttle vector plasmids that carried a mutation marker gene and could replicate in both mammalian and bacterial cells. These were used in experiments in which the plasmids, treated with a mutagen, would be passaged through mammalian host cells after which the progeny plasmids were introduced into an indicator bacterial strain. Colonies with mutant marker genes could be distinguished by color or survival, the plasmids recovered, and the sequence of the mutant gene determined. The shuttle vector plasmid that became the most widely used contained as the marker the supF amber suppressor tyrosyl tRNA gene positioned in the plasmid such that deletion mutations associated with mammalian cell transfection were selected against. Although various improvements have been introduced since its introduction in the mid-1980s, including bar codes to distinguish independent from sibling mutations (in the early 1990s), the basics of the system have been maintained, and it and variations are still in use.

      The Kamiya group has made several adjustments to the supF shuttle vectors, including the construction of indicator bacterial strains based on survival of bacteria containing mutant supF genes (the initial system relied on colony color). They have published many studies of mutagenesis by various agents, error prone polymerases, etc. In the current submission they describe a comprehensive approach to identifying mutations in the supF gene that exploits Next Generation Sequencing technology that can identify the full spectrum of mutations including those that escape detection in phenotypic screens. The study is exhaustive and presents a methodical validation of each component of their approach. They report UV induced mutations, the mechanism of which has been well characterized in previous literature. They also describe a category of multiple mutations, which had been observed in the early work with the supF plasmids, and whose relationship to UV photoproducts is most likely indirect.

      Major comments:

      This manuscript presents a technical advance on the use of the supF mutation reporter system. The extent of the validation of each component of the system, including the bar code is rigorous. Their data on the nature and location of UV induced mutations are in very good agreement with previous studies with supF and other reporter genes, a further validation of their approach. Their discussion of the mechanism of the UV induced mutations is in accord with prior work from other laboratories. However, their interpretation of the multiple mutations, although reasonable in invoking a role for APOBEC deamination of cytosines (see eLife. 2014; 3: e02001 for another discussion of this issue), overlooks a much earlier study on the same topic that showed that nicks in the vicinity of the marker gene are mutagenic and can induce multiple mutations (Proc Natl Acad Sci 1987 84:4944-8). It would be useful for the authors to consider their data on the multiple mutations in the light of the earlier analysis. Furthermore, a check to verify the covalently closed circular integrity of the plasmid preparations would be an important quality control and could reduce the mutagenesis observed in 0 UV controls.

      Minor points:

      The authors state that only 30% of the base sequence of the supF gene can be "used for dual-antibiotic selection on the indicator E. coli". An earlier review (Mutation Res 220: 61,1989) indicated that within the mature tRNA region single or tandem mutations had been reported at 87% of sites, using the colony color assay. The direct NGS analyses would be indifferent to phenotype, and one would expect the maximum number of mutable sites would be recovered from this approach. It would be helpful for an explicit statement regarding the number of mutant sites to be in the Discussion, as this should strengthen the case for the NGS strategy. <br /> Supplementary Figure 1 shows the organization of 8 supF reporter plasmids. Were these discussed in the text and employed in the experiments? It was not clear in the text.

      CROSS-CONSULTATION COMMENTS

      Comment on the issue raised by Reviewer #2 regarding plasmids with unrepaired DNA damage introduced into E. coli after passage through U2OS cells: treatment of the plasmid harvest with Dpn1 eliminates un-replicated plasmid DNA. Also, SV40 T antigen drives run away replication of the plasmids, which contain the SV40 origin of replication. This greatly dilutes plasmids with remaining UV photoproducts.

      Significance

      Significance:

      This is a comprehensive description of a technical advance for the analysis of mutations based on the most widely used system for reporting mutations in mammalian, including human, cells. As costs for NGS decline it is likely to become the approach of choice.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1:

      Review of "Identifying novel regulators of placental development using time series transcriptomic data and network analyses."

      The authors present a detailed bioinformatic assessment of mouse developmental time series of the placenta. They apply current data mining and analysis methods to identify protein-centred networks that are likely enriched to specific cell types of the placenta. They then translate these findings to humans using statistical comparisons of human single-cell sequencing data of the placenta. Lastly, they use knock-down experiments to validate the conserved functional importance of the hub genes in the mouse protein networks in human cells.

      The strengths of this paper are the rigorous data mining methods and the functional translation to humans from mice. There are no critical weaknesses to the article. There is a blend of statistical analysis with anecdotal or hand curation from databases and the literature, but it is unclear if these curated finings are circumstantial or statistically meaningful. In the end, the hypothesis seems to hold in that 4/4 gene knocked down in the human cells gave a migration phenotype.

      Comments, questions, critique:

      1. Given the translational aims of the paper, more introduction/discussion material on the comparative aspects of mice and humans are needed. Are giant cells and EVT the same? What are the cell equivalents that you are discovering? The Soncin et al. paper is cited, but I think underused. This publication contains time series data on mice and humans and could be used as external validation of clusters, networks, and other analyses. Other publications to consider for context are

      2. Cox B, et al. Mol Syst Biol 5: 279.

      3. Silva JF, Serakides R. 2016. Cell Adhes Migr 10: 88-110. (specifically discusses migration difference between the species placentae)

      We thank the reviewer for the comment and valuable resources. We agree that more information about the similarities and differences between the migratory cells needs to be provided. We have added the following details in the introduction of the manuscript:

      “Although there are certain differences between the mouse and human placenta (Hemberger, Hanna, and Dean 2020; Soncin, Natale, and Parast 2015), they do express common genes during gestation, including common regulators and signaling pathways involved in placental development (Cox et al. 2009; Soncin et al. 2018; Soncin, Natale, and Parast 2015; Watson and Cross 2005). For example, Ascl2/ASCL2 and Tfap2c/TFAP2C are required for the trophoblast (TB) cell lineage in both mouse and human models (Guillemot et al. 1994; Kuckenberg, Kubaczka, and Schorle 2012; Varberg et al. 2021). Another example is the HIF signaling pathway, which regulates TB differentiation in both mouse and human placenta (Soncin, Natale, and Parast 2015).”

      “Although the structure of the placenta is not identical between mouse and human, certain mouse placental cell types are thought to be equivalent to human placental cell types (Soncin, Natale, and Parast 2015). For example, parietal TGCs and glycogen TBs have been described as equivalent to human extravillous trophoblasts (EVTs) (Soncin, Natale, and Parast 2015). Mouse TGCs are not as invasive as human EVTs (Soncin, Natale, and Parast 2015), and they have different levels of polyploidy and copy number variation (Morey et al. 2021); however, both EVTs and TGCs are able to degrade extracellular matrix to enable TB migration into the decidua (Silva and Serakides 2016).”

      Added to discussion:

      “These genes were selected primarily based on the network analyses, but also based on expression data from human cells to account for possible differences between mouse and human placental gene expression.”

      As the reviewer suggested, we used the Soncin et al., 2015 data for validation. Only 6,317 of the 11,713 protein-coding genes used for hierarchical clustering were detected in the mouse dataset in Soncin et al., 2015. This issue could be because the Soncin data was generated using microarrays.

      Nevertheless, we still compared our e7.5 and e9.5 hierarchical groups with: (1) Soncin et al. gene clusters in mouse that were downregulated over time, had highest expression from e9.5-12.5, or were upregulated over time; and (2) Soncin et al. gene clusters in human that were best correlated with mouse clusters and were either downregulated over time or upregulated over time. We observed a general consensus that our e7.5-hierarchical group had the highest percent of agreement with Soncin et al. gene groups that are downregulated over time, and our e9.5-hierarchical group had the highest percent of agreement with Soncin et al. gene groups that either have highest expression at e9.5-e12.5 or genes that are upregulated over time. This data is added below, described in the results section 1, and included in Supplementary Table S1.

      Comparison with Soncin et al. mouse data:

      Having expression > 0 (in Soncin et al.) and being in any hierarchical clusters

      E7.5-hierarchical genes (down-regulation trend)

      E9.5-hierarchical genes (up-regulation trend)

      Cluster 2, 3 and 7 (Soncin et al., downregulation trend)

      1009

      800 (79.3%)

      279 (27.7%)

      Cluster 6 (Soncin et al., highest at e9.5 – e12.5)

      120

      51 (42.5%)

      110 (91.7%)

      Cluster 1, 4 and 5 (Soncin et al., upregulation trend)

      1019

      415 (40.7%)

      881 (86.5%)

      Comparison with Soncin et al. human data:

      Having expression > 0 (in Soncin et al.) and being in any hierarchical clusters

      E7.5-hierarchical genes (down-regulation trend)

      E9.5-hierarchical genes (up-regulation trend)

      HS Cluster 5 (Soncin et al., downregulation trend)

      164

      92 (56.1%)

      52 (31.7%)

      HS Cluster 2 and 4 (Soncin et al., upregulation trend)

      111

      44 (39.6%)

      72 (64.9%)

      The following statement was added to the result section:

      “Second, we compared our hierarchical groups with previously published mouse and human placental microarray time course data from Soncin et al., 2015 (Soncin, Natale, and Parast 2015). Despite the technical differences between the datasets, we observed a consensus that our e7.5 hierarchical cluster had the highest percent of overlap with Soncin et al. gene groups that are downregulated over time, and our e9.5 hierarchical cluster had the highest percent of overlap with Soncin et al. gene groups that either have highest expression at e9.5 - e12.5 or genes that are upregulated over time (Supplementary Table S1).”

      Clustering represented in Figure 1B, was this a supervised model? Why only three clusters?) Did you specify that there would be three models and force each gene profile into one of the categories? How robust are the fits? A fitted model might be a better approach as you can specify the ideal models (early high, late high and mid-high), then determine each gene profile that fits each model and only assess those genes with a significant fit to the model. Forcing clustering to the three-model fit likely gives many poorly fitting profiles. While in the end, this works out, it may be due to applying other post hoc methods for gene enrichment, where noise distributes randomly.

      We carried out unsupervised transcript clustering using hierarchical clustering (agglomerative approach using Euclidean distance and complete linkage). The resulting dendrogram was cut at the second highest level to obtain three clusters. We have added additional validation with different numbers of clusters (k = 3, 4 and 5) and quantification of agreement between different clustering methods to show the robustness of the hierarchical clusters. We acknowledge that hierarchical clustering could be sensitive to noise and could result in poorly fitted transcripts in each group; however, it was a necessary first step for us to identify genes relevant to the distinct placental processes at the three timepoints. Acknowledging this disadvantage, we only focused the analyses on genes that are differentially expressed over time and were present in the timepoint hierarchical groups.

      We added the additional analysis as Supplementary Figure S1, and the following statements were added in the results section:

      "First, we used three different algorithms, K-means clustering, self-organizing maps, and spectral clustering, to validate the trends of the expression levels in hierarchical groups, as well as the number of transcript groups (k = 3, 4 and 5). Only with k = 3 did we obtain groups with median expression level trends consistent in all four algorithms (Supplementary Figure S1). Moreover, with k = 3, the maximum percent of agreement (see Materials and Methods) between hierarchical clusters and clusters obtained using each of the different algorithms was 70.34-87.26% (Supplementary Figure S1), while the maximum percent of agreement between hierarchical clusters and clusters obtained from other algorithms decreases to between 55.67-65.72% with k = 4 and 54.81-59.19% with k = 5.”

      We agree model-based clustering could be an alternative approach and have added it to the discussion section:

      “Combining hierarchical clustering with differential expression analysis, we were able to identify gene groups using an unsupervised approach. It has also been shown that for times-series analyses with fewer than eight timepoints, pairwise differential expression analysis combined with additional methods identifies a more robust set of genes (Spies et al. 2019). Alternatively, model-based clustering using RNA-seq profiles (Si et al. 2014) could also be useful for gene group identification. However, it is still important to evaluate the robustness and functional relevance of the fitted models by carrying out additional downstream analyses.”

      Several statements are made about the conservation of importance between mouse and human hub genes. For example, "We predict these highly expressed genes to be generally important for TB function and processes such as cell migration, a term associated with multiple timepoint specific networks (Figure 2A)." While your knock-down assay of migration results shows these hub genes to be necessary to humans, what do they mean to the mouse? You did not use mouse TSC to assess functional importance concurrently. You note a small number of genes as of known importance, "127 hub genes of which 16 have been annotated as having a role in placental development". Were the others knocked out but lack a developmental phenotype or not assessed? Are these functionally redundant in the mouse or not involved in the same processes between the species?

      To assess the possible role of hub genes in mouse development more comprehensively, we extended our search for gene functions on the Mouse Genome Informatics (MGI) database to include not only placenta related GO and MGI phenotype terms (defined as “genes with known roles”), but also embryo related GO and MGI phenotype terms (defined as “genes with possible roles”). We included embryo related terms as “genes with possible roles” because embryonic lethal mouse knockout lines frequently have placentation defects, and because defects in placental development can be associated with the development of other embryonic tissues (Brown and Hay 2016; Perez-Garcia et al. 2018; Woods, Perez-garcia, and Hemberger 2018). This change resulted in an increase in the number of genes with relevant functions in mouse, including several annotated as embryonic lethal or with abnormal embryonic growth (see Supplementary Table S6). With the additional annotations:

      • 6 out of 17 hub genes of e7.5 networks have known/possible roles.
      • 17 out of 28 hub genes of e8.5 networks have known/possible roles.
      • 48 out of 127 hub genes of e9.5 networks have known/possible roles. We also carried out randomization tests to determine if the number of known/possible genes we identified were significant. Randomization tests were carried out with the following procedure: for each timepoint, from the respective timepoint-specific groups, we sampled 10,000 gene sets of the same number as the hub gene numbers. Then we counted the number of known/possible genes in each random set. A p-value is calculated as the number of times a random gene set has ≥ known/possible genes than the observed number, divided by 10,000. We found that the number of genes with known/possible roles at each time point are statistically significant (Supplementary Figure S3). This result indicates that the gene sets we identified are significantly associated with relevant phenotypes in mouse.

      The remaining hub genes are unannotated as related to placental or embryonic functions in the MGI database. Based on that, it is difficult to determine if they lack a relevant phenotype, or if there has not been a detailed assessment of the placenta.

      Added to section 2 of the result section:

      “Briefly, genes annotated under any GO or MGI phenotype terms related to placenta, TB cells, TE and the chorion layer are considered as having a “known” role in the placenta. Genes annotated under terms related to embryo are considered as having a “possible” role in the placenta, because embryonic lethal mouse knockout lines frequently have placentation defects, and because defects in placental development can be associated with the development of other embryonic tissues (Brown and Hay 2016; Perez-Garcia et al. 2018; Woods, Perez-garcia, and Hemberger 2018). Hereafter, such genes are referred to as “known/possible genes”. In the e7.5 networks, there were 17 hub genes in which six genes were known/possible. The number of hub genes that are labelled as known/possible is statistically significant when comparing to random gene sets selected from the e7.5 timepoint-specific group (Supplementary Figure S3). In the e8.5 and e9.5 networks, 17 out of 28 and 48 out of 127 hub genes were known/possible, respectively. Similar to e7.5, the number of hub genes labelled as known/possible in e8.5 networks and e9.5 networks were both statistically significant when comparing to random gene sets selected from the corresponding timepoint-specific groups (Supplementary Figure S3). These results indicate that the gene sets we identified are significantly associated with relevant phenotypes in the mouse.”

      For the four genes that we tested in HTR-8/SVneo cells, we also added more information about the current known role of the gene in mouse.

      Added to the discussion section:

      “We identified hub genes and their immediate neighboring genes which could regulate placental development and confirmed the roles of four novel genes (Mtdh, Siah2, Hnrnpk and Ncor2) in regulating cell migration in the HTR-8/SVneo cell line. These genes were selected primarily based on the network analyses, but also based on expression data from human cells to account for possible differences between mouse and human placental gene expression. Previous studies suggested these four candidates are functionally important in mouse. Mtdh has been suggested to regulate cell proliferation in mouse fetal development (Jeon et al. 2010). The Siah gene family is important for several functions (Qi et al. 2013). Of relevance to the placenta, Siah2 is an important regulator of HIF1α during hypoxia both in vitro and in vivo (Qi et al. 2008). Moreover, while Siah2 null mice exhibited normal phenotypes, combined knockouts of Siah2 and Siah1a showed enhanced lethality rates, suggesting the two genes have overlapping modulating roles (Frew et al. 2003). Hnrnpk-/- mice were embryonic lethal, and Hnrnpk+/- mice had dysfunctions in neonatal survival and development (Gallardo et al. 2015) . Ncor2-/- mice were embryonic lethal before e16.5 due to heart defects (Jepsen et al. 2007). According to the International Mouse Phenotyping Consortium database (Dickinson et al. 2016), Ncor2 null mice also showed abnormal placental morphology at e15.5. However, none of these genes have been studied in TB migration function.”

      In determining conservation between mouse and human networks, were only 1:1 orthologs examined or did you consider more complex 1:many mapping conditions between the two species?

      In this work, we used only one-to-one orthology between mouse and human avoid duplication while sampling in the enrichment tests. We added this detail in the method section. However, as found in Cox et al., 2009, genes with one-to-many orthologs could be highly intriguing and should be investigated in future studies.

      Should the migration assay be normalized to survival/adhesion? If 70,000 cells were seeded but had 50% cell death (or reduced adhesion), then it may appear to be poor migration. Should the migration be evaluated as a ratio of top to bottom cell densities to control for poor adhesion or survival?

      We thank the reviewer for bringing up this important point. Unfortunately, with the method we used we cannot quantify the densities on top, because the cells on top need to be scraped off prior to measuring the cells at the bottom (the two densities cannot be measured separately). To help with this concern, in a separate experiment we instead counted cell numbers 48-hours post-transfection for cells treated with target gene siRNA and cells treated with negative control siRNA to determine if apoptosis or changes in proliferation rate could be leading to changes in the observed migration. From this data, we determined that none of the siRNA knockdowns resulted in a significant change of cell counts (p-value > 0.05). We do note that Siah2 siRNA #1 has some decrease in counts (p-value = 0.081) and Ncor2 siRNA #1 and #2 have some increase in cell counts (p-value = 0.081 and p-value = 0.077) (Supplementary Figure S7). Additional follow up experiments we have performed with our targets of interest, which are out of the scope of this paper, demonstrate that different pathways and processes could be involved in the resulting decrease in migration we observed (we are following up experimentally in more detail for each gene). Proliferation and other assays could also be used to further examine the increase in Ncor2 cell counts that were observed. We have added the cell count results and additional text to the discussion.

      Added to results, section 4:

      “When comparing the number of cells 48 hours post-transfection for cells treated with target gene siRNA to cells treated with negative control siRNA, we determined that none of the target gene siRNA treatments resulted in significant changes in cell counts. We do note that Siah2 siRNA #1 has some decrease in cell counts (p-value = 0.081), and Ncor2 siRNA #1 and Ncor2 siRNA #2 have some increase in cell counts (p-value = 0.081 and p-value = 0.077) compared to negative control treated samples (Supplementary Figure S7). This provides evidence that, in general, the reduction in cell migration capacity was likely not due to the target gene impacting the rate of cell death.”

      To the discussion:

      “Moreover, we observed that cell counts generally were not decreased upon target gene knockdown compared to negative control knockdown. However, more detailed analysis and process specific assays are needed. For example, future studies assessing each gene’s role in cell adhesion, cell-cell fusion, cell proliferation and cell apoptosis can be done to better understand their roles in placental development.”

      Reviewer #1 (Significance (Required)):

      This significantly advances previous publications on this topic by functionally testing the discovered genes.

      This highlights an excellent data mining strategy for a developmental disease using mice and translating to humans.

      The audience is likely developmental biologists and reproductive specialists.

      My expertise is bioinformatics and developmental biology.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors used RNA-seq data from mouse fetal placenta at e7.5, e8.5, and e9.5 to create timepoint-specific gene expression interaction networks to find genes that they predicted would regulate placental development. They confirmed four novel candidate genes and showed that in the transfected human trophoblast HTR-8/SVneo cell line, these four candidates reduced cell migration capacity. Additionally, the authors show that bulk RNA-seq data can be used to infer cell-type composition and when used with single-cell RNA-seq, can be a powerful tool to study the biological processes that involve multiple cell-types.

      Overall, the authors are rigorous in their analyses, their conclusions appear sound, and the work could be an asset to the broader placental biology field. However, although the authors present an approach that future studies might find useful to replicate and their work has produced numerous novel transcripts/genes that warrant further investigation, the approach is not entirely novel, and could be expanded/improved (as suggested by the authors in the discussion), particularly with regard to validation of the genes/networks identified. Major and minor comments are listed below.

      Major comments:

      1) The authors used clustering and differential expression analysis to define sets of timepoint-specific genes. However, it was not clear to me the benefits of this approach. Why would using this approach be better than differential expression analysis alone such as in a typical ANOVA?

      We have added more discussion on this matter to explain our approach. We believe using hierarchical clustering and pairwise differential expression analysis can help identify gene lists with higher confidence. These are the new details we added to the discussion section:

      “Combining hierarchical clustering with differential expression analysis, we were able to identify gene groups using an unsupervised approach. It has also been shown that for times-series analyses with fewer than eight timepoints, pairwise differential expression analysis combined with additional methods identifies a more robust set of genes (Spies et al. 2019). Alternatively, model-based clustering using RNA-seq profiles (Si et al. 2014) could also be useful for gene group identification. However, it is still important to evaluate the robustness and functional relevance of the fitted models by carrying out additional downstream analyses.”

      2) Related to number 1 above, although the authors are interested in timepoint-specific transcripts, the author's methods would filter out possibly interesting transcripts that turn on and off during development. The authors might want to check to see if there are transcripts that are up in e7.5 and then down in e8.5 but then up again in e9.5. Also, the author's methods seem to include transcripts that are not exclusive to one timepoint (i.e. are up in e7.5 and e8.5 but not e9.5). It might be interesting to differentiate transcripts that are exclusive to one timepoint from those that are in more than one timepoint.

      We thank the reviewer for their valuable comment. We agree genes that turn on and off during the time course could be very interesting. In performing this analysis, we found that the number of such genes is rather small (38 genes that are up-regulated at e7.5 compared to e8.5 and up-regulated at e9.5 compared to e8.5). These genes were not enriched for processes that we observed with timepoint-specific gene groups, such as “trophoblast giant cell differentiation” (e7.5-specific genes), “labyrinthine layer development” (e8.5- and e9.5-specific genes), "blood vessel development” (e7.5- and e9.5-specific genes) and “response to nutrient” (e9.5-specific genes) (Supplementary Table S3). They are generally enriched for processes related to cytokine production and regulation of secretion.

      We also agree that it is interesting to differentiate transcripts that are exclusive to one time point from those that are in more than one time point. In the revised manuscript, we added additional analysis for genes that belong to multiple timepoint groups due to different transcripts of the same gene being annotated as timepoint-specific, and genes unique to each timepoint (Added to results section 1):

      “It is possible that timepoint-specific groups share genes that have timepoint-specific transcripts. Indeed, we identified 37 genes shared between e7.5 and e8.5, 5 genes shared between e7.5 and e9.5, and 109 genes shared between e8.5 and e9.5 (Supplementary Table S3). We found that genes only present at one timepoint (timepoint-unique genes) were generally enriched for similar terms as the full group of timepoint-specific genes (Supplementary Table S3). However, terms related to the development of labyrinth layer like “labyrinthine layer morphogenesis” and “labyrinthine layer blood vessel development” were only enriched when using all e8.5-specific genes but not when using e8.5 timepoint-unique genes. Moreover, we found that, unlike genes shared between e9.5 and e7.5, genes shared between e9.5 and e8.5 were enriched for processes such as “blood vessel development” and “insulin receptor signaling pathway”. This observation may indicate that different transcripts of the same genes could be expressed at different timepoints for the continuation of certain biological processes.”

      3) In the network analysis it would be interesting and helpful to the reader to highlight, if any, nodes or terms that were found to be significant (i.e. hubs or genes that have a high centrality metric etc.) in both the STRING and GENIE3 networks or overlap the networks created by the two different algorithms to compare them. This might help readers better rank genes when using these data to decide what genes are most important at each timepoint.

      We observed only one hub gene shared among networks inferred by the two methods (Vegfa in the e9.5 networks). However, hub genes of networks inferred by one method could be nodes in networks inferred by the other method. Hence, we have added lists of such genes in section 2. Interestingly, many of these genes have known roles in placental development. In terms of biological functions shared between the networks at the same timepoints, there were multiple interesting processes such as “positive regulation of cell migration”, “epithelium migration” and “vasculature development”, which we highlighted in Figure 2A.

      In the revised manuscript, we have added the following details in different paragraphs of section 2 of the results:

      “Although the networks inferred by the two methods did not share any hub genes, hub genes identified with one method could be members of the other method’s networks. These hub genes are Mmp9 (e7.5_1_STRING), Frk, Hmox1, and Nr2f2 (e7.5_2_GENIE3) (Table 1). This observation strengthens the potential roles of Frk gene in placental development.”

      “Hub genes identified with one method and present in the other method’s networks are Hsp90aa1, Akt1, and Mapk14 (e8.5_1_STRING), Dvl3 and Msx2 (e8.5_2_GENIE3) (Table 1).”

      “Hub genes identified with one method and present in the other method’s networks include important genes such as Rb1 (Sun et al. 2006), Yap1 (Meinhardt et al. 2020) (e9.5_1_GENIE3) and Vegfa (e9.5_2_STRING) (Table 1). Notably, Vegfa is the only hub gene identified with both of the network inference methods.”

      4) The author's conclusion that network analysis can be used to identify genes more likely associated with specific placental cell types is very likely true, but I think that the conclusion would be more impactful if the authors reported how the method compares to simply taking a list of differentially expressed genes and looking for cell type enrichments using their favorite enrichment software. For example, if a gene is highly connected in a particular network that has been identified as SCT-specific, but that gene isn't considered an SCT "marker" by the placental biology research community, it would be interesting to highlight that it is prevalent in a previously published scRNA-seq dataset or a dataset that has isolated that particular cell type to show the advantages of using networks to find placental cell type specific genes.

      We completely agree with the reviewer’s point and have now added a randomization analysis to compare the enrichment using PlacentaCellEnrich (PCE) with genes in networks and random genes (Supplementary Figure S6). We randomly sampled 10,000 gene sets with the same sizes as the subnetworks from their corresponding hierarchical groups and carried out PCE analysis. These tests showed that the enrichments of cell type-specific genes were only significant with the subnetwork genes but not the random genes. The randomization tests added a valuable highlight that the network genes are highly relevant to cell type-specific genes in the human placenta, and therefore provided more confidence in the gene lists obtained from the network analyses.

      We also further checked the expression of the hub genes in other independent data in order to identify hub genes that are potentially cell type specific markers. For example, we observed that Dvl3 (e8.5_2_GENIE3) and Olr1 (e9.5_3_STRING) have been shown to be differentially expressed in SCT compared to other TB subtypes (human trophoblast stem cells, EVT (Sheridan et al. 2021) or endovascular TB (Gormley et al. 2021)).

      We added the following detail in the results, section 3:

      “Importantly, randomization tests showed that the enrichment of cell type-specific genes were only significant in these subnetworks but not in random gene sets selected from corresponding timepoint hierarchical groups (Supplementary Figure S6), which highlights the biological relevance of the gene network modules.”

      Added to the discussion section:

      “Moreover, hub genes could be used to identify potential novel markers for the cell types corresponding to their subnetworks. For example, hub genes of subnetworks enriched for SCT-specific genes such as Dvl3 (e8.5_2_GENIE3) and Olr1 (e9.5_3_STRING) are not established SCT marker genes, but are in fact differentially expressed in SCT compared to human trophoblast stem cells, EVT (Sheridan et al. 2021) or endovascular TB (Gormley et al. 2021). In general, combining network analysis with existing gene expression data from single cell or pure cell populations will allow identification of novel cell-specific marker genes to help future studies focused on different TB populations.”

      5) While the selection of genes for validation was limited by the model system available for testing, the authors should recognize that the genes/networks identified here should first and foremost be validated in a mouse model (by knockdown/overexpression studies using mouse trophoblast stem cells or by evaluation of placenta/embryo in a KO/transgenic mouse model). Whether or not the data are relevant to human placentation is (at least initially) irrelevant. While we recognize that these are difficult studies requiring significant time and resources, as is, the data and results will have significantly less impact than if even a limited amount of such validation could be performed.

      We thank the reviewer for this valuable comment. Based on this comment and the suggestions from reviewer #1, we have added the following points to the manuscript to discuss the relevance of the genes in the mouse models, and further explain our gene choices:

      To assess the possible role of hub genes in mouse development more comprehensively, we extended our search for gene functions on the Mouse Genome Informatics (MGI) database to include not only placenta related GO and MGI phenotype terms (defined as “genes with known roles”), but also embryo related GO and MGI phenotype terms (defined as “genes with possible roles”). We included embryo related terms as “genes with possible roles” because embryonic lethal mouse knockout lines frequently have placentation defects, and because defects in placental development can be associated with the development of other embryonic tissues (Brown and Hay 2016; Perez-Garcia et al. 2018; Woods, Perez-garcia, and Hemberger 2018). This change resulted in an increase in the number of genes with relevant functions in mouse, including several annotated as embryonic lethal or with abnormal embryonic growth (see Supplementary Table S6). With the additional annotations:

      • 6 out of 17 hub genes of e7.5 networks have known/possible roles.
      • 17 out of 28 hub genes of e8.5 networks have known/possible roles.
      • 48 out of 127 hub genes of e9.5 networks have known/possible roles. We also carried out randomization tests to determine if the number of known/possible genes we identified were significant. Randomization tests were carried out with the following procedure: for each timepoint, from the respective timepoint-specific groups, we sampled 10,000 gene sets of the same number as the hub gene numbers. Then we counted the number of known/possible genes in each random set. A p-value is calculated as the number of times a random gene set has ≥ known/possible genes than the observed number, divided by 10,000. We found that the number of genes with known/possible roles at each time point are statistically significant (Supplementary Figure S3). This result indicates that the gene sets we identified are significantly associated with relevant phenotypes in mouse.

      The remaining hub genes are unannotated as related to placental or embryonic functions in the MGI database. Based on that, it is difficult to determine if they lack a relevant phenotype, or if there has not been a detailed assessment of the placenta.

      Added to section 2 of the result section:

      “Briefly, genes annotated under any GO or MGI phenotype terms related to placenta, TB cells, TE and the chorion layer are considered as having a “known” role in the placenta. Genes annotated under terms related to embryo are considered as having a “possible” role in the placenta, because embryonic lethal mouse knockout lines frequently have placentation defects, and because defects in placental development can be associated with the development of other embryonic tissues (Brown and Hay 2016; Perez-Garcia et al. 2018; Woods, Perez-garcia, and Hemberger 2018). Hereafter, such genes are referred to as “known/possible genes”. In the e7.5 networks, there were 17 hub genes in which six genes were known/possible. The number of hub genes that are labelled as known/possible is statistically significant when comparing to random gene sets selected from the e7.5 timepoint-specific group (Supplementary Figure S3). In the e8.5 and e9.5 networks, 17 out of 28 and 48 out of 127 hub genes were known/possible, respectively. Similar to e7.5, the number of hub genes labelled as known/possible in e8.5 networks and e9.5 networks were both statistically significant when comparing to random gene sets selected from the corresponding timepoint-specific groups (Supplementary Figure S3). These results indicate that the gene sets we identified are significantly associated with relevant phenotypes in the mouse.”

      For the four genes that we tested in HTR-8/SVneo cells, we also added more information about the current known role of the gene in mouse.

      Added to the discussion section:

      “We identified hub genes and their immediate neighboring genes which could regulate placental development and confirmed the roles of four novel genes (Mtdh, Siah2, Hnrnpk and Ncor2) in regulating cell migration in the HTR-8/SVneo cell line. These genes were selected primarily based on the network analyses, but also based on expression data from human cells to account for possible differences between mouse and human placental gene expression. Previous studies suggested these four candidates are functionally important in mouse. Mtdh has been suggested to regulate cell proliferation in mouse fetal development (Jeon et al. 2010). The Siah gene family is important for several functions (Qi et al. 2013). Of relevance to the placenta, Siah2 is an important regulator of HIF1α during hypoxia both in vitro and in vivo (Qi et al. 2008). Moreover, while Siah2 null mice exhibited normal phenotypes, combined knockouts of Siah2 and Siah1a showed enhanced lethality rates, suggesting the two genes have overlapping modulating roles (Frew et al. 2003). Hnrnpk-/- mice were embryonic lethal, and Hnrnpk+/- mice had dysfunctions in neonatal survival and development (Gallardo et al. 2015) . Ncor2-/- mice were embryonic lethal before e16.5 due to heart defects (Jepsen et al. 2007). According to the International Mouse Phenotyping Consortium database (Dickinson et al. 2016), Ncor2 null mice also showed abnormal placental morphology at e15.5. However, none of these genes have been studied in the context of TB migration.”

      Minor comments:

      1) In the GO analysis, why not use a combination of hypergeometric and binomial distribution for enrichment decisions?

      We used hypergeometric tests as in the default setting of ClusterProfiler. GO enrichment with hypergeometric test for differentially expressed genes was also suggested in Rivals et al., 2007 (Rivals et al. 2007). Combination of hypergeometric and binomial tests will be of great use when carrying out enrichment for cis-regulatory domains where there is a higher chance of sampling a gene randomly (McLean et al. 2010).

      We have added this detail in the method section to make the analysis clearer.

      2) In Figure 2B, are there any genes that are both hub nodes (diamonds) and annotated as having placental functions (squares)? If so, it might be good to show that in some way.

      We agree this is necessary and have altered the presentation in Figure 2. In the revised manuscript, we have added an additional list of hub genes as genes with possible roles. The figure now shows hub genes with known placental functions (diamonds), hub genes with possible functions (hexagons) and hub genes without related annotation (rounded squares). Non-hub genes are now not shown to avoid crowdedness.

      3) It might improve the deconvolution analysis to employ more than one method and recent reports have shown that the cell-type signature data is the most important parameter with the main factors influencing performance being biological (such as where the sample was taken) rather than technical (https://doi.org/10.1038/s41467-022-28655-4).

      We agree the conclusion would have been further confirmed if we could employ another deconvolution method. Upon literature search, we found another tool, CAM (N. Wang et al. 2016), that had similar approaches to LinSeed which aims to infer cell proportions without reference. However, the tool has been taken down from Bioconductor and is not currently maintained. As a result, to the best of our knowledge, LinSeed is the only deconvolution tool that is completely reference-free.

      We also tried carrying out the deconvolution analysis with another method, DSA (Zhong et al. 2013), with a limited number of marker genes obtained through literature review. However, when the marker genes are highly correlated in multiple cell types, the models failed to infer meaningful proportions.

      We acknowledge that we need additional single cell RNA-seq data or marker genes obtained from pure cell populations to make more concrete conclusions for the deconvolution analysis. We hope with future studies, there will be more evidence supporting our observations.

      We have added this acknowledgement in the results section:

      “The identification of these cell groups could have resulted from noise introduced by both biological and technical variation, which is challenging to overcome when using a small sample size or analyzing without prior knowledge in the deconvolution analysis.”

      Added to the discussion section:

      “Nevertheless, we acknowledge that our deconvolution analysis and cell type annotations were limited due to the absence of matching scRNA-seq data, data from pure cell populations, or extensive cell marker lists. As these types of information are available, deconvolution analysis can be used to identify species-specific cell types or correcting for confounding effects prior to DEA (Sutton et al. 2022).”

      4) The above report also shows that there are ways to correct for cell-type composition differences in DEA which might be interesting to look when using bulk data from different timepoints in future studies when focusing on different biological processes and not timepoint-specific transcripts.

      We agree correcting for cell proportion prior to differential expression analysis will be interesting for future studies. When single cell RNA-seq data or more extensive marker gene lists are available, deconvolution analysis will be of great use for this purpose.

      We have added this in the discussion section (also mentioned in point #3):

      “Nevertheless, we acknowledge that our deconvolution analysis and cell type annotations were limited due to the absence of matching scRNA-seq data, data from pure cells, or extensive cell marker lists. As these types of information become more available, deconvolution analysis can be used to identify species-specific cell types or correcting for confounding effects prior to DEA (Sutton et al. 2022).”

      5) Could the authors speculate as to possible reason(s) that an siRNA knockdown would give variable results functionally, while the actual gene expression appears to be consistently and sufficiently downregulated? Did the authors evaluate protein levels following siRNA knockdown?

      Following the reviewer’s comment, we have evaluated protein levels for each target gene and each siRNA. For the genes that gave variable results between siRNAs (MTDH and NCOR2), we did not observe a change in their ability to reduce protein levels (Supplementary Figure S7). It is therefore possible that there are off-target effects for one of the siRNAs. We considered this possibility in designing the project, which is why we tested two siRNAs per target gene. Although siRNA off-target effects may be present, visual inspection of the migration experiments indicate that transfection with each of the siRNAs reduces migration capacity. We have added the possibility of off-target effects in the discussion section:

      “We observed that while all siRNAs were able to decrease cell migration capacity, there was variability in the amount of decrease, even when comparing two siRNAs targeting the same gene. This observation did not seem to be associated with differences in transcript or protein knockdown levels and could be due to different off-target effects for different siRNAs.”

      6) As mentioned in the discussion, finding genes that have timepoint dependent isoforms would an interesting and novel addition to the manuscript.

      Protein isoforms would be interesting to study. Here we focused on different mRNA transcripts. We carried out additional GO analysis on the genes unique to each timepoint and genes shared among timepoints. This was also done in response to major comment 2:

      In the revised manuscript, we added additional analysis for genes that belong to multiple timepoint groups due to different transcripts of the same gene being annotated as timepoint-specific, and genes unique to each timepoint (Added to results section 1):

      “It is possible that timepoint-specific groups share genes that have timepoint-specific transcripts. Indeed, we identified 37 genes shared between e7.5 and e8.5, 5 genes shared between e7.5 and e9.5, and 109 genes shared between e8.5 and e9.5 (Supplementary Table S3). We found that genes only present at one timepoint (timepoint-unique genes) were generally enriched for similar terms as the full group of timepoint-specific genes (Supplementary Table S3). However, terms related to the development of labyrinth layer like “labyrinthine layer morphogenesis” and “labyrinthine layer blood vessel development” were only enriched when using all e8.5-specific genes but not when using e8.5 timepoint-unique genes. Moreover, we found that, unlike genes shared between e9.5 and e7.5, genes shared between e9.5 and e8.5 were enriched for processes such as “blood vessel development” and “insulin receptor signaling pathway”. This observation may indicate that different transcripts of the same genes could be expressed at different timepoints for the continuation of certain biological processes.”

      7) Although outside the scope of this manuscript, it might be interesting to look at the effects of knocking down network genes on the networks themselves and in combination with a phenotypic readout such as a migration assay. With numerous knockouts and migration assay readouts, one could possibly find a better method to rank the genes within the networks.

      We agree with this comment. Upon literature search, we realized this approach has been used in previous studies on other biological contexts such as virus entry (A. Wang et al. 2010; A. Wang, Ren, and Li 2011) and cancer cell growth (Paul et al. 2021). Although these studies used different network inference strategies from ours, their in silico gene knockouts proved to be effective for the candidate selection. However, the knockout process (both computationally and experimentally) may not be trivial; therefore, we agree the approach will be useful for future studies.

      CROSS-CONSULTATION COMMENTS

      I mostly agree with the other two reviewers.

      It is not clear to me that additional KD experiments (i.e. ones that might affect fusion, proliferation, apoptosis), as proposed by Reviewer #3, would be that much more informative. There are many differences between mouse and human placentation, and these model systems (HTR8 and BeWo) are not truly representative of either. The additional data mining/computational work would be more useful and enhance data interpretation.

      Reviewer #2 (Significance (Required)):

      The authors use RNA-seq of mouse placenta at e7.5, e8.5, and e9.5 to show that timepoint-specific expression patterns are highly correlated with certain biological processes and point to the existence of certain cell types in the sample. While focused on early post-implantation mouse placental development, the author's methods could be transferrable to other timepoints, species, and organs. Furthermore, with their method they uncover what appears to be several novel, early placental, developmentally important genes and their results might be of interest to those in the field studying placental development.

      Reviewer #3:

      Summary:

      This paper is an analysis of RNA-seq data from the mouse human placenta at embryonic day from 7.5 to 9.5 days. Bioinformatics was used to pinpoint genes networks, and tentatively connect with human cell populations. Wet experiments were performed on the HTR8/SV neo trophoblast cell model.

      The introduction clearly posits the reasons why mouse models were chosen, and presents some examples of genes that are conserved between human and mouse placentas, before presenting the major steps of mouse placental development at the crucial periods analyzed.

      The results are divided into four parts:

      1. Identification of genes that are specific of fetal tissues at the three days studied
      2. A network analysis of the genes using classical bioinformatics tools (String, Genie3) to identify gene modules
      3. A connection with the human placenta at the level of cell-specific expression profile is then analyzed
      4. A in vitro validation on a trophoblast cell model using siRNA to Knockdown genes identified in the in silico part of the paper. Three clustering methods were used to classify the genes according to their profile (at which time point they have the highest level). The function associated are dispatched into three logical physiological events (7.5: proliferation and ectoplacental cone development, 8.5 attachment of the placenta -chorioallantoidian at this stage- , and 9.5: syncytiotrophoblast constitution and labyrinth development, structures essential for growth and exchange).

      Mostly minor comments:

      Quality of the transcriptomics data: 6 replicates per condition (some being pools at E7.5 and 8.5) is a lot, and I congratulate the authors to have make such effort. This says a lot about the technical quality of their results. Nevertheless, there is no comment on the exclusion of two samples in the further analysis based upon the PCA. Could the authors comment upon the reasons why these two samples behave so differently from the others?

      We thank the reviewer for the comment. We reviewed the RNA concentration and quality prior to sequencing, and did not observe that the outliers were of lower quality. After sequencing, quality control metrics (obtained with FastQC), also did not indicate that the two outliers were of poor quality. Based on the PCA, it is also unlikely that two samples were swapped. One possibility is that the tissues obtained for these samples were diseased in some way. However, this is difficult to confirm, so we did not want to speculate about this in the manuscript. We did exclude the two samples to ensure the accuracy of our downstream analyses.

      Rq: at this stage some statistics of the degree of enrichment in keyword should be provided (such as Enrichment Scores, normalized or not, and False Discovery Rates, to be able to evaluate the actual robustness of the genes network identified. In addition, it seems that the authors supervised the 'keywords' and 'ontologies' toward placental function. A more agnostic approach could be very relevant, such as identifying the ontologies associated to for instance the set of genes that are highest at 8.5 days, by comparing them with preliminary datasets accessible via the GSEA platform of the BROAD institute or similar sites such as Webgestalt. This does not mean that the placental-targeted approach is not useful, but to have a more global overview is in my opinion indispensable.

      We agree and this is a good point. We have now added a stringent approach to determine if the placenta-targeted terms are truly relevant to the gene networks. We performed randomization tests using random gene sets sampled from hierarchical groups of the same time point. These tests showed that the selected terms are significant in the networks when compared to gene groups of the same size from the timepoint specific hierarchical groups (Supplementary Figure S3). Moreover, we have added the specific -log10(q-value) of some highlighted enriched terms in the main text, so together with Figure 2A, the degree of enrichment of these terms can be shown in a clearer way.

      We have added this detail in the result section:

      “Compared to e8.5 and e9.5 networks, e7.5 networks had a higher rank or fold change and were significantly enriched for the GO terms “inflammatory response” (e7.5_1_STRING: -log10(q-value) = 22.82 and e7.5_2_GENIE3: -log10(q-value) = 3.95) and “female pregnancy” (e7.5_2_GENIE3: -log10(q-value) = 4.1) (Figure 2A, Supplementary Table S5). The term “morphogenesis of a branching structure”, which can be expected following chorioallantoic attachment around e8.5, was not enriched at e7.5, but was enriched in multiple e8.5 and e9.5 networks (e8.5_1_STRING: -log10(q-value) = 1.73, e8.5_2_GENIE3: -log10(q-value) = 1.72, e9.5_1_STRING: -log10(q-value) = 4.01, e9.5_1_GENIE3: -log10(q-value) = 1.54, e9.5_2_STRING: -log10(q-value) = 14.33, and e9.5_2_GENIE3: -log10(q-value) = 2.2). After chorioallantoic attachment finishes, nutrient transport is being established. Accordingly, we observed the following enrichments: “endothelial cell proliferation” (highest ranked in e9.5_2_STRING: -log10(q-value) = 15.91), “lipid biosynthetic process” (only significant after e7.5, highest ranked in e9.5_3_STRING: -log10(q-value) = 17.63), “cholesterol metabolic process” (only significant after e7.5, highest ranked in e9.5_2_GENIE3: -log10(q-value) = 2.76 and e9.5_3_STRING: -log10(q-value) = 7.79), and “response to insulin” (only significant after e7.5, highest ranked in e9.5_1_GENIE3: -log10(q-value) = 1.67).”

      “Using randomization tests, we observed the majority of these GO terms (10 out of 11 terms) were significantly enriched when using the network genes but not random gene sets (significance level of 0.05; the term “vasculature development” having p-value = 0.0549 and 0.0575 in with subnetwork e9.5_1_GENIE3 and e9.5_3_GENIE3, respectively) (see Materials and Methods, Supplementary Figure S3). This analysis demonstrates that the network genes were highly relevant to the biological functions of interest. Moreover, the observed GO terms strongly aligned with the processes enriched when using the full lists of timepoint-specific genes (Supplementary Table S3), indicating the representative characteristics of the network genes. While the current analysis focuses on the biological processes related to placental development, there are other terms significantly enriched, which can be found in Supplementary Table S5.”

      This is partially done in the part 2 of the results, but it would be relevant to do it on the group of highly expressed genes and not only on the clusters found by the algorithm of sting and genie3.

      We have added GO analysis for timepoint-specific genes and also observed highly relevant processes being enriched (Supplementary Table S3). This additional analysis has also helped strengthen the relevance of the network genes, as the observed terms with network genes aligned well with the terms enriched with the full lists of genes.

      Rq: in the second part of the results, everything is descriptive but no hierarchy is given to facilitate the understanding and to try to generate a few 'take-home messages' for the reader.

      We agree with the comment and have adjusted the writing accordingly. We have added the following statements in section 2 of the result section:

      “In summary, we identified 18 subnetworks across three timepoints for downstream analyses, some of which were enriched, according to GO analysis and randomization tests, for specific terms relating to placental development (Figure 2A).”

      “These results indicate that the gene sets we identified are functionally relevant in the mouse models.”

      “In summary, we have identified hub genes in networks at each timepoint. Analyzing the annotations of hub genes using the MGI database demonstrated that the hub genes are biologically relevant to mouse development and will be strong candidates for future investigation.”

      The network analysis is well presented in Figure 2. I wonder whether the author could add systematically besides the three examples that are given the network analysis for the other enrichment network that are described (the four at e7.5, the 6 at e8.5 and the 8 at e9.5).

      We have added the additional figures in Supplementary Figure S3.

      The deconvolution of the 3rd part of the results to try to connect the mouse results to the human cell situation is interesting. I suspect that given the terms of the mouse placentas used, it would be relevant to focus on 1st trimester human placental cells.

      The reference dataset we used in the PlacentaCellEnrich analysis was from human 1st trimester placenta samples. For the Placenta Ontology analysis, we were limited to the provided database from (Naismith and Cox 2021); however, it will be interesting to revisit the analysis when the database is extended.

      We have specified that the reference data in PlacentaCellEnrich analysis was from human 1st trimester placenta in the methods section:

      “For PlacentaCellEnrich, cell-type specific groups were based on the single-cell transcriptome data of first trimester human maternal-fetal interface from Vento-Tormo et al.”

      As previously mentioned, this is a highly descriptive paragraph, and two or three sentences at the end of each paragraph of the results would be in my opinion indispensable to present the most important observations of the results in an intelligible way. Overall, the data presented by the authors, are not obviously 'raw data', but an effort of interpretation should be done by the authors to underline the importance of their results, and to stress among these results which are the most important, and which are the most relevant for placental development and human health.

      We agree with the comment and have adjusted the writing accordingly. We have added this summary paragraph at the end of section 3 of the result section:

      “In summary, we have demonstrated that the identification of timepoint-specific gene groups and densely connected network modules can be used to infer the cellular composition of bulk RNA-seq samples. We used independent human datasets from different sources to annotate the cell types in each timepoint’s samples. As a result, from the bulk RNA-seq data we were able to observe that at e7.5 and e8.5, there was a high proportion of different TB populations, whereas at e9.5, the placental tissues consisted of multiple cell types such as TB, endothelial and fibroblast cells.”

      In the last part, which is very important in this type of paper, four genes were selected. A choice of highly expressed genes was made (which can in fact be discussed, some transcriptional factors may have a crucial importance with relatively low levels of expression). The efficiency of the siRNA was overall excellent. The authors showed that each of these siRNA is efficient to inhibit cell migration in the HTR8/SVneo model.

      The migration assays are quantified, but there is a inherent limit of the cell model: the authors analyzed only cell migration, but not other very important parameters. One of them is trophoblast fusion, an issue that can be studied in another trophoblast cell model, the BeWo cells, which are induced to fuse under forskolin. It would be highly relevant to test the siRNA identified in this respect, since fusion is a very conspicuous feature of trophoblast cells in mice as well as in humans. Other relevant endpoints such as proliferation markers, apoptosis markers, oxidative stress markers could be studied in the KD cell models. Alternatively, it would have been interesting to evaluate the overall effect of the siRNA by transcriptomics and check whether the modified gene expression leads to specific profiles characteristic of a certain moment of placental development in mice, or proportion of various cells in the human placentas. Without asking for further experiments the authors should mention these limits in their discussion.

      We completely agree with this comment and are investigating each of our candidate genes in more detail in ongoing studies. As we have already learned that each gene is involved in different processes and pathways, we feel that these studies are out of the scope of the current paper. However, we have added this point to our discussion section:

      “However, more detailed analysis and process specific assays are needed. For example, future studies assessing each gene’s role in cell adhesion, cell-cell fusion, cell proliferation and cell apoptosis can be done to better understand their roles in placental development.”

      In sum, I feel that this paper provides an excellent dataset, but that the authors should make an additional effort of redaction to extract the most important conclusions of their paper. This would increase its impact for a wider public.

      Thank you. We have attempted to do so in the revised version.

      Reviewer #3 (Significance (Required)):

      The context is well introduced, but explanatory and synthesis sentences are missing at the end of each paragraph. I am relatively competent in bioinformatics methods, including deconvolution, and rather expert in cell biology. Therefore I feel comfortable to evaluate this paper.

      References:

      Brown, Laura D., and William W. Hay. 2016. “Impact of Placental Insufficiency on Fetal Skeletal Muscle Growth.” Molecular and cellular endocrinology 435: 69. /pmc/articles/PMC5014698/ (August 24, 2022).

      Cox, Brian et al. 2009. “Comparative Systems Biology of Human and Mouse as a Tool to Guide the Modeling of Human Placental Pathology.” Molecular Systems Biology 5: 279. /pmc/articles/PMC2710868/ (July 20, 2022).

      Dickinson, Mary E. et al. 2016. “High-Throughput Discovery of Novel Developmental Phenotypes.” Nature 2016 537:7621 537(7621): 508–14. https://www.nature.com/articles/nature19356 (July 20, 2022).

      Frew, Ian J. et al. 2003. “Generation and Analysis of Siah2 Mutant Mice.” Molecular and Cellular Biology 23(24): 9150. /pmc/articles/PMC309644/ (July 27, 2022).

      Gallardo, Miguel et al. 2015. “HnRNP K Is a Haploinsufficient Tumor Suppressor That Regulates Proliferation and Differentiation Programs in Hematologic Malignancies.” Cancer Cell 28(4): 486–99. http://www.cell.com/article/S1535610815003050/fulltext (August 24, 2022).

      Gormley, Matthew et al. 2021. “RNA Profiling of Laser Microdissected Human Trophoblast Subtypes at Mid-Gestation Reveals a Role for Cannabinoid Signaling in Invasion.” Development (Cambridge, England) 148(20). https://pubmed.ncbi.nlm.nih.gov/34557907/ (August 15, 2022).

      Guillemot, François et al. 1994. “Essential Role of Mash-2 in Extraembryonic Development.” Nature 371(6495): 333–36. https://www.nature.com/articles/371333a0 (December 21, 2021).

      Hemberger, Myriam, Courtney W. Hanna, and Wendy Dean. 2020. “Mechanisms of Early Placental Development in Mouse and Humans.” Nature Reviews Genetics 21(1): 27–43. http://dx.doi.org/10.1038/s41576-019-0169-4.

      Jeon, Hyun Yong et al. 2010. “Expression Patterns of Astrocyte Elevated Gene-1 (AEG-1) during Development of the Mouse Embryo.” Gene expression patterns : GEP 10(7–8): 361. /pmc/articles/PMC3165053/ (July 27, 2022).

      Jepsen, Kristen et al. 2007. “SMRT-Mediated Repression of an H3K27 Demethylase in Progression from Neural Stem Cell to Neuron.” Nature 450(7168): 415–19. https://www.nature.com/articles/nature06270 (July 27, 2022).

      Kuckenberg, Peter, Caroline Kubaczka, and Hubert Schorle. 2012. “The Role of Transcription Factor Tcfap2c/TFAP2C in Trophectoderm Development.” Reproductive BioMedicine Online 25(1): 12–20. http://www.rbmojournal.com/article/S1472648312001010/fulltext (December 21, 2021).

      McLean, Cory Y. et al. 2010. “GREAT Improves Functional Interpretation of Cis-Regulatory Regions.” Nature Biotechnology 28(5): 495–501. http://dx.doi.org/10.1038/nbt.1630.

      Meinhardt, Gudrun et al. 2020. “Pivotal Role of the Transcriptional Co-Activator YAP in Trophoblast Stemness of the Developing Human Placenta.” Proceedings of the National Academy of Sciences of the United States of America 117(24): 13562–70. https://www.ncbi.nlm.nih.gov/geo/ (April 8, 2022).

      Morey, Robert et al. 2021. “Transcriptomic Drivers of Differentiation, Maturation, and Polyploidy in Human Extravillous Trophoblast.” Frontiers in Cell and Developmental Biology 9: 2269.

      Naismith, Kendra, and Brian Cox. 2021. “Human Placental Gene Sets Improve Analysis of Placental Pathologies and Link Trophoblast and Cancer Invasion Genes.” Placenta 112: 9–15.

      Paul, Abhijit et al. 2021. “Exploring Gene Knockout Strategies to Identify Potential Drug Targets Using Genome-Scale Metabolic Models.” Scientific Reports 2021 11:1 11(1): 1–13. https://www.nature.com/articles/s41598-020-80561-1 (July 27, 2022).

      Perez-Garcia, Vicente et al. 2018. “Placentation Defects Are Highly Prevalent in Embryonic Lethal Mouse Mutants.” Nature 555(7697): 463. /pmc/articles/PMC5866719/ (August 11, 2022).

      Qi, Jianfei et al. 2008. “The Ubiquitin Ligase Siah2 Regulates Tumorigenesis and Metastasis by HIF-Dependent and -Independent Pathways.” Proceedings of the National Academy of Sciences of the United States of America 105(43): 16713. /pmc/articles/PMC2575485/ (September 20, 2021).

      Qi, Jianfei, Hyungsoo Kim, Marzia Scortegagna, and Ze’ev A. Ronai. 2013. “Regulators and Effectors of Siah Ubiquitin Ligases.” Cell biochemistry and biophysics 67(1): 15. /pmc/articles/PMC3758783/ (July 27, 2022).

      Rivals, Isabelle, Lé On Personnaz, Lieng Taing, and Marie-Claude Potier. 2007. “Databases and Ontologies Enrichment or Depletion of a GO Category within a Class of Genes: Which Test?” Bioinformatics 23(4): 401–7.

      Sheridan, Megan A. et al. 2021. “Characterization of Primary Models of Human Trophoblast.” Development (Cambridge, England) 148(21). /pmc/articles/PMC8602945/ (August 15, 2022).

      Si, Yaqing, Peng Liu, Pinghua Li, and Thomas P. Brutnell. 2014. “Model-Based Clustering for RNA-Seq Data.” Bioinformatics 30(2): 197–205. https://academic.oup.com/bioinformatics/article/30/2/197/217752 (July 18, 2022).

      Silva, Juneo F., and Rogéria Serakides. 2016. “Intrauterine Trophoblast Migration: A Comparative View of Humans and Rodents.” Cell Adhesion and Migration 10(1–2): 88–110. http://dx.doi.org/10.1080/19336918.2015.1120397.

      Soncin, Francesca et al. 2018. “Comparative Analysis of Mouse and Human Placentae across Gestation Reveals Species-Specific Regulators of Placental Development.” Development (Cambridge) 145(2).

      Soncin, Francesca, David Natale, and Mana M. Parast. 2015. “Signaling Pathways in Mouse and Human Trophoblast Differentiation: A Comparative Review.” Cellular and Molecular Life Sciences 72(7): 1291–1302.

      Spies, Daniel, Peter F. Renz, Tobias A. Beyer, and Constance Ciaudo. 2019. “Comparative Analysis of Differential Gene Expression Tools for RNA Sequencing Time Course Data.” Briefings in Bioinformatics 20(1): 288. /pmc/articles/PMC6357553/ (July 18, 2022).

      Sun, Huifang et al. 2006. “An E2F Binding-Deficient Rb1 Protein Partially Rescues Developmental Defects Associated with Rb1 Nullizygosity.” Molecular and Cellular Biology 26(4): 1527. /pmc/articles/PMC1367194/ (February 6, 2022).

      Sutton, Gavin J. et al. 2022. “Comprehensive Evaluation of Deconvolution Methods for Human Brain Gene Expression.” Nature Communications 2022 13:1 13(1): 1–18. https://www.nature.com/articles/s41467-022-28655-4 (July 27, 2022).

      Varberg, Kaela M. et al. 2021. “ASCL2 Reciprocally Controls Key Trophoblast Lineage Decisions during Hemochorial Placenta Development.” Proceedings of the National Academy of Sciences of the United States of America 118(10). https://www.pnas.org/content/118/10/e2016517118 (December 21, 2021).

      Wang, Anyou, S. Claiborne Johnston, Joyce Chou, and Deborah Dean. 2010. “A Systemic Network for Chlamydia Pneumoniae Entry into Human Cells.” Journal of Bacteriology 192(11): 2809–15. https://journals.asm.org/doi/10.1128/JB.01462-09 (July 27, 2022).

      Wang, Anyou, Li Ren, and Hong Li. 2011. “A Systemic Network Triggered by Human Cytomegalovirus Entry.” Advances in Virology 2011.

      Wang, Niya et al. 2016. “Mathematical Modelling of Transcriptional Heterogeneity Identifies Novel Markers and Subpopulations in Complex Tissues.” Scientific Reports 2016 6:1 6(1): 1–12. https://www.nature.com/articles/srep18909 (July 27, 2022).

      Watson, Erica D., and James C. Cross. 2005. “Development of Structures and Transport Functions in the Mouse Placenta.” Physiology 20(3): 180–93.

      Woods, Laura, Vicente Perez-garcia, and Myriam Hemberger. 2018. “Regulation of Placental Development and Its Impact on Fetal Growth — New Insights From Mouse Models.” Frontiers in Endocrinology 9(September): 1–18.

      Zhong, Yi et al. 2013. “Digital Sorting of Complex Tissues for Cell Type-Specific Gene Expression Profiles.” BMC Bioinformatics 14(1): 1–10. https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-14-89 (July 27, 2022).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This paper is an analysis of RNA-seq data from the mouse human placenta at embryonic day from 7.5 to 9.5 days. Bioinformatics was used to pinpoint genes networks, and tentatively connect with human cell populations. Wet experiments were performed on the HTR8/SV neo trophoblast cell model.

      The introduction clearly posits the reasons why mouse models were chosen, and presents some examples of genes that are conserved between human and mouse placentas, before presenting the major steps of mouse placental development at the crucial periods analyzed.

      The results are divided into four parts:

      1. Identification of genes that are specific of fetal tissues at the three days studied
      2. A network analysis of the genes using classical bioinformatics tools (String, Genie3) to identify gene modules
      3. A connection with the human placenta at the level of cell-specific expression profile is then analyzed
      4. A in vitro validation on a trophoblast cell model using siRNA to Knockdown genes identified in the in silico part of the paper.

      Three clustering methods were used to classify the genes according to their profile (at which time point they have the highest level). The function associated are dispatched into three logical physiological events (7.5: proliferation and ectoplacental cone development, 8.5 attachment of the placenta -chorioallantoidian at this stage- , and 9.5: syncytiotrophoblast constitution and labyrinth development, structures essential for growth and exchange).

      Mostly minor comments:

      Quality of the transcriptomics data: 6 replicates per condition (some being pools at E7.5 and 8.5) is a lot, and I congratulate the authors to have make such effort. This says a lot about the technical quality of their results. Nevertheless, there is no comment on the exclusion of two samples in the further analysis based upon the PCA. Could the authors comment upon the reasons why these two samples behave so differently from the others?

      Rq: at this stage some statistics of the degree of enrichment in keyword should be provided (such as Enrichment Scores, normalized or not, and False Discovery Rates, to be able to evaluate the actual robustness of the genes network identified. In addition, it seems that the authors supervised the 'keywords' and 'ontologies' toward placental function. A more agnostic approach could be very relevant, such as identifying the ontologies associated to for instance the set of genes that are highest at 8.5 days, by comparing them with preliminary datasets accessible via the GSEA platform of the BROAD institute or similar sites such as Webgestalt. This does not mean that the placental-targeted approach is not useful, but to have a more global overview is in my opinion indispensable.

      This is partially done in the part 2 of the results, but it would be relevant to do it on the group of highly expressed genes and not only on the clusters found by the algorithm of sting and genie3. Rq: in the second part of the results, everything is descriptive but no hierarchy is given to facilitate the understanding and to try to generate a few 'take-home messages' for the reader.

      The network analysis is well presented in Figure 2. I wonder whether the author could add systematically besides the three examples that are given the network analysis for the other enrichment network that are described (the four at e7.5, the 6 at e8.5 and the 8 at e9.5).

      The deconvolution of the 3rd part of the results to try to connect the mouse results to the human cell situation is interesting. I suspect that given the terms of the mouse placentas used, it would be relevant to focus on 1st trimester human placental cells.

      As previously mentioned, this is a highly descriptive paragraph, and two or three sentences at the end of each paragraph of the results would be in my opinion indispensable to present the most important observations of the results in an intelligible way. Overall, the data presented by the authors, are not obviously 'raw data', but an effort of interpretation should be done by the authors to underline the importance of their results, and to stress among these results which are the most important, and which are the most relevant for placental development and human health.

      In the last part, which is very important in this type of paper, four genes were selected. A choice of highly expressed genes was made (which can in fact be discussed, some transcriptional factors may have a crucial importance with relatively low levels of expression). The efficiency of the siRNA was overall excellent. The authors showed that each of these siRNA is efficient to inhibit cell migration in the HTR8/SVneo model.

      The migration assays are quantified, but there is a inherent limit of the cell model: the authors analyzed only cell migration, but not other very important parameters. One of them is trophoblast fusion, an issue that can be studied in another trophoblast cell model, the BeWo cells, which are induced to fuse under forskolin. It would be highly relevant to test the siRNA identified in this respect, since fusion is a very conspicuous feature of trophoblast cells in mice as well as in humans. Other relevant endpoints such as proliferation markers, apoptosis markers, oxidative stress markers could be studied in the KD cell models. Alternatively, it would have been interesting to evaluate the overall effect of the siRNA by transcriptomics and check whether the modified gene expression leads to specific profiles characteristic of a certain moment of placental development in mice, or proportion of various cells in the human placentas. Without asking for further experiments the authors should mention these limits in their discussion.

      In sum, I feel that this paper provides an excellent dataset, but that the authors should make an additional effort of redaction to extract the most important conclusions of their paper. This would increase its impact for a wider public.

      Significance

      The context is well introduced, but explanatory and synthesis sentences are missing at the end of each paragraph. I am relatively competent in bioinformatics methods, including deconvolution, and rather expert in cell biology. Therefore I feel comfortable to evaluate this paper.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors used RNA-seq data from mouse fetal placenta at e7.5, e8.5, and e9.5 to create timepoint-specific gene expression interaction networks to find genes that they predicted would regulate placental development. They confirmed four novel candidate genes and showed that in the transfected human trophoblast HTR-8/SVneo cell line, these four candidates reduced cell migration capacity. Additionally, the authors show that bulk RNA-seq data can be used to infer cell-type composition and when used with single-cell RNA-seq, can be a powerful tool to study the biological processes that involve multiple cell-types.

      Overall, the authors are rigorous in their analyses, their conclusions appear sound, and the work could be an asset to the broader placental biology field. However, although the authors present an approach that future studies might find useful to replicate and their work has produced numerous novel transcripts/genes that warrant further investigation, the approach is not entirely novel, and could be expanded/improved (as suggested by the authors in the discussion), particularly with regard to validation of the genes/networks identified. Major and minor comments are listed below.

      Major comments:

      1) The authors used clustering and differential expression analysis to define sets of timepoint-specific genes. However, it was not clear to me the benefits of this approach. Why would using this approach be better than differential expression analysis alone such as in a typical ANOVA?

      2) Related to number 1 above, although the authors are interested in timepoint-specific transcripts, the author's methods would filter out possibly interesting transcripts that turn on and off during development. The authors might want to check to see if there are transcripts that are up in e7.5 and then down in e8.5 but then up again in e9.5. Also, the author's methods seem to include transcripts that are not exclusive to one timepoint (i.e. are up in e7.5 and e8.5 but not e9.5). It might be interesting to differentiate transcripts that are exclusive to one timepoint from those that are in more than one timepoint.

      3) In the network analysis it would be interesting and helpful to the reader to highlight, if any, nodes or terms that were found to be significant (i.e. hubs or genes that have a high centrality metric etc.) in both the STRING and GENIE3 networks or overlap the networks created by the two different algorithms to compare them. This might help readers better rank genes when using these data to decide what genes are most important at each timepoint.

      4) The author's conclusion that network analysis can be used to identify genes more likely associated with specific placental cell types is very likely true, but I think that the conclusion would be more impactful if the authors reported how the method compares to simply taking a list of differentially expressed genes and looking for cell type enrichments using their favorite enrichment software. For example, if a gene is highly connected in a particular network that has been identified as SCT-specific, but that gene isn't considered an SCT "marker" by the placental biology research community, it would be interesting to highlight that it is prevalent in a previously published scRNA-seq dataset or a dataset that has isolated that particular cell type to show the advantages of using networks to find placental cell type specific genes.

      5) While the selection of genes for validation was limited by the model system available for testing, the authors should recognize that the genes/networks identified here should first and foremost be validated in a mouse model (by knockdown/overexpression studies using mouse trophoblast stem cells or by evaluation of placenta/embryo in a KO/transgenic mouse model). Whether or not the data are relevant to human placentation is (at least initially) irrelevant. While we recognize that these are difficult studies requiring significant time and resources, as is, the data and results will have significantly less impact than if even a limited amount of such validation could be performed.

      Minor comments:

      1) In the GO analysis, why not use a combination of hypergeometric and binomial distribution for enrichment decisions?

      2) In Figure 2B, are there any genes that are both hub nodes (diamonds) and annotated as having placental functions (squares)? If so, it might be good to show that in some way.

      3) It might improve the deconvolution analysis to employ more than one method and recent reports have shown that the cell-type signature data is the most important parameter with the main factors influencing performance being biological (such as where the sample was taken) rather than technical (https://doi.org/10.1038/s41467-022-28655-4).

      4) The above report also shows that there are ways to correct for cell-type composition differences in DEA which might be interesting to look when using bulk data from different timepoints in future studies when focusing on different biological processes and not timepoint-specific transcripts.

      5) Could the authors speculate as to possible reason(s) that an siRNA knockdown would give variable results functionally, while the actual gene expression appears to be consistently and sufficiently downregulated? Did the authors evaluate protein levels following siRNA knockdown?

      6) As mentioned in the discussion, finding genes that have timepoint dependent isoforms would an interesting and novel addition to the manuscript.

      7) Although outside the scope of this manuscript, it might be interesting to look at the effects of knocking down network genes on the networks themselves and in combination with a phenotypic readout such as a migration assay. With numerous knockouts and migration assay readouts, one could possibly find a better method to rank the genes within the networks.

      CROSS-CONSULTATION COMMENTS

      I mostly agree with the other two reviewers. It is not clear to me that additional KD experiments (i.e. ones that might affect fusion, proliferation, apoptosis), as proposed by Reviewer #3, would be that much more informative. There are many differences between mouse and human placentation, and these model systems (HTR8 and BeWo) are not truly representative of either. The additional data mining/computational work would be more useful and enhance data interpretation.

      Significance

      The authors use RNA-seq of mouse placenta at e7.5, e8.5, and e9.5 to show that timepoint-specific expression patterns are highly correlated with certain biological processes and point to the existence of certain cell types in the sample. While focused on early post-implantation mouse placental development, the author's methods could be transferrable to other timepoints, species, and organs. Furthermore, with their method they uncover what appears to be several novel, early placental, developmentally important genes and their results might be of interest to those in the field studying placental development.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Review of "Identifying novel regulators of placental development using time series transcriptomic data and network analyses."

      The authors present a detailed bioinformatic assessment of mouse developmental time series of the placenta. They apply current data mining and analysis methods to identify protein-centred networks that are likely enriched to specific cell types of the placenta. They then translate these findings to humans using statistical comparisons of human single-cell sequencing data of the placenta. Lastly, they use knock-down experiments to validate the conserved functional importance of the hub genes in the mouse protein networks in human cells. The strengths of this paper are the rigorous data mining methods and the functional translation to humans from mice. There are no critical weaknesses to the article. There is a blend of statistical analysis with anecdotal or hand curation from databases and the literature, but it is unclear if these curated finings are circumstantial or statistically meaningful. In the end, the hypothesis seems to hold in that 4/4 gene knocked down in the human cells gave a migration phenotype.

      Comments, questions, critique

      1. Given the translational aims of the paper, more introduction/discussion material on the comparative aspects of mice and humans are needed. Are giant cells and EVT the same? What are the cell equivalents that you are discovering? The Soncin et al. paper is cited, but I think underused. This publication contains time series data on mice and humans and could be used as external validation of clusters, networks, and other analyses. Other publications to consider for context are

      a) Cox B, et al. Mol Syst Biol 5: 279.

      b) Silva JF, Serakides R. 2016. Cell Adhes Migr 10: 88-110. (specifically discusses migration difference between the species placenta)

      1. Clustering represented in Figure 1B, was this a supervised model? Why only three clusters?) Did you specify that there would be three models and force each gene profile into one of the categories? How robust are the fits? A fitted model might be a better approach as you can specify the ideal models (early high, late high and mid-high), then determine each gene profile that fits each model and only assess those genes with a significant fit to the model. Forcing clustering to the three-model fit likely gives many poorly fitting profiles. While in the end, this works out, it may be due to applying other post hoc methods for gene enrichment, where noise distributes randomly.

      2. Several statements are made about the conservation of importance between mouse and human hub genes. For example, "We predict these highly expressed genes to be generally important for TB function and processes such as cell migration, a term associated with multiple timepoint specific networks (Figure 2A)." While your knock-down assay of migration results shows these hub genes to be necessary to humans, what do they mean to the mouse? You did not use mouse TSC to assess functional importance concurrently. You note a small number of genes as of known importance, "127 hub genes of which 16 have been annotated as having a role in placental development". Were the others knocked out but lack a developmental phenotype or not assessed? Are these functionally redundant in the mouse or not involved in the same processes between the species?

      3. In determining conservation between mouse and human networks, were only 1:1 orthologs examined or did you consider more complex 1:many mapping conditions between the two species?

      4. Should the migration assay be normalized to survival/adhesion? If 70,000 cells were seeded but had 50% cell death (or reduced adhesion), then it may appear to be poor migration. Should the migration be evaluated as a ratio of top to bottom cell densities to control for poor adhesion or survival?

      Significance

      This significantly advances previous publications on this topic by functionally testing the discovered genes.

      This highlights an excellent data mining strategy for a developmental disease using mice and translating to humans.

      The audience is likely developmental biologists and reproductive specialists.

      My expertise is bioinformatics and developmental biology.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      In this study we reveal that, in both mice and humans, the metabolic benefits of caloric restriction (CR) are sex- and age-dependent. Through a systematic review of the literature, we show that sex differences have been largely overlooked by previous CR research, a finding that Reviewer 1 highlights as “an important point”. Our results have critical implications for understanding the fundamental biology linking diet and health outcomes, as well as translational strategies to leverage the therapeutic benefits of CR in humans.

      We thank the reviewers for their helpful appraisal of our manuscript, which Reviewer 2 highlights as “a very well written paper”. Reviewer 1 emphasised the translational relevance of our findings and commented on the “systematic” nature of our study. They noted that it “was well performed”, ”is a valuable and important contribution to the field”, and “will elicit great interest in the scientific and public readership.” Indeed, the importance of sex as a biological variable is the focus of a September 2022 news feature in Nature (https://www.nature.com/articles/d41586-022-02919-x), underscoring the timeliness and relevance of our findings. Our response to the reviewers comments is outlined below, including the changes we have incorporated in a revised version of our manuscript.

      Reviewer 1 – Major Comments:

      __A) The clinical part is definitely the weak spot in the study. I don't think that the data should be omitted, but the authors should be very careful in interpreting the data. Obvious limitations apply to this part, which need to be more directly addressed in the abstract and discussion. It feels like the data from the small-scale clinical trial is exaggerated. __The clinical study was conducted by Prof Alex Johnstone’s group at the Rowett Institute of Nutrition and Health, University of Aberdeen. Her group are experts in the study of dietary interventions for weight loss. The study was conducted to a high standard and therefore we have the utmost confidence in the conclusions drawn from our analysis of this data.

      As we discuss in response to the reviewer’s other points below, the clinical study was primarily designed to address other outcomes and we analysed the data retrospectively to investigate if sex and age affect CR-induced weight and fat loss. This explains some of the limitations that the reviewer mentions, e.g. the relatively low numbers of younger males, and the focus on overweight and obese subjects. As requested, we have now addressed these limitations as follows:

      1. Updated the abstract to clarify that the data are from overweight and obese subjects.
      2. Updated the results to emphasise that we did a retrospective analysis of CR in overweight and obese subjects (lines 396-398).
      3. Performed an additional ANCOVA analysis to test if baseline adiposity or BMI contribute to the sex differences in body mass, fat mass or fat-free mass (new Supplementary Figure 11); see Reviewer 1 Major Point D below.
      4. Updated the ‘Limitations’ section of the Discussion to highlight the retrospective nature of the human study (lines 746-748).
      5. Updated the Methods to again clarify the retrospective nature of the analysis (lines 884-885). __B) It is important to mention in the abstract and the discussion that the human data came from obese participants. This might well influence the findings from human data. __The human subjects were overweight or obese; this was previously stated in the methods section (line 885) and in the discussion (lines 509-511). To further clarify this, we now also mention it in the Abstract (lines 52-53) and have reiterated it in the Discussion (line 744). Importantly, the fact that humans still show age-dependent sex differences in fat loss, even when overweight and obese, supports our conclusion that this age effect in mice is not simply a consequence of aged mice being fatter than younger mice. We refer to this as the ‘baseline adiposity’ hypothesis (lines 500-518 of the Discussion). In response to point D below, we have also analysed if the loss of fat mass or fat-free mass is influenced by adiposity or BMI at baseline (pre-CR). Our analyses show that neither of these parameters explain the sex differences in loss of fat mass or fat-free mass (see new Supplementary Figure 11).

      __C) It is very important to calculate the % calorie restriction of the human participants achieved throughout the CR study. This is crucial information to compare it to other studies. __We have updated the Methods (lines 906-909) to explain the basis for the weight loss diet, as follows: “Participants had their basal energy requirements determined and each participant was then fed an individualised diet with a caloric content equivalent to 100% of their resting metabolic rate (Table 3). This approach was taken to standardise the diet to account for individual energy requirements and energy restriction.” We have also updated Table 3 to show the caloric intake for males and females. Note that RMR accounts for ~60-70% of total daily energy expenditure (TDEE) in adults (Martin et al., 2022), so the diet in our study would give a daily caloric deficit of around 30-40% from baseline TDEE.

      __D) Since there is quite a wide range in the BMIs of the participants, can the authors also stratify against BMI? __We have done this against both baseline BMI and against baseline fat mass (the latter to further test the ‘baseline adiposity’ hypothesis). We present this data in an updated Supplementary Figure 11. We find that, in males but not in females, baseline BMI or fat mass are significantly associated with the changes in fat mass or fat-free mass: surprisingly, individuals with higher baseline fat mass or BMI show less fat loss and a greater loss of fat-free mass during CR. Importantly, males and females do not significantly differ in the relationships between baseline fat mass (or BMI) and loss of fat mass or fat-free mass. This further supports our conclusion that the sex differences in fat loss are unrelated to differences in baseline adiposity. We report this in lines 409-411 of the Results and lines 513-515 of the Discussion.

      __E) There is no mention of any pre-study registration online of the clinical part (e.g. _gov_). Was this done? __This study was done before pre-registration was a requirement for clinical trials. We retrospectively analysed the study data to investigate if sex and/or age influence the outcomes. In the updated manuscript we now state this on lines 884-885 of the Methods, as well as in the Results (line 396) and Discussion (lines 746-748).

      __F) In the methods section the authors write "Participants were informed that the study was funded by an external commercial sponsor...". This is important information, and this is not mentioned anywhere else in the paper. Can the authors clarify this point? A commercial sponsor would, in my view, qualify for a conflict of interest that needs to be mentioned. __We have updated the Declaration of Interests section to clarify this as follows: “The human weight loss study was funded by a food retailer; however, the company had no role in the data analysis, interpretation or conclusions presented in this paper.”

      __G) How did the authors determine the group sizes for the clinical part? I have some doubts about the sub-group sizes. It would be valuable information if the authors had a statistical analysis plan prior conducting the study. It appears a bit, like the sub-groups were chosen at random, to match findings of the mouse data. Otherwise, there should have been a better allocation within the sub-groups (especially age). __We agree that larger group sizes would have been preferable. This limitation reflects that the study was not originally designed to test age and sex effects on CR outcomes, but instead was analysed retrospectively to investigate the impact of these variables. As mentioned above, we have updated the text of the manuscript to highlight the retrospective nature of the analyses. In the Discussion, under ‘Limitations’, we also highlight the fact that relatively few younger subjects are included in the human study (lines 744-745).

      __H) *There's a big problem with the age stratification of the male participants in the clinical data. If I'm correct there are only 5 males 45 groupings.

      __I) The applied protocol for CR in mice is known to provoke long fasting phases and probably elicits some effects through fasting alone, rather than the caloric deficit. There are some papers out addressing this (e.g. by deCabo, Lamming). The authors should not dismiss this fact and at least address it in their discussion. Also, given this fact, it would be thoughtful to include a database-search - not only regarding CR - but also regarding various types of intermittent fasting protocols in humans and animal studies (similar to what the authors did in the supplemental figure). __We agree on the importance of highlighting recent studies demonstrating that prolonged daily fasting contributes to the outcomes of typical ‘single-ration’ CR protocols. We have added a new paragraph to the Discussion to address this (Lines 710-719).

      Regarding the second point, we feel that including a new literature search that addresses not only CR, but also intermittent fasting, is beyond the scope of the current manuscript. However, this is a very good idea and would be worth addressing in a future standalone review article. We have also updated our source data to include all data from our literature reviews, to help if other researchers wish to analyse according to fasting duration or other variables.

      __J) Did the authors monitor the eating time of the mice? __We have since done this in new cohorts of mice fed using the same CR protocol. We find that the mice consume their food within 2-3 hours, consistent with other CR studies. We have now mentioned this in the Methods section (lines 867-868).

      __K) While CR certainly has a lot of health benefits in rodents and humans, it should be advised to raise the cautious note that it may not be beneficial for everyone in the general population. For some groups of people and in some cases (e.g. infectious diseases, pregnancy) even CR with adequate nutritional intake of micro/macronutrients might be disadvantageous. This should be mentioned clearly, as the topic gets more and more "hyped" in public media and online. __We now highlight this important point in the opening paragraph of the introduction (lines 65-67).

      __L) There is no indication of how the authors dealt with missing data. Statistically this can be very important, especially in cases with a low number of data points. __In the Methods section we previously explained (lines 846-848) that “Mice were excluded from the final analysis only if there were confounding technical issues or pathologies discovered at necropsy.” No data had to be excluded from our human study and we have now stated this in the Methods (lines 897-898). For analyses involving paired or repeated-measures data (e.g. time courses of body mass or blood glucose), if data points were missing or had to be excluded for some mice then we used mixed models for the statistical analysis. We have now updated this information in the ‘Statistical analysis’ section of the Methods (lines 1047-1048). Because of the large numbers of mice used in our studies, analyses remain sufficiently well powered even if some data points were missing or had to be excluded.

      __M) Key data from qPCR should be followed up by western blots or other means. If this was done and there was no effect, the authors should report this. Also, is there any evidence or the possibility to support these findings regarding pck1 and ppara in human samples? __As requested, we will next use Western Blotting to assess the expression of proteins encoded by the transcripts that show sex and/or diet differences within the liver (Fig. 6A). These data will be reported in our fully revised manuscript.

      Regarding effects of CR on PCK1 and PPARA expression in human liver samples, no human CR studies have taken liver biopsies for downstream molecular analysis. Recent studies of the GTEx database confirm that hepatic gene expression in humans is highly sexually dimorphic (Oliva et al., 2020). We checked PCK1 and PPARA in the GTEx database and found that, in the liver, each of these transcripts is expressed more highly in females than in males (https://www.gtexportal.org/home/gene/PCK1 & https://www.gtexportal.org/home/gene/PPARA). While this is the opposite to what we observe in our ad libitum mice (Fig. 6A), it demonstrates that sex differences in these genes’ hepatic expression do occur in humans. The effect of CR on their hepatic expression, and whether this differs between males and females, remains to be addressed.

      N): I think it would be very valuable to analyse the sex-differences in lipolysis directly in fat tissues. The authors concentrated on differences in hepatic mRNA profiles, but there's an obvious possibility and gap in their story. ____We agree that this would be informative. In the Discussion we cite previous research identifying sex differences in adipose lipolysis and lipogenesis and explain how this fits with our findings (lines 567-574). Since submitting our manuscript, we have begun experiments to investigate sex differences in the effects of CR on lipid metabolism and molecular pathways in adipose tissue. However, these analyses are extensive and ongoing, so we feel strongly that attempting to include them in our present paper would not only substantially delay publication, but also overload what is already a very extensive paper. Therefore, we plan to report our findings in future publications.

      __O) Given the relatively low n and sometimes small effect sizes I fear that some of their findings won't be reproduced by other labs. Were the (mouse) data collected all at once in one cohort or did the authors pool data from different cohorts/repeats? __We presume the reviewer means ‘relatively high n’, as most of our mouse analyses used large group sizes. The mouse data were pooled from across multiple cohorts, with ANOVA confirming that the same sex-dependent CR effects were observed within each cohort. This reproducibility across multiple cohorts is a clear strength of our study because it demonstrates the robustness of our findings. Importantly, the sex differences in fat loss, weight loss and glucose homeostasis were still observed in our much-smaller cohort of evening-fed mice (Fig. S5-S6) (n = 5-6), demonstrating that large sample sizes are not needed for other researchers to detect these effects.

      Reviewer 1 – Minor comments:

      __a) The discussion is very extensive, and I suggest compressing the information presented there to make it more easily readable. __We have removed some text that was more speculative, such as the paragraph discussing a possible role for ERalpha. We have also revised wording elsewhere to state things more succinctly. However, given the scope of our study we feel we cannot substantially cut down the Discussion without compromising the interpretation of our findings. We note the Reviewer two’s comment that “This is a very well written paper” and feel that attempting to compress the extensive information in the Discussion would compromise, rather than help, the readability.

      __b) There is some confusion present in the literature regarding the nomenclature of CR/fasting interventions. Recently some reviews have summarized the different forms (e.g. Longo Nature Aging, Hofer Embo Mol Med, ...) and the authors should address this briefly. Especially the applied CR intervention in ____mice overlaps with intermittent fasting. __We have updated the Discussion (lines 710-719) to explain how our single-ration CR protocol also incurs a prolonged intermittent fast, and how this fast per se may contribute to metabolic effects.

      c): The order of the subpanels in Figure 9 (and other figures where B is below A and so on) is confusing. Please rearrange or indicate in a visual way which panels belong to each other.

      We disagree that the order of subpanels is confusing: the panels are clearly labelled, and we find it most logical to have the absolute values shown in the top row (panels A, C and E), with the corresponding graphs of fold changes shown beneath each of these (panels B, D and F). This allows the reader to quickly compare the absolute vs fold-change data for each readout. If we had panels A-C on the top row and D-F on the second row, then the connection between graphs 9C and 9D would be less clear and comparable.

      d): Did the authors also measure cardiovascular (e.g. blood pressure) parameters? There is some evidence out there that there is an age/sex dependency during fasting/CR. This would be a nice add-on to the rather small clinical data here.

      We did measure various cardiovascular parameters for our mice but find, unlike for the metabolic outcomes, these generally don’t show sex or age differences. In our human study we measured blood pressure and heart rate before starting CR and at weeks 3 and 4 post-CR. For this response to reviewers we have summarized these human data in Figure R1. The data show that CR decreases blood pressure and heart rate in males and females (Figs. R1A-E). In the younger age group (We have decided to not include these data in the current study because we feel it is already extensive and is focused on metabolic outcomes. We instead plan to report the cardiovascular outcomes (from both humans and mice) in a separate paper.

      __e) What was the decision basis for stratifying the human data into 45 years? __We used 45 years as the cutoff point because this is the age when, in women, oestrogen levels begin to decline (this point was stated in lines 491-492 of the Discussion, and we now reiterate it in lines 414-415 of the Results).

      __f) The part on aging starting in Figure 7 comes quite surprising and it is not clearly linked to the data before. A suggestion here would be to smooth the transition in the text and the authors could again perform a literature search regarding age-of-onset for CR/fasting interventions in mice and humans. __We have added a sentence to smooth the transition to these studies (lines 363-364). We had previously done a literature search to identify the age of onset of CR interventions in mice and humans. We summarise the findings of this search in lines 452-470 and 484-495 of the Discussion. We have also updated the source data so that it includes the our review of the CR literature, allowing other researchers to interrogate this data.

      g) At the first mention of HOMA and Matsuda indices, the effect direction should be put into physiological context.

      We now mention this in lines 231-232 of the Results.

      h) There is no mention of how the PCA analyses were conducted.

      We have updated the Methods to explain that the PCA analyses were done using R. We have updated the source data to include the outputs from these analyses, as well as the underlying code. These data and code are now available here https://doi.org/10.7488/ds/3758.

      i) Were the mice aged in-house in the authors' facility or bought pre-aged from a vendor? Is it known how they were raised? If bought pre-aged, were female and male animals comparable?

      We bred and aged all mice in house. Males and females were littermates from across several cohorts. Therefore, there are no concerns about lack of comparability resulting from environmental differences.

      j) Very minor note: I think that "focussed" has become very rarely used, even in British English. I don't know about the journal's language standards, but I would switch to the much more common "focused".

      We have updated to ‘focused’ as requested.

      k) Figure 6B/F (PCAs) should indicate the % difference of each dimension.

      We have updated the figures to show the % variance accounted for by each principal component. We have also updated the figure legend to specify this.

      l) Limitations section: Maybe tone down on "world-leading mass spec facility". This sounds like an excuse and this statement is unsupported and doesn't add anything valuable to the section. Other limitations would include the low n, as mentioned above and the mono-centric fashion of the mouse and human experiments.

      We have addressed these points as follows:

      • Toned down the description of our mass spec facility (they are renowned for expertise in steroid hormone analysis, so we our original text was intended to highlight that our facility are not novices for this).
      • Regarding the low n for some of the human groups, we now highlight this on lines 744-745 of the Discussion.
      • We have added a new paragraph to the Discussion (lines 710-719) explaining the limitations of our CR protocol, i.e. that includes elements of both CR and intermittent fasting. Reviewer 2:

      __Point 1: This is a very well written paper. __We thank the reviewer for this kind comment.

      __Point 2: Since the authors fed the animals in the morning, this is likely the reason for energy expenditure to be different in the CR vs ad lib groups. Although the authors do study the effects of night v day feeding and saw no change in the outcomes regarding weight, this fact I think should be mentioned somewhere. Also, figure 4A is expressed a W while all the other graphs are in kJ. I think it would be nice to see it all consistent. __Regarding the first point, we agree that time of feeding can influence when energy expenditure is altered, but most studies show that CR decreases overall energy expenditure regardless of time of feeding. For example, Dionne et al studied the effects of CR on energy expenditure, administering the CR diet during the night phase (Dionne et al., 2016). They found that CR mice have lower energy expenditure in the day but not in the night (Figure 3C in their paper), which is the opposite to our findings (Figure 4C). However, total energy expenditure in their study remains decreased with CR. This goes against the reviewer’s suggestion that feeding the animals in the morning “is likely the reason for energy expenditure to be different in the CR vs ad lib groups”. We have updated our manuscript (Lines 576-581) to clarify this.

      Regarding the second point, we have updated Figure 4A to express the data in kJ (showing the average kJ, per hour, at each time point). The figure legend has been updated to reflect this.

      __Point 3: For all the graphs, can you make the CR groups bold and not filled as it is hard to see the lighter colours. __We have updated the graphs so that the CR groups are represented by solid lines, rather than dashed lines.

      __Point 4: I know many investigators use them, but I am not sure how relevant HOMA-IR and the Matsuda index are in mice since they were specifically designed for humans. __The issue of whether it is ‘correct’ to use HOMA-IR and/or Matsuda index in mice is often debated in the metabolism field. Importantly, we are not using the absolute values for HOMA-IR or Matsuda in the same way that they are used in humans; instead, we are comparing the relative values between groups because these are still physiologically meaningful. We discussed this with Dr Sam Virtue, an expert in mouse metabolic phenotyping (Virtue and Vidal-Puig, 2021), who agrees on their usefulness in this way.

      __Point 5: Something also to note is the fact that all the glucose uptake data is under basal conditions. Just because there are no differences in the basal state does not mean that there are no differences after a meal/during an insulin stimulation. I think that this needs to be discussed and the muscle and fat not completely discounted as a player in the differences seen. __We agree that CR can enhance insulin-stimulated glucose uptake but our OGTT data suggest that it is effects on fasting glucose, rather than insulin-stimulated glucose uptake, that contribute to the sex differences we observe. We have now updated the Discussion (lines 608-613) as follows, “CR enhances insulin-stimulated glucose uptake (82) and it is possible that this effect differs between the sexes. However, our second relevant finding is that, during an OGTT, CR decreases the tAUC but not the iAUC, highlighting decreases in fasting glucose, rather than insulin-stimulated glucose disposal, as the main driver of the improvements in glucose tolerance.”

      References cited in Response to Reviewers:

      Dionne, D.A., Skovso, S., Templeman, N.M., Clee, S.M., and Johnson, J.D. (2016). Caloric Restriction Paradoxically Increases Adiposity in Mice With Genetically Reduced Insulin. Endocrinology 157, 2724-2734. 10.1210/en.2016-1102.

      Martin, A., Fox, D., Murphy, C.A., Hofmann, H., and Koehler, K. (2022). Tissue losses and metabolic adaptations both contribute to the reduction in resting metabolic rate following weight loss. Int. J. Obes. 46, 1168-1175. 10.1038/s41366-022-01090-7.

      Oliva, M., Muñoz-Aguirre, M., Kim-Hellmuth, S., Wucher, V., Gewirtz, A.D.H., Cotter, D.J., Parsana, P., Kasela, S., Balliu, B., Viñuela, A., et al. (2020). The impact of sex on gene expression across human tissues. Science 369, eaba3066. 10.1126/science.aba3066.

      Virtue, S., and Vidal-Puig, A. (2021). GTTs and ITTs in mice: simple tests, complex answers. Nat Metab 3, 883-886. 10.1038/s42255-021-00414-7.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      This study was investigating sex and age in calorie restriction. In part 1 the authors reviewed the literature to see the percentage of papers that still only use males as their primary animal model, of which it is still the predominant sex being investigated although there are countries that have stipulated the importance of both sexes being involved and responses compared. In human studies, both sexes are often used, but not analysed separately to give sex differences.

      In the second section, which was experimental, the authors report that sex does play an important role in the effects of 30% calorie restriction in mice with differences in both metabolic outcomes as well as in adipose tissue - with males responding and females not. Interestingly, this sex effect in mice was not found if the calorie restriction was started later in life, with both sexes responding, suggesting female sex hormones play and important role early in life in the resistance to calorie restriction. This finding was replicated in human studies, with females seemingly resistant to the weight loss effects or CR, especially in the younger age group.

      Comments

      1. This is a very well written paper
      2. Since the authors fed the animals in the morning, this is likely the reason for energy expenditure to be different in the CR vs ad lib groups. Although the authors do study the effects of night v day feeding and saw no change in the outcomes regarding weight, this fact I think should be mentioned somewhere. Also, figure 4A is expressed a W while all the other graphs are in kJ. I think it would be nice to see it all consistent.
      3. For all the graphs, can you make the CR groups bold and not filled as it is hard to see the lighter colours.
      4. I know many investigators use them, but I am not sure how relevant HOMA-IR and the matsuda index are in mice since they were specifically designed for humans.
      5. Something also to note is the fact that all the glucose uptake data is under basal conditions. Just because there are no differences in the basal state does not mean that there are no differences after a meal/during an insulin stimulation. I think that this needs to be discussed and the muscle and fat not completely discounted as a player in the differences seen.

      Significance

      Due to the lack of studies directly investigating sex and calorie restriction, I believe this is a very important manuscript.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In „The effects of caloric restriction on adipose tissue and metabolic health are sex- and age-dependent" the authors systematically studied the effects of sex and age on the response to sustained caloric restriction in mice and humans. The study was well performed and is a valuable addition to the literature. They found overlapping differences between the species when CR-data from mouse experiments and a small-scale clinical trial were stratified for sex and age. It appears that differences in the response to CR in young age in females compared to males depends on the hormonal status and is equalized in aged mice and humans. Since sex has played an underrated role so far in most studies on CR (especially in rodents), as the authors have laid out in their literature search, they make an important point.

      Major comments:

      • The clinical part is definitely the weak spot in the study. I don't think that the data should be omitted, but the authors should be very careful in interpreting the data. Obvious limitations apply to this part, which need to be more directly addressed in the abstract and discussion. It feels like the data from the small-scale clinical trial is exaggerated.
      • It is important to mention in the abstract and the discussion that the human data came from obese participants. This might well influence the findings from human data.
      • It is very important to calculate the % calorie restriction of the human participants achieved throughout the CR study. This is crucial information to compare it to other studies.
      • Since there is quite a wide range in the BMIs of the participants, can the authors also stratify against BMI?
      • There is no mention of any pre-study registration online of the clinical part (e.g. clinicaltrials.gov). Was this done?
      • In the methods section the authors write "Participants were informed that the study was funded by an external commercial sponsor...". This is important information, and this is not mentioned anywhere else in the paper. Can the authors clarify this point? A commercial sponsor would, in my view, qualify for a conflict of interest that needs to be mentioned.
      • How did the authors determine the group sizes for the clinical part? I have some doubts about the sub-group sizes. It would be valuable information if the authors had a statistical analysis plan prior conducting the study. It appears a bit, like the sub-groups were chosen at random, to match findings of the mouse data. Otherwise, there should have been a better allocation within the sub-groups (especially age).
      • There's a big problem with the age stratification of the male participants in the clinical data. If I'm correct there are only 5 males <45 years. Although this looks intriguing, this can easily be a sampling problem.
      • The applied protocol for CR in mice is known to provoke long fasting phases and probably elicits some effects through fasting alone, rather than the caloric deficit. There are some papers out addressing this (e.g. by deCabo, Lamming). The authors should not dismiss this fact and at least address it in their discussion. Also, given this fact, it would be thoughtful to include a database-search - not only regarding CR - but also regarding various types of intermittent fasting protocols in humans and animal studies (similar to what the authors did in the supplemental figure).
      • Did the authors monitor the eating time of the mice?
      • While CR certainly has a lot of health benefits in rodents and humans, it should be advised to raise the cautious note that it may not be beneficial for everyone in the general population. For some groups of people and in some cases (e.g. infectious diseases, pregnancy) even CR with adequate nutritional intake of micro/macronutrients might be disadvanteguous. This should be mentioned clearly, as the topic gets more and more "hyped" in public media and online.
      • There is no indication of how the authors dealt with missing data. Statistically this can be very important, especially in cases with a low number of data points.
      • Key data from qPCR should be followed up by western blots or other means. If this was done and there was no effect, the authors should report this. Also, is there any evidence or the possibility to support these findings regarding pck1 and ppara in human samples?
      • I think it would be very valuable to analyse the sex-differences in lipolysis directly in fat tissues. The authors concentrated on differences in hepatic mRNA profiles, but there's an obvious possibility and gap in their story.
      • Given the relatively low n and sometimes small effect sizes I fear that some of their findings won't be reproduced by other labs. Were the (mouse) data collected all at once in one cohort or did the authors pool data from different cohorts/repeats?

      Minor comments:

      • The discussion is very extensive, and I suggest compressing the information presented there to make it more easily readable.
      • There is some confusion present in the literature regarding the nomenclature of CR/fasting interventions. Recently some reviews have summarized the different forms (e.g. Longo Nature Aging, Hofer Embo Mol Med, ...) and the authors should address this briefly. Especially the applied CR intervention in mice overlaps with intermittent fasting.
      • The order of the subpanels in Figure 9 (and other figures where B is below A and so on) is confusing. Please rearrange or indicate in a visual way which panels belong to each other.
      • Did the authors also measure cardiovascular (e.g. blood pressure) parameters? There is some evidence out there that there is an age/sex dependency during fasting/CR. This would be a nice add-on to the rather small clinical data here.
      • What was the decision basis for stratifying the human data into < and >45 years?
      • The part on aging starting in Figure 7 comes quite surprising and it is not clearly linked to the data before. A suggestion here would be to smooth the transition in the text and the authors could again perform a literature search regarding age-of-onset for CR/fasting interventions in mice and humans.
      • At the first mention of HOMA and Matsuda indices, the effect direction should be put into physiological context.
      • There is no mention of how the PCA analyses were conducted.
      • Were the mice aged in-house in the authors' facility or bought pre-aged from a vendor? Is it known how they were raised? If bought pre-aged, were female and male animals comparable?
      • Very minor note: I think that "focussed" has become very rarely used, even in British English. I don't know about the journal's language standards, but I would switch to the much more common "focused".
      • Figure 6B/F (PCAs) should indicate the % difference of each dimension.
      • Limitations section: Maybe tone down on "world-leading mass spec facility". This sounds like an excuse and this statement is unsupported and doesn't add anything valuable to the section. Other limitations would include the low n, as mentioned above and the mono-centric fashion of the mouse and human experiments.

      Significance

      The authors have analysed in great depth the importance of age and sex on the outcomes of CR on fat tissue, body weight and glucose homeostasis. It is a valuable and important contribution to the field. However, some points need to be clarified and some additional analyses/experiments should be performed. The piece will elicit great interest in the scientific and public readership. The authors have collected important evidence that the majority of CR studies in the literature have a drastic sex-bias in both mice and humans. This may be one reason why translational and mechanistic findings from mice to human application have been haltered in the field of nutritional healthy-aging-promoting interventions.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The Drosophila oocyte is a classical model to study establishment of cell polarity, and it is known for decades how Bicoid and Oskar define the anterior-posterior axis of the embryo. However, Bicoid and Oskar are not conserved so that these findings cannot be generalized. The situation is different for the Par proteins, which have been identified in C. elegans. They are not only conserved but also their mode of of action seems to be preserved. 20 years ago it was a very surprising finding that the Par proteins contribute to establishment of polarity in the Drosophila oocyte. A fascinating and simple mutual inhibition model emerged over the years, in which the same molecular mechanisms establish cell polarity in the C. elegans one-cell embryo and in the Drosophila oocyte: Anteriorly localised Par-3/Bazooka recruits aPKC kinase, which excludes Par-1 by phosphorylation, whereas posteriorly localised Par-1 kinase excludes Par-3/Bazooka by phosphorylation. The manuscript by Milas et al. challenges this model by closely analysing Par localisation in living Drosophila oocytes. The authors provide strong evidence that the kinetics of Par-1 and Bazooka localisation are not consistent with the model.

      Milas et al. first describe a morphological difference between the anterior-lateral and posterior cortex of the oocyte by showing that only the posterior cortex is tightly connected to the overlaying epithelium. This morphological difference correlates with the localisation of Par-1, which is restricted to the posterior, while Bazooka localises only to those regions of the cortex, where there is a gap between the oocyte and the epithelium. This gap expands towards the posterior cortex during stage 10A and encloses it at stage 11. Unexpectedly, Par-1 and Bazooka localisations overlap at the posterior cortex when the gap expands, which contradicts the mutual inhibition model. The authors hypothesised that the close contact of epithelium with the oocyte might influence Par-1/Bazooka localisation. To test this they mechanically detached the epithelium from the oocyte and also ablated groups of epithelial cells. These manipulations resulted in posterior spreading of Bazooka protein within 30-60 minutes. Interestingly, the authors found that in those regions of the posterior cortex, where cells have been ablated, Par-1 and Bazooka colocalise for 30 minutes, which is difficult to reconcile with a model in which Par-1 excludes Bazooka by phosphorylation. The authors also show that Par-1 finally disappeared form the regions where epithelial cells have been ablated. However, aPKC, the kinase that is supposed the exclude Par-1 by phosphorylation, appeared only after Par-1, which argues against the idea that aPKC prevents Par-1 localisation. In summary, the described localisation kinetics are in conflict with the current model, in which direct phosphorylation activities of Par-1 and aPKC orchestrate the mutual exclusive Par domains in the Drosophila oocyte. The data suggest that the mechanisms underlying mutual inhibition are more complex than thought and involve contact with posterior epithelial cells.

      The microscopy used by the authors is state of the art, the data are of high quality and the quantitative analysis is convincing. The results are surprising but conclusive since the experiments were performed and presented in a professional way. This combination makes the manuscript very interesting.

      Major points:

      1. The finding that the posterior cortex is in close contact to the epithelium, while there is a gap between the remaining oocyte cortex and the epithelium is very interesting, and should be quantified and characterised more precisely. When does the gap form and how exactly does it spread posteriorly? Is it possible to distinguish the gap from the attachment zone by using markers for the ECM (e.g. viking-GFP) or adhesion proteins (e.g. Integrin)?
      2. The authors suggest that direct contact between the epithelium and the oocyte is required to exclude Bazooka from the posterior oocyte cortex. The polar cells of the follicular epithelium have almost no contact to the oocyte. One would expect that if only the polar cells are ablated, this would not lead to posterior spreading of Bazooka. Such a control experiment could support the author´s model.

      Minor points:

      1. There are repeatedly double negations which make the text difficult to understand (e.g. "Bazooka exclusion was lost...." (line 104) or "Par-1 does not delocalise from the posterior pole prior to accumulation of Bazooka" (line 163). I see that this follows the logic of the published molecular mechanisms but for the sake of comprehensibility, the authors should try to formulate the results in a positive way (at least in a repeating sentence).
      2. Based on the kinetics of Par-1 localisation the authors the conclude that Par-1 binds to diffusible binding sites at the oocyte cortex, which are modulated by posterior epithelial cells. This is one possible explanation for their results but other interpretations are equally possible. Since the authors provide no further evidence for the existence of Par-1 binding sites their interpretation should be formulated more carefully.
      3. The authors should mention that they use the Par-1 isoform (N1S) which fully rescues the par-1 mutant phenotype (see Doerflinger et. al, Curr Biol, 2006). What is known about the rescuing activity of the Bazooka transgenes that were used in the manuscript?
      4. In principle it is possible that the posterior spreading of Bazooka (after follicle cell detachment or ablation) is caused by premature ooplasmic steaming. However, the movies show that this is not the case. This should be stated in the text.

      Significance

      The Drosophila oocyte is a classical model to address the fundamental biological question of how cell polarity is established. The current model of mutual Par protein inhibition is a critical part of our understanding of cell polarisation, and was proposed to be conserved between flies and worms. In the case of Drosophila this model mainly relies on a combination of genetic and biochemical data. Milas et al. tested this model by using in vivo imaging, and found that the kinetics of Par localisation do not correspond to the existing model. This suggests that central aspects of the proposed mechanisms controlling mutual Par inhibition in the Drosophila oocyte are not conserved or not fully understood. The work makes therefore a surprising and important contribution to the understanding of cell polarity.

      I work for many years on Drosophila oogenesis and my main interest switched from cell polarity to membrane trafficking.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript examines an important and unsolved question concerning the establishment of the polarity axis in the Drosophila oocyte, namely how the follicle cells located at the posterior of the egg chamber trigger a signal to the oocyte for its subsequent polarization. To address this question the authors, center their studies on the localization of PAR proteins which are distributed along the antero-posterior axis. They more specifically focus on the mutual exclusion of Par1 and Bazooka/Par3 (Baz)at the posterior of the oocyte. The signaling event from the posterior follicle cells toward the oocyte is an essential process however it remains unsolved despite numerous screening and genetic manipulation approaches, leaving open the possibility that classical signaling involving a diffusible ligand emitted by follicular cells with its receptor located at the plasma membrane of the oocyte, would not be applied here.

      Here the authors are using original biophysical approaches to address whether signaling between the follicular cells and the oocyte would involve mechanical features.

      The authors focus at the dual exclusion between Baz and Par1 between the stages 10 and 11. To specifically follow these two proteins in the oocyte without being disrupted by their expression in the follicle cells, they used the Gal4/UASp system to express Baz and Par1. They found that Baz accumulate again at the posterior of the oocyte at stage 10B following the loss of contact between the posterior follicle cells (PFCs) and the oocyte whereas Par1 is gradually lost at that position. By using a glass micropipette to aspirate and pull on the PFCs they observed a premature Baz accumulation at the oocyte posterior. Then, to spatially improve the targeted area in the PFCs, the authors use a pulsed UV lazer, and show that PFCs are required to locally maintain posterior exclusion of Baz. Using a similar setup, they show that similarly Par1 is eliminated at the oocyte cell cortex region that had been in contact with ablated PFCs. However, Par1, with a kinetic slower to the one of Baz, is never disappearing before Baz appearance. Although difficult to distinguish, the authors report that the disappearance of Par1 is locally connected by an increase in microtubules (see major points). Finally, upon PFCs ablation, the authors show that the posterior reappearance of Baz is followed by the appearance of aPKC. However, the reappearance of PKC is slower than the removal of Par1, suggesting that in this case Par1 is not removed by PKC.

      The particularly interesting results of this work show that cellular contacts between PFCs and the oocyte are necessary to maintain Baz exclusion and Par1 localization. Furthermore, the ablation results suggest that individual PFCs are required to maintain local posterior exclusion of Baz. Overall it is an interesting observation, and most of the data are presented in a clean organized manner.

      Major comments

      1. The authors concentrate their studies on the distribution of Par3 and Par1 at the posterior part of the oocyte, mainly at stage 10 according to the images in the figures and movies. The involvement of Par3 and Par1 on polarized transport to the posterior pole of the oocyte has been well characterized previously between stages 7 and 9. The results of the authors are very interesting but they do not show that beyond the return of Baz and the disappearance of Par1 at the developmental stage they are looking at, the antero-posterior polarization and more particularly the localization of oskar in the posterior is affected. This is an important point as the authors propose that follicle cell contact maintains main body axis polarity. This would be possible by monitoring the impact of PFC ablation on the maintenance of oskar localization by tracking osk RNA with the MCP-MS2 system, or also by visualizing the staufen protein with a stau-GFP transgene.
      2. The authors use the Jupiter protein fused to the cherry protein to track MTs. This is perfectly fine to highlight the cytoplasm in the oocyte and to outline the cell-cell contacts between the PFCs and the oocyte. However, with Jupiter-cherry the microtubules are not clearly detected in the oocyte in the data presented.This is a problem because the authors want to make an important point with the potential reappearance of microtubules in the oocyte while Par1 has disappeared in the vicinity of the destroyed PFCs. (Fig5). The authors should use another microtubule reporter that allows better detection of microtubules in the oocyte, Jupiter-GFP, EB1-GFP, Ensconsin MT binding domain (EMTB)-RFP.

      Minor comments:

      1. The stage of the oocyte is not always indicated, this is particularly the case with the Fig2 with the pulling experiment with a glass micropipette.
      2. With the Fig 3E, to highlight the fact that the intensity of Baz increases very quickly after the removal of PFCs (1 mn) the authors should include an insert with a shorter time scale. The authors could also comment on the difference in velocity in baz reappearance when the ablation of PFCs includes or not polar cells.
      3. In the discussion line 240, this is not myosin II but myosin V which anchored oskar mRNA at the posterior.
      4. For the suppl figure 5, the n is not mentioned in the legend

      Significance

      Nature and significance of the advance and work in the context of existing literature

      This manuscript examines an important and unsolved question concerning the establishment of the polarity axis in the Drosophila oocyte, namely how the follicle cells located at the posterior of the egg chamber trigger a signal to the oocyte for its subsequent polarization (Gonzalez-Reyes et al ; Nature 1995 ;doi: 10.1038/375654a0) and (Roth et al; 1995; Cell; doi: 10.1016/0092-8674(95)90016-0). To address this question the authors, center their studies on the localization of PAR proteins which are distributed along the antero-posterior axis. They more specifically focus on the mutual exclusion of Par1 and Bazooka/Par3 (Baz) at the posterior of the oocyte. The signaling event from the posterior follicle cells toward the oocyte is an essential process however it remains unsolved despite numerous screening and genetic manipulation approaches, leaving open the possibility that classical signaling involving a diffusible ligand emitted by follicular cells with its receptor located at the plasma membrane of the oocyte, would not be applied here. We still know little about the modalities of this signaling between the follicular cells and the oocyte necessary for the polarization of the latter. We know that the first sign of anteroposterior polarization in the oocyte is posteriorly the recruitment of Par1 and subsequently the elimination of Baz. However, we do not know the nature of this signaling. Furthermore, we do not know whether this signaling must be maintained in order to maintain the polarization of the oocyte and more particularly to maintain the localization of oskar RNA, the posterior determinant of the oocyte, Here the authors are using original biophysical approaches to address whether signaling between the follicular cells and the oocyte would involve mechanical features. Important results of this work show that cell contacts between PFCs and the oocyte are necessary to maintain baz exclusion and Par1 localisation. Furthermore, the ablation results suggest that individual PFCs are required to maintain local posterior exclusion of Bazooka.

      Audience: These results will be of interest to those interested in the relationship between cell signaling and polarization in particular in a developmental context.

      Reviewer's area of expertise: Cell polarity, microtubule-associated transport, oocyte development in Drosophila.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The formation of mutually exclusive domains of Partition defective (Par) proteins works as a foundation for establishment of cell polarity in a variety of cells. The Drosophila oocyte is a well-known model system to study mechanisms of the asymmetric distribution Par proteins. At stage 6/7 of oogenesis, an unknown signal the posterior follicle cells (PFC) induces the recruitment of the Par-1 kinase to the posterior cortex of the oocyte and the concomitant exclusion of aPKC/Par-6 from this region. By contrast, Bazooka (Par-3) remains at the posterior with Par-1 and only disappears from the posterior at early stage 9. Millas et al. investigate the nature of the PFC signal and whether PFC continue to play a role in keeping Bazooka away from the posterior after the original signal is received by the oocyte. They do so by following the distribution of Bazooka and other Par proteins in living oocytes after pulling away or ablating the PFC at various stages of oogenesis.

      Major comments

      1. Quality of live imaging Judging from the appearance of the polar follicle cells and the size of the follicle cells, the authors constantly have an issue with maintaining a steady focal plane during live imaging in most movies (Figure 2 and video1; Figure 3 and video 2; Figure 4 and video 4; Figure 5 and video 5, FigureS5 and video 6). The conclusions of the paper are based on measuring changes in fluorescence intensity at the oocyte posterior over time, and this will be undermined by a varying focal plane. Considering the bullet shape of the oocyte, imaging the posterior at different focal planes could also cause artefacts. Supplementary Fig 3D-E and video 3 (a control experiment) are examples where the focal plane did not drift.
      2. Mechanical contact of PFC with the oocyte cortex causes the posterior exclusion of Bazooka and maintains oocyte polarity By physically pulling PFC away from the oocyte at stage 10b (Figures 1-2) the authors observed that in some oocytes Bazooka re-localises to posterior and concluded that it is a mechanical contact between PFC and the oocyte cortex that keeps Bazooka away from the posterior. Although this is an interesting observation per se, this is after the polarity of the oocyte has been defined (stages 6-9) and the posterior determinant, oskar mRNA has been localised. Could the authors do the same experiment at stages 6-9 to directly address whether the distance between the PFC and the oocyte cortex actually matters, considering that Bazooka remains at the posterior up to early 9 when the PFC and the oocyte are still at close contact?

      The conclusion that the signal between PFC and the oocyte could be mechanical is only one of potential interpretations of the experiment. It still could be a short range/ non-diffusible biochemical signal that is sensitive to the distance between the PFC and the oocyte membrane. The authors do not provide any evidence for or against either interpretation. 3. Figure 5B is supposed to demonstrate that local loss of Par-1 at the posterior causes the re-growth of microtubules from this region. However, the data provided are not convincing. The accumulation of red vesicles at the posterior cortex 150 min post ablation does not look like a specific signal for Jupiter-mCherry-marked microtubules. Similar vesicles start to be visible in the neighbouring follicle cells at the same time.

      Minor comments

      1. In Figure 4A-C, it is not clear what area has been ablated
      2. The authors should provide a simple 1-6 numbering for Video files

      Significance

      The observation that the PFC are required to maintain oocyte polarity at stage 9 is significant, but not very surprising, given the recent observation by Doerflinger et al that the posterior localisation of Par-1 requires continuous myosin activation, demonstrating that the antagonism between anterior and posterior Par proteins is not sufficient to maintain polarity once established. The authors must improve the quality of the live imaging to support this conclusion.

      The conclusion that phosphorylation of Bazooka by Par-1 is not sufficient to exclude Bazooka from the posterior cortex is not novel (see Doerflinger et al 2010, 2022).

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Comments for Butt et al This study examines the potential role of SHARPIN phosphorylation on cell migration. The role of the phosphorylation sites of SHARPIN, especially S146, is clearly shown by the expression of the mutant in various cell lines. However, the argument of the interaction with the Arp2/3 complex is relatively weak; only the FRET analysis in cells is shown. It would be good if the authors could include more mechanistic insights into the role of the phosphorylation of SHARPIN.

      Table 1. the reason for the selection of S131 and S156 is not clear. Why the S165 and T170 were not studied?

      The phosphorylation would be better confirmed by the antibody for the phosphorylation peptides. The possible phospho-mimic mutants, with the substitution to the acidic amino acid residues, would be better to be examined.

      Significance

      The potential role of SHAEPIN in the context of the current understanding of the cell migration machinery would be better to be described. Now the study only contains the mutant expression phenotype.

      Referees cross-commenting

      I agree with other reviewers and do not have other opinions. The three reviews appear to be reasonable.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this paper, Butt and co-workers examine the role of SHARPIN phosphorylation in Ser146 as it controls diverse readouts related to cell motility. The main findings of the study are: SHARPIN is robustly phosphorylated by PKCalpha in vitro. Mass spectrometry and in silico approaches indicate that some of these sites may be genuinely phosphorylated in cells. Ser146 phosphorylation seems uninvolved in SHARPIN-mediated integrin inactivation but imitation of the non-phosphorylated state using a Ser to Ala mutant impairs SHARPIN interaction with Arp2/3. S146A and V240A/L242A mutations impair lamellipodia formation. S146E mutation has a much less pronounced effect on SHARPIN-Arp2/3 interaction and lamellipodia formation, being close to the effect of the wild type. The authors conclude, based on this, that the active form of SHARPIN is likely phosphorylated in Ser146. In 3D matrix invasion assay, CRISPR-mediated SHARPIN deletion decreased invasiveness in several tumor cell lines. Perhaps the most striking experiment is that the authors use SHARPIN-KO MDA-MB-231 cells reconstituted with GFP-SHARPIN, WT or 146A, in a xenograft model in zebrafish, and they see that only WT-expressing cells form distal clusters of tumor cells. This is an interesting paper that merits publication, but several issues need to be addressed.

      1. Whereas the data are consistent and interesting, some points could be strengthened quantitatively. An example is the quantification of the invasion assays (Fig. 4), which is relatively crude (this reviewer has firsthand experience with these assays).
      2. Does SHARPIN control actin polymerization in response to stimulation? For example, growth factors in the cells used, or PMA.
      3. PKCalpha has been involved in the control of protrusion dynamics through its local effect on myosin II regulatory light chain (Asokan et al, 2014). Is PKCalpha actually phosphorylating SHARPIN in live cells? Are other kinases involved in vitro? This needs to be clearly and explicitly demonstrated.
      4. SHARPIN phosphorylated in Ser146 locally at lamellipodia? This is a hard experiment that requires a phospho-specific antibody, but the subcellular localization of the effect may be critical towards explaining the relative importance and generality of this mechanism.
      5. In the same vein, does SHARPIN S146E localize more readily to protrusions that Ser146A?
      6. The inference that active SHARPIN is phosphorylated in Ser146 needs to be demonstrated formally for the story to be substantiated. One possible manner could be to immunoprecipitate the Arp2/3 complex and show that most of the associated SHARPIN is phosphorylated in this residue by mass spectrometry. Alternatively, the authors could pull down integrins and show that SHARPIN is not phosphorylated in this case, which is also suggested by their data.
      7. The last piece of data is striking, but the xenograft nature of the experiment casts some doubt as to its significance. B16F10 cells similarly treated could be implanted in C57BL/6 mice to make a much stronger case.

      Significance

      The data presented here, if substantiated as indicated in the previous section, would constitute a significant addition to the state of the art. My enthusiasm for the story is curbed by the technical flaws (relatively minor) and conceptual gaps (more significant) stated above. My expertise is cell and molecular biology with focus on cytoskeletal-related cell motility.

      Referees cross-commenting

      I mostly concur with the revisions suggested by the other two reviewers. Reviewer #1 raises a significant point that needs to be addressed, which pertains to the overall levels of overexpression. Other than that, I stand by my review, which has many connection points with those of reviewers 1 and 3. I think this paper has been judged fairly and the experiments requested are not overly complicated. I understand if the authors cannot do the in vivo experiment in mice, and this alone should not be grounds for rejection. However, they need to address the rest of our points.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The SHARPIN protein is involved in multiple cellular pathways, and a number of studies demonstrated that it can interact with many key proteins regulating cellular proliferation, adhesion, motility and other functions that are important in normal cell physiology, but are particularly relevant for cancer metastasis. This study addresses post-translational phosphorylation of SHARPIN in human cells, and identifies two functionally important sites that are specifically involved in the interaction with ARP2/3, as well as in lamellipodia formation and cell motility (invasion). The function of one of these phosphorylation sites, the S146, is further explored in KD, KO and rescue experiments, in several human cancer cell lines, and in zebrafish model, using the directed mutagenesis approach to abolish or mimic the phosphorylation of S146. The authors conclude that Ph-S146 modification of SHARPIN is specifically responsible for interaction with ARP2/3, lamellipodia formation, and cancer cell invasion.

      Major comments:

      • Are the key conclusions convincing?

      Overall, the conclusions are convincing, though the question of wild-type and mutated SHARPIN in rescue experiments should be addressed, given the functional importance of SHARPIN overexpression in cancer. Indeed, throughout the paper, the authors monitor the expression levels of their ectopically expressed SHARPIN, and systematically refer to these molecules as being "overexpressed", without showing their relative levels with regard to normal endogenous levels of SHARPIN in these cell lines, prior to its KD/KO. However, as the typical cancer-related functions of SHARPIN are linked to its expression levels, it is important to understand whether the observed phenomena can be regarded as physiologically relevant, or are the authors operating out of the physiological scale. Given the capacity of HeLa and 293 cell lines to produce very high levels of ectopic proteins, this issue should be systematically controlled. Therefore, in all the experiments with ectopic expression of SHARPIN and its mutants, a western blot should be added to show the relative expression, compared to the endogenous protein.

      There are also some major problems with the statistical analysis (please see below). - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      I would modify the text of the article referring to Fig.2A, where the authors qualify a 20-25% reduction of integrin activity as "significant". It is not clear whether they refer to statistical or functional significance, and the statement is generally misleading. - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      As mentioned previously, I believe that the expression levels of ectopic SHARPIN have to be systematically monitored in all assays, and compared to the normal, endogenous levels. - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      These experiments are fully realistic and should not involve additional costs. - Are the data and the methods presented in such a way that they can be reproduced?

      There is some ambivalence about the "n" values in most of the experiments, where it is not clear at all what "n" means (the number of cells? Experimental points?) For example, in Fig.S1D, the legend states that n=1, whereas the panel shows three experimental points per condition, as well as error bars. I would suggest that the authors correct these obvious mistakes and explain more clearly what exactly "n" means, in each case. - Are the experiments adequately replicated and statistical analysis adequate?

      I cannot comment on that because I do not understand what exactly "n" means (please see above). For example, if in Fig.3, "n=4" means that only four cells were analyzed per experimental condition, this is clearly not enough. The statistical analysis is not sufficiently described in the Materials and Methods section, and the reasons for attributing one, two or three stars to the results are not stated, either (normally, this information should be equally present in Legends to Figures). The choice of applying the t-test does not appear evident to me, in experiments that clearly require multiple statistical comparisons, and in general, the statistical analysis does not appear adequate.

      Minor comments:

      • Specific experimental issues that are easily addressable.

      Please provide a clear gel for Fig.1A, especially the right-hand panel. The current results look as one big black spot, with two red frames added for no clear reason. - Are prior studies referenced appropriately?

      I have no problem with the list of references. - Are the text and figures clear and accurate?

      It will be a good idea to proof-read the text. For example, I have noticed a frequent use of the word "lamellApodia" instead of "lamellipodia", as well as other typing errors. - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Molecular weight markers should be added to all western blots, and scale bars to all immunofluorescent images.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Given the functional importance of SHARPIN in cancer, and the fact that it interacts with multiple major regulatory networks, it is important to pinpoint the exact post-translational modification of this protein that is specifically responsible for interaction with ARP2/3, and for the invasion potential of cancer cells. - Place the work in the context of the existing literature (provide references, where appropriate).

      I agree with the description of the field and the place of the current study that is provided in the manuscript, and do not have anything significant to add. - State what audience might be interested in and influenced by the reported findings.

      The study will undoubtedly be interesting to scientists working with cell motility, ARP2/3-dependent lamellipodia formation, and eventually metastasis growth in cancer. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I study the RAC1-WAVE-ARP2/3 regulatory pathway in normal and cancer cells. I did not have any problems with evaluating this work, either academically, or with regard to methodology.

      Referees cross-commenting

      I am very pleased to see that the three reviews have a very similar evaluation of this article, and hope that the raised questions will help the authors to improve the manuscript and successfully publish their work. I do not have any additional comments.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the reviewer for stating that “The detailed analysis uses many state of the art techniques to address the role of ROR1 and is of great interest to a large audience including basic researchers in the field of cancer biology and oncologists in the clinic” and we appreciate the reviewer’s constructive suggestions. We have substantially revised our manuscript and plan to perform new experiments based on these valuable comments.

      1. Description of the planned revisions

      Three main points: (1) The importance of AURKB as a downstream effector of ROR1 [Reviewer #1: major #2] Based on these suggestions, we plan to perform a colony formation assay using AURKB-overexpressing cells with ROR1-knockdown. We will clarify this point in the revised manuscript.

      (2) The link between ROR1 expression and YAP/BRD4 [Reviewer #1: major #5 and Reviewer #3: major #1] Based on the suggestion, we plan to perform the luciferase reporter assay. We will clearly describe this experiment in the revised manuscript.

      (3) Single-cell analysis using other models to validate tumor heterogeneity [Reviewer #2: major #1 and Reviewer #3: major #2] Based on your suggestion, we plan to analyze primary human tumors (public data: for example, GSE155698, CRA001160) and examine PDO#1 xenografts (in-house data). We will clearly state this information in the revised manuscript.

      For the two minor points suggested by Reviewer #2, we plan to (1) reanalyze TCGA data. (2) perform the organoid or colony formation assay to validate that the siRNA model functionally recapitulates the ROR1low vs. ROR1high phenotype.

      Please see the “Authors’ responses to the reviewers' comments” for more details.

      1. Description of the revisions that have already been incorporated in the transferred manuscript

      As suggested by the reviewer, we have substantially revised our manuscript, and the changes are shown in red. • Reviewer #1: major comments #2, #3, #4, and #5; minor comments #1 and #2 • Reviewer #2: major comments #2, #3, and #4; minor comments #2, #3, #4, #8, and #10 • Reviewer #3: minor comments #1 and #2

      Please see the “Authors’ responses to the reviewers' comments” for more details.

      1. Description of analyses that authors prefer not to carry out

      Authors’ responses to the reviewers' comments

      Reviewer #1

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript the authors analyzed the role of ROR1 in pancreatic cancer progression and metastasis. They found that ROR1 expression is specifically increased in an partial EMT cell cluster upon scRNA-Seq of tumor cells derived from an orthotopic mouse PDAC model. Moreover, the ROR1 high population in tumors specifies cells with high proliferation and tumor initiation capacities, increased metastatic propensity and chemoresistance, since knockdown of ROR1 shows reduction of these features in vivo. By comparing transcriptomes from several in vivo models the authors identified that ROR1 acts through AURKB and that its expression is regulated by an upstream enhancer that is bound by YAP/TAZ and BRD4 complexes. With this study the authors identified a new targetable pathway that promotes tumor progression and metastasis in PDAC. The detailed analysis uses many state of the art techniques to address the role of ROR1 and is of great interest to a large audience including basic researchers in the field of cancer biology and oncologists in the clinic. However, some of the findings are a bit preliminary and the drawn conclusions are not sufficiently supported by the experimental data. Moreover, some findings seem a bit out of context and do not really help to bring the story forward. At other instances experimental details are missing to mechanistically demonstrate the role of ROR1. In particular it remains elusive how ROR1 is regulated, i.e. which signaling events are crucial to generate ROR1 high vs. low cells. I listed my specific comments below.

      [Response] We thank the reviewer for stating that “The detailed analysis uses many state of the art techniques to address the role of ROR1 and is of great interest to a large audience including basic researchers in the field of cancer biology and oncologists in the clinic” and we appreciate the reviewer’s constructive suggestions. We have substantially revised our manuscript and plan to perform new experiments based on these valuable comments.

      1. The authors' initial finding is that in the partial EMT cluster ROR1, but also other RTKs (out of 56) are specifically increased. What about the other RTKs? Why was ROR1 chosen to analyze more thoroughly?

      [Response 1] We are thankful for the reviewer’s suggestion to clarify why ROR1 was selected. (1) Seven candidate genes (EPHA4, EPHA7, ERBB4, FGFR1, JAK3, LYN, and ROR1) were chosen as surface markers in the partial EMT cluster. (2) The genes were sorted in order of high expression. (3) ROR1 is reported to promote metastasis in breast cancer (Cui et al, 2013). The induction of metastasis is one of the functions of tumor-initiating cells. FGFR1 is already known to enhance the CSC-like phenotype in non-small cell lung cancer (Ji et al, 2016). (4) The antibody against ROR1 was marketed as available for cell sorting using FACS. Therefore, we focused on ROR1 as a potential new marker for tumor-initiating cells with a partial EMT signature.

      References Cui B, Zhang S, Chen L, Yu J, Widhopf GF 2nd, Fecteau JF, Rassenti LZ, Kipps TJ. Targeting ROR1 inhibits epithelial-mesenchymal transition and metastasis. Cancer Res. 2013 Jun 15;73(12):3649-60. doi: 10.1158/0008-5472.CAN-12-3832. PMID: 23771907; PMCID: PMC3832210. Ji W, Yu Y, Li Z, Wang G, Li F, Xia W, Lu S. FGFR1 promotes the stem cell-like phenotype of FGFR1-amplified non-small cell lung cancer cells through the Hedgehog pathway. Oncotarget. 2016 Mar 22;7(12):15118-34. doi: 10.18632/oncotarget.7701. PMID: 26936993; PMCID: PMC4924774.

      1. The finding of AURKB as crucial target of ROR1 is very weak and needs more in-depth analyses. It is not clear why AURKB was chosen over the other candidates. Is AURKB expression directly regulated by ROR1? Are the two genes directly linked? Can ROR1 deficiency be compensated by AURKB overexpression? Especially the decrease in AURKB protein level in Fig. 4K is not very convincing to account for the different phenotypes in ROR1 high and low cells. Is AURKB and ROR1 expression correlated in TCGA samples (like Fig. 8B)? In Fig. 4L the readout was changed from colony numbers to colony diameter. If AURKB is the crucial player downstream of ROR1, then colony formation efficiency should be affected at first. This needs to be shown. The statement in lines 223,224 that AURKB is a direct downstream target of ROR1 was not shown!

      [Response 2-1: changed] We thank the reviewer for noting this issue. We have performed additional experiments to assess the hypothesis that AURKB is a crucial downstream target of ROR1. ROR1-knockdown not only suppressed AKT phosphorylation (Supplemental Figure 9A) but also decreased c-Myc protein levels and the expression of c-Myc target genes (CDK4, CCND1, CDK2, and CCNE1), leading to a reduction in RB phosphorylation (new Supplemental Figure 9B and 9C). Based on these results, ROR1 regulates c-Myc expression through AKT signaling, leading to the activation of the E2F network (new Supplemental Figure 9D). We added some figures and descriptions to the preliminary revision manuscript (new Supplemental Figure 9B–9D, lines 357–363, lines 649–651).

      [Response 2-2: the planned revisions] We also plan to perform new experiments with a colony formation assay to determine whether ROR1 deficiency is compensated by AURKB overexpression. We agree that this experiment will confirm that AURKB is an important downstream target of ROR1 in PDAC proliferation.

      [Response 2-3] In TCGA-PAAD dataset, AURKB expression was not correlated with ROR1 expression. Since the ROR1high cluster is a minor population in the tumor, a downstream analysis of specific clusters with results from a bulk study such as this TCGA dataset is difficult to perform.

      [Response 2-4: changed] We have added a new graph of organoid formation efficiency (new Figure 4L) and changed some descriptions in the preliminary revision manuscript (line 227).

      1. Fig. 4 A-E: The ROR1 KD was induced in vitro but not continued in vitro. The transient KD has a strong impact on tumor forming capacity, even though recovery of expression is likely within the first days in vivo. This is very interesting and underscores the role of ROR1 in tumor initiation and presumably independent of differences in proliferation. Would the results be different, if the DOX treatment would start with injection of the cells and continued in vivo? Is then tumor initiation not affected and maybe only tumor growth?

      [Response 3: changed] We apologize for the confusing description in the original manuscript. In Fig. 4A–E, we used PDAC cells with stable expression of doxycycline-inducible shROR1. ROR1-knockdown was maintained in vivo by adding doxycycline to the drinking water. Continuous ROR1-knockdown suppressed tumor growth (Fig. 4C–E). Several statements we made were more ambiguous than intended, and we have adjusted the text and the figures for clarity in the preliminary revision manuscript (new Figure 4A and B, lines 203–204).

      1. In Fig. 5 the authors show that ROR1 is highly expressed in tumors after gemcitabine treatment and conclude that the ROR1 high cells are a resistant population. However, this statement is too strong, since gemcitabine treatment could also lead to an upregulation of ROR1 in "low" cells during acquisition of chemoresistence. Together with our knowledge on the role of EMT in driving therapy resistance and therapy-mediated induction of EMT, such a scenario is equally likely. Similarly, the statement in lines 370-372 is not supported by experimental evidence.

      [Response 4: changed] We appreciate the reviewer’s critical comments. As suggested, we have not clearly determined whether (1) the ROR1high cells survived gemcitabine treatment and/or (2) the ROR1low cells increased ROR1 expression upon exposure to this treatment. We have carefully changed some descriptions in the preliminary revision manuscript (lines 241–242, 382–383).

      1. In order to understand how ROR1 is regulated, the authors use ATAC-Seq and cut and run and identified a putative upstream enhancer element (Fig. 7). Although this element increases the activity of the promoter fragment in a reporter construct, the experiments do not help to understand how ROR1 activity is increased specifically in the "high" cells. Are peaks of YAP1 and BRD4 also changed between hi/lo cells? Is YAP OE and KD (BRD4 OE and KD) or the use of the inhibotor JQ1 altering the activity of the reporter constructs (i.e. only of the enhancer-promoter combination but not of the promoter only construct)? This would help to strengthen a direct link between ROR1, YAP and BRD4. Is YAP activity different in ROR1 high vs. low cells?

      [Response 5-1: changed] We thank the reviewer for this important comment. We have shown differences in chromatin accessibility and histone modification of the ROR1 enhancer between ROR1high and ROR1low cells using ATAC-seq and CUT&RUN assays (Fig. 7B). Very few ROR1high/low cells are present in xenograft. We were not successful in experiments examining the binding of YAP and BRD4 to enhancers in ROR1high/low cells because of the technical limitations in the ChIP and CUT&RUN assays. Instead, we used public data to examine YAP and BRD4 occupancy at the ROR1 enhancer region of cell lines with low ROR1 expression. In T-47D and MCF7 cells (breast cancer cells, low ROR1 expression), YAP and BRD4 did not bind to the ROR1 enhancer region (new Figure 8D and 8I). We have added figures and some descriptions to the preliminary revision manuscript (new Figure 8D and 8I, lines 304–309, line 768).

      [Response 5-2: the planned revisions] We plan to perform new experiments with the reporter assay you suggested. We agree that this experiment will help strengthen the direct link between ROR1, YAP and BRD4.

      [Response 5-3] As shown in Figure 8C, GSEA revealed that ROR1high cells in both S2-VP10 xenografts and PDO#1 xenografts expressed higher levels of YAP-regulated genes than ROR1low cells in these xenografts. We have added a description of this result as follows: “Thus, ROR1high cells have higher YAP activity than ROR1low cells.” (lines 304–305).

      1. In Fig. 8A the authors identified 202 antigens that match the H3 monomethylation / acetylation pattern. How was YAP etc. chosen?

      [Response 6] We apologize for the poor description in the original manuscript. We chose YAP and BRD4 based on the following criteria: (1) these antigens are expressed in S2-VP10 cells and PDO#1 and (2) bind to the ROR1 enhancer region (based on an analysis of public data).

      Minor: 1. Fig. 2D,E: What is actually shown here? Is there an overlap between the genes that define ROR1 high vs. low cells in both approaches? The gene list should be provided.

      [Response: changed] We apologize for the poor description in the original manuscript. We have added this information to the preliminary revision manuscript (new Supplemental Table 3).

      1. Fig. 3G: I suggest to include the images of the tumors from the ROR1 low cells in the main figure as well.

      [Response: changed] We appreciate the reviewer’s suggestion. We have moved this information from the supplementary information to the main figure in the preliminary revision manuscript (new Figure 3G, lines 186–189).

      Reviewer #1 (Significance (Required)):

      PDAC is a very aggressive desease with very low 5-year survival rates. Understanding of the pathobiology is of keen interest. The findings of the authors are of high significance and extremely relevant as they provide a mechanism that can also be targeted by specific drug combinations, i.e. standard care gemcitabine with specific ROR1 inhibition. The findings are of great interest to a large audience including basic researchers in the field of cancer biology and oncologists in the clinic.

      [Response] We greatly appreciate the reviewer’s comments.

      Reviewer #2

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this work Yamazaki and colleagues performed single cell RNA sequencing of one xenograft tumor formed by the S2-VP10 PDAC cell line to explore PDAC intratumor heterogeneity. Using this model they identified ROR1 as heterogeneously expressed in neoplastic cells. Using further in vivo and in vitro models they show that ROR1high cells have higher tumor initiation capacity than ROR1low. By histone and ATAC-seq analyses, they identify a ROR1 enhancer upstream the promoter and show that YAP and BRD4 bind to this genomic region and that BRD4 inhibition by JQ1 reduces ROR1 expression and organoid formation. The data, figures and methods are nicely and clearly presented.

      [Response] We thank the reviewer for stating that “The data, figures and methods are nicely and clearly presented”, and we appreciate the reviewer’s constructive suggestions. We have substantially revised our manuscript and plan to perform new experiments based on these valuable comments.

      Major comments

      1. The authors use one xenograft tumor as starting model and all conclusions are derived from the data generated with this model. To support the existence of identifie heterogeneity in the PDAC neoplastic compartment, I would strongly suggest to validate the existence of the partial EMT population and the ROR1 heterogeneity in single cell data bases generated from primary human tumors.

      [Response 1: the planned revisions] We thank the reviewer for the positive suggestion. We plan to perform a new analysis of available public single-cell data from human PDAC tumors. In addition, we also launched a single-cell analysis of PDO#1 xenografts.

      1. In Fig. 3G, it is mentioned that tumors grown from ROR1high cells recapitulate the original PDOx histology thus suggesting that ROR1high cells in the tumor are the actual TICs. ROR1low cells could also grow tumors, just with lower incidence. Are these tumors any different to the ROR1high derived ones? Is it just a lower tumor initiation capacity (TIC) or they can not recapitulate the tumor as the ROR1high cell? Can they also give rise to differentiated progeny cells? This should appear in the main text and not only in the discussion. I would suggest to move panel 3G to supplementary figure.

      [Response 2: changed] We thank the reviewer for noting this issue and apologize for the confusing description in the original manuscript. ROR1low cells generated tumors at a low frequency, and these tumors showed a hierarchical histology mimicking the original tumor. As suggested, we have added this information to the main text (new Figure 3G, lines 186–189).

      1. In line 160 you mention that known CSC markers such as CD44, PROM1 and DCLK1 are not differentially expressed between ROR1 high and low populations. Then, in figure 3H,I you analyze the expression of CD44v6 together with ROR1. I would try to put this information together in the text, or at least in fig. 3 start with something like "we had seen that both ROR1high and low express CD44, however...". In any case, I feel that the experiment with CD44 could be obviated (or at least moved to supplementary), as it brings the question of weather this is also true for DCLK1 or CD133.

      [Response 3: changed] We appreciate and agree with the reviewer's comment on this point. Accordingly, we have moved this figure to the supplementary information and changed the description (new Supplemental Figure 5C and 5D, lines 191–196).

      1. JQ1 has been described to inhibit PDAC growth by downregulation of MYC. To unequivocally link the effect of JQ1 in the downregulation of ROR1 (Fig. 8M) as discussed in the text it would be important to exclude that other mechanisms such as MYC downregulation are taking place. For example, does JQ1 treatment of ROR1low cells also reduce their colony formation capacity (in an experiment such as the one in fig. 3C). Or does ROR1 re-expression in Fig. 8M rescue the JQ1 effect? These or other experiments could help to establish a stronger link between (BRD4/JQ1) and ROR1.

      [Response 4: changed] We thank the reviewer for this important comment. As mentioned in the response to Reviewer #1-major comment #2, we newly found that ROR1 regulates c-Myc expression through AKT signaling, leading to the activation of the E2F network (new Supplemental Figures 9B–9D, lines 357–363).

      Minor comments 1. The data are nicely presented (text and figures) and the conclusions are clear. My suggestion to make the story more "catchy" at the beginning would be, if possible, to start from the observation done in primary human data and then move to the PDX model to explore ROR1 as a TIC marker in PDAC. For this, you could use available public single cell data of human PDAC tumors. If this doesn't work (it is of course possible that by unsupervised analysis you don't get the same clusters as in the PDX with the partial EMT cluster popping up), it would be nice if some primary tumor data came early in the story (currently the first figure showing heterogeneity in primary samples is in supplem fig. 4A).

      [Response: the planned revisions] We thank the reviewer for these excellent comments. As suggested, we plan to perform several new analyses (please see the previous comment for details: Reviewer #2-major comment #1).

      1. It is not clear if the xenografts were subcutaneous or orthotopic. It would be good to include this information in the main text (line 102) and the methods so that the reader knows what is the exact model that has been used.

      [Response: changed] We thank the reviewer for this comment and apologize for the poor description in the original manuscript. As suggested, we have added this information to the preliminary revision manuscript (line 101).

      1. In Fig. 2F and 2G I would highlight the EMT pathway to help the reader.

      [Response: changed] We thank the reviewer for this comment. As suggested, we have changed the relevant figures in the preliminary revision manuscript (new Figure 2F and 2G).

      1. In Supp Fig 4B it would be nice to have an amplified view of the staining as in panel C of the same figure.

      [Response: changed] We thank the reviewer for this comment. As suggested, we have added high-magnification images of the staining in the preliminary revision manuscript (new Supplemental Figure 4A and 4B).

      1. In the same figure (Fig. 4A-D) ROR1 shows an apical staining pattern that doesn't seem to resemble the staining in patient samples. I am not an expert in pathology evaluation but I would recommend a pathologist to give her/his opinion. Possibly, during the PDX process, few cells from the original patient tumor are selected giving a different staining pattern.

      [Response] We appreciate the reviewer's comment on this point. Dr. Ito, a coauthor of this paper, is a pathologist. We have changed some images of staining in patient samples (new Supplemental Figure 4A). We agree that ROR1 shows an apical staining pattern in PDX samples. However, some sites show similar apical staining patterns in patient samples (Patient #2 and Patient #4 in the new Supplemental Figure 4A). We propose that PDX mimics the original patient tissue because it has heterogeneity of ROR1 expression and morphological features indicative of a luminal structure.

      1. In the analyses of TCGA data, be aware that only 150 from the original dataset are actual PDAC tumors. The dataset contains otherwise data from cell lines, PDX, normal tissue, etc that should be removed for a proper analysis (see DOI: 10.3390/cancers11010126)

      [Response: the planned revisions] We thank the reviewer for the careful review of this issue. We are currently reconsidering with the pathologist whether the samples are appropriate based on TCGA data (diagnosis and pathology sections) and the paper you presented. The current data (Figures 3A, 4J, and 8B) were analyzed for samples excluding cell lines, PDX, and normal tissue in the TCGA-PAAD dataset.

      1. Does ROR1 correlate with RFS? This would nicely fit with the concept of TIC and metastasis.

      [Response] We thank the reviewer for noting this issue. Unfortunately, no correlation was observed between ROR1 expression and RFS.

      1. Line 219: ROR1 is not "depleted" in the lines as it is a downregulation model. "ROR1-downregulated" would be more correct.

      [Response: changed] We thank the reviewer for this suggestion and agree with your comment. We have corrected this term accordingly in the preliminary revision manuscript (line 223).

      1. It would be good to have a supplem figure showing that siROR1 cells show reduction organoid formation, to validate that the siRNA model functionally recapitulates the ROR1low vs high phenotype.

      [Response: the planned revisions] We thank the reviewer for this suggestion. We plan to perform a colony formation assay.

      1. Some of the supplemental figures are only referred in the discussion although they appear earlier than other in the main text. This is a bit confusing when going through the figures.

      [Response] We apologize for the poor description in the original manuscript. We have adjusted the order of the supplemental figures in the preliminary revision manuscript.

      CROSS-CONSULTATION COMMENTS I agree with the importance of addressing points 2 (link to AURKB), 4 (selection vs acquisition), 5 (mechanism in high vs low cells) raised by Reviewer 1, and the comments from Reviewer 3. I think that the study of other RTKs (point 1 from Reviewer 1) is not the focus of the story. It would be nice if the authors can comment on why they chose ROR1 but the fact that are other differentially expressed genes does not exclude the validity of the current story. I fell that the in vivo sustained KD experiment (point 3 from Reviewer 1) although interesting, it is not mandatory for a revision of this manuscript in case the adaptation of the animal protocol represents a long process. The experiment provided already in the current version is the best approach to address the role of ROR1 at the early initiation phase.

      [Response] We thank the reviewer for these positive comments. As suggested, we have substantially revised our manuscript.

      Reviewer #2 (Significance (Required)):

      Significance: This is a neat and interesting work with potential implications for the clinical field of pancreatic cancer as the authors identified a new subpopulation with enhanced tumor initiating cell capacity. However, the use of JQ1 for pancreatic cancer has been previously discussed mainly linked to MYC inhibition, but also to stromal reprogramming or DNA damage induction. I missed some discussion in this regard in the discussion section. What is adding the work to the field of JQ1 treatment in PDAC? IN a way, how do the authors foresee that the discovery of ROR1high cells and the regulation of ROR1 by BRD4 and YAP will be beneficial when considering JQ1 in the clinics? Maybe by stratifying patients? Or by following ROR1 upregulation upon initial chemotherapy? These questions are just suggestions. In general, some discussion to put the work into the context of previous works using JQ1 in PDAC would be nice.

      [Response: changed] We thank the reviewer for this comment. As you suggested, we have added a description of the proposed use of JQ1 and BRD4 inhibitors in ROR1high PDAC treatment to the Discussion section (lines 412–416).

      I believe that this work would be interesting not only to the pancreatic cancer community but also to a more general public working on cancer and/or stemmness as it touches several interesting points in that regard that can be applicable to other systems. My own work is focused on pancreatic cancer, patient heterogeneity and stromal interactions. I am not an expert on histone or ATACseq analyses.

      [Response] We greatly appreciate the reviewer’s comments.

      Reviewer #3

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary Yamazaki et al investigate partial EMT in pancreatic cancer and provide data that ROR1 marks pancreatic tumor cells that are capable of initiating tumors. The authors exploit scRNAseq of pancreatic tumor xenografts to identify a cluster of cells showing a partial EMT phenotype. The found 7 RTKs expressed more highly in this partial EMT cluster and focus their attention on ROR1, an 'orphan' receptor that has been implicated in WNT signaling and EMT previously. Validation experiments using ROR1-high vs low cells support that ROR1 expression correlates with EMT, poor outcome in human PDA patients, tumor forming and colony forming capacity. They also show that ROR1 high cells form tumors that recapitulate parental tumor histology. The authors show that ROR1 expression is associated with EF2 transcription factor activity, elevated expression of multiple targets including AURKB. Pharmacologic inhibition of AURKB reduces colony formation and genetic loss of ROR1 combined with chemotherapy (gemcitabine) has potent anti-tumor activity in vivo. The authors show that ROR1 expression is elevated in metastatic lesions and identify a novel enhancer element that putatively drives ROR1 expression in tumor cells. They provide evidence that this element is engaged by YAP/BRD4 and show that BRD4 inhibition reduces tumor cell colony formation. The manuscript is a solid combination of techniques with adequate controls and statistics.

      [Response] We thank the reviewer for stating that “The manuscript is a solid combination of techniques with adequate controls and statistics”, and we appreciate the reviewer’s constructive suggestions. We have substantially revised our manuscript and plan to perform new experiments based on these valuable comments.

      Major Comments: The overall conclusion that ROR1 expression marks a subset of pancreatic cancer cells that have the ability to initiate tumors is supported by the data provided. The correlative data are strong and the demonstration that loss of ROR1 reduces colony formation, reduces metastatic lesions and enhances the efficacy of chemotherapy are compelling. Additionally, the demonstration that ROR1 expression is elevated in metastatic lesions is consistent with many other drivers/markers of EMT in pancreatic cancer.

      The conclusion that ROR1 expression is driven by YAP/BRD4 is interesting and provides important mechanistic depth to the study. However, this conclusion could be strengthened by use of a suitable rescue experiment. For instance does overexpression of ROR1 rescue the effect of BRD4 inhibition or loss of YAP?

      [Response 1: the planned revisions] We thank the reviewer for this comment. We completely agree with the reviewer’s suggestion. However, the suggested examination to determine whether overexpression of ROR1 rescues the effect of BRD4 inhibition or loss of YAP may not be suitable because BRD4 and YAP act as transcriptional coregulators of various target genes. Instead, as mentioned in response to Reviewer #1-major comments 5-2, we plan to perform new experiments using a reporter assay.

      A challenge with the data presented in Figure 1, the scRNA-seq data that lead them to ROR1, is that it is not stated how many tumors are used to generate the scRNA-seq data and the overall number of tumor cells analyzed is relatively low (993). The authors should provide the number of tumors used for the initial scRNA-seq. A general concern with any scRNA-seq data is batch effect, this is mitigated to a degree by the follow on studies that provide functional validation of ROR1 in multiple cell lines.

      [Response 2: changed and the planned revisions] We appreciate the reviewer’s comments. As suggested, we have added this information to the preliminary revision manuscript (line 104). In addition, as mentioned in response to Reviewer #2 major comment #1, we plan to perform a new single-cell analysis of PDO xenografts (in-house data) and human PDAC tumors (available public data).

      The data and methods are provided in an adequate manner. Reproduction of the experiments is likely. The authors use multiple cell lines and tools that are generally available. The authors note a limitation of the study is that only human tumor xenografts were exploited.

      [Response] We thank the reviewer for the positive comment.

      Minor comments: Figure 1E and text page 9. The text identifies MERB3 as a gene that marks the partial EMT cluster, I believe this is a type and the gene should actually be MSRB3.

      [Response: changed] We apologize for the typo. We have corrected this error accordingly (line 114).

      Please provide the dose of gemcitabine in the legend for figure 5

      [Response: changed] We apologize for the poor description in the original manuscript. We have added this information.

      CROSS-CONSULTATION COMMENTS I think the comments from Referee #2 are pretty reasonable - have no additions

      Reviewer #3 (Significance (Required)):

      Intratumor heterogeneity is a major challenge for the treatment of many cancers, including pancreatic cancer. The data provided support that ROR1 marks a subset of cancer cells in pancreatic tumors that have the capacity to drive intratumor heterogeneity. If supported these data have the potential to drive significant impact. Identification of a marker and a targetable pathway that supports tumor initiation in pancreatic cancer has the potential to nominate companion therapies that enhance the efficacy of standard of care approaches. Further, identification of a pathway that drives partial EMT in pancreatic cancer provides a substantial increase in baseline knowledge of intratumor heterogeneity.

      These data would be broadly interesting to scientists interested in the tumor microenvironment, metastasis, therapy resistance and tumor progression. In addition, oncologists focused on drug development and combinatorial therapy will find this manuscript of interest.

      [Response] We greatly appreciate the reviewer’s comments.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      Yamazaki et al investigate partial EMT in pancreatic cancer and provide data that ROR1 marks pancreatic tumor cells that are capable of initiating tumors. The authors exploit scRNAseq of pancreatic tumor xenografts to identify a cluster of cells showing a partial EMT phenotype. The found 7 RTKs expressed more highly in this partial EMT cluster and focus their attention on ROR1, an 'orphan' receptor that has been implicated in WNT signaling and EMT previously. Validation experiments using ROR1-high vs low cells support that ROR1 expression correlates with EMT, poor outcome in human PDA patients, tumor forming and colony forming capacity. They also show that ROR1 high cells form tumors that recapitulate parental tumor histology. The authors show that ROR1 expression is associated with EF2 transcription factor activity, elevated expression of multiple targets including AURKB. Pharmacologic inhibition of AURKB reduces colony formation and genetic loss of ROR1 combined with chemotherapy (gemcitabine) has potent anti-tumor activity in vivo. The authors show that ROR1 expression is elevated in metastatic lesions and identify a novel enhancer element that putatively drives ROR1 expression in tumor cells. They provide evidence that this element is engaged by YAP/BRD4 and show that BRD4 inhibition reduces tumor cell colony formation. The manuscript is a solid combination of techniques with adequate controls and statistics.

      Major Comments:

      The overall conclusion that ROR1 expression marks a subset of pancreatic cancer cells that have the ability to initiate tumors is supported by the data provided. The correlative data are strong and the demonstration that loss of ROR1 reduces colony formation, reduces metastatic lesions and enhances the efficacy of chemotherapy are compelling. Additionally, the demonstration that ROR1 expression is elevated in metastatic lesions is consistent with many other drivers/markers of EMT in pancreatic cancer.

      The conclusion that ROR1 expression is driven by YAP/BRD4 is interesting and provides important mechanistic depth to the study. However, this conclusion could be strengthened by use of a suitable rescue experiment. For instance does overexpression of ROR1 rescue the effect of BRD4 inhibition or loss of YAP?

      A challenge with the data presented in Figure 1, the scRNA-seq data that lead them to ROR1, is that it is not stated how many tumors are used to generate the scRNA-seq data and the overall number of tumor cells analyzed is relatively low (993). The authors should provide the number of tumors used for the initial scRNA-seq. A general concern with any scRNA-seq data is batch effect, this is mitigated to a degree by the follow on studies that provide functional validation of ROR1 in multiple cell lines.

      The data and methods are provided in an adequate manner. Reproduction of the experiments is likely. The authors use multiple cell lines and tools that are generally available.

      The authors note a limitation of the study is that only human tumor xenografts were exploited.

      Minor comments:

      Figure 1E and text page 9. The text identifies MERB3 as a gene that marks the partial EMT cluster, I believe this is a type and the gene should actually be MSRB3.

      Please provide the dose of gemcitabine in the legend for figure 5

      Referees cross-commenting

      I think the comments from Referee #2 are pretty reasonable - have no additions

      Significance

      Intratumor heterogeneity is a major challenge for the treatment of many cancers, including pancreatic cancer. The data provided support that ROR1 marks a subset of cancer cells in pancreatic tumors that have the capacity to drive intratumor heterogeneity. If supported these data have the potential to drive significant impact. Identification of a marker and a targetable pathway that supports tumor initiation in pancreatic cancer has the potential to nominate companion therapies that enhance the efficacy of standard of care approaches. Further, identification of a pathway that drives partial EMT in pancreatic cancer provides a substantial increase in baseline knowledge of intratumor heterogeneity.

      These data would be broadly interesting to scientists interested in the tumor microenvironment, metastasis, therapy resistance and tumor progression. In addition, oncologists focused on drug development and combinatorial therapy will find this manuscript of interest.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this work Yamazaki and colleagues performed single cell RNA sequencing of one xenograft tumor formed by the S2-VP10 PDAC cell line to explore PDAC intratumor heterogeneity. Using this model they identified ROR1 as heterogeneously expressed in neoplastic cells. Using further in vivo and in vitro models they show that ROR1high cells have higher tumor initiation capacity than ROR1low. By histone and ATAC-seq analyses, they identify a ROR1 enhancer upstream the promoter and show that YAP and BRD4 bind to this genomic region and that BRD4 inhibition by JQ1 reduces ROR1 expression and organoid formation. The data, figures and methods are nicely and clearly presented.

      Major comments

      1. The authors use one xenograft tumor as starting model and all conclusions are derived from the data generated with this model. To support the existence of identifie heterogeneity in the PDAC neoplastic compartment, I would strongly suggest to validate the existence of the partial EMT population and the ROR1 heterogeneity in single cell data bases generated from primary human tumors.
      2. In Fig. 3G, it is mentioned that tumors grown from ROR1high cells recapitulate the original PDOx histology thus suggesting that ROR1high cells in the tumor are the actual TICs. ROR1low cells could also grow tumors, just with lower incidence. Are these tumors any different to the ROR1high derived ones? Is it just a lower tumor initiation capacity (TIC) or they can not recapitulate the tumor as the ROR1high cell? Can they also give rise to differentiated progeny cells? This should appear in the main text and not only in the discussion. I would suggest to move panel 3G to supplementary figure.
      3. In line 160 you mention that known CSC markers such as CD44, PROM1 and DCLK1 are not differentially expressed between ROR1 high and low populations. Then, in figure 3H,I you analyze the expression of CD44v6 together with ROR1. I would try to put this information together in the text, or at least in fig. 3 start with something like "we had seen that both ROR1high and low express CD44, however...". In any case, I feel that the experiment with CD44 could be obviated (or at least moved to supplementary), as it brings the question of weather this is also true for DCLK1 or CD133.
      4. JQ1 has been described to inhibit PDAC growth by downregulation of MYC. To unequivocally link the effect of JQ1 in the downregulation of ROR1 (Fig. 8M) as discussed in the text it would be important to exclude that other mechanisms such as MYC downregulation are taking place. For example, does JQ1 treatment of ROR1low cells also reduce their colony formation capacity (in an experiment such as the one in fig. 3C). Or does ROR1 re-expression in Fig. 8M rescue the JQ1 effect? These or other experiments could help to establish a stronger link between (BRD4/JQ1) and ROR1.

      Minor comments

      1. The data are nicely presented (text and figures) and the conclusions are clear. My suggestion to make the story more "catchy" at the beginning would be, if possible, to start from the observation done in primary human data and then move to the PDX model to explore ROR1 as a TIC marker in PDAC. For this, you could use available public single cell data of human PDAC tumors. If this doesn't work (it is of course possible that by unsupervised analysis you don't get the same clusters as in the PDX with the partial EMT cluster popping up), it would be nice if some primary tumor data came early in the story (currently the first figure showing heterogeneity in primary samples is in supplem fig. 4A).
      2. It is not clear if the xenografts were subcutaneous or orthotopic. It would be good to include this information in the main text (line 102) and the methods so that the reader knows what is the exact model that has been used.
      3. In Fig. 2F and 2G I would highlight the EMT pathway to help the reader.
      4. In Supp Fig 4B it would be nice to have an amplified view of the staining as in panel C of the same figure.
      5. In the same figure (Fig. 4A-D) ROR1 shows an apical staining pattern that doesn't seem to resemble the staining in patient samples. I am not an expert in pathology evaluation but I would recommend a pathologist to give her/his opinion. Possibly, during the PDX process, few cells from the original patient tumor are selected giving a different staining pattern.
      6. In the analyses of TCGA data, be aware that only 150 from the original dataset are actual PDAC tumors. The dataset contains otherwise data from cell lines, PDX, normal tissue, etc that should be removed for a proper analysis (see DOI: 10.3390/cancers11010126)
      7. Does ROR1 correlate with RFS? This would nicely fit with the concept of TIC and metastasis.
      8. Line 219: ROR1 is not "depleted" in the lines as it is a downregulation model. "ROR1-downregulated" would be more correct.
      9. It would be good to have a supplem figure showing that siROR1 cells show reduction organoid formation, to validate that the siRNA model functionally recapitulates the ROR1low vs high phenotype.
      10. Some of the supplemental figures are only referred in the discussion although they appear earlier than other in the main text. This is a bit confusing when going through the figures.

      Referees cross-commenting

      I agree with the importance of addressing points 2 (link to AURKB), 4 (selection vs acquisition), 5 (mechanism in high vs low cells) raised by Reviewer 1, and the comments from Reviewer 3. I think that the study of other RTKs (point 1 from Reviewer 1) is not the focus of the story. It would be nice if the authors can comment on why they chose ROR1 but the fact that are other differentially expressed genes does not exclude the validity of the current story. I fell that the in vivo sustained KD experiment (point 3 from Reviewer 1) although interesting, it is not mandatory for a revision of this manuscript in case the adaptation of the animal protocol represents a long process. The experiment provided already in the current version is the best approach to address the role of ROR1 at the early initiation phase.

      Significance

      This is a neat and interesting work with potential implications for the clinical field of pancreatic cancer as the authors identified a new subpopulation with enhanced tumor initiating cell capacity. However, the use of JQ1 for pancreatic cancer has been previously discussed mainly linked to MYC inhibition, but also to stromal reprogramming or DNA damage induction. I missed some discussion in this regard in the discussion section. What is adding the work to the field of JQ1 treatment in PDAC? IN a way, how do the authors foresee that the discovery of ROR1high cells and the regulation of ROR1 by BRD4 and YAP will be beneficial when considering JQ1 in the clinics? Maybe by stratifying patients? Or by following ROR1 upregulation upon initial chemotherapy? These questions are just suggestions. In general, some discussion to put the work into the context of previous works using JQ1 in PDAC would be nice.

      I believe that this work would be interesting not only to the pancreatic cancer community but also to a more general public working on cancer and/or stemmness as it touches several interesting points in that regard that can be applicable to other systems.

      My own work is focused on pancreatic cancer, patient heterogeneity and stromal interactions. I am not an expert on histone or ATACseq analyses.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript the authors analyzed the role of ROR1 in pancreatic cancer progression and metastasis. They found that ROR1 expression is specifically increased in an partial EMT cell cluster upon scRNA-Seq of tumor cells derived from an orthotopic mouse PDAC model. Moreover, the ROR1 high population in tumors specifies cells with high proliferation and tumor initiation capacities, increased metastatic propensity and chemoresistance, since knockdown of ROR1 shows reduction of these features in vivo. By comparing transcriptomes from several in vivo models the authors identified that ROR1 acts through AURKB and that its expression is regulated by an upstream enhancer that is bound by YAP/TAZ and BRD4 complexes. With this study the authors identified a new targetable pathway that promotes tumor progression and metastasis in PDAC. The detailed analysis uses many state of the art techniques to address the role of ROR1 and is of great interest to a large audience including basic researchers in the field of cancer biology and oncologists in the clinic. However, some of the findings are a bit preliminary and the drawn conclusions are not sufficiently supported by the experimental data. Moreover, some findings seem a bit out of context and do not really help to bring the story forward. At other instances experimental details are missing to mechanistically demonstrate the role of ROR1. In particular it remains elusive how ROR1 is regulated, i.e. which signaling events are crucial to generate ROR1 high vs. low cells. I listed my specific comments below.

      1. The authors' initial finding is that in the partial EMT cluster ROR1, but also other RTKs (out of 56) are specifically increased. What about the other RTKs? Why was ROR1 chosen to analyze more thoroughly?
      2. The finding of AURKB as crucial target of ROR1 is very weak and needs more in-depth analyses. It is not clear why AURKB was chosen over the other candidates. Is AURKB expression directly regulated by ROR1? Are the two genes directly linked? Can ROR1 deficiency be compensated by AURKB overexpression? Especially the decrease in AURKB protein level in Fig. 4K is not very convincing to account for the different phenotypes in ROR1 high and low cells. Is AURKB and ROR1 expression correlated in TCGA samples (like Fig. 8B)? In Fig. 4L the readout was changed from colony numbers to colony diameter. If AURKB is the crucial player downstream of ROR1, then colony formation efficiency should be affected at first. This needs to be shown. The statement in lines 223,224 that AURKB is a direct downstream target of ROR1 was not shown!
      3. Fig. 4 A-E: The ROR1 KD was induced in vitro but not continued in vitro. The transient KD has a strong impact on tumor forming capacity, even though recovery of expression is likely within the first days in vivo. This is very interesting and underscores the role of ROR1 in tumor initiation and presumably independent of differences in proliferation. Would the results be different, if the DOX treatment would start with injection of the cells and continued in vivo? Is then tumor initiation not affected and maybe only tumor growth?
      4. In Fig. 5 the authors show that ROR1 is highly expressed in tumors after gemcitabine treatment and conclude that the ROR1 high cells are a resistant population. However, this statement is too strong, since gemcitabine treatment could also lead to an upregulation of ROR1 in "low" cells during acquisition of chemoresistence. Together with our knowledge on the role of EMT in driving therapy resistance and therapy-mediated induction of EMT, such a scenario is equally likely. Similarly, the statement in lines 370-372 is not supported by experimental evidence.
      5. In order to understand how ROR1 is regulated, the authors use ATAC-Seq and cut and run and identified a putative upstream enhancer element (Fig. 7). Although this element increases the activity of the promoter fragment in a reporter construct, the experiments do not help to understand how ROR1 activity is increased specifically in the "high" cells. Are peaks of YAP1 and BRD4 also changed between hi/lo cells? Is YAP OE and KD (BRD4 OE and KD) or the use of the inhibotor JQ1 altering the activity of the reporter constructs (i.e. only of the enhancer-promoter combination but not of the promoter only construct)? This would help to strengthen a direct link between ROR1, YAP and BRD4. Is YAP activity different in ROR1 high vs. low cells?
      6. In Fig. 8A the authors identified 202 antigens that match the H3 monomethylation/acetylation pattern. How was YAP etc. chosen?

      Minor:

      1. Fig. 2D,E: What is actually shown here? Is there an overlap between the genes that define ROR1 high vs. low cells in both approaches? The gene list should be provided.
      2. Fig. 3G: I suggest to include the images of the tumors from the ROR1 low cells in the main figure as well

      Significance

      PDAC is a very aggressive desease with very low 5-year survival rates. Understanding of the pathobiology is of keen interest. The findings of the authors are of high significance and extremely relevant as they provide a mechanism that can also be targeted by specific drug combinations, i.e. standard care gemcitabine with specific ROR1 inhibition. The findings are of great interest to a large audience including basic researchers in the field of cancer biology and oncologists in the clinic.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      This is a revision plan, the manuscript has not been modified yet as it is being transferred to a journal.

      *------------------------------------------------------------------------------ Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This study proposes (and uses) an elegant model of bacteria evolution to study how division of labor can emerge through the interaction between non-random mutations (occurring at some specific ``fragile' genomic sites) and genome architecture. The study is very interesting and the results are convincing. My main concerns are about the presentation of the model and results. Although I am confident about the results, some elements should be clarified for a better understanding and for a correct interpretation of the results. Two points in particular (detailed below as major comments) require clarification.

      Major comments:

      • the notion of telomere/centromere is used all throughout the paper but I think it is used in a misleading way. First, it seems that here there is only one telomere (but this is actually a detail of the model). More importantly, as long as I know, it is well known that in S. coelicolor the sequence degenerates more rapidly when getting closer to the telomeres (but telomeres are defined independently from this property). But here, the notion of telomere is precisely directly determined by its mutational instability (respectively, the centromere is defined by its stability). Although this is reasonable given the objective of the model, it forbid the use of sentences like "we observed that the genome of the evolved colony founded had two distinct regions: a telomeric [...] and a centromeric [...]" (line 234) or "When bacteria divide, mutations induced at fragile sites lead to the deletion of the part of the genome distal to them, causing large telometic deletions" (line 239 - this is not a result but a hidden description of the model) as this distinction between the two regions is not an outcome of the simulation but rather given a priori as a coded property of the fragile sites that all lead to deletions on the same -- called telomeric -- side (of course, formally if the genome contains no fragile site, there is no distinction but still). Please clarify this in the main text and in the methods. *

      Authors response (AR, in the following): we agree with the reviewer that the directionality of the deletions determines centromere and telomere in our model (and the reviewer is correct that we only consider one arm of the chromosome). We will explicitly state both in the main text and in the methods that the model does not include any explicit centromeric and telomeric structure, and that the polarity of the genetic information (and thus centromere and telomere) depends on the choice of directionality of the deletions.

      - In most part of the paper (methods, results, figures, sup mat...) antibiotics are considered to have a concentration (or a high/low production) but at least twice in the text (lines 165 and 488) it is said that only the presence/absence of antibiotics is modelled. I was not able to understand how the continuous values are transformed into presence/absence (is there a threshold?) but more importantly, I strongly suspect that this choice has a strong influence on the outcome. For instance, with a diffusion radius equals to 10, it means that an antibiotics producing cell is able to protect 2*\pi*10=~60 replicating cells. Hence, one could conjecture that the fraction of antibiotic-producing mutants should a little more than 2%... which is what is observed by the authors. So (1) please clarify this point (2) discuss (or experiments) the consequences of this choice on the conclusion.

      AR: the reviewer is correct that antibiotics are modelled as presence/absence – this was done for computational efficiency. However, the probability that a bacterium deposits an antibiotic at a site within the deposition radius is a continuous number, as it depends on the number of antibiotic genes and growth genes. We will make this clear in the main text and in the methods.

      Secondly, we show the effect of varying the deposition radius for the evolutionary dynamics in Supplementary Section S17. We will make this clear in the main text. For the area covered by different radius of antibiotic deposition, please see below.

      * Minor comments: - line 262: "We conclude that genome architecture is a key prerequisit for the maintenance of mutation-driven division of labor". Given the model hypotheses you cannot be so affirmative (it is a key prerequisit... in this model!) *

      AR: we will modify the statement as suggested. * *

      - line 286: "cannot" is probably too strong. It has not been observed...

      AR: we will modify the statement as suggested.

      - line 288 and following: you seem to consider that there is "selection for diversity". Given the large number of possible antibiotics and given that cells are "automatically" resistant to the antibiotics they produce, could it be simply drift? There is a clear selection pressure to limit the number of growth-promoting genes but no such pressure exist for antibiotics. Hence their number could simply drift (note that figs 2 and SF1 both use a log scale; random variations due to drift could be hidden by the log. Fig. SF2 does use a log scale and shows a dynamics that---to my eyes---claims for drift rather than for selection of diversity).

      AR: we agree with the reviewer that drift might contribute to the overall antibiotic diversity. This might be especially true for the antibiotic genes residing downstream of the fragile sites, which have low probability of expression in the wild-type (because of the many growth genes) and are deleted in the mutants. Duplications, deletions and modifications of these genes are effectively neutral, and are therefore likley subject to drift. We will include this discussion in the main text. However, bacteria are highly susceptible to the diverse antibiotics produced by other colonies (i.e. those produced – largely – by the mutants). These antibiotics and their diversity drives colony invasion and is thus selective. The overall number and diversity of antibiotics is therefore, at least in part, under selection.

      - line 340: "ends" should be "end" when discussing the model - line 345: "a telomeric region" should be "telomeric regions" when discussing the bacteria - line 359: "S. ambofaciens" should be italic - line 365: same for "Streptomyces"

      AR: we will modify the statement as suggested (and thank the reviewer for carefully reading the text).

      - line 245 states that colonies begin clonally but methods (lines 434-438) don't support this. Colonies don't begin clonally but they begin without antibiotic-producing spores (see also line 618)

      AR: we agree with the reviewer that colonies are not specifically initialised as clonal. We will modify the sentence as: By this process colonies eventually evolve to become functionally differentiated throughout the growth cycle.

      - line 442: "their" should be "its" - line 446: "hotspot for recombination" no, for "deletion" - line 449: please remove brackets around the reference.

      AR: we will modify the statement as suggested.

      - line 458: if I understood it correctly, there is no explicit competition in the model. Competition simply comes from the asynchronous replication. Am I true? Could you clarify that point?

      AR: The reviewer is correct that through asynchronous updating only one focal lattice site is update at a time. However, if a site is empty, the bacteria surrounding it are competing based on their replication rate kreplication. Dividing by the neighbourhood size (eta) simply ensures that a bacterium surrounded by a completely empty neighborhood replicates on average alpha_g times (alpha_g being the max growth rate). We will mention this in the methods.

      - line 490: "the antibiotic deposited is chosen randomly and uniformly among them". This is not fully clear. I suppose the bacteria is still resistant to all the antibiotics it \it{can} produce?

      AR: Yes. This is mentioned in the methods section “Replication”.

      - figure SF1: please use the same scales as in figure 2 such that the two plots can be easily compared

      AR: we will modify the x-axis to include the number of growth cycles.

      - section S3 and figure SF4: What is to be understood from the figure is not clear to me. Seems that WTs win only if generalists produce less AB or replicate slower (?) Is it true?

      AR: The reviewer is correct. In other words: when the artificial generalist has the same replication rate and the same antibiotic production rate as the WT, then the competition experiment ends with a near draw (the generalist still wins, but slowly). This means that the fitness cost associated to division of labor, i.e. to having two cell types doing the same work as one generalist – is small.

      We will include this description in the section.

      The figure is unfortunately complicated by the fact that we do not know a-priori how high the effective antibiotic production rate is (because antibiotics are spatially distributed by the stochastically generated mutants) – and so we had to make a large parameter screen to figure out the parameter values for which the competition experiment made most sense.

      - I found it very difficult to draw conclusion from section S4, S5 and S6. These experiments should be analyzed with the help of mathematical analyses of the equations. Moreover, the understanding of these results are rendered difficult due to the lack of clarity regarding the discrete (or not) nature of the antibiotic production/action/diffusion

      AR: We hope that we have clarified the distinction between antibiotic production rate and antibiotic presence/absence in the lattice.

      The model is not amenable to analytical tractability, which makes it difficult to make exact statements based on the equations that govern it. However, we can check that the model is robust, and identify regions of parameter space where the model behaves in a qualitatively similar way to main text results.

      Sections S4, S5 and S6 are essentially parameter screens to verify that the model reproduces the results reported in the main text for a broad range of parameters. The primary conclusion that can be drawn is that the model is robust to parameter changes.

      Section S4 explores the model robustness to changes in two key parameters of the model: the antibiotic inhibition due to growth genes beta_g and the parameter h_g, which is the number of growth genes that produces half-maximum growth rate. Section S5 further analyses the relation between these parameters, and how they together determine the strength of the trade-off. Section S6, finally, shows that a strong trade-off is not a necessary requirement for evolution of division of labor as the division also depends (in a counterintuitive way) on the parameter alpha_g, the maximum antibiotic production rate.

      We will include and expand these summarizing statements in each section, to make clear what each section achieves.

      - S7 and fig SF9. It is unclear to me why the fraction of mutants decrease along time elapsed in the cycle. Please explain.

      AR: The reason is that not all mutants are born with the same number of antibiotic genes (Fig. 3A). A mutant with fewer antibiotic genes might be susceptible to some of the antibiotics produced by another mutant, and could be killed by these antibiotics. Once a mutant is killed in the inner colony, a wt will replicate to fill the spot, and likely a wt offspring will take that site rather than another mutant. Thus there is a decline in overall mutant population.

      We will include this discussion in Section S7.

      - Figure SF14: what are the tin lines? if they correspond to the five repeats, how can it be that the bold line be the median?

      AR: we realise that the caption should be clearer. Each of the five lines (both bold and thin) in each pane represents the median number of genetic elements over time. The bold line just highlights one randomly chosen simulation (the same for each genetic element), to better guide the eye.

      We will clarify the caption of the figure.

      - S13 and figure SF15: given that AB concentration is ON/OFF, is this result really surprising? This also questions about the accumulation of AB genes in the original model. Although the authors regularly claim that this is due to selection for diversity, drift could also be at play (see above)

      AR: As mentioned above, we agree with the reviewer and we will mention that drift may co-determine antibiotic gene accumulation.

      - S17: for radius 1, 2 and 3, the aliasing is likely to be strong. Hence, the results cannot be interpreted with this sole information. Please give e.g. how many cells are "protected" for each radius (e.g. for r_{alpha}=1, this value can vary between 1 and 9!)

      AR: for radius=1, 2, 3, 5 ,8, 10 the area covered by antibiotic production is respectively 5 ,13, 29, 81, 197, 317. We will include this information in the figure.

      - L742: "matching the antibiotic bitstring with the bitstring of the antibiotic". True and actually elegant but simpler formulation could ease the reading...

      AR: We will change the sentence as follows: “Both antibiotics and antibiotic genes are characterised by a bitstring, which determines their type. Antibiotic resistance in the model is determined by matching these two strings.”

      - lines 746-751 and figure SF21: There again, could it be a consequence of the AB ON/OFF diffusion model?

      AR: we agree with the reviewer that a continuous diffusion model could affect resistance to antibiotics. We expect that the main effect will come from some antibiotics antibiotics having different concentrations. For instance, we could have a situation in which many deleterious antibiotics are produced in small amount, but have a compounding effect on the susceptible bacterium. This finer model of antibiotic production, diffusion and killing was not included in the model to limit the computational load.

      - S18-S19-S20: what should the reader understand from these results? Please better comment the figures.

      AR: we agree that figures in Section S18,19 and 20 could have more descriptive captions. Sections S18, 19 and 20 are parameter screen to check that the model is robust to changes in the mutation rates affecting fragile sites activation and de-novo formation. The primary result of Section S18 is that that division of labor evolves over a broad range of fragile site activation rates and de-novo fragile site formation rates (and even when these parameters are decreased by one order of magnitude).

      Section S19 shows how these combination of parameters result in quantitative changes in genome composition.

      Section S20 shows that the de-novo fragile site formation rate can be zero: as long as the system is initialised genomes that can divide labor, the fragile sites will persist even though no new ones are generated.* *

      • CROSS-CONSULTATION COMMENTS Sorry about the confusion about the computation of the number of cells protected by a single AB-producing cell. Of course it is of the order 10*\pi^2 !!! The global argument still holds but the number of cells protected is of course larger than 60 (note that, due to aliasing at the periphery the exact number of cells in the protected area is difficult to determine). *

      Author response: We hope the clarifications mentioned above answer the reviewer’s comment.

      * Reviewer #1 (Significance (Required)):

      First, an very importantly, I must say that I am no familiar with the biological model (Streptomyces coelicolor). So I am not fully able to judge the biological significance of this research (i.e. whether the way division of labor is achieved here enlights---or not---the biology of this bacteria). However, on the computational side, the model and the results (as they are summarized in the conclusion) are very interesting on their own and deserve publication.

      Remark: a lots of supplementary results are added to the paper that are not not fully explained or analysed. Please, better discuss all these results and their significance. *

      AR: we will extensively check and add detail to the supplementary material, ensuring that results are fully explained (see also response to reviewer 1).*

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript "Evolution of genome fragility enables microbial division of labor" presents a model of genetically-based division of labour in bacterial colonies. It is postulated that two essential processes, growth and the important for elimination of competitors production of antibiotics, are poorly compatible in a single cell. The beneficial for a colony cell specialization is assumed to be determined only by genetic differences that appear via deletions of growth- promoting loci. These deletions and production of various antibiotics are mediated by a rather elaborate genetic architecture, which includes position-sensitive "fragile" sites, mutable antibiotic and growth-promoting genes. The model produces rather predictable results that under sufficiently strong incompatibility between growth and antibiotic production, the long-term evolution results in formation of mosaic of colonies, each specialized in production of its specific set of antibiotics. Such production is facilitated by evolving rapidly mutable genomes that constantly generate non-reproducing antibiotic-pumping cells.

      The model appears very thoroughly developed and analyzed, and all major conclusion are intuitively appealing. Overall, the manuscript reads as a well-written quantitative proof of the principle of genetically-based division of labour between bacterial cells. The only part of the model that I'm a bit sceptical about is the unwarranted complexity of the genetic architecture. Unless the introduction of "fragile" sites and the directional ordering of genes is strongly justified by empirical data, a simpler and more clear assumption about mutational incapacitation of growth genes would suffice to reproduce the predicted phenomenology. So adding such empirical evidence would boost the relevance of the genetical part of the model. In the present form, all observed adaptations are inevitable simply because the expected division of labour will not evolve without each of them due to the design of the model. *

      AR: We agree with the reviewer that a simpler model with a predetermined effect of mutations, such as to incapacitate the growth genes, would suffice to reproduce the phenomenology of the mutation-driven division of labor observed in Streptomyces. Adding the complexity of a genome architecture introduces one more hypothesis: that genome fragility can evolve to organize the division of labor. This hypothesis, supported by the results presented here, can be tested experimentally.

      However, there is already some empirical support for our modelling choices: 1) mutation rates along the genome of Streptomyces are highly heterogeneous, 2) the genetic content is partitioned along the chromosome so that some genes are preferentially located in the mutationally quiet centromere, and others are in the mutationally active (sub)telomeric regions, 3) some cis genetic elements in Steptomyces’ genomes readily recombine to produce large-scale duplications and deletions (which we heavily simplified in the model as deletion-inducing fragile sites).

      We will extend the introduction to include the references for the empirical support to our model.

      * A couple of minor comments...

      217 This is achieved when fewer growth-promoting genes are required to inhibit antibiotic 218 production (i.e. lower βg). Shouldn't it be "larger \beta_g"? *

      AR: yes. Thanks for catching this!

      * Whether in the main text or Supplementary materials, it woud help to add a complete population dynamics equation with all gain and loss terms. *

      AR: we agree with the reviewer that it would be interesting to obtain a comprehensive population dynamics equation that captures the spatial dynamics of replication, mutation, and antibiotic production, causing colony formation and between-colony competition. However, deriving such equation would be a very big effort in itself, and we suspect that it would not be analytically tractable. Because of this, we prefer the “procedural” model description we gave – which also mirrors the model implementation (see github repository at github.com/escolizzi/strepto2).

      * Strikingly, we find the opposite: division of labor evolves when 224 bacteria produce fewer overall antibiotics (lower αa), under shallow trade-off conditions 225 (hgβg = 5; see Suppl. Section S6).

      I don't see why it is"striking". It seems perfectly explicable that a smaller \alpha requires more dedication to antibiotic production, thus favouring specialization. *

      * *AR: we agree that we have not conveyed why we found this result surprising. We have set the trade-off shallow enough (h_g beta_g =5) that the generalist wins when alpha_g =1. In addition, lowering alpha_a makes the benefit of creating a mutant smaller, because a highly specialised mutant with zero growth genes makes fewer antibiotics. A generalist is proportionally less affected. Intuitively, we have compunded two benefits for the generalist.

      But division of labor evolves, outcompeting the generalist – which surprised us.

      We will modify the paragraph to better explain what we expected, and we will tone down the wording, removing the word “strikingly”.

      *Reviewer #2 (Significance (Required)):

      Due to my relative lack of familiarity with the literature on evolution of genetically-based division of labour, I would rather not comment on the degree of innovation of the manuscript.

      The text is well written and is accessible to a wide readership, so it could be recommended to a general biological or evolutionary journal.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: In this manuscript the authors explore the co-evolution of genomic architecture and division of labour in antibiotic production, in a model inspired by the bacterium Streptomyces coelicolor. In the model a genetic trade-off is implemented where the having a large number of growths promoting genes (and thus fast growth) leads to a low production of antibiotics. On the other hand, having fewer growth promoting genes allows for a higher production of antibiotics. This trade-off selects for a division of labour, where one sub population specializes in antibiotic production and another sub population specializes in reproduction. This division of labour is achieved by evolving the genome structure, so that growth promoting genes are clustered together, separated from the rest of the genome by several fragile sites (sites that allow for large deletions). This allows a single mutational event to delete a large number of growth-promoting genes, which creates a cell, lacking growth genes and that thus has a high antibiotic production (cell specializing in antibiotic production). In other words, the genome structure evolves to shape evolvability, so as to allow cells with a high growth rate to rapidly and repeatably evolve/mutate into cells with a high antibiotic production. This creates a division of labour where a part of the population specializes in growth/reproduction and another part specializes in antibiotics production. This model provides a tangible mechanism to explain a similar division of labour observed in S. coelicolor. This mechanism also fits well with the large deletions observed in antibiotic-hyperproducing S. coelicolor cells, which are also repeatably generated during colony growth.

      Major comments: -Line 69, It would be good to give a bit more information here on the (number of) different types of antibiotics produced by S. coelicolor, to help the reader understand some of the modelling choices later on, such as allowing for the evolution of a large number (16 or higher if I understand correctly) of different antibiotics and a cell automatically being resistant to all antibiotics it produces (instead of having separate resistance genes). *

      AR: we agree with the reviewer that adding this information would put the model more in focus. The total number of antibiotics that can be produced by the genus Streptomyces has been estimated to be of the order of 100000 (ten to the fifth, [Watve et al., 2001]). Although we use S. Coelicolor as reference model organism for our computational model, we simulate long-term evolutionary dynamics that diversify the antibiotic repertoire. Each antibiotic is represented by a 16 bits string, meaning that there are 2^16 (= 65536) possible antibiotics in the system – consistent with the number of possible antibiotics in the genus.

      This being said, our model genomes evolve to have many more antibiotic genes than typical Streptomyces. Each species in the genus has up to 30 biosynthetic gene clusters [Genilloud, O. (2014)], a fraction of which make antibiotics. We discuss this discrepancy and propose solutions for this in the Discussion (also see below).

      Regarding the possibility of separating antibiotic resistance from antibiotic synthesis: we (and most literature on the eco-evolutionary dynamics of antibiotic-producing bacteria) simplified antibiotic production as depending on individual “antibiotic biosynthetic genes”. In reality several genes in a cluster must be expressed to synthesize an antibiotic. A typical biosynthetic gene cluster also encodes resistance genes for the cognate antibiotic, to prevent cell suicide [Mak et al., 2014] – hence antibiotic genes providing resistance in the model. This being said, Streptomyces genomes also host resistance genes to antibiotics for which they have no biosynthetic pathway themselves, including efflux pumps that give some nonspecific resistance [Nag et al 2021].

      Modelling antibiotic synthesis in more detail would allow to make a better model of antibiotic evolution, as well as to enrich the social dynamics of the model – because “cheaters” could evolve that are resistant but do not contribute to the antibiotics in the colony. These questions are certainly interesing, but would further complexify the model. They are exciting venues for future model expansions.

      We will include the literature mentioned above in the introduction, and use these references to better motivate the model.

      * -Lines 127-129 It is mentioned here fragile sites in the genome might represent transposable elements or long inverted repeats. Would both of these types of fragile sites behave the same? Has it been shown that both transposable elements and long inverted repeats can lead to large deletions from a linear chromosome? It would be nice to have a bit more background on how fragile sites might work or what they might look like in an empirical context. I am a bit unsure on this, but depending on their exact empirical nature, should fragile sites not also lead to increased rates of gene duplication near themselves? *

      AR: we see that we have not made a clear connection between the introduction, where we introduce the mutational dynamics of Streptomyces, and the methods, where we introduce fragile sites.

      Briefly, both duplications and deletions occur in Streptomyces, as well as circularization of the linear chromosome, conjugation, etc. [Hoff et al 2018,Tidjani et al 2019]. However, the outcome of all these mutations is biased towards deletion [hoff et al 2018, Zhang et al, 2020, Zhang et al, 2022]. There are many mechanisms involved in producing these mutations, forming the mutational hotspots, handling DNA breaks, and in the horizontal transfer of genetic material [Tidjani et al 2019; Lorentzi et al, 2021]. As the reviewer suggests – they do not behave all in the same way. To construct the model, we simplified all these mutational mechanisms into one genetic element, the “fragile site”, and assumed that they are solely responsible for the chromosomal-scale mutations that produce deletions.

      We will add this information to the introduction (see also response to reviewer 2), and refer to it in the methods.

      * -Line 160 As alluded to before, given the introduction provided, two assumptions come about here (lines 160-166) that lack a bit of justification/background/context. First, why does one allow the evolution of such a relatively large number of antibiotics? A bit more empirical in the introduction background would go a long way to making this assumption seem more justified. As far as I can see the genomic architecture leading to division of labour is only demonstrated for values of v that are 6 (i.e. 64 antibiotics) or above. Perhaps it is because I lack empirical background here, but this still seems to be a relatively large antibiotic space. Does the model also work with v=2? Perhaps it would be good to show a simulation with v=2 in supplementary material S16 as well. *

      AR: Hopefully the previous comment on the number of possible antibiotics also clarifies this point.

      We will carry out a simulation with v=2.

      * -Line 166 The assumption is made that if a bacterium produces a certain antibiotic, it is automatically resistant to this antibiotic. Now it could be that this assumption is empirically rooted, in which case it would be good to allude to this empirical justification. I wonder how would the results be impacted if the resistance genes were separated from the antibiotic production genes? (I do not think additional simulations are in any way necessary on this point, but some more context/thoughts on this matter would be helpful, perhaps near lines 306-309) *

      AR: Please see response to major comment on the possibility of separating antibiotic resistance from antibiotic synthesis. We will add the discussion there in the Discussion session.

      * -Figure 1 In the subscript it becomes evident that the probability of large deletions due to fragile sites is much higher (10 fold) than single gene duplications, it seems to me this should be the other way around, single gene duplications and deletions could be much more probable than fragile site induced large deletions. Would the model still produce the same results if the values for mu-d and mu-f were switched around? (Again, I do not think additional simulations are per se required, some justification for this assumption would already be plenty). *

      AR: We chose these parameter values because, empirically, large scale chromosomal rearrangements (deletions) occur more frequently than single gene duplication/deletion in Streptomyces – as they are the primary mechanism for Streptomyces development and division of labor. We now mention this in the caption of Fig. 1.

      Still, would we expect results to be affected if mu_d > mu_f? We do not think so, for the following reason: mu_d and mu_f are per-gene probabilities, so the genomic probability of duplication/deletion and of fragile site activation will depend on the evolved number of genes.

      in Fig. 5 we show that mu_f can be decreased by more than one order of magnitude and results do not change qualitatively. To compensate for a smaller per-gene deletion rate (mu_f), the evolved number of fragile sites per genome becomes larger (Suppl. Section S19, Fig. SF23). A similar compensatory increase of fragile sites could happen if duplications and deletions rate per gene were larger.

      * Minor comments: -Line 36, perhaps replace "must" with " can" as there are other ways to achieve a division of labour that do not hinge on genomic architecture such as those listed in the next sentence. This sentence seems at odds with the next one, which lists ways to achieve cell differentiation that do not per se completely rely on genomic architecture such as gene regulation. Maybe consider moving this sentence to be on line 40 (after "...organized at the genome level remains unclear") *

      AR: we will modify the text as suggested by the reviewer

      * -Line 48, perhaps remove "disposable" as there is no particular reason the somatic tissue is disposable, furthermore it invokes the disposable soma theory of aging which is not relevant here *

      AR: we will remove “disposable”.

      * -Line 147-148 Why these particular relationships, as a reader I do not understand how these functions were constructed and how they might influence the results, a bit more justification might be helpful. Perhaps later on (results/discussion) also address what might happen if you were to use different functions? *

      AR: we agree that these functions could use a little more explanation. The probability of replication is a function that increases with the number of growth genes. We assume that the function saturates, as growth cannot be arbitrarily large even if the genome hosts many growth genes. So we need at least two parameters: one for the maximum growth rate (alpha_g), and another that controls the curvature of the function (h_g). A simple choice is a Hill function, but other saturating functions would likely work just as well (e.g. an exponential function with a form alpha_g*(1-exp(-g/h_g)). Similarly, antibiotic synthesis inhibition from growth genes should tend to zero for larger numbers of growth genes, hence the exponential (but we expect that a hyperbolic form e.g 1/(1+g/beta_g) would work just the same).

      As this discussion is rather technical, we will include it in the methods section.

      * -I am clearly biased on this matter, since I work on evolvability. So, the authors should feel free to ignore this comment. Regardless, I think the authors have shown a wonderful example of the evolution of evolvability. Perhaps it would be nice to add a little bit of an evolvability angle in the discussion. In particular thinking about how fragile sites shape evolvability. *

      AR: we agree with the reviewer that the work is a clear form of evolution of evolvability. We now explicitly mention this in the discussion.

      * -Lines 404-411 It is great to see that the authors consider the wider applicability of their findings. It would be nice to add something here about the broader applicability in bacteria. As a large number of bacteria have circular chromosomes, how would these findings be impacted if circular chromosomes were at play? (I suspect they would largely still work in the same way, but keen to hear what the authors think). Referring to the work of Yona et al. 2012 on transient chromosomal duplications in yeast due to heat stress might also be good here, to show the more general applicability of the authors findings, this is another example where genomic architecture shapes evolvability. Yona AH, Manor YS, Herbst RH, Romano GH, Mitchell A, Kupiec M, Pilpel Y, Dahan O. Chromosomal duplication is a transient evolutionary solution to stress. Proc Natl Acad Sci U S A. 2012 Dec 18;109(51):21010-5. doi: 10.1073/pnas.1211150109. Epub 2012 Nov 29. PMID: 23197825; PMCID: PMC3529009. *

      AR: Bacteria show many forms of targeted mutational dynamics (we do already mention CRISPR and HGT). It recently came to our attention that many bacterial and archea genomes host so-called Diversity-Generating Retroelements (DGR) [Macadangdang et al, 2022]. DGRs accelerate microbial evolution at specific sites and generate functional diversity. We will include this reference in the discussion.

      We thank the reviewer for pointing us to the work on chromosomal duplication in yeast – we will also incorporate this “dramatic” form of duplication in the discussion.

      * -Lines 412 -419 I agree with the authors that in practice the cells specializing in antibiotic production look somewhat like soma, however I would consider not using this term here as strictly speaking the antibiotics producing cells can still reproduce (be it at an extremely low rate, which leads to their loss). *

      AR: We tone down both mentions of soma, as follows: “This gives rise to a division of labor driven by mutation, reminiscent of the division between germ and soma in multicellular eukaryotes.”

      And, in the last sentence, we write: “...mutant cells *effectively* function as soma by enhancing...”

      - Lines 434-438 If I understand correctly authors did not explicitly model the sporulation process (instead selecting random cells from the end of a cycle). I think this is a very good modelling choice that should not be changed; however, I do wonder how the results would be affected if sporulation was more explicitly modelled (for example by adding genes for sporulation, creating a 3 way trade-off between growth, sporulation and antibiotic production). Perhaps something that could be mentioned in the discussion.

      AR: we agree with the reviewer that more complex evolutionary problem could be implemented in the system, e.g. through a gene type required for sporulation. They would likely have interesting outcomes. For instance, some bacteria may decide never to sporulate, while others could enhance their antibiotic resistance by turning into spores. Moreover, including additional functions together with an evolvable gene regulation could better capture the developmental dynamics observed through the life cycle of Streptomyces.

      * I hope this review is of some use and helps the improvement of this manuscript. *

      * Yours sincerely,

      Timo van Eldijk

      Reviewer #3 (Significance (Required)):

      Significance: This study provides a clear conceptual advance by showing and studying how genome structure can evolve to create a division of labor. Thereby mechanistically explaining the division of labor in antibiotic production observed in S. coelicolor. It seems evident to me that whilst this study mainly focuses on S. coelicolor, the mechanism likely plays an important role in microbial evolution in general. Though others have previously theoretically explored such mechanisms, this study provides the first exploration modelled closely after an empirical system and hence provides a significant advance. In a more general sense, the evolution of genome architecture likely governs evolvability not just in microbes but in all life on earth. Therefore, I believe that this paper would be interesting for a general audience interested evolution. It would be of particular interest to those studying microbial evolution. My expertise lies in evolutionary biology, theoretical biology, microbial evolution and palaeontology. *

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript the authors explore the co-evolution of genomic architecture and division of labour in antibiotic production, in a model inspired by the bacterium Streptomyces coelicolor. In the model a genetic trade-off is implemented where the having a large number of growths promoting genes (and thus fast growth) leads to a low production of antibiotics. On the other hand, having fewer growth promoting genes allows for a higher production of antibiotics. This trade-off selects for a division of labour, where one sub population specializes in antibiotic production and another sub population specializes in reproduction. This division of labour is achieved by evolving the genome structure, so that growth promoting genes are clustered together, separated from the rest of the genome by several fragile sites (sites that allow for large deletions). This allows a single mutational event to delete a large number of growth-promoting genes, which creates a cell, lacking growth genes and that thus has a high antibiotic production (cell specializing in antibiotic production). In other words, the genome structure evolves to shape evolvability, so as to allow cells with a high growth rate to rapidly and repeatably evolve/mutate into cells with a high antibiotic production. This creates a division of labour where a part of the population specializes in growth/reproduction and another part specializes in antibiotics production. This model provides a tangible mechanism to explain a similar division of labour observed in S. coelicolor. This mechanism also fits well with the large deletions observed in antibiotic-hyperproducing S. coelicolor cells, which are also repeatably generated during colony growth.

      Major comments:

      • Line 69, It would be good to give a bit more information here on the (number of) different types of antibiotics produced by S. coelicolor, to help the reader understand some of the modelling choices later on, such as allowing for the evolution of a large number (16 or higher if I understand correctly) of different antibiotics and a cell automatically being resistant to all antibiotics it produces (instead of having separate resistance genes).
      • Lines 127-129 It is mentioned here fragile sites in the genome might represent transposable elements or long inverted repeats. Would both of these types of fragile sites behave the same? Has it been shown that both transposable elements and long inverted repeats can lead to large deletions from a linear chromosome? It would be nice to have a bit more background on how fragile sites might work or what they might look like in an empirical context. I am a bit unsure on this, but depending on their exact empirical nature, should fragile sites not also lead to increased rates of gene duplication near themselves?
      • Line 160 As alluded to before, given the introduction provided, two assumptions come about here (lines 160-166) that lack a bit of justification/background/context. First, why does one allow the evolution of such a relatively large number of antibiotics? A bit more empirical in the introduction background would go a long way to making this assumption seem more justified. As far as I can see the genomic architecture leading to division of labour is only demonstrated for values of v that are 6 (i.e. 64 antibiotics) or above. Perhaps it is because I lack empirical background here, but this still seems to be a relatively large antibiotic space. Does the model also work with v=2? Perhaps it would be good to show a simulation with v=2 in supplementary material S16 as well.
      • Line 166 The assumption is made that if a bacterium produces a certain antibiotic, it is automatically resistant to this antibiotic. Now it could be that this assumption is empirically rooted, in which case it would be good to allude to this empirical justification. I wonder how would the results be impacted if the resistance genes were separated from the antibiotic production genes? (I do not think additional simulations are in any way necessary on this point, but some more context/thoughts on this matter would be helpful, perhaps near lines 306-309)
      • Figure 1 In the subscript it becomes evident that the probability of large deletions due to fragile sites is much higher (10 fold) than single gene duplications, it seems to me this should be the other way around, single gene duplications and deletions could be much more probable than fragile site induced large deletions. Would the model still produce the same results if the values for mu-d and mu-f were switched around? (Again, I do not think additional simulations are per se required, some justification for this assumption would already be plenty).

      Minor comments:

      • Line 36, perhaps replace "must" with " can" as there are other ways to achieve a division of labour that do not hinge on genomic architecture such as those listed in the next sentence. This sentence seems at odds with the next one, which lists ways to achieve cell differentiation that do not per se completely rely on genomic architecture such as gene regulation. Maybe consider moving this sentence to be on line 40 (after "...organized at the genome level remains unclear")
      • Line 48, perhaps remove "disposable" as there is no particular reason the somatic tissue is disposable, furthermore it invokes the disposable soma theory of aging which is not relevant here
      • Line 147-148 Why these particular relationships, as a reader I do not understand how these functions were constructed and how they might influence the results, a bit more justification might be helpful. Perhaps later on (results/discussion) also address what might happen if you were to use different functions?
      • I am clearly biased on this matter, since I work on evolvability. So, the authors should feel free to ignore this comment. Regardless, I think the authors have shown a wonderful example of the evolution of evolvability. Perhaps it would be nice to add a little bit of an evolvability angle in the discussion. In particularl thinking about how fragile sites shape evolvability.
      • Lines 404-411 It is great to see that the authors consider the wider applicability of their findings. It would be nice to add something here about the broader applicability in bacteria. As a large number of bacteria have circular chromosomes, how would these findings be impacted if circular chromosomes were at play? (I suspect they would largely still work in the same way, but keen to hear what the authors think). Referring to the work of Yona et al. 2012 on transient chromosomal duplications in yeast due to heat stress might also be good here, to show the more general applicability of the authors findings, this is another example where genomic architecture shapes evolvability. Yona AH, Manor YS, Herbst RH, Romano GH, Mitchell A, Kupiec M, Pilpel Y, Dahan O. Chromosomal duplication is a transient evolutionary solution to stress. Proc Natl Acad Sci U S A. 2012 Dec 18;109(51):21010-5. doi: 10.1073/pnas.1211150109. Epub 2012 Nov 29. PMID: 23197825; PMCID: PMC3529009.
      • Lines 412 -419 I agree with the authors that in practice the cells specializing in antibiotic production look somewhat like soma, however I would consider not using this term here as strictly speaking the antibiotics producing cells can still reproduce (be it at an extremely low rate, which leads to their loss).
      • Lines 434-438 If I understand correctly authors did not explicitly model the sporulation process (instead selecting random cells from the end of a cycle). I think this is a very good modelling choice that should not be changed; however, I do wonder how the results would be affected if sporulation was more explicitly modelled (for example by adding genes for sporulation, creating a 3 way trade-off between growth, sporulation and antibiotic production). Perhaps something that could be mentioned in the discussion.

      I hope this review is of some use and helps the improvement of this manuscript.

      Yours sincerely,

      Timo van Eldijk

      Significance

      This study provides a clear conceptual advance by showing and studying how genome structure can evolve to create a division of labor. Thereby mechanistically explaining the division of labor in antibiotic production observed in S. coelicolor. It seems evident to me that whilst this study mainly focuses on S. coelicolor, the mechanism likely plays an important role in microbial evolution in general. Though others have previously theoretically explored such mechanisms, this study provides the first exploration modelled closely after an empirical system and hence provides a significant advance. In a more general sense, the evolution of genome architecture likely governs evolvability not just in microbes but in all life on earth. Therefore, I believe that this paper would be interesting for a general audience interested evolution. It would be of particular interest to those studying microbial evolution. My expertise lies in evolutionary biology, theoretical biology, microbial evolution and palaeontology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript "Evolution of genome fragility enables microbial division of labor" presents a model of genetically-based division of labour in bacterial colonies. It is postulated that two essential processes, growth and the important for elimination of competitors production of antibiotics, are poorly compatible in a single cell. The beneficial for a colony cell specialization is assumed to be determined only by genetic differences that appear via deletions of growth- promoting loci. These deletions and production of various antibiotics are mediated by a rather elaborate genetic architecture, which includes position-sensitive "fragile" sites, mutable antibiotic and growth-promoting genes. The model produces rather predictable results that under sufficiently strong incompatibility between growth and antibiotic production, the long-term evolution results in formation of mosaic of colonies, each specialized in production of its specific set of antibiotics. Such production is facilitated by evolving rapidly mutable genomes that constantly generate non-reproducing antibiotic-pumping cells.

      The model appears very thoroughly developed and analyzed, and all major conclusion are intuitively appealing. Overall, the manuscript reads as a well-written quantitative proof of the principle of genetically-based division of labour between bacterial cells. The only part of the model that I'm a bit sceptical about is the unwarranted complexity of the genetic architecture. Unless the introduction of "fragile" sites and the directional ordering of genes is strongly justified by empirical data, a simpler and more clear assumption about mutational incapacitation of growth genes would suffice to reproduce the predicted phenomenology. So adding such empirical evidence would boost the relevance of the genetical part of the model. In the present form, all observed adaptations are inevitable simply because the expected division of labour will not evolve without each of them due to the design of the model.

      A couple of minor comments...

      217 This is achieved when fewer growth-promoting genes are required to inhibit antibiotic 218 production (i.e. lower βg). Shouldn't it be "larger \beta_g"?

      Whether in the main text or Supplementary materials, it woud help to add a complete population dynamics equation with all gain and loss terms.

      Strikingly, we find the opposite: division of labor evolves when 224 bacteria produce fewer overall antibiotics (lower αa), under shallow trade-off conditions 225 (hgβg = 5; see Suppl. Section S6).

      I don't see why it is"striking". It seems perfectly explicable that a smaller \alpha requires more dedication to antibiotic production, thus favouring specialization.

      Significance

      Due to my relative lack of familiarity with the literature on evolution of genetically-based division of labour, I would rather not comment on the degree of innovation of the manuscript.

      The text is well written and is accessible to a wide readership, so it could be recommended to a general biological or evolutionary journal.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study proposes (and uses) an elegant model of bacteria evolution to study how division of labor can emerge through the interaction between non-random mutations (occurring at some specific ``fragile' genomic sites) and genome architecture. The study is very interesting and the results are convincing. My main concerns are about the presentation of the model and results. Although I am confident about the results, some elements should be clarified for a better understanding and for a correct interpretation of the results. Two points in particular (detailed below as major comments) require clarification.

      Major comments:

      • the notion of telomere/centromere is used all throughout the paper but I think it is used in a misleading way. First, it seems that here there is only one telomere (but this is actually a detail of the model). More importantly, as long as I know, it is well known that in S. coelicolor the sequence degenerates more rapidly when getting closer to the telomeres (but telomeres are defined independently from this property). But here, the notion of telomere is precisely directly determined by its mutational instability (respectively, the centromere is defined by its stability). Although this is reasonable given the objective of the model, it forbid the use of sentences like "we observed that the genome of the evolved colony founded had two distinct regions: a telomeric [...] and a centromeric [...]" (line 234) or "When bacteria divide, mutations induced at fragile sites lead to the deletion of the part of the genome distal to them, causing large telometic deletions" (line 239 - this is not a result but a hidden description of the model) as this distinction between the two regions is not an outcome of the simulation but rather given a priori as a coded property of the fragile sites that all lead to deletions on the same -- called telomeric -- side (of course, formally if the genome contains no fragile site, there is no distinction but still). Please clarify this in the main text and in the methods.
      • In most part of the paper (methods, results, figures, sup mat...) antibiotics are considered to have a concentration (or a high/low production) but at least twice in the text (lines 165 and 488) it is said that only the presence/absence of antibiotics is modelled. I was not able to understand how the continuous values are transformed into presence/absence (is there a threshold?) but more importantly, I strongly suspect that this choice has a strong influence on the outcome. For instance, with a diffusion radius equals to 10, it means that an antibiotics producing cell is able to protect 2\pi10=~60 replicating cells. Hence, one could conjecture that the fraction of antibiotic-producing mutants should a little more than 2%... which is what is observed by the authors. So (1) please clarify this point (2) discuss (or experiments) the consequences of this choice on the conclusion.

      Minor comments:

      • line 262: "We conclude that genome architecture is a key prerequisit for the maintenance of mutation-driven division of labor". Given the model hypotheses you cannot be so affirmative (it is a key prerequisit... in this model!)
      • line 286: "cannot" is probably too strong. It has not been observed...
      • line 288 and following: you seem to consider that there is "selection for diversity". Given the large number of possible antibiotics and given that cells are "automatically" resistant to the antibiotics they produce, could it be simply drift? There is a clear selection pressure to limit the number of growth-promoting genes but no such pressure exist for antibiotics. Hence their number could simply drift (note that figs 2 and SF1 both use a log scale; random variations due to drift could be hidden by the log. Fig. SF2 does use a log scale and shows a dynamics that---to my eyes---claims for drift rather than for selection of diversity).
      • line 340: "ends" should be "end" when discussing the model
      • line 345: "a telomeric region" should be "telomeric regions" when discussing the bacteria
      • line 359: "S. ambofaciens" should be italic
      • line 365: same for "Streptomyces"
      • line 245 states that colonies begin clonally but methods (lines 434-438) don't support this. Colonies don't begin clonally but they begin without antibiotic-producing spores (see also line 618)
      • line 442: "their" should be "its"
      • line 446: "hotspot for recombination" no, for "deletion"
      • line 449: please remove brackets around the reference.
      • line 458: if I understood it correctly, there is no explicit competition in the model. Competition simply comes from the asynchronous replication. Am I true? Could you clarify that point?
      • line 490: "the antibiotic deposited is chosen randomly and uniformly among them". This is not fully clear. I suppose the bacteria is still resistant to all the antibiotics it \it{can} produce?
      • figure SF1: please use the same scales as in figure 2 such that the two plots can be easily compared
      • section S3 and figure SF4: What is to be understood from the figure is not clear to me. Seems that WTs win only if generalists produce less AB or replicate slower (?) Is it true?
      • I found it very difficult to draw conclusion from section S4, S5 and S6. These experiments should be analyzed with the help of mathematical analyses of the equations. Moreover, the understanding of these results are rendered difficult due to the lack of clarity regarding the discrete (or not) nature of the antibiotic production/action/diffusion
      • S7 and fig SF9. It is unclear to me why the fraction of mutants decrease along time elapsed in the cycle. Please explain.
      • Figure SF14: what are the tin lines? if they correspond to the five repeats, how can it be that the bold line be the median?
      • S13 and figure SF15: given that AB concentration is ON/OFF, is this result really surprising? This also questions about the accumulation of AB genes in the original model. Although the authors regularly claim that this is due to selection for diversity, drift could also be at play (see above)
      • S17: for radius 1, 2 and 3, the aliasing is likely to be strong. Hence, the results cannot be interpreted with this sole information. Please give e.g. how many cells are "protected" for each radius (e.g. for r_{alpha}=1, this value can vary between 1 and 9!)
      • L742: "matching the antibiotic bitstring with the bitstring of the antibiotic". True and actually elegant but simpler formulation could ease the reading...
      • lines 746-751 and figure SF21: There again, could it be a consequence of the AB ON/OFF diffusion model?
      • S18-S19-S20: what should the reader understand from these results? Please better comment the figures.

      Referees cross-commenting

      Sorry about the confusion about the computation of the number of cells protected by a single AB-producing cell. Of course it is of the order 10*\pi^2 !!! The global argument still holds but the number of cells protected is of course larger than 60 (note that, due to aliasing at the periphery the exact number of cells in the protected area is difficult to determine).

      Significance

      First, an very importantly, I must say that I am no familiar with the biological model (Streptomyces coelicolor). So I am not fully able to judge the biological significance of this research (i.e. whether the way division of labor is achieved here enlights---or not---the biology of this bacteria). However, on the computational side, the model and the results (as they are summarized in the conclusion) are very interesting on their own and deserve publication.

      Remark: a lots of supplementary results are added to the paper that are not not fully explained or analysed. Please, better discuss all these results and their significance.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their thoughtful comments and suggestions and were pleased to note the quality of work and the findings were well received. Both reviewers commented that the datasets and findings represent a valuable resource for the field, and that this is a valuable resource paper. The only major concern related to conceptual advance and we provide a clear plan below that we believe will thoroughly address this issue.

      Below, we provide point-by-point responses to each of the reviewer’s comments. These are presented to improve the conceptual advance in section 1 and address all other issues raised in other minor comments in section 2.

      SPECIFIC ISSUES:

      Section 1: Conceptual A____dvance

      One main concern raised by both of the reviewers was that the main biological findings did not represent a major new conceptual advance, which is encapsulated by the comments below:

      Reviewer 1:

      “Major comments:

      The conclusions of the manuscript are convincing. The robust data generated is inherently valuable and is of great interest of the field. However, my impression is that the authors did not utilize the power of their studies. The main message - Prox1 is a key regulator in promoting and maintaining lymphatic cell fate - is well accepted and has been intensively studied. Therefore, the main findings presented in the current manuscript are not conceptually-advancing.

      Additional studies focusing on the function of some of the identified hit genes, such as cdh6, slc7a7, fabp11a in lymphatics - either in fish or in vitro - would significantly improve the novelty of the article. Zebrafish is an ideal experimental model that enable a relatively easy and quick way to address these questions. However, considering the time and expense of those experiments, in vitro studies would be also well appreciated instead of fish.”

      Reviewer 2:

      “While the work presented in this manuscript could be an interesting resource for the researchers in the field, it does not provide significant conceptual advances in the field.”

      “CROSS-CONSULTATION COMMENTS

      I agree with the excellent technical and statistical comments of Rev. 2. Overall, we are in agreement regarding the strength of the datasets as a resource for the field, but with limited conceptual novelty.”

      We appreciate both reviewers’ feedback and take this concern seriously. We believe that the paper would be improved by utilizing the unique and extensive single cell resource to develop a deeper new understanding of the molecular control of lymphatic development. We also believe that the novel and new biology already presented in the paper can be better highlighted by re-writing the paper in key places.

      In revision, we will therefore provide two major improvements to address these comments on conceptual advance.

      Firstly, we will re-write relevant sections of the paper to provide an improved focus on the new biology uncovered and less of a technical focus on the resource generated. Specifically, the key new biological insights our thorough analysis already made in the manuscript that will be better highlighted include:

      1. We define the precise timing of differentiation of lymphatics in vivo. Specified LECs in the cardinal vein are not significantly differentiated from their venous neighbours, rather they become transcriptionally distinct between 40 and 72 hours post fertilisation (hpf), well after the sprouting events by which they emerge from the vein. This was not previously shown in any study.
      2. We show in definitive Prox1 null maternal and zygotic double mutant zebrafish for the first time that the sprouting of LECs from the cardinal vein occurs independently of Prox1 function, separating the initial sprouting and fating events. This is different from earlier findings in mouse (Yang, Garcia-Verdugo et al. 2012) and offers a unique new understanding.
      3. We define the transcriptional program controlled by Prox1 during the maintenance of LEC fate in vivo at a whole transcriptome level. This has never actually been done before in any published literature. This reveals that Prox1 simultaneously up-regulates lymphatic marker genes and down-regulates blood vascular marker genes (which was known for a small number of markers). However, it also demonstrates the surprising finding that none of the change in fate from blood vascular to lymphatic vascular is regulated independently of Prox1 function. This shows that Prox1 is not “a” master regulator but “the only” master regulator of this fate decision in a definitive manner for the first time.
      4. In contrast to how Prox1 maintains lymphatic fate, we also provide a challenging and important analysis of Prox1 function at the earliest stages of lymphatic specification and transdifferentiation from blood vascular to lymphatic vasculature. For the first time in any published literature, we show that the role of Prox1 in vivo in this fate decision is primarily to negatively regulate blood vascular and hematopoietic cell fate and not to positively regulate a lymphatic specification gene network. Importantly, this suggests that lymphatic fate transition begins by blocking what may be a default blood vascular cell fate. This was not previously shown in vivo.
      5. Finally, at a molecular level we have demonstrated that Prox1 regulates chromatin accessibility across the genome. Specifically, mutants show a unique and unexpected chromatin signature, whereby chromatin is opened up at many key lymphatic developmental genes but these genes are not transcribed. This discordance in chromatin state and gene transcription appears to be consistent with ectopic activity of early blood and blood vascular transcription factors. This unique finding indicates that Prox1 function negatively regulates blood and blood vascular transcriptional control across diverse enhancers and regulatory elements, and that Prox1 function determines normal chromatin state changes to regulate cell fate. We believe that we did not do a good job of highlighting these important biological observations for the reviewers. Our revision will better emphasise the biological meaning in our data, rather than emphasising the technical aspects of the work.

      Secondly, we agree with the reviewers that extracting a new biological finding or understanding from the data will improve the impact of this study. A longstanding question in the field of lymphangiogenesis has been what precise role Notch signalling plays in cell fate decisions and in vessel network growth. The literature is very murky on the role of Notch signalling in the specification of lymphatics. For example:

      1. Human in vitro work (Kang, Yoo et al. 2010) showed that increased Notch pathway function repressed expression of key transcription factors Prox1 and CoupTFII and the subsequent induction of LEC fate, but this was not confirmed in vivo.
      2. In mice, Murtomaki et al (2013) reported that Notch signalling negatively regulates VEC to LEC transition via suppression of Prox1 expression at the earliest stages of specification of LECs from the cardinal veins (Murtomaki, Uh et al. 2013).
      3. This work contradicts the rather definitive observation that endothelial deletion of the core Notch effector Rbpj (Tie2:Cre, Rbpj-f/f) has no effect on the expression of Prox1 in the cardinal veins (Srinivasan, Geng et al. 2010).
      4. In zebrafish, it was found that lymphatics don’t form in the absence of Notch signalling (Geudens, Herpers et al. 2010), but in this study we found no evidence of active Notch signalling during LEC specification and sprouting. This was recently explained with the demonstration that it was arterial Notch signalling responsible for the abnormal wiring of zebrafish vessels and loss of lymphatics (Geudens, Coxam et al. 2019). These studies suggest that the role of Notch signalling in zebrafish is not autonomous to the developing veins and lymphatics. Thus, it is currently very unclear if Notch signalling plays a specific role in developing lymphatics or if so, when and how it controls lymphangiogenesis in a cell autonomous manner.

      Upon re-evaluating our data, we examined all of the known Notch ligands, receptors and target genes with single cell resolution. To our surprise we found that:

      • jag2b expression is a specific marker of the fate-shifted LECs in Zprox1a-/- mutants at 4dpf, switching on when LEC fate is not maintained by Prox1.
      • notch1a and notch1b are the key lymphatic expressed receptors for the pathway in zebrafish.
      • The main downstream target expressed was her6, which was expressed in a specific manner in vasculature in the maturing LECs.
      • Strikingly, there was little to no expression of these key pathway components at specification of LEC fate stages, but rather the Notch pathway is active at later stages when LECs differentiate and grow in the embryo. This prompted us to examine a unique notch1b mutant that we have in the lab. We found that this mutant has clear defects in lymphangiogenesis that impact later stages of development but do not impact early specification.

      For our revision, we therefore plan to include one additional large Figure of data. This figure will build of a deeper analysis of Notch signalling in our single cell RNA and ATAC sequencing resource and use our new mutant strain to definitively demonstrate the importance of and timing of Notch signalling in the development of lymphatic vessels. We believe that this will clear up the mystery of when and how Notch controls lymphangiogenesis and will add important new conceptual advance to the paper.

      Section 2: Other minor comments

      Reviewers’ Comments:

      Reviewer #1

      An article in 2017 presented abundant expression of fabp11a in zebrafish and suggested its function in brain vessel integrity (PMID: 28443032). In the current manuscript however, the authors did not find fabp11a expression in the head vasculature. Did the authors not detect expression of fabp11a in brain blood vessel endothelial cells at the investigated stages of the zebrafish development? In this case, how would they discuss this seeming contradiction?

      We thank the reviewer for pointing out this study. In the paper from Zhang et al. (2017), the authors showed blood vessel expression of fabp11a at earlier stages than we have examined in our images here. In particular, the expression in blood vessels in the head was shown at 1.5, 2 and 3 dpf. We have examined our transgenic line only at 5 dpf. At 5 dpf we do see expression in the trunk veins, which is consistent with the Zhang paper, but we have not looked at the cranial blood vessels at early stages.

      In our revision, we will image earlier stage brain blood vessels using our new transgenic line to address this issue and provide additional confidence in our findings.

      Minor comments:

      In Figure 1a, authors show LEC sprouts in the trunk region at 40 hpf. At 3 dpf however, these LECs sprouts are not shown, but parachordal LECs only. Do these LEC sprouts disappear by 3 dpf? Cartoons on later timepoints suggest that LEC sprouts shown at 40 hpf remain in their location and make connection with parachordal LECs, but the panel in its current form is misleading.

      We thank the reviewer for this feedback. We will correct this figure to better indicate these key stages and we will include a full reference at this point of the article to our previous review article Hogan and Schulte-Merker (2017) in which we describe this process in detail and in full (Hogan and Schulte-Merker 2017).

      Although I appreciate that the authors were consistent with the colour coding in the graphs, some combinations should be revised. Although the light blue/dark blue colour combination works well in other places, in Figure 4a, it is hard to distinguish those colours. Use of a higher contrast colour combination would be better.

      We will correct this by using high-contrast colour combinations as requested.

      In Figure 1b, similar colours are used for different purposes. Orange in the upper panel shows 40 hpf cluster, while a very similar colour is used for the VEC_preLEC cluster in the lower panels. Although I recognize the overlay between these clusters, a different colour coding would be more accurate. Maybe, clusters from the upper panel (Stage) should be show individually, just like genes in panel c, to help the reader identifying those clusters at different timepoints.

      We will correct this by selecting different colours for VEC_preLEC cluster and cells collected at the 40hpf time point.

      Reviewer #2:

      Specific Comments:

      In general, the authors need to be more precise and cautious in interpreting the RNA velocity analyses. For instance, in Fig 1b, there are two potential regions which could reflect VEC to LEC transition (the one which is connected to LEC sub-cluster and the other which is located in between LEC and VEC/preLEC sub-clusters.) Which trajectory are the authors referring to? In addition, in Fig 3c, the authors claim that RNA velocity analyses showed that the cells within the mutant cluster, however, since cells located within the edge of the clusters tend to have similar trajectory (for instance, cells in the right edge within the LEC_S1 sub-cluster and those in the top left edge within the LEC_S2 sub-cluster), it is difficult to assess whether the trajectory the authors indicated in the mutant sub-cluster is biologically meaningful and relevant. Finally, in Fig 7a, further analyses are needed to support the authors claim which is solely based on RNA velocity analyses.

      We thank the reviewer for this feedback and will ensure the size of arrows on our Velocity analysis are increased, to facilitate interpretation of the data. Further to this we will include a second trajectory analysis (Street, Risso et al. 2018) in Extended data figures 1, 2 and 7 that we expect will validate our observations made in Figures 1b, 3c and 7a respectively.

      In Fig. 1b, it is not clear whether arterial and venous ECs were excluded from the analyses, if so, the authors need to state how these cell types were identified and excluded. In addition, it would be helpful if the authors show the actual number of cells in each sub-cluster, so the readers could estimate the prevalence of each sub-cluster.

      We agree that this information can be more explicitly described, and will include the number of cells per cluster in the legend of Figure 1b, and all single cell RNA-seq UMAPs that define sub-populations. Furthermore, we will include an extra column in Extended Data Table 1a detailing the number of cells per cluster, expand Extended Figure 1 to describe step-wise sub setting of data. We will do this for all 3 single cell datasets. This information will also be written into the Results and Methods.

      In Fig 2a, the authors claim that the level of gene expression is different between head and trunk region using cropped fluorescence microscopy images. It would be more convincing if the authors show both head and trunk regions in a single image.

      We will address this by using images taken of the entire fish including both head and trunk.

      In Fig. 1c, could the authors include an UMAP image showing the expression level of prox1b? It would be helpful for the readers to compare the expressivity of prox1b over time.

      We will amend Figure 1c by replacing UMAP images of hexa with prox1b (prox3).

      In Fig. 1d, the authors need to explain why the expression of LEC markers diminish at 5dpf.

      We thank the reviewer for pointing this out. The 4 dpf single cell RNA-seq libraries are larger than the other libraries included in our developmental time course. While the normalisation (Stuart, Butler et al. 2019) and integration (He, Brazovskaja et al. 2020) approaches have partially corrected this, we believe the higher expression at 4 dpf can be attributed to library size rather than biology. In our revision we will include an analysis that applies down-sampling to larger libraries, that we believe will reduce the contribution of library size to gene expression patterns reported in the developmental time-course.

      In Fig. 3a, it would be helpful if the authors show arterial ECs as well, so the readers could assess the characteristics of mutant clusters in a more general context.

      We thank the reviewer for this feedback. This information is detailed in Extended Data Figure 2a, which shows UMAPS for all cells in the Zprox1a-/- mutant scRNA-seq dataset. We will expand this figure and include a separate panel with additional UMAP images and dot plots of all endothelial cell types including AEC (arterial endothelial cells), and believe that this will allow readers to better appreciate how different sub-populations of ECs relate to each other.

      In the current Figure 3 we focus exclusively on evaluating the LEC, VEC and mutant sub-populations, allowing the reader to hone in on our key points.

      In Fig. 3a and 3b, the authors state that Zprox1a null cells generate a peculiar VEC cluster (mutant cluster). Does prox1a influence the transcriptomic profile of VECs as well?

      We thank the reviewer for this important question. We will expand the Extended Data Table 2 to include differential expression analyses between Zprox1a-/- mutant and WT AEC (arterial endothelial cells), and VEC (venous endothelial cells). We will include a dot plot in Extended Data Figure 2 that includes cluster specific markers of the mutant cluster with Zprox1a-/- mutant and WT AEC and VEC phenotypes. This demonstrates that the changes in Prox1 mutants are restricted to the cells that normally express Prox1 (i.e. LECs).

      It is not clear how the normalization was done in Fig. 3d.

      We will include this information in the Results section text more clearly upon revision.

      In Fig. 3f, the number of the genes do not match with the extended data table 2b (1034 vs 1107, and 294 vs 326).

      We thank the reviewer for picking up these errors. Figure 3f includes all genes that are considered differentially expressed (Wilcoxin Rank Sum adjusted p value

      In Fig. 3i and 3k, the authors show the quantification of cdh5/kdrl intensity within the thoracic duct. It would be helpful if the authors could correlate the location of the area used for quantification (whether the quantification represents LEC cluster or mutant cluster).

      We thank the reviewer for this suggestion and will add a clear box displaying where measurements were made. We will also amend the text for clarity.

      Can the authors specify the unique characteristics of mutant clusters such as the presence of specific markers?

      We thank the reviewer for this suggestion, and will amend the text for clarity. We will include a dot plot of top cluster specific markers for all clusters (including the mutant specific cluster) in Extended Data Figure 2.

      In Fig. 4g, how prevalent is prox1a/b binding sites and what is the P value?

      This is a great question from the reviewer. The Prox1-motif has been problematic but we have now developed robust approaches to identify predicted Prox1-motifs in our snATAC identified peaks. We have now performed a Prox1 motif analysis and will update Figure 4g to include these results. We will include a quantitative comparison of the frequency of Prox1 motifs in LEC, VEC and AEC specific peaks identified in our ATAC analyses.

      In Fig. 5a and 5b, the authors assume that the mutant cluster in scRNA-seq data and the mutant cluster in snATAC-seq data are the same population. Is there any validation done?

      We thank the reviewer for pointing this out. We will clarify in text that we believe that these are the same cell population for two reasons:

      1. They are the only populations in both the scRNA-seq and snATAC-seq data composed almost entirely of Zprox1a-/- mutant cells.
      2. Furthermore, all other endothelial cell phenotypes (eg. AEC, VEC, LEC, muLEC, Endocardium) are accounted for in both datasets. At the transcriptional level (in our scRNA-seq) the mutant specific cluster co-expresses LEC and VEC markers, suggesting it is a hybrid cell type that sits between LEC and VEC phenotypes. However, at key lymphatic genes chromatin accessibility and gene expression (comparing snATAC and scRNA-seq) become discordant in the mutant specific clusters, which gives us confidence that we are observing a fate shift due to loss of Prox1 in this specific type of cell. This also suggests that Prox1 is required for concordant chromatin accessibility and gene expression.

      In Fig. 5c, figure legend and the extended data table 4a did not match. In Fig 5c, the figure legend says the cut off was set by Wilcoxon Rank Sum, FDRWe thank the reviewer for picking up these errors. As in Figure 3f, Figure 5c includes peaks that are considered differentially accessible (Wilcoxin Rank Sum FDR In Fig. 7d and 7e, it is not clear how the clustering was performed. Based on the image shown in the Fig. 7d/e, three sub-clusters do not seem to clearly separate from one another. It would be helpful if the authors clearly state what was the criteria used for the clustering.

      We thank the reviewer for this suggestion. The reason that the clusters sit close together is because these cell types are not yet differentiated from each other. This can be appreciated by looking at clustering of all endothelial cells in 7a. In response to this comment we will no longer show subsetted and re-clustered data in 7d (we will move this to Extended Data), instead will display 7d and 7e using the same UMAP used in 7a with other endothelial cells (AEC, VEC, Endocardium) coloured light grey. We will also expand our description of clustering in the Results and Methods.

      Overall, the dot plots should be replaced with the violin plots to better reflect potential heterogeneity within sub-clusters.

      We agree that for key points, violin plots could be helpful. We will include violin plots in Extended Data Figures for key data points that include the following: Figures 1c, 3b and 7e. This will ensure that readers have a clear appreciation for heterogeneity within sub-clusters for all key markers that define phenotype in each dataset.

      Other comments from reviewers:

      Reviewer 1:

      Significance:

      The manuscript uses state of the art approaches to characterize Prox1-dependent transcriptional and chromatin accessibility changes that define LEC fate and lymphatic sprouting in zebrafish models.

      The key role of Prox1 in LEC differentiation and maintenance of lymphatic cell fate and lymphatic development is well known based on previous findings. Strength of the current manuscript is the massive dataset generated, which opens the opportunity to identify downstream players of Prox1 in regulating lymphatic fate and expansion. The authors, however, did not utilize this opportunity for elucidating novel conceptual findings about lymphatic endothelial fate, development or function.

      The presented results will be of interest for experts in vascular biology, lymphatic biology, developmental biology and genetics. The generated data may be further used in studies investigating the function of the hit genes highlighted in this manuscript in lymphatic vessels.

      Reviewer 2:

      Grimm and colleagues analysed developmental lymphangiogenesis in zebrafish embryos using single cell transcriptomics. They identified a number of novel targets of Prox1a, the master regulator for the LEC fate. In addition, the authors have identified a novel mutant-specific sub-cluster in Zprox1a mutant embryos, reiterating the importance of prox1a in the specification and differentiation of LECs.

      Significance:

      While the work presented in this manuscript could be an interesting resource for the researchers in the field, it does not provide significant conceptual advances in the field. Moreover, there are some technical issues that needs to be resolved prior to the publication of the manuscript.

      We thank the reviewers for their positive response and feedback.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Grimm and colleagues analyzed developmental lymphangiogenesis in zebrafish embryos using single cell transcriptomics. They identified a number of novel targets of Prox1a, the master regulator for the LEC fate. In addition, the authors have identified a novel mutant specific subclusters in Zprox1a mutant embryos, reiterating the importance of prox1a in the specification and differentiation of LECs.

      Specific Comments:

      1. In general, the authors need to be more precise and cautious in interpreting the RNA velocity analyses. For instance, in Fig 1b, there are two potential regions which could reflect VEC to LEC transition (the one which is connected to LEC subcluster and the other which is located in between LEC and VEC/preLEC subclsuters.) Which trajectory are the authors referring to? In addition, in Fig 3c, the authors claim that RNA velocity analyses showed that the cells within the mutant cluster, however, since cells located within the edge of the clusters tend to have similar trajectory (for instance, cells in the right edge within the LEC_S1 subcluster and those in the top left edge within the LEC_S2 subcluster), it is difficult to assess whether the trajectory the authors indicated in the mutant subcluster is biologically meaningful and relevant. Finally, in Fig 7a, further analyses are needed to support the authors claim which is solely based on RNA velocity analyses.
      2. In Fig. 1b, it is not clear whether arterial and venous ECs were excluded from the analyses, if so, the authors need to state how these cell types were identified and excluded. In addition, it would be helpful if the authors show the actual number of cells in each subcluster, so the readers could estimate the prevalence of each subcluster.
      3. In Fig 2a, the authors claims that the level of gene expression is different between head and trunk region using cropped fluorescence microscopy images. It would be more convincing if the authors show both head and trunk regions in a single image.
      4. In Fig. 1c, could the authors include an UMAP image showing the expression level of prox1b? It would be helpful for the readers to compare the expressivity of prox1b over time.
      5. In Fig. 1d, the authors need to explain why the expression of LEC markers diminish at 5dpf.
      6. In Fig. 3a, it would be helpful if the authors show arterial ECs as well, so the readers could assess the characteristics of mutant clusters in a more general context.
      7. In Fig. 3a and 3b, the authors state that Zprox1a null cells generate a peculiar VEC cluster (mutant cluster). Does prox1a influence the transcriptomic profile of VECs as well?
      8. It is not clear how the normalization was done in Fig. 3d.
      9. In Fig. 3f, the number of the genes do not match with the extended data table 2b (1034 vs1107, and 294 vs 326).
      10. In Fig. 3i and 3k, the authors show the quantification of cdh5/kdrl intensity within the thoracic duct. It would be helpful if the authors could correlate the location of the area used for quantification (whether the quantification represents LEC cluster or mutant cluster).
      11. Can the authors specify the unique characteristics of mutant clusters such as the presence of specific markers?
      12. In Fig. 4g, how prevalent is prox1a/b binding sites and what is the P value?
      13. In Fig. 5a and 5b, the authors assume that the mutant cluster in scRNA-seq data and the mutant cluster in snATAC-seq data are the same population. Is there any validation done?
      14. In Fig. 5c, figure legend and the extended data table 4a did not match. In Fig 5c, the figure legend says the cut off was set by Wilcoxon Rank Sum, FDR<0.05. However, in the extended data table 4a, different cut off was used. Similarly, figure legend for Fig. 5e needs to be revised as well.
      15. In Fig. 7d and 7e, it is not clear how the clustering was performed. Based on the image shown in the Fig. 7d/e, three subclusters do not seem to clearly separate from one another. It would be helpful if the authors clearly state what was the criteria used for the clustering.
      16. Overall, the dot plots should be replaced with the violin plots to better reflect potential heterogeneity within subclusters.

      Significance

      While the work presented in this manuscript could be an interesting resource for the researchers in the field, it does not provide significant conceptual advances in the field. Moreover, there are some technical issues that needs to be resolved prior to the publication of the manuscript.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The authors provide a comprehensive transcriptomic and chromatin accessibility atlas of embryonic lymphangiogenesis in fish using state-of-the-art sc-RNAseq and single cell ATAC sequencing approaches. Furthermore, they present data to prove that Prox1 is a key factor of maintaining LEC identity and promoting lymphatic vascular fate and lymphatic sprouting. Using novel reporter models, they further analyzed the spatial expression pattern of numerous hit proteins in the presence or absence of Prox1 genes.

      The manuscript is well written, clear and reproducible based on the given information.

      Major comments:

      The conclusions of the manuscript are convincing. The robust data generated is inherently valuable and is of great interest of the field. However, my impression is that the authors did not utilize the power of their studies. The main message - Prox1 is a key regulator in promoting and maintaining lymphatic cell fate - is well accepted and has been intensively studied. Therefore, the main findings presented in the current manuscript are not conceptually-advancing.

      Additional studies focusing on the function of some of the identified hit genes, such as cdh6, slc7a7, fabp11a in lymphatics - either in fish or in vitro - would significantly improve the novelty of the article. Zebrafish is an ideal experimental model that enable a relatively easy and quick way to address these questions. However, considering the time and expense of those experiments, in vitro studies would be also well appreciated instead of fish.

      An article in 2017 presented abundant expression of fabp11a in zebrafish and suggested its function in brain vessel integrity (PMID: 28443032). In the current manuscript however, the authors did not find fabp11a expression in the head vasculature. Did the authors not detect expression of fabp11a in brain blood vessel endothelial cells at the investigated stages of the zebrafish development? In this case, how would they discuss this seeming contradiction?

      Minor comments:

      In Figure 1a, authors show LEC sprouts in the trunk region at 40 hpf. At 3 dpf however, these LECs sprouts are not shown, but parachordial LECs only. Do these LEC sprouts disappear by 3 dpf? Cartoons on later timepoints suggest that LEC sprouts shown at 40 hpf remain in their location and make connection with parachordial LECs, but the panel in its current form is misleading.

      Although I appreciate that the authors were consistent with the color coding in the graphs, some combinations should be revised. Although the light blue/dark blue color combination works well in other places, in Figure 4a, it is hard to distinguish those colors. Use of a higher contrast color combination would be better.

      In Figure 1b, similar colors are used for different purposes. Orange in the upper panel shows 40 hpf cluster, while a very similar color is used for the VEC_preLEC cluster in the lower panels. Although I recognize the overlay between these clusters, a different color coding would be more accurate. Maybe, clusters from the upper panel (Stage) should be show individually, just like genes in panel c, to help the reader identifying those clusters at different timepoints.

      Referees cross-commenting

      I agree with the excellent technical and statistical comments of Rev. 2. Overall, we are in agreement regarding the strength of the datasets as a resource for the field, but with limited conceptual novelty.

      Significance

      The manuscript uses state of the art approaches to characterize Prox1-dependent transcriptional and chromatin accessibility changes that define LEC fate and lymphatic sprouting in zebrafish models.

      The key role of Prox1 in LEC differentiation and maintenance of lymphatic cell fate and lymphatic development is well known based on previous findings. Strength of the current manuscript is the massive dataset generated, which opens the opportunity to identify downstream players of Prox1 in regulating lymphatic fate and expansion. The authors, however, did not utilize this opportunity for elucidating novel conceptual findings about lymphatic endothelial fate, development or function.

      The presented results will be of interest for experts in vascular biology, lymphatic biology, developmental biology and genetics. The generated data may be further used in studies investigating the function of the hit genes highlighted in this manuscript in lymphatic vessels.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: GlmS, the glucosamine-6-phosphate synthetase in E. coli and related bacteria, is essential, required for synthesis of both peptidoglycan and LPS. It is regulated at various levels, including positive regulation of GlmS translation by the Hfq-binding sRNA GlmZ. GlmZ activation of translation is regulated, indirectly, by the levels of GlcN6P, the product of GlmS. The components of the sensing and regulatory cascade have previously been defined, via genetics, biochemical and molecular biology studies. GlmZ is cleaved by Rnase E, becoming inactive, when GlcN6P levels are high, dependent upon the binding of GlmZ to RapZ. RapZ has been found to directly sense GlcN6P levels; another regulatory RNA, GlmY, also binds RapZ in the absence of GlcN6P, protecting GlmZ from RapZ-mediated processing. The authors of this manuscript performed cryoEM to study the structure of two important complexes in this sensing cascade, RapZ/GlmZ and RapZ/GlmZ/RNase E-NTD, with the aim of clarifying how the RNA binding protein RapZ causes the cleavage of sRNA GlmZ by RNaseE. Some of the predictions for critical residues in the RapZ/GlmZ binary complex structure were investigated by mutagenesis RapZ to define essential resiudes for GlmZ cleavage; the results are consistent with the structure.

      Major comments:

      • Are the key conclusions convincing? 1) Given that this is basically a structural paper, the major questions would be whether the cryoEM reconstructions are accurate (appear to be consistent with general expectations) and whether there is clear evidence to support the physiological relevance of the structure. The tests of function are of two sorts: a) Effect of RapZ mutants in Fig. 3b-d. These tests show loss of RapZ function with various alleles, likely consistent with model (but as noted below, very difficult for the reader to identify on the structures in 3a). The implication is that these will interfere with GlmZ binding. Possibly direct tests of a couple of these mutants for GlmZ binding (or pull down of GlmZ from in vivo expressed protein) would further support the model. I note that the text says T248A was unaffected in cleavage, but seems to be much reduced in Fig. 3b, even if fusion activity is good.

      Our reply. We have made further tests of the mutations for GlmZ binding. Using electrophoretic mobility shift assays, we observe reduced GlmZ binding affinities for RapZ mutants K170A, H190A, C247A, T248A (figure below). We also tested the activity of RapZ variant with 4 substitutions at the proposed RapZ/NTD interface (right lanes in figure below).

      We followed the recommendation of the reviewer and performed co-purification experiments (“pull-down”) using StrepTactin affinity chromatography and Strep-tagged RapZ variants as baits. Eluates were assessed for RapZ protein content and co-eluting GlmZ and processed GlmZ* sRNAs using Northern blotting. These new results, which have been incorporated in Fig. S7c, show that all tested RapZ variants except for the wild-type protein are not capable to pull-down GlmZ or GlmZ* in cell extracts. This includes the RapZ-T248A variant, which as noted by the referee is nonetheless still capable to decrease full-length GlmZ to some extent, albeit processed GlmZ* is hardly detectable (Fig. 3b, lanes 23, 24). To address this issue further, we purified the RapZ-T248A variant and some additional variants for comparison and performed EMSA. Globally, the EMSAs confirm the co-purification experiments, i.e. they demonstrate strongly reduced GlmZ binding activity for most tested RapZ variants, but also show that the RapZ-T248 variant kept some residual binding activity. This may explain the weak signal for processed GlmZ in the Northern blot (Fig. 3b) as processed GlmZ* likely binds to RapZ for stabilization. Similar effects were previously seen for the RapZquad and the RapZ 1-279 variants in Durica-Mitic et al. 2020 (Fig. 5). Accordingly, we also changed our wording concerning the RapZ-T248A variant in the text. We have not incorporated the EMSA figure into the updated manuscript.

      b) The ternary complex was tested primarily by the BACTH assay of some RapZ mutants (Fig. S11), that show a reduced interaction. This is not a particularly convincing assay for a number of reasons: 1) the effects are relatively modest (2x down, in an assay that is probably not very linear with interaction, 2) some with reduced interaction (S239A, T248A) had good activity (at least all those with full interaction seem to be functional); 3) Ternary complex suggests that RapZ mediates this interaction; this could be tested by deleting glmZ (and maybe glmY as well) from this BACTH strain. 4) the authors suggest that there are also important protein-protein interactions, based on some observed interactions, and support this with similarly difficult to interpret BACTH data from a previous paper for Rnase E-RapZ interaction. Here, too, that is not the most compelling data (is this interaction RNA-independent?).

      Our reply: Previous work already indicated that formation of the ternary complex involves multiple interactions – direct protein-protein contacts but also indirect interactions mediated by sRNA GlmZ. For instance, in vitro pull-down signals (RapZ = prey; RNase E = bait) become reduced but not abolished when RNA-free protein preparations are used (Durica-Mitic et al., 2020; Fig. 2E). BACTH signals are reduced 2-fold when using RNase E and RapZ variants that are strongly impaired in their RNA-binding capabilities, respectively (Durica-Mitic et al., 2020; Fig. 2C). As the BACTH assays and in vitro pull-down approaches yield similar trends, we suggest that BACTH experiments represent a useful approach to clarify the questions under study.

      Point b1: To demonstrate that removal of multiple interactions is required to disrupt the ternary complex, we combined substitutions of residues making contact to the sRNA as well as residues directly contacting RNase E. According to the structure of the ternary complex presented here, residues T161, Y240, N271 and Q273 in RapZ are proposed to contact RNase E directly. Upon substitution of these four latter residues, resulting in the RapZ variant named RapZ-4 subst., the BACTH signal decreases two-fold – similar to what is observed for the RapZ variants that carry Ala substitutions of residues involved in sRNA-binding, such as H190 or R253. Importantly, when the latter two substitutions are introduced into the RapZ-4 subst. variant – either alone or in combination, the BACTH signal is reduced to almost back-ground levels. These results are in agreement with the features of the ternary complex proposed here and also with data obtained previously: They show that protein-protein and protein-RNA contacts must be concomitantly removed to disrupt the complex completely. We integrated the latter data as Fig. S7a in the revised manuscript and discuss the data at the appropriate positions in the text.

      Point b2: In our opinion, the data reporting regulatory activity of the individual RapZ variants (Fig. 3 b-d) correlate well with the BACTH data (Fig. S7a): RapZ variants carrying substitutions of residues I175 and N236 retain regulatory activity and concomitantly a high RNase E interaction potential indistinguishable from the wild-type is observed. In contrast, RapZ variants carrying substitutions affecting sRNA-binding, i.e. H190A, C247A, C247S, T248D, G249W, R253A loose activity completely and concomitantly show a 2-fold decrease in the BACTH signal. The remaining BACTH signal is explained by the remaining (protein-protein) contacts as discussed above (point b1). Therefore, these variants are likely uncapable to present GlmZ in a correct manner to RNase E even though interaction is retained to some degree.

      Only the RapZ mutants with exchanges H171A, S239A and T248A do not follow either of these two scenarios: albeit they exhibit reduced interaction with RNase E according to BACTH, they retain the ability to regulate the chromosomal glmS’-lacZ fusion, at least when produced from a plasmid (Fig. 3d). However, inspection of the GlmZ Northern blot signals (Fig. 3b) reveals that full-length GlmZ is decreased as expected, but that processed GlmZ* becomes either not visible or is much reduced when compared to wild-type RapZ. This explains by a reduced sRNA binding affinity, as pointed out above (point 1a), which also provides a rationale for the decreased BACTH signal.

      Point b3: We agree that deletion of glmZ in the BACTH strain would be an ideal approach to dissect the contributions of protein-protein and sRNA-protein mediated interactions for formation of the ternary complex in vivo. Unfortunately, construction of the strain is not straight-forward. In our hands, the BACTH reporter strain BTH101 is not amenable to chromosomal manipulations by using engineered recombination tools such as the phage lambda-derived Red system. This may be explained by regulatory elements used by the l Red system that depend on cAMP, which cannot be synthesized in this strain.

      __Point b4: __We have addressed this query in the response to point b1.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Possibly the importance of RNAse E-RapZ direct interaction, without further proof that this actually is needed for function.

      __Our reply: __We partially addressed this issue already in our response to point b1. Additionally, we also tested activity of the RapZ-4 subst variant that lacks the residues making direct contact to RNase E in our structure (Fig. 3b-d, last two lanes/columns). The results that are now described in the last paragraph of the results section show that this variant retains regulatory activity. Interestingly, the level of processed GlmZ* is strongly reduced in this case, similar to what is observed with the RapZ-S239A and RapZ-T248A variants discussed above. Therefore, these direct protein-protein contacts might have a role for GlmZ* decay in a manner that remains to be addressed.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. As noted above, further tests of RapZ mutants for RNA binding would be useful; if this has been done previously, needs to be presented.

      Our reply.

      This has been addressed in the response above.

      Are there Rnase E residues that would be predicted by the model to be critical for the RapZ or GlmZ interaction but are not otherwise needed for activity? Would these disrupt either the BACTH results or activity in vivo?

      Our reply.

      Please see response to this point above.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. Yes, they are. They are generally extrapolations from what is already in the paper or in previous studies by these groups.
      • Are the data and the methods presented in such a way that they can be reproduced? Yes.
      • Are the experiments adequately replicated and statistical analysis adequate? Yes.

      Minor comments: - Specific experimental issues that are easily addressable. None noted. - Are prior studies referenced appropriately? Yes, they are. However, the paper could more clearly outline what is already known at the level of interactions of the molecules under study here.

      Our reply. We have changed the text to better introduce information from previous studies: interprotomer contacts, properties of the isolated RapZ domains, conclusions from the truncation analyses, requirements for interaction for RNase E and for sRNA-binding, stabilization of processed GlmZ through RapZ binding (Göpel et al., 2013; Gonzalez et al 2017; Durica-Mitic and Görke, 2019; Durica-Mitic et al., 2020).

      • Are the text and figures clear and accurate?
      • In a number of places, the text and figure order/numbers are not correct. See Fig. S1 (p. 4), S2 (legends vs. figure panels).

      Our reply. We have corrected these in the revised text.

      Better labeling in many figures is needed. Clarify what is shown in Fig. S2d, and make the labels readable. Label the particle types in S3. Use schematics more (as in Fig. 4 and S8) to make it easier for reader to follow structure (for Fig. 2, for instance). It is very difficult to discern RapZ tetramer here. Fig. 3a, it is very difficult to see the residue numbers on the structures. Clearly identify the fructokinase-like domains. Label lanes in Fig. 3b, c, d. Indicate active site for RNase E. in Fig. 4, in schematic at least.

      Our reply. We have also corrected these in the revised text.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? Overall, clarify and highlight better how the structures here fit with what is already known about important sequences/regions of RapZ, GlmZ, and Rnase E, maybe color-coding parts of GlmZ shown to be important for RapZ recognition, etc.

      Our reply. We have added a sequence alignment for RapZ in the supplementary materials section, indicating important residues (Fig. S12).

      Page 12, the second last row. Text after 'In this model...' can be simplified or removed because it is just a hypothesis.

      Our reply. We have simplified the text.

      Our reply:

      We believe that the discussion section should also give room for novel ideas and hypotheses. Therefore, we wish to keep the paragraph.

      Reviewer #1 (Significance (Required)):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. Rnase E is a major essential endonuclease in bacteria such as E. coli. How accessory proteins lead to its recognition and cleavage of regulatory RNAs such as GlmZ is not well understood at the structural level, and these structures provide important insight into that process. In addition, the GlmZ/RapZ regulatory circuit plays an important role in bacterial growth and pathogenesis, and understanding it at this level of detail will certainly open up possibilities for targeting this process in the future.

      • Place the work in the context of the existing literature (provide references, where appropriate). The components that go into the current structures have been studied previously, with publications on RapZ structure, analysis of critical regions within the GlmZ RNA, and demonstration of the domain of Rnase E involved in interactions with RapZ (Durica-Mitic et al, 2020; Khan et al, 2020, Gonzalez et al, 2017, among others), exactly how these fit together has not been known. Other RNA binding proteins that affect degradation have been reported, but are not fully understood, and ways in which the ribonuclease binds complex RNAs is not fully understood either.

      • State what audience might be interested in and influenced by the reported findings. This work should be of broad interested to the field of RNA-based regulation and RNA degradation, with particular interest for those working on these processes in bacteria.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. Our expertise is in RNA-based regulation and microbial genetics; we are not able to critically evaluate the cryoEM analysis itself.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Islam et al present their characterization of the E. coli RapZ-GlmZ-RNase E ternary complex in this manuscript under review. In E. coli, the RNA binding protein RapZ facilitates cleavage of GlmZ sRNA by RNase E when intracellular concentrations of GlcN-6P are high; when GlcN-6P levels are low RapZ is titrated by GlmY sRNA and GlmZ sRNA promotes an increase in the translation and stability of the mRNA encoding GlcN-6P synthase, GlmS. Via Cryo-EM, the authors of this manuscript solve the structure of the binary RapZ:GlmZ (Fig. 2) and ternary RapZ:GlmZ:RNase Y (Fig. 4) complexes. Based on the apparent RapZ-sRNA binding sites in the solved structure of the binary complex, the authors make substitutions in residues suspected to be involved in RNA binding and measure the impact of these substitutions on cleavage of GlmZ and GlmZ-mediated activation of GlmS expression (Fig. 3). The authors find that some of the residues predicted to be involved in RNA binding based on their structural studies are also important for the cleavage of GlmZ, presumably by RNase E. Finally, the authors show via bacterial two-hybrid assays that some residues of RapZ necessary for GlmZ cleavage are also important for its interaction with RNase E (Fig. S11). I would suggest that the authors measure co-immunoprecipitation of GlmZ with tagged-RapZ with or without substitutions in the proposed RNA binding residues to resolve this issue. Alternatively, EMSAs could be performed.

      Our reply. Please see the response above to reviewer 1. We have included results from EMSAs with selected RapZ mutants and for multiple mutations in the BACTH analysis.

      Major comments:

      Overall, the structural studies our impressive and provide considerable insight into the recognition of substrates by RapZ and RNase E. Given the dearth of solved structures of RNAs with their cognate RNA binding proteins, these results are very significant.

      A limitation in this work is the lack of experiments directly testing whether or not the residues of RapZ that appear to be important for its interaction with the GlmZ sRNA in the authors' Cryo-EM structures actually have a significant role in RNA binding. In lieu of measuring GlmZ binding by RapZ, the authors measure GlmZ cleavage in strains expressing RapZ or particular variants harboring substitutions in residues that appear to play a role in sRNA binding (Fig. 3b); however, it is impossible for the authors to determine whether impairment of GlmZ cleavage by RNase E in their assays is due to lack of GlmZ binding to RapZ, extraordinarily tight binding of GlmZ to RapZ, changes in the orientation of GlmZ bound to RapZ, or conformational changes in RapZ that lead to disruption of direct RapZ-RNase E contacts. The lack of this empirical data supporting their structural studies becomes more salient as the authors attempt to test whether RapZ binding of GlmZ is important for its interaction with RNase E via a bacterial two-hybrid assay. Since the authors have not directly examined the importance of particular RapZ residues on GlmZ binding, the authors' interpretation of their results from these assays is very speculative.

      Our reply: Reviewer 1 raised a similar point to which we replied above. The role of candidate residues in RapZ for binding GlmZ has been addressed by more direct assays (Pull-down/EMSA).

      The authors state on page 7 that "the interaction of RapZ:GlmZ with RNase E does not involve conformational rearrangement of either RapZ or GlmZ". However, the arrangement of SLII relative to SLI appears different between the RapZ:GlmZ and RapZ:GlmZ:RNase E structures presented. Additionally, SLII appears entirely bound by RapZ in the binary complex (Fig. 2b), whereas in the structure of the ternary complex, SLII appears less associated with RapZ (Fig. S4b). A supplementary figure showing side-by-side the structure of GlmZ bound to RapZ solved in the presence or absence of RNase E may make clear whether any differences that exist in the conformation of RapZ and GlmZ between the binary and ternary complex structures.

      Our reply: In the revised manuscript, we have included a supplementary figure showing side-by-side comparisons of the structures.

      Minor comments: Figure S1 legend. Change "inactivate" to "inactive" or "inactivated"

      Figure S2 legend. The description for "(d)" is for S2c and the text for "(c)" refers to the image in S2d.

      Figure legend S5a and S9a. If resolution in the key is in angstroms, then it should be indicated.

      Our reply: We have now corrected the above points in the revised text.

      Figure 1. The model appears to indicate that the apo-form of RapZ binds GlmZ and GlmY, whereas the GlcN-6P bound form does not. Moreover, in the discussion, the authors indicate that GlcN-6P interferes with GlmZ binding to RapZ. How does RapZ bind and cleave GlmZ when GlcN-6P levels are high, if GlcN-6P interferes with GlmZ binding? It would be useful for the authors to address this conundrum in their discussion.

      Our reply. We thank the reviewer for pointing out this paradox. Our unpublished work indicates that RapZ may have phosphatase activity for GlcN6P, and we added a comment to this in the discussion section.

      Fig. S3B and C. While panels in Fig. S3B and C seemed well aligned, numbering of lanes would provide additional clarity.

      We will provide lane numbers, accordingly.

      Many bacterial species including Bacillus subtilis, Streptococcus pyogenes, and Clostridium botulinum have RapZ homologs that bear a tyrosine instead of a histidine residue at the position corresponding to H190 in E. coli RapZ. Would you expect this change to reduce GlmZ binding by RapZ or lead to change in RNA specificity based on your structural data? This may be useful to discuss in the manuscript.

      We believe that the is more behind this question. Likely, the referee (by inspecting a RapZ sequence alignment) realized that almost all residues proposed to be involved in binding GlmZ are also conserved in RapZ homologs in Gram-positive bacteria, unless His190 and His171, which are replaced by tyrosines in some of these species. However, no RNA-binding activity has been reported for the Gram-positive RapZ homologs. If true, the question arises what is making the difference here? In principle, this could be due to the lacking histidine residues, which are replaced by tyrosines in Gram-positive RapZs. Alternatively, we consider that the positively charged residues at the far C-terminus (K270, K281, R282, K283), which were identified previously to be required for sRNA binding (Göpel et al., 2013; Durica-Mitic et al., 2020), and which could not be resolved in the current structures, are additionally required to obtain RNA-binding activity.

      Fig. S10. It is confusing to me that the yellow chain in the structure of RNase E is labeled as the DNase I-domain in the apo structure, whereas in the structure with RprA or GlmZ bound, this colored region is labeled as the 5' sensing domain.

      We have changed the figure to make it clearer.

      On page 12, the authors appear to indicate that their structural studies of the RapZ-GlmZ-RNase E ternary complex could be informative with regards to how KH domain proteins in Gram-positive bacteria could present their substrates to RNase E. First of all, these bacteria lack RNase E and instead encode an evolutionarily distinct endoribonuclease (RNase Y). Secondly, I think that it is overreaching to state that these structural studies will inform us on how KH domain proteins such as KhpA/KhpB, which may or may not have a chaperoning function akin to Hfq in Gram-positive bacteria, present substrates to RNase Y. Regardless, if this statement is to remain, the authors should make clear that is RNase Y and not RNase E that they are referring to.

      We have changed the text to make clear that a different RNase is employed in this case.

      Reviewer #2 (Significance (Required)):

      In my opinion, the significance of this work is in the achievement of high-resolution structures of the complexes of the RNA binding protein RapZ and the endoribonuclease RNase Y with RNA substrate bound. There are very few structures solved of RNA binding proteins or RNases with their cognate substrates. This is likely due to the difficult in obtaining high resolution data for the bound RNA that may have a large degree of flexibility or many alternative conformations. More structures like this are needed to advance our understanding of RNA-protein interactions.

      I believe that these findings would not only be of great interest to those that study small regulatory RNAs, such as myself, but also others more generally interested in RNA binding proteins, RNases, or protein-RNA interactions.

      Field of expertise: small regulatory RNAs, RNA chaperones, RNases

      **Referees cross-commenting**

      1. I agree with Reviewer #1 that the results of the bacterial two-hybrid assay would be more informative, if the authors tested the impact of deletion of glmZ on the ability of the wild type and mutant RapZ proteins to interact with RNase Y by this assay.

      As both reviewer #1 and I indicated, I think that it would be useful for the authors to directly assess the effect of key substitutions in RapZ on GlmY binding by a more direct measure of interaction, e.g., CoIP or EMSA.

      I do think that it would be nice at some point for the authors to actually provide evidence that GlcN6P binds to the site that they predict as reviewer 3 suggested but this may be be beyond the scope of this manuscript and may be better addressed in another manuscript in which the authors solve the structure of RapZ with GlcN6P bound. In the meantime, the authors could limit their speculation.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: The biogenesis of the bacterial cell envelope relies on glucosamine-6-phosphate (GlcN6P), which is mediated by GlmZ and the sRNA-binding protein RapZ. GlmZ stimulates translation of the GlcN6P synthetase. When the levels of the GlcN6P are sufficiently high, RapZ will presents GlmZ to the endoribonuclease RNase E for cleavage and thereby silencing synthesis of the GlcN6P synthetase. However, how RapZ recruit RNase E to GlmZ for degradation is still unsolved. This paper reports the cryoEM structure of the binary complex of RapZ: GlmZ and the ternary complex of the RNase E catalytic domain (RNase E-NTD), RapZ and GlmZ. RapZ interacts with SLI and SLII of GlmZ through complementarity in shape and electrostatic charge to the phosphodiester backbone of the sRNA and presents the sRNA by alignning its SSR comprising the cleavage site into the RNase E active center. This paper suggests a general RNase E recognition pathway for complex substrates, which will help to understand the mechanisms that other RNA chaperones such as Hfq might work in an analogous assembly to present base-paired sRNA/mRNA pairs for cleavage. In total, this is an excellent work. I will support the publication of it until these following points are presented.

      Major comments: 1. It was mentioned on Page 5 that "Sulphate and malonate ions were previously seen at these positions in crystal structures of apo RapZ" and pn Page 11 that " Interestingly, the phosphate groups of the RNA backbone occupy positions in RapZ that were previously observed to bind sulphate or malonate ions in the crystal structure of apo-RapZ, suggesting that this pocket could be the binding site for a charged metabolite such as GlcN6P". Is there any following experiments to investigate it further? If possible, I suggest the author to confirm that weather RapZ has the binding activity with GlcN6P or not.

      Binding of GlcN6P by the RapZ-CTD was demonstrated previously by SPR as well as by metabolomics of metabolites copurifying with RapZ (Khan et al., 2020), although evidence that the “sulphate/malonate binding sites” in RapZ also bind GlcN6P is still lacking. Crystallization of RapZ+GlcN6P is not straight forward as bound GlcN6P is apparently hydrolyzed over time.

      "The kinase-like N-terminal domain of RapZ (NTD) makes only a few interactions with the RNA, and the path of the RNA does not encounter the Walker A or B motifs (Figure 2b). It is possible that this domain could act as an allosteric switch, whereby the binding of an as yet unknown ligand triggers quaternary structural changes that affect RapZ functions." Is there any more structural information supporting it? If the domain act as an allosteric switch, is it possible to make some deletion or substitution to test it?

      The properties of the separated NTD and CTD of RapZ were assessed in previous work.

      Is there any results to compare the binding affinity of GlmY and GlmZ with RapZ?

      Affinities were determined previously using complimentary techniques:

      Göpel et al., 2013/EMSA: KD GlmY ~ 30 nM; KD GlmZ ~ 75 nM

      Gonzalez et al., 2017/biolayer interferometry: ~ 50 nM for both GlmY/GlmZ (full-length)

      Minor comments: 1. Page 8, is it "stabilised" or "stabilized", please check it.

      We have changed the spelling to “stabilized”.

      The legends for Figure S2 c and d are reversed.

      This has now been corrected.

      It was suggested to show the RNA molecules in Figure S1a.

      We have changed the figure to include single-stranded RNA substrate.

      Reviewer #3 (Significance (Required)):

      This paper suggests a general RNase E recognition pathway for complex substrates, which will help to understand the mechanisms that other RNA chaperones such as Hfq might work in an analogous assembly to present base-paired sRNA/mRNA pairs for cleavage. In total, this is an excellent work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The biogenesis of the bacterial cell envelope relies on glucosamine-6-phosphate (GlcN6P), which is mediated by GlmZ and the sRNA-binding protein RapZ. GlmZ stimulates translation of the GlcN6P synthetase. When the levels of the GlcN6P are sufficiently high, RapZ will presents GlmZ to the endoribonuclease RNase E for cleavage and thereby silencing synthesis of the GlcN6P synthetase. However, how RapZ recruit RNase E to GlmZ for degradation is still unsolved. This paper reports the cryoEM structure of the binary complex of RapZ: GlmZ and the ternary complex of the RNase E catalytic domain (RNase E-NTD), RapZ and GlmZ. RapZ interacts with SLI and SLII of GlmZ through complementarity in shape and electrostatic charge to the phosphodiester backbone of the sRNA and presents the sRNA by alignning its SSR comprising the cleavage site into the RNase E active center. This paper suggests a general RNase E recognition pathway for complex substrates, which will help to understand the mechanisms that other RNA chaperones such as Hfq might work in an analogous assembly to present base-paired sRNA/mRNA pairs for cleavage. In total, this is an excellent work. I will support the publication of it until these following points are presented.

      Major comments:

      1. It was mentioned on Page 5 that "Sulphate and malonate ions were previously seen at these positions in crystal structures of apo RapZ" and pn Page 11 that " Interestingly, the phosphate groups of the RNA backbone occupy positions in RapZ that were previously observed to bind sulphate or malonate ions in the crystal structure of apo-RapZ, suggesting that this pocket could be the binding site for a charged metabolite such as GlcN6P". Is there any following experiments to investigate it further? If possible, I suggest the author to confirm that weather RapZ has the binding activity with GlcN6P or not.
      2. "The kinase-like N-terminal domain of RapZ (NTD) makes only a few interactions with the RNA, and the path of the RNA does not encounter the Walker A or B motifs (Figure 2b). It is possible that this domain could act as an allosteric switch, whereby the binding of an as yet unknown ligand triggers quaternary structural changes that affect RapZ functions." Is there any more structural information supporting it? If the domain act as an allosteric switch, is it possible to make some deletion or substitution to test it?
      3. Is there any results to compare the binding affinity of GlmY and GlmZ with RapZ?

      Minor comments:

      1. Page 8, is it "stabilised" or "stabilized", please check it.
      2. The legends for Figure S2 c and d are reversed.
      3. It was suggested to show the RNA molecules in Figure S1a.

      Significance

      This paper suggests a general RNase E recognition pathway for complex substrates, which will help to understand the mechanisms that other RNA chaperones such as Hfq might work in an analogous assembly to present base-paired sRNA/mRNA pairs for cleavage. In total, this is an excellent work.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Islam et al present their characterization of the E. coli RapZ-GlmZ-RNase E ternary complex in this manuscript under review. In E. coli, the RNA binding protein RapZ facilitates cleavage of GlmZ sRNA by RNase E when intracellular concentrations of GlcN-6P are high; when GlcN-6P levels are low RapZ is titrated by GlmY sRNA and GlmZ sRNA promotes an increase in the translation and stability of the mRNA encoding GlcN-6P synthase, GlmS. Via Cryo-EM, the authors of this manuscript solve the structure of the binary RapZ:GlmZ (Fig. 2) and ternary RapZ:GlmZ:RNase Y (Fig. 4) complexes. Based on the apparent RapZ-sRNA binding sites in the solved structure of the binary complex, the authors make substitutions in residues suspected to be involved in RNA binding and measure the impact of these substitutions on cleavage of GlmZ and GlmZ-mediated activation of GlmS expression (Fig. 3). The authors find that some of the residues predicted to be involved in RNA binding based on their structural studies are also important for the cleavage of GlmZ, presumably by RNase E. Finally, the authors show via bacterial two-hybrid assays that some residues of RapZ necessary for GlmZ cleavage are also important for its interaction with RNase E (Fig. S11). I would suggest that the authors measure co-immunoprecipitation of GlmZ with tagged-RapZ with or without substitutions in the proposed RNA binding residues to resolve this issue. Alternatively, EMSAs could be performed.

      Major comments:

      Overall, the structural studies our impressive and provide considerable insight into the recognition of substrates by RapZ and RNase E. Given the dearth of solved structures of RNAs with their cognate RNA binding proteins, these results are very significant.

      A limitation in this work is the lack of experiments directly testing whether or not the residues of RapZ that appear to be important for its interaction with the GlmZ sRNA in the authors' Cryo-EM structures actually have a significant role in RNA binding. In lieu of measuring GlmZ binding by RapZ, the authors measure GlmZ cleavage in strains expressing RapZ or particular variants harboring substitutions in residues that appear to play a role in sRNA binding (Fig. 3b); however, it is impossible for the authors to determine whether impairment of GlmZ cleavage by RNase E in their assays is due to lack of GlmZ binding to RapZ, extraordinarily tight binding of GlmZ to RapZ, changes in the orientation of GlmZ bound to RapZ, or conformational changes in RapZ that lead to disruption of direct RapZ-RNase E contacts. The lack of this empirical data supporting their structural studies becomes more salient as the authors attempt to test whether RapZ binding of GlmZ is important for its interaction with RNase E via a bacterial two-hybrid assay. Since the authors have not directly examined the importance of particular RapZ residues on GlmZ binding, the authors' interpretation of their results from these assays is very speculative.

      The authors state on page 7 that "the interaction of RapZ:GlmZ with RNase E does not involve conformational rearrangement of either RapZ or GlmZ". However, the arrangement of SLII relative to SLI appears different between the RapZ:GlmZ and RapZ:GlmZ:RNase E structures presented. Additionally, SLII appears entirely bound by RapZ in the binary complex (Fig. 2b), whereas in the structure of the ternary complex, SLII appears less associated with RapZ (Fig. S4b). A supplementary figure showing side-by-side the structure of GlmZ bound to RapZ solved in the presence or absence of RNase E may make clear whether any differences that exist in the conformation of RapZ and GlmZ between the binary and ternary complex structures.

      Minor comments:

      Figure S1 legend. Change "inactivate" to "inactive" or "inactivated"

      Figure S2 legend. The description for "(d)" is for S2c and the text for "(c)" refers to the image in S2d.

      Figure legend S5a and S9a. If resolution in the key is in angstroms, then it should be indicated.

      Figure 1. The model appears to indicate that the apo-form of RapZ binds GlmZ and GlmY, whereas the GlcN-6P bound form does not. Moreover, in the discussion, the authors indicate that GlcN-6P interferes with GlmZ binding to RapZ. How does RapZ bind and cleave GlmZ when GlcN-6P levels are high, if GlcN-6P interferes with GlmZ binding? It would be useful for the authors to address this conundrum in their discussion.

      Fig. S3B and C. While panels in Fig. S3B and C seemed well aligned, numbering of lanes would provide additional clarity.

      Many bacterial species including Bacillus subtilis, Streptococcus pyogenes, and Clostridium botulinum have RapZ homologs that bear a tyrosine instead of a histidine residue at the position corresponding to H190 in E. coli RapZ. Would you expect this change to reduce GlmZ binding by RapZ or lead to change in RNA specificity based on your structural data? This may be useful to discuss in the manuscript.

      Fig. S10. It is confusing to me that the yellow chain in the structure of RNase E is labeled as the DNase I-domain in the apo structure, whereas in the structure with RprA or GlmZ bound, this colored region is labeled as the 5' sensing domain.

      On page 12, the authors appear to indicate that their structural studies of the RapZ-GlmZ-RNase E ternary complex could be informative with regards to how KH domain proteins in Gram-positive bacteria could present their substrates to RNase E. First of all, these bacteria lack RNase E and instead encode an evolutionarily distinct endoribonuclease (RNase Y). Secondly, I think that it is overreaching to state that these structural studies will inform us on how KH domain proteins such as KhpA/KhpB, which may or may not have a chaperoning function akin to Hfq in Gram-positive bacteria, present substrates to RNase Y. Regardless, if this statement is to remain, the authors should make clear that is RNase Y and not RNase E that they are referring to.

      Significance

      In my opinion, the significance of this work is in the achievement of high-resolution structures of the complexes of the RNA binding protein RapZ and the endoribonuclease RNase Y with RNA substrate bound. There are very few structures solved of RNA binding proteins or RNases with their cognate substrates. This is likely due to the difficult in obtaining high resolution data for the bound RNA that may have a large degree of flexibility or many alternative conformations. More structures like this are needed to advance our understanding of RNA-protein interactions.

      I believe that these findings would not only be of great interest to those that study small regulatory RNAs, such as myself, but also others more generally interested in RNA binding proteins, RNases, or protein-RNA interactions.

      Field of expertise: small regulatory RNAs, RNA chaperones, RNases

      Referees cross-commenting

      1. I agree with Reviewer #1 that the results of the bacterial two-hybrid assay would be more informative, if the authors tested the impact of deletion of glmZ on the ability of the wild type and mutant RapZ proteins to interact with RNase Y by this assay.
      2. As both reviewer #1 and I indicated, I think that it would be useful for the authors to directly assess the effect of key substitutions in RapZ on GlmY binding by a more direct measure of interaction, e.g., CoIP or EMSA.
      3. I do think that it would be nice at some point for the authors to actually provide evidence that GlcN6P binds to the site that they predict as reviewer 3 suggested but this may be be beyond the scope of this manuscript and may be better addressed in another manuscript in which the authors solve the structure of RapZ with GlcN6P bound. In the meantime, the authors could limit their speculation.
    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      GlmS, the glucosamine-6-phosphate synthetase in E. coli and related bacteria, is essential, required for synthesis of both peptidoglycan and LPS. It is regulated at various levels, including positive regulation of GlmS translation by the Hfq-binding sRNA GlmZ. GlmZ activation of translation is regulated, indirectly, by the levels of GlcN6P, the product of GlmS. The components of the sensing and regulatory cascade have previously been defined, via genetics, biochemical and molecular biology studies. GlmZ is cleaved by Rnase E, becoming inactive, when GlcN6P levels are high, dependent upon the binding of GlmZ to RapZ. RapZ has been found to directly sense GlcN6P levels; another regulatory RNA, GlmY, also binds RapZ in the absence of GlcN6P, protecting GlmZ from RapZ-mediated processing. The authors of this manuscript performed cryoEM to study the structure of two important complexes in this sensing cascade, RapZ/GlmZ and RapZ/GlmZ/RNase E-NTD, with the aim of clarifying how the RNA binding protein RapZ causes the cleavage of sRNA GlmZ by RNaseE. Some of the predictions for critical residues in the RapZ/GlmZ binary complex structure were investigated by mutagenesis RapZ to define essential resiudes for GlmZ cleavage; the results are consistent with the structure.

      Major comments:

      • Are the key conclusions convincing?
        1. Given that this is basically a structural paper, the major questions would be whether the cryoEM reconstructions are accurate (appear to be consistent with general expectations) and whether there is clear evidence to support the physiological relevance of the structure. The tests of function are of two sorts:
          • a) Effect of RapZ mutants in Fig. 3b-d. These tests show loss of RapZ function with various alleles, likely consistent with model (but as noted below, very difficult for the reader to identify on the structures in 3a). The implication is that these will interfere with GlmZ binding. Possibly direct tests of a couple of these mutants for GlmZ binding (or pull down of GlmZ from in vivo expressed protein) would further support the model. I note that the text says T248A was unaffected in cleavage, but seems to be much reduced in Fig. 3b, even if fusion activity is good.
          • b) The ternary complex was tested primarily by the BACTH assay of some RapZ mutants (Fig. S11), that show a reduced interaction. This is not a particularly convincing assay for a number of reasons: 1) the effects are relatively modest (2x down, in an assay that is probably not very linear with interaction, 2) some with reduced interaction (S239A, T248A) had good activity (at least all those with full interaction seem to be functional); 3) Ternary complex suggests that RapZ mediates this interaction; this could be tested by deleting glmZ (and maybe glmY as well) from this BACTH strain. 4) the authors suggest that there are also important protein-protein interactions, based on some observed interactions, and support this with similarly difficult to interpret BACTH data from a previous paper for Rnase E-RapZ interaction. Here, too, that is not the most compelling data (is this interaction RNA-independent?).
      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Possibly the importance of RNAse E-RapZ direct interaction, without further proof that this actually is needed for function.
      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. As noted above, further tests of RapZ mutants for RNA binding would be useful; if this has been done previously, needs to be presented. Are there Rnase E residues that would be predicted by the model to be critical for the RapZ or GlmZ interaction but are not otherwise needed for activity? Would these disrupt either the BACTH results or activity in vivo?
      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. Yes, they are. They are generally extrapolations from what is already in the paper or in previous studies by these groups.
      • Are the data and the methods presented in such a way that they can be reproduced? Yes.
      • Are the experiments adequately replicated and statistical analysis adequate? Yes.

      Minor comments:

      • Specific experimental issues that are easily addressable. None noted.
      • Are prior studies referenced appropriately? Yes, they are. However, the paper could more clearly outline what is already known at the level of interactions of the molecules under study here.
      • Are the text and figures clear and accurate?
        1. In a number of places, the text and figure order/numbers are not correct. See Fig. S1 (p. 4), S2 (legends vs. figure panels).
        2. Better labeling in many figures is needed. Clarify what is shown in Fig. S2d, and make the labels readable. Label the particle types in S3. Use schematics more (as in Fig. 4 and S8) to make it easier for reader to follow structure (for Fig. 2, for instance). It is very difficult to discern RapZ tetramer here. Fig. 3a, it is very difficult to see the residue numbers on the structures. Clearly identify the fructokinase-like domains. Label lanes in Fig. 3b, c, d. Indicate active site for RNase E. in Fig. 4, in schematic at least.
      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? Overall, clarify and highlight better how the structures here fit with what is already known about important sequences/regions of RapZ, GlmZ, and Rnase E, maybe color-coding parts of GlmZ shown to be important for RapZ recognition, etc.<br /> Page 12, the second last row. Text after 'In this model...' can be simplified or removed because it is just a hypothesis.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Rnase E is a major essential endonuclease in bacteria such as E. coli. How accessory proteins lead to its recognition and cleavage of regulatory RNAs such as GlmZ is not well understood at the structural level, and these structures provide important insight into that process. In addition, the GlmZ/RapZ regulatory circuit plays an important role in bacterial growth and pathogenesis, and understanding it at this level of detail will certainly open up possibilities for targeting this process in the future. <br /> - Place the work in the context of the existing literature (provide references, where appropriate).

      The components that go into the current structures have been studied previously, with publications on RapZ structure, analysis of critical regions within the GlmZ RNA, and demonstration of the domain of Rnase E involved in interactions with RapZ (Durica-Mitic et al, 2020; Khan et al, 2020, Gonzalez et al, 2017, among others), exactly how these fit together has not been known. Other RNA binding proteins that affect degradation have been reported, but are not fully understood, and ways in which the ribonuclease binds complex RNAs is not fully understood either.<br /> - State what audience might be interested in and influenced by the reported findings.

      This work should be of broad interested to the field of RNA-based regulation and RNA degradation, with particular interest for those working on these processes in bacteria.<br /> - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Our expertise is in RNA-based regulation and microbial genetics; we are not able to critically evaluate the cryoEM analysis itself.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Spinal cord injury (SCI) is a damage to the spinal cord, that causes temporary or permanent changes in its function. While in mammals the regeneration process are very limited zebrafish are able to repair the spinal cord. Based on the hypothesis, that the vascular response might affect the regeneration capacity, the paper by Ribeiro et al addresses the structure and injury response of the spinal cord vasculature. As the growth of zebrafish larvae and juveniles depends a lot on the individual response to the environment, the authors first established comparable body measurement parameters (other than age) and observed the natural spinal cord vascularization process, starting from 6mm body length of the animals. Using transgenic lines the authors describe the formation and patterning of endothelial cells and pericytes up to 9mm length, when a more developed vascular network was present. They observe the processes of vascular regeneration after a contusion based SCI model at different time points (days post injury (dpi)) and in correlation with glial and axonal regrowth, also observing BSCB barrier integrity, angiogenesis, pericyte recruitment and the dependence on Vegf signaling.

      The study is interesting and novel, vascular structures in the zebrafish adult spinal cord have not been reported yet and neither has the vascular response to SCI. Currently the study remains very descriptive, although the authors tried to add functional data, by inhibiting Vegf signaling.

      Major points for revision: The authors fail to establish whether there is any relationship between spinal cord regeneration and vessel regeneration. While I do very well understand the challenges and limitations the authors should put more effort into functional analyses.

      For example: the authors address EC proliferation as a marker for angiogenesis, but do not analyse whether or how much EC proliferation is required for revascularization and regeneration. Pharmacological inhibition of proliferation should be possible and used. From a vascular point of view it would also be interesting whether there is a differential influence of tip or stalk cell proliferation.

      Although we agree that it would be interesting to inhibit EC proliferation to assess its role in spinal cord regeneration, the use of proliferation inhibiting drugs would likely have a widespread effect on the lesioned spinal cord, since many cell types proliferate in response to injury. Therefore, a pharmacological approach would not allow us to dissect the specific role of endothelial proliferation.

      The same is true for pericyte recruitment: the role of pericytes for the vascular repair or the spinal cord regeneration is not clear. The authors could use use mutants with impaired pericyte development or e.g. nitroreductase mediated ablation of pericytes.

      These experiments have been performed in larvae by Tsata et al. (2021). Although it would be interesting to repeat in adults, we believe that these experiments are beyond the focus of our study.

      The statements regarding the role of Vegf are too bold. The problem lies in the limitations of assessing the efficiency of Vegf inhibition. The heatshock promotor has been shown to induce transcription for up to 4 hours, depending on the efficiency of heatshock. There are no data on the stability of dnVegfaa protein. Likewise the pharmacological inhibition could be far from complete. A full inhibition of Vegf signaling is expected to stop vessel growth or angiogenesis. While it is a sign of good practice, that the authors combined a genetic model with a pharmacological one, both leave the same unresolved issue. However if we believe a very limites requirement of Vegf-signaling, it would be interesting to look for other signaling pathways, like cxcl, IL, or FGF to regulates regenerative angiogenesis.

      We agree that our data does not allow us assess the level of inhibition of the Vegf pathway. Since we are unable to confirm this at the moment, we will be excluding the Vegf inhibition data and make this a descriptive study.

      Minor issues

      The correlation with spinal cord repair could be stated more clearly throughout the manuscript. For the uninformed reader it is less clear when exactly the spinal cord is functional again.

      We will include in Fig. 3 a plot of the swimming capacity in contusion-injured fish until 90 dpi and will explain in the text how the vascular response correlates with the functional recovery.

      While I find the model in figure 8 very helpful, it gives 5 to 30 days, for the neuronal regeneration. Maybe a more detailed timeline of EC regeneration and remodeling correlating with neuronal repair would help.

      We will update the model in Fig. 8 with a more detailed timeline and a better description of structures important for regeneration (glial bridge, axonal regrowth).

      In line with that in figure 4 it is not clear whether the images of different time points are indeed one individual animal at the different time points or representative animals for the stage (also figure 4 lacks panel labels, in my copy I can see A, K and L, but no other letters).

      We will detail in the figure legend that the images are of different animals that are representative for each stage.

      For understanding the (re)vascularization, the direction of blood flow might be helpful.

      We will perform an additional experiment to characterise the direction of blood flow in uninjured fish. For this we will use juvenile fish with a body size of 7-9 mm, in which we expect to be able to perform live imaging. We will use a lighsheet microscope to image circulating cells in the spinal cord blood vessels in fish with labelled thrombocytes (Tg(-6.0itga2b:EGFP); Lin et al., 2005) and endothelial cells (Tg(kdrl:ras-mCherry)). These transgenic lines are already available in our fish facility. Even though the vascular network has not yet reached its mature stage at these body sizes, we expect to have enough intraspinal vessels to describe the blood flow circuit.

      Especially for the connection between spinal cord regeneration and vessel regeneration. Does blood flow regulate vessel pruning after 14 dpi?

      Although we agree with the reviewer that it would be interesting to understand how blood flow direction is reestablished in repaired vessels and how blood flow levels correlate with vessel remodelling and pruning, this would be difficult to assess in this system. This could be examined using live imaging, but this technique is challenging in adult zebrafish and has only been carried out in more superficial organs than the spinal cord, such as skin (Castranova et al., 2022) and superficial brain structures (Barbosa et al., 2015; Castranova et al., 2021). In addition, SC-injured fish are more sensitive to external conditions and would probably not survive the long-term/repeated anaesthesia required for imaging.

      This analysis could be performed in fixed samples, for example using the the Golgi complex position in relation to the endothelial nuclei as a proxy for blood flow direction (Kwon et al., 2016), however: (1) this would require a new transgenic line (Tg(fli1a: B4GALT1-mCherry)) that would take time to import and establish in the lab; (2) the identification of regressing vessels is not straightforward in fixed samples and is usually studied in very well established vascular models, such as the mouse retina and zebrafish ISVs (Franco et al., 2015).

      For these reasons, we will not address this question by reviewer 1.

      The combined Vegfaa DN and PTK treatment data looks like it could be inhibiting endothelial cell proliferation (Figure7I).However, Supplementary Figure 8B shows endothelial proliferation does not change. Does it mean the number of endothelial cells is same but the volume of endothelial cells decrees?

      We will not be addressing the changes in endothelial density in the presence of dn-vegfaa and PTK787, since we will be removing the figures related to Vegf inhibition.

      There are also some remaining grammatical errors, for example (but NOT limited to) line 133 to 135.

      We will review grammatical errors in the text.

      As a personal interest I think evaluating the role of Notch in the SCI model would also be very interesting, especially with regard to the vasculature, however that might be out of the scope of the manuscript.

      We agree that Notch signalling may be a player during spinal cord revascularisation. However, mutants for dll4 (the Notch ligand involved in angiogenesis) die between 7-14 dpf and cannot be used for this study. In addition, the use of Notch-inhibiting drugs would likely have pleiotropic effects, since the Notch pathway is also involved in other aspects of spinal cord regeneration, namely in the regulation of regenerative neurogenesis (Dias et al., 2012). To our knowledge, tools that allow the endothelial-specific inhibition of the Notch pathway have not been developed, and therefore we will not be able to address this question.

      Reviewer #1 (Significance (Required)):

      The study is partially descriptive, but very novel as the aspects of vascularisation in a spinal cord injury model have not been described before. If the major revisions regarding functionality are addressed fully, I would wholeheartedly recommend publication and expect an interest for a broad audience. The presented images and their analyses are of very high quality, and therefore also enhance the impact of the study.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The study by Ribeiro et al. investigates the formation of new blood vessels after spinal cord injury in adult zebrafish. The authors initially characterize the extend of spinal cord vascularization during the development of juvenile zebrafish and investigate the association of pericytes with the newly forming vasculature. They then injure the spinal cord and describe the subsequent regeneration of blood vessels. They perform assays to analyze the functionality of the newly forming blood vessels and show that initially blood vessels are leaky. Through EdU labelling the authors show that endothelial cells proliferate. Pericytes similarly increased in numbers. Lastly, the authors inhibited VEGF signaling, which only mildly affected vascular regeneration.

      Together, this manuscript describes the re-vascularization of the regenerating spinal cord in adult zebrafish and addresses how blood vessels mature during this process through pericyte recruitment and decrease in leakiness. The manuscript provides some interesting initial insights into spinal cord vascularization, but is mainly descriptive and unfortunately remains superficial in this regard, as specified below:

      1. The authors only refer to "blood vessels" without specifying the type of blood vessels they observe (are these veins, arteries, capillaries)? A wealth of markers and transgenic zebrafish lines are available to better characterize spinal cord vessels. This is not necessary in case the authors solely refer to "blood vessels" as they do, but it greatly limits the insights into spinal cord vascularization. For instance, Wild et al. (2017) showed that new vessels apparently sprout from veins in the spinal cord. Is this also true during regeneration?

      We will perform RNA in situ hybridisation using probes for arterial and venous markers. We will assay the expression of arterial markers (dll4, dlc, flt1 and efnb2a) and venous markers (flt4 and ephb4a) in uninjured spinal cord (to characterise vessel identity in homeostasis) and in 3 and 7 dpi spinal cords (to investigate the identify of angiogenic vessels during regeneration).

      1. The authors state that their characterization revealed a "stereotypic organization of blood vessels". However, the organization does not appear to be stereotypic (as I understand this term as looking the same in each fish) at all. Can the authors compare e.g. 3 or 5 wildtype fish and extract features that all fish share and those that differ between fish? This would greatly enhance our understanding of the vascular variability within the wildtype population.

      We will provide an additional figure comparing the spinal cord vasculature in different fish.

      1. The authors show an interesting metameric organization of the vasculature with regions of high vascularization interspersed with sparsely vascularized areas. Are there any morphological landmarks that would precipitate these differences?

      We will acquire light sheet images of adult spinal cords without removing the vertebrae. This will allow us to determine if the metameric organisation is correlated with the vertebral distribution.

      Can the authors check whether they induce a lesion in a highly or poorly vascularized area? This might greatly influence the degree of re-vascularization.

      We always perform the spinal cord injury in the region between neural arches (Dietrich et al., 2021). Once we determine how the vasculature is organised in relation to the vertebrae, we will be able to determine if the lesions are performed in a region of high or low vascularisation.

      1. The same superficial characterization unfortunately also applies to the cell population the authors refer to as "pericytes". Traditionally, pericytes are characterized as being associated with capillaries and sharing a basement membrane with the endothelium. Is this the case here?

      We will further characterise the association between Tg(pdgfrß:citrine)-positive cells and blood vessels using an anti-laminin antibody (#L9393, Sigma) to label the basement membrane. Preliminary results recently acquired indicate that Tg(pdgfrß:citrine)-positive perivascular cells and endothelial cells are both enveloped by the basement membrane, supporting the identity of Tg(pdgfrß:citrine)-positive cells as pericytes. Moreover, pericytes are generally described as solitary mural cells associated with small diameter blood vessels (the type of distribution we observe for Tg(pdgfrß:citrine)-positive cells), whereas vascular smooth muscle cells (vSMCs) form concentric layers around larger blood vessels (a distribution we do not detect with this transgene) (Hellström et al., 1999). For these reasons we believe that this transgene is labelling pericytes. We will explain more clearly in the text how the morphology, localisation and density of Tg(pdgfrß:citrine)-positive cells suggests these cells are pericytes.

      In addition, pdgfrb is hardly specific for pericytes, as it also labels a multitude of other cell types (refer to e.g. Tsata et al. (2021)).

      The different cell types labelled by the pdgfrb reporter line used in the Tsata et al., 2021 paper were identified not by the use of different cell markers, but by their localisation: perivascular cells (the same cell type that we also detect), myoseptal cells (which we would not expect to detect, since we are only analysing the spinal cord tissue and not the adjacent muscle) and floor plate cells (a reporter distribution that the authors show is lost after 3 dpf and is not present in the adult spinal cord). Moreover, the Tsata et al., 2021 paper also includes a supplementary figure (S1, panel N) showing a restricted perivascular pdgfrb:GFP distribution in the wholemount adult spinal cord, in agreement with our characterisation. By their morphology and density, these perivascular cells are likely pericytes, as argued above.

      It is also not clear why the transgenic pdgfrb line the authors use only labels cells next to blood vessels. Tsata et al. show a much broader labelling. The authors need to validate their transgenic line using in situ hybridization showing where pdgfrb is being expressed endogenously and how this overlaps with the fluorescent protein expression of the pdgfrb transgenic line.

      We will perform ISH for pdgfrb to confirm if the Tg(pdgfrß:citrine) reporter reproduces the endogenous expression in the uninjured spinal cord and at 3 and 7dpi. The 3-7 dpi period is approximately equivalent to the 1-2 days post-lesion in larvae and, if the non-perivascular pdgfrb:GFP cells observed in the larval spinal cord are present in the adult, we expect to detect them by ISH during this phase of regeneration.

      There are also several transgenic lines available that allow for the distinction between smooth muscle cells and pericytes (e.g. Shih,..., Lawson, Development 2021 and Whitesell,..., Childs, Plos ONE 2014). As for the vasculature, this more detailed characterization is not necessary in case the authors refer to the cells as "cells labelled by the pdgfrb transgene and reside next to endothelial cells". However, this would not be reflective of the level of detail currently present in the field.

      As we explain above, the morphology and density of the pdgfrb:Citrine-positive cells suggests that these cells are pericytes and not smooth muscle cells (SMCs). To confirm this we will compare the expression of pdgfrb with markers of SMCs (i.e, 𝛼-smooth muscle actin and desmin) using immunohistochemistry and/or ISH.

      The reviewer also suggests the characterisation of pericyte subtypes using the lines described by Shih et al., 2021. Although this would be interesting, we do not consider it is essential for our study. It would be very demanding to import the reporter lines and it is not certain that these subtypes are present in the spinal cord.

      1. The authors state that "New blood vessels rapidly attracted pericytes, formed through proliferation and possibly migration of existing pericytes". This statement is not supported by the data, as the authors do not perform lineage tracing of pre-existing pericytes. The authors need to specifically label existing pericytes and then follow whether these pre-labelled cells can be found on newly forming blood vessels. Tsata et al. provide some evidence for this in zebrafish larvae, but they also conclude that pdgfrb expressing tenocytes contribute to new mural cells.

      We will reformulate the sentence to clarify that we detect pericyte proliferation, but pdgfrb-lineage tracing would be needed to provide evidence that existing pericytes contribute to the generation of mural cells associated to new blood vessels. However, we will not perform the lineage tracing experiment for the revision, as we are unable to currently import this line.

      1. The findings that new blood vessel growth only marginally depended on VEGFA signaling is striking. However, it might also point towards an inefficient inhibition of VEGFA signaling. In particular, other publications, for instance Cattin et al. 2015 have shown that inhibiting VEGFA signaling prevents new blood vessel growth during peripheral nerve regeneration in mouse. It will therefore be important that the authors demonstrate that their approach leads to successful inhibition of VEGFA signaling. VEGFAB mutants appear to be homozygous viable and important for spinal cord vascularization (Matsuoka et al., 2017). In addition, heterozygous VEGFAA mutants already have some vascular phenotypes, but are also viable. Can the authors combine these mutants with their inhibitor treatments to achieve a greater reduction in VEGFA signaling?

      Since we are unable to confirm the level of inhibition of the Vegf pathway and we are unable to import the suggested lines at the moment, we will be excluding the Vegf inhibition data.

      Reviewer #2 (Significance (Required)):

      Together, this publication is the first to describe to some extend the regenerating vasculature after spinal cord injury in adult zebrafish. However, both the vascular and regeneration fields are much more advanced than what the authors cover. Both blood vessels and perivascular cells can be characterized in much more detail, as outlined above. Also, studies on nerve regeneration and its dependence on the vasculature, e.g. during peripheral nerve regeneration in mouse have been carried out with a wealth of functional data available. Therefore, the impact of the present study in its current form will be limited. I am an expert on zebrafish blood vessel development.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      Ribeiro et al. described vascular development in the spinal cord from larval to adult stages in zebrafish, and found the dependence of vessel length on body-size. Then, the authors depicted the vascular regeneration process after spinal cord injury (SCI), which includes initial vascularization, angiogenesis, pericyte recruitment, and blood-spinal cord barrier establishment. Although the molecules or signaling pathways that drive the re-vascularization remain unidentified, this study illustrates the cellular processes of spinal cord vascular development and regeneration from the descriptive level, which may facilitate further understandings of mechanisms underlying vascular regeneration in the spinal cord.

      Major comments: - Are the key conclusions convincing? The descriptions of spinal cord vascularization during development and vascular regeneration after SCI are convincing. However, inhibition of Vegfaa and Vegfr2 is nearly ineffective. The author might not conclude that the Vegfr2 signaling plays any role.

      Since we are unable to confirm the level of inhibition of the Vegf pathway, we will be excluding the Vegf inhibition data.

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Major comments: 1) In Figure 3, the exact injured site on the spinal cord is not clear. Please include a schematic illustration of full spinal cord to show where is the injured site. Are all the injury experiments in this study done at the same site? If not, is there any site difference regarding the regenerative capability.

      We will include a scheme of the injury site in the spinal cord in Fig.3. All the injury were performed in the same position and this will be clarified in the methods.

      2) Figure 2E showed a segmented pattern of spinal cord vasculature. Is this pattern correlated with the position of vertebra?

      We will acquire light sheet images of adult spinal cords without removing the vertebrae. This will allow us to determine if the metameric organisation is correlated with the vertebral distribution.

      3) In Figure 3, during vascular regeneration after SCI, the author only showed partial regeneration at 30 dpi. Why not show the stage of complete regeneration? At that stage, how about the behaviors of the regenerated animals?

      We will add an additional timepoint (90 dpi) to the characterisation of the revascularisation. Moreover, we will include in Fig. 3 a plot of the swimming capacity in contusion-injured fish until 90 dpi and will explain in the text how the vascular response correlates with the functional recovery.

      4) Only EdU data is not sufficient to conclude that new vessels come from proliferation of remaining endothelial cells. For example, these new vessels might come from transdifferentiation of lymphatic vessels, or immune cells, or glial cells, in the meantime proliferate. This could also explain why the inhibition of Vegfr2 signaling is ineffective on new vessel formation. Cre/loxP-mediated lineage tracings need to be performed to exactly identify where these new vessels originate.

      We will clarify in the text that while the detection of endothelial proliferation suggests existing endothelial cells contribute to new vessels, we cannot exclude that other cell types also give rise to endothelial cells. However, regarding the transdifferentiation of immune and glial cells into endothelial cells, to our knowledge few examples have been described in the literature and generally associated with cancers or in in vitro conditions (Fernandez Pujol et al., 2000; Li et al., 2011; Soda et al., 2011). For this reason we do not expect this rare process to occur during spinal cord repair.

      A cell type that has been associated with transdifferentiation into ECs are lymphatic cells (Das et al., 2022). However, we have analysed the expression of a lymphatic marker (Tg(lyve1b:DsRed)) and were only able to detect very few lyve1b:DsRed-positive cells before or after injury, suggesting that any possible lymphatic contribution would likely be very limited. We plan to include these data in the revised submission.

      5) To confirm the Tg(hsp70l:dn-vegfaa) did work in this study, the authors need a positive control. For example, the effects on vasculogenesis or angiogenesis during embryonic development after heat shock. If the transgene works, the vascular development at early stages should be blocked (Marín-Juez et al., 2016).

      We will be removing the vegf inhibition data, therefore we will not address this question.

      Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. The suggested experiments are realistic in terms of time and resources.

      • Are the data and the methods presented in such a way that they can be reproduced? In the method, the author should describe how to identify the Tg(hsp70l:dn-vegfaa) in more details, because there is no fluorescence before and after heat shock.

      We will be removing the vegf inhibition data, therefore we will not address this question.

      Are the experiments adequately replicated and statistical analysis adequate? Yes.

      Minor comments: - Specific experimental issues that are easily addressable. In Figure 6, from 30 dpi to 90 dpi, the number of pericytes decreased. Did these pericytes undergo apoptosis from 30 dpi on?

      We have not investigated pericyte apoptosis during vessel remodelling. However, this experiment would require the acquisition of long-term samples (between 60 and 90 dpi) and we would prefer not to address this question.

      Are prior studies referenced appropriately? Yes.

      • Are the text and figures clear and accurate? Please clearly labeled the injured region in Figure 6.

      We will identify more clearly the site of the injury in Fig.6.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? The number of proliferating ECs at 3 dpi is more than those at 5 dpi (Figure 5G). But the number of total EdU+ cells at 3 dpi is less than those at 5 dpi (Figure 5A-D). These data are consistent with Figure S3, which showed ECs were the leading cell type to enter the lesioned site, then were the axons and glial cells at later stages. Please explain and discuss whether the regeneration of other cell types is dependent on the accomplishment of vascular regeneration.

      As the reviewer points out, our data suggest that endothelial cells display an earlier peak of proliferation than spinal cord cells in general and colonise the lesioned tissue before new axons and glial cells. Although these observations could point to a role for ECs in the regeneration of other cell types, we would need to inhibit vascular repair to assess this possibility, which we were unable to do using Vegf inhibition. In our discussion we already mention some possible roles for ECs in stem cell proliferation, neurogenesis and axonal regrowth, but can expand this discussion if necessary.

      Reviewer #3 (Significance (Required)):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Although this study has characterized the development and regeneration of spinal cord vasculature in details, the significance of the advance needs to be improved due to lack of mechanisms. Obviously Vegfa is not essential for the vascular regeneration after SCI. It is better for the authors to identify one or two factors required for this process, in addition to identify cell origins of new vessels. With those, the significance of this study will be improved because the cell origins and required factors will provide potential therapeutic targets after SCI.

      • Place the work in the context of the existing literature (provide references, where appropriate).
      • State what audience might be interested in and influenced by the reported findings. The audience includes people who are interested in vascular development and regeneration, and spinal cord clinicians.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. My field of expertise includes brain vascular regeneration, digestive organ development and regeneration. This study reported spinal cord vascular development and regeneration, which fit my expertise.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      Ribeiro et al. described vascular development in the spinal cord from larval to adult stages in zebrafish, and found the dependence of vessel length on body-size. Then, the authors depicted the vascular regeneration process after spinal cord injury (SCI), which includes initial vascularization, angiogenesis, pericyte recruitment, and blood-spinal cord barrier establishment. Although the molecules or signaling pathways that drive the re-vascularization remain unidentified, this study illustrates the cellular processes of spinal cord vascular development and regeneration from the descriptive level, which may facilitate further understandings of mechanisms underlying vascular regeneration in the spinal cord.

      Major comments:

      • Are the key conclusions convincing?

      The descriptions of spinal cord vascularization during development and vascular regeneration after SCI are convincing. However, inhibition of Vegfaa and Vegfr2 is nearly ineffective. The author might not conclude that the Vegfr2 signaling plays any role. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Major comments:

      1. In Figure 3, the exact injured site on the spinal cord is not clear. Please include a schematic illustration of full spinal cord to show where is the injured site. Are all the injury experiments in this study done at the same site? If not, is there any site difference regarding the regenerative capability.
      2. Figure 2E showed a segmented pattern of spinal cord vasculature. Is this pattern correlated with the position of vertebra?
      3. In Figure 3, during vascular regeneration after SCI, the author only showed partial regeneration at 30 dpi. Why not show the stage of complete regeneration? At that stage, how about the behaviors of the regenerated animals?
      4. Only EdU data is not sufficient to conclude that new vessels come from proliferation of remaining endothelial cells. For example, these new vessels might come from transdifferentiation of lymphatic vessels, or immune cells, or glial cells, in the meantime proliferate. This could also explain why the inhibition of Vegfr2 signaling is ineffective on new vessel formation. Cre/loxP-mediated lineage tracings need to be performed to exactly identify where these new vessels originate.
      5. To confirm the Tg(hsp70l:dn-vegfaa) did work in this study, the authors need a positive control. For example, the effects on vasculogenesis or angiogenesis during embryonic development after heat shock. If the transgene works, the vascular development at early stages should be blocked (Marín-Juez et al., 2016).
      6. Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      The suggested experiments are realistic in terms of time and resources. - Are the data and the methods presented in such a way that they can be reproduced?

      In the method, the author should describe how to identify the Tg(hsp70l:dn-vegfaa) in more details, because there is no fluorescence before and after heat shock. - Are the experiments adequately replicated and statistical analysis adequate?

      Yes.

      Minor comments:

      • Specific experimental issues that are easily addressable.

      In Figure 6, from 30 dpi to 90 dpi, the number of pericytes decreased. Did these pericytes undergo apoptosis from 30 dpi on? - Are prior studies referenced appropriately?

      Yes. - Are the text and figures clear and accurate?

      Please clearly labeled the injured region in Figure 6. - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      The number of proliferating ECs at 3 dpi is more than those at 5 dpi (Figure 5G). But the number of total EdU+ cells at 3 dpi is less than those at 5 dpi (Figure 5A-D). These data are consistent with Figure S3, which showed ECs were the leading cell type to enter the lesioned site, then were the axons and glial cells at later stages. Please explain and discuss whether the regeneration of other cell types is dependent on the accomplishment of vascular regeneration.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Although this study has characterized the development and regeneration of spinal cord vasculature in details, the significance of the advance needs to be improved due to lack of mechanisms. Obviously Vegfa is not essential for the vascular regeneration after SCI. It is better for the authors to identify one or two factors required for this process, in addition to identify cell origins of new vessels. With those, the significance of this study will be improved because the cell origins and required factors will provide potential therapeutic targets after SCI. - Place the work in the context of the existing literature (provide references, where appropriate). - State what audience might be interested in and influenced by the reported findings.

      The audience includes people who are interested in vascular development and regeneration, and spinal cord clinicians. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      My field of expertise includes brain vascular regeneration, digestive organ development and regeneration. This study reported spinal cord vascular development and regeneration, which fit my expertise.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The study by Ribeiro et al. investigates the formation of new blood vessels after spinal cord injury in adult zebrafish. The authors initially characterize the extend of spinal cord vascularization during the development of juvenile zebrafish and investigate the association of pericytes with the newly forming vasculature. They then injure the spinal cord and describe the subsequent regeneration of blood vessels. They perform assays to analyze the functionality of the newly forming blood vessels and show that initially blood vessels are leaky. Through EdU labelling the authors show that endothelial cells proliferate. Pericytes similarly increased in numbers. Lastly, the authors inhibited VEGF signaling, which only mildly affected vascular regeneration.

      Together, this manuscript describes the re-vascularization of the regenerating spinal cord in adult zebrafish and addresses how blood vessels mature during this process through pericyte recruitment and decrease in leakiness. The manuscript provides some interesting initial insights into spinal cord vascularization, but is mainly descriptive and unfortunately remains superficial in this regard, as specified below:

      1. The authors only refer to "blood vessels" without specifying the type of blood vessels they observe (are these veins, arteries, capillaries)? A wealth of markers and transgenic zebrafish lines are available to better characterize spinal cord vessels. This is not necessary in case the authors solely refer to "blood vessels" as they do, but it greatly limits the insights into spinal cord vascularization. For instance, Wild et al. (2017) showed that new vessels apparently sprout from veins in the spinal cord. Is this also true during regeneration?
      2. The authors state that their characterization revealed a "stereotypic organization of blood vessels". However, the organization does not appear to be stereotypic (as I understand this term as looking the same in each fish) at all. Can the authors compare e.g. 3 or 5 wildtype fish and extract features that all fish share and those that differ between fish? This would greatly enhance our understanding of the vascular variability within the wildtype population.
      3. The authors show an interesting metameric organization of the vasculature with regions of high vascularization interspersed with sparsely vascularized areas. Are there any morphological landmarks that would precipitate these differences? Can the authors check whether they induce a lesion in a highly or poorly vascularized area? This might greatly influence the degree of re-vascularization.
      4. The same superficial characterization unfortunately also applies to the cell population the authors refer to as "pericytes". Traditionally, pericytes are characterized as being associated with capillaries and sharing a basement membrane with the endothelium. Is this the case here? In addition, pdgfrb is hardly specific for pericytes, as it also labels a multitude of other cell types (refer to e.g. Tsata et al. (2021)). It is also not clear why the transgenic pdgfrb line the authors use only labels cells next to blood vessels. Tsata et al. show a much broader labelling. The authors need to validate their transgenic line using in situ hybridization showing where pdgfrb is being expressed endogenously and how this overlaps with the fluorescent protein expression of the pdgfrb transgenic line. There are also several transgenic lines available that allow for the distinction between smooth muscle cells and pericytes (e.g. Shih,..., Lawson, Development 2021 and Whitesell,..., Childs, Plos ONE 2014). As for the vasculature, this more detailed characterization is not necessary in case the authors refer to the cells as "cells labelled by the pdgfrb transgene and reside next to endothelial cells". However, this would not be reflective of the level of detail currently present in the field.
      5. The authors state that "New blood vessels rapidly attracted pericytes, formed through proliferation and possibly migration of existing pericytes". This statement is not supported by the data, as the authors do not perform lineage tracing of pre-existing pericytes. The authors need to specifically label existing pericytes and then follow whether these pre-labelled cells can be found on newly forming blood vessels. Tsata et al. provide some evidence for this in zebrafish larvae, but they also conclude that pdgfrb expressing tenocytes contribute to new mural cells.
      6. The findings that new blood vessel growth only marginally depended on VEGFA signaling is striking. However, it might also point towards an inefficient inhibition of VEGFA signaling. In particular, other publications, for instance Cattin et al. 2015 have shown that inhibiting VEGFA signaling prevents new blood vessel growth during peripheral nerve regeneration in mouse. It will therefore be important that the authors demonstrate that their approach leads to successful inhibition of VEGFA signaling. VEGFAB mutants appear to be homozygous viable and important for spinal cord vascularization (Matsuoka et al., 2017). In addition, heterozygous VEGFAA mutants already have some vascular phenotypes, but are also viable. Can the authors combine these mutants with their inhibitor treatments to achieve a greater reduction in VEGFA signaling?

      Significance

      Together, this publication is the first to describe to some extend the regenerating vasculature after spinal cord injury in adult zebrafish. However, both the vascular and regeneration fields are much more advanced than what the authors cover. Both blood vessels and perivascular cells can be characterized in much more detail, as outlined above. Also, studies on nerve regeneration and its dependence on the vasculature, e.g. during peripheral nerve regeneration in mouse have been carried out with a wealth of functional data available. Therefore, the impact of the present study in its current form will be limited. I am an expert on zebrafish blood vessel development.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Spinal cord injury (SCI) is a damage to the spinal cord, that causes temporary or permanent changes in its function. While in mammals the regeneration process are very limited zebrafish are able to repair the spinal cord. Based on the hypothesis, that the vascular response might affect the regeneration capacity, the paper by Ribeiro et al addresses the structure and injury response of the spinal cord vasculature. As the growth of zebrafish larvae and juveniles depends a lot on the individual response to the environment, the authors first established comparable body measurement parameters (other than age) and observed the natural spinal cord vascularization process, starting from 6mm body length of the animals. Using transgenic lines the authors describe the formation and patterning of endothelial cells and pericytes up to 9mm length, when a more developed vascular network was present. They observe the processes of vascular regeneration after a contusion based SCI model at different time points (days post injury (dpi)) and in correlation with glial and axonal regrowth, also observing BSCB barrier integrity, angiogenesis, pericyte recruitment and the dependence on Vegf signaling.

      The study is interesting and novel, vascular structures in the zebrafish adult spinal cord have not been reported yet and neither has the vascular response to SCI. Currrently the study remains very descriptive, although the authors tried to add functional data, by inhibiting Vegf signaling.

      Major points for revision: The authors fail to establish whether there is any relationship between spinal cord regeneration and vessel regeneration. While I do very well understand the challenges and limitations the authors should put more effort into functional analyses.

      For example: the authors address EC proliferation as a marker for angiogenesis, but do not analyse whether or how much EC proliferation is required for revascularization and regeneration. Pharmacological inhibition of proliferation should be possible and used. From a vascular point of view it would also be interesting whether there is a differential influence of tip or stalk cell proliferation.

      The same is true for pericyte recruitment: the role of pericytes for the vascular repair or the spinal cord regeneration is not clear. The authors could use use mutants with impaired pericyte development or e.g. nitroreductase mediated ablation of pericytes.

      The statements regarding the role of Vegf are too bold. The problem lies in the limitations of assessing the efficiency of Vegf inhibition. The heatshock promotor has been shown to induce transcription for up to 4 hours, depending on the efficiency of heatshock. There are no data on the stability of dnVegfaa protein. Likewise the pharmacological inhibition could be far from complete. A full inhibition of Vegf signaling is expected to stop vessel growth or angiogenesis. While it is a sign of good practice, that the authors combined a genetic model with a pharmacological one, both leave the same unresolved issue. However if we believe a very limites requirement of Vegf-signaling, it would be interesting to look for other signaling pathways, like cxcl, IL, or FGF to regulates regenerative angiogenesis.

      Minor issues

      The correlation with spinal cord repair could be stated more clearly throughout the manuscript. For the uninformed reader it is less clear when exactly the spinal cord is functional again. While I find the model in figure 8 very helpful, it gives 5 to 30 days, for the neuronal regeneration. Maybe a more detailed timeline of EC regeneration and remodeling correlating with neuronal repair would help. In line with that in figure 4 it is not clear whether the images of different time points are indeed one individual animal at the different time points or representative animals for the stage (also figure 4 lacks panel labels, in my copy I can see A, K and L, but no other letters).

      For understanding the (re)vascularization, the direction of blood flow might be helpful. Especially for the connection between spinal cord regeneration and vessel regeneration. Does blood flow regulate vessel pruning after 14 dpi?

      The combined Vegfaa DN and PTK treatment data looks like it could be inhibiting endothelial cell proliferation (Figure7I).However, Supplementary Figure 8B shows endothelial proliferation does not change. Does it mean the number of endothelial cells is same but the volume of endothelial cells decrees?

      There are also some remaining grammatical errors, for example (but NOT limited to) line 133 to 135.

      As a personal interest I think evaluating the role of Notch in the SCI model would also be very interesting, especially with regard to the vasculature, however that might be out of the scope of the manuscript.

      Significance

      The study is partially descriptive, but very novel as the aspects of vascularisation in a spinal cord injury model have not been described before. If the major revisions regarding functionality are addressed fully, I would wholeheartedly recommend publication and expect an interest for a broad audience. The presented images and their analyses are of very high quality, and therefore also enhance the impact of the study.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1.

      Reviewer #1 summary:

      In this manuscript by Lu et al., the authors describe some CRISPR screens and protein-protein interaction screens to identify novel regulators of wild-type p53 and mutant p53 function and stability. Besides generating a wealth of data, they discover FBXO42-CCDC6 as positive regulators of the some p53 hot-spot mutants, including R273H mutant p53, but not of all p53 mutants tested and also not of wild-type, indicating selectivity. Furthermore, the found C16orf72(TAPR1) as a negative regulator of p53 stability.

      Mechanistically, the authors claim a direct interaction between FBXO42 and CCDC6 and p53, but the importance of these interactions has not been shown. On the other hand the authors suggest that the FBXO42/CCDC6 regulate p53 via destabilization of USP28, but also the mechanism has not been worked out. For C16orf72, they show that it interacts with USP7, but no relevance of this interaction is shown either.

      Response: We sincerely thank the reviewer for the constructive and thorough review. We have incorporated most of the suggestions into our planned revision, with our major focus on the molecular mechanistic follow-up.

      Reviewer #1, major points.

      1. One very important point for me is that the authors do not show the levels of expression of p53 in the p53-mClover stable cell lines. It is known that overexpressed p53 is usualy more stable than endogenous levels of wt-p53. Therefore, I think it is necessary that the authors show the levels of the p53-mClover fusion proteins in the stably transduced cell lines compared to endogenous p53 levels in the parental RPE1 cells and also compared to the endogenous levels of R273H mutant in the PANC-1 cells.

      Response: We fully agree that the levels of overexpressed p53s are often more than the endogenous ones, due in part to increased expression and stability. In designing the reporter, we first tried to avoid the stabilisation of p53-GFP due to GFP aggregation by using the monomeric mClover-variant. Further, we titrated the WT and R273H clones (similar to our recent work in PMID: 35439056), to select clones with p53 levels closer to endogenous protein, and exhibiting high dynamic response to Nutlin-3a treatment.

      In the revised submission, we will include Western blotting comparing the levels of p53-mClover (WT and R273H) expression to the endogenous p53s in RPE1 (WT) and PANC1 (R273H) cell lines, in the presence or absence of Nutlin-3a.

      Also the functionality of the wild-type p53-mClover fusion is questionable, at least not shown. One would expect that the overexpression of a functional wt-p53 in p53-KO cells will affect the survival of the RPE1 cells. In Figure 5A the authors show that depletion of MDM2 or C16ORF72 is toxic for the RPE1 cells in a p53-dependent manner, indicating that elevated levels of p53 cannot be handled by these cells. So, experiment(s) showing that the wt-p53/mClover fusion is functional is needed.

      Response: We agree that it will be an important point to benchmark the reporter design. The ectopically expressed WTp53 is often observed to have reduced functionality compared to the endogenous WTp53. The WTp53-reporter line behaves similarly to the RPE1 line (p53-proficient), where both chemical (e.g. Nutlin) or genetic perturbation (e.g. depletion of MDM2/C16orf72) would be toxic in a p53-depedent manner. In line with this data, we have observed that the WTp53-reporter line is able to induce a p53 response as demonstrated by induction of p53-target genes such as p21, which is not observed in p53 null RPE cells, albeit the p21 induction is not as dramatic as in RPE1 cells with endogenous WTp53. Together, these data indicate that our WTp53-reporter is functional albeit with a somewhat reduced activity.

      In the revised submission, we will better demonstrate the functionality of the WTp53-mClover fusion by probing WTp53 target (e.g. p21), in the presence and absence of Nutlin. This is also performed as a part of the experiment addressing Point #1 above.

      A second important point is that the 'verification' of the hits from the screens is only done in one cancer cell line, PANC-1, with mutant p53. I would have like to see at least one other cell line with another p53 mutant endogenously expressed that is also regulated by FBXO42/CCDC6.

      Response: we will include validation of the hits (FBXO42, CCDC6) in other 1-2 tumour lines with confirmed R273H endogenous mutation (e.g. MB-MDA-468, etc).

      For many of the p53-mutants, a bimodal expression is observed. In the FBXO42- and CCDC6-depleted cells, the equilibrium shifts towards more negative cells but the levels in the two populations itself don’t change (while for example for USP28 depletion also the right peak shifts further up, Fig S4E). Is there any correlation with the cell cycle and p53 expression? And can the authors exclude that FBXO42 and CCDC6 are involved in cell cycle progression and hereby influence p53 indirectly (by combining PI staining with Clover-p53 for example).

      Response: we have indeed observed that the “bimodal” levels in the reporters of several mutants, which are also observed in other studies probing the endogenous p53 level (PMID: 29653964); while the population equilibrium shifts, the location of each peak (as a proxy of the level of p53s) are more stable.

      Regarding the relation between p53-level and cell cycle stage, indeed, both the authors in the paper above and we have probed this possibility, but were unable to establish a direct connection.

      In the revised submission, we will add flow cytometry analysis of the p53-mClover level, and the cell cycle position using Hoechst 33342 (live-cell permeable DNA staining).

      The authors claim that the FBXO42-CCDC6 axis regulates stability specifically some p53-mutants, including R273H-mutant, in a manner involving USP28. But USP28 regulates all forms of p53, not just some mutants version. How can the authors reconcile this apparent contradiction?

      Response: we thank the reviewer for this critical observation. From our screen (Supplemental Table 1A), we have indeed noticed a pronounced effects (|Z score| >=3) of FBXO42 on R273H and R248Q stability, and a marginal effect on wild-type p53. Similarly, USP28 had pronounced effects on R273H and R248Q and WTp53.

      In the discussion of the paper, we noted that USP28 was shown to regulate p53 levels through distinct mechanisms:

      ‘USP28 was originally implicated as a protective deubiquitinating enzyme counteracting the proteasomal degradation of p53, TP53BP1, CHCK2, and additional proteins68-71. USP28 regulates wild-type p53 via TP53BP1-dependent and -independent mechanisms. Concordantly, our data shows that USP28 and TP53BP1 are strong positive regulators of wild-type p53. However, while USP28 was also a strong hit in the mutant R273H p53 screen, TP53BP1 was not, indicating that the effects we see upon loss of USP28 on R273H p53 are independent of TP53BP1.’

      Together, this indicates that the R273H-mutant is regulated by a FBXO42-CCDC6-USP28 axis while wild-type p53 is regulated mainly via a USP28-TP53BP1 axis. We will attempt to address and discuss it in the revision.

      On a similar note, the authors show that FBXO42 and CCDC6 interact with p53, but not USP28. Do FBXO42 and CCDC6 interact with each other and with USP28? And is the interaction with p53 specific for the R273H version? This part of the mechanism is very poorly defined and the Co-IPs are not very convincing or relevant for the proposed model.

      Response: This comment will be more extensively addressed in the revision. We have indeed observed the interaction between FBXO42 and CCDC6 (via BioID and APMS); however, we failed to recover USP28 as an interactor of either FBXO42 or CCDC6. The interaction between CCDC6/FBXO42 is not specific to R273H; although we were able to IP endogenous R273H with CCDC6 in PANC-1 line, the WTp53 (as in HEK 293 TRex BioID line) was also picked up in the BioID preys of CCDC6/FBXO42. In addition, we have new data to show that FBXO42 directly interacts with WTp53.

      In the revised submission, we will improve the molecular underpinning of the FBXO42-CCDC6-USP28-p53 axis we propose. We will specifically address the following.

      (1.1.) Biochemically, further support that CCDC6 and FBXO42 regulate p53 via regulating USP28 stability: We will address this by established biochemical assays, e.g. cycloheximide-chase/MG132 experiment. While USP28 is an established WTp53 regulator, little is known about the mechanism, and the “upstream” regulation of USP28; we will attempt to fill this gap:

      (1.2.) And to an unbiased systematic approach, how R273H interactome changes upon the loss of CCDC6 or FBXO42.

      We will perform R273H-BioID upon loss of CCDC6 and FBXO42 and USP28.

      (1.3.) Furthermore, we will specifically exam the interaction of USP28-p53R273H with or without the genetic perturbation of FBXO42/CCDC6.

      Through these efforts, we hope to gain further mechanistic insights into this regulatory axis, but hope that the editors and reviewers will agree that a fully annotated mechanistic understanding is probably beyond the scope of this paper.

      Reviewer #1, minor points.

      The mechanisms of p53 regulation may vary greatly in different cell lines. Can the authors discuss why they choose to do the screen with different mutants, rather than with different cell lines expressing these same mutant endogenously?

      Response: While it is certainly very interesting to assess how WT and mutant p53 is regulated in different cell lines, such an approach is confounded by the ‘genetic make-up’ of the respective tested cell lines. For example, TP53BP1 might be a regulator in one cell line but not in another for the simple reason that the later cell line harbors a TP53BP1 deletion or mutation or expression levels. In addition, while working with endogenous p53 mutations certainly has many advantages, comparing different mutants in different cell lines is again very much confounded by the ‘genetic make-up’ of the respective tested cell lines.

      Our focus was slightly different, and we wanted to set out and specifically ask what the difference between p53 hotspot mutations are. Are they all the same or are there differences and importantly, are there differences between mutants and WT p53 and this can only be achieved when working in the same cellular background. In designing the screen, we have thus tried to optimise the inclusion of different hotspot mutants in an isogenic screening system. As such, we first depleted the endogenous WTp53 to minimise its interference and built the current isogenic system in the non-transformed RPE1 (“normal”) line.

      However, as discussed above, we agree that the screen results will be validated in more cell lines carrying respective endogenous mutants.

      Figure 1: Typo in the legends : Nultin ipv Nutlin

      Response: We apologise for the typos. This is addressed in the current submission, along with improved figure legends to improve readability.

      Figure 1b,1c : Show basal and Nutlin-3 induced MDM2 levels and in the overexpression cell lines; if WT-p53 is functional, MDM2 levels should be higher in WT-transduced cells compared to control or mt-p53 expressing cells.

      Response: In the revised submission, we will include Western blotting probing MDM2 levels (antibody permitting); this is a part of the experiment proposed for Points 1 and 2.

      Authors should explain which they name USP7 a negative regulator of p53, since it is supposed to de-ubiquitinate p53?!

      Response: The effects of USP7 on WTp53 have indeed been difficult to elucidate (by Prof. Vogelstein PMID: 15118411, and PMID: 15058298, and seemingly opposite by Prof. Gu Wei, PMID: 15053880, and PMID: 11923872). However, consistent with Prof. Vogelstein group, the inhibition of USP7 (either by inhibitor or genetically via CRISPR in our studies), has resulted in elevated p53 level.

      Figure 2E: the effect of MG132 on p53 seems to be very minimal on this Western blot; it would need quantification to be convincing...Quality of the blot is also not great.The fact that in control cells the levels of p53 R273H are not affected by MG132 treatment fits with Suppl Figure 2E, indicating that the proteasome has no effect on p53 R273H.

      Response: We indeed noticed that while the proteasome pathway is largely implicated in the WTp53 screen, it has much reduced effects on R273H. Interestingly, the treatment of MG 132 also has limited effects using PANC-1 line (with endogenous R273H). We will repeat this experiment and provide quantifications and modify the text accordingly.

      Suppl figure 3b, 3c, 3d:

      Somehow, I have the feeling that the results from the western blots and the FACS do not match fully, although not all the time-points are shown in the various experiments.

      For example, the FACS analysis (3b) suggests that in control-transduced cells after 16hr p53 is still increased. However, that is not clear at all in the Western blot (3c)

      Is Suppl Figure 3d the quantification of 3c experiment? If so, in the blot also the 24 hrs should be shown.

      The blot shown in Suppl Figure 3c suggests that CCDC6 expression increased upon irradiation. Do the authors agree with that? Would that explain why depletion of CCDC6 has more effect upon irradiation?

      Suppl Figure S3E: if I am right, this is essentially the same type of experiment as shown in figure 2e, but analysis of p53-expression by Western blot. In that blot no real effect of MG132 on p53 levels could be seen. But here, in the FACS analysis, MG132 clearly increases the p53-Clover fusion levels; for me again that Western blot and FACS data do not neccesarily match.

      Response: We apologise for the confusion. In the revised submission, we will improve the figure legends for better readability. Furthermore, in anticipation to the multiple cell lines involved in the revision, we will also clarify the cell lines in the figure.

      With regards to the difference between the flow cytometry and WB data, we have generally observed the flow cytometry bimodal shifting to be more sensitive than the WB, e.g. a 50% shift in population (FACS) is reflected by a 15% reduction in WB (which may be partially explained as WB is a measurement across the cell population and FACS determines the p53-GFP levels of every cell and thus the shift of cells between peaks). Similarly, we noticed flow-cytometry based quantification by antibody staining the endogenous p53 yielded similar sensitivity (PMID: 29653964). As such, we will ensure the validation of hits is performed in two modes. For WB experiment, we will do so in two cell lines carrying the endogenous mutants as suggested by Reviewers #1 and 2.

      Figure 3B: In the CCDC6 IP a very small amount of p53 can be found. I don't know how much input lysate compared to amount of lysate for IP is used, but the percentage of p53 found interacting with CCDC6 seems so marginal that is difficult to explain the effect of KO of CCDC6 in PANC1 cells.

      And, the authors called it a 'reciprocal IP' (Suppl Figure 4a) after transfection of V5-tagged CCDC6 into PANC1 cells, but it actually is the same type of IP. Did the authors try to IP p53 and blot for CCDC6? That would be a reciprocal IP.

      Response: We apologise for the confusion. In the revised submission, we will specify the portion of the lysates used for pre-IP (5% lysate) and IP (1 mg). As for the IP, we will also include the true reciprocal IP (IP p53, and blot for CCDC6).

      Figure 3H: how can authors explain that basal levels of USP28 in control and CCDC6-KO cells transfected with control plasmid are more or less the same and not reduced in the CCDC6-KO cells?

      Response: We will provide a better blot and quantification for this observation. In the current Fig 3H, the CCDC6-KO lane is slightly overladed as seen by the H3 loading control.

      Figure 3I: Essentially the whole blot here is of low quality; especially the FBXO42 blot; is deletion of USP28 increasing FBXO42 protein levels, or is it just the quality of the blot? All in all it seems that FBXO42 is very low expressed in the used cell lines.

      Response: We apologise for the confusion. In the revised submission, we will repeat and try to include higher quality WB, with more optimised condition for using the FBXO42 antibody.

      FBXO42 messenger level is readily detected using qRT.

      Figure 4B: I find it a bit surprising that USP7 is also found in the synthetic viability screen, since it has been shown that USP7 has many more essential targets and KO of p53 only partially rescues the development of USP7-KO mouse embryo's.

      Response: We thank the reviewer for this critical observation. While the double p53-USP7 knockout line is viable, we acknowledge that it is amongst the top scored hits due to the large differential viabilities between WT and p53-null lines. In the revised submission, we will further clarify the screen analysis and the associated interpretation.

      Figure 5: the authors nowhere show the efficacy of the guides targeting c16orf72. A Western blot showing the expression and the reduction upon expressing the guide-RNAs is essential.

      Response: We thank the Reviewer for this suggestion. The efficacy of each guide has been verified using ICE (at the genomic level), and in the revised submission, we will include this critical information as part of the Figure S2F.

      Figure 5E: First, here probably parental RPE1 cells have been used, but that is not stated. Second, the authors state 'only a slight increase in p53 levels upon siHUWE1'; I would say none compared to scrambled.

      I know HUWE1 is a very huge protein, but the blot of HUWE1 is not convincing. I seem to be able to conclude that siMDM2 and siUSP7 reduces HUWE1 levels?

      Response: We apologise for the confusion. In the revised submission, we will be specific of the cell line information on the figure, to improve the readability.

      We agree with the reviewers that assessment of large protein by WB is often difficult but given that this band almost completely disappears upon HUWE1 knock-down, strongly argues that we are indeed assessing the endogenous HUWE1. We also agree that it is an interesting observation that the levels of HUWE1 seem to be slightly reduced upon knock-down of MDM2 and USP7. We will repeat this experiments and provide quantitative data for HUWE1 and p53. Of note, in the screen, HUWE1 also scored as a negative regulator of wt-p53 and did not quite reach statistical significance for the p53 mutants.

      Regarding the relationship between C16orf72 and HUWE1, a newly published work (PMID: 35776542) seems to suggest that siHUWE1 has resulted in an increased C16orf72 level (termed HAPSTR1 in the paper), while siC16orf72 seemed to have no effect on HUWE1 level, although the stability of such a large protein by WB is often difficult to conclude.

      Figure 5F, in relation to figure 5D. Here the author overexpress both c16orf72 and USP7, and find an interaction. The implication of that is not clear. If they want to make point of this interaction, they should have looked at endogenous proteins.

      Response: We acknowledge the many concerns associated with coIP with ectopically, and especially overexpressed proteins in large quantity. In the revised submission, we will attempt to perform endogenous-based IP experiment (antibody permitting).

      It is worrying that USP7 apparently was not one of the hits in the Mass-spec experiment of which results are shown in Figure 5D. Also in that experiment c16orf72 was overexpressed, and USP7 is very highly expressed in essentially all cell lines, so do the authors have an explanation?

      Response: We indeed acknowledge this discrepancy. In the revised submission, we will attempt the coIP/IP using endogenous proteins (antibody permitting, or at least using endogenous target for one of the two partners). We also acknowledge that the limitation associated with the APMS for the detection of interactors.

      Suppl. figure 5D is missing

      Response: We apologise for the confusion. The Figure S5D was inconveniently placed at the top of the figure panel due to space limitation. In the revised submission, we will address this as a part of the overall readability improvement.

      Reviewer #1, Significance.

      The topic of the paper is of high interest given the relevance of p53 and its gain-of-function mutants in oncology, and the screens are well executed and clearly presented. In terms of novelty, FBXO42 has been linked to p53-degradation before, and c16orf72 was recently shown to be able to destabilize p53. However, the link between CCDC6 and p53 is novel and of interest, since they are both substrates of USP7 and are both regulators of the cell cycle.

      We think the manuscript has potential to add something to the field, but would benefit greatly from a better understanding of the molecular underpinnings of their newly described mechanisms, as well as the conditions in which the mechanism is active.

      Therefore, it might be advisable to shorten the manuscript, and go more in-depth in finding the mechanisms of regulation.

      Response: We sincerely thank the reviewer for all the constructive critiques. We will incorporate them in to our revision.

      Reviewer #2.

      Reviewer #2 summary:

      The paper describes several genome-wide CRISPR screens designed to identify regulators of p53 stability. The authors use a system in which p53 levels are marked by mClover expression, using RFP expression to normalise for gene expression changes.

      Reviewer #2, major points.

      1. The bimodal distribution of p53 expression levels in some reporter cell lines (G245S, R248Q, R248W and R273H) hampers the implementation of a robust readout and makes correct interpretation of the results challenging. While it is possible that the bimodal distribution indicates dynamic changes in p53 levels within one population, it also seems possible that a subclone of these cells have acquired additional alterations affecting p53 stability, and that the authors are screening a mixed population of two intrinsically different cell populations. This would make it difficult to interpret the results of the screen in these cell lines and may be a challenge when trying to identify something that has not already been highlighted on depmap.

      Response: We thank the reviewer for this critical observation. We strongly believe that this bimodal distribution is actually an inherent property of the p53 mutants in these cells for the following reasons: (1) The observation of the similar bimodal appearance in cell lines harbouring corresponding endogenous mutant p53s (PMID: 29653964) suggest that these two populations are of biological significance. (2) We have established 5-10 clonal lines each from the G245S, R248Q, R248W and R273H p53 reporter line and all of them exhibit a bimodal distribution, making it very unlikely that these populations are all through stochastic outgrowth of sub-populations with spontaneous mutations/alterations. (3) The bimodal distribution is stable over several months to years in culture. If it were a spontaneous mutations giving rise to a clone with higher mutant p53 levels, we would likely expect that over time this clone takes over the population. (4) We observed that such a pool of bimodal cells could be “synchronised” (e.g. by Nutlin, or MDM2 knockout) to one population, and later return to and repopulate the other (e.g. Nutlin washoff, Figure 1B). (5) When we sort out a single cells from the upper or the lower peak, expand them, we obtain again populations of cells with the same bimodal distribution, indicating that this is a dynamic process. Thus, we believe that these two populations were rather intrinsic, such that a cell in the population may assume both states.

      We also acknowledge the difficulties of screening using a bimodal population; however, we took advantage of these “bimodal” mutants and using FACS assessed the state of a single cell in relation to a genetic perturbation. Each guide has an equal chance of entering a cell that belongs to one of the two populations. If a gene knock-out really affects p53 levels, the cells with the respective guides enrich in one and deplete in the other population and the analysis comparing the guide abundances from these two peaks ensures the experiment are being perfectly internally controlled.

      While many of the top scored hits from the resulting screens are known regulators, it is critical that we validate our hits in an independent system, such as the cell lines harbouring endogenous p53 mutations, echoed by both Reviewers #1 and 2.

      The coverage of the sgRNA library (200x) is rather low for a negative selection screen, where a coverage of 500x would be more desirable. The FDR threshold is also rather lenient, a more stringent FDR threshold would seem more appropriate and shorten the list of potential hits.

      Response: We thank the reviewer for this constructive suggestion. A higher coverage, along with a more stringent FDR, will ensure an even stronger confidence for the remaining individual hits. The present reporter-based enrichment screen and the synthetical viability drop-out screen used four guides per gene, and with 200x coverage for each guide.

      In determining the coverage, we tried to reference recent successful screenings and apply earlier titration result for the 200x coverage (e.g. PMID: 26627737, PMID: 33465779, and reviewed in Nat Rev Methods Primers 2, 8 (2022). https://doi.org/10.1038/s43586-021-00093-4). While the threshold of FDR was often arbitrary, we fully agree that a more stringent FDR, which results in shortened hits list, may further boost the confidence of the hits, though also at the cost of losing potential hits due to collateral effects (e.g. guide efficiency).

      We agree with this reviewer that a higher FDR, esp. at the hits that result in p53 stabilization, would make sense as any gene whose loss causes cellular or genotoxic stress, would likely lead at least in part to p53 stabilization. In the revised submission, we will adjust the FDR accordingly.

      Although the study is focused on the regulation of p53 stability, there are no experiments to show that any of the manipulations alter the ubiquitination or degradation (half-life) of p53. The rescue of expression by proteasome inhibition is very modest (Figure 2E), suggesting the loss of expression may not be a reflection of degradation. A role for endogenous FBXO42 and C16orf72 in regulating the ubiquitination and half-life of endogenous p53 should be confirmed

      Response: We thank the reviewer for this suggestion. In the revised submission, we will monitor the ubiquitination status and also degradation (cycloheximide-chase) experiments for R273H cells, with or without the genetic alteration of CCDC6/FBXO42/C16orf72.

      Many p53 mutants are used for the initial screens, but very little validation is carried out to show that the apparent differences in factors regulating their stability persists in cells naturally expressing these mutants. For example, FBXO42 is identified as a protein required to maintain the stability of R273H, 248W and R248Q, but not R175H, G245S and R337H. While the authors show an association of CCDC6 and p53 in PANC1 cells (expressing 273H), it would be important to show a panel of R273H, 248W and R248Q expressing tumor cells and the response of p53 to FBXO42 and CCDC6 depletion, compared to similar experiments in a panel of R175H, G245S and R337H expressing tumor cells. Again, it would be important to show that any changes in protein levels are due to changes in protein stability.

      Response: We thank the reviewer for this suggestion. In the revised submission, we will include validations in more cell lines carrying endogenous mutant p53s, with a focus on the R273H mutant. We will also try to involve a line with an endogenous p53 mutation that does not respond to FBXO42/CCDC6 alteration.

      The potential hits should also be tested in wild type p53 expressing cells to confirm the specificity to mutant p53s.

      Response: In the revised submission, we will include WB for WT lines (e.g. RPE1) upon genetic alteration of CCDC6 and FBXO42. This was already performed for C16orf72 (Figure 6D).

      (6A) The role of C16orf72 in restraining p53 activity has been reported previously, as has the interaction with HUWE1 (including a new publication PMID: 35776542). The authors suggest an interaction between C16orf72 and USP7, although this should be shown with endogenous proteins. The relative importance of USP7 and HUWE1 binding is not explored. (6B) The effect of C16orf72 overexpression in promoting mammary tumors is impressive, although maybe the more interesting question is whether inhibition of C16orf72 expression can limit tumor development in this system.

      Response to 6A: we are excited about the independent observations by other group(s) confirming similar results! As a part of our improvement for mechanistic work-up, in the revised submission, we will attempt to address, whether C16orf72’ regulation of p53 is dependent on USP7 and/or HUWE1, or other known E3s, such as MDM2.

      (1) Whether the interaction of C16orf72 and HUWE1 or USP7 is required for the C16orf72 regulation of p53. Specifically, for example, we will perform epistasis experiments to test USP7’ or HUWE1’ ability to rescue the p53 levels in reporters upon ∆C16orf72. Due to the toxicity/lethality in WTp53 lines induced by the loss of C16orf72, we intend to test using R273H-reporter, or RPE1-line with ∆CDKN1A (p21) that is a synthetic viable rescue for ∆*C16orf72. *

      (2) In the revised submission, we will attempt to perform endogenous-based C16orf72-USP7 IP experiment (antibody permitting).

      6B. The effect of C16orf72 overexpression in promoting mammary tumors is impressive, although maybe the more interesting question is whether inhibition of C16orf72 expression can limit tumor development in this system.

      Response: We are also equally excited about the in vivo result supporting the idea that C16orf72 overexpression in tumour-prone mice (Pik3caH1047R) mice harbouring WTp53 may accelerate tumour formations. In the revised submission, we will further support that this effect is specific to WTp53/C16orf72, by including data of the control cohort with p53-null background (LSL-Pi3kH1047R; p53Flox/Flox).

      In regard to the effects of C16orf72-depletion in controlling tumour growth - we agree that this would be a very exciting avenue. Conditional C16orf72 mice are being made at the moment and these mice will allow us to comprehensively address this question. However, it will take several more month to generate and validate this line, and then another 2 breeding rounds to generate homozygous C16orf72fl/fl; Pik3caH1047R mice. In addition, the long time required to form tumours in the control mice with WTp53 (~250 days), it becomes not feasible for us to test whether the inhibition of C16orf72 could limit the tumour development, given the revision timeline. As such we respectfully believe that this would be beyond the scope of this manuscript.

      Reviewer #2, Minor comments.

      Figure 1b: The nutlin concentration stated in the methods section is wrong. Should be 10 µM instead of 10 nM (correct in figure legend).

      Figure 6b: y-axis label is missing.

      Figure 1e/f Legend: Should be FDR 0.5.

      Response: We apologise for typos. The current submission has incorporated the corrections.

      Figure 1c: Include results for a mutant that is not regulated by MDM2, such as R175H. Otherwise, as a standalone experiment, this figure doesn't add much.

      Response: We thank the reviewer for this suggestion. In the revised submission, we will include R175H/R337H.

      Figure 1h: While an UpSet plot is an elegant way to present unique and overlapping hits between different screens, Venn diagrams might be more 'accessible' to many readers and easier to understand.

      Response: We thank the reviewer for this feedback. The choice of UpSet blot was largely motivated by the different categories involved, which made the area representation and the intersection of the conventional Venn diagram no longer feasible.

      In the revised submission, we will improve our figure legend for the UpSet blot, to improve the readability.

      Might be worth stating that mClover is an eGFP variant and can therefore be targeted by eGFP sgRNAs so that it is easier to understand the following:

      o Page 5, paragraph 1: "We used the TKOv3 sgRNA library, which contains [...] 142 control sgRNAs targeting EGFP, LacZ and luciferase"

      o Page 5, paragraph 2: "As expected, sgRNAs targeting p53 and mClover were the most depleted sgRNAs, [...]

      Response: We thank the reviewer for this suggestion. We believe this will also improve the readability and have incorporated this into our current submission.

      Reviewer #2, Significance.

      Reviewer #2 (Significance (Required)):

      This is an interesting concept and the results could provide a useful resource for groups interested in the regulation of p53. The authors chose to focus on candidate genes that could have been identified by looking for the top 30 p53 co-dependent genes on depmap (C16orf72 is #24 in this list and FBXO42 is #28, most of the other genes ranking above are already known as p53 regulators). While this validates the screen, it would have been interesting if the authors had identified and validated new regulators of p53 that were not apparent from previously published work.

      Response: We thank the reviewer for all the thorough and constructive comments! In relation to the DepMap dataset, we are excited that many of the top hits from our screens are indeed top WTp53-correlators/anti-correlators (e.g. MDM2, USP28)!

      While the DepMap dataset used cell fitness/viability to construct the genetic relation score, this assay may not effectively rule out the many regulators that could otherwise elicit their regulation of p53 via regulating the general cell response to cell cycle, stress, etc. In our screen systems (i.e. protein stability and synthetic viability screens), we attempted to focus on the regulators of p53-stability (post-translational), and further coupled it with the synthetic viability screens to concentrate on hits that have a more direct role in p53 regulation (e.g. MDM2, C16orf72).

      One other difficulty to fully couple our screens to the DepMap dataset is due to the limited cell lines harbouring endogenous mutant p53s, e.g. R337H. This may also contribute to the uniqueness of the identified R337H-reporter specific hits (where cell lines harbouring R337H have not yet been included in the DepMap dataset), e.g. several Aminoacyl tRNA synthetases (SARS, YARS, etc) were identified as R337H unique regulators and subsequently verified using different guides in the reporter line, but could not be obtained via DepMap.

      We largely see this paper as a resource for the p53 field and would like to publish it as soon as possible. In fact, when we started working on C16orf72 or CCDC6/FBXO42, these hits were not known for their ability to regulate p53. We will work up several other hits, but this would be beyond the scope of this paper and the first author’s Ph.D. thesis that needs to be completed under a timeline.

      Reviewer #3.

      Reviewer #3 summary:

      The manuscript by Lu and coworkers performed genome wide CRISPR screens to search for genes that when knocked out, lead to p53 accumulation or degradation. Wt p53 and a panel of p53 hotspot mutants were chosen as reporter for the screen. The approach reassuringly identified many previously described regulators of p53 degradation, and also found a large set of new hits that many appear to be indirectly affecting p53 level.

      A key step of this approach is the follow up functional and mechanistic study of the hits. To this end, the authors chose FBXO42 as a top hit that blocks mutant p53 degradation, and C16orf72 as a top hit that promotes wt/mutant p53 degradation.

      Overall the functional data for FBXO42 is disappointing. FBXO42 knockout has quite modest effect on mutant p53 level (~50% reduction). The knockout also showed some effect on p53 mRNA level (~25% reduction), making the determination of mechanism difficult. It does not appear to be a promising targeting for reducing mutant p53 level and gain of function activity in tumor cells.

      We thank the reviewer for this constructive comment! We will address this in the revision, as proposed in Point #3.

      The C16orf72 finding unfortunately lost some novelty because it was independently identified as a p53 regulator in a recent study using CRISPR screening (PMID: 33660365). However, the repeated identification is reassuring and the current work provides more convincing functional data, showing C16orf72 knockout increase wt p53 level, inhibits cell proliferation specifically in p53+/+ cells, and overexpression of C16orf72 reduce wt p53 level and accelerates progression of a breast tumor mouse model. Their results suggest C16orf72 is a biologically relevant regulator of p53 in cancer development. In order to provide a reasonable amount of new information and set it further apart from the published study, some biochemical analysis looking into the mechanism of C16orf72 will be helpful.

      Reviewer #3 Major and Minor comments:

      Specific comments:

      1. There appears to be a mix up in the figure legend for Fig.1A describing line 1 and 2.

      Response: We sincerely apologise for the mix up in the figure legend! In the current submission, this has been fixed.

      Fig.2. Data for some p53 mutants mentioned in the text cannot be found in the main figure 2D and supplemental figure S3A.

      Response: We apologise for having not included the R175H and R337H mutants in Supplemental Figure S3A. In the revised version, we will include these two mutants.

      Fig.2 E-F. The effects of FBXO42 and CCDC6 KO on endogenous mutant p53 level is small (~50% decrease). Given that mutant p53 accumulates at high levels, whether a 50% decrease has meaningful effect on its gain of function activities is questionable. The knockouts also caused a ~25% decrease in p53 mRNA (FigS3F) which makes the mechanism quite difficult to investigate further.

      Response: We agree with the reviewer that the current data makes it difficult to conclude the mechanism. Given the design of our reporter, we still believe that the regulations could largely be at the post-translational level. In our revised version, we plan to exam the ubiquitination status of p53 upon losses of CCDC6/FBXO42, and also monitor the p53 degradation via cycloheximide chase.

      To further address whether this reduced level of mutp53 has biological impacts, we plan to test it in the tumour cell context. Given the difference in migration capability observed between PANC-1 and PANC-1-∆p53 line (e.g. PMID: 35439056), we plan to also evaluate the migration pattern of PANC-1, with the presence and absence of FBXO42/CCDC6 (controlled by similar FBXO42/CCDC6 loss in PANC-1- ∆p53 background). Furthermore, in tissue culture, although there is only marginal to no difference in cell growth rate between many mutant p53 lines (e.g. PANC-1) and their ∆p53 line, we plan to test whether a reduced serum or nutrient level could exacerbate the difference, and hence further be used to monitor the difference resulted from the loss of FBXO42/CCDC6.

      Fig.3B. The IP experiment using p53 shRNA and control shRNA should be done by IP of p53 followed by CCDC6 western blot. If CCDC6 IP is used as in the figure, then a CCDC6 shRNA knockdown sample should be compared to control shRNA. The current data does not rule out the possibility that CCDC6 antibody can nonspecifically pull down some p53.

      Response: We apologise for the confusion. In the revised version, we will include the proper reciprocal IP, with IP of endogenous p53 (R273H) followed by blotting of CCDC6.

      Fig.3D. The in vitro pull down experiment needs specificity controls such as non affected R175H p53 core domain. The data presented would suggest that MBP-FBXO42c captured more than 1:1 molar ratio of R273H core domain, which is unusual for specific binding unless there is aggregation of p53.

      Response: We thank the reviewer for this constructive comment! In the revised version, we will incorporate this, by repeating the in vitro pull-down assay including a non-p53 control protein.

      To increase the impact of the current study, the authors could provide more mechanism insight on how C16orf72 regulates p53 level, which was also missing in the other published study. For example, addressing whether C16orf72 effect is dependent on MDM2. Does it cooperate with MDM2 to ubiquitinate p53. Does it promote p53 ubiquitination in the absence of MDM2, since it interacts with HUWE1. Does it act by recruiting usp7 to stabilize MDM2.

      Response: we thank the reviewer for this very constructive and thorough comment! In our revised version, we will attempt these assays and incorporate them into the submission.

      Together with our response to Reviewer #2, Point #6, in the revised submission, we will attempt to address if C16orf72 regulation of p53 is dependent on MDM2 or HUWE1.

      (1) Whether the interaction of C16orf72 and HUWE1, or C16orf72 and USP7 is required for the C16orf72 regulation of p53. Specifically, for example, we will perform epistasis experiments to test HUWE1’ or USP7’s ability to rescue the p53 levels in reporters upon the loss of C16orf72 (∆C16orf72). Due to the toxicity/lethality in WTp53 lines induced by the loss of C16orf72, we intend to test using the R273H-reporter, or RPE1-line with ∆CDKN1A (p21) that is a synthetic viable rescue for ∆*C16orf72. *

      (2) Whether C16orf72 dependent upon or cooperate with MDM2 in regulating p53.

      We will first probe whether C16orf72 overexpression increased the p53 ubiquitination, and then decide whether overexpression of C16orf72 has additive effects to MDM2 overexpression in regulating p53 levels.

      We previously observed that overexpressing C16orf72 could not rescue the R273H level resulted from losing MDM2 (using flow-cytometry in R273H-reporter-∆MDM2), and as such, we plan to test the C16orf72-MDM2 relation in the MDM2-proficient context.

      The manuscript is in a form extremely unfriendly to review, text, figures and legends are all split up at multiple locations, the pdf figures are very sluggish to scroll.

      Response: We sincerely apologise for the inconvenience. In the current submission, we have split the submission into three separate files, (1) main text, (2) main figures, and (3) supplemental figures, along with (4) supplemental tables as individual EXCELs. We will also reduce the resolution of a few images, so the overall higher resolution is retained, while still fitting into the file size limit.

      Reviewer #3 (Significance (Required)):

      The work is significant in identifying a functionally relevant regulator of p53 stability.

      Response: we thank the reviewer again for the very constructive feedback!

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Lu and coworkers performed genome wide CRISPR screens to search for genes that when knocked out, lead to p53 accumulation or degradation. Wt p53 and a panel of p53 hotspot mutants were chosen as reporter for the screen. The approach reassuringly identified many previously described regulators of p53 degradation, and also found a large set of new hits that many appear to be indirectly affecting p53 level.

      A key step of this approach is the follow up functional and mechanistic study of the hits. To this end, the authors chose FBXO42 as a top hit that blocks mutant p53 degradation, and C16orf72 as a top hit that promotes wt/mutant p53 degradation.

      Overall the functional data for FBXO42 is disappointing. FBXO42 knockout has quite modest effect on mutant p53 level (~50% reduction). The knockout also showed some effect on p53 mRNA level (~25% reduction), making the determination of mechanism difficult. It does not appear to be a promising targeting for reducing mutant p53 level and gain of function activity in tumor cells.

      The C16orf72 finding unfortunately lost some novelty because it was independently identified as a p53 regulator in a recent study using CRISPR screening (PMID: 33660365). However, the repeated identification is reassuring and the current work provides more convincing functional data, showing C16orf72 knockout increase wt p53 level, inhibits cell proliferation specifically in p53+/+ cells, and overexpression of C16orf72 reduce wt p53 level and accelerates progression of a breast tumor mouse model. Their results suggest C16orf72 is a biologically relevant regulator of p53 in cancer development. In order to provide a reasonable amount of new information and set it further apart from the published study, some biochemical analysis looking into the mechanism of C16orf72 will be helpful.

      Specific comments:

      There appears to be a mix up in the figure legend for Fig.1A describing line 1 and 2.

      Fig.2. Data for some p53 mutants mentioned in the text cannot be found in the main figure 2D and supplemental figure S3A.

      Fig.2 E-F. The effects of FBXO42 and CCDC6 KO on endogenous mutant p53 level is small (~50% decrease). Given that mutant p53 accumulates at high levels, whether a 50% decrease has meaningful effect on its gain of function activities is questionable. The knockouts also caused a ~25% decrease in p53 mRNA (FigS3F) which makes the mechanism quite difficult to investigate further.

      Fig.3B. The IP experiment using p53 shRNA and control shRNA should be done by IP of p53 followed by CCDC6 western blot. If CCDC6 IP is used as in the figure, then a CCDC6 shRNA knockdown sample should be compared to control shRNA. The current data does not rule out the possibility that CCDC6 antibody can nonspecifically pull down some p53.

      Fig.3D. The in vitro pull down experiment needs specificity controls such as non affected R175H p53 core domain. The data presented would suggest that MBP-FBXO42c captured more than 1:1 molar ratio of R273H core domain, which is unusual for specific binding unless there is aggregation of p53.

      To increase the impact of the current study, the authors could provide more mechanism insight on how C16orf72 regulates p53 level, which was also missing in the other published study. For example, addressing whether C16orf72 effect is dependent on MDM2. Does it cooperate with MDM2 to ubiquitinate p53. Does it promote p53 ubiquitination in the absence of MDM2, since it interacts with HUWE1. Does it act by recruiting usp7 to stabilize MDM2.

      The manuscript is in a form extremely unfriendly to review, text, figures and legends are all split up at multiple locations, the pdf figures are very sluggish to scroll.

      Significance

      The work is significant in identifying a functionally relevant regulator of p53 stability.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The paper describes several genome-wide CRISPR screens designed to identify regulators of p53 stability. The authors use a system in which p53 levels are marked by mClover expression, using RFP expression to normalise for gene expression changes.

      1. The bimodal distribution of p53 expression levels in some reporter cell lines (G245S, R248Q, R248W and R273H) hampers the implementation of a robust readout and makes correct interpretation of the results challenging. While it is possible that the bimodal distribution indicates dynamic changes in p53 levels within one population, it also seems possible that a subclone of these cells have acquired additional alterations affecting p53 stability, and that the authors are screening a mixed population of two intrinsically different cell populations. This would make it difficult to interpret the results of the screen in these cell lines and may be a challenge when trying to identify something that has not already been highlighted on depmap.
      2. The coverage of the sgRNA library (200x) is rather low for a negative selection screen, where a coverage of 500x would be more desirable. The FDR threshold is also rather lenient, a more stringent FDR threshold would seem more appropriate and shorten the list of potential hits.
      3. Although the study is focused on the regulation of p53 stability, there are no experiments to show that any of the manipulations alter the ubiquitination or degradation (half-life) of p53. The rescue of expression by proteasome inhibition is very modest (Figure 2E), suggesting the loss of expression may not be a reflection of degradation. A role for endogenous FBXO42 and C16orf72 in regulating the ubiquitination and half-life of endogenous p53 should be confirmed
      4. Many p53 mutants are used for the initial screens, but very little validation is carried out to show that the apparent differences in factors regulating their stability persists in cells naturally expressing these mutants. For example, FBXO42 is identified as a protein required to maintain the stability of R273H, 248W and R248Q, but not R175H, G245S and R337H. While the authors show an association of CCDC6 and p53 in PANC1 cells (expressing 273H), it would be important to show a panel of R273H, 248W and R248Q expressing tumor cells and the response of p53 to FBXO42 and CCDC6 depletion, compared to similar experiments in a panel of R175H, G245S and R337H expressing tumor cells. Again, it would be important to show that any changes in protein levels are due to changes in protein stability.
      5. The potential hits should also be tested in wild type p53 expressing cells to confirm the specificity to mutant p53s.
      6. The role of C16orf72 in restraining p53 activity has been reported previously, as has the interaction with HUWE1 (including a new publication PMID: 35776542). The authors suggest an interaction between C16orf72 and USP7, although this should be shown with endogenous proteins. The relative importance of USP7 and HUWE1 binding is not explored. The effect of C16orf72 overexpression in promoting mammary tumors is impressive, although maybe the more interesting question is whether inhibition of C16orf72 expression can limit tumor development in this system.

      Minor comments

      • Figure 1b: The nutlin concentration stated in the methods section is wrong. Should be 10 µM instead of 10 nM (correct in figure legend).
      • Figure 1c: Include results for a mutant that is not regulated by MDM2, such as R175H. Otherwise, as a standalone experiment, this figure doesn't add much.
      • Figure 1e/f Legend: Should be FDR <0.5 not >0.5.
      • Figure 1h: While an UpSet plot is an elegant way to present unique and overlapping hits between different screens, Venn diagrams might be more 'accessible' to many readers and easier to understand.
      • Might be worth stating that mClover is an eGFP variant and can therefore be targeted by eGFP sgRNAs so that it is easier to understand the following:
        • Page 5, paragraph 1: "We used the TKOv3 sgRNA library, which contains [...] 142 control sgRNAs targeting EGFP, LacZ and luciferase"
        • Page 5, paragraph 2: "As expected, sgRNAs targeting p53 and mClover were the most depleted sgRNAs, [...]
      • Figure 6b: y-axis label is missing

      Significance

      This is an interesting concept and the results could provide a useful resource for groups interested in the regulation of p53. The authors chose to focus on candidate genes that could have been identified by looking for the top 30 p53 co-dependent genes on depmap (C16orf72 is #24 in this list and FBXO42 is #28, most of the other genes ranking above are already known as p53 regulators). While this validates the screen, it would have been interesting if the authors had identified and validated new regulators of p53 that were not apparent from previously published work.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript by Lu et al., the authors describe some CRISPR screens and protein-protein interaction screens to identify novel regulators of wild-type p53 and mutant p53 function and stability. Besides generating a wealth of data, they discover FBXO42-CCDC6 as positive regulators of the some p53 hot-spot mutants, including R273H mutant p53, but not of all p53 mutants tested and also not of wild-type, indicating selectivity. Furthermore, the found C16orf72(TAPR1) as a negative regulator of p53 stability. Mechanistically, the authors claim a direct interaction between FBXO42 and CCDC6 and p53, but the importance of these interactions has not been shown. On the other hand the authors suggest that the FBXO42/CCDC6 regulate p53 via destabilization of USP28, but also the mechanism has not been worked out. For c16orf72, they show that it interacts with USP7, but no relevance of this interaction is shown either.

      Major points

      One very important point for me is that the authors do not show the levels of expression of p53 in the p53-mClover stable cell lines. It is known that overexpressed p53 is usualy more stable than endogenous levels of wt-p53. Therefore, I think it is necessary that the authors show the levels of the p53-mClover fusion proteins in the stably transduced cell lines compared to endogenous p53 levels in the parental RPE1 cells and also compared to the endogenous levels of R273H mutant in the PANC-1 cells.

      Also the functionality of the wild-type p53-mClover fusion is questionable, at least not shown. One would expect that the overexpression of a functional wt-p53 in p53-KO cells will affect the survival of the RPE1 cells. In Figure 5A the authors show that depletion of MDM2 or C16ORF72 is toxic for the RPE1 cells in a p53-dependent manner, indicating that elevated levels of p53 cannot be handled by these cells. So, experiment(s) showing that the wt-p53/mClover fusion is functional is needed.

      A second important point is that the 'verification' of the hits from the screens is only done in one cancer cell line, PANC-1, with mutant p53. I would have like to see at least one other cell line with another p53 mutant endogenously expressed that is also regulated by FBXO42/CCDC6.

      For many of the p53-mutants, a bimodal expression is observed. In the FBXO42- and CCDC6-depleted cells, the equilibrium shifts towards more negative cells but the levels in the two populations itself don't change (while for example for USP28 depletion also the right peak shifts further up, Fig S4E). Is there any correlation with the cell cycle and p53 expression? And can the authors exclude that FBXO42 and CCDC6 are involved in cell cycle progression and hereby influence p53 indirectly (by combining PI staining with Clover-p53 for example).

      • The authors claim that the FBXO42-CCDC6 axis regulates stability specifically some p53-mutants, including R273H-mutant, in a manner involving USP28. But USP28 regulates all forms of p53, not just some mutants version. How can the authors reconcile this apparent contradiction?

      On a similar note, the authors show that FBXO42 and CCDC6 interact with p53, but not USP28. Do FBXO42 and CCDC6 interact with each other and with USP28? And is the interaction with p53 specific for the R273H version? This part of the mechanism is very poorly defined and the Co-IPs are not very convincing or relevant for the proposed model.

      Minor points

      The mechanisms of p53 regulation may vary greatly in different cell lines. Can the authors discuss why they choose to do the screen with different mutants, rather than with different cell lines expressing these same mutant endogenously? .

      Figure 1: Typo in the legends : Nultin ipv Nutlin

      Figure 1b,1c : Show basal and Nutlin-3 induced MDM2 levels and in the overexpression cell lines; if WT-p53 is functional, MDM2 levels should be higher in WT-transduced cells compared to control or mt-p53 expressing cells. Authors should explain which they name USP7 a negative regulator of p53, since it is supposed to de-ubiquitinate p53?!

      Figure 2E: the effect of MG132 on p53 seems to be very minimal on this Western blot; it would need quantification to be convincing...Quality of the blot is also not great. The fact that in control cells the levels of p53 R273H are not affected by MG132 treatment fits with Suppl Figure 2E, indicating that the proteasome has no effect on p53 R273H.

      Suppl figure 3b, 3c, 3d:

      Somehow, I have the feeling that the results from the western blots and the FACS do not match fully, although not all the time-points are shown in the various experiments. For example, the FACS analysis (3b) suggests that in control-transduced cells after 16 hr p53 is still increased. However, that is not clear at all in theWestern blot (3c) Is Suppl Figure 3d the quantification of 3c experiment? If so, in the blot also the 24 hrs should be shown. The blot shown in Suppl Figure 3c suggests that CCDC6 expression increased upon irradiation. Do the authors agree with that? Would that explain why depletion of CCDC6 has more effect upon irradiation? Suppl Figure S3E: if I am right, this is essentially the same type of experiment as shown in figure 2e, but analysis of p53-expression by Western blot. In that blot no real effect of MG132 on p53 levels could be seen. But here, in the FACS analysis, MG132 clearly increases the p53-Clover fusion levels; for me again that Western blot and FACS data do not neccesarily match.

      Figure 3B: In the CCDC6 IP a very small amount of p53 can be found. I don't know how much input lysate compared to amount of lysate for IP is used, but the percentage of p53 found interacting with CCDC6 seems so marginal that is is difficult to explain the effect of KO of CCDC6 in PANC1 cells. And, the authors called it a 'reciprocal IP' (Suppl Figure 4a) after transfection of V5-tagged CCDC6 into PANC1 cells,but it actually is the same type of IP. Did the authors try to IP p53 and blot for CCDC6? That would be a reciprocal IP.

      Figure 3H: how can authors explain that basal levels of USP28 in control and CCDC6-KO cells transfected with control plasmid are more or less the same and not reduced in the CCDC6-KO cells?

      Figure 3I: Essentially the whole blot here is of low quality; especially the FBXO42 blot; is deletion of USP28 increasing FBXO42 protein levels, or is it just the quality of the blot? All in all it seems that FBXO42 is very low expressed in the used cell lines.

      Figure 4B: I find it a bit surprising that USP7 is also found in the synthetic viability screen, since it has been shown that USP7 has many more essential targets and KO of p53 only partially rescues the development of USP7-KO mouse embryo's.

      Figure 5: the authors nowhere show the efficacy of the guides targeting c16orf72. A Western blot showing the expression and the reduction upon expressing the guide-RNAs is essential. Figure 5E: First, here probably parental RPE1 cells have been used, but that is not stated. Second, the authors state 'only a slight increase in p53 levels upon siHUWE1'; I would say none compared to scrambled. I know HUWE1 is a very huge protein, but the blot of HUWE1 is not convincing. I seem to be able to conclude that siMDM2 and siUSP7 reduces HUWE1 levels? Figure 5F, in relation to figure 5D. Here the author overexpress both c16orf72 and USP7, and find an interaction. The implication of that is not clear. If they want to make point of this interaction, they should have looked at endogenous proteins. It is worrying that USP7 apparently was not one of the hits in de Mass-spec experiment of which results are shown in Figure 5D. Also in that experiment c16orf72was overexpressed, and USP7 is very highly expressed in essentially all cell lines, so do the authors have an explanation?

      Suppl. figure 5D is missing

      Referees cross-commenting

      I agree essentially with all comments of Reviewer #2. Especially the major points 3 and 4. The use of more cell lines expressing endogenous mutant p53 is very important. In addition, I can agree with almost all comments of Reviewer #3. The effects especially of FBXO42 ablation are rather minimal, so relevance is questionable.

      Significance

      Nature and Significance

      Compare to existing literature

      The topic of the paper is of high interest given the relevance of p53 and its gain-of-function mutants in oncology, and the screens are well executed and clearly presented. In terms of novelty, FBXO42 has been linked to p53-degradation before, and c16orf72 was recently shown to be able to destabilize p53. However, the link between CCDC6 and p53 is novel and of interest, since they are both substrates of USP7 and are both regulators of the cell cycle.

      We think the manuscript has potential to add something to the field, but would benefit greatly from a better understanding of the molecular underpinnings of their newly described mechanisms, as well as the conditions in which the mechanism is active.

      Therefore, it might be advisable to shorten the manuscript, and go more in-depth in finding the mechanisms of regulation.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2021-01204R

      Corresponding author(s): Alexander, Aulehla

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)): *

      *The paper by Miyazawa and colleagues addresses a key question: How is changed metabolic activity sensed and to induce changes in developmental programs. In recent years, there is more and more indication that metabolism is not only a dull workhorse synthesizing the building blocks for new cells and providing chemical energy, but that metabolic activity itself has also a regulatory role. How this precisely works is largely unknown and even also unexplored in higher cells. From early insights obtained in microbes, it seems that certain metabolites - possibly reflecting metabolic activity (i.e. flux) - could be metabolic signals that feedback into cellular regulation. *

      *The current paper takes this idea now to developmental processes, where the authors found that the glycolytic metabolite fructose-1,6-bisphosphate is a flux-dependent signal that interferes with developmental processes. This is a very exciting finding, as it indicates that this metabolite not only has a regulatory function in microbes but also in mouse during mesoderm development. *

      *Answering the question how such a flux-dependent metabolite mechanistically interferes with the developmental processes is an enormously difficult. Compared to other mechanistic studies, where deleting genes, modifying genes, and changing protein expressions will usually do the trick, here, perturbing metabolite levels is extremely challenging, particularly if such perturbations need to be carried out in a way that nothing else is perturbed. Researchers, who are not overly familiar with metabolism, usually underestimate the difficulty with targeted and insightful perturbation of metabolism. *

      *To this end, the authors of this paper need to be congratulated for a very well carried out study with very solid data, and excellent control experiments. The authors open up a new path towards understanding how embryo mesoderm development is regulated by metabolic activity. In particular, they show that that glycolytic flux, FBP and important developmental phenotypes as well as protein localization changes are linked. As normal with a complex metabolism-based story as this one, there is always more that could be done. Yet, the results are highly important to be reported now such that the field as a whole can build on these interesting results and to explore the exciting path further that has been opened by the authors. Thus, I strongly recommend publishing these findings: The data generated by the authors are accompanied by the required control experiments. The conclusions drawn are very solid. I do not have any major concerns but just a number of minor suggestions that the authors could consider in a revised version of the manuscript. *

      *Minor: *

        • At the end of the introduction, the authors stated their original goal. As it is phrased, it is unclear whether this goal has been obtained or not. They might want to consider replacing the last introductory sentence by a sentence stating what the reader can find in this paper.*

      1. We agree with the reviewer and have rephrased accordingly (line 112–117):

      “In this study, our goal was therefore to first determine in vivo sentinel metabolites during mouse embryo PSM development. We then combined genetic, metabolomic and proteomic approaches to investigate how altered glycolytic flux and metabolite levels impact developmental signaling and patterning processes.”

      • Data from Fig 3: If you plot the lactate secretion vs the FBP levels of the controls and the overexpression experiment, would the control and the overexpression data lie on one line (maybe if combined with the data shown in Fig 1A)?*

      2. As the reviewer suggested, it is of great interest to check whether lactate secretion and FBP levels show a similar correlation in control and cytoPfkfb3 embryos, considering that cytoPfkfb3 overexpression lifts the upper limit of glycolytic capacity and FBP levels (revised Figure 3B, 3E). As the reviewer suggested, we plotted FBP levels against lactate secretion and fitted a linear regression line onto control samples (please see the Figure R1 below). The new plot shows that lactate secretion and FBP levels in cytoPfkfb3 embryos lie on the linear regression line derived from wild-type samples, highlighting that a correlation between lactate secretion and FBP levels is maintained even in cytoPfkfb3 embryos. We now included this new plot in the revised Figure S4C and modified the text accordingly (line 474-477):

      “In addition, FBP levels showed a linear correlation with lactate secretion in control explants, and such a correlation was maintained even in cytoPfkfb3 explants (Figure S4C).”

      Figure R1. Correlation between lactate secretion and FBP levels in PSM explants. Linear regression line (a grey line) was derived from the data of control samples cultured in 0.5–25 mM glucose (black circles; from Figure 1A and 3E). The data from cytoPfkfb3 embryos cultured in 2.0–10 mM glucose (from Figure 3B and 3E) are shown as red rectangles.

      • Maybe the authors could attempt an experiment like the following one: Chose the strongest phenotype observed and test a combination of overexpressing cytoPfkfb3 and reducing extracellular glucose level at the same time? *

      3. We agree this suggested experiment is important to show that the phenotype in cytoPfkfb3 embryos is indeed dependent on glycolytic flux and have already addressed this specific point in our manuscript, see results in Figure 4B and 5A in our original manuscript. The results show that the phenotypes in cytoPfkfb3 explants, i.e. reduction in somite formation and downregulation of Msgn mRNA expression occur in a glucose dose-dependent manner. Since in this embryonic context, we show that glucose concentration impacts glycolytic flux (see increased lactate production upon glucose titration in Figure 3B), our findings support the conclusion that the effect of cytoPfkfb3 overexpression is flux-dependent and not due to the overexpression per se. Based on the reviewer's feedback, we have modified the text to clarify and highlight this critical point (line 339–345):

      “Combined, these results show that cytoPfkfb3 overexpression results in reduced segment formation, arrest of the segmentation clock oscillations and downregulation of Wnt signaling, in a glucose-dose dependent manner. As glucose concentration impacts, in turn, glycolytic flux (Figure 1A, 3B), these findings suggest that these phenotypes are flux-dependent and are not a mere result of cytoPfkfb3 overexpression.”

      • Can the proteomics experiments shown in Fig. 6 be repeated with high and low extracellular glucose? High glucose should yield high FBP levels and one would then expect to see the same as with the experiment where at 2 mM glucose 20 mM extracellular FBP were added. Is this the case? *

      4. We agree with the reviewer that based on the findings, one would expect the phenotype, i.e. in this case translocation of proteins, to correlate with FBP levels. Two of our results are of note in this regard.

      First, our data indicates that in order to see the effect on protein localization, high levels of FBP have to be reached. Accordingly, we find that Pfkl becomes depleted from the nuclear-cytoskeletal fraction in cytoPfkfb3 explants when cultured in 10 mM glucose but not (visibly) in 2.0 mM glucose (Figure 7D). Corresponding to this, FBP levels in cytoPfkfb3 explants show a significant increase (about 3-fold) from 2.0 to 10 mM glucose conditions (revised Figure 3E).

      Second, in control samples, FBP levels saturate in high glucose conditions. FBP levels in control samples do not further increase when glucose concentration is increased from 10mM to 25mM, and thus it does not become as high as in cytoPfkfb3 embryos cultured in 10 mM glucose (revised Figure 3E).

      Therefore, in order to reveal the translocation, it requires an experimental strategy that leads to significantly increased FBP levels, such as in cytoPfkfb3 explants with high glucose condition, or alternatively, direct supplementation of FBP.

      As also pointed out by the other reviewers, we are experimentally generating controlled conditions that exceed the physiological range which the embryo is exposed to. Accordingly, our data does not constitute evidence that under physiological conditions an alteration of protein localization in response to change in glycolytic flux and FBP levels occurs, at a smaller scale.

      We regard our approach as a first step to reveal potential mechanisms and so far hidden possible responses to changes in metabolic flux. In order to see minor changes in translocation upon small changes in glycolytic-flux/FBP levels, more quantitative approaches, such as live-imaging of tagged proteins, will need to be developed. We hence decided to include these discussion in our revised manuscript (line 657-666):

      “Of note, the translocation of proteins was observed only when high levels of FBP were reached upon direct FBP supplementation or cytoPfkfb3 overexpression with high glucose (Figure 6, 7). Future studies hence need to investigate whether flux-dependent change in protein localization occurs upon moderate and more physiological changes in glycolytic-flux/FBP levels. To this end, the development of more quantitative approaches, such as live-imaging of tagged enzymes and the development of metabolite biosensors, are needed.”

      • While the authors quantified proteins in different compartments, I was wondering whether they also looked for whole-embryo protein expression changes? *

      5. We have not done protein expression analysis using whole embryos, or other isolated tissues in this study. This is indeed a potentially interesting future experimental comparison.

      • Throughout the manuscript, the authors state the glucose levels or cytoPfkfb3 changes the glycolytic flux. While I tend to agree with this, it is important to note that the authors have not directly measured glycolytic flux, but use the amount of accumulated lactate as a proxy. I think it is important to add this disclaimer at important points in the manuscript, such that readers are aware of this point. *

      6. We fully agree with the reviewer and now have added the following sentence in the first result section to make this point clearer to the reader (line 126-128):

      "Throughout this study, we used quantification of secreted lactate as a proxy for glycolytic flux due to the inability to directly measure flux in embryonic tissues."

      Another aspect for changing FBP levels could be connected on what was found in yeast, where the FBP levels were found to oscillate with the cell cycle (https://pubmed.ncbi.nlm.nih.gov/31885198/). Could this be connected with the pattern formation here?

      7. This is indeed an interesting aspect to discuss; in the absence of experimental evidence connecting the observed pattern formation and cell cycle (though some classic work had suggested its existence) we have decided to omit the discussion of this potential link.

      • Line 606: The mentioned review article also covers yeast. As such, maybe the authors should replace the term "bacteria" with "microbes"? *

      8. We modified our manuscript accordingly.

      Reviewer #1 (Significance (Required)):

      **Referees cross-commenting**

      As I mentioned in my comment, targeted metabolic perturbations are extremely difficult. Perturbing a metabolite level without at the same time perturbing the flux through this pathways is difficult (of not impossible). Also, the opposite is the case.

      I am not sure whether experiments as the one suggested by reviewer 2 (comment 1) will really lead to results from which further conclusions can be drawn. Furthermore, there does not need to be a linear correlation between the extracellular glucose concentration and metabolic flux/FBP levels (as my reviewer colleague implies). Thus, I am not sure whether doing this experiment makes sense, or would lead to strengthened conclusions.

      Reviewer 2 also states "The lack of proven mechanism for the activity of FBP might restrict the real general impact of this work." I agree that we do not know the downstream targets of FBP, but finding them would likely require many years of additional work. Such work will not be initiated if this paper is not published, and it would be a pity if it would be further delayed. I feel that the evidence is strong enough that FBP has an important role and with this paper published, it will motivate others to look for the downstream targets.

      Reviewer 3 makes the point: "Given that FBP levels are highly correlated with extracellular glucose levels (which impact glycolytic flux )(TeSlaa and Teitell, 2014) the authors should elaborate on why progressive increase in extracellular glucose does not affect PSM patterning, in the same way that increasing FBP levels does. " Here, I feel my reviewer colleague might be overlooking that in biochemistry molecular interactions typically reach a saturation at some point. The correlation between extracellular glucose and glycolytic flux has likely only a range where these two measures linearly correlate. Similarily, the correlation between glycolytic flxu and FBP likely also exists only within a certain range, and finally FBP levels and the downstream targets likely also only linearly interact within bounds. Thus, the absence of a correlation at "extremes" does by no mean mean that what the authors propose is incorrect. In fact, it just shows what you expect from biomolecular interactions that there a limits to linear correlations.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      *Summary. *

      *The work described in this paper first searches for potential sentinel metabolites of glycolytic flux, focusing on the process of somitogenesis during mouse embryonic development. By measuring the levels of different metabolites in the presomitic mesoderm (PSM) of E10.5 mouse embryos cultured in the presence of three different glucose concentrations, the authors identify 14 metabolites whose concentration rises with increasing glucose concentration in the culture medium. Among them, they selected fructose 1,6-bisphosphate (FBP) for further analyses, as it showed the highest linear correlation with extracellular glucose concentrations. They then show that addition of FBP to the incubation medium of cultured embryo tails interfere with somitogenesis and tail extension in a concentration-dependent fashion. In addition, they show that this effect is exacerbated when extracellular glucose levels are increased. By analyzing specific targets of Wnt and Fgf signaling, the authors also show that addition of FBP down-regulates both signaling pathways in the PSM. They then use a genetic trick (ubiquitous overexpression of cytoPfkfb3) to increase FBP levels by allosteric activation of Pfk (the enzyme that produces FBP) in developing embryos. When tails from these transgenic embryos were cultured in vitro and exposed to various glucose concentrations somitogenesis was affected in a way resembling the effects of FBP on cultured tails from wild type embryos. The authors then go on to determine the subcellular localization of different proteins in tails incubated in the presence of various FBP concentrations to identify that some enzymes involved in the glycolytic pathway (and they specifically focus on Pfkl and Aldoa) are excluded from nuclear fractions at high FBP concentrations. The authors conclude that FBP functions as a flux-signaling metabolite connecting glycolysis and PSM patterning, potentially through modulating subcellular protein localization. *

      *Major comments *

      *I think that in general the work described in this manuscript has been performed to the highest technical standards. However, I do not think that I can agree with the authors' conclusions (that FBP connects glycolysis with PSM patterning and that subcellular localization of glycolytic enzymes play a role in this process), which in my opinion go way beyond what can be proven by the data provided. *

      *1- Explants incubated with external glucose concentrations up to 25 mM have no obvious defects on somitogenesis or on the segmentation clock as determined by LuVeLu cycling activity. Under these conditions, explants are expected to contain very high FBP levels if this metabolite keeps its linear relationship with external glucose (in this work it was not measured beyond 10 mM glucose in the medium, where FBP concentration was already very high). This contrasts with the phenotypes observed upon exogenous supplementation of FBP, which affects somitogenesis already at 2 mM glucose. These latter results are at odds not only with the lack of phenotypic alterations under high glucose conditions, but also with the observation that exogenous addition of fructose 6-phosphate (F6P), the substrate of Pfk enzymes to generate FBP, does not alter somitogenesis. The authors take the absence of effects by incubation with F6P as a control of the specificity of FBP. However, as F6P is the natural substrate of Pfk, it is possible that supplementation of F6P also leads to an increase of FBP but in a way closer to a physiological condition. Therefore, I find it essential to determine FBP levels in tails incubated in the presence of increasing amounts of F6P, as if it increases FBP levels, similarly to what the authors described for the tails incubated with increasing glucose concentrations, it will have important implications to the interpretation of the work presented in this manuscript. *

      9. We agree with the reviewer and to directly address this central point, we have performed an extended, additional experiment, collecting 375 embryos to quantify FBP levels under five conditions with three biological replicates.

      There are two major results that we highlight here: First, we found that addition of F6P did not lead to increased FBP levels compared to control samples cultured in 10 mM glucose, which is in stark contrast to cytoPfkfb3 embryos cultured in 10 mM glucose (revised Figure 3E). Second, while increasing glucose concentration is mirrored by elevated FBP levels as we reported, we find clear evidence of saturation above a concentration of 10mM glucose: increasing glucose to 25mM does not increase FBP levels further (revised Figure 3E).

      This saturation effect seen in glucose titration, but also the absence of elevated FBP upon F6P addition, might be expected outcomes because, as also the reviewer 1 pointed out in the response, Pfk is commonly considered to be a rate-limiting enzyme in the glycolytic pathway. We now have the direct experimental data supporting this hypothesis and thank the reviewers to have initiated this additional (very involved..) experiment.

      This new data allows us to conclude more firmly on the correlation between FBP levels and phenotype: at high FBP levels, which are seen in cytoPfkfb3 samples, we observe PSM patterning defects. These high levels are not reached even at 25mM glucose or upon F6P addition, due to the saturation at the level of PFK enzymatic step. Hence, while glucose titration does elevate FBP significantly until this saturation, FBP levels are not as high as in cytoPfkfb3 samples. As a correlative finding, we see that only those conditions with very high FBP levels, or the direct addition of high levels of FBP, cause the arrest of segmentation clock activity. At moderately elevated FBP levels, observed in control explants with high glucose or in cytoPfkfb3 explants with low glucose, clock activity continues and we find a quantitative effect at the level of gene expression, i.e. Wnt signaling target downregulation (Figure S3, 5A).

      The new data has been included in the revised manuscript and the text has been adjusted accordingly:

      • (Result Part, line 245–254) "Consistently, we found that cytoPfkfb3 overexpression lifted the upper limit of FBP levels in PSM cells (Figure 3E, S4B, S4C). In control explants, FBP levels did not increase further when glucose concentration was increased from 10 mM to 25 mM. It was also the case when control explants were cultured in 20 mM of F6P (Figure 3E). These results indicate that the Pfk reaction carries a (rate-)limiting role for glycolytic flux and FBP levels, and that cytoPfkfb3 overexpression hinders the flux-regulation function of Pfk."

      • (Discussion Part, line 551–573) “Our findings suggest that flux-regulation at the level of Pfk is critical to keep FBP steady state levels within a range compatible with proper PSM patterning and segmentation. In agreement with such a rate-limiting function for Pfk, we found in glucose titration experiments that FBP levels saturated and did not further increase at glucose levels above 10 mM (Figure 3E). Along similar lines, the supplementation of high concentrations of the Pfk substrate F6P did not result in a significant increase of FBP levels, again compatible with a rate-limiting function at the level of Pfk (Figure 3E). The upper limit of glycolytic flux and FBP levels can be experimentally increased by cytoPfkfb3 overexpression (Figure 3B, 3E). We interpret the data as evidence that cytoPfkfb3 overexpression compromises the flux-control function of Pfk and hence much higher FBP (and secreted lactate) levels are reached. Such a drastic increase in glycolytic flux and FBP levels correlates with a severe PSM patterning phenotype (Figure 4), which resembles the phenotype induced by supplementation of high dose of FBP (Figure 2). Our results in mouse embryos hence provides evidence that flux regulation by Pfk, an evolutionary conserved role present from bacteria to humans, serves to maintain FBP levels below a critical threshold.”

      *The main difference between the experiments involving FBP supplementation and those involving high glucose concentrations or exogenous F6P addition is that in the later two cases increase in FBP would be restricted to the tissue(s) expressing Pfk, whereas upon FBP supplementation this metabolite would hit any tissue, regardless of whether or not it would ever be physiologically exposed to this molecule. In the case of the PSM, this might be relevant because it has been shown that there is a gradient of glycolysis, being high at the caudal tip and becoming lower at more anterior regions of the PSM, most likely mirroring the distribution of Pfk activity. Exogenous administration of FBP would flatten the gradient, which could lead to alterations in PSM patterning, whereas glucose (and eventually F6P) would not as they would increase FBP locally in the area where it is normally activated, keeping the natural gradient. *

      *On the basis of these arguments, to which extent does FBP connect glycolysis and somitogenesis under physiological conditions? *

      10. First, we would like to clarify that while indeed glycolytic activity is graded along the PSM, as other and we reported previously (reported in Bulusu et al., 2017 and Oginuma et al., 2017), the baseline expression of the entire glycolytic machinery (from glucose transport to lactate production) is very high, in all PSM cells. Hence, we see that cells all along the entire PSM have very active glycolysis, the posterior PSM being even more active.

      For this and related reasons, our interpretation about the difference seen between glucose titration/F6P addition on one side, and FBP addition/cytoPfkfb3 addition on the other side, is based on the role of Pfk in controlling either flux levels or dynamics in all PSM cells.

      Hence, while we agree that we generate experimental conditions that allow FBP levels to surpass those found in control embryos, we would like to highlight the fact that even moderate changes in flux does result in very robust functional consequences on gene expression (Figure S3, 5), as we show in this work.

      We can currently not fully address the first point raised, i.e. the role of graded flux/graded metabolite levels, due to the experimental limitations. Such a study requires, for instance, the generation of metabolite biosensor reporter lines in order to be able to monitor these changes dynamically, in space and time.

      *ESSENTIAL ADDITIONAL EXPERIMENT related to point #1: Measure FBP from PSM explants incubated under various exogenous concentrations of F6P. *

      11. We have performed this suggested experiment, which required the collection of n=375 embryos cultured under the various conditions and analysis by LC-MS to quantify metabolites. The outcome was indeed very informative (please refer to our response #9).

      *ANOTHER EXPERIMENT THAT COULD BE INFORMATIVE: measure FBP levels in PSM incubated under different glucose concentrations but instead of using the whole PSM together, dividing the PSM in posterior, medium and anterior parts (similarly to what was done in Oginuma et al, 2017, reference in the manuscript) to see if there is a gradient in FBP activation. *

      12. While in principle we agree that this experiment could be informative, we consider the proposed experiment beyond the scope of this work and technically very challenging (although possible). With a similar motivation, the development of metabolite biosensors is an alternative route that we are pursuing for future studies (for the detail, please refer to our response #10).

      *2- A similar argument could be presented for the results with the cytoPfkfb3 transgenics, as they are based on global artificial overactivation of Pfk, in addition to other possible effects of the ectopic activity of cytoPfkfb3, which were not controlled. Also, while the phenotypic alterations in the PSM in vitro, most particularly in the experiments involving incubation of the tails, are rather strong, the reported effects on somitogenesis in vivo are minor, also questioning the contribution of the in vitro conditions to the final phenotypic effects observed throughout the manuscript. *

      13. First of all, we would like to emphasize that the phenotype seen in cytoPfkfb3 embryos, i.e. the reduction of segmentation and downregulation of Wnt-target gene expression, occurs in a glucose dose dependent manner (Figure 4B and 5A). Hence, it is not the overexpression of cytoPfkfb3 per se that can account for the effects seen. But rather, increased glycolytic flux caused by the combination of transgene expression with high glucose results in functional consequences.

      In addition, ‘other possible effects’ that the reviewer is referring to should be evident in all transgenic embryos, irrespective of glucose dose. To the contrary, transgenic embryos cultured in low glucose conditions appear unaltered to control embryos.

      Second, we agree that we need to distinguish between strong phenotypes, visible at the level of clock arrest, and milder phenotypes, visible at the level of quantitative gene expression changes. It is important to note that the moderate phenotype, i.e. the quantitative gene expression changes seen in posterior PSM, are seen upon the addition of FBP at moderate levels and upon in glucose titration within the physiological concentration range, as well as in cytoPfkfb3 embryos. We take this as evidence that the effects seen in cytoPfkfb3 transgenic embryos reflect a common response also seen under physiological conditions.

      To extend this argument to the in vivo setting, we have performed additional experiments using a genetic mouse model for diabetes. As shown in our previous submission, cytoPfkfb3 transgenic animals do not exhibit a drastic in vivo phenotype when dissected at embryonic day 10.5. One interpretation of this finding is that since the cytoPfkfb3 phenotype is glucose and flux-dependent, the in vivo flux is low, reflecting low glucose concentrations described in vivo. To test the effect of increased flux in cytoPfkfb3 embryos in vivo, we therefore crossed the transgenic mice into a diabetic model called Akita, in which a point mutation in the Insulin2 gene causes high maternal glucose levels (Yoshioka et al., 1997; Wang et al., 1999). Using this experimental setup, we tested whether transgenic embryos in Akita diabetic females would manifest in vivo phenotypes.

      Indeed, we found that cytoPfkfb3 transgenic embryos developing in Akita diabetic females showed significantly increased cases of neural tube closure defects (50% of cytoPfkfb3 embryos) and developmental delay (control: 38 somites vs. cytoPfkfb3: 34 somites at E10.5), defects not seen in transgenic cytoPfkfb3 embryos from control females (please refer to Figure R2 below). This dependency of the in vivo phenotype on maternal glucose conditions again highlights that the defects observed in cytoPfkfb3 embryos are not due to the expression of cytoPfkfb3 per se, but are rather directly linked to increased/unregulated glycolytic flux.

      We included the new in vivo data in the revised Figure S5D-E and modified the text accordingly.

      Figure R2. In vivo phenotype of cytoPfkfb3 embryos grown in diabetic Akita females. (A) The number of somites in control (Ctrl) and cytoPfkfb3 (Tg) E10.5 embryos grown in diabetic Akita females. (B) In situ hybridization of Msgn, Uncx4.1, and Shh mRNAs in Ctrl and Tg E10.5 embryos grown in diabetic Akita females (ss, somite stage; scale bar, 500 µm).

      In conclusion, combining the arguments in the two previous comments, to which extent the results from the addition of FBP or from the transgenic activation of Pfk are not artefactual phenotypes without real physiological relevance?

      14. In our view, two main conclusions, both in vivo and in vitro, can be drawn based on the result we obtained:

      First, we find that a moderate increase in glycolytic flux, within the physiological range, leads to a quantitative and consistent change in gene expression, such as downregulation of Wnt target genes (Figure S3, 5). Such a phenotype was the result of either glucose titration or culturing cytoPfkfb3-transgenic embryos in low glucose concentration.

      In these conditions, while overall PSM patterning is qualitatively normal, we do find consistent changes at quantitative level, i.e. gene expression changes, which are also mirrored by a reduced rate of segmentation (Figure 4B). A detailed analysis of the quantitative changes at the level of segmentation clock dynamics is being carried out and will be presented in a dedicated follow up study.

      Second, we find that a very significant increase in FBP levels, i.e. when cytoPfkfb3 transgenic animals are cultured in high glucose conditions or when samples are cultured in high levels of FBP, PSM patterning is qualitatively altered and segmentation clock ceases to oscillate. In this case, we agree that it is not a physiological condition, as such high levels of flux and FBP are not reached in control samples which have intact flux regulation by Pfk. Nevertheless, such an experimental condition can be insightful, as it very clearly reveals the potential link between glycolysis, clock activity, PSM patterning and the Wnt signaling pathway.

      It is the combination between the moderate and the more severe effects, observed both in vitro, and now also in vivo using the Akita model (see above), that we take as evidence for an intrinsic, physiological link between glycolytic activity, PSM patterning and signaling.

      *3- The authors seem to give a strong functional meaning to the absence of Pfkl and Aldoa from the nuclear fraction in tails incubated with exogenous FBP, suggesting a "moonlighting" function of these enzymes under FBP regulation. In addition to the purely speculative nature of this interpretation (there is no proof for such activity or even an attempt to test it), the data provided is also difficult to interpret for various reasons. *

      15. We fully agree that we do not show a functional role for either the nuclear localization of enzymes or their dynamic change in sub-cellular localization and have tried to express this clearly in the original manuscript:

      • (Result Part, line 382-388) “While we have not been able to address the functional consequence of specific changes in subcellular localization, such as the nuclear depletion of Pfkl or Aldoa when glycolytic flux is increased, these results pave the way for future investigations on the mechanistic underpinning of how metabolic state is linked to cellular signaling and functions.”

      • (Discussion Part, line 575-577): “While future studies will need to reveal if nuclear localization of glycolytic enzymes is linked to their moonlighting functions or metabolic compartmentalization…”

      Based on this comment by the reviewer, we have further emphasised this point in the revised manuscript(line 635-639):

      “While we do not have any direct functional evidence so far for a functional role of nuclear localized glycolytic enzymes, our findings do raise the question whether their subcellular compartmentalization is linked to a non-metabolic, moonlighting function.”

      The protein levels in nuclear fractions are clearly much lower than those in the cytoplasm (this is best seen in the blots of Figure 6D). Does this represent similar subcellular distribution of these enzymes throughout the tissue or the different levels result from the presence of the enzymes in the nucleus of only a subset of the cells? This might be of importance to understand the possible relevance of the subcellular distribution of those enzymes. All the analyses were done on bulk tissue and, therefore, it is not possible to distinguishing between these possibilities. As the authors have antibodies for these enzymes, they could try to perform immunofluorescence analyses, which would provide spatial data.

      16: We agree that a spatially resolved analysis of the subcellular localization of these various enzymes is needed. Unfortunately, the immunofluorescence experiments that we performed did not yield clear, reliable results and hence we can’t provide the answer at this time.

      *In addition to this, it would be important to determine Pfkl and Aldoa subcellular localization in explants incubated with different external concentrations of glucose, which in a way reproduces better possible physiological effects (see point 1), to see if under those conditions high FBP also affects subcellular distribution of those enzymes. *

      17: Please find our response under #4 (attached below), as this important point was also raised by the reviewer 1.

      *(Our response #4) *

      *#4. We agree with the reviewer that based on the findings, one would expect the phenotype, i.e. in this case translocation of proteins, to correlate with FBP levels. Two of our results are of note in this regard. *

      *First, our data indicates that in order to see the effect on protein localization, high levels of FBP have to be reached. Accordingly, we find that Pfkl becomes depleted from the nuclear-cytoskeletal fraction in cytoPfkfb3 explants when cultured in 10 mM glucose but not (visibly) in 2.0 mM glucose (Figure 7D). Corresponding to this, FBP levels in cytoPfkfb3 explants show a significant increase (about 3-fold) from 2.0 to 10 mM glucose conditions (revised Figure 3E). *

      *Second, in control samples, FBP levels saturate in high glucose conditions. FBP levels in control samples do not further increase when glucose concentration is increased from 10mM to 25mM, and thus it does not become as high as in cytoPfkfb3 embryos cultured in 10 mM glucose (revised Figure 3E). *

      *Therefore, in order to reveal the translocation, it requires an experimental strategy that leads to significantly increased FBP levels, such as in cytoPfkfb3 explants with high glucose condition, or alternatively, direct supplementation of FBP. *

      As also pointed out by the other reviewers, we are experimentally generating controlled conditions that exceed the physiological range which the embryo is exposed to. Accordingly, our data does not constitute evidence that under physiological conditions an alteration of protein localization in response to change in glycolytic flux and FBP levels occurs, at a smaller scale.

      We regard our approach as a first step to reveal potential mechanisms and so far hidden possible responses to changes in metabolic flux. In order to see minor changes in translocation upon small changes in glycolytic-flux/FBP levels, more quantitative approaches, such as live-imaging of tagged proteins, will need to be developed. We hence decided to include these discussion in our revised manuscript (line 657-666):

      “Of note, the translocation of proteins was observed only when high levels of FBP were reached upon direct FBP supplementation or cytoPfkfb3 overexpression with high glucose (Figure 6, 7). Future studies hence need to investigate whether flux-dependent change in protein localization occurs upon moderate and more physiological changes in glycolytic-flux/FBP levels. To this end, the development of more quantitative approaches, such as live-imaging of tagged enzymes and the development of metabolite biosensors, are needed.”

      SUGGESTED ADDITIONAL EXPERIMENTS related to point #3:

      *3a- Analysis of subcellular localization of Pfkl and Aldoa by Immunofluorescence. This analysis is not limited by the amount of biological material available, so it could be applied to different experimental conditions. *

      18. We addressed this point in our response #15.

      *3b- Subcellular distribution of Pfkl and Aldoa in explants exposed to different exogenous glucose concentrations. As this involves wild type embryos, it can be done following similar protocols as in figures 6 and 7 of the manuscript. *

      19. We addressed this point in our response #16.

      4- The results from the work presented in this manuscript would indirectly indicate a negative relationship between glycolysis and somitogenesis. This contrasts with previous reports indicating the essential role of aerobic glycolysis for the same process. There is no explanation for this apparent (and important) contradiction (the authors only comment the discrepancy between the data provided in this paper and previous reports in what concerns the relationship between glycolysis and Wnt signalling, although they also do not provide an explanation).

      19. We cannot resolve this discrepancy, but now offer a more detailed discussion, also based on the additional data we obtained.

      First, it is important to point out that we have performed additional experiments to substantiate this part of the work, i.e. a transcriptome analysis with control and cytoPfkfb3 explants cultured in 10 mM glucose. We decided to focus on an early time point, i.e. three-hour after incubation, in order to increase the chance to score the primary response of PSM cells upon changes in glycolytic flux. In addition, our nanostring data in Figure S3 shows that glucose titration can change the expression levels of some Wnt-targets in both directions, i.e. decreasing glucose upregulates their expressions while increasing glucose downregulates their expressions. Again, this analysis was done at short time-scales to score the immediate effect.

      One possible explanation regarding the difference to Oginuma et al. could indeed be the late time point of analysis in their study, i.e. 16-hour after culture. This difference in sampling time, i.e. 3-hour vs. 16-hour after culture, is of particular importance given the dynamic nature of metabolic and signaling responses.

      We have added a sentence to explain this point in more detail (line 608-617):

      “This discrepancy could relate to the time point of analysis: while Oginuma et al. mainly focused on analyzing samples 16-hour after metabolic changes, we chose to score the effects of altered glycolytic flux/FBP levels already after a three-hour incubation, with the goal to capture the primary response of PSM cells. Whether the difference in sampling time underlies the observed difference is yet unknown, but both studies highlight that Wnt signaling is responsive to glycolytic flux, supporting a tight link between metabolism and PSM development.”

      Minor comments.

      *It was not specified the tissue used for the Western blot analyses (was it the PSM alone, the whole tails including somites, etc). This is of relevance to comment #3. *

      20. PSM explants without somites were cultured for one/three-hour and were subjected to subcellular protein fractionation. This information is now included in the revised method section.

      Reviewer #2 (Significance (Required)):

      -The work described in this manuscript identifies FBP as a sentinel metabolite for the glycolytic flux. This, itself has the potential to be important for different processes in which differences in glycolysis makes a difference, although I do not think that this will be relevant for the developmental process on which the authors focused their study (see major comments #1 and 2). Indeed, the lethality of global transgenic cytoPfkfb3 expression (although it was not analyzed if it was during development of in postnatal stages, or the cause of this lethality) but with very minor effects on somitogenesis in vivo supports this conclusion.

      21. Please see our detailed comments also based on the newly added in vivo experiments done with the Akita diabetic mouse model in our responses #9–14.

      *- The potential moonlighting activity of Pfk (connected with specific subcellular localization), is an interesting idea but so far does not go beyond pure speculation. This is prone to the typical double edged effect of stimulating research in that direction but also the potential negative effect of being taken for granted without rigorous proof. *

      22: We have added a statement to highlight the nature of this finding and the requirement for follow up studies both in this and other contexts. Please refer to our response #15 for the details.

      • The importance of metabolism in general and glycolysis in particular for somitogenesis and axial extension has been recently reported (the relevant papers are cited in the manuscript) and therefore the work described in this manuscript extends those studies. Also, the recent observations that metabolic process can influence cell activity beyond their participation on the classical pathways in which they are involved, including processes apparently as distant as epigenetic regulation of gene activity (see for instance Tarazona and Pourquie, 2020, Dev Cell 54, 282-292), is opening new perspectives to the study of the influence of metabolism on physiological and pathological processes (championed by cancer and immunological response). It also provides a link between control mechanisms across large scale phylogeny, from procaryotes to eukaryotes.

      -In principle, the potential audience for this work could be wide, as the interest in understanding the involvement of metabolism in the regulation of physiological and pathological processes has been growing over the last years. However, the lack of proven mechanism for the activity of FBP might restrict the real general impact of this work. In this regard, the suggestion that it might control some type of still unknown moonlighting activity of Pfk is so far totally speculative.

      • I am a developmental biologist with strong focus on mechanisms of somitogenesis and axial extension in vertebrate embryos. There is no part of this work for which I do not feel competent to evaluate.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      *Summary - *

      *In the present manuscript, Miyazawa and colleagues explore the role of glycolytic flux on embryonic development by using presomitic mesoderm (PSM) patterning as a model. *

      *First, the authors examined the steady-state levels of central carbon metabolism metabolites in PSM explants. Explants were cultured in various concentrations of glucose and subjected to gas chromatography mass spectrometry (GC-MS). These experiments allowed the identification of metabolites (such as lactate, 3PG, and FBP) that exhibit a linear correlation with glucose levels and can therefore serve as sentinel metabolites for glycolytic flux in PSM cells. Among the metabolites identified, fructose 1,6-bisphosphate (FBP) showed the strongest linear correlation with glucose levels and was used to inform the design of subsequent experiments. *

      *Second, to elucidate the functional role of FBP on PSM patterning, the authors supplement the media used to culture PSM explants with various concentrations of FBP and: *

      *- analyze the dynamics of Notch signaling (a critical player in mesoderm segmentation during embryogenesis) using real-time imaging of the LuVeLu reporter; *

      *- assess gene expression patterns using in situ hybridization of candidate genes. *

      *The authors find that supplementation with FBP, but not F6P or 3PG, impairs mesoderm segmentation and disrupts the activity of the segmentation clock in the posterior PSM. Furthermore, FBP supplementation led to the reduced expression of FGF- and WNT-target genes Dusp4 and Msgn, respectively. *

      *Third, the authors generate a conditional cytoPfkfb3 transgenic mouse line in which a cytoplasmic form of the Pfkfb3 enzyme is overexpressed. Pfkfb3 can promote glycolysis, and more importantly, leads to increased levels of FBP in a glucose-dependent manner. The authors find that cytoPfkfb3 transgenic PSM explants contain higher levels of FBP and secrete lactate at higher levels when compared to control explants. Importantly, cytoPfkfb3 transgenic PSM explants exhibit impaired somite formation and reduced expression of Msgn (but not Dusp4) in a glucose-dependent manner when compared to control explants. *

      Finally, the authors investigate changes in protein subcellular localization in their pharmacological and genetic models of FBP-driven glycolytic flux activation. This was prompted by previous reports on the changes in subcellular localization of glycolytic enzymes (Hu et al., 2016). To this end, the authors perform proteome-wide cell-fractionation analyses in drug-treated and cytoPfkfb3 transgenic PSM explants and find that certain glycolytic proteins exhibit altered subcellular localization in both cases (albeit in different fractions).

      *Major concerns: *

      *- (Re: Results from Fig. 2 and Fig. S1.) *

      *o Given that FBP levels are highly correlated with extracellular glucose levels (which impact glycolytic flux )(TeSlaa and Teitell, 2014) the authors should elaborate on why progressive increase in extracellular glucose does not affect PSM patterning, in the same way that increasing FBP levels does. This is especially important given the claim that FBP is a sentinel metabolite of glycolytic flux. *

      23. This important point was also addressed by the reviewer 2, so please see our responses that are also listed under #9, #10, #14 (attached below).

      *(Our response #9) *

      *We agree with the reviewer and to directly address this central point, we have performed an extended, additional experiment, collecting 375 embryos to quantify FBP levels under five conditions with three biological replicates. *

      *There are two major results that we highlight here: First, we found that addition of F6P did not lead to increased FBP levels compared to control samples cultured in 10 mM glucose, which is in stark contrast to cytoPfkfb3 embryos cultured in 10 mM glucose (revised Figure 3E). Second, while increasing glucose concentration is mirrored by elevated FBP levels as we reported, we find clear evidence of saturation above a concentration of 10mM glucose: increasing glucose to 25mM does not increase FBP levels further (revised Figure 3E). *

      This saturation effect seen in glucose titration, but also the absence of elevated FBP upon F6P addition, might be expected outcomes because, as also the reviewer 1 pointed out in the response, Pfk is commonly considered to be a rate-limiting enzyme in the glycolytic pathway. We now have the direct experimental data supporting this hypothesis and thank the reviewers to have initiated this additional (very involved..) experiment.

      *This new data allows us to conclude more firmly on the correlation between FBP levels and phenotype: at high FBP levels, which are seen in cytoPfkfb3 samples, we observe PSM patterning defects. These high levels are not reached even at 25mM glucose or upon F6P addition, due to the saturation at the level of PFK enzymatic step. Hence, while glucose titration does elevate FBP significantly until this saturation, FBP levels are not as high as in cytoPfkfb3 samples. As a correlative finding, we see that only those conditions with very high FBP levels, or the direct addition of high levels of FBP, cause the arrest of segmentation clock activity. At moderately elevated FBP levels, observed in control explants with high glucose or in cytoPfkfb3 explants with low glucose, clock activity continues and we find a quantitative effect at the level of gene expression, i.e. Wnt signaling target downregulation (Figure 5A, S3). *

      The new data has been included in the revised manuscript and the text has been adjusted accordingly:

      - (Result Part, line 245–254) "Consistently, we found that cytoPfkfb3 overexpression lifted the upper limit of FBP levels in PSM cells (Figure 3E, S4B, S4C). In control explants, FBP levels did not increase further when glucose concentration was increased from 10 mM to 25 mM. It was also the case when control explants were cultured in 20 mM of F6P (Figure 3E). These results indicate that the Pfk reaction carries a (rate-)limiting role for glycolytic flux and FBP levels, and that cytoPfkfb3 overexpression hinders the flux-regulation function of Pfk."

      - (Discussion Part, line 551–573) “Our findings suggest that flux-regulation at the level of Pfk is critical to keep FBP steady state levels within a range compatible with proper PSM patterning and segmentation. In agreement with such a rate-limiting function for Pfk, we found in glucose titration experiments that FBP levels saturated and did not further increase at glucose levels above 10 mM (Figure 3E). Along similar lines, the supplementation of high concentrations of the Pfk substrate F6P did not result in a significant increase of FBP levels, again compatible with a rate-limiting function at the level of Pfk (Figure 3E). The upper limit of glycolytic flux and FBP levels can be experimentally increased by cytoPfkfb3 overexpression (Figure 3B, 3E). We interpret the data as evidence that cytoPfkfb3 overexpression compromises the flux-control function of Pfk and hence much higher FBP (and secreted lactate) levels are reached. Such a drastic increase in glycolytic flux and FBP levels correlates with a severe PSM patterning phenotype (Figure 4), which resembles the phenotype induced by supplementation of high dose of FBP (Figure 2). Our results in mouse embryos hence provides evidence that flux regulation by Pfk, an evolutionary conserved role present from bacteria to humans, serves to maintain FBP levels below a critical threshold.”

      (Our response #10)

      *#10. First, we would like to clarify that while indeed glycolytic activity is graded along the PSM, as other and we reported previously (reported in Bulusu et al., 2017 and Oginuma et al., 2017), the baseline expression of the entire glycolytic machinery (from glucose transport to lactate production) is very high, in all PSM cells. Hence, we see that cells all along the entire PSM have very active glycolysis, the posterior PSM being even more active. *

      *For this and related reasons, our interpretation about the difference seen between glucose titration/F6P addition on one side, and FBP addition/cytoPfkfb3 addition on the other side, is based on the role of Pfk in controlling either flux levels or dynamics in all PSM cells. *

      Hence, while we agree that we generate experimental conditions that allow FBP levels to surpass those found in control embryos, we would like to highlight the fact that even moderate changes in flux does result in very robust functional consequences on gene expression (Figure S3, 5), as we show in this work.

      *We can currently not fully address the first point raised, i.e. the role of graded flux/graded metabolite levels, due to the experimental limitations. Such a study requires, for instance, the generation of metabolite biosensor reporter lines in order to be able to monitor these changes dynamically, in space and time. *

      (Our response #14)

      *In our view, two main conclusions, both in vivo and in vitro, can be drawn based on the result we obtained: *

      *First, we find that a moderate increase in glycolytic flux, within the physiological range, leads to a quantitative and consistent change in gene expression, such as downregulation of Wnt target genes (Figure S3, 5). Such a phenotype was the result of either glucose titration or culturing cytoPfkfb3-transgenic embryos in low glucose concentration. *

      In these conditions, while overall PSM patterning is qualitatively normal, we do find consistent changes at quantitative level, i.e. gene expression changes, which are also mirrored by a reduced rate of segmentation (Figure 4B). A detailed analysis of the quantitative changes at the level of segmentation clock dynamics is being carried out and will be presented in a dedicated follow up study.

      *Second, we find that a very significant increase in FBP levels, i.e. when cytoPfkfb3 transgenic animals are cultured in high glucose conditions or when samples are cultured in high levels of FBP, PSM patterning is qualitatively altered and segmentation clock ceases to oscillate. In this case, we agree that it is not a physiological condition, as such high levels of flux and FBP are not reached in control samples which have intact flux regulation by Pfk. Nevertheless, such an experimental condition can be insightful, as it very clearly reveals the potential link between glycolysis, clock activity, PSM patterning and the Wnt signaling pathway. *

      *It is the combination between the moderate and the more severe effects, observed both in vitro, and now also in vivo using the Akita model (see above), that we take as evidence for an intrinsic, physiological link between glycolytic activity, PSM patterning and signaling. *

      - (Re: Fig. 2A and Fig. 2B)

      *o The authors should be consistent with the glucose concentrations for the experiments where they assess the dynamics of Notch signaling (Figure 2A) and gene expression (Figure 2B) or otherwise elaborate on why different concentrations are used for these assays. *

      24: We agree that ideally the experimental parameters should be as consistent as possible. In regards to the control glucose concentration used in this study, both 0.5 mM and 2.0 mM glucose were used. It reflects that over the years, minor adjustments in the experimental protocol were made, i.e. we now use 2.0 mM glucose as standard setting for all experiments, while previously, 0.5 mM glucose was used (see Bulusu et al., 2017). This change is based on the observation of a slightly improved culture outcome, in terms of reporter gene expression. We have confirmed that the developmental outcome and also effects seen upon addition of FBP are consistent at 0.5 mM and at 2.0 mM glucose. We made a note in the methods section to explain this point (line 1082-1084):

      “Basal culture condition was 0.5 mM glucose at the beginning of this study but was later switched to 2.0 mM glucose which yields a slightly improved reporter gene expression. No major difference was observed in the effects of FBP between these glucose conditions.”

      *- (Re: Results from pharmacological and genetic models of increased FBP levels) *

      *o The authors state that FBP-driven impairment of mesoderm segmentation is most pronounced in the undifferentiated PSM cells (in the posterior-most end of the explants) and is, therefore, unlikely to be due to a toxic effect that might otherwise affect the whole explant. While this is a reasonable assumption, it does not discount the possibility that the spatial specificity of the effect of FBP could be driven primarily by increased cell death in the posterior end of the explant. Thus, the authors should test whether cell death underlies the mesoderm patterning defects seen in PSM explants subjected to increased FBP levels. *

      25. We have performed immunostaining of active caspase-3 in explants cultured for three-hour in medium containing 0.5 mM glucose and 20 mM FBP and found no difference between control and FBP-treated explants (please refer to the Figure R2 below). This qualitative result does not indicate a major effect via cell death in the tail bud region (i.e. posterior PSM) as the underlying reason for the observed phenotype. We included the new data in the revised Figure S2C and adjusted the text accordingly.

      Figure R3. Immunostaining of active caspase-3 in PSM explants. Explants were cultured for three hours in the presence or absence of 20 mM of FBP. Neural tubes were outlined by white dotted lines.

      *- (Re: Gene expression experiments/analyses) *

      *o This study would benefit greatly from transcriptomic analysis of wt and cytoPfkfb3 transgenic PSM explants (and/or transcriptomic characterization of FBP-treated vs. control PSM explants). The candidate approach used to assess gene expression (through in situ hybridization) may not be sufficient to conclude that cytoPfkfb3 over-expression leads to the downregulation of Wnt signaling (a claim the authors make at the beginning of the manuscript). *

      26. We fully agree with the reviewer’s comment. We have now performed RNA-sequencing (RNAseq) analysis using control and cytoPfkfb3 explants cultured in 10 mM glucose, importantly after three hours of incubation in order to score early effects at transcriptome level (please refer to Figure R4).

      We found clear evidence that many Wnt-target genes (i.e. Axin2, Cdx4, Dact1, Dkk1, Mixl1, Msgn1, Sp5, Sp8, T) were significantly downregulated in cytoPfkfb3 explants, supporting the conclusion that Wnt signaling activity is downregulated in cytoPfkfb3 explants under high glucose condition.

      Furthermore, in order to examine similarities between the effects of cytoPfkfb3 overexpression and FBP supplementation, we also performed RNAseq analysis with explants treated with high dose of FBP or F6P. FBP supplementation resulted in downregulation of Wnt target gene expression (i.e. Dact1, Dkk1, Mixl1, Lef1, Sp5, T, Tbx6), mirroring the effects seen in cytoPfkfb3 samples. Such a response was not detected in F6P-treated explants.

      Combined, these new data significantly strengthen our conclusion that an increase in glycolytic flux and FBP levels leads to downregulation of Wnt signaling activity. The new data is now included in the revised Figure 5C–E and adjusted the texts accordingly.

      Figure R4. Transcriptome analysis of control (Ctrl) and cytoPfkfb3 (TG) PSM explants. PSM explants were cultured for three hours under different culture conditions. (A) Effects of cytoPfkfb3 overexpression on gene expression under 10 mM glucose condition. (B, C) Effects of 20 mM FBP (B) or F6P (C) on gene expression under 2.0 mM glucose condition. Wnt-target genes that were significantly downregulated in cytoPfkfb3 or FBP/F6P-treated explants are highlighted in blue.

      *- (Re: Results related to the neural tube closure defects in cytoPfkfb3 transgenic embryos) *

      *o The section of the manuscript describing the neural tube closure defects in cytoPfkfb3 transgenic embryos is superficial, lacks detail, and distracts from the focus of the study. Perhaps the data and text on neural tube closure defects should be included as supplemental information. *

      27: We agree with the reviewer that in the previous version, this data appeared isolated. It also connects with the point raised by the reviewer 2 about the in vivo significance of our findings. To address both these points, we have now performed additional in vivo experiments using a diabetic mouse model (Akita) to directly test the in vivo consequence of cytoPfkfb3, which interestingly links to the previous findings of neural tube defects. Please see our response #13 for the details (attached below):

      (Our response #13)

      *First of all, we would like to emphasize that the phenotype seen in cytoPfkfb3 embryos, i.e. the reduction of segmentation and downregulation of Wnt-target gene expression, occurs in a glucose dose dependent manner (Figure 4B and 5A). Hence, it is not the overexpression of cytoPfkfb3 per se that can account for the effects seen. But rather, increased glycolytic flux caused by the combination of transgene expression with high glucose results in functional consequences. *

      In addition, ‘other possible effects’ that the reviewer is referring to should be evident in all transgenic embryos, irrespective of glucose dose. To the contrary, transgenic embryos cultured in low glucose conditions appear unaltered to control embryos.

      *Second, we agree that we need to distinguish between strong phenotypes, visible at the level of clock arrest, and milder phenotypes, visible at the level of quantitative gene expression changes. It is important to note that the moderate phenotype, i.e. the quantitative gene expression changes seen in posterior PSM, are seen upon the addition of FBP at moderate levels and upon in glucose titration within the physiological concentration range, as well as in cytoPfkfb3 embryos. We take this as evidence that the effects seen in cytoPfkfb3 transgenic embryos reflect a common response also seen under physiological conditions. *

      *To extend this argument to the in vivo setting, we have performed additional experiments using a genetic mouse model for diabetes. As shown in our previous submission, cytoPfkfb3 transgenic animals do not exhibit a drastic in vivo phenotype when dissected at embryonic day 10.5. One interpretation of this finding is that since the cytoPfkfb3 phenotype is glucose and flux-dependent, the in vivo flux is low, reflecting low glucose concentrations described in vivo. To test the effect of increased flux in cytoPfkfb3 embryos in vivo, we therefore crossed the transgenic mice into a diabetic model called Akita, in which a point mutation in the Insulin2 gene causes high maternal glucose levels (Yoshioka et al., 1997; Wang et al., 1999). Using this experimental setup, we tested whether transgenic embryos in Akita diabetic females would manifest in vivo phenotypes. *

      Indeed, we found that cytoPfkfb3 transgenic embryos developing in Akita diabetic females showed significantly increased cases of neural tube closure defects (50% of cytoPfkfb3 embryos) and developmental delay (control: 38 somites vs. cytoPfkfb3: 34 somites at E10.5), defects not seen in transgenic cytoPfkfb3 embryos from control females (please refer to Figure R2 below). This dependency of the in vivo phenotype on maternal glucose conditions again highlights that the defects observed in cytoPfkfb3 embryos are not due to the expression of cytoPfkfb3 per se, but are rather directly linked to increased/unregulated glycolytic flux.

      We included the new in vivo data in the revised Figure S5D-E and modified the text accordingly.

      *Figure R2. In vivo phenotype of cytoPfkfb3 embryos grown in diabetic Akita females. (A) The number of somites in control (Ctrl) and cytoPfkfb3 (Tg) E10.5 embryos grown in diabetic Akita females. (B) In situ hybridization of Msgn, Uncx4.1, and Shh mRNAs in Ctrl and Tg E10.5 embryos grown in diabetic Akita females (ss, somite stage; scale bar, 500 µm). *

      • (Re: Conclusions of the study)

      o A previous study by Oginuma et al., 2020 provided strong evidence for a mechanism underlying the positive regulation of Wnt signaling by glycolysis (initiated by the elevation of intracellular pH) in the chick embryo tailbud. As mentioned in the discussion, the results of the present study are not consistent with this mode - and this contradiction is not sufficiently resolved. This is a concern, given that the evidence that cytoPfkfb3 inhibits Wnt signaling is sparse (see above).

      28: This important point was also raised by the reviewer 2, please see our response as listed under #19 (attached below).

      (Our response #19)

      *We cannot resolve this discrepancy, but now offer a more detailed discussion, also based on the additional data we obtained. *

      *First, it is important to point out that we have performed additional experiments to substantiate this part of the work, i.e. a transcriptome analysis with control and cytoPfkfb3 explants cultured in 10 mM glucose. We decided to focus on an early time point, i.e. three-hour after incubation, in order to increase the chance to score the primary response of PSM cells upon changes in glycolytic flux. In addition, our nanostring data in Figure S3 shows that glucose titration can change the expression levels of some Wnt-targets in both directions, i.e. decreasing glucose upregulates their expressions while increasing glucose downregulates their expressions. Again, this analysis was done at short time-scales to score the immediate effect. *

      *One possible explanation regarding the difference to Oginuma et al. could indeed be the late time point of analysis in their study, i.e. 16-hour after culture. This difference in sampling time, i.e. 3-hour vs. 16-hour after culture, is of particular importance given the dynamic nature of metabolic and signaling responses. *

      We have added a sentence to explain this point in more detail (line 608-617):

      “This discrepancy could relate to the time point of analysis: while Oginuma et al. mainly focused on analyzing samples 16-hour after metabolic changes, we chose to score the effects of altered glycolytic flux/FBP levels already after a three-hour incubation, with the goal to capture the primary response of PSM cells. Whether the difference in sampling time underlies the observed difference is yet unknown, but both studies highlight that Wnt signaling is responsive to glycolytic flux, supporting a tight link between metabolism and PSM development.”

      *o Another discrepancy lies in the lack of an observable phenotype when culturing mouse PSM explants at very low glucose concentrations (e.g., 0.5 mM in Fig. 2A). Oginuma et al. observed clear disruptions to embryonic elongation and somite formation at a glucose concentration equal to 0.83 mM. Would this be due to species-specific mechanisms? Furthermore, while the authors focus on sentinel metabolites (such as FBP), experiments involving direct manipulation in glycolysis could resolve some of these inconsistencies. *

      29: Indeed species specific differences in the requirement for glucose are to be expected. Our extensive analysis shows that at 0.5mM glucose, segmentation and elongation proceeds (Bulusu et al., 2017).

      Regarding the second point, we have outlined several strategies to directly perturb glycolysis, i.e. glucose titration (mirrored by increase in lactate secretion) and by genetic targeting of the rate-limiting enzyme, Pfk. Glucose titration in wild-type embryos corresponds to the experiment the reviewer suggested, and we again found that higher glucose (i.e. higher flux) leads to down regulation of several Wnt-target genes (Figure S3). Of note, also in cytoPfkfb3 explants the effects are glucose-dose dependent (again mirrored by increase of lactate secretion), clearly indicating that we successfully and directly controlled glycolysis.

      *References - *

        • Hu, Hai, et al. "Phosphoinositide 3-kinase regulates glycolysis through mobilization of aldolase from the actin cytoskeleton." Cell 164.3 (2016): 433-446. *
        • TeSlaa, Tara, and Michael A. Teitell. "Techniques to monitor glycolysis." Methods in enzymology 542 (2014): 91-114. *
        • Oginuma, Masayuki, et al. "Intracellular pH controls WNT downstream of glycolysis in amniote embryos." Nature584.7819 (2020): 98-101. * *Reviewer #3 (Significance (Required)): *

      The experimental results reported in this study enhance our understanding of how cellular metabolic states regulate cellular behaviors during embryonic development. The study provides insight into how PSM elongation is controlled by morphogenetic mechanisms that are modulated by glycolytic flux. One of the strengths of the study is the use of an interdisciplinary approach that includes GC-MS, in vivo imaging and mouse transgenic lines. It should be noted that some of the conclusions of the study diverge from previous papers that examine the role of metabolism in developmental patterning (e.g., Oginuma et al., 2020).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      In the present manuscript, Miyazawa and colleagues explore the role of glycolytic flux on embryonic development by using presomitic mesoderm (PSM) patterning as a model.

      First, the authors examined the steady-state levels of central carbon metabolism metabolites in PSM explants. Explants were cultured in various concentrations of glucose and subjected to gas chromatography mass spectrometry (GC-MS). These experiments allowed the identification of metabolites (such as lactate, 3PG, and FBP) that exhibit a linear correlation with glucose levels and can therefore serve as sentinel metabolites for glycolytic flux in PSM cells. Among the metabolites identified, fructose 1,6-bisphosphate (FBP) showed the strongest linear correlation with glucose levels and was used to inform the design of subsequent experiments.

      Second, to elucidate the functional role of FBP on PSM patterning, the authors supplement the media used to culture PSM explants with various concentrations of FBP and: - analyze the dynamics of Notch signaling (a critical player in mesoderm segmentation during embryogenesis) using real-time imaging of the LuVeLu reporter; - assess gene expression patterns using in situ hybridization of candidate genes. The authors find that supplementation with FBP, but not F6P or 3PG, impairs mesoderm segmentation and disrupts the activity of the segmentation clock in the posterior PSM. Furthermore, FBP supplementation led to the reduced expression of FGF- and WNT-target genes Dusp4 and Msgn, respectively.

      Third, the authors generate a conditional cytoPfkfb3 transgenic mouse line in which a cytoplasmic form of the Pfkfb3 enzyme is overexpressed. Pfkfb3 can promote glycolysis, and more importantly, leads to increased levels of FBP in a glucose-dependent manner. The authors find that cytoPfkfb3 transgenic PSM explants contain higher levels of FBP and secrete lactate at higher levels when compared to control explants. Importantly, cytoPfkfb3 transgenic PSM explants exhibit impaired somite formation and reduced expression of Msgn (but not Dusp4) in a glucose-dependent manner when compared to control explants.

      Finally, the authors investigate changes in protein subcellular localization in their pharmacological and genetic models of FBP-driven glycolytic flux activation. This was prompted by previous reports on the changes in subcellular localization of glycolytic enzymes (Hu et al., 2016). To this end, the authors perform proteome-wide cell-fractionation analyses in drug-treated and cytoPfkfb3 transgenic PSM explants and find that certain glycolytic proteins exhibit altered subcellular localization in both cases (albeit in different fractions).

      Major concerns:

      1. (Re: Results from Fig. 2 and Fig. S1.)
        • Given that FBP levels are highly correlated with extracellular glucose levels (which impact glycolytic flux )(TeSlaa and Teitell, 2014) the authors should elaborate on why progressive increase in extracellular glucose does not affect PSM patterning, in the same way that increasing FBP levels does. This is especially important given the claim that FBP is a sentinel metabolite of glycolytic flux.
      2. (Re: Fig. 2A and Fig. 2B)
        • The authors should be consistent with the glucose concentrations for the experiments where they assess the dynamics of Notch signaling (Figure 2A) and gene expression (Figure 2B) or otherwise elaborate on why different concentrations are used for these assays.
      3. (Re: Results from pharmacological and genetic models of increased FBP levels)
        • The authors state that FBP-driven impairment of mesoderm segmentation is most pronounced in the undifferentiated PSM cells (in the posterior-most end of the explants) and is, therefore, unlikely to be due to a toxic effect that might otherwise affect the whole explant. While this is a reasonable assumption, it does not discount the possibility that the spatial specificity of the effect of FBP could be driven primarily by increased cell death in the posterior end of the explant. Thus, the authors should test whether cell death underlies the mesoderm patterning defects seen in PSM explants subjected to increased FBP levels.
      4. (Re: Gene expression experiments/analyses)
        • This study would benefit greatly from transcriptomic analysis of wt and cytoPfkfb3 transgenic PSM explants (and/or transcriptomic characterization of FBP-treated vs. control PSM explants). The candidate approach used to assess gene expression (through in situ hybridization) may not be sufficient to conclude that cytoPfkfb3 over-expression leads to the downregulation of Wnt signaling (a claim the authors make at the beginning of the manuscript).
      5. (Re: Results related to the neural tube closure defects in cytoPfkfb3 transgenic embryos)
        • The section of the manuscript describing the neural tube closure defects in cytoPfkfb3 transgenic embryos is superficial, lacks detail, and distracts from the focus of the study. Perhaps the data and text on neural tube closure defects should be included as supplemental information.
      6. (Re: Conclusions of the study)
        • A previous study by Oginuma et al., 2020 provided strong evidence for a mechanism underlying the positive regulation of Wnt signaling by glycolysis (initiated by the elevation of intracellular pH) in the chick embryo tailbud. As mentioned in the discussion, the results of the present study are not consistent with this mode - and this contradiction is not sufficiently resolved. This is a concern, given that the evidence that cytoPfkfb3 inhibits Wnt signaling is sparse (see above).
      7. Another discrepancy lies in the lack of an observable phenotype when culturing mouse PSM explants at very low glucose concentrations (e.g., 0.5 mM in Fig. 2A). Oginuma et al. observed clear disruptions to embryonic elongation and somite formation at a glucose concentration equal to 0.83 mM. Would this be due to species-specific mechanisms? Furthermore, while the authors focus on sentinel metabolites (such as FBP), experiments involving direct manipulation in glycolysis could resolve some of these inconsistencies.

      References

      1. Hu, Hai, et al. "Phosphoinositide 3-kinase regulates glycolysis through mobilization of aldolase from the actin cytoskeleton." Cell 164.3 (2016): 433-446.
      2. TeSlaa, Tara, and Michael A. Teitell. "Techniques to monitor glycolysis." Methods in enzymology 542 (2014): 91-114.
      3. Oginuma, Masayuki, et al. "Intracellular pH controls WNT downstream of glycolysis in amniote embryos." Nature584.7819 (2020): 98-101.

      Significance

      The experimental results reported in this study enhance our understanding of how cellular metabolic states regulate cellular behaviors during embryonic development. The study provides insight into how PSM elongation is controlled by morphogenetic mechanisms that are modulated by glycolytic flux. One of the strengths of the study is the use of an interdisciplinary approach that includes GC-MS, in vivo imaging and mouse transgenic lines. It should be noted that some of the conclusions of the study diverge from previous papers that examine the role of metabolism in developmental patterning (e.g., Oginuma et al., 2020).

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The work described in this paper first searches for potential sentinel metabolites of glycolytic flux, focusing on the process of somitogenesis during mouse embryonic development. By measuring the levels of different metabolites in the presomitic mesoderm (PSM) of E10.5 mouse embryos cultured in the presence of three different glucose concentrations, the authors identify 14 metabolites whose concentration rises with increasing glucose concentration in the culture medium. Among them, they selected fructose 1,6-bisphosphate (FBP) for further analyses, as it showed the highest linear correlation with extracellular glucose concentrations. They then show that addition of FBP to the incubation medium of cultured embryo tails interfere with somitogenesis and tail extension in a concentration-dependent fashion. In addition, they show that this effect is exacerbated when extracellular glucose levels are increased. By analyzing specific targets of Wnt and Fgf signaling, the authors also show that addition of FBP down-regulates both signaling pathways in the PSM. They then use a genetic trick (ubiquitous overexpression of cytoPfkfb3) to increase FBP levels by allosteric activation of Pfk (the enzyme that produces FBP) in developing embryos. When tails from these transgenic embryos were cultured in vitro and exposed to various glucose concentrations somitogenesis was affected in a way resembling the effects of FBP on cultured tails from wild type embryos. The authors then go on to determine the subcellular localization of different proteins in tails incubated in the presence of various FBP concentrations to identify that some enzymes involved in the glycolytic pathway (and they specifically focus on Pfkl and Aldoa) are excluded from nuclear fractions at high FBP concentrations. The authors conclude that FBP functions as a flux-signaling metabolite connecting glycolysis and PSM patterning, potentially through modulating subcellular protein localization.

      Major comments

      I think that in general the work described in this manuscript has been performed to the highest technical standards. However, I do not think that I can agree with the authors' conclusions (that FBP connects glycolysis with PSM patterning and that subcellular localization of glycolytic enzymes play a role in this process), which in my opinion go way beyond what can be proven by the data provided.

      1. Explants incubated with external glucose concentrations up to 25 mM have no obvious defects on somitogenesis or on the segmentation clock as determined by LuVeLu cycling activity. Under these conditions, explants are expected to contain very high FBP levels if this metabolite keeps its linear relationship with external glucose (in this work it was not measured beyond 10 mM glucose in the medium, where FBP concentration was already very high). This contrasts with the phenotypes observed upon exogenous supplementation of FBP, which affects somitogenesis already at 2 mM glucose. These latter results are at odds not only with the lack of phenotypic alterations under high glucose conditions, but also with the observation that exogenous addition of fructose 6-phosphate (F6P), the substrate of Pfk enzymes to generate FBP, does not alter somitogenesis. The authors take the absence of effects by incubation with F6P as a control of the specificity of FBP. However, as F6P is the natural substrate of Pfk, it is possible that supplementation of F6P also leads to an increase of FBP but in a way closer to a physiological condition. Therefore, I find it essential to determine FBP levels in tails incubated in the presence of increasing amounts of F6P, as if it increases FBP levels, similarly to what the authors described for the tails incubated with increasing glucose concentrations, it will have important implications to the interpretation of the work presented in this manuscript. The main difference between the experiments involving FBP supplementation and those involving high glucose concentrations or exogenous F6P addition is that in the later two cases increase in FBP would be restricted to the tissue(s) expressing Pfk, whereas upon FBP supplementation this metabolite would hit any tissue, regardless of whether or not it would ever be physiologically exposed to this molecule. In the case of the PSM, this might be relevant because it has been shown that there is a gradient of glycolysis, being high at the caudal tip and becoming lower at more anterior regions of the PSM, most likely mirroring the distribution of Pfk activity. Exogenous administration of FBP would flatten the gradient, which could lead to alterations in PSM patterning, whereas glucose (and eventually F6P) would not as they would increase FBP locally in the area where it is normally activated, keeping the natural gradient. On the basis of these arguments, to which extent does FBP connect glycolysis and somitogenesis under physiological conditions?

      ESSENTIAL ADDITIONAL EXPERIMENT related to point #1: Measure FBP from PSM explants incubated under various exogenous concentrations of F6P.

      ANOTHER EXPERIMENT THAT COULD BE INFORMATIVE: measure FBP levels in PSM incubated under different glucose concentrations but instead of using the whole PSM together, dividing the PSM in posterior, medium and anterior parts (similarly to what was done in Oginuma et al, 2017, reference in the manuscript) to see if there is a gradient in FBP activation. 2. A similar argument could be presented for the results with the cytoPfkfb3 transgenics, as they are based on global artificial overactivation of Pfk, in addition to other possible effects of the ectopic activity of cytoPfkfb3, which were not controlled. Also, while the phenotypic alterations in the PSM in vitro, most particularly in the experiments involving incubation of the tails, are rather strong, the reported effects on somitogenesis in vivo are minor, also questioning the contribution of the in vitro conditions to the final phenotypic effects observed throughout the manuscript.

      In conclusion, combining the arguments in the two previous comments, to which extent the results from the addition of FBP or from the transgenic activation of Pfk are not artefactual phenotypes without real physiological relevance? 3. The authors seem to give a strong functional meaning to the absence of Pfkl and Aldoa from the nuclear fraction in tails incubated with exogenous FBP, suggesting a "moonlighting" function of these enzymes under FBP regulation. In addition to the purely speculative nature of this interpretation (there is no proof for such activity or even an attempt to test it), the data provided is also difficult to interpret for various reasons. The protein levels in nuclear fractions are clearly much lower than those in the cytoplasm (this is best seen in the blots of Figure 6D). Does this represent similar subcellular distribution of these enzymes throughout the tissue or the different levels result from the presence of the enzymes in the nucleus of only a subset of the cells? This might be of importance to understand the possible relevance of the subcellular distribution of those enzymes. All the analyses were done on bulk tissue and, therefore, it is not possible to distinguishing between these possibilities. As the authors have antibodies for these enzymes, they could try to perform immunofluorescence analyses, which would provide spatial data.

      In addition to this, it would be important to determine Pfkl and Aldoa subcellular localization in explants incubated with different external concentrations of glucose, which in a way reproduces better possible physiological effects (see point 1), to see if under those conditions high FBP also affects subcellular distribution of those enzymes.

      SUGGESTED ADDITIONAL EXPERIMENTS related to point #3: 3a- Analysis of subcellular localization of Pfkl and Aldoa by Immunofluorescence. This analysis is not limited by the amount of biological material available, so it could be applied to different experimental conditions.

      3b- Subcellular distribution of Pfkl and Aldoa in explants exposed to different exogenous glucose concentrations. As this involves wild type embryos, it can be done following similar protocols as in figures 6 and 7 of the manuscript. 4. The results from the work presented in this manuscript would indirectly indicate a negative relationship between glycolysis and somitogenesis. This contrasts with previous reports indicating the essential role of aerobic glycolysis for the same process. There is no explanation for this apparent (and important) contradiction (the authors only comment the discrepancy between the data provided in this paper and previous reports in what concerns the relationship between glycolysis and Wnt signalling, although they also do not provide an explanation).

      Minor comments.

      It was not specified the tissue used for the Western blot analyses (was it the PSM alone, the whole tails including somites, etc). This is of relevance to comment #3.

      Significance

      • The work described in this manuscript identifies FBP as a sentinel metabolite for the glycolytic flux. This, itself has the potential to be important for different processes in which differences in glycolysis makes a difference, although I do not think that this will be relevant for the developmental process on which the authors focused their study (see major comments #1 and 2). Indeed, the lethality of global transgenic cytoPfkfb3 expression (although it was not analyzed if it was during development of in postnatal stages, or the cause of this lethality) but with very minor effects on somitogenesis in vivo supports this conclusion.
      • The potential moonlighting activity of Pfk (connected with specific subcellular localization), is an interesting idea but so far does not go beyond pure speculation. This is prone to the typical double edged effect of stimulating research in that direction but also the potential negative effect of being taken for granted without rigorous proof.
      • The importance of metabolism in general and glycolysis in particular for somitogenesis and axial extension has been recently reported (the relevant papers are cited in the manuscript) and therefore the work described in this manuscript extends those studies. Also, the recent observations that metabolic process can influence cell activity beyond their participation on the classical pathways in which they are involved, including processes apparently as distant as epigenetic regulation of gene activity (see for instance Tarazona and Pourquie, 2020, Dev Cell 54, 282-292), is opening new perspectives to the study of the influence of metabolism on physiological and pathological processes (championed by cancer and immunological response). It also provides a link between control mechanisms across large scale phylogeny, from procaryotes to eukaryotes.
      • In principle, the potential audience for this work could be wide, as the interest in understanding the involvement of metabolism in the regulation of physiological and pathological processes has been growing over the last years. However, the lack of proven mechanism for the activity of FBP might restrict the real general impact of this work. In this regard, the suggestion that it might control some type of still unknown moonlighting activity of Pfk is so far totally speculative.
      • I am a developmental biologist with strong focus on mechanisms of somitogenesis and axial extension in vertebrate embryos. There is no part of this work for which I do not feel competent to evaluate.
    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The paper by Miyazawa and colleagues addresses a key question: How is changed metabolic activity sensed and to induce changes in developmental programs. In recent years, there is more and more indication that metabolism is not only a dull workhorse synthesizing the building blocks for new cells and providing chemical energy, but that metabolic activity itself has also a regulatory role. How this precisely works is largely unknown and even also unexplored in higher cells. From early insights obtained in microbes, it seems that certain metabolites - possibly reflecting metabolic activity (i.e. flux) - could be metabolic signals that feedback into cellular regulation. The current paper takes this idea now to developmental processes, where the authors found that the glycolytic metabolite fructose-1,6-bisphosphate is a flux-dependent signal that interferes with developmental processes. This is a very exciting finding, as it indicates that this metabolite not only has a regulatory function in microbes but also in mouse during mesoderm development. Answering the question how such a flux-dependent metabolite mechanistically interferes with the developmental processes is an enormously difficult. Compared to other mechanistic studies, where deleting genes, modifying genes, and changing protein expressions will usually do the trick, here, perturbing metabolite levels is extremely challenging, particularly if such perturbations need to be carried out in a way that nothing else is perturbed. Researchers, who are not overly familiar with metabolism, usually underestimate the difficulty with targeted and insightful perturbation of metabolism. To this end, the authors of this paper need to be congratulated for a very well carried out study with very solid data, and excellent control experiments. The authors open up a new path towards understanding how embryo mesoderm development is regulated by metabolic activity. In particular, they show that that glycolytic flux, FBP and important developmental phenotypes as well as protein localization changes are linked. As normal with a complex metabolism-based story as this one, there is always more that could be done. Yet, the results are highly important to be reported now such that the field as a whole can build on these interesting results and to explore the exciting path further that has been opened by the authors. Thus, I strongly recommend publishing these findings: The data generated by the authors are accompanied by the required control experiments. The conclusions drawn are very solid. I do not have any major concerns but just a number of minor suggestions that the authors could consider in a revised version of the manuscript.

      Minor:

      1. At the end of the introduction, the authors stated their original goal. As it is phrased, it is unclear whether this goal has been obtained or not. They might want to consider replacing the last introductory sentence by a sentence stating what the reader can find in this paper.
      2. Data from Fig 3: If you plot the lactate secretion vs the FBP levels of the controls and the overexpression experiment, would the control and the overexpression data lie on one line (maybe if combined with the data shown in Fig 1A)?
      3. Maybe the authors could attempt an experiment like the following one: Chose the strongest phenotype observed and test a combination of overexpressing cytoPfkfb3 and reducing extracellular glucose level at the same time?
      4. Can the proteomics experiments shown in Fig. 6 be repeated with high and low extracellular glucose? High glucose should yield high FBP levels and one would then expect to see the same as with the experiment where at 2 mM glucose 20 mM extracellular FBP were added. Is this the case?
      5. While the authors quantified proteins in different compartments, I was wondering whether they also looked for whole-embryo protein expression changes?
      6. Throughout the manuscript, the authors state the glucose levels or cytoPfkfb3 changes the glycolytic flux. While I tend to agree with this, it is important to note that the authors have not directly measured glycolytic flux, but use the amount of accumulated lactate as a proxy. I think it is important to add this disclaimer at important points in the manuscript, such that readers are aware of this point.
      7. Another aspect for changing FBP levels could be connected on what was found in yeast, where the FBP levels were found to oscillate with the cell cycle (https://pubmed.ncbi.nlm.nih.gov/31885198/). Could this be connected with the pattern formation here?
      8. Line 606: The mentioned review article also covers yeast. As such, maybe the authors should replace the term "bacteria" with "microbes"?

      Significance

      Referees cross-commenting

      As I mentioned in my comment, targeted metabolic perturbations are extremely difficult. Perturbing a metabolite level without at the same time perturbing the flux through this pathways is difficult (of not impossible). Also, the opposite is the case. I am not sure whether experiments as the one suggested by reviewer 2 (comment 1) will really lead to results from which further conclusions can be drawn. Furthermore, there does not need to be a linear correlation between the extracellular glucose concentration and metabolic flux/FBP levels (as my reviewer colleague implies). Thus, I am not sure whether doing this experiment makes sense, or would lead to strengthened conclusions.

      Reviewer 2 also states "The lack of proven mechanism for the activity of FBP might restrict the real general impact of this work." I agree that we do not know the downstream targets of FBP, but finding them would likely require many years of additional work. Such work will not be initiated if this paper is not published, and it would be a pity if it would be further delayed. I feel that the evidence is strong enough that FBP has an important role and with this paper published, it will motivate others to look for the downstream targets.

      Reviewer 3 makes the point: "Given that FBP levels are highly correlated with extracellular glucose levels (which impact glycolytic flux )(TeSlaa and Teitell, 2014) the authors should elaborate on why progressive increase in extracellular glucose does not affect PSM patterning, in the same way that increasing FBP levels does. " Here, I feel my reviewer colleague might be overlooking that in biochemistry molecular interactions typically reach a saturation at some point. The correlation between extracellular glucose and glycolytic flux has likely only a range where these two measures linearly correlate. Similarily, the correlation between glycolytic flxu and FBP likely also exists only within a certain range, and finally FBP levels and the downstream targets likely also only linearly interact within bounds. Thus, the absence of a correlation at "extremes" does by no mean mean that what the authors propose is incorrect. In fact, it just shows what you expect from biomolecular interactions that there a limits to linear correlations.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all reviewers for their input and suggestions.

      In the discussion section, the reviewers agreed on four major points which we have addressed as follows:

      In vivo validation of ROP1. We have now carried out mouse infections using the PRUΔKU80, PRUΔROP1, and complemented lines, showing that PRUΔROP1 is completely avirulent. This matches our in vivo CRISPR screen and in vitro IFNγ restriction assays results, and confirms that ROP1 is an important T. gondii virulence factor. We highlight the discrepancy with Soldati et al 1995, and suggest that this important role of ROP1 was previously overlooked in T. gondii RH due to the “hypervirulence” of this strain in laboratory mice. Clarify discrepancies in Irgb6 recruitment compared to published data: We revisited our image analysis pipeline for this data and corrected an error in the host cell segmentation step which was causing erroneously high calling of Irgb6 recruitment. The recruitment we now measure is now less variable and is consistent with published data, confirming that ROP1 does not affect Irgb6 recruitment or rhoptry bulb protein secretion. Conduct specific assays to measure host cell death or remove claims about this. We have carried out kinetic propidium iodide uptake assays as suggested by Reviewer #3. These have clarified that there is minimal parasite-induced host cell death at low MOI which cannot explain the host cell loss observed in the high-content imaging restriction assays. At an MOI of 0.3, ROP1 has no detectable effect on host cell death, while at a much higher MOI of 3, ROP1 knockout moderately increases cell death of PRU-infected BMDMs. Since cell death in macrophages is reported to result from exposure of parasite-derived PAMPS to host cytosolic sensors, we suggest that this increased host cell death at high MOI is a secondary effect of vacuole disrustion. Alternatively, these findings raise the interesting possibility of a differential phenotype for ROP1 at low versus high parasite burden. Queries regarding the restriction assays in C1QBPflox/flox/-/- MEFs. We included these data as we found statistically significant differences between the C1QBPflox/flox and C1QBP-/- MEFs that were dependent on the presence of ROP1 in the parasites. However, given the concerns raised by all three reviewers regarding the apparent lack of T. gondii restriction in these cells, we have withdrawn all conclusions relating to the putative role of C1QBP in parasite restriction. The data are included now in a supplementary figure only as a reference for other researchers working on the role of C1QBP in innate immunity who may use these previously published cell lines. We have made additional attempts to explore the link between ROP1 and C1QBP that have been unsuccessful, which are now mentioned in the discussion section. Although the co-immunoprecipitation of C1QBP with ROP1 is interesting, given the putative role of C1QBP in regulating autophagy and innate immune responses, further exploration of this potential interaction will require new tools and substantially more work that is beyond the scope of this manuscript. Further responses to comments raised by individual reviewers and details of further revisions are described below.

      Reviewer #1

      The authors identified ROP1 as a significant hit from their in vivo screen. However, they have not done any validation experiments using rop1 KO parasites in mice. Previous studies have shown no virulence defect in mice for rop1 KO in the type I background (PMID: 8719248). The result could be different in the type II strain used here, but this needs to be tested and shown.

      Soldati et al 1995 (PMID: 8719248) demonstrated that there was no virulence defect associated with ROP1 knockout in the RH strain. However, the RH strain is extremely virulent in most laboratory mice strains, which can mask phenotypes observed in a type II strain. We have now carried out in vivo infections of C57BL/6J mice using the type II Prugniaud strain, and have shown a severe virulence defect for PRUΔROP1 parasites which is rescued by complementation. This matches our in vivo CRISPR screen and in vitro IFNγ restriction assays results, and confirms that ROP1 is a virulence factor in vivo.

      The data in Fig. 2 and S3 do support that reduced parasitemia was due to decrease in number of vacuoles rather than their size or host cell death. However, it is important to control for invasion and/or egress differences of rop1 KO parasites in IFN-g activated cells.

      Soldati et al 1995 (PMID: 8719248) demonstrated that ROP1 has no role in invasion in HFFs, therefore it would be very surprising if ROP1 were to have a specific IFNγ-dependent role in invasion in macrophages. There is no involvement of rhoptry bulb-localised proteins in the predominant model of invasion, only rhoptry neck and microneme proteins.

      “Natural” egress after ~48 h would not occur here as the cells are fixed after 24 h. IFNγ-induced “early” egress has been documented in HFFs, A549s, and macrophages in vivo, and is apparent through increased host cell death/lysis (Niedelman et al 2013 PMID: 24042117, Rinkenberger et al 2021 PMID: 34871166, Tomita et al 2009 PMID: 19846885). We have now carried out prodium iodide uptake assays to more accurately quantify parasite-induced host cell death, and find no differences between strains at an MOI of 0.3, the targeted MOI we use in our restriction assays. At a higher MOI of 3, we find a moderate increase in host cell death in PRUΔROP1-infected BMDMs, which we suggest results from increased exposure of parasite-derived PAMPs to cytosolic sensors (Fisch et al 2019 PMID: 31268602, Zhao et al 2009 PMID: 19197351). Alternatively, this increased host cell death could result from increased rates of early egress, or from direct inhibition of programmed cell death pathways.

      It is important and informative to depict absolute parasite/size when making multiple comparisons. For example, the data in Fig. 4E shows C1QBP-/- MEFs can clear both RH and PRU better than the WT. However the authors do not comment on what is the meaning of > 100% parasite numbers in IFN-g treated MEFs with respect to untreated in WT. Since the data are normalized, it is difficult to appreciate what the actual differences are. Additionally, C1QBP-/- MEFs show close to equal survival in control and IFN-g treated condition (approximately 100%). Is it correct to infer that C1QBP has no effect on parasite survival? This should be considered in light of the comment below on colocalisation of C1QBP.

      It is standard practise to show IFNγ-dependent restriction as a percentage of unstimulated cells as this reflects the hypothesis being tested and the statistical tests carried out, as for example in Wang et al 2020 PMID: 33067458, Matta et al 2019 PMID: 31413201, Gay et al 2016 PMID: 27503074, Bando et al 2018 PMID: 30283439, Fleckenstein et al 2012 PMID: 22802726. Absolute numbers are included in the supplementary data for further reference.

      The >100% survival observed in the C1QBPflox/flox and C1QBP-/- MEFs is puzzling. We conclude that these cell lines have largely lost the ability to restrict T. gondii parasites as a result of the immortalisation process and/or passage history, and what little restriction we observe is at the limit of detection in our assay. MEFs are otherwise known to restrict both RH and PRU parasites (Niedelman et al 2012 PMID: 22761577). We included these data as we found statistically significant differences between the C1QBPflox/flox and C1QBP-/- MEFs which were dependent on the presence of ROP1 in the parasites. However, after the concerns raised by all three reviewers we agree that it is better to not to draw conclusions from these assays given the lack of parasite restriction. We will include these data only in the supplementary figures as a reference for other researchers working on the role of C1QBP in innate immunity who may use these previously published cell lines.

      The authors observed good restriction of both RH and PRU in IFN-g activated THP1s without cell death (Fig S1D). It is important to incorporate this information into the main result and discuss their implications in contrast to a previous report from 2019 (Fisch et al, PMID: 31268602).

      As mentioned above, we have carried out propidium iodide uptake assays to address questions regarding host cell death more precisely, which are now included as a main figure in the revised manuscript. While Fisch et al measured host cell death during infection of stimulated THP-1s at an MOI of 3, our restriction assays were carried out at a targeted MOI of 0.3. The Howard lab has shown that host cell death is directly proportional to MOI in BMDMs (Zhao et al 2009 PMID: 19197351), and we also find that at an MOI of 0.3 there is little detectable host cell death in BMDMs. While we are not aware of a similar study in THP-1s, it is likely also the case that at low MOIs there is little detectable cell death.

      The authors should consider conducting C1QBP functional assays to explore potential roles in parasite survival/growth within host cells. For example, it would be informative to measure the extent of autophagy or transcriptomic profile of the KO to deduce or suggest possible mechanisms of restriction.

      All reviewers have raised concerns regarding the restriction assays in the C1QBPflox/flox/C1QBP-/- MEFs, particularly that they do not appear to restrict parasite growth as would be expected. As a result, we have decided to withdraw conclusions from these assays regarding the role of C1QBP. For the same reason, we feel that further functional assays using these cells would be of limited value. As an alternative, we attempted siRNA-mediated knockdown of C1QBP in primary BMDMs using a pool of three commercial siRNAs, but were able to achieve only

      The authors state in their text that "C1QBP localised primarily to the mitochondria (Figure S7A) and therefore did not see any co-localisation with ROP1". The authors should discuss in more detail as this finding seems to contradict the interaction studies. Is there any independent evidence to corroborate the interaction studies and show they are not simply an in vitro artifact?

      We have now added immunofluorescence images of C1QBP and ROP1 in infected MEFs and HFFs to the main figure and discuss this in further detail. We find that C1QBP primary localises to the mitochondria, therefore there is some overlap with ROP1 signal at the PVM in RH as type I strains recruit mitochondria to the vacuole via MAF1B (Pernas et al 2014 PMID: 24781109, Adomako-Ankomah et al 2016 PMID: 26920761). However, type II strains do not recruit host mitochondria therefore we see little overlap with ROP1 in PRU. Furthermore, the precise localisation of C1QBP is a matter of some debate, such that it is unclear whether ROP1 and C1QBP are topologically able to interact. One study reported that C1QBP is exclusively localised to the mitochondrial matrix and therefore would not be able to interact with ROP1 even when the mitochondria are recruited to the vacuole (Muta et al 1997 PMID: 9305894). Others have asserted that there is an additional cytosolic pool of C1QBP which can be recruited to the outer membrane of the mitochondria, thus allowing interaction with ROP1 immediately following rhoptry secretion into the cytosol or at the PVM (Xu et al 2009 PMID: 19164550). From these IFAs we are therefore unable to draw any firm conclusions.

      We attempted proximity biotinylation to validate this potential interaction in cellulo, but C-terminal fusion of TurboID to ROP1 caused mislocalisation of the protein and prevented secretion of ROP1 to the parasitophorous vacuole. Based on the current evidence, we are unable to exclude that the interaction is an in vitro artefact as a result of cell lysis during the immunoprecipitation, and we clearly state this in the discussion. However, given this pulldown data is highly reproducible and technically sound, we believe it is important to include this result in the manuscript.

      The authors state in their text that "Enhanced restriction of Δrop1 parasites is primarily mediated through increased vacuole destruction". Their data is more suggestive of growth restriction. The authors should provide more direct data for destruction of the vacuoles or change the wording to indicate it is due to growth restriction.

      In the absence of specific results for vacuole destruction, which is technically challenging to determine quantitatively, we concluded by a process of elimination that restriction of ΔROP1 parasites at low MOI mostly likely occurs primarily through vacuole destruction, as we did not see strong evidence for vacuole size reduction and (now added in revision) do not see differences in parasite-induced host cell death. Reviewer #1 agrees in an earlier comment that vacuole destruction is the most likely mechanism. We note some subtle indications of strain- and host species-dependent differences, namely that RHΔROP1 parasites in THP-1 macrophages appear to have reduced vacuole size compared to RHΔUPRT (but not the complemented strain), but in all other cases the most likely mechanism consistent with our data is destruction of vacuoles. We have revised our conclusions to reflect that this is an inference from the data rather than a definitive finding.

      The authors should provide high resolution IFA images and decrease their size to an equivalent size of other sub-figures. Empty white space between sub-figures can be minimized. Font size of the figure labels/axis/titles should be matched and increased slightly.

      Thank you for these suggestions. Each IFA image is only 2x2 cm so we feel that decreasing their size further would make them difficult to see.

      Reviewer #2

      The title indicates a positive role for ROP1 in subverting innate immune restriction, but the data indicate that a deficiency in ROP1 causes susceptibility to innate immune restriction. This might seem like a subtle discord, but in the absence of identifying the mechanism of ROP1 subversion of innate immune restriction it seems more appropriate to provide a title that better reflects the findings i.e., that ROP1 deficient parasites are more susceptible to innate immune restriction.

      We recognise this point and have changed the title to reflect both this and new results added in revision: “Toxoplasma gondii virulence factor ROP1 reduces parasite susceptibility to murine and human innate immune restriction”

      The reference strains and complement strains lack UPRT, but from the available information it appears the KO strains have UPRT. If this is the case, it is necessary to rule out that the presence of UPRT doesn't render parasites more susceptible to IFNg mediated killing by performing additional experiments comparing RH∆ku80 with RH∆ku80∆uprt and Pru∆ku80 with Pru∆ku80∆uprt.

      This is correct, the reference and complemented strains lack UPRT while the ROP1 knockout strains have a functional UPRT. For the high-content imaging assay it was necessary to use a reference strain that expressed a fluorophore, as the knockout and complemented strains do, to allow for accurate segmentation of the parasites. We considered it preferable to insert the mCherry fluorophore at the well-established, non-essential UPRT locus rather than integrate it randomly. Complementation at the UPRT locus is widely used in the literature with no impact on virulence phenotypes: Shen et al 2014 (PMID: 24825012) Fig 4F, Wang et al 2020 (PMID: 33067458) Fig 5F, Wang et al 2020 (PMID: 31908049) Fig 5, Fox et al 2019 (PMID: 31266861) Fig 5, Olias et al 2016 (PMID: 27414498) Fig 6. Moreover, the genome-wide CRISPR knockout screen of Wang et al 2020 (PMID: 33067458) has also demonstrated that knockout of UPRT does not affect growth of RH parasites in IFNγ-stimulated vs. naive BMDMs. UPRT has 100% amino acid identity between RH and PRU so no functional differences are expected.

      Although it is understandable why the authors chose MEFs to test the role of C1QBP because MEFs were used for the Co-IP, the MEFs do not appear to be responding to IFNg for parasite growth restriction (-/+ IFNg % survival is at or above 100% for WT MEFs). As such, the authors are potentially blind to the role of C1QBP in the context of IFNg restriction. It would be ideal to repeat these experiments using BMDMs from WT and C1QBP KO mice to assess the potential contribution of C1QBP during IFNg restriction of T. gondii growth and survival.

      We agree that it would be preferable to use ΔC1QBP BMDMs for these experiments. However, knockout of C1QBP is lethal in mice (Yagi et al 2012, PMID: 22904065), therefore it is not possible to obtain primary ΔC1QBP BMDMs. We also attempted siRNA-mediated knockdown of C1QBP in wild-type primary BMDMs using a pool of three different siRNAs, but were only able to achieve

      Much of the data is analyzed with a paired two-sided t-test, but the authors used Bonferoni correction in some cases and Benjamini-Hochberg adjustment in other cases. It would be helpful to either consistently use the same correction or explain in a short section on stats in the methods the rationale for using different corrections.

      We have changed the manuscript to consistently use Benjamini-Hochberg correction for all tests. These correction methods represent different approaches to the multiple-testing problem: Bonferroni correction controls the family-wise error rate, while the Benjamini-Hochberg procedure controls the false-discovery rate. We favour the FDR-based approach as it is less dependent on the somewhat-arbitrary decision as to what constitutes a “family” of tests - for example: should tests in the RH and PRU strains be in the same family; should tests for number of parasites and number of vacuoles/host cells be in the same family; should the tests in BMDMs vs. THP-1s should be the same family. At the alpha level of 0.05, one in twenty significant results are false positives following Benjamini-Hochberg adjustment, as opposed to one in twenty of all tests carried out prior to adjustment. We find this appropriate for our study.

      The IFA images in Figure 2B appear to show considerable redistribution of ROP1 from the rhoptries to other parts of the parasite upon short Trition-X100 treatment of formaldehyde fixed samples. Inclusion of a low concentration of glutaraldehyde might help preserve the normal distribution of ROP1. Alternatively, or additionally, permeabilization with saponin or digitonin could help visualize ROP1 associated with the PVM. Improving the imaging is not critical to support the conclusions of the study, but would nevertheless be an asset.

      The ROP1 immunofluorescence staining within the parasites is likely from protein that is being trafficked through the ER or Golgi. This staining is more prominent with shorter Triton permeabilisation and in the complement lines. We have also observed PVM localisation of ROP1 using permeabilisation with 0.1% saponin for 15 min, but find that it is clearer and more consistent with the short Triton permeabilisation. We have added additional examples to the supplementary figure validating ROP1 knockout and complement.

      Reviewer #3

      The IFNg-dependent T. gondii control data as well as the Irgb6 recruitment data are extremely variable and preclude drawing any solid conclusions (Fig 2c-f, 3d, 4e-f, S3a-d, S8). The authors normalize the data by presenting the ratio of +IFNg/-IFNg in % of the measure they are analyzing. However, this does not "clean up" the data. The underlying cause for this problem are the extremely varied input, the time point analyzed and the nature of the microscopy experiment.

      We have found that parasite IFNγ-restriction assays are highly variable between biological replicates by all methods that we have tried. We thank Reviewer #1 for sharing their similar experience, as we have heard the same from several colleagues. As it becomes more common to plot individual data points rather than summary statistics, this is increasingly apparent in the literature (e.g. Wang et al 2020 Fig 2, Rinkenberger et al 2021). While we could select the three “best” replicates to publish or instead present only summary statistics to obscure the variation present, we do not feel that this would be in the best interest of the field or of open science. We address Reviewer #3’s specific concerns below.

      Presenting the data as a ratio/percentage of IFNγ-stimulated versus naive cells is not an attempt to “clean up” the data, it is standard practice in the field and reflects the hypothesis being tested. For example: Wang et al 2020 PMID: 33067458 (Saeij lab), Matta et al 2019 PMID: 31413201 (Sibley lab), Gay et al 2016 PMID: 27503074 (Hakimi lab), Bando et al 2018 PMID: 30283439 (Yamamoto lab), Fleckenstein et al 2012 PMID: 22802726 (Howard lab).

      - Varied input: looking at the Supplementary tables and calculating the input MOI in the -IFNg samples for the BMDM, for example, they range from 0.1-6. Most of the MOIs are in the 0.1-1 range, but this is still a huge variation and not the MOI of 0.3 the authors are aiming to add. Input host cells within one experiment often differ 5 fold between the T. gondii strains analyzed. Consistency in input is not achieved.

      The inputs of the restriction assays, as estimated by the parasite and host cell numbers in the unstimulated cells, are variable but not so extreme as Reviewer #3 says. Importantly, variability between strains within a biological replicate is far less than between biological replicates, and it is for this reason that we have analysed the results with paired tests. It is in fact a feature of microscopy-based assays that these sources of variability can be identified. We have attempted to be as open as possible with our data in reporting all of these measured parameters.

      MOI is the most appropriate measure of inter- and intra-replicate variability as it accounts for both host cell and vacuole number. We have now added estimated MOI as a column in the supplementary data files (S2B and S3B: MOI = “Vacuole - Minus IFNG Median”/”Host Nuclei - Minus IFNG Median”). Out of 49 data points, there are only three instances in the BMDM restriction assays where MOI >1 (1.26, 1.06, 1.42) and none in THP-1s. Because the MOIs are low, the range of MOI within each replicate is small. Furthermore, for MOI ≤1, we expect there will be little qualitative effect on restriction as the host cells will be infected with either 1 or 0 vacuoles. In our data, we do not find any correlation between MOI and restriction for any individual strain or for all strains combined.

      Regardless of the inter-replicate variability, the results for parasite restriction are statistically significant using an appropriate paired test. Our control parasite line PRUΔGRA12 behaves as expected (Fox et al 2019 PMID: 31266861) and differences between the RH and PRU strain in BMDMs dependent on the polymorphic effector ROP18 are apparent, although not tested here. While there is variation in the magnitude of restriction between replicates, there is a qualitative decrease in parasite survival in terms of total parasite number (Fig 2C and 2E) for ΔROP1 parasites compared to both ΔUPRT and the complemented lines in every replicate for both strains in BMDMs (7/7 replicates for each strain), and in all but one replicate in THP-1s (6/7 for RH, 7/7 for PRU). We consider that this restriction phenotype is highly robust.

      - Time point analyzed: the authors chose to analyze 24h p.i. IFNg-stimulated MEFs, BMDMs and THP-1s. All will have undergone a significant amount of cell death at this time point. The authors even present the varying host cell numbers in Fig S3: the +IFNg cell numbers vs -IFNg cell numbers in BMDMs range from 35-70% for example in RH; THP-1 seem slightly better but still there is a range of 70-120%.

      24 hpi or later is a standard timepoint at which to measure parasite restriction, as at earlier timepoints the dynamic range of the assays are reduced due to lower parasite numbers and therefore lower fluorescence, luminescence, or uracil uptake. For example, the following references all measure parasite restriction at at least 24 hpi: Wang et al 2020 Fig. 2: 24 hpi; Matta et al 2019 Fig. 3: 36-40 hpi; Gay et al 2016 Fig. 8: 36-48 hpi; Bando et al 2018: 24 hpi; Fleckenstein et al 2012 Fig. 4: 48 hpi.

      The Howard lab has shown that host cell death is directly proportional to MOI in murine cells (Zhao et al 2009 PMID: 19197351, Lilue et al 2013 PMID: 24175088). At low MOIs of

      - Nature of the experiment: Due to rampant host cell death at 24h p.i. analyzing the total parasite or vacuole number or even vacuole size is difficult in a microscopy experiment (dead cells will wash away even after fixation as they do not adhere anymore). Most laboratories (Howard, Yamamoto, Coers, Saeij, Steinfeldt etc), who study these mechanisms employ experimental systems that do not rely on washes/perturbations (plaque assays, plate reader T. gondii luciferase growth assays, uracil incorporation etc), or focus on the "number of parasites/vacuole" measure that are less dependent on host cell numbers (Saeij, Coers etc) or choose host cells that do not undergo IFNg-driven cell death (A549 cells, Sibley recent elife; only overexpression of the host factor induces cell death).

      High-content imaging has previously been used to characterise IFNγ-dependent restriction of T. gondii (Gay et al 2016 PMID: 27503074 Fig 8D, Rinkenberger et al 2021 PMID: 34871166, Fisch et al 2021 PMID: 34931666) and for anti-parasitic drug screening (Touquet et al 2018 PMID: 30157171). Certainly different methods have different strengths and weaknesses: for example, fixed imaging-based methods required washing which will remove dead host cells from the analysis, whereas e.g. luciferase or uracil incorporation assays do not; however, with imaging we can measure multiple potential phenotypes simultaneously and identify potential confounding factors that would be unknown in other assays. Acknowledging that every assay has specific weaknesses, we verify the role of ROP1 by orthogonal methods: in vivo CRISPR screen for growth in the mouse peritoneum, in vitro growth in IFNγ-stimulated macrophages, and (now added in revision) in vivo virulence studies. The results of all of these assays concur.

      If ROP1 alters the amount of host cell death induced in macrophages, host cell numbers in the +IFNg samples will vary in dependency of ROP1. Hence, in order to assess ROP1-dependent T. gondii restriction it is imperative to know if ROP1 alters host cell death. This can only be measured with cell death assays (total LDH or better a kinetic cell death assay) and not by looking at how many host cells remain after fixation in a microscopy experiment. To compare ROP1-driven parasite restriction, it is important to compare the same input MOI for each strain (-IFNg), determine a time point p.i. before host cell death has taken over and verify host cell numbers stay within a reasonable variation between strains. Input MOI consistency between T. gondii strains is typically measured by plaque viability assay.

      We have now carried out kinetic propidium iodide uptake assays to determine formally whether ROP1 affects host cell death in BMDMs, which have been added to the revised manuscript. At an MOI of 0.3, as targeted in our restriction assays, there is very little parasite-induced cell death and no significant differences between parasite strains. This is in line with the results from the Howard lab mentioned above (Zhao et al 2009 PMID: 19197351, Lilue et al 2013 PMID: 24175088), and confirms that the majority of host cell loss observed in the high-content imaging assays at an average MOI of 0.3 is not parasite-induced.

      We also carried out propidium iodide uptake assays at an MOI of 3, and found a slight but significant increase in host cell death for PRUΔROP1 in BMDMs of 10-15%. A small increase was also observed for RHΔROP1 compared to the complemented line, but was not significant compared to RHΔUPRT. This differential effect at low and high MOI is interesting, although we cannot rule out that a small phenotype is beyond the limit of detection at low MOI. We conclude that at low MOI host cell death does not have a detectable role in the restriction of ΔROP1 parasites, leaving the most likely mechanism as disruption of vacuoles. At high MOI, vacuole breakage appears to result in host cell death as host cytosolic sensors recognise parasite-derived PAMPs: parasite DNA is sensed by AIM2 in activated THP-1 macrophages leading to apoptosis (Fisch et al 2019 PMID: 31268602), while in murine cells vacuole breakage by the IRGs leads to necrotic cell death (Zhao et al 2009 PMID: 19197351). Increased cell death at high MOI is therefore also consistent with increased disruption of vacuoles.

      Fig 2C - Why are there datapoints with more than 100% T. gondii in +IFNg vs -IFNg samples? In these cases, no IFNg restriction was observed for the WT strain. This data is not reliable, as it has been demonstrated before that BMDMs control RH in an IFNg-dependent fashion.

      This is one replicate out of seven. Although removing this replicate would not affect the statistical significance of the results in terms of total parasite number or number of vacuoles, we do not consider it appropriate to remove data because it does not match our expectations. Published data also show survival of RH parasites in BMDMs at or close to 100% in some replicates as a result of biological and technical variability (e.g. Wang et al 2020 PMID: 33067458, Fig. 2), which is to be expected in primary cells derived from different donor mice. The average survival we determine for RH across replicates in ~75%, which is in line with published data.

      Fig 4C - The description of the IP in the legends and the materials is incomplete. In the materials it only states the loading of the IP fraction. Is the supernatant the post-IP fraction? 202200-HA is not described in the Figure legend. Why is the IP'ed version of C1QBP smaller than the supernatant version?

      We will clarify this in the figure legend and methods. The supernatant is the post-IP fraction. It was necessary to load as much material as possible to detect C1QBP in the post-IP supernatant of the RHΔKU80 and PRUΔKU80-infected samples, but as the protein concentration in the supernatant is far higher than in the IP fraction this appears to have caused slight retardation of protein migration in these lanes. This is only apparent for C1QBP as it is the only protein detected in both supernatant and IP.

      Fig 4E and F - There is almost no IFNg-dependent restriction in the WT MEFs for any of the parasite strains (4E). Some of the data even shows a dramatic increase of parasite load with IFNg versus without IFNg. Hence, no conclusion can be drawn about the function of C1QBP and making a ratio of KO vs WT (Fig 4F) is not justified as there was no IFNg restriction to begin with in WT cells.

      We included these data as we found statistically significant differences between the C1QBPflox/flox and C1QBP-/- MEFs that were dependent on the presence of ROP1 in the parasites. However, after the concerns raised by all three reviewers we agree that it is better to not to draw conclusions from these assays given the lack of parasite restriction that would otherwise be expected in MEFs (e.g. Niedelman et al 2012 PMID: 22761577). We will include these data only in the supplementary figures as a reference for other researchers working on the role of C1QBP in innate immunity who may use these previously published cell lines.

      Fig 3D - Something is wrong with the Irgb6 recruitment data. Many labs have shown that RH recruitment of Irgb6 is at most 10% due to the activity of ROP5 and ROP18. The authors get up to 40% recruitment for RH and again the data ranges from 5-42%, that's a huge variation!

      Recruitment of IRGB6 was determined by the ratio of the median fluorescence intensity in a 6 pixel radius around the parasites versus the median intensity in the rest of the infected cell cytoplasm. After carefully checking each step of this analysis pipeline, we found that the threshold for host cell cytoplasm segmentation based on CellMask staining was set too low and was including areas of empty space. This resulted in erroneously low median intensity in the host cell and so false-positive calling of IRGB6 recruitment to some vacuoles. We have corrected the cut-off threshold for host cell segmentation and reanalysed all the data with this corrected script. We now find average recruitment of ~5% for RHΔUPRT, ~15% for RHΔROP18, and ~60% for PRUΔUPRT (comparable to e.g. Fentress et al 2011, Fleckenstein et al 2012, Niedelman et al 2012, and Etheridge et al 2014), as well as less variability overall.

      Fig S3A and C: The authors conclude that ROP1 does not affect vacuole size. I disagree. The data for THP-1 is significant. However, due to the noisy input with such varied MOI and varied host cell numbers, at the moment, no solid conclusion can be drawn.

      Although there is a significant difference between RHΔUPRT and RHΔROP1, this is not rescued by complementation so we are hesitant to claim this as a phenotype of ROP1. There are no significant differences in the PRU strain. We highlight this in the text of the results section, as it may be an important strain- or host species-dependent mechanism.

      Fig S3B and D: The data clearly show huge variation in host cell numbers depending on IFNg and T. gondii infection. It is well-known that host cell death occurs in these experimental systems. In order to analyze whether ROP1 impacts host cell death a kinetic cell death assay is needed. Assessment of remaining host cell numbers in a 96 well plate microscopy experiment is not a quantitative assessment of host cell death, so the conclusion is not valid.

      Please refer to comments on host cell death above.

      T. gondii "type" is always spelt with a lowercase "t" by convention.

      We have corrected this.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors identify 22 putative T. gondii virulence factors in an in vivo CRISPR screen and aim to analyze ROP1 as one of the hits for its contribution and mechanism in IFNg-driven parasite control mostly in murine and human macrophages. Somewhat disappointingly, most of the data are too variable to draw conclusions. The underlying cause for the variability in the data are the divergent input host cell numbers within the same experiment, variable MOI and most importantly, probably the loss of host cells due to cell death (the infected ones wash away more easily). The authors try to remedy this by pooling and normalizing data, and while there may be a trend to ROP1 counteracting IFNg-driven T. gondii control (Fig 2), the result overall is not convincing. Even if more robust data would confirm the role of ROP1 in innate immune evasion, all mechanistic "tries" the authors undertake in this study show no phenotype (Fig 3 and S6) or are unreliable (Fig 4, see comments below).

      Major concern:

      The IFNg-dependent T. gondii control data as well as the Irgb6 recruitment data are extremely variable and preclude drawing any solid conclusions (Fig 2c-f, 3d, 4e-f, S3a-d, S8). The authors normalize the data by presenting the ratio of +IFNg/-IFNg in % of the measure they are analyzing. However, this does not "clean up" the data. The underlying cause for this problem are the extremely varied input, the time point analyzed and the nature of the microscopy experiment.

      • Varied input: looking at the Supplementary tables and calculating the input MOI in the -IFNg samples for the BMDM, for example, they range from 0.1-6. Most of the MOIs are in the 0.1-1 range, but this is still a huge variation and not the MOI of 0.3 the authors are aiming to add. Input host cells within one experiment often differ 5 fold between the T. gondii strains analyzed. Consistency in input is not achieved.
      • Time point analyzed: the authors chose to analyze 24h p.i. IFNg-stimulated MEFs, BMDMs and THP-1s. All will have undergone a significant amount of cell death at this time point. The authors even present the varying host cell numbers in Fig S3: the +IFNg cell numbers vs -IFNg cell numbers in BMDMs range from 35-70% for example in RH; THP-1 seem slightly better but still there is a range of 70-120%.
      • Nature of the experiment: Due to rampant host cell death at 24h p.i. analyzing the total parasite or vacuole number or even vacuole size is difficult in a microscopy experiment (dead cells will wash away even after fixation as they do not adhere anymore). Most laboratories (Howard, Yamamoto, Coers, Saeij, Steinfeldt etc), who study these mechanisms employ experimental systems that do not rely on washes/perturbations (plaque assays, plate reader T. gondii luciferase growth assays, uracil incorporation etc), or focus on the "number of parasites/vacuole" measure that are less dependent on host cell numbers (Saeij, Coers etc) or choose host cells that do not undergo IFNg-driven cell death (A549 cells, Sibley recent elife; only overexpression of the host factor induces cell death). If ROP1 alters the amount of host cell death induced in macrophages, host cell numbers in the +IFNg samples will vary in dependency of ROP1. Hence, in order to assess ROP1-dependent T. gondii restriction it is imperative to know if ROP1 alters host cell death. This can only be measured with cell death assays (total LDH or better a kinetic cell death assay) and not by looking at how many host cells remain after fixation in a microscopy experiment. To compare ROP1-driven parasite restriction, it is important to compare the same input MOI for each strain (-IFNg), determine a time point p.i. before host cell death has taken over and verify host cell numbers stay within a reasonable variation between strains. Input MOI consistency between T. gondii strains is typically measured by plaque viability assay.

      Other specific major comments:

      Fig 2C - Why are there datapoints with more than 100% T. gondii in +IFNg vs -IFNg samples? In these cases, no IFNg restriction was observed for the WT strain. This data is not reliable, as it has been demonstrated before that BMDMs control RH in an IFNg-dependent fashion.

      Fig 4C - The description of the IP in the legends and the materials is incomplete. In the materials it only states the loading of the IP fraction. Is the supernatant the post-IP fraction? 202200-HA is not described in the Figure legend. Why is the IP'ed version of C1QBP smaller than the supernatant version?

      Fig 4E and F - There is almost no IFNg-dependent restriction in the WT MEFs for any of the parasite strains (4E). Some of the data even shows a dramatic increase of parasite load with IFNg versus without IFNg. Hence, no conclusion can be drawn about the function of C1QBP and making a ratio of KO vs WT (Fig 4F) is not justified as there was no IFNg restriction to begin with in WT cells.

      Fig 3D - Something is wrong with the Irgb6 recruitment data. Many labs have shown that RH recruitment of Irgb6 is at most 10% due to the activity of ROP5 and ROP18. The authors get up to 40% recruitment for RH and again the data ranges from 5-42%, that's a huge variation!

      Fig S3A and C: The authors conclude that ROP1 does not affect vacuole size. I disagree. The data for THP-1 is significant. However, due to the noisy input with such varied MOI and varied host cell numbers, at the moment, no solid conclusion can be drawn.

      Fig S3B and D: The data clearly show huge variation in host cell numbers depending on IFNg and T. gondii infection. It is well-known that host cell death occurs in these experimental systems. In order to analyze whether ROP1 impacts host cell death a kinetic cell death assay is needed. Assessment of remaining host cell numbers in a 96 well plate microscopy experiment is not a quantitative assessment of host cell death, so the conclusion is not valid.

      Minor comments:

      T. gondii "type" is always spelt with a lower case "t" by convention.

      Significance

      T. gondii protein ROP1 was identified by the authors in an in vivo CRISPR screen as a potential effector protein mediating parasite resistance to innate immunity. The function of ROP1 is currently unknown. The data presented in this manuscript does not deliver conclusive evidence that ROP1 counteracts IFNg-driven immunity and a potential mechanism has not been uncovered.

      Referees cross-commenting

      This section contains comments of all reviewers

      Reviewer 1

      I agree with the comments of review #2 in regard to additional experiments needed for validation. I also agree with reviewer #3 about the variable data with Irgb6 recruitment and IFNg control- we have also done these assays and struggled with the variability. The suggestions for how to reduce variability with modified protocols are good - but is our role really to instruct them how to do the experiments? It seems you are asking them to start over, and I am not sure the situation is that bleak. I would be more inclined to allow them to claim only features that are clearly supported within the variability of the current assays. Perhaps that weakens the conclusions they can make and hence ultimate decision - but I feel that this choice (of fixing it or compromising) should be up to the authors.

      Reviewer 3

      Thanks for your points, yes I agree, statements can be made about the role of ROP1 in innate immune defence (i.e. it probably contributes to control of vacuole numbers and size in both mouse and human), but the authors should more carefully place them in the framework of their assays.

      Rev 2, point 1 is very good (title suggestion). Rev 1, point 1, I agree very much (validate ROP1 phenotype in vivo). Rev 2, minor point 1 (explain your stats) and Rev 1, point 3 (depict absolute numbers in comparisons). This is my point also, the authors try to normalise the data and apply varied statistics to smooth over the variability. All reviewers pointed out the problems with the C1QBP data, specifically the >100% data of +IFNg vs -IFNg and additionally the potential for an in vitro artefact (Rev 1).

      My main points remain: - statements on ROP1 and its impact on host cell death without conducting any host cell death assay cannot be made. - the Irgb6 recruitment data is puzzling and contradicts all the many published data on Irgb6 recruitment levels. - C1QBP data is not interpretable with data where IFNg does not restriction WT parasites.

      In the end, maybe suggesting in vivo validation of ROP1, remove impact on host cell death claim and the C1QBP data (or conduct further experiments to understand role)?

      Reviewer 1

      Yes I agree with the points and this slightly reworded final assessment:

      Request in vivo validation of ROP1 Clarify discrepancies in Irgb6 data Remove impact on host cell death claim Soften the conclusions about C1QBP data (or conduct further experiments to understand its role).

      Reviewer 2

      Thanks for leading the discussion R1 and R3. It looks like we are in good agreement. The study has some merit but will require more in vitro and in vivo experiments to support the conclusions and to substantiate the relationship between ROP1 and C1QBP.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Toxoplasma gondii secretes a variety of effector proteins that manipulate many aspects of host cell physiology including several that facilitate evasion of cell autonomous immunity due to activation by interferon gamma. The current study features a new CRISPR gRNA library targeting genes encoding proteins that are released from two types of secretory organelles that have previously been strongly implicated in immune evasion, namely rhoptries and dense granules. The authors use the library to identify parasite genes that are more important to in vivo infection than they are suring in vitro culture. That the screen identified several known immune evasion genes suggests that the experiment worked as intended. Among the somewhat marginal hits, the authors identified ROP1, which they pursued further since the function of this protein has remained unknown despite being the first identified ROP protein. As rigorous component of the study, the authors disrupt and restore expression of ROP1 in two different strains (RH and Pru) and identify ROP1 knockout parasite susceptibility to IFNg stimulation in two different types of macrophages (murine BMDMs and human THP-1). Two functional assays were used to indicate that rhoptry secretion is not impaired in parasites lacking ROP1. Co-immunoprecipitation assays suggest an interaction between ROP1 and a multifunctional host protein, C1QBP, inspiring the hypothesis that ROP1 targets C1QBP to evade cell autonomous immunity. However, host cells lacking C1QBP appear to be more restrictive to growth of WT parasites and neutral to ROP1 deficient parasites, implying that C1QBP contributes to T. gondii survival in a manner that is dependent on ROP1. The authors are appropriately cautious when discussing the extent to which the function of ROP1 is mechanistically linked to C1QBP. Overall, the study provides some new evidence for a role of ROP1 in limiting IFNg dependent clearance of T. gondii and identifies some clues to potential mechanism without being able to nail down this aspect.

      Main comments

      1. The title indicates a positive role for ROP1 in subverting innate immune restriction, but the data indicate that a deficiency in ROP1 causes susceptibility to innate immune restriction. This might seem like a subtle discord, but in the absence of identifying the mechanism of ROP1 subversion of innate immune restriction it seems more appropriate to provide a title the better reflects the findings i.e., that ROP1 deficient parasites are more susceptible to innate immune restriction.
      2. The reference strains and complement strains lack UPRT, but from the available information it appears the KO strains have UPRT. If this is the case, it is necessary to rule out that the presence of UPRT doesn't render parasites more susceptible to IFNg mediated killing by performing additional experiments comparing RH∆ku80 with RH∆ku80∆uprt and Pru∆ku80 with Pru∆ku80∆uprt.
      3. Although it is understandable why the authors chose MEFs to test the role of C1QBP because MEFs were used for the Co-IP, the MEFs do not appear to be responding to IFNg for parasite growth restriction (-/+ IFNg % survival is at or above 100% for WT MEFs). As such, the authors are potentially blind to the role of C1QBP in the context of IFNg restriction. It would be ideal to repeat these experiments using BMDMs from WT and C1QBP KO mice to assess the potential contribution of C1QBP during IFNg restriction of T. gondii growth and survival.

      Minor comments

      1. Much of the data is analyzed with a paired two-sided t-test, but the authors used Bonferonni correction in some cases and Benjamini-Hochberg adjustment in other cases. It would be helpful to either consistently use the same correction or explain in a short section on stats in the methods the rationale for using different corrections.
      2. The IFA images in Figure 2B appear to show considerable redistribution of ROP1 from the rhoptries to other parts of the parasite upon short Trition-X100 treatment of formaldehyde fixed samples. Inclusion of a low concentration of glutaraldehyde might help preserve the normal distribution of ROP1. Alternatively, or additionally, permeabilization with saponin or digitonin could help visualize ROP1 associated with the PVM. Improving the imaging is not critical to support the conclusions of the study, but would nevertheless be an asset.

      Significance

      Immune evasion is critical to the survival of pathogens including Toxoplasma gondii, yet the effector proteins responsible for such evasion remain incompletely identified or understood. This study identifies a role for ROP1 in parasite survival in interferon gamma stimulated host cells, thus addressing a long standing question of this protein's function during infection. The study appears suited to a microbiology or infection themed journal. My areas of expertise are T. gondii pathogenesis, virulence, secretion, invasion, egress, resource acquisition, and persistence.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      Butterworth et al identify a novel function of Toxoplasma ROP1 being involved in subverting host IFN-g restriction of parasite growth. The authors executed a CRISPR-Cas9 sgRNA screen in a type II strain parasite targeting a sub-pool of genes characterized with respect to their localization in rhoptries and dense-granules. They adopted an approach of sequentially sequencing the survivors following growth in vitro in human foreskin fibroblasts followed by in vivo expansion in mouse peritoneum. Measurement of relative depletion of sgRNA sequences produced significant hits of factors involved in parasite survival. They identified many parasite proteins previously known to be involved in different aspects of parasite virulence including those involved in countering murine IFN-g mediated anti-toxoplasma response. They then focused on identifying the function of ROP1 in augmenting virulence and survival of the parasite in mice. Lastly, using immunoprecipitation in mouse cells they identified host C1QBP as a protein that may be involved in facilitating ROP1 mediated parasite resistance against the IFN-g response.

      Major Comments:

      1. The authors identified ROP1 as a significant hit from their in vivo screen. However, they have not done any validation experiments using rop1 KO parasites in mice. Previous studies have shown no virulence defect in mice for rop1 KO in the type I background (PMID: 8719248). The result could be different in the type II strain used here, but this needs to be tested and shown.
      2. The data in Fig. 2 and S3 do support that reduced parasitemia was due to decrease in number of vacuoles rather than their size or host cell death. However, it is important to control for invasion and/or egress differences of rop1 KO parasites in IFN-g activated cells.
      3. It is important and informative to depict absolute parasite/size when making multiple comparisons. For example, the data in Fig. 4E shows C1QBP-/- MEFs can clear both RH and PRU better than the WT. However the authors do not comment on what is the meaning of > 100% parasite numbers in IFN-g treated MEFs with respect to untreated in WT. Since the data re normalized, it is difficult o appreciate what the actual differences are. Additionally, C1QBP-/- MEFs show close to equal survival in control and IFN-g treated condition (approximately 100%). Is it correct to infer that C1QBP has no effect on parasite survival? This should be considered in light of the comment below on colocalisation of C1BQ.
      4. The authors observed good restriction of both RH and PRU in IFN-g activated THP1s without cell death (Fig S1D). It is important to incorporate this information into the main result and discuss their implications in contrast to a previous report from 2019 (Fisch et al, PMID: 31268602).
      5. The authors should consider conducting C1QBP functional assays to explore potential roles in parasite survival/growth within host cells. For example, it would be informative to measure the extent of autophagy or transcriptomic profile of the KO to deduce or suggest possible mechanism of restriction.

      Minor Comments:

      1. The authors state in their text that "C1QBP localised primarily to the mitochondria (Figure S7A) and therefore did not see any co-localisation with ROP1". The authors should discuss in more detail as this finding seems to contradict the interaction studies. Is there any independent evidence to corroborate the interaction studies and show they are not simply an in vitro artifact?
      2. The authors state in their text that "Enhanced restriction of Δrop1 parasites is primarily mediated through increased vacuole destruction". Their data is more suggestive of growth restriction. The authors should provide more direct data for destruction of the vacuoles or change the wording to indicate it is due to growth restriction.
      3. The authors should provide high resolution IFA images and decrease their size to an equivalent size of other sub-figures. Empty white space between sub-figures can be minimized. Font size of the figure labels/axis/titles should be matched and increased slightly.

      Significance

      The work has significance with respect to host-parasite interaction in the Toxoplasma field. The report aims to assign a function to the first identified rhoptry protein in Toxoplasma. It employs CRISPR-Cas9 screens in Toxoplasma to study and identify the function of proteins involved in parasite virulence. The study may be of less interest to groups working in the field of host-pathogen interaction, innate immunity, as the genes studied here are not widely conserved nor do they provide obvious parallels to other systems.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): Major comments: 1. The authors mentioned that the heart dysfunction observed upon CHCHD3/6 KD may be mediated via defects in ATP synthase. Then, how does CHCHD3/6 KD affect ATP synthase? Additionally, OPA1 also affects ATP synthase, why does OPA1 KD just reduce fractional shortening (S.T.2) without reducing F-actin staining?

      • Since MICOS is in involved ETC assembly/sorting in cristae, ATPase subunits may not be assembled correctly and thus causing defects mitochondrial morphology and function, which is further supported by reduced mito-GFP staining (see Fig. 4B&H) (see discussion, lines 438-41 in red; see also lines 434-438 and 441-44 that are unchanged). F-actin staining with OPA1 KD: it may not be a strong enough KD to cause reduced F-actin staining, since only strong CHCHD3/6 KD shows reduced F-actin.
      • To address this issue and ask whether OPA1 interacts with CHCHD3/6 in affecting sarcomeric protein levels, we plan to do an interaction experiment with and OPA1 KD with CHD3/6-C1 KD at 21oC (as in Fig. 6B) and probe for reduced FS; and as well at 25oC and probe for reduced F-actin staining (as with beta-Spectrin KD in Fig. 6D and with Sam50 in Fig. 5D).
      1. It has been reported that CHCHD3 KD in HeLa cells causes fragmented mitochondria, so how does CHCHD3/6 KD caused mitochondrial aggregation? What is the mechanism?
      • Thank you for pointing that out. It is actually more appropriate to call this phenotype “fission-fusion defects”. See lines 279-86, 416.
      • To address this issue further, we propose to do co-KD or co-overexpression (OE) of Drp1 with KD of CHCHD3/6 (as above). Specifically, we will use the strong CHCHD3/6-RNAiA to in conjunction with Drp-1 OE (in the presence of mito-GFP) and see if this can rescue the mitochondrial morphology defects, thus concluding that CHCHD3/6 KD is likely to causes aggregation that is normalized with Drp-1 OE. If this is not the case, but a parallel experiment with Drp-1 KD can rescue, this would support the conclusion that CHCHD3/6 causes increased fragmentation that is counteracted with Drp-1 KD. A complementary experiment would be to use a CHCHD3/6 sensitizer (as in Fig. 5&6). These experiments are intended to address the question whether CHCHD3/6 causes primarily fusion or fission defects.
      1. The ultrastructure of mitochondria (especially aggregated mitochondria) in control and CHCHD3/6 KD heart of drosophila should be analyzed by TEM.
      • That’s difficult, since cardiac tissue is so thin. It is not clear of fission vs fusion defects can easily be distinguished. Instead, we propose to do genetic interaction experiments (see above 2.).

      Reviewer #1 (Significance (Required)): The manuscript partially illustrate the relationship between MICOS complex with Hypoplastic left heart syndrome (HLHS), which is interesting to the reader.

      • We thank the reviewer for his/her appreciation of our study

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): This study performed whole genome sequencing (WGS) on a large cohort of hypoplastic left heart syndrome (HLHS) patients and their families to identify candidate. Nine candidate genes with rare, predicted damaging homozygous variants were identified. Of the candidate HLHS gene homologs tested, cardiac-specific knockdown (KD) of the mitochondrial contact site and cristae organization system (MICOS) complex subunit dCHCHD3/6 resulted in drastically compromised heart contractility, diminished levels of sarcomeric actin and myosin, reduced cardiac ATP levels, and mitochondrial fission-fusion defects. These heart defects were similar to those inflicted by cardiac KD of ATP synthase subunits of the electron transport chain (ETC), consistent with the MICOS complex's role in maintaining cristae morphology and ETC complex assembly. Analysis of 183 genomes of HLHS patient-parent trios revealed five additional HLHS probands with rare, predicted damaging variants in CHCHD3 or CHCHD6. Hypothesizing an oligogenic basis for HLHS, the authors tested 60 additional prioritized candidate genes in these cases for genetic interactions with CHCHD3/6 in sensitized fly hearts. Moderate KD of CHCHD3/6 in combination with Cdk12 (activator of RNA polymerase II), RNF149 (E3 ubiquitin ligase), or SPTBN1 (scaffolding protein) caused synergistic heart defects, suggesting the potential involvement of a diverse set of pathways in HLHS. General Comments: The authors performed an elegant series of experiments that implicate variants of dCHCHD3/6 in HLHS patients as contributing to mitochondrial and sarcomeric defects and contractile function defects. Demonstrating in Drosphilia the functional and biochemical implications of knocking out dCHCHD3/6 provides some potentially important insights into the functional and biochemical implications of dCHCHD3/6 variants in HLHS patients. The data is also complemented by hiPSC-CM studies in which knockdown of CHCHD6 and CHCHD3 showed similar alterations in ATP synthase and mitochondrial morphology. The authors nicely show that knock down of the subunit dCHCHD3/6 resulted in drastically compromised heart contractility, diminished levels of sarcomeric actin and myosin, reduced cardiac ATP levels, and mitochondrial fission-fusion defects in the Drosphilia. What is not clear is how these changes mirror the phenotype of HLHS in humans. It would helpful to speculate to a greater extent as to how these changes would manifest as a decreased left ventricular development in HLHS.

      • This is indeed a very important question we comment on in the discussion (see text revision, lines 415-22 in red; see also lines 402-415 and 423-33 that are unchanged). We want to stress that we focus on genetic interactions in heart development, not on convergent endpoint phenotypes between flies and humans. However, our studies do support the idea that mitochondrial defects could contribute to HLHS. We show that MICOS deficiency causes mitochondrial defects manifest in diminished ATP production in addition to diminished sarcomeric actin and myosin causing diminished contractility. Impaired contractility during development has previously been proposed to contribute to defective human cardiac growth (no flow – no growth, Goldberg and Rychik, 2016; Grossfeld et al., 2019), thereby compounding the potentially polygenic effects from damaging gene variants.
      • Why there would a preferential effect on the left ventricle is another interesting question. We speculate that some of the patient-specific variants are in genes preferentially affecting the left ventricle thus preferentially affecting its growth, thus affecting its contractility, then again compounded by impaired blood flow feeding back to diminishing growth. Specific Comments:

        Line 139: Figure 1A does not show echos from the siblings.

      • We apologize that the “(Figure 1A)” was in wrong position (after echocardiograms), causing confusion. We moved it to the previous sentence (line 138). In case the reviewers require that echocardiograms are shown as supplemental data, we can provide these.

        Line155: This table is listed as "Table 1" not Supplemental Table 1.

      • We apologize for mislabeling. This table is now listed as Supplementary Table 1.

        Reviewer #2 (Significance (Required)): This is a highly significant study. The main audience would be pediatric cardiologists and geneticists.

      • We thank the reviewer for his/her appreciation of our study