5,256 Matching Annotations
  1. Jul 2022
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Autophagy of the endoplasmic reticulum (ER-phagy) is a fundamental process that is essential for maintaining cellular homeostasis and quality control. We recently identified a novel mechanism regulating ER-phagy in both plants and animals that is based on the ubiquitin-like protein modifiers ATG8 and UFM1, and the ER-associated protein, C53. Here, we use a combination of evolutionary, biochemical, and physiological experiments to investigate the evolution and regulation of this process. We reveal the dynamic evolution of UFM1 and the ubiquity of C53-mediated autophagy across eukaryotes. Leveraging these results, we then identify an ancestral molecular toggle switch, mediated by shuffled ATG8-interacting motifs (sAIMs), that controls C53-mediated autophagy through competitive binding between UFM1 and ATG8. These findings provide new insights into the evolution of UFM1, reveal a conserved mechanism for the regulation of ER-phagy, and raise new and exciting hypotheses about the diversity and function of the UFMylation pathway. We believe that this work will be of interest to those studying autophagy and cellular stress response but will also serve as an interesting example of the benefits of combining evolutionary analyses with biochemical and cellular experiments.

      Our manuscript has been reviewed by three reviewers through ReviewCommons, whose comments, and our responses, can be found below. Two of the reviewers (Reviewer 1 and 3) were supportive of our work and its significance whereas Reviewer 2 questioned the novelty of our findings.

      Each of the reviewers’ comments can be addressed through a few supporting experiments as well as an improved manuscript which clarifies the novelty and significance of our results. While being supportive of our work, Reviewer 1 requested minor additional experiments to support our mechanistic conclusions and Reviewer 3 suggested that we expand our characterizations of C53 function to additional eukaryotic supergroups. These experiments are straightforward to perform, the materials and protocols to accomplish them are already established, and our overall conclusions are robust to the resulting outcomes.

      In contrast, Reviewer 2 did not suggest any additional experiments but rather challenged the novelty of our results as well as some of our interpretations. In particular, Reviewer 2 was uncertain of how our phylogenomic analyses built upon a previous study, published in 2014, which used comparative genomics to identify ubiquitin-related machinery across eukaryotes. Although it was an oversight to not reference this study (we cited a more recent article showing the same results), we were aware of their conclusions that UFMylation was present in the last eukaryotic common ancestor but absent in Fungi. We now clearly outline, both below and within the manuscript, our key phylogenomic results. These were acquired after implementing more advanced and comprehensive comparative genomic searches which allowed us to identify dynamic patterns in UFMylation evolution and permitted co-evolutionary analyses which were not only important for informing our experimental hypotheses but generated new functional questions. Our phylogenomic analyses are also linked to biochemical and physiological data, providing, for the first time, experimental support for our conclusions regarding UFMylation evolution. Similarly, Reviewer 2 suggested that our mechanistic results were an incremental extension of our previous work. Although our current work does of course build on our initial identification of C53-mediated autophagy, this manuscript provides novel insights into the importance and function of this process by revealing its ubiquity across eukaryotes and by characterizing the mechanistic details of its regulation. Ultimately, we disagree with Reviewer 2 but appreciate that this misunderstanding likely resulted from a lack of context and clarity in our manuscript which we have now resolved.

      As outlined in detail below, we will address the reviewers concerns through additional experiments, analyses, and improvements to the text.

      Thank you for considering our manuscript. We look forward to hearing from you.

      Description of the planned revisions

      We thank the reviewers for carefully evaluating our manuscript and for providing us with an opportunity to respond to their suggestions and criticisms. As you can see below in our pointby-point response, we address each of the points raised by the reviewers through the addition of supporting experiments, analyses, and an improved text. Altogether, we think these additional experiments and textual changes will significantly improve the manuscript. Therefore, we would like to thank all the reviewers and editors for their time and input.

      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript Picchianti et al. provide novel insights into the interaction of C53 with UFM1 and ATG8. Initially, the authors show that protein modification by UFM1 exists in the unicellular organism Chlamydomonas reinhardtii. To that end they demonstrated that pure Chlamydomonas UBA5, UFC1 and UFM1 proteins, can charge UFC1. Then, they showed that C53 interacts with ATG8 and UFM1. Specifically, they found that the sAIM are essential for the interaction with UFM1, while substituting this motif with canonical AIM prevents the binding of UFM1 but not of ATG8. Since binding of C53 to ATG8 recruits the autophagy machinery, the authors suggest that ufmylation of RPL26 releases UFM1 from C53 which allows the binding of ATG8. Overall, the authors demonstrate that C53 that forms a complex with UFL1 connects between protein ufmylation and autophagy by its ability to bind both UBLs. Here the authors revisited the assumption that only multicellular organisms have the UFM1 system. Using bioinformatic tools they show that it exists also in unicellular organism. Also, they show that in some organisms the E3 complex UFL1, UFBP1 and C53 exist but not UBA5, UFC1 or UFM1. This is a very interesting observation that suggests an additional role for this complex. In Fig 1C the authors show that in Chlamydomonas RPL26 undergoes ufmylation. Please use IP against RPL26 and then a blot with anti UFM1. From the current experiment it is not clear how the authors know that this is indeed RPL26 that undergoes ufmylation

      RPL26 is highly conserved across eukaryotes, so by comparing our western blots with previous studies (Walczak et. al., 2019, Wang et al. 2020), we concluded that these bands corresponded to UFMylated RPL26. However, we agree with the reviewer that we need to confirm the identify of RPL26 with additional assays. Since the submission of the manuscript, we tested RPL26 antibodies in Chlamydomonas and showed that they work well. So, we will update our figure with the confirmation westerns.

      In the second part of the manuscript the authors characterize the interaction of C53 with ATG8 and UFM1. This is a continuation of their previous published work (Stephani et al, 2020). Here the reviewer thinks that further data on the binding of these proteins to C53 is required. Specifically, defining the Kd of these interactions using ITC or other biophysical method can contribute to the study.

      We agree with the reviewer. To obtain the KD values, we will perform ITC experiments with C53 wild type, a C53 sAIM mutant and a C53 cAIM variant titrated with ATG8 and UFM1.

      Under normal condition the authors suggest that C53 binds UFM1 and this keeps it inactive. The reviewer thinks that this claim needs further support. Using IP (maybe with crosslinker) the author can show that C53, in normal conditions, bind more UFM1 than ATG8. Also, since the interaction of UFM1 to C53 is noncovalent, it will be nice to show how alternations in UFM1 expression levels can affect the activation of C53.

      We thank the reviewer for this suggestion. Since the submission of the manuscript, we have obtained UFM1 overexpression lines. We will pull on C53 using our C53 antibody and check for ATG8 levels in wild type and UFM1 overexpressing lines under normal and stress conditions. We think this will show how alterations in UFM1 levels can affect C53 activation.

      Finally, the authors suggest that ufmylation of RPL26 allows binding of ATG8 to C53 and this, in turn, leads to C53 activation. Can the authors show that in cells lacking UBA5, under normal condition or with Tunicamycin treatment, ATG8 does not activate C53 due to the fact that UFM1 does not leave C53.

      In Stephani et al., we showed that C53-mediated autophagy requires the UFMylation machinery. In ufl1 and ddrgk1 mutants, C53 becomes insensitive to ER stress. However, to supplement these results, we will perform autophagic flux assays using the native C53 antibody to test autophagic degradation of C53 in a uba5 and ufc1 mutant under normal and tunicamycin stress conditions. The uba5 mutant that we have is a knockdown, so that’s why we will include the ufc1 mutant in our experiments.

      Significance

      This manuscript advances our understanding of the connection of ufmylation to autophagy which is mediated by C53.

      Thank you!

      Referee #2

      Evidence, reproducibility and clarity

      The manuscript from Picchianti et al. seeks to define the role of CDK5RAP3 (hereinafter referred as C53) during autophagy and its interplay with UFMylation. Together with UFL1 and DDRGK1, C53 is a component of a trimeric UFM1 E3 ligase complex that modifies the 60S ribosomal protein RPL26 at the endoplasmic reticulum (ER) surface upon ribosomal stalling (among other proposed functions that are not addressed). Several previous studies have implicated the UFMylation pathway in autophagy or ER-phagy although a non-autophagic fate for UFM1- tagged ribosomal subunits has also been reported. A previous study from the same authors (PMID: 32851973) identified an intrinsically disorder region (IDR) in C53 that is necessary and sufficient for interaction between C53 and autophagy receptor, ATG8. They reported that this IDR comprises four non canonical ATG8 interacting motifs (AIM), named shuffled AIMs (sAIMs) and showed that combinatorial mutagenesis of sAIM1, sAIM2, and sAIM3 abrogates ATG8 binding. A similar effect was observed for plant C53, though an additional canonical AIM (cAIM) in the C53 IDR had to be mutated to completely abolish C53 and ATG8 interaction. The earlier study reported that C53 IDR also interacts with UFM1, and this interaction can be disrupted in vitro by adding increasing concentration of ATG8, suggesting that ATG8 and UFM1 may compete with one another for C53 binding. The present paper attempts to build on this previous work by using phylogenomics to infer a coevolutionary relationship between UFMylation machinery and sAIMs in C53, which the authors argue, constitutes further evidence of the primary importance of a role for UFMylation in ER homeostasis. The manuscript includes a lot of biochemical data using variations of in vitro and in vivo pull-down experiments to define the roles of individual AIMs in mediating the binding of C53 to ATG8 and to UFM1. They also use NMR spectroscopy in an attempt to define the structural basis of the UFM1 and ATG8 binding to C53, concluding that plant C53 interacts with UFM1 mainly through sAIM1, while interaction with ATG8 requires cAIM as well as sAIM1 and sAIM2. Finally, the authors attempt to contextualize these findings by conducting studies on Arabidopsis mutants, showing that replacing sAIMs with cAIMs causes increases sensitivity to ER stress and apparently increases formation of C53 intracellular puncta that may colocalize with ATG8. From these data the authors concluded that the dual-ATG8 and UFM1 binding of C53 IDR regulates C53 recruitment to autophagosomes in response to ER stress. Major Issues: 1) The phylogenomics analysis conclusion that UFM1 is common in unicellular lineages and did not evolve in multicellular eukaryotes is not novel, as another comprehensive analysis of UFM1 phylogeny, published eight years ago - in 2014 - by Grau-Bové et al. (PMID: 25525215), also reported that UFM1, UBA5, UFC1, UFL1 and UFSP2 were likely present in LECA and lost in Fungi. Although the phylogenomic analysis by Picchianti et al. is also extended to DDRGK1 and C53 proteins, and some parasitic and algal lineages, their findings are incremental. Their proposed coevolution of sAIM and UFM1 is based on presence-absence correlation observed within five species (i.e., Albugo candida, Albuco laibachii, Piromyces finnis, Neocallimastix californiae, Anaeromyces robustus). However, this coevolutionary relationship must be further investigated by substantially increasing the taxonomic sampling within the UFM1-lacking group.

      We were aware that previous studies had investigated the distribution of UFMylation proteins across eukaryotes and that these analyses had predicted the presence of UFMylation in LECA and subsequent loss in Fungi. We included a more recent citation noting this (Tsaban et al. 2021) but apologise for not citing Grau-Bové et al. (2014), which we have now included. We must emphasize that our results are not incremental. Although we had made a point of emphasizing the presence of UFM1 in LECA, this was to counter a recent and highly cited paper in the field which claimed that UFMylation evolved in plants and animals (Walczak et al. 2019). Below we note the novel and important results from our phylogenomic analyses: 1. We used improved taxonomic sampling and more advanced comparative genomics methods to identify UFMylation components sensitively and specifically across eukaryotes. This involved the inclusion of additional eukaryotic genomes, phylogenetic annotation of orthologs, and genomic searches to complement proteome predictions. These methods are essential for accurately identifying UFMylation components and yield more robust results than using sequence similarity clustering (Tsaban et al. 2021) or un-curated Pfam HMMER search results (Grau-Bové et al. 2014). 2. By placing our UFMylation reconstructions in a modern phylogenetic context we were not only able to support previous observations which noted the presence of UFM1 in LECA and its loss in Fungi (Grau-Bové et al. 2014) and Plasmodium (Tsaban et al. 2021), but also to identify novel patterns in the evolution of UFMylation. This included the observation of recurrent losses in diverse but trophically-related lineages (such as algae and parasites) and revealed the retention of certain UFMylation components in the absence of UFM1. We identified the frequent coretention of UFL1 and DDRGK1 following UFM1 loss in multiple eukaryotic groups, including Fungi, which were previously thought to be devoid of UFMylation machinery. These previously uncharacterized patterns, suggest that these proteins could have alternative functions and may be functionally associated with life history. These results therefore expand on and add complexity to our understanding of the evolution of UFMylation. 3. By conducting a comprehensive and accurate survey of UFMylation components we were able to use our data to examine co-evolutionary trends between C53 and UFM1, which would have been incomplete and inaccurate using previously curated datasets. As the reviewer noted, only five species were identified that encoded C53 but lacked UFM1. This is not a reflection of insufficient taxon sampling, but rather the strong co-evolution between C53 and UFM1 (i.e., when UFM1 is lost, C53 is almost always lost as well). We attempted to identify additional cases by searching hundreds of fungal and oomycete genomes as well as those from other eukaryotes, but no other species were found. We agree with the reviewer that additional taxa would have made our analyses stronger, but importantly, we do not rely on genomic correlations to infer function. Rather, we use these correlations to generate functional hypotheses which we then tested experimentally. In this way, we do not rely on the strength of our correlations. We have now revised the manuscript to include additional context (including citations) and have improved the clarity of the text to better convey the novelty of our findings.

      2) The manuscript presents an overwhelming amount of biochemical and structural data obtained from a variety of protein binding techniques (i.e., NMR spectroscopy, in vitro GSTpulldown, fluorescence microscopy-based on-bead binding assays, and native massspectrometry). The results are poorly explained and not organized in a logical manner. Moreover, no attempt was made to explain the rationale behind using one technique over the other or how one method complements another to build a stronger conclusion than any individual approach. Given that none of the methods employed report quantitative measurement of binding affinities between C53 IDR and UFM1 or ATG8, it is not clear how the data presented in this manuscript contribute to our understanding of the proposed competition model for UFM1 and ATG8 binding to C53 IDR. To conclude that an interaction is "stronger" or "weaker" it is necessary to measure equilibrium binding constants. Fortunately, there are suitable techniques, including surface plasmon resonance (SPR), microscale thermophoresis (MST), fluorescence anisotropy, or calorimetry that are available to dissect these complex competitive binding interactions and to build models.

      We thank the reviewer for their suggestion. Although we attempted to describe the rationale behind each experiment (please see the line 135-137; on-bead binding assays, line154-157; NMR, 177-181), we agree that the volume of data and variety of techniques warrants additional explanation. We will revise the manuscript to further explain our rationale for using each of the different approaches. As we noted above in our response to reviewer 1, we will also perform relevant ITC binding assays to quantify the interaction between C53, ATG8, and UFM1.

      3) The NMR studies have the potential to dissect the types of dynamic binding inherent in unstructured proteins. However, the abundant NMR data presented combined with the aforementioned binding studies, remarkably, do not seem to significantly advance our understanding of how the system is organized or even how UFM1 and ATG8 bind C53, beyond the rather vague and somewhat circular conclusion stated in the abstract: "...we confirmed the interaction of UFM1 with the C53 sAIMs and found that UFM1 and ATG8 bound the sAIMs in a different mode." Or on line 165 "Altogether these results suggested that ATG8 and UFM1 bind the sAIMs withn C54 IDR, albeit in a different manner".

      We agree that NMR has the potential to dissect the complex binding interactions between UFM1, ATG8, and C53, but disagree with the reviewer’s interpretation that our NMR data fail to achieve this. To sum up, our NMR data: 1. Revealed the structural basis of the interaction of C53-IDR with ATG8 and UFM1 at atomic resolution by showing that UFM1 binds preferentially to sAIM1 in the fast-intermediate exchange [Fig.4 and Fig. S7B], instead ATG8 binds cAIM in the slow-intermediate exchange, and once cAIM is occupied, it binds sAIM1,2 with lower affinity in the fast-intermediate exchange (Fig.4 and Fig.S7D). 2. Determined conformational changes in C53 IDR upon binding of ATG8, but not UFM1 (Fig.S7E), which lead to increased dynamics in distinct regions in C53 IDR. These data could explain how binding of first ATG8 would trigger C53-dependent recruitment of the tripartite complex to autophagosomes. 3. Identified how UFM1 binds to atypical hydrophobic patch in C53 sAIM, similar to what was shown for the UBA5 LIR/UFIM. To sum up, our results shed light on how both UBLs interact with C53, being sAIM1 the highest affinity binding site for UFM1 while ATG8 binds cAIM preferentially before occupying sAIM1,2. To provide more detailed information on the atomic details of the interaction between C53 and the UBLs, we will perform molecular docking studies by using the restraints obtained from the experimental NMR data.

      4) The functional assays performed in Arabidopsis do not support the competitive model between UFM1 and ATG8 for binding to C53 during C53-mediated autophagy. The fluorescence microscopy images do not provide convincing evidence of colocalization between C53 and ATG8. In fact, in contrast to the claims made in the text or the quantification, mCherry-C53 fluorescence does not seem to localize in discrete puncta and its signal does not seem to overlap with ATG8A.

      We disagree with the reviewer’s interpretation of these results although we acknowledge that there is some subtlety in interpreting the co-localization data. Importantly, Arabidopsis has 9 ATG8 isoforms and C53 can bind to most of them with varying affinities (see Stephani et al). Because of this, we do not expect C53 puncta to fully colocalize with ATG8A puncta. Additionally, the C53 puncta are smaller and more subtle than ATG8 puncta, which label the entire autophagosome. To reconcile this, we will quantify the effect by performing colocalization analyses under normal and stress conditions. We will also upload all the raw images as supporting material, so that anyone can independently assess our images.

      Minor Issues: 1. The authors might choose to avoid teleological arguments such as (line 135): "As the phylogenomic analysis suggested that eh sAIMs have been retained to mediate C53-UFM1 interaction..."

      We thank the reviewer for this suggestion and will modify the text accordingly.

      1. The authors refer on multiple occasions to C53 "autoactivation" without defining what they mean by this. Do they propose that C53 UFMylates itself?.

      We refer to C53 activity as the ability to recruit the autophagy machinery and initiate cargo sequestration and degradation in the vacuole. We attempted to explain this in lines 57-61 but we will reword it more clearly, as suggested by the reviewer.

      1. The paper might want to avoid preachy philosophical statements like "Our evolutionary analysis also highlights why we should move beyond yeast and metazoans and instead consider the whole tree of life when using evolutionary arguments to guide biological research." (333- 335). While this is indeed a laudable goal, given the rather limited insights from this study, it is unclear how this paper exemplifies the notion.

      We added this statement as we were intrigued by our evolutionary analyses’ ability to link C53 to UFM1 (an association which took years to identify experimentally) and generate useful functional hypotheses about the interaction between C53 sAIMs and UFM1. As we mentioned above, we also wanted to highlight this point in reference to a recent prominent study in the field which drew conclusions after only considering animals, plants, and fungi (Walczak et al., 2019). We believe this point is important and underappreciated by some cell biologists, but we will modify the text to make it more generic: “This work highlights the utility of using evolutionary analyses and eukaryotic diversity to generate mechanistic hypotheses for cellular processes”.

      Significance

      Overall, while the manuscript contains an abundance of new data, the overall conclusion of the work, stated in the title: "Shuffled ATG8 interacting motifs form an ancestral bridge between UFMylation and C53-mediated autophagy" does not constitute a significant advance beyond other published phylogenomic analysis (below) and the two previous papers by the same authors, including the 2020 paper "A cross-kingdom conserved ER-phagy receptor maintains endoplasmic reticulum homeostasis during stress (PMID: 32851973)" and the 2021 paper "C53 is a cross-kingdom conserved reticulophagy receptor that bridges the gap between selective autophagy and ribosome stalling at the endoplasmic reticulum PMID: 33164651)". While a regulatory interaction between UFMylation and autophagy is of potential importance, the data in this manuscript do not constitute a major advance and fail to provide new mechanistic insight to explain the role of C53 IDR in autophagy and its interplay with UFMylation

      We disagree with the reviewer’s suggestion that our work does not constitute a significant advance. We outlined above in detail the novel insights that were obtained from our phylogenomic analysis which involved using improved methods to reveal a much more dynamic and informative picture of UFMylation evolution than has been described previously. Likewise, this manuscript builds substantially on our previous mechanistic work. In our 2020 paper (which is summarized in the mentioned 2021 review article), we identified C53 as an ER-associated protein that binds ATG8 through sAIMs and interacts with the phagophore after RPL26 UFMylation. This work linked C53 activity to ER-phagy and highlighted its importance in plant and animal stress response. However, key questions remained unanswered prior to our current work such as whether this mechanism is conserved across eukaryotes, especially in unicellular species, how C53 activity is regulated, and how UFM1 and ATG8 interact with C53. Our current manuscript builds on this work with the following key results: 1. We use a combination of phylogenomic and experimental analyses to demonstrate that C53 function is conserved across eukaryotes. 2. We reveal a mechanism whereby UFM1 and ATG8 compete for binding at the sAIMs in the C53 IDR and characterize how each of these ubiquitin-like proteins interacts in an alternative way (see the NMR results described above). 3. We show how the sAIMs are required for the regulation of C53-mediated autophagy and reveal the importance of UFM1-ATG8 competition in preventing C53 autoactivation, which causes unnecessary autophagic degradation and impairs cellular stress responses.

      These insights are fundamental for understanding the mechanisms regulating C53-mediated autophagy which were unknown before this work. We will therefore adjust our manuscript to more clearly and explicitly explain how our data build on previous observations so that the novelty and significance of our results are clearer.

      Referee #3

      Evidence, reproducibility and clarity

      Picchianti and colleagues have investigated a conserved molecular framework that orchestrates ER homeostasis via autophagy. For this, they have carried out phylogenomics and large-scale gene family analyses across eukaryote diversity as well as a barrage of molecular lab work. The amount of work carried out as well as the overall quality of the study is impressive.

      Thank you!

      I have only a few comments that should be very easy to tackle. (1) Maybe I missed it, but please upload all alignments used for phylogenetics and phylogenomics for reproducibility to e.g. Zenodo, Figshare or other suitable OA databases.

      We included the alignments in the supplementary data, but as suggested, we will upload all the source data including the scripts and the alignments to Zenodo.

      (2) "Why these non-canonical motifs were selected during evolution, instead of canonical ATG8 interacting motifs remains unknown" --> Maybe there is no "why" and these were not selected at all. Could be random... drift, non-adaptive constructive neutral evolution. I am not saying that asking "why" in evolutionary biology is wrong. It, however, often does not yield satisfactory answers--or any answer at all.

      The reviewer is completely right that “why” is not the right way to frame an evolutionary question. Thank you for pointing this out. We will revise the text and make sure that we remove these kinds of deterministic statements.

      (3) The authors make a case for UFMylation in LECA and I am fully sympathetic with this. However, getting rid of misfoled/problematic proteins and subcellular entities is something that prokaryotes also to a certain degree must have (and still do) master. Are inclusion bodies or export their only answers (I don't know)? Of course, in eukaryotes with all their intracellular complexity this is likely more of an issue. Given the scope of this manuscript (i.e. shedding light on that ancient framework, deep evolutionary roots in eukaryote evolution etc. etc.) it would be very interesting to read the authors thoughts on this and also pinpoint the prokaryote/eukaryote divide in light of the machinery discussed here.

      Thank you for this suggestion. We did indeed check whether any of the UFMylation machinery were present in prokaryotes and only found homologs of UFSP2. These results are consistent with Grau-Bové et al. (2014) who conducted an equivalent analysis and concluded that UFMylation machinery were derived during eukaryogenesis. We will make reference to this in the revised manuscript.

      Significance

      This study not only impresses with the volume of experiments and data, but also the courage to show conservation of a molecular framework by working with such a range of distantly-related eukaryotes. The results and conclusions from this study should be interesting to anyone working in the broad fields of cellular stress and/or autophagy--both extremely timely topics.

      We thank the reviewer for understanding our take-home message and the advances made. We especially thank the reviewer for understanding the challenge of connecting in silico genomic data with in vivo and in vitro experiments.

      CROSS-CONSULTATION COMMENTS

      Referee #2 The challenge in providing a fair review of this manuscript is to clearly define what contributions are novel, significant advances. It is difficult to tell the way the manuscript is written, as it is unclear how the new data - which are voluminous- actually advance the model already put forth by the same authors in two previous publications. It is also unfortunate that the authors overlooked the 2004 phylogenomics paper. There clearly are some new pieces of information here, but the overall increment in knowledge is rather minimal. Response from Referee #3 I agree that the authors somehow steamroll the reader with a wealth of data. But I think this can be addressed by the authors by requesting a lot more justification and by giving them the opportunity to put the significant advances into their own words. This is, in my opinion, quite doable in course of a revision. Overall I have to say that I am very sympathetic with the crosseukaryote reactivity approach that the authors have taken. It is quite intriguing.

      We thank the reviewers for this useful exchange. We agree that our manuscript was not clear enough to emphasize the novelty of our results which likely resulted from the volume and diversity of the experiments and analyses that were presented. We have now revised the manuscript to improve the context and rationale for the study, the intent and hypotheses behind each experiment, and the novel results and insights obtained in each section.

      Response from Referee #2 I agree that the cross-eukaryote approach is intriguing. Shouldn't we be concerned that the 2004 publication already made two of their key points (ie present in LECA, loss in Fungi). What is the incremental insight from this paper? I'd appreciate an opinion from an evolutionary biologist as to how strongly one can conclude functional co-evolution from such correlative data, especially given the rather small number of supporting examples. Is it also necessary to consider counter-examples- ie species that have sAIMs but no UFM1 (I believe that they found a few such cases)?

      Importantly, we do not conclude functional co-evolution from our correlative data. Instead, we used these correlations to generate hypotheses that we tested with various experiments in different model systems. For example, the apparent correlation between C53 sAIMs and UFM1 prompted us to test whether or not UFM1 and sAIMs interact. Regardless of sample size or statistical significance, phylogenomic analyses can never demonstrate functional links, only correlations, which is why we combined these two approaches. Although only a few species encoded C53 without UFM1, each of these contained C53 cAIMs and lacked sAIMs (Figure 2c). There are species with UFM1 that lack C53 but this makes sense as UFM1 is used in other processes besides ER-phagy. We have revised the text to make our approach and reliance on certain data clearer.

      Response from Referee #3 Well with these deep evolutionary questions this is always a challenge. Where does one stop to sample more homologs for one's analyses (one from each supergroup [which are no longer recognised by the community])? In that sense, the authors are right to make the parsimonious base assumption that if X and Y interact in species A and B (no matter how distant they are related) then X and Y interacted in the last common ancestor of A and B. That being said, if I would have designed this study, I would have sampled more broadly for my in vitro crosseukaryote approach. But also this, I think, could be carried out by the authors in a reasonable timeframe. Specifically, they have now sampled from Amorphea and Archaeplastida, they should add one from TSAR, one Haptista, one Cryptista, and one CRuM. If they synthesised the proteins via a company, they could have the constructs in a few weeks for about 1K Euro - I do not think that this would be an unreasonable request.

      We agree that testing C53 function in additional species would strengthen our understanding of the conservation of this pathway across eukaryotes, as it cannot be assumed that orthologous proteins will function in the same way across all species. To our knowledge there is no other work showing experimentally that the UFMylation pathway is working in a single-celled organism. We focussed our efforts on the unicellular green alga, Chlamydomonas due to its relative experimental tractability. However, testing this was not trivial as it required us to establish expression and purification protocols, isolate Chlamydomonas mutants, optimize physiological stress assays, and perform the experiments.

      Nevertheless, we agree that we could expand our in vitro assays with C53 orthologs from additional species. As suggested by reviewer 3, we will now synthesize 6 more C53 isoforms from two TSAR representatives (the alveolate, Tetrahymena thermophila, and the stramenopile, Phytophthora sojae), as well as a representative from Haptista (Emiliania), Cryptista (Guillardia), Diplomonada (Trypanosoma), and CRuMs (Rigifila). We will test their interaction with human and plant ATG8 and UFM1 proteins. We have also added two species from CRuMs into our phylogenomic analysis.

      The list of experiments that we can do to address the reviewer’s concerns: 1. Repeat experiment in Figure 1C probing with �-RPL26. 2. To calculate KD values, perform ITC experiments with C53 wild-type, C53 sAIM mutant and C53 cAIM variant titrated with ATG8 and UFM1. 3. Perform CoIP experiments using C53 antibody in wild type and UFM1 overexpressing lines and detect for ATG8 association, under normal and stress conditions. 4. We will test autophagic degradation of C53 in uba5 and ufc1 mutants under normal and tunicamycin stress conditions by performing autophagic flux assays using the native C53 antibody 5. Molecular docking studies to see C53’s structural rearrangements leading to ATG8 and UFM1 binding. 6. Figures from co-localization experiments in Figure 5G will be revisited and we will perform additional co-localization analyses such as Pearson coefficient under normal and stress conditions. We will also upload all the raw images as supporting material, so that anyone can independently assess our images. 7. We will upload all the source data for phylogenomic analyses, including scripts and alignments to Zenodo. 8. Test the interaction of 6 newly synthesised C53 isoforms from: (1) an alveolate (tsAr, Ciliate), (2) a stramenopile (tSar, Phaeodactylum), (3) a haptophyte (Emiliania), (4) a cryptophyte (Guillardia), (5) a diplomonad (Trypanosoma) and (6) a CrRuM with human and plant ATG8 and UFM1 proteins.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Picchianti and colleagues have investigated a conserved molecular framework that orchestrates ER homeostasis via autophagy. For this, they have carried out phylogenomics and large-scale gene family analyses across eukaryote diversity as well as a barrage of molecular lab work. The amount of work carried out as well as the overall quality of the study is impressive. I have only a few comments that should be very easy to tackle.

      1. Maybe I missed it, but please upload all alignments used for phylogenetics and phylogenomics for reproducibility to e.g. Zenodo, Figshare or other suitable OA databases.
      2. "Why these non-canonical motifs were selected during evolution, instead of canonical ATG8 interacting motifs remains unknown" --> Maybe there is no "why" and these were not selected at all. Could be random... drift, non-adaptive constructive neutral evolution. I am not saying that asking "why" in evolutionary biology is wrong. It, however, often does not yield satisfactory answers--or any answer at all.
      3. The authors make a case for UFMylation in LECA and I am fully sympathetic with this. However, getting rid of misfoled/problematic proteins and subcellular entities is something that prokaryotes also to a certain degree must have (and still do) master. Are inclusion bodies or export their only answers (I don't know)? Of course, in eukaryotes with all their intracellular complexity this is likely more of an issue. Given the scope of this manuscript (i.e. shedding light on that ancient framework, deep evolutionary roots in eukaryote evolution etc. etc.) it would be very interesting to read the authors thoughts on this and also pinpoint the prokaryote/eukaryote divide in light of the machinery discussed here.

      Referees cross-commenting

      Referee #2

      The challenge in providing a fair review of this manuscript is to clearly define what contributions are novel, significant advances. It is difficult to tell the way the manuscript is written, as it is unclear how the new data - which are voluminous- actually advance the model already put forth by the same authors in two previous publications. It is also unfortunate that the authors overlooked the 2004 phylogenomics paper. There clearly are some new pieces of information here, but the overall increment in knowledge is rather minimal.

      Response from Referee #3

      I agree that the authors somehow steamroll the reader with a wealth of data. But I think this can be addressed by the authors by requesting a lot more justification and by giving them the opportunity to put the significant advances into their own words. This is, in my opinion, quite doable in course of a revision. Overall I have to say that I am very sympathetic with the cross-eukaryote reactivity approach that the authors have taken. It is quite intriguing.

      Response from Referee #2

      I agree that the cross-eukaryote approach is intriguing. Shouldn't we be concerned that the 2004 publication already made two of their key points (ie present in LECA, loss in Fungi). What is the incremental insight from this paper?

      I'd appreciate an opinion from an evolutionary biologist as to how strongly one can conclude functional co-evolution from such correlative data, especially given the rather small number of supporting examples. Is it also necessary to consider counter-examples- ie species that have sAIMs but no UFM1 (I believe that they found a few such cases)?

      Response from Referee #3

      Well with these deep evolutionary questions this is always a challenge. Where does one stop to sample more homologs for one's analyses (one from each supergroup [which are no longer recognised by the community])? In that sense, the authors are right to make the parsimonious base assumption that if X and Y interact in species A and B (no matter how distant they are related) then X and Y interacted in the last common ancestor of A and B. That being said, if I would have designed this study, I would have sampled more broadly for my in vitro cross-eukaryote approach. But also this, I think, could be carried out by the authors in a reasonable timeframe. Specifically, they have now sampled from Amorphea and Archaeplastida, they should add one from TSAR, one Haptista, one Cryptista, and one CRuM. If they synthesised the proteins via a company, they could have the constructs in a few weeks for about 1K Euro - I do not think that this would be an unreasonable request.

      Significance

      This study not only impresses with the volume of experiments and data, but also the courage to show conservation of a molecular framework by working with such a range of distantly-related eukaryotes. The results and conclusions from this study should be interesting to anyone working in the broad fields of cellular stress and/or autophagy--both extremely timely topics.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript from Picchianti et al. seeks to define the role of CDK5RAP3 (hereinafter referred as C53) during autophagy and its interplay with UFMylation. Together with UFL1 and DDRGK1, C53 is a component of a trimeric UFM1 E3 ligase complex that modifies the 60S ribosomal protein RPL26 at the endoplasmic reticulum (ER) surface upon ribosomal stalling (among other proposed functions that are not addressed). Several previous studies have implicated the UFMylation pathway in autophagy or ER-phagy although a non-autophagic fate for UFM1-tagged ribosomal subunits has also been reported.

      A previous study from the same authors (PMID: 32851973) identified an intrinsically disorder region (IDR) in C53 that is necessary and sufficient for interaction between C53 and autophagy receptor, ATG8. They reported that this IDR comprises four non canonical ATG8 interacting motifs (AIM), named shuffled AIMs (sAIMs) and showed that combinatorial mutagenesis of sAIM1, sAIM2, and sAIM3 abrogates ATG8 binding. A similar effect was observed for plant C53, though an additional canonical AIM (cAIM) in the C53 IDR had to be mutated to completely abolish C53 and ATG8 interaction. The earlier study reported that C53 IDR also interacts with UFM1, and this interaction can be disrupted in vitro by adding increasing concentration of ATG8, suggesting that ATG8 and UFM1 may compete with one another for C53 binding.

      The present paper attempts to build on this previous work by using phylogenomics to infer a co-evolutionary relationship between UFMylation machinery and sAIMs in C53, which the authors argue, constitutes further evidence of the primary importance of a role for UFMylation in ER homeostasis. The manuscript includes a lot of biochemical data using variations of in vitro and in vivo pull-down experiments to define the roles of individual AIMs in mediating the binding of C53 to ATG8 and to UFM1. They also use NMR spectroscopy in an attempt to define the structural basis of the UFM1 and ATG8 binding to C53, concluding that plant C53 interacts with UFM1 mainly through sAIM1, while interaction with ATG8 requires cAIM as well as sAIM1 and sAIM2. Finally, the authors attempt to contextualize these findings by conducting studies on Arabidopsis mutants, showing that replacing sAIMs with cAIMs causes increases sensitivity to ER stress and apparently increases formation of C53 intracellular puncta that may colocalize with ATG8.

      From these data the authors concluded that the dual-ATG8 and UFM1 binding of C53 IDR regulates C53 recruitment to autophagosomes in response to ER stress.

      Major Issues:

      1. The phylogenomics analysis conclusion that UFM1 is common in unicellular lineages and did not evolve in multicellular eukaryotes is not novel, as another comprehensive analysis of UFM1 phylogeny, published eight years ago - in 2014 - by Grau-Bové et al. (PMID: 25525215), also reported that UFM1, UBA5, UFC1, UFL1 and UFSP2 were likely present in LECA and lost in Fungi. Although the phylogenomic analysis by Picchianti et al. is also extended to DDRGK1 and C53 proteins, and some parasitic and algal lineages, their findings are incremental. Their proposed coevolution of sAIM and UFM1 is based on presence-absence correlation observed within five species (i.e., Albugo candida, Albuco laibachii, Piromyces finnis, Neocallimastix californiae, Anaeromyces robustus). However, this coevolutionary relationship must be further investigated by substantially increasing the taxonomic sampling within the UFM1-lacking group.
      2. The manuscript presents an overwhelming amount of biochemical and structural data obtained from a variety of protein binding techniques (i.e., NMR spectroscopy, in vitro GST-pulldown, fluorescence microscopy-based on-bead binding assays, and native mass-spectrometry). The results are poorly explained and not organized in a logical manner. Moreover, no attempt was made to explain the rationale behind using one technique over the other or how one method complements another to build a stronger conclusion than any individual approach. Given that none of the methods employed report quantitative measurement of binding affinities between C53 IDR and UFM1 or ATG8, it is not clear how the data presented in this manuscript contribute to our understanding of the proposed competition model for UFM1 and ATG8 binding to C53 IDR. To conclude that an interaction is "stronger" or "weaker" it is necessary to measure equilibrium binding constants. Fortunately, there are suitable techniques, including surface plasmon resonance (SPR), microscale thermophoresis (MST), fluorescence anisotropy, or calorimetry that are available to dissect these complex competitive binding interactions and to build models.
      3. The NMR studies have the potential to dissect the types of dynamic binding inherent in unstructured proteins. However, the abundant NMR data presented combined with the aforementioned binding studies, remarkably, do not seem to significantly advance our understanding of how the system is organized or even how UFM1 and ATG8 bind C53, beyond the rather vague and somewhat circular conclusion stated in the abstract: "...we confirmed the interaction of UFM1 with the C53 sAIMs and found that UFM1 and ATG8 bound the sAIMs in a different mode." Or on line 165 "Altogether these results suggested that ATG8 and UFM1 bbind the sAIMs withn C54 IDR, albeit in a different manner".
      4. The functional assays performed in Arabidopsis do not support the competitive model between UFM1 and ATG8 for binding to C53 during C53-mediated autophagy. The fluorescence microscopy images do not provide convincing evidence of colocalization between C53 and ATG8. In fact, in contrast to the claims made in the text or the quantification, mCherry-C53 fluorescence does not seem to localize in discrete puncta and its signal does not seem to overlap with ATG8A.

      Minor Issues:

      1. The authors might choose to avoid teleological arguments such as (line 135): "As the phylogenomic analysis suggested that eh sAIMs have been retained to mediate C53-UFM1 interaction..."
      2. The authors refer on multiple occasions to C53 "autoactivation" without defining what they mean by this. Do they propose that C53 UFMylates itself?.
      3. The paper might want to avoid preachy philosophical statements like "Our evolutionary analysis also highlights why we should move beyond yeast and metazoans and instead consider the whole tree of life when using evolutionary arguments to guide biological research." (333-335). While this is indeed a laudable goal, given the rather limited insights from this study, it is unclear how this paper exemplifies the notion.

      Referees cross-commenting

      Referee #2

      The challenge in providing a fair review of this manuscript is to clearly define what contributions are novel, significant advances. It is difficult to tell the way the manuscript is written, as it is unclear how the new data - which are voluminous- actually advance the model already put forth by the same authors in two previous publications. It is also unfortunate that the authors overlooked the 2004 phylogenomics paper. There clearly are some new pieces of information here, but the overall increment in knowledge is rather minimal.

      Response from Referee #3

      I agree that the authors somehow steamroll the reader with a wealth of data. But I think this can be addressed by the authors by requesting a lot more justification and by giving them the opportunity to put the significant advances into their own words. This is, in my opinion, quite doable in course of a revision. Overall I have to say that I am very sympathetic with the cross-eukaryote reactivity approach that the authors have taken. It is quite intriguing.

      Response from Referee #2

      I agree that the cross-eukaryote approach is intriguing. Shouldn't we be concerned that the 2004 publication already made two of their key points (ie present in LECA, loss in Fungi). What is the incremental insight from this paper?

      I'd appreciate an opinion from an evolutionary biologist as to how strongly one can conclude functional co-evolution from such correlative data, especially given the rather small number of supporting examples. Is it also necessary to consider counter-examples- ie species that have sAIMs but no UFM1 (I believe that they found a few such cases)?

      Response from Referee #3

      Well with these deep evolutionary questions this is always a challenge. Where does one stop to sample more homologs for one's analyses (one from each supergroup [which are no longer recognised by the community])? In that sense, the authors are right to make the parsimonious base assumption that if X and Y interact in species A and B (no matter how distant they are related) then X and Y interacted in the last common ancestor of A and B. That being said, if I would have designed this study, I would have sampled more broadly for my in vitro cross-eukaryote approach. But also this, I think, could be carried out by the authors in a reasonable timeframe. Specifically, they have now sampled from Amorphea and Archaeplastida, they should add one from TSAR, one Haptista, one Cryptista, and one CRuM. If they synthesised the proteins via a company, they could have the constructs in a few weeks for about 1K Euro - I do not think that this would be an unreasonable request.

      Significance

      Overall, while the manuscript contains an abundance of new data, the overall conclusion of the work, stated in the title: "Shuffled ATG8 interacting motifs form an ancestral bridge between UFMylation and C53-mediated autophagy" does not constitute a significant advance beyond other published phylogenomic analysis (below) and the two previous papers by the same authors, including the 2020 paper "A cross-kingdom conserved ER-phagy receptor maintains endoplasmic reticulum homeostasis during stress (PMID: 32851973)" and the 2021 paper "C53 is a cross-kingdom conserved reticulophagy receptor that bridges the gap between selective autophagy and ribosome stalling at the endoplasmic reticulum PMID: 33164651)". While a regulatory interaction between UFMylation and autophagy is of potential importance, the data in this manuscript do not constitute a major advance and fail to provide new mechanistic insight to explain the role of C53 IDR in autophagy and its interplay with UFMylation

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript Picchianti et al. provide novel insights into the interaction of C53 with UFM1 and ATG8. Initially, the authors show that protein modification by UFM1 exists in the unicellular organism Chlamydomonas reinhardtii. To that end they demonstrated that pure Chlamydomonas UBA5, UFC1 and UFM1 proteins, can charge UFC1. Then, they showed that C53 interacts with ATG8 and UFM1. Specifically, they found that the sAIM are essential for the interaction with UFM1, while substituting this motif with canonical AIM prevents the binding of UFM1 but not of ATG8. Since binding of C53 to ATG8 recruits the autophagy machinery, the authors suggest that ufmylation of RPL26 releases UFM1 from C53 which allows the binding of ATG8. Overall, the authors demonstrate that C53 that forms a complex with UFL1 connects between protein ufmylation and autophagy by its ability to bind both UBLs.

      Here the authors revisited the assumption that only multicellular organisms have the UFM1 system. Using bioinformatic tools they show that it exists also in unicellular organism. Also, they show that in some organisms the E3 complex UFL1, UFBP1 and C53 exist but not UBA5, UFC1 or UFM1. This is a very interesting observation that suggests an additional role for this complex. In Fig 1C the authors show that in Chlamydomonas RPL26 undergoes ufmylation. Please use IP against RPL26 and then a blot with anti UFM1. From the current experiment it is not clear how the authors know that this is indeed RPL26 that undergoes ufmylation

      In the second part of the manuscript the authors characterize the interaction of C53 with ATG8 and UFM1. This is a continuation of their previous published work (Stephani et al, 2020) . Here the reviewer thinks that further data on the binding of these proteins to C53 is required. Specifically, defining the Kd of these interactions using ITC or other biophysical method can contribute to the study.

      Under normal condition the authors suggest that C53 binds UFM1 and this keeps it inactive. The reviewer thinks that this claim needs further support. Using IP (maybe with crosslinker) the author can show that C53, in normal conditions, bind more UFM1 than ATG8. Also, since the interaction of UFM1 to C53 is noncovalent, it will be nice to show how alternations in UFM1 expression levels can affect the activation of C53. Finally, the authors suggest that ufmylation of RPL26 allows binding of ATG8 to C53 and this, in turn, leads to C53 activation. Can the authors show that in cells lacking UBA5, under normal condition or with Tunicamycin treatment, ATG8 does not activate C53 due to the fact that UFM1 does not leave C53.

      Significance

      This manuscript advances our understanding of the connection of ufmylation to autophagy which is mediated by C53.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the reviewers

      Referee #1

      Evidence, reproducibility and clarity

      1. This manuscript constructs a gene expression model with various factors. Specifically, the effect of cell size on gene expression is considered, which is often ignored by previous studies. One interesting finding is that the absolute number of the gene products and the concentration can have different distributions. Some predictions of the models are validated by experimental data on E. coli and yeast. This manuscript uses the mean-field approximation for cell volume, which has good accuracy when the number of stages is large. The usage of the power spectrum has a satisfactory effect on studying the concentration oscillation.

      Response: Thank you for the positive comments.

      1. Overall the paper was very difficult to follow and digest easily because of all the different factors and mechanisms invoked. It is mainly an issue of providing sufficient details for each of the factors and organizing them in a systematic and logical way. Although there is a supplementary appendix, it was hard to keep track of all the elements in the main manuscript. Perhaps something like Fig 1 of the Appendix can be presented in the main body to outline all the ingredients and how they affect each other.

      Response: In the revised manuscript, we moved Supplementary Fig. S1 in the previous version into the main text to outline all the ingredients and how they affect each other (see page 8, Fig. 2). Moreover, we provided many details for each of the biological factors and tried to organize them in a more systematic and logical way (see pages 3-7).

      1. It might be good to provide a more detailed description of the goal (studying gene product number and concentration under different parameters) after introducing the full and the reduced models. A table of symbols would also be helpful.

      Response: In the revised manuscript, we added a table explaning the meaning of all model parameters (see page 4, Table 1). Moreover, we provided a detailed description of the goal of the present paper after introducing the full and reduced models (see page 7).

      1. Some technical details in the Methods section are in fact helpful in understanding the conclusions. They can be moved to the Results section.

      Response: In the revised manuscript, we moved many technical details in Methods and Supplementary Notes to the main text to help the readers better understand the conclusions (see pages 5-10).

      1. One concern is that the central concept of this manuscript, “stage”, is not thoroughly discussed. This concept should have some significant biological meaning, not just be coined for mathematical convenience.

      Response: In the revised manuscript, we explained in detail the biological meaning of the effective cell cycle stages (see page 4). Specifically, recent studies have revealed that in many cell types, the accumulation of some activator to a critical threshold is used to promote mitotic entry and trigger cell division, a strategy known as activator accumulation mechanism. In E. coli, the activator was shown to be FtsZ; in fission yeast, it is believed to be a protein upstream of Cdk1, the central mitotic regulator, such as Cdr2, Cdc25, and Cdc13. Biophysically, the N effective stages can be understood as different levels of the key activator. Moreover, we pointed out that the power law form for the rate of cell cycle progression may come from cooperativity of the key activator that triggers cell division.

      1. Fig. 1(b) is a little strange. For the left panel, the x-axis (stage) is discrete, then the volume (y-axis) should be a step function, not a straight red line.

      Response: In the revised manuscript, we added some red dots in the stage-volume plot to show the dependence of the mean cell volume vk on cell cycle stage k for the mean-field model (see page 3, Fig. 1). Moreover, we emphasized that the joining of these dots by a straight red line is simply a guide to the eye.

      Significance

      1. The main advance is a more complete model of gene expression under more realistic organism growth conditions.

      Response: Thank you for acknowledging the results of the manuscript.

      Referee #2

      Evidence, reproducibility and clarity

      1. Jia et al. introduce a modeling framework to represent stochastic gene expression, with an explicit representation of cell volume growth, cell cycle progression (and its dependency on cell volume) and gene dosage compensation. The model is very elegant and general in that it can represent a variety of situations, simply as a matter of parametrization. Under a simplifying assumption, the authors derive a number of metrics (include stationary distribution of gene product and power spectrum of gene product fluctuation dynamics), for both absolute number and concentration of gene product molecules. They use their model and derivations to examine under which conditions cell can achieve homeostasis in the concentration of the expressed gene product, despite changes in cell volume and gene copy number following replication. They also present and discuss the conditions giving rise to specific features (i.e. bimodality in stationary distribution, peak in power spectrum) and examine these features in experimental data to conclude to infer the underlying homeostasis strategies. The model is rather general and powerful. The simplifying assumption seems reasonable (and the authors investigate to some extent its limitations, i.e. Fig. 2). The conclusions are overall convincing.

      Response: Thank you for the positive comments.

      Major comments 1. My main concern is that the metrics that the authors use to assess concentration homeostasis (i.e. the γ parameter and the presence / absence of peak in power spectrum) do not seem quite appropriate to describe how much variability / fluctuations in concentration are driven by cell cycle effects. Indeed, the γ parameter measures how much the *average* concentration in each cell cycle stage varies throughout the cell cycle. However, this variability should be compared to the total variability due to both cell cycle effects and stochastic bursting dynamics. A given level of cell-cycle dependency (say γ = 0.2) could be very visible if gene expression is weakly noisy (e.g. B low and hni high) and completely invisible is gene expression is highly bursty (large B and small hni). In the latter situation, cell-cycle effects would be meaningless for the cell to minimize. In essence, reusing the authors notations, I think γ/φ1/2 , would be a more relevant metric to observe.

      Response: In the revised manuscript, we showed that the total concentration noise φ can be decomposed as φ = φext + φint, where φext is the extrinsic noise which characterizes the fluctuations between different stages due to cell cycle effects and φint is the intrinsic noise which characterizes the fluctuations within each stage due to stochastic bursty synthesis and degradation of the gene product (see page 11). Based on the above decomposition, we introduced a new metric γ = φext/φ, which characterizes the accuracy of concentration homeostasis. Clearly, the new metric γ reflects the relation contribution of cell cycle effects in the total concentration variability. All discussions about concentration homeostasis are based on the new metric γ in the revised manuscript. Moreover, all figures have been updated by using this new metric.

      1. Similarly, when inspecting the peak in the power spectrum, the weight of the Lorentzian function(s) creating the peak, should be compared to the stationary component (λN , uN in the authors’ notations).

      Response: We cannot quite understand why the weights uk of the Lorentzian functions should be compared to the stationary component uN . In fact, all the weights uk except uN are actually complex numbers and we are not so sure about the meaning of uk/uN . However in the revised manuscript, we emphasized that the power spectrum G(ξ) is normalized so that G(0) = 1 throughout the paper (see page 13). To better understand concentration oscillations and its relation to homeostasis, we depicted both γ and H as a function of B and hni (see Supplementary Fig. S5). As expected, the off-zero peak becomes lower as B increases and as hni decreases since both of them correspond to an increase in concentration fluctuations which counteracts the regularity of oscillations; noise above a certain threshold can even completely destroy oscillations. Furthermore, we found that γ and H have similar dependence on B and hni. This again shows that the occurrence of concentration oscillations is intimately related to the visibility of cell cycle effects in concentration fluctuations.

      A complementary analysis including these two points and a discussion the relative contribution of cell-cycle effects and bursting dynamics in the total variability/fluctuation of concentrations would be important to include.

      Response: In the revised manuscript, we made some complementary analysis and discussion about the relative contribution of cell cycle effects and stochastic birth-death dynamics in the total variability of concentrations (see pages 11-14).

      Minor comments 3. The dashed line on Fig. 3a is defined as κ = √ 2 1−β . First is this empirical or does it come from a derivation? Second, it seems incomplete since it should depend on w. Intuitively, this line should correspond to the value of κ that would best mimic balanced biosynthesis in the case where β 6= 1. In other words, κ should be so that hρB0 /V (t)iprereplication = hκρB0 /V (t)ipostreplication, which yields κ = 2w(1−β) ∗ (w − 1)/w ∗ [2w(β−1) − 1]/[2(1−w)(β−1) − 1]. This indeed simplifies into κ = √ 2 1−β when w = 0.5.

      Response: Thank you for providing such a beautiful derivation. In the revised manuscript, we added this derivation into the main text (see pages 12-13). Moreover, we also made it clear that this relation can also be obtained from the perspective of power spectrum (see page 14).

      1. η is used in the caption of Fig. 2, which is cited on page 4. But it is defined only 2 sections later, on page 6.

      Response: In the revised manuscript, we gave the definition of η in both Table 1 and the caption of Fig. 3 (Fig. 2 in the old version). Please see page 4, Table 1 and page 9, Fig. 3.

      1. w is used in the main text, but only defined in the caption of Fig. 3.

      Response: In the revised manuscript, we gave the definition of w in both Table 1 (see page 4) and on page 7.

      1. w is defined as “the proportion of cell cycle before replication”. Is this in terms of cell cycle stages (i.e. w = N0/N) or actual time?

      Response: In the revised manuscript, we made it clear that w represents the proportion of cell cycle duration before replication, which should be distinguished from the proportion N0/N of cell cycle stages before replication (see page 7). This is because the transition rate between cell cycle stages is an increasing function of cell size, which means that earlier (later) stages have longer (shorter) durations.

      1. Fig. 3 indicates that power spectra are normalized so that G(0) = 1, but G(0) = 10 on the first two graphs.

      Response: Corrected as suggested (see page 12, Fig. 4). Thank you.

      1. Page 11: “bimodality in the concentration distribution is significantly less apparent”. I would suggest rephrasing “bimodality in the concentration distribution is absent” since there should be no reference to “significance” and bimodality is either present or absent (binary), not less apparent.

      Response: Corrected as suggested. Thank you.

      Referees cross-commenting

      1. Regarding the comment from reviewer 3 that ”a direct validity test should use data sets of at least two types (total, nascent RNA, etc)”. I almost made a related comment in my review, but then I held it off: This issue with using nascent RNA data is that their model does not allow an ON state. They assume that gene products are produced in instantaneous bursts, which is a fair assumption if the lifetime of gene products is large compared to the time the gene stays ON. This is ok if the considered ”gene products” are mRNA or proteins, but not nascent RNAs (for which the lifetime is the time to transcribe the gene). I did not make this comment in the end because I think the model is useful regardless. To comply with reviewer 3’s request, maybe the authors could use distributions of mRNA and protein products, but I’m not sure that such data exists (since they need cell-cycle-resolved data).

      Response: It is not possible to validate our model with nascent mRNA data because the model in its present form cannot predict nascent mRNA fluctuations. This is because unlike mature mRNA, nascent mRNA cannot be assumed to decay via first-order kinetics. A detailed response is provided below to the original comment made by Referee 3. Regarding the comment on the use of cell-cycle-resolved data measuring mRNA and protein expression – while we agree it would make an excellent test of our model, we could not find such a dataset in the literature. We point out that our model, in its present form, is interesting as it is, as a detailed biological model of mature mRNA and protein number / concentration fluctuations in growing cells. Its predictions are yet to be fully confirmed and hence may stimulate the development of further experimental single-cell studies.

      Significance

      1. The advance of this paper is essentially technical. The authors present a model that incorporates and unifies previously studied effects (cell volume homeostasis, concentration homeostasis, bursting transcription). There is no major conceptual novelty, but the combination of these different aspects and the derivations that authors present are very valuable and might be applicable to interpret data in various species.

      Response: Thank you for acknowledging the results of the manuscript.

      Referee #3

      Evidence, reproducibility and clarity

      1. The manuscript analyses a phenomenological model of stochastic gene expression. The model couples bursty transcription with cell growth, division and DNA replication. The cell cycle is divided into a large number of stages whose exponential lifetimes depend on the cell volume. It is argued that concentrations of gene products are distributed according to mixed Gamma distributions, whereas the copy numbers follow mixed negative binomial distributions. The number of modes can be different for concentrations and copy numbers, for instance the copy numbers can be unimodal while concentrations are bimodal. The case when the mean concentration does not depend on the cell cycle stage is called perfect homeostasis. It is argued that perfect homeostasis leads to Gamma distribution of the gene product concentration and that deviations from a Gamma distributions result mainly from deviations of the concentration from perfect homeostasis. It is also proposed that concentration homeostasis is difficult to obtain. These qualitative predictions of the model are tested using two data sets, one for E.coli and another for fission yeast.

      Response: Thank you for acknowledging the results of the manuscript.

      Major comments 1. A huge number of states called “cell cycle stages” have exponential life times. On my opinion, this sequence of stages is just a technicality for keeping the model within a discrete Markovian framework. More natural choices are possible, such as piecewise deterministic Markov processes, age structured diffusions, etc. The biological significance (if there is any) of such states should be explained.

      Response: In the revised manuscript, we explained in detail the biological meaning of the effective cell cycle stages (see page 4). Specifically, recent studies have revealed that in many cell types, the accumulation of some activator to a critical threshold is used to promote mitotic entry and trigger cell division, a strategy known as activator accumulation mechanism. In E. coli, the activator was shown to be FtsZ; in fission yeast, it is believed to be a protein upstream of Cdk1, the central mitotic regulator, such as Cdr2, Cdc25, and Cdc13. Biophysically, the N effective stages can be understood as different levels of the key activator. Moreover, we pointed out that the power law form for the rate of cell cycle progression may come from cooperativity of the key activator that triggers cell division.

      1. The timescales of stochastic gene expression are not correctly taken into account. It is considered that during an exponential stage the bursting approximation describes gene expression in terms of Gamma distributions for concentrations and in terms of negative binomial distributions for copy numbers. This approximation is only valid if the lifetime of a stage is much larger than the time needed to generate a burst. For RNA, this condition cannot be fulfilled for a large number of states N and/or for two states promoters with a relatively long ON state. For the protein and/or in the case of translational bursting, the condition is even more difficult to fulfil. I agree with the Reviewer 2 that once the master equation accepted the results make sense. But my criticism is different and concerns the master equation itself. In this equation the burst is considered instantaneous, whereas it needs finite time in reality. Concerning nascent mRNA, ON/OFF etc. I disagree. The notion of instantaneous burst with well defined burst size and burst frequency on a stage has a meaning if the lifetime of this stage (which is not mRNA or protein lifetime) is short. The model validity should be clearly stated.

      Response: Thank you for pointing out this important issue. When we talk about the validity of the model, we should stick to the full model, instead of the mean-field model. This is because once the full model makes sense, the mean-field model must work well when N ? 15, as we have shown in Fig. 3 and Supplementary Fig. S3. Hence our reply is based on the validity of the full model. We will reply to the above comments from the following three aspects. First, we agree with the referee that in our model, we assume that the gene product is produced in instantaneous bursts with the reaction scheme G ρpk (1−p) −−−−−−→ G + kM, k ≥ 1, M d −→ ∅, (1) where the mean burst size scales as V (t) β . Of course, in reality there is a finite time for the bursts to occur. A more general assumption is that within each cell cycle, the gene expression dynamics is characterized by the following three-stage model: G ρ −→ G ∗ , G∗ r −→ G, G∗ sV (t) β −−−−→ G ∗ + M, M u−→ M + P, M v −→ ∅, P d −→ ∅, (2) where the first two reactions describe the switching of the gene between an inactive state G and an active state G∗ the middle two reactions describe transcription and translation, and the last two reactions describe the degradation of the mRNA M and the protein P. Here the synthesis rate of mRNA depends on cell volume via a power law form with power β ∈ [0, 1]. Dosage compensation can be modeled by a decrease in the gene activation rate (for each gene copy) from ρ to κρ/2 upon replication. Previous studies have revealed that the bursting of mRNA and protein has different biophysical origins: transcriptional bursting is due to a gene that is mostly inactive, but transcribes a large number of mRNA when it is active (r ? ρ and s/r is finite), whereas translational bursting is due to rapid synthesis of protein from a single short-lived mRNA molecule (v ? d and u/v is finite). Under the above timescale separation assumptions, both mRNA and protein are produced in a bursty manner with the reaction scheme described by Eq. (1). The burst frequency for mRNA and protein are both ρ before replication and κρ after replication. The mean burst size for mRNA is (s/r)V (t) β and the mean burst size for protein is (su/rv)V (t) β , both of which have a power law dependence on cell volume (see pages 5-6). In Supplementary Figs. S1 and S2, we compare the mRNA and protein distributions for the bursty model with the reaction scheme given by Eq. (1) and the three-stage model with the reaction scheme given by Eq. (2), where both models under consideration have a cell cycle and cell volume description. It can be seen that the distributions for the two models are very close to each other under the above timescale separation assumptions with the bursty model being more accurate as r/ρ and v/d increase. Moreover, we find that the accuracy of the bursty model is insensitive to the value of the number of stages N. Here the values of N are chosen so that the ratio of the average time spent in each stage (T /N, where T ≈ (log 2)/g is the mean cell cycle duration) and the mean burst duration time (1/ρ) ranges from ∼ 0.5 − 2. This shows that the effectiveness of the bursty model does not require that the lifetime of a cell cycle stage is sufficient long. Due to mathematical complexity, we only focus on the bursty model in the present paper. The consistency between the gene product distributions for the two models justifies our bursty assumption. Second, while we assume bursty expression here, our model naturally covers non-bursty expression since the latter can be regarded as a limit of the former. Hence all the conclusions in the present paper are applicable to both bursty and non-bursty expression. In the revised manuscript, we emphasized this point (see page 4 for a detailed explanation). Last but not least, if the lifetime of the gene product is much shorter compared to the lifetime of each cell cycle stage, then the gene expression dynamics will rapidly relax to a quasi-steady state for each stage. In this case, the gene product fluctuations at each stage can be characterized by a gamma distribution in terms of concentrations and by a negative binomial distribution in terms of copy numbers, and hence the distribution of concentrations (copy numbers) for a population of cells is naturally a mixture of N gamma (negative binomial) distributions. However, the powerfulness of our analytical distribution (see page 10, Eq. (8)) is that it serves an accurate approximation when N ? 1 without making any timescale assumptions. The effectiveness of our analytical distributions is validated in Supplementary Fig. S3 for three different cases: (i) the degradation rate d of the gene product is much smaller than the cell cycle frequency f; (ii) d and f are comparable; (iii) d is much larger than f. In the revised manuscript, we also emphasized these points (see page 10).

      1. DNA replication is a stochastic event and does not occur after a fixed number of exponential stages as it is considered in this model. Concerning replication: in the model this occurs after exactly N0 steps. In reality, replication occurs somewhere between the start of S and G2/M. N0 is in fact a random variable. Probably a new mean field assumption is needed here with some justification, but I have seen nothing in the paper.

      Response: We agree with the referee that replication of the whole genome occurs in the S phase, which occupies a considerable portion of the cell cycle and thus cannot be assumed to occur after a fixed number of exponential stages. However, our model is for a single gene and since the replication time of a particular gene is much shorter than the total duration of the S phase, it is reasonable to consider it to be instantaneous. In addition, recent experiments have shown that the time elapsed from birth to replication for a particular gene occupies an approximately proportion of the cell cycle, which is called the stretched cell cycle model. This is also consistent with our assumption that replication of the gene of interest occurs after exactly N0 stages. While replication occurs after a fixed number of stages, nevertheless the time of replication is stochastic since each stage has a random lifetime. In the revised manuscript, we emphasized these points (see pages 4-5).

      1. The results in the Methods were derived heuristically and their relation to the master equation (12) is not explicit (except for the part concerning moments and their power spectrum). Furthermore, one would like to have some estimates of the biases introduced by the mean field approximation. Concerning biases introduced by the mean field approximation: Figure 2 is a numerical simulation, some analytical estimates could be better. As Figure 2 looks rather convincing, I reclassify this as minor comment.

      Response: We agree with the referee that the derivation of moments is rigorous, but the derivation of the analytical distribution given in Methods is not rigorous and cannot be directly obtained from the master equation. In the revised manuscript, we emphasized that the analytical distribution is not exact but it serves as a very good approximation (see pages 10 and 22). We showed that the analytical distribution agrees well with stochastic simulations when the number of cell cycle stages N ≥ 15 (see page 9, Fig. 3 and Supplementary Fig. S3). The logic behind our approximate distribution is that while the gene product may produce complex distribution of concentrations (copy numbers), when the number of cell cycle stages is large, the distribution must be relatively simple within each stage and thus can be well approximated by a simple gamma (negative binomial) distribution (see page 22). Due to the complexity of our model, it is very difficult to provide any analytical estimates on the bias introduced by the mean-field approximation. Often the bias of an approximation can be estimated when the approximation emerges from a systematic method such as van Kampen’s system-size expansion (see Ref. [21]). However, our mean-field model cannot be seen as the zero order term of some expansion and hence it is not possible to calculate the next-order correction which would be needed to estimate the error. However, we have tested very large swathes of parameter space and found that the mean-field approximation always works well when N ≥ 15 which is the physiologically relevant regime for most types of cells (see discussion on P. 7).

      1. The model is not minimal and depends on a huge number of parameters. It is not clear how these parameters were found and if overfitting was avoided. One may have doubts about the identifiability of the parameter N. What difference is between N = 59 and N = 60 (the value of N for the cyanobacterium)?

      Response: In the revised manuscript, we used synthetic data to show that all the model parameters involved in our model (except d and β which can be determined based on a priori knowledge) can be accurately estimated from cell-cycle resolved lineage data of cell volume and gene expression (see Supplementary Note 7). We provided details of the parameter inference method, compared the input parameters with the estimated ones and verify that they are identifiable (see Supplementary Table 1). We did not use real data to test our inference method because we could not find cell-cycle resolved lineage data for mRNA or proteins. As we noted, this is in principle possible via cell-cycle fluorescent markers. We also note that parameter inference for less detailed but similar models have been made in our previous papers — the parameters related to cell volume dynamics have been inferred in E. coli (see Ref. [51]) and fission yeast (see Ref. [52]) using the method of distribution matching, and the parameters related to gene expression dynamics have be estimated in E. coli (see Ref. [40]) using the method of power spectrum matching. Moreover, for our purpose, i.e. to investigate the effect of cell cycle and cell volume on gene expression, we do believe that our model is minimal. We captured cell growth with only one parameter g, the degree of balanced biosynthesis with one parameter β (β = 0 corresponds to the case where the synthesis rate is independent of cell volume and β = 1 corresponds to the case where the synthesis rate scales linearly with cell volume), the variability in cell cycle duration with only one parameter N, gene replication with only one parameter N0, gene dosage compensation with only one parameter κ (κ = 1 corresponds to perfect dosage compensation and κ = 2 corresponds to no dosage compensation), and the variation of size control strategy across the cell cycle with two parameters α0 and α1 (αi → 0 corresponds to timer, αi = 1 corresponds to adder, and αi → ∞ corresponds to sizer). The biological meaning of the cell cycle stages were clarified in the revised manuscript (see page 4). For our purpose, we believe that our model cannot be simpler.

      1. The authors should make clear which cell biology aspects are important, which are less important, and which were neglected in the context of their problem. Thus, in their model, cell cycle acts on gene expression mainly by duplication of burst sources and thus by increase of burst frequency after replication. Another important source of gene expression variability during the cycle, the mitotic transcription repression, is neglected.

      Response: In the revised manuscript, we clarified which cell biology aspects are important for gene expression dynamics (see page 17). Specifically, in our model, cell cycle and cell volume act on gene expression mainly by (i) the dependence of the burst size on cell volume; (ii) the increase in the burst frequency upon replication; (iii) the change in size control strategy upon replication; (iv) the partitioning of molecules at division. Point (iv) strongly affects copy number fluctuations, while it has little influence on concentration fluctuations. In addition, in the revised manuscript, we also elucidated the limitations of our model including mitotic transcription repression and others (see pages 19-20).

      1. The validity test of the model is indirect. It was tested that the concentration distribution deviates from Gamma and that the deviation correlates positively to the lack of accuracy of the concentration homeostasis. However, many models can have this behaviour. A direct validity test should use data sets of at least two types (total, nascent RNA, etc.) allowing direct estimates of some model parameters (such as burst size and frequency using nascent RNA). Concerning parsimony, I think that the authors should test it. Are all the parameters identifiable? Is there any overfitting? They could use parameter uncertainty, comparison of training /testing errors, etc. Some details about the parameter fitting method should be provided.

      Response: Regarding the parameter fitting and identifiability we have provided a detailed response to a previous comment above. However we emphasize that for the generation of Fig. 7, we did not need to estimate all model parameters from data. Hence in the previous version of the manuscript, no such estimation was done — we simply extracted the homeostasis accuracy γ, the height H of the off-zero peak of the power spectrum, and the Hellinger distance D of the concentration distribution from its gamma approximation directly from data. Finally, we point out that our model can be used to predict the dynamics of mature mRNAs, but it cannot be used to describe the dynamics of nascent mRNAs. This is because nascent mRNAs do not decay via a first-order reaction but their removal, i.e. their detachment from the gene which leads to mature mRNA, is better approximated by a reaction with a fixed decay time. This models the elongation time of nascent transcripts which does not suffer from much noise because the RNAP velocity is to a good approximation constant along the gene. See e.g. the following two papers for details: H. Xu, S. O. Skinner, A. M. Sokac, I. Golding, Stochastic kinetics of nascent RNA. Phys. Rev. Lett. 117, 128101 (2016). S. Braichenko, J. Holehouse, R. Grima. Distinguishing between models of mammalian gene expression: telegraph-like models versus mechanistic models. J. R. Soc. Interface 18, 20210510 (2021). Because of the fixed delay, the delay telegraph model (the telegraph model with a delayed degradation reaction) is non-Markovian and very different from the usual Markovian telegraph model which describes the dynamics of mature mRNA within each cell cycle. See e.g. the Supplementary Information of the following paper: X. Fu, et al. Accurate inference of stochastic gene expression from nascent transcript heterogeneity. bioRxiv (2021). Given the mathematical complexity introduced by a fixed delay, using it to describe the dynamics of nascent mRNA within each cell cycle leads to a non-Markovian model that is even more analytically intractable than the present one for mature mRNA. While an interesting research question, this is clearly far removed from the scope of our current manuscript.

      Minor comments 8. The introduction could be more pedagogical. Right now it is just an accumulation of loosely related and sometimes abruptly introduced statements. For instance, we understand that the authors want to oppose their approach to other extant approaches. However, extant approaches should be better reviewed, some of them are aged structured and perfectly suited for analysing cell cycle data. It would be useful for the reader that an example of observation explained by their model and not explained by other models (age structured or not) is discussed in detail. The model of this work does not explain size control, it just assumes that this holds, and does not discuss cell population aspects. A more nuanced positioning of this approach with respect to the literature would be useful for judging its value.

      Response: In the revised manuscript, we rewrote the introduction part to make it more pedagogical (see pages 1-2). In particular, we compared three popular models describing the cell size dynamics and the associated size homeostasis. The advantages and disadvantages of the three models were discussed.

      1. The meaning of N should be discussed from the very start when the model is introduced.

      Response: In the revised manuscript, we explained in detail the biological meaning of the effective cell cycle stages (see page 4). Specifically, recent studies have revealed that in many cell types, the accumulation of some activator to a critical threshold is used to promote mitotic entry and trigger cell division, a strategy known as activator accumulation mechanism. In E. coli, the activator was shown to be FtsZ; in fission yeast, it was believed to be a protein upstream of Cdk1, the central mitotic regulator, such as Cdr2, Cdc25, and Cdc13. Biophysically, the N effective stages can be understood as different levels of the key activator. Moreover, we pointed out that the power law form for the rate of cell cycle progression may come from cooperativity of the key activator that triggers cell division.

      1. The authors call constitutive expression the situation when the mean copy number does not depend on the volume. This choice should be clarified as in general constitutive as opposed to specific, localised or transitory expression refers to non-regulated gene expression. It seems to me that in this context, expression is only partially constitutive (independent on the volume).

      Response: In the present paper, constitutive expression means that the gene product is produced one at a time and is not produced in a bursty manner. It does not mean that the mean copy number does not depend on the volume. In the revised manuscript, we provided a more detailed discussion about how constitutive expression can be viewed as a limit of bursty expression (see page 4).

      1. In figure 1b and for exponential growth the y axis should be log(volume) instead of volume. The mean field approximation is called both “of novel type” (Discussion) and “which has a long history of successful use in statistical physics” (p4). If something is novel, then one should clearly explain why.

      Response: In fact, the y-axis in Fig. 1(b) should be volume instead of log(volume). This is because the x-axis represents the cell cycle stage instead of the real time. Note that for the adder strategy (α0 = α1 = 1), it follows from Eq. (3) on page 7 that the mean cell volume at stage k is vk = v1 + (k − 1)M0/N0, which linearly depends on k. This explains why the red curves in Fig. 1(b) are straight lines instead of exponential curves. In the revised manuscript, we also explained why the mean-field approximation used is novel (see page 7). Specifically, we pointed out that the mean-field approximation is not made for the whole cell cycle, rather we make the approximation for each stage and thus different stages have different mean cell volumes. This type of piecewise mean-field approximation, as far as we know, is novel and has not been used in the study of concentrating fluctuations before.

      1. The word “cyclo-stationarity” is used with not much definition. If this means just stationary distribution of the gene products why not use just “stationarity” instead. What means “cyclo”? A number of properties were called “rare” but it is not clear on what grounds.

      Response: In the revised manuscript, we removed the term “cyclo-stationarity” and simply assumed that the copy number and concentration distributions of the gene product at each cell cycle stage have reached the steady state (see page 8). In addition, for each property that was called “rare”, we explained the reasons in detail (see pages 14 and 17).

      1. I did not find a proof that the copy number distribution has less modes than the concentration distribution.

      Response: In fact, it is very difficult to prove that the concentration distribution has less modes than the copy number distribution. However, we have tested very large swathes of parameter space and found that the number of modes of the concentration distribution is always less than or equal to that of the copy number distribution. In the revised manuscript, we emphasized this point (see page 16).

      Significance

      1. The strength of this work is that it incorporates in a stochastic gene expression model a number of ideas on size control and dosage compensation that were discussed elsewhere from a cell population point of view. However, the proposed model is based on a number artificial choices that are difficult to justify biologically: a huge number of cell cycle discrete states and inappropriate handling of the timescales characterizing stochastic gene expression. Furthermore, the model is not minimal but depends instead on a huge number of parameters. I found the paper difficult to read and in the results presentation is not suitable for biologists that would need more details on the justification of the modelling choices and on the experimental validation of the model.

      Response: All these points have been addressed in previous replies.

      1. For mathematicians, the calculations are rather standard and may seem trivial.

      Response: Our model is complex due to the coupling between gene expression dynamics, cell volume dynamics, and cell cycle events. It is far more complex than standard models of gene expression (see e.g. Refs. [2,84,85]) because of the large amount of biology encapsulated in it and we presented a first analytical- and simulation-based analysis of concentration fluctuations when concentration homeostasis is broken.

      The computations of many quantities in the present paper are non-trivial. First, we showed that the generalized added volumes before and after replication both have an Erlang distribution. Using this property, we computed the mean cell volume in each cell cycle stage which is needed in the mean-field approximation. Furthermore, the computations of the power spectrum of concentration fluctuations are also highly non-trivial. The analytical expression of the power spectrum allows us to precisely determine the onset of concentration homeostasis. While the computations of moments of concentration fluctuations are standard, we used to the moments to construct an analytical concentration distribution which serves as an accurate approximation when N is large. Our concentration distribution is generally valid when concentration homeostasis is broken and goes far beyond recent models for growing cells which require concentration homeostasis and which do not take into account DNA replication, dosage compensation and size control mechanisms that vary with the cell cycle phase (e.g. Ref. [26] ).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript analyses a phenomenological model of stochastic gene expression. The model couples bursty transcription with cell growth, division and DNA replication. The cell cycle is divided into a large number of stages whose exponential lifetimes depend on the cell volume. It is argued that concentrations of gene products are distributed according to mixed Gamma distributions, whereas the copy numbers follow mixed negative binomial distributions. The number of modes can be different for concentrations and copy numbers, for instance the copy numbers can be unimodal while concentrations are bimodal. The case when the mean concentration does not depend on the cell cycle stage is called perfect homeostasis. It is argued that perfect homeostasis leads to Gamma distribution of the gene product concentration and that deviations from a Gamma distributions result mainly from deviations of the concentration from perfect homeostasis. It is also proposed that concentration homeostasis is difficult to obtain. These qualitative predictions of the model are tested using two datasets, one for E.coli and another for fission yeast.

      Major comments:

      The model encompasses a number of artificial choices:

      • A huge number of states called "cell cycle stages" have exponential life times. On my opinion, this sequence of stages is just a technicality for keeping the model within a discrete Markovian framework. More natural choices are possible, such as piecewise deterministic Markov processes, age structured diffusions, etc. The biological significance (if there is any) of such states should be explained.
      • The timescales of stochastic gene expression are not correctly taken into account. It is considered that during an exponential stage the bursting approximation describes gene expression in terms of Gamma distributions for concentrations and in terms of negative binomial distributions for copy numbers. This approximation is only valid if the lifetime of a stage is much larger than the time needed to generate a burst. For RNA, this condition cannot be fulfilled for a large number of states N and/or for two states promoters with a relatively long ON state. For the protein and/or in the case of translational bursting, the condition is even more difficult to fulfil.
      • DNA replication is a stochastic event and does not occur after a fixed number of exponential stages as it is considered in this model. The results in the Methods were derived heuristically and their relation to the master equation (12) is not explicit (except for the part concerning moments and their power spectrum). Furthermore, one would like to have some estimates of the biases introduced by the mean field approximation. The model is not minimal and depends on a huge number of parameters. It is not clear how these parameters were found and if overfitting was avoided. One may have doubts about the identifiability of the parameter N. What difference is between N=59 and N=60 (the value of N for the cyanobacterium)? The authors should make clear which cell biology aspects are important, which are less important, and which were neglected in the context of their problem. Thus, in their model, cell cycle acts on gene expression mainly by duplication of burst sources and thus by increase of burst frequency after replication. Another important source of gene expression variability during the cycle, the mitotic transcription repression, is neglected.<br /> The validity test of the model is indirect. It was tested that the concentration distribution deviates from Gamma and that the deviation correlates positively to the lack of accuracy of the concentration homeostasis. However, many models can have this behaviour. A direct validity test should use datasets of at least two types (total, nascent RNA, etc.) allowing direct estimates of some model parameters (such as burst size and frequency using nascent RNA).

      Minor comments:

      The introduction could be more pedagogical. Right now it is just an accumulation of loosely related and sometimes abruptly introduced statements. For instance, we understand that the authors want to oppose their approach to other extant approaches. However, extant approaches should be better reviewed, some of them are aged structured and perfectly suited for analysing cell cycle data. It would be useful for the reader that an example of observation explained by their model and not explained by other models (age structured or not) is discussed in detail. The model of this work does not explain size control, it just assumes that this holds, and does not discuss cell population aspects. A more nuanced positioning of this approach with respect to the literature would be useful for judging its value.

      The meaning of N should be discussed from the very start when the model is introduced.

      The authors call constitutive expression the situation when the mean copy number does not depend on the volume. This choice should be clarified as in general constitutive as opposed to specific, localised or transitory expression refers to non-regulated gene expression. It seems to me that in this context, expression is only partially constitutive (independent on the volume).

      In figure 1b and for exponential growth the y axis should be log(volume) instead of volume.

      The mean field approximation is called both "of novel type" (Discussion) and "which has a long history of successful use in statistical physics" (p4). If something is novel, then one should clearly explain why.<br /> The word "cyclo-stationarity" is used with not much definition. If this means just stationary distribution of the gene products why not use just "stationarity" instead. What means "cyclo"?

      A number of properties were called "rare" but it is not clear on what grounds.

      I did not find a proof that the copy number distribution has less modes than the concentration distribution.

      Referees cross-commenting

      Part 1

      I agree with the Reviewer 2 that once the master equation accepted the results make sense. But my criticism is different and concerns the master equation itself. In this equation the burst is considered instantaneous, whereas it needs finite time in reality.

      Part 2 (response to Part 2 of Rev2)

      • concerning replication: in the model this occurs after exactly N_o steps. In reality, replication occurs somewhere between the start of S and G2/M. N_o is in fact a random variable. Probably a new mean field assumption is needed here with some justification, but I have seen nothing in the paper
      • concerning biases introduced by the mean field approximation: Figure 2 is a numerical simulation, some analytical estimates could be better. As Figure 2 looks rather convincing, I reclassify this as minor comment.
      • concerning nascent mRNA, ON/OFF etc. I disagree. The notion of instantaneous burst with well defined burst size and burst frequency on a stage has a meaning if the lifetime of this stage (which is not mRNA or protein lifetime) is short. The model validity should be clearly stated.
      • concerning parsimony, I think that the authors should test it. Are all the parameters identifiable? Is there any overfitting? They could use parameter uncertainty, comparison of training /testing errors, etc. Some details about the parameter fitting method should be provided.

      Significance

      The strength of this work is that it incorporates in a stochastic gene expression model a number of ideas on size control and dosage compensation that were discussed elsewhere from a cell population point of view. However, the proposed model is based on a number artificial choices that are difficult to justify biologically: a huge number of cell cycle discrete states and inappropriate handling of the timescales characterizing stochastic gene expression. Furthermore, the model is not minimal but depends instead on a huge number of parameters.

      I found the paper difficult to read and in the results presentation is not suitable for biologists that would need more details on the justification of the modelling choices and on the experimental validation of the model. For mathematicians, the calculations are rather standard and may seem trivial. I am a systems biologist with a background in mathematics and theoretical physics.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Jia et al. introduce a modeling framework to represent stochastic gene expression, with an explicit representation of cell volume growth, cell cycle progression (and its dependency on cell volume) and gene dosage compensation. The model is very elegant and general in that it can represent a variety of situations, simply as a matter of paramterization. Under a simplifying assumption, the authors derive a number of metrics (include stationary distribution of gene product and power spectrum of gene product fluctuation dynamics), for both absolute number and concentration of gene product molecules. They use their model and derivations to examine under which conditions cell can achieve homeostasis in the concentration of the expressed gene product, despite changes in cell volume and gene copy number following replication. They also present and discuss the conditions giving rise to specific features (i.e. bimodality in stationary distribution, peak in power spectrum) and examine these features in experimental data to conclude to infer the underlying homeostasis strategies.

      Major comments:

      The model is rather general and powerful. The simplifying assumption seems reasonable (and the authors investigate to some extent its limitations, i.e. Fig. 2). The conclusions are overall convincing.

      1. My main concern is that the metrics that the authors use to assess concentration homeostasis (i.e. the γ parameter and the presence/absence of peak in power spectrum) do not seem quite appropriate to describe how much variability/fluctuations in concentration are driven by cell cycle effects. Indeed, the γ parameter measures how much the average concentration in each cell cycle stage varies throughout the cell cycle. However, this variability should be compared to the total variability due to both cell cycle effects and stochastic bursting dynamics. A given level of cell-cycle dependency (say γ=0.2) could be very visible if gene expression is weakly noisy (e.g. B low and <n> high) and completely invisible is gene expression is highly bursty (large B and small <n>). In the latter situation, cell-cycle effects would be meaningless for the cell to minimize. In essence, re-using the authors notations, I think γ / ϕ^1/2, would be a more relevant metric to observe.
      2. Similarly, when inspecting the peak in the power spectrum, the weight of the Lorenztian function(s) creating the peak, should be compared to the stationary component (λ_N, u_N in thhe authors' notations).

      A complementary analysis including these two points and a discussion the relative contribution of cell-cycle effects and bursting dynamics in the total variability/fluctuation of concentrations would be important to include.

      Minor comments:

      1. The dashed line on Fig. 3a is defined as κ = sqrt(2)^(1-β). First is this empirical or does it come from a derivation? Second, it seems incomplete since it should depend on ω. Intuitively, this line should correspond to the value of κ that would best mimic balanced biosynthesis in the case where β≠1. In other words, κ should be so that <ρB' / V(t)>_prereplication = <κρB' / V(t)>_postreplication which yields κ = 2^(ω(1-β)) * (ω-1)/ω * [2^(ω(β-1))-1]/[2^((1-ω)(β-1))-1] This indeed simplifies into κ = sqrt(2)^(1-β) when ω=0.5.
      2. η is used in the caption of Fig. 2, which is cited on page 4. But it is defined only 2 sections later, on page 6.
      3. ω is used in the main text, but only defined in the caption of Fig. 3.
      4. ω is defined as "the proportion of cell cycle before replication". Is this in terms of cell cycle stages (i.e. ω=N_0/N) or actual time?
      5. Fig. 3 indicates that power spectra are normalized so that G(0)=1, but G(0)=10 on the first two graphs.
      6. Page 11: "bimodality in the concentration distribution is significantly less apparent". I would suggest rephrasing "bimodality in the concentration distribution is absent" since there should be no reference to "significance" and bimodality is either present or absent (binary), not less apparent.

      Referees cross-commenting

      Part 1.

      I agree with reviewer 1 that a table of symbols would be helpful. On reviewer 3's second Major Comment, I don't think that the "the lifetime of a stage [has to be] much larger than the time needed to generate a burst". From how the authors write and solve the master equation, I don't think that such a separation of timescale is necessary. The authors should indeed clarify this and if reviewer 3 is correct, then that's indeed a major limitation. On reviewer 3's second Major Comment, I don't think that the "the lifetime of a stage [has to be] much larger than the time needed to generate a burst". From how the authors write and solve the master equation, I don't think that such a separation of timescale is necessary. The authors should indeed clarify this and if reviewer 3 is correct, then that's indeed a major limitation. On reviewer 3's comment "DNA replication [...] does not occur after a fixed number of exponential stages", I don't think I agree with this statement. Cell cycle progression relies on an ensemble of biochemical reactions. Representing this as a set of exponential waiting-time distributions with different means is probably amongst the most general and agnostic ways of representing this. Whether these exponential waiting-times only depend on cell volume is another question. This actually links back to reviewer 3's first Major comment and reviewer 1's comment that the concept of "stage" should be better discussed.

      Regarding the need for "estimates of the biases introduced by the mean field approximation" (reviewer 3), I guess that's the goal of figure 2. Maybe reviewer 3 should make more explicit what she/he would like to see.

      Regarding the comment from reviewer 3 that "a direct validity test should use datasets of at least two types (total, nascent RNA, etc)". I almost made a related comment in my review, but then I held it off: This issue with using nascent RNA data is that their model does not allow an ON state. They assume that gene products are produced in instantaneous bursts, which is a fair assumption if the lifetime of gene products is large compared to the time the gene stays ON. This is ok if the considered "gene products" are mRNA or proteins, but not nascent RNAs (for which the lifetime is the time to transcribe the gene). I did not make this comment in the end because I think the model is useful regardless. To comply with reviewer 3's request, maybe the authors could use distributions of mRNA and protein products, but I'm not sure that such data exists (since they need cell-cycle-resolved data).

      I disagree with the statements that "the proposed model is based on a number artificial choices that are difficult to justify biologically" and that "the model is not minimal but depends instead on a huge number of parameters." In my opinion, the model is elegantly simple to capture the mechanisms under study (i.e. the effect of cell cycle and cell volume on stochastic gene expression). It is expressed so that the model captures a broad range of situations (i.e. it reduces to simpler models as a matter of choosing parameter values, e.g. \Beta=0 => transcription independent of cell cycle; \alpha => \infty cell cycle depends only on size ...). I do not think that a series of exponential distributions for cell cycle progression is inappropriate, it is the most agnostic and general way of representing an ensemble of biochemical reactions that would be meaningless to describe explicitly. Instead, only their dependency on cell volume is taken into account (and in a very general way, i.e. parameters 'a' and \alpha). It is fair to ask the authors to clarify the concept of "stage", but I see this model as being as simple as possible, but not simpler, for the authors' purpose.

      Finally, I agree that the paper is probably "not suitable for biologists" but disagree that "for mathematicians, the calculations are rather standard and may seem trivial."

      Part 2. Resp. to reviewer 3 on the master equation (Part 1 of Rev3):

      Ok, I understand better your comment. What you mean by "the time needed to generate a burst" is the time that the gene produces RNAs, not the lifetime of the gene product (which is 1/d). That's true. It is essentially the same ifdea as what I write in my previous comment about nascent RNA data not being well captured by the model. Again, I think this is fine for "gene products" that are somewhat stable (not the case for nascent RNAs, but ok for mRNAs and proteins). This is fine by me as long as the authors explicit better this limitation of their model.

      Part 3. Response to Reviewer 3 (Part 2 of Rev 3)

      • concerning replication: Note that the mean field approximation is on cell volume, not on stage progression ("To simplify this model, [...] we ignore volume fluctuations at each stage but retain fluctuations in the time elapsed between two stages", p3). So the time at which replication occurs is already a random variable in the model. It is the sum of all the exponentially distributed random variables corresponding to stages 1 to N_0. The resulting distribution of replication time from the start of cell cycle is a random variable, which can be anything from very deterministic (N_0 very high) to very variable (N_0 very low).
      • concerning nascent mRNA, ON/OFF etc. : I'm not sure I get your objection, but the best is probably to let the authors respond to your original comment.
      • concerning parsimony: Ok, you're right. The authors should test it.

      Significance

      The advance of this paper is essentially technical. The authors present a model that incorporates and unifies previously studied effects (cell volume homeostasis, concentration homeostasis, bursting transcription). There is no major conceptual novelty, but the combination of these different aspects and the derivations that authors present are very valuable and might be applicable to interpret data in various species.

      The paper is suitable for a physics/mathematics/computational audience. It is rather technical and would not be understood by readers with only a biology background.

      Field of expertise of the reviewer: Gene regulation, single-molecule imaging, stochastic modeling.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is a report on "Concentration fluctuations due to size-dependent gene expression and cell-size control mechanisms," by jia, Singh, and Grima. This manuscript constructs a gene expression model with various factors. Specifically, the effect of cell size on gene expression is considered, which is often ignored by previous studies.

      One interesting finding is that the absolute number of the gene products and the concentration can have different distributions. Some predictions of the models are validated by experimental data on E. coli and yeast.

      This manuscript uses the mean-field approximation for cell volume, which has good accuracy when the number of stages is large. The usage of the power spectrum has a satisfactory effect on studying the concentration oscillation. Overall the paper was very difficult to follow and digest easily because of all the different factors and mechanisms invoked. It is mainly an issue of providing sufficient details for each of the factors and organizing them in a systematic and logical way. Although there is a supplementary appendix, it was hard to keep track of all the elements in the main mauscript. Perhaps something like Fig 1 of the Appendix can be presented in the main body to outline all the ingredients and how they affect each other.

      It might be good to provide a more detailed description of the goal (studying gene product number and concentration under different parameters) after introducing the full and the reduced models. A table of symbols would also be helpful.

      Some technical details in the Methods section are in fact helpful in understanding the conclusions. They can be moved to the Results section.

      One concern is that the central concept of this manuscript, "stage", is not thoroughly discussed. This concept should have some significant biological meaning, not just be coined for mathematical convenience.

      Fig. 1(b) is a little strange. For the left panel, the x-axis (stage) is discrete, then the volume (y-axis) should be a step function, not a straight red line.

      Significance

      The main advance is a more complete model of gene expression under more realistic organism growth conditions.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the reviewers

      We wish to thank all three reviewers for their thorough examination of our manuscript and their constructive criticism that allowed us to increase its quality. You will see that, following their recommendations, we have included a good amount of new data in the manuscript. Specifically, we added a new figure with experiments proposed by the reviewers (now Fig. 4), as well as Figs. S3 and S4. In addition, we expanded one paragraph of our Discussion to comment on a very recent article published by Huang et al in Nature Structural and Molecular Biology with conclusions pertaining the interplay of Rpd3 and Gcn5 in PHO5 gene regulation. Below we include the point-by-point response (in blue) with the changes we have implemented to address their specific points. All the additions and changes in the manuscript are made in red.

      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Novačić et al., investigate into a mechanisms of the non-coding transcriptiondriven regulation of the phosphate-responsive PHO5 gene. The authors employ CRSPRi system to discern direct contribution of the antisense non-coding transcription (CUT025) expressed during phosphate -rich conditions to transcriptional repression of the yeast PHO5 gene and therefore challenging previous study from the Svejstrup's lab that proposed a positive role for non-coding transcription in control of PHO5 gene. They propose a model where non-coding transcription represses PHO5 by mediating recruitment of Rpd3 histone deacetylase leading to altered chromatin structure at PHO5 promoter due to reduced recruitment of the RSC chromatin remodelling complex. Overall, the data presented in the manuscript are of a good quality, experiments are well controlled and nicely presented. Manuscript is well written. My specific comments are below: 1. I am somewhat confused by the data presented in Figure 5. While there is similar impact on the chromatin structure seen in rrp6D and air1Dair2D strains (Fig 5C) that corresponds to more "closed" configuration of chromatin , it is not consistent with H3 ChIP data that show higher nucleosome occupancy across PHO5 UAS in rrp6D but loss of nucleosomes in the double mutant (or there is a mistake perhaps while plotting the data?)

      We now realize that the data was plotted confusingly, and we apologize for it. While doing the H3 ChIP experiment we only prepared the +Pi samples for the air1Δ air2Δ double mutant. In the figure we only included this one data point for the double mutant, which could lead to the false conclusion that at other timepoints there are no histones at its PHO5 promoter region. We decide to remove this data point from the figure to avoid the confusion and only keep the air1Δ air2Δ data for the ClaI assay. We believe that this should not be an issue as this data point is not critical for the conclusions we are making.

      1. To further explore direct link between nc transcription, Rpd3 and rrp6 mediated effect, I suggest to test the effect on PHO5 induction upon rpd3 and rrp6 deletions in CRISPRi CUT025 background.

      We performed this experiment and now include it as Fig. S3 in the manuscript. As expected, expressing the CRISPRi system only made difference when Rpd3 was present.

      1. It seems that most noticeable effect of blocking nc transcription by an elegant approach that utilizes CRISPRi system on the phosphatase activity is seen between 0-1.5h of induction. I suggest taking additional time points at 30-45 min.

      We took additional timepoints and the results were incorporated as the new Fig. 5E. The CRISPRi effect resulting in higher acid phosphatase activity was still most noticeable after 1,5 h of induction. This was mostly in line with the fact that the difference in PHO5 mRNA levels was most pronounced after 30 min of induction (Fig. 5D), as the time needed to achieve measurable protein level after induction can lag significantly for secretory proteins, such as acid phosphatase. Secretory proteins are cotranslationally translocated into the ER, after which they traverse the secretory pathway and undergo modifications before being finally exported to the periplasm where their activity can be measured. Consequently, the increase in acid phosphatase activity upon induction is only measurable after at least an hour.

      1. How do authors explain that the effect of the exosome mutations are reversed and phosphatase activity is increased at later time point (20 h, Fig 2A)? I suggest using more distinct colour for dis3 mutants.

      That effect is indeed somewhat surprising. We hypothesize that the effects we are seeing after 20 h reflect the specific conditions of prolonged induction, i.e. keeping the chromatin open or semi-open for a very long period of time, which do not necessarily reflect the early gene induction period that we are using as a read-out of the effect of different mutations on acid phosphatase expression kinetics. We previously noticed a similar effect with chromatin remodeler-related mutants (e.g. rsc2Δ, unpublished result from S. Barbarić group), which speak in favour of the prolonged induction conditions resulting in a chromatin state with its own specialized cofactor requirements. We therefore consider the chromatin state after prolonged induction a topic for another study, however, we now comment on this effect in the manuscript. The dis3 mutants are now shown in more distinct colours.

      1. Figure 5A -label "H3 ChIP"

      The label was added.

      1. Error bars are quite high in Fig 1C, perhaps it is worth repeating the experiment

      Since significant differences in PHO5 mRNA levels can be seen between wt and rrp6Δ mutant cells at 0,75 and 3 h of induction, we feel that the higher error bars at 5 h of induction are not worth repeating the experiment – especially since the values are bound to converge to a similar one after a longer induction period, as demonstrated in Fig. 1D.

      Significance

      significant of interest for general audience

      Referee #2

      Evidence, reproducibility and clarity

      The authors study the PHO5 locus, which is known to a have antisense transcript and that has previously been shown the be important for activation of Pho5 sense transcription. The authors challenge the idea by an extensive analyses. They show the Pho5-AS represses sense transcription, and thus fits in the category as AS repressors instead of activators. They show a correlative data that when antisense goes down and sense goes up. They show that increase antisense levels leads to decrease sense levels. They use mutants of decay pathways to increase the levels antisense transcription. Moreover, they used crispri to repress the antisense transcript. Lastly, they show that histone deacetylation represses Pho5 sense. The data in the manuscript is convincing, and well presented. One thing that needs further clarification is the strategy to increase anti-sense levels by deletion mutants of decay or depletion of decay pathways. While it is clear that this stabilizes the pho5-AS and decrease pho5-sense, it is not clear that this causes an increase in transcription. Perhaps, it is possible that antisense transcript itself has a repressive effect. If one really wanted to increase antisense transcription than the antisense promoter should be increased in strength. On the other the CriprI experiment is very convincing. I am surprised how well the crisprI system works, it is thought to be not so efficient at blocking elongating polymerase and good at blocking initiation.

      We thank the reviewer for this feedback. We performed additional experiments which you will find described below. Based on the results, we would like to keep the point about AS transcription causing the effect.

      Major comments: - Are the key conclusions convincing? Perhaps, the conclusion that increased transcription leads to repression is not completely convincing. The authors use mutants in rrp6, exosome, and nrd1 to increase Pho5-AS transcription elongation. However, I am always under impression that these mutants stabilize the transcript. And the authors acknowledge this in their manuscript. So how do you discriminate between increased stability versus increased elongation? I support the conclusion that inhibition of Pho5-AS leads to increase Pho5-S. However, increase in elongation is not directly demonstrated. While still possible, it is equally possible that a more stable pho5-AS transcript has a repressive an effect on Pho5-AS. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? See above. Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. If the authors want to keep the message that increased transcription of Pho5-AS leads to more repression that may need to consider additional experiments. For example, increasing transcription from the antisense promoter.

      We performed the proposed experiment and now include it in the manuscript as Fig. 4AB. Briefly, we inserted the strong constitutive TEF1 promoter in the antisense configuration downstream of the PHO5 gene ORF, so that it drives AS transcription. The results of this experiment very clearly show the inverse relationship between PHO5 mRNA and AS transcripts levels at +Pi conditions. Importantly, this strong constitutive AS transcription had an even more pronounced effect on PHO5 gene expression than deletion mutant backgrounds (in which, like in wt cells, the AS promoter is presumably weak), and did not allow for full level of PHO5 gene expression to be reached. To verify that the AS RNA itself does not have a regulatory role, but rather the act of its transcription represses the corresponding gene, we performed an additional experiment with appropriate diploid strains. The design of this experiment is standardly used to test whether an AS transcript can work in trans (for example see Nevers et al. 2018 NAR Fig. 6). This experiment is now included as Fig. 4C. Together, the results of these experiments paint a clear picture of AS transcription, and not AS level/stability itself, driving the repression of the PHO5 gene.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. To me this is an optional experiment, but it would benefit the manuscript
      • Are the data and the methods presented in such a way that they can be reproduced? yes - Are the experiments adequately replicated and statistical analysis adequate? yes

      Minor comments: - Specific experimental issues that are easily addressable. - Are prior studies referenced appropriately? yes - Are the text and figures clear and accurate? Yes - Do you have suggestions that would help the authors improve the presentation of their data and conclusions? no

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. The manuscript challenges previous work where it was claimed that Pho5-AS is important for activation of Pho5-S. As such, it is important work. In the field of noncoding the transcription the Pho5-AS fits in a class of AS transcript that has been well described.
      • Place the work in the context of the existing literature (provide references, where appropriate). See above.
      • State what audience might be interested in and influenced by the reported findings. In researchers in field of transcription, chromatin, and more specifically in yeast gene regulation.
      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. Chromatin, transcription, yeast.

      Referee #3

      Evidence, reproducibility and clarity

      Novačić et al present a manuscript entitled "Antisense non-coding transcription represses the PHO5 model gene via remodeling of promoter chromatin structure" which is a locus-specific follow up to previous studies from Soudet and Stutz groups on genome-wide analysis of transcription interference mediated by antisense transcripts in S cerevisiae. Critically, the authors here employ a CRISPRi approach to reduce antisense transcription from reaching the PHO5 promoter and in doing so show that kinetics of PHO5 induction are increased as would be predicted from their previous model. Additionally, they show predicted epistasis between rpd3 and rrp6 on PHO5 expression and gcn5 and rrp6 that are consistent with their model. Comments are relatively minor but should be addressed. Introduction p3. "This mechanism was subsequently explored genome-wide in yeast, which revealed a group of genes that in the absence of Rrp6 accumulate AS RNAs and are silenced in an HDACdependent manner (14)." This sentence appears awkward- perhaps move "in the absence of Rrp6" to after "AS RNAs"?

      Corrected as proposed.

      p3 "Under a high phosphate concentration Pho4 undergoes phosphorylation by the cyclindependent-kinase (Pho80-Pho85)" Since "the" is used, don't use parentheses around Pho80-Pho85

      Corrected as proposed.

      Methods Give amount/concentration of glycine used in quenching formaldehyde for ChIP. Give the exact wash conditions and buffers not "extensively"

      All of those details are now provided in the manuscript. Figure 4C.

      Describe schematic in legend

      It is now described.

      Figure 4D. Indicate time of induction in legend.

      This was lacking for Figs. 4B-C (now 5B-C) so we added it there.

      Figure 5A. air∆ data are missing from later time points?

      Please see our first response to Reviewer 1. We removed the air1Δ air2Δ double mutant data, as we only had one data point for it in this assay.

      Figure 6. Legend needs to indicate what Pi conditions are. Since PHO5 expressed, appears to be low Pi. An issue that needs to be discussed is that rpd3∆ appears to decrease expression of PHO5 AS. Is this simply because of increased PHO5 expression? Does rpd3∆ have any effects on AS in high Pi? This is important to interpret if effects of rrp6 and rpd3 are epistatic or additive.

      We thank the Reviewer for bringing this to our attention. To explore the effect of rpd3Δ on PHO5 AS level, we quantified the PHO5 AS transcript by RT-qPCR with cells grown in (chemically defined) high Pi medium, which we now include in Fig. 7A. We find that rpd3Δ mutation has practically no effect on PHO5 AS transcript level both in the wt and the rrp6Δ mutant background. This result speaks in favor of rrp6Δ and rpd3Δ being epistatic rather than additive.

      Figure 7. Sth1-CHEC data are hard to interpret. Some sort of quantification might be required as effects are not clear from the browser track nor is it clear from browser track that the results are reproducible. Examination of Sth1-AA effects in gcn5∆ background might be more compelling that the effect on RSC is via acetylation. Otherwise it is a bit hard to say as RSC could be functioning in parallel to the acetylation-dependent pathways implicated.

      We agree that the presumption that histone acetylation recruits RSC to the PHO5 gene promoter had to be tested. We therefore include the experiment involving Sth1-AA depletion in the gcn5Δ background as Fig. 8A. This experiment was complicated by the fact that RSC is highly abundant (and at the same time essential for cell viability), but we resolved this by starting to deplete RSC two hours before gene induction. These results position RSC and Gcn5 in the same pathway. In contrast, more complete Sth1 depletion severely impaired viability of the rrp6Δ mutant, making it hard to interpret the effect, so we now include this result as Fig. S4.

      To show the effect of AS transcription on RSC recruitment to the PHO5 promoter more quantitatively, we re-analyzed the Sth1-CHEC data (for two independent biological replicates) and now include the log2 values for the changes in Sth1 binding in the text of the manuscript.

      Significance

      The work is focused and narrower in impact but important because direct tests of locus-specific effects are performed, validating models from previous genomic analyses. **Referees cross-commenting**

      I think the other reviews are very reasonable. I would just suggest to the authors that they think carefully about the reviews and decide what they think is most valuable to improving the work/presentation

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Novačić et al present a manuscript entitled "Antisense non-coding transcription represses the PHO5 model gene via remodeling of promoter chromatin structure" which is a locus-specific follow up to previous studies from Soudet and Stutz groups on genome-wide analysis of transcription interference mediated by antisense transcripts in S cerevisiae. Critically, the authors here employ a CRISPRi approach to reduce antisense transcription from reaching the PHO5 promoter and in doing so show that kinetics of PHO5 induction are increased as would be predicted from their previous model. Additionally, they show predicted epistasis between rpd3 and rrp6 on PHO5 expression and gcn5 and rrp6 that are consistent with their model. Comments are relatively minor but should be addressed.

      Introduction

      p3. "This mechanism was subsequently explored genome-wide in yeast, which revealed a group of genes that in the absence of Rrp6 accumulate AS RNAs and are silenced in an HDAC-dependent manner (14)."

      This sentence appears awkward- perhaps move "in the absence of Rrp6" to after "AS RNAs"?

      p3 "Under a high phosphate concentration Pho4 undergoes phosphorylation by the cyclin-dependent-kinase (Pho80-Pho85)"

      Since "the" is used, don't use parentheses around Pho80-Pho85

      Methods

      Give amount/concentration of glycine used in quenching formaldehyde for ChIP. Give the exact wash conditions and buffers not "extensively"

      Figure 4C. Describe schematic in legend

      Figure 4D. Indicate time of induction in legend.

      Figure 5A. air∆ data are missing from later time points?

      Figure 6. Legend needs to indicate what Pi conditions are. Since PHO5 expressed, appears to be low Pi. An issue that needs to be discussed is that rpd3∆ appears to decrease expression of PHO5 AS. Is this simply because of increased PHO5 expression? Does rpd3∆ have any effects on AS in high Pi? This is important to interpret if effects of rrp6 and rpd3 are epistatic or additive.

      Figure 7. Sth1-CHEC data are hard to interpret. Some sort of quantification might be required as effects are not clear from the browser track nor is it clear from browser track that the results are reproducible. Examination of Sth1-AA effects in gcn5∆ background might be more compelling that the effect on RSC is via acetylation. Otherwise it is a bit hard to say as RSC could be functioning in parallel to the acetylation-dependent pathways implicated.

      Significance

      The work is focused and narrower in impact but important because direct tests of locus-specific effects are performed, validating models from previous genomic analyses.

      Referees cross-commenting

      I think the other reviews are very reasonable. I would just suggest to the authors that they think carefully about the reviews and decide what they think is most valuable to improving the work/presentation

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      The authors study the PHO5 locus, which is known to a have antisense transcript and that has previously been shown the be important for activation of Pho5 sense transcription. The authors challenge the idea by an extensive analyses. They show the Pho5-AS represses sense transcription, and thus fits in the category as AS repressors instead of activators. They show a correlative data that when antisense goes down and sense goes up. They show that increase antisense levels leads to decrease sense levels. They use mutants of decay pathways to increase the levels antisense transcription. Moreover, they used crispri to repress the antisense transcript. Lastly, they show that histone deacetylation represses Pho5 sense.

      The data in the manuscript is convincing, and well presented. One thing that needs further clarification is the strategy to increase anti-sense levels by deletion mutants of decay or depletion of decay pathways. While it is clear that this stabilizes the pho5-AS and decrease pho5-sense, it is not clear that this causes an increase in transcription. Perhaps, it is possible that antisense transcript itself has a repressive effect. If one really wanted to increase antisense transcription than the antisense promoter should be increased in strength. On the other the CriprI experiment is very convincing. I am surprised how well the crisprI system works, it is thought to be not so efficient at blocking elongating polymerase and good at blocking initiation.

      Major comments:

      • Are the key conclusions convincing?

      Perhaps, the conclusion that increased transcription leads to repression is not completely convincing. The authors use mutants in rrp6, exosome, and nrd1 to increase Pho5-AS transcription elongation. However, I am always under impression that these mutants stabilize the transcript. And the authors acknowledge this in their manuscript. So how do you discriminate between increased stability versus increased elongation? I support the conclusion that inhibition of Pho5-AS leads to increase Pho5-S. However, increase in elongation is not directly demonstrated. While still possible, it is equally possible that a more stable pho5-AS transcript has a repressive an effect on Pho5-AS.<br /> - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      See above. - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      If the authors want to keep the message that increased transcription of Pho5-AS leads to more repression that may need to consider additional experiments. For example, increasing transcription from the antisense promoter. - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      To me this is an optional experiment, but it would benefit the manuscript - Are the data and the methods presented in such a way that they can be reproduced?

      yes - Are the experiments adequately replicated and statistical analysis adequate?

      yes

      Minor comments:

      • Specific experimental issues that are easily addressable.
      • Are prior studies referenced appropriately?

      yes - Are the text and figures clear and accurate?

      Yes - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      no

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      The manuscript challenges previous work where it was claimed that Pho5-AS is important for activation of Pho5-S. As such, it is important work. In the field of noncoding the transcription the Pho5-AS fits in a class of AS transcript that has been well described. - Place the work in the context of the existing literature (provide references, where appropriate).

      See above. - State what audience might be interested in and influenced by the reported findings.

      In researchers in field of transcription, chromatin, and more specifically in yeast gene regulation. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Chromatin, transcription, yeast.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Novačić et al., investigate into a mechanisms of the non-coding transcription-driven regulation of the phosphate-responsive PHO5 gene. The authors employ CRSPRi system to discern direct contribution of the antisense non-coding transcription (CUT025) expressed during phosphate -rich conditions to transcriptional repression of the yeast PHO5 gene and therefore challenging previous study from the Svejstrup's lab that proposed a positive role for non-coding transcription in control of PHO5 gene. They propose a model where non-coding transcription represses PHO5 by mediating recruitment of Rpd3 histone deacetylase leading to altered chromatin structure at PHO5 promoter due to reduced recruitment of the RSC chromatin remodelling complex.

      Overall, the data presented in the manuscript are of a good quality, experiments are well controlled and nicely presented. Manuscript is well written. My specific comments are below:

      1. I am somewhat confused by the data presented in Figure 5. While there is similar impact on the chromatin structure seen in rrp6D and air1Dair2D strains (Fig 5C) that corresponds to more "closed" configuration of chromatin , it is not consistent with H3 ChIP data that show higher nucleosome occupancy across PHO5 UAS in rrp6D but loss of nucleosomes in the double mutant (or there is a mistake perhaps while plotting the data?)
      2. To further explore direct link between nc transcription, Rpd3 and rrp6 mediated effect, I suggest to test the effect on PHO5 induction upon rpd3 and rrp6 deletions in CRISPRi CUT025 background.
      3. It seems that most noticeable effect of blocking nc transcription by an elegant approach that utilizes CRISPRi system on the phosphatase activity is seen between 0-1.5h of induction. I suggest taking additional time points at 30-45 min.
      4. How do authors explain that the effect of the exosome mutations are reversed and phosphatase activity is increased at later time point (20 h, Fig 2A)? I suggest using more distinct colour for dis3 mutants.
      5. Figure 5A -label "H3 ChIP"
      6. Error bars are quite high in Fig 1C, perhaps it is worth repeating the experiment

      Significance

      significant

      of interest for general audience

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Revision Plan

      1. General Statements

      We really appreciate the positive comments and suggestions of the reviewers on our submitted manuscript. We think we will be able to solve the issues inquired by reviewers by adding new data and revising the phrases as detailed below.

      2. Description of the planned revisions

      Reviewer #1:

      Major comments

      Localization analysis of a transiently expressed MAP70 transgene with inactivating phosphosite mutations would be important to see whether the identified conserved phosphosites are relevant for MAP70 interaction with MTs. This experiment could be performed rapidly using transient expression in BY-2 cells.

      We agree on the importance of this analysis. Therefore we are currently preparing fluorescent markers of Nt-MAP70-2-like and its phospho-blocked (Ala) version to coexpress with MT and nuclear markers in BY-2 cells. We estimate that we need three more months to complete this experimsnt.

      The authors propose that PP2 blocks phragmoplast formation by preventing phosphorylation of class II Kinesin-12 proteins. In support, authors show that PP2 treatment correlates with a decrease in KIN12A phosphopeptide count (not fully abolished) and its failure to localize to emerging phragmoplasts in BY-2 cells and Physcomitrium. As class II Kinesis-12 proteins have been previously implicated in phragmoplast assembly this is a fairly reasonable hypothesis, but would benefit from the analysis of transgenic KIN12A variants carrying inactivating (A) or potentially activating (D/E) phosphosite mutations. Is loss of phosphorylation sufficient to prevent phragmoplast localization? Can an activated variant rescue PP2-induced KIN12A localization and cell division defects? As above, using transient expression in BY-2 cells would be a fast approach to tackle these questions.

      We are currently preparing fluorescent markers of phospho-blocked (Ala) and phospho-mimic (Asp) versions of KIN12A (PAKRP1) to coexpress with MT and nuclear markers in BY-2 cells. We will check whether they localize to phragmoplast and also test PP2 effects. We would need three more months to complete these analyses.

      Reviewer #2:

      Major comments

      • The manuscript would strongly benefit from being revised by a native english speaker. There are many unusual or awkward formulation, in particular in the abstract.

      We apologize for unnatural sentences. After adding new data and correcting the manuscript, we will ask a native english speaker to revise it.

      Reviewer #3:

      Major comments

      The major concern is lack of evidence to connect MAP70 and MT disruption upon treatment with PD-180970, in contrast to PP2, which was shown to affect localization of Kinesin-12. I wonder if authors could use taxol to stabilize MTs, then observe the localization of MAP70 with application of PD-180970?

      As we responded to reviewer 1, we are preparing the fluorescent marker of Nt-MAP70-2-like to coexpress with MT and nuclear markers in BY-2 cells. By using this multi-color marker, we will test whether PD-180970 affects the localization of MAP70 on MTs, also using taxol. However, in our experiene, taxol is not a very effective inhibitor and may not work in our transient expression system in BY-2 cells. In that case, we will analyze whether phospho-mimic (Asp) version can prevent MT disruption in the presence of PD-180970 to assess the relation of PD-180970, MAP70 and MT disruption.

      I have another concern on the action of PD-180970. PD-180970 appears to affect ubiquitously indispensable proteins for MTs. If PD-180970 disrupt MT by inhibiting phosphorylation of some MAPs, it must need time for turnover of proteins phosphorylated before PD-180970 was applied. In the proteomics experiment, author treated the cells with the compounds for 8-9 hr. On the other hand, in BY-2 cells, PD-18970 disrupted MTs only 30 min after application of PD-180970. I wonder if proteins were replaced during the 30 min. Could authors examine how long it takes to affect interphase MTs? If PD-180970 disrupts MTs in a 5-10 min like oryzalin, it is unlikely that inhibition of phosphorylation of proteins like MAP70 caused MT disruption. Rather, it may inhibit some proteins that have activity to disrupt microtubules but are usually inactivated by phosphorylation or inhibit something directly without phosphorylation.

      We agree that there is no evidence that PD-180970 disrupts MTs by inhibiting phosphorylation of MAP70. In our live-imaging system, in which reagents are added to liquid cultivation medium, the time from the reagent application to the arrival to each cell varies. Therefore, in order to accurately measure the time required for the inhibitor to take effect, it is necessary to design a new assay system, such as using fluorescent dyes to monitor the reagent's diffusion. In addition, since some reactions mediated by protein phosphorylation occur rapidly, minute-order observations might not be sufficient. Therefore, as an alternative strategy to assess the direct involvement of MAP70 phosphorylation on MT stabilization, we will examine whether PD-180970 induces MT disruption using strains expressing the phospho-blocked (Ala) and phospho-mimic (Asp) versions of MAP70 described above.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #1:

      Minor comments

      The authors identified the analogs PD-166326 and PP1 as potent inhibitors of cell division. For completeness, it would be interesting to include a description of these arrest phenotypes and how they compare with that of PD180870 or PP2.

      We have added the effects of all tested compounds on Arabidopsis embryos in Fig. S3C and Table S1. Based on this data and the results of tobacco BY-2 cells, we have compared the effects of PD-166326 and PD180870, and PP1 and PP2 in Results.

      Although there are two more obvious candidates in the phosphoproteome datasets on which the authors focus on, there is very little discussion on whether the other top hits and whether they might be involved in cell division. On a related note, there is no discussion on the specificity of these compounds and the likelihood of phenotypes unrelated to cell division.

      We have added the information of “Similar proteins in Arabidopsis” and “Description and putative functions” for all identified candidates for PD-180970 and PP2 in Table S2 and S3, respectively. With referring this information, we have added the sections to describe the possible contributions of these candidates on MT organization and phragmoplast formation in Results. In addition, we have described the specificity of these compounds and the phenotypes unrelated to cell division in the section for the results of Arabidopsis roots (Fig. S2A).

      1st results section:

      "...developed into the globular stage without causing morphological defects..."

      Should omit the word "causing" or replace with "any/detectable"

      We have omitted the word "causing".

      Reviewer #2:

      Even if the identification of the kinase(s) targeted by these two compounds is missing, the characterisation of at least two downstream effectors of these elusive kinase(s) inhibited by PD-180970 and PP2 is an important step forward. I would recommend to this point make very clear in the writing (e.g. already in the abstract). Upon a superficial reading, the reader could assume that MAP70s and PAKRP1s are the direct molecular targets of these compounds.

      We appreciate the very positive comments. To clarify this point, in addition to the following responses to each suggestion, we have changed the last sentense of the abstract to “These properties make PD-180970 and PP2 useful tools for transiently controlling plant cell division at key manipulation nodes that are conserved in diverse plant species”.

      Major comments

      • I would modify the title to shift the emphasis from the methodology to the biological targets identified.

      We have changed the title to “Identification of novel compounds inhibiting microtubule organization and phragmoplast formation in diverse plant species”.

      • Concerning MAP70s the authors claim that there is little functional data about this family. Yet, a recent paper (https://www.science.org/doi/10.1126/sciadv.abm4974) identifies MAP70-5 as necessary for the proper organisation of CMTs in the endodermis and its ability to actively remodel to accommodate emergence of the lateral root primordium in Arabidopsis thaliana. This could provide a functional context to test several of the predictions that the authors list in the discussion.

      We have referred this paper in Results and Discussion, as “MAP70-5 was reported to increase MT length in vitro and to reorganize cortical MTs to alter the endodermal cell shape for lateral root initiation, suggesting that MAP70-5 mediates dynamic change of MT arrays”.

      Minor comments

      • The narrative would be improved by moving the section "PD-180970 and PP2 do not irreversibly damage viability" before the phosphoproteomic section.

      We have moved the “irreversibly” section to before the “phosphoproteomics” section.

      Reviewer #3:

      Minor comments

      In supplemental data, authors show only 12 or 14 candidates of the target. It is interesting how other MAPs including homologues of MAP70 and Kiesnin-12 in BY-2 cells were scored in the phospho-proteomics assay. I suggest authors show longer lists of proteomics including other MAPs. It would be valuable information for the research community.

      We apologize for not providing the complete dataset. We have added Dataset S1 of total protein sequences that we predicted from published RNA-sea data of BY-2 cells, and all identified proteins of phosphoproteomics assay for PD-180970 and PP2 in Datasets S2 and S3, respectively. We have moved the lists of top candidates to Tables S2 and S3.

      In Abstract, authors should mention that the two compounds reduced phosphorylation level of diverse proteins including MAP70 and Kinesin-12. This is very important results and, otherwise, it may cause misunderstanding of the activity of the compounds. In addition to this, it is better to rephrase the following sentence. "presumably by inhibiting MT-associated proteins (MAP70)" with "presumably by inhibiting phosphorylation of MT-associated proteins (MAP70)."

      To avoid such a misunderstanding, we have changed the descriptions in Abstract to “Phosphoproteomic analysis showed that these compounds reduced phosphorylation level of diverse proteins. In particular, PD-180970 inhibited phosphorylation of the conserved serine residues in MT-associated proteins (MAP70). PP2 significantly reduced the phosphorylation of class II Kinesin-12, and impaired its localization at the phragmoplast emerging site”. Due to this change, the suggested sentence was eliminated. Also in Discussion, we have mentioned the reduction of phosphorylation of various proteins by stating, "we found that PD-180970 and PP2 reduced the phosphorylation levels of diverse proteins. These parts may be further modified depending on the results of the phospho-blocked (Ala) and phospho-mimic (Asp) analyses.

      Page7 line 1st. it would be better to insert "of MAP70 family" after "in the conserved MT-binding domain" because the MT binding domains are unique to the MAP70 family. I could not understand why this is " (2nd line) consistent with PD-18970 severely disrupting all the tested MT structure". At current stage, there is no evidence that dephosphorylation of MAP70 caused the microtubule disruption. I suggest authors remove the sentence (", which was~MT structures").

      We agreed on both points and have corrected them as the reviewer suggested.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript by Kimata and colleagues describes identification of compounds that inhibit microtubule organization and cell division in plants. In this manuscript, authors screened two chemical libraries and successfully found two compounds, PD-180970 and PP2", as potent inhibitors of cell division. Because the two compounds act as kinase inhibitor in animal cells, authors analyzed their effects on phosphorylation by using BY-2 suspension cells. Among the affected proteins, authors focused on two microtubule-related proteins, MAP70 and Kinesin-12. Authors showed that PD-180970 disrupt all structures of microtubules in BY-2 cells. All members of Arabidopsis MAP70 shared the target residues, although Arabidopsis map70 mutants grew normally. PP2 abolished the localization of Kinesin-12 to phragmoplasts, and inhibit the cell plate formation. Both compounds worked in other plant species including moss. I think highly of the unique screening system using Arabidopsis zygote and BY-2 cells developed by the authors. Although the direct targets and specificity of the compounds are still to be determined, I think the two compounds should become powerful tools in plant cell biology in future. Generally, the manuscript is well written, and the data are of high quality. I have, however, several suggestions as below.

      Major

      The major concern is lack of evidence to connect MAP70 and MT disruption upon treatment with PD-180970, in contrast to PP2, which was shown to affect localization of Kinesin-12. I wonder if authors could use taxol to stabilize MTs, then observe the localization of MAP70 with application of PD-180970?

      I have another concern on the action of PD-180970. PD-180970 appears to affect ubiquitously indispensable proteins for MTs. If PD-180970 disrupt MT by inhibiting phosphorylation of some MAPs, it must need time for turnover of proteins phosphorylated before PD-180970 was applied. In the proteomics experiment, author treated the cells with the compounds for 8-9 hr. On the other hand, in BY-2 cells, PD-18970 disrupted MTs only 30 min after application of PD-180970. I wonder if proteins were replaced during the 30 min. Could authors examine how long it takes to affect interphase MTs? If PD-180970 disrupts MTs in a 5-10 min like oryzalin, it is unlikely that inhibition of phosphorylation of proteins like MAP70 caused MT disruption. Rather, it may inhibit some proteins that have activity to disrupt microtubules but are usually inactivated by phosphorylation or inhibit something directly without phosphorylation.

      Minor

      In supplemental data, authors show only 12 or 14 candidates of the target. It is interesting how other MAPs including homologues of MAP70 and Kiesnin-12 in BY-2 cells were scored in the phospho-proteomics assay. I suggest authors show longer lists of proteomics including other MAPs. It would be valuable information for the research community.

      In Abstract, authors should mention that the two compounds reduced phosphorylation level of diverse proteins including MAP70 and Kinesin-12. This is very important results and, otherwise, it may cause misunderstanding of the activity of the compounds. In addition to this, it is better to rephrase the following sentence. "presumably by inhibiting MT-associated proteins (MAP70)" with "presumably by inhibiting phosphorylation of MT-associated proteins (MAP70)."

      Page7 line 1st. it would be better to insert "of MAP70 family" after "in the conserved MT-binding domain" because the MT binding domains are unique to the MAP70 family. I could not understand why this is " (2nd line) consistent with PD-18970 severely disrupting all the tested MT structure". At current stage, there is no evidence that dephosphorylation of MAP70 caused the microtubule disruption. I suggest authors remove the sentence (", which was~MT structures").

      Significance

      Redundancy of genes prevent researchers from exploring the genetic mechanisms of cell division. Time-specific manipulation of plant cell division by optogenetics or pharmacology has not been established. Identification of compounds that can specifically affect cell division is desired for further investigation of plat cell division. Although the direct targets and specificity of the compounds are still to be determined, I think the screening system and the two compounds identified by the authors should become powerful tools in plant cell biology in future. This work will influence not only plant biologists but also broad readership including cell/developmental biologists and chemical biologists.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Kimata et al. report on the identification and characterisation of two compounds inhibiting cell division in plants. Motivated by the need to circumvent genes function redundancy to study cell division in plants, the authors screened 170 biologically active compounds for inhibitors of the first highly stereotypical division of the Arabidopsis zygote. They identify two compounds PD-180970 and PP2 that very potently block this division. Monitoring the effect of these compounds on microtubules dynamics and using high resolution imaging on BY-2 cell cultures, they conclude that PD-180970 inhibits MT organization while PP2 inhibits phragmoplast formation. These two compounds have reversible effects and are active in several plants species ( Arabidopsis, Tobacco, Cucumber and Physcomitrella). Both compounds targets kinases in animal cells. To shed light on the molecular mechanism perturbed by these compounds, the authors performed a phospho-proteomic profiling of synchronised BY-2 cells upon treatment by either of this compounds, controlled by inactive analogs. They identify two proteins which phosphorylation is severely reduced by the compounds. PD-180970 reduces the phosphorylation of members of the MAP70 at three conserved S residues, two mapping in the microtubule binding domains. PP2 reduces phosphorylation of PAKRP1/KIN12A and PAKRP1L//KIN12B a pair of phragmoplasts-associated kinesins. The authors show that PP2 disrupts the phragmoplast-localisation of the both kinesins, phenocopying the effects of the double mutant pakrp1/pakrp1l and thus providing a likely molecular mechanisms for the effects of PP2.

      Overall the manuscript is solid, the experiments well executed and controlled and the results precisely. The conclusions are supported by the data and the manuscript is clearly structured.

      Even if the identification of the kinase(s) targeted by these two compounds is missing, the characterisation of at least two downstream effectors of these elusive kinase(s) inhibited by PD-180970 and PP2 is an important step forward. I would recommend to this point make very clear in the writing (e.g. already in the abstract). Upon a superficial reading, the reader could assume that MAP70s and PAKRP1s are the direct molecular targets of these compounds.

      Major comments:

      • I would modify the title to shift the emphasis from the methodology to the biological targets identified.
      • Concerning MAP70s the authors claim that there is little functional data about this family. Yet, a recent paper (https://www.science.org/doi/10.1126/sciadv.abm4974) identifies MAP70-5 as necessary for the proper organisation of CMTs in the endodermis and its ability to actively remodel to accommodate emergence of the lateral root primordium in Arabidopsis thaliana. This could provide a functional context to test several of the predictions that the authors list in the discussion.
      • The manuscript would strongly benefit from being revised by a native english speaker. There are many unusual or awkward formulation, in particular in the abstract.

      Minor comments:

      • The narrative would be improved by moving the section "PD-180970 and PP2 do not irreversibly damage viability" before the phosphoproteomic section.

      Significance

      Plant cell biologists interested in cell division and microtubules will find this pre-print enticing. The compounds identified will reveal useful tools to analyse cell division in plants and the manuscript provides a significant technical advance. Although my expertise does not lay in the field of chemical inhibitors of cell division, there are to my knowledge, no compounds that selectively inhibit phragmoplast growth like PP2. The manuscript paves the way for further studies such as genetic suppressor screens to identify the plant kinase(s) targeted by these compounds.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Kimata and colleagues describe the identification of two reversibly acting compounds that affect specific stages of cell division through a chemical screen in in vitro cultured Arabidopsis zygotes. They further characterize the effects of these compounds on cell division using advanced imaging techniques, and demonstrate that they perturb cell cycle progression in multiple plant species and systems, thus acting on conserved pathways. Finally, using phosphoproteomics and transgenic approaches in cultured plant cells the authors identified potential indirect targets of these compounds. The work described is very thorough and well presented, with conclusions supported by the data. In addition it provides information on two compounds that can be broadly applied in plant molecular research and demonstrate the feasibility of an in vitro ovule culture system for chemical screening in plants.

      Major comments

      Localization analysis of a transiently expressed MAP70 transgene with inactivating phosphosite mutations would be important to see whether the identified conserved phosphosites are relevant for MAP70 interaction with MTs. This experiment could be performed rapidly using transient expression in BY-2 cells.

      The authors propose that PP2 blocks phragmoplast formation by preventing phosphorylation of class II Kinesin-12 proteins. In support, authors show that PP2 treatment correlates with a decrease in KIN12A phosphopeptide count (not fully abolished) and its failure to localize to emerging phragmoplasts in BY-2 cells and Physcomitrium. As class II Kinesis-12 proteins have been previously implicated in phragmoplast assembly this is a fairly reasonable hypothesis, but would benefit from the analysis of transgenic KIN12A variants carrying inactivating (A) or potentially activating (D/E) phosphosite mutations. Is loss of phosphorylation sufficient to prevent phragmoplast localization? Can an activated variant rescue PP2-induced KIN12A localization and cell division defects? As above, using transient expression in BY-2 cells would be a fast approach to tackle these questions.

      Minor comments

      The authors identified the analogs PD-166326 and PP1 as potent inhibitors of cell division. For completeness, it would be interesting to include a description of these arrest phenotypes and how they compare with that of PD180870 or PP2.

      Although there are two more obvious candidates in the phosphoproteome datasets on which the authors focus on, there is very little discussion on whether the other top hits and whether they might be involved in cell division. On a related note, there is no discussion on the specificity of these compounds and the likelihood of phenotypes unrelated to cell division.

      1st results section: "...developed into the globular stage without causing morphological defects..." Should omit the word "causing" or replace with "any/detectable"

      Significance

      The work described in this manuscript identified and characterized two compounds that affect specific processes important for cell division in plant cells. Furthermore, the authors demonstrate the feasibility of an in vitro ovule culture system for chemical screening of specific processes in early plant embryos. The identified compounds are effective in multiple tissues in phylogenetically distant land plants and their effects are reversible. These compounds can be useful to, for example, manipulate the microtubule cytoskeleton, cell cycle, or ploidy in different plant models and different contexts, and more specifically to cell biologists studying the molecular mechanisms of cell division.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      REVIEWER #1

      __Summary: __Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      In virtue of the classical cancer stem cells (CSC) marker ALDH1A1 and SMAD Response Element (SRE) promoter, the authors engineered CSC-derived extracellular vesicles (EVs). By performing the second sortase (SrtA)-based proximity labeling, the authors detected the immune cells that specifically interacted with the CSC-EVs and demonstrated that CSC-EVs preferentially target MHC-II- macrophages and PD-1+ T cells.


      Major comments: - Are the key conclusions convincing? No. CD63 is accepted as the exosome marker, but cannot represent the whole population of EVs. Especially, we do have the information on the percentage of CD63+ EVs among the total population derived by CSCs. However, it seems impossible to estimate the total population derived by CSCs. It is the inherent flaw of the strategy, which limits the accuracy of the labeling. One possible method is to label CD81+ and CD9+ EVs, together with CD63+ EVs, to study the immune cells interacting with CSC-EVs in vitro and in vivo.

      We would like to thank this Reviewer for the time spent reading our manuscript and for highlighting the fact that other EVs markers could have been used to track EVs. We would like to point out that the Sortase-A experimental strategy is independent from any assumptions on EV markers since SrtA is fused to a commonly-used transmembrane domain (from PDGFR, see Hamilton et al. Adv Biosys 2020). Nonetheless, we followed up on the Reviewer suggestion and performed additional experiments to assess the overlap between this generic membrane marker, CD63 and CD81. To this end, we have performed multicolor nano-flow cytometry and stained EVs for SrtA (via its flag peptide) and CD63 or CD81 (new suppl. Fig. S2B-C). We used Flag staining to detect the PDGFR transmembrane domain as a generic membrane marker (__new suppl. Fig. S1C __and ref. 50). We observed that Flag staining colocalized with both CD63+ and CD81+ EVs, indicating not only that the use of a general-purpose transmembrane domain transcends classical EV biomarkers, but also that CD63 and CD81 label largely overlapping EV subpopulations, as previously reported (Jeppesen DK et al. Cell 2019). Accordingly, SrtA- and CD63-GFP-based strategies yielded very similar results (Fig. 2 and 4).

      Compared with the normal cancer cells, cancer stem cells are a very small population. It is reasonable to consider that the CSC-EVs is also a small population among total EVs. Therefore, it is quite questionable to compare the interaction of normal cancer cells-derived EVs and CSC-EVs with immune cells.

      We fully agree with this reasoning, and it is exactly because of this contrast that our observations of the specific behavior of CSC-EVs are very relevant for CSC biology. Our experimental design includes proper control groups and is based on validated approaches (Hamilton et al. Adv Biosys 2020).

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Yes. The authors stated "such EV-mediated intercellular communication between CSC and these immune cells contributed to the observed spatial interactions and niche sharing." Not enough evidence supported the statement.

      We have removed these claims from the text.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. As mentioned before, if the authors could perform the labeling CD81+ and CD9+ CSC-EVs and study the interaction with immune cells, the conclusion may be more convincing.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. The suggested experiments are time-consuming.

      We appreciate the Reviewer acknowledging that repeating the CD63-GFP experiments in the manuscript using CD81-GFP and CD9-GFP fusion reporters is very time-consuming. Nonetheless, we have provided proof that the SrtA approach labels both CD63+ and CD81+ EVs (new suppl. Fig. S2B-C), which, together with the fact that both CD63-GFP- and SrtA-based approaches yielded very similar results (Fig. 2 and 4), strongly indicates that repeating these experiments with additional reporters will add limited value to already sound conclusions.

      • Are the data and the methods presented in such a way that they can be reproduced? Yes.

      • Are the experiments adequately replicated and statistical analysis adequate? Yes.

      Minor comments: - Specific experimental issues that are easily addressable. Yes

      • Are prior studies referenced appropriately? The references related SrtA-mediated labeling were not sufficiently referenced.

      The full characterization of SrtA-based strategy was cited in reference 50 (Hamilton et al. Adv Biosys 2020). We would be happy to include any reference this Reviewer thinks is missing.

      • Are the text and figures clear and accurate? Yes

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? No

      Significance: - Describe the nature and significance of the advance (e.g., conceptual, technical, clinical) for the field.

      Cancer cells are heterogeneous. It is natural to believe that EVs are heterogeneous due to their different origin. Considering the important role of cancer stem cells during tumor development and treatment resistance acquisition, it is important to understand the function of CSC-EVs in the tumor microenvironment. However, considering the methodology is questionable, I am not sure the conclusions are convincing. For Figure 3, there are many pieces of literature on this topic and showing the data that macrophages in CSCs niches are good for the maintenance of CSC. So, it is not novel.

      We thank the Reviewer to point out the importance of understanding the function of CSC-EVs in the tumor microenvironment. We hope we have addressed the methodology issues raised by this Reviewer. Although recent students outline the relationship between CSC and macrophages biology, very little is known about the role of EVs in this interaction. The novelty of our work stems from the use of advanced genetic engineering approaches that allow us to demonstrate directly in vivo, without any in vitro manipulation of CSC-EVs, that CSC-EVs come in contact with macrophages (and other specific immune subsets).

      • Place the work in the context of the existing literature (provide references, where appropriate).

      • State what audience might be interested in and influenced by the reported findings. Cancer stem cells or extracellular vesicles are timely topics and would be interesting to people in the cancer and EV fields.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate., EVs biology, with no special focus on CSCs.

      REVIEWER #2

      __Summary: __In this work Dr. Pucci and colleagues use flowcytometry and in vivo approaches to define the potential role of EVs originating from cancer stem cells to mediate intercellular communication with cells of immune origin in the cancer microenvironment. The work is interesting, the team used specific promoters to drive the expression of EV markers specifically in cancer stem cells. Another interesting approach is the use of sortase to label neighbour cells in the cancer microenvironment.

      Overall the reviewer has the impression the work is quite superficial and the conclusions cannot be claimed by the results presented in the paper for the following reasons:

      1. Cancer stem cells are a relatively small fraction when compared to the entire cancer cell population, therefore it is possible that the EV released tend to accumulate in macrophages because those are cells competent for specialized internalization and clearance of EV (e.g. PMID: 30745143). Second accumulation in macrophages does not mean any kind of signaling, it may just be that the EVs are degraded.

      We thank the Reviewer for understanding and appreciating our work. We would like to point out that the novelty of our work is the identification of a specific macrophage subset (MHC-II– macrophages) that is mainly targeted by CSC-EVs (Fig.2). We observed a selective enrichment of these macrophage subset when tracking CSC-EVs, which argues against passive uptake, as the Reviewer seems to suggest. Moreover, these class-II negative macrophages were also found at increased frequency within the CSC niche (Fig.4), further suggesting an active process. This information is not trivial and cannot be inferred from the literature.

      We are not claiming any signaling mechanism, which will be the focus of future work, including the role of CSC-EVs on maintaining MHC-II– immunosuppressive macrophage populations. We have amended the main text to clarify these points.

      The sortase experiment is very interesting, however key controls are missing. For example, a thorough in vitro characterization of the system is needed: a. No clear description of the vectors used is provided (how is the labelling fluorescent protein released by the cells? How far can the protein diffuse?); b. The sortase's labelling efficiency is not characterized. c. Which proteins are targeted by sortase in the acceptor cells? There is any protein that can specifically be labelled by sortase on the cell surface of acceptor cells? This is not explained or validated. d. The authors claim that the labelling is provided by EVs harboring sortase on their surface, however also the plasma membrane of the cells may efficiently label cells. This should be explored and discussed. Which is the enzymatic sortase activity present on the EVs? How the authors can exclude that the red fluorescent protein is simply internalized by the neighbor cells? This should also be evaluated.

      We thank the Reviewer for the interest in the SrtA-based approach, which we have thoroughly described and validated in Hamilton et al. Adv Biosys 2020 (ref. 50). We apologize if we did not reference that work properly. In that publication, we characterized SrtA labelling efficiency under various conditions and with different substrates, we mentioned which proteins can be targeted by SrtA on the surface of cells (that is, any protein with an N-terminal glycine, such as MHC-I, VE-Cadherin, CD19, integrins, …). We have clarified the above details in the new version of the manuscript.

      We apologize for not including a schematic of the lentiviral vectors used, that we have now added (new suppl. Fig. S1). These schematics show that the red fluorescent protein is released from cells because it is fused to a signal sequence. In order to control for internalization of the red fluorescent protein by neighboring cells, we have used a control group in which CSC do not express SrtA while the bulk of tumor cells (including CSC) still secrete the red fluorescent protein. The values for SrtA activity are calculated by subtracting baseline internalization of red fluorescent protein by each individual immune subset. We have amended the Methods section to clarify this.

      We are glad to hear that the Reviewer fully understood how the SrtA-based approach works. As this Reviewer mentions, it is not possible to discriminate between CSC and CSC-EVs since SrtA is present on both. This is a limitation of current EV technology in general. Although we were careful in wording our results and conclusions, we have revised the manuscript to take this into further consideration. The manuscript now claims that the SrtA approach unveils short-range interactions between CSC, CSC-EVs and immune cells due to their proximity.

      Method section should be expanded, map of vectors provided and possibly deposited.

      We apologize for not including a schematic of the lentiviral vectors used, that we have now added (new suppl. Fig. 1). We have expanded description of the Methods. We will promptly deposit the lentiviral transfer plasmids employed in this work with Addgene upon publication.

      __Minors: __The paper should be expanded, and experiments better described.

      We have expanded the paper and the description of the experiments, as requested.

      CROSS-CONSULTATION COMMENTS I find the comments of Reviewer 1 important to be addressed. I might have under-estimated the amount of time necessary to revise the work. I still believe the material and method section is insufficient.

      We thank the Reviewer for acknowledging that major revision of our work is very time-consuming. We hope to have addressed both Reviewer 1 and 2 comments.

      Significance: The paper present technological innovation that can result of interest for the large audience of EV enthusiasts. The scientific advancement is limited since the conclusion: These results suggest that combination therapies targeting CSC, tumor macrophages and PD1 may synergize, is known and the work presented does not really support it since there is no evidence the EVs have any signaling role. Perhaps the authors should work more on tightening their result to a cell biology perspective of cancer niche interaction.

      We thank the Reviewer for understanding the technological innovation. We will remove the statement on combination therapy and refocus on a cell biology perspective.

      REVIEWER #3

      In this paper, the authors used genetically engineered CSC-derived EVs to perform sortase-mediated in vivo proximity labeling and interrogate interactions of these vesicles with immune cells in the TME. The authors show that these EVs mediate intra-tumoral recruitment of immune cells, MHC class II(-) macrophages and PD1+ T cells, to the CSC niche and define EV-mediated special interactions of these immune cells within this niche. The manuscript is timely and novel, as it introduces a new experimental platform for identification, characterization and monitoring of CSC-derived EVs within the TME. Much has been recently learned about tumor cell-derived "tEVs", while almost nothing is known about CSC-derived tEVsCSC. Here, using genetic engineering, the authors have created specifically labeled fluorescent (GFP) tEVsCSC and studied interactions of these vesicles with immune cells in the TME. Two different HNSCC mouse models, MOC2 (carcinogenesis-dependent) and mEER (Ras dependent), were used. CSC populations were identified as cells with the brightest GFP fluorescence (~5%). These cells also expressed the known stem cell markers and formed oospheres in vitro. The authors then show that tEVcsc preferentially targeted MHC II (-) macrophages, which avidly uptake these EVs. tEVcsc also showed preferential tropism towards PD1+ T cells. Further, the authors demonstrate that location-dependent labeling indicates the presence and "clustering" of MHC II (-) macrophages and PD1+ T cells in the same niche within the TME. The generation of genetically modified labeled fluorescent tEVcsc and tEVs and in vitro as well as in vivo analyses of their interactions with immune cells in the TME were technically demanding. These studies were expertly performed, and the results are convincing. The data presentation is adequate, but the figure legends are sparce, and the text is densely narrated and somewhat difficult to read. Some more clarity in Results and more explicitly documented correlative data would clarify and enhance the message the authors convey.

      We thank the Reviewer for fully understanding the novelty of our work. We agree that the description of results and figures can be improved. We have now clarified the narration of results and figure legends so they can be better understood.

      The Discussion is rationally written, but the comments in Abstract and in conclusions about combination therapies and targeting CSC, tumor macrophages and PD1 to lower HNSCC recurrence are not appropriate. There is nothing in this manuscript about immunotherapy and these comments should be deleted.

      We agree with the Reviewer and we have removed these statements from the text.

      Overall, this is an interesting, timely and novel manuscript using genetically modified, fluorescently labeled EVs to explore their interactions with immune cells in the TME of HNSS. Technical and experimental approaches are complex, but appear to be well done, providing an experimental model for probing cellular interactions in the TME at a single-cell level. I recommend acceptance after modifications as suggested above

      Significance: Significance is high, as it advances our understanding of the interactive role of CSC-derived EVs with macrophages and T cells in the tumor microenvironment.

      We thank the Reviewer for appreciating the significance of our work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this paper, the authors used genetically engineered CSC-derived EVs to perform sortase-mediated in vivo proximity labeling and interrogate interactions of these vesicles with immune cells in the TME. The authors show that these EVs mediate intra-tumoral recruitment of immune cells, MHC class II(-) macrophages and PD1+ T cells, to the CSC niche and define EV-mediated special interactions of these immune cells within this niche.

      The manuscript is timely and novel, as it introduces a new experimental platform for identification, characterization and monitoring of CSC-derived EVs within the TME. Much has been recently learned about tumor cell-derived "tEVs", while almost nothing is known about CSC-derived tEVsCSC. Here, using genetic engineering, the authors have created specifically labeled fluorescent (GFP) tEVsCSC and studied interactions of these vesicles with immune cells in the TME. Two different HNSCC mouse models, MOC2 (carcinogenesis-dependent) and mEER (Ras dependent), were used. CSC populations were identified as cells with the brightest GFP fluorescence (~5%). These cells also expressed the known stem cell markers and formed oospheres in vitro. The authors then show that tEVcsc preferentially targeted MHC II (-) macrophages, which avidly uptake these EVs. tEVcsc also showed preferential tropism towards PD1+ T cells. Further, the authors demonstrate that location-dependent labeling indicates the presence and "clustering" of MHC II (-) macrophages and PD1+ T cells in the same niche within the TME.

      The generation of genetically modified labeled fluorescent tEVcsc and tEVs and in vitro as well as in vivo analyses of their interactions with immune cells in the TME were technically demanding. These studies were expertly performed, and the results are convincing. The data presentation is adequate, but the figure legends are sparce, and the text is densely narrated and somewhat difficult to read. Some more clarity in Results and more explicitly documented correlative data would clarify and enhance the message the authors convey. The Discussion is rationally written, but the comments in Abstract and in conclusions about combination therapies and targeting CSC, tumor macrophages and PD1 to lower HNSCC recurrence are not appropriate. There is nothing in this manuscript about immunotherapy and these comments should be deleted.

      Overall, This is an interesting, timely and novel manuscript using genetically modified, fluorescently labeled EVs to explore their interactions with immune cells in the TME of HNSS. Technical and experimental approaches are complex, but appear to be well done, providing an experimental model for probing cellular interactions in the TME at a single-cell level. I recommend acceptance after modifications as suggested above

      Significance

      Significance is high, as it advances our understanding of the interactive role of CSC-derived EVs with macrophages and T cells in the tumor microenvironment.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this work Dr. Pucci and colleagues use flowcytometry and in vivo approaches to define the potential role of EVs originating from cancer stem cells to mediate intercellular communication with cells of immune origin in the cancer microenvironment. The work is interesting, the team used specific promoters to drive the expression of EV markers specifically in cancer stem cells. Another interesting approach is the use of sortase to label neighbour cells in the cancer microenvironment.

      Overall the reviewer has the impression the work is quite superficial and the conclusions cannot be claimed by the results presented in the paper for the following reasons: 1. Cancer stem cells are a relatively small fraction when compared to the entire cancer cell population, therefore it is possible that the EV released tend to accumulate in macrophages because those are cells competent for specialized internalization and clearance of EV (e.g. PMID: 30745143). Second accumulation in macrophages does not mean any kind of signaling, it may just be that the EVs are degraded.

      1. The sortase experiment is very interesting, however key controls are missing. For example, a thorough in vitro characterization of the system is needed:
        • a. No clear description of the vectors used is provided (how is the labelling fluorescent protein released by the cells? How far can the protein diffuse?);
        • b. The sortase's labelling efficiency is not characterized.
        • c. Which proteins are targetd by sortase in the acceptor cells? There is any protein that can specifically be labelled by sortase on the cell surface of acceptor cells? This is not explained or validated.
        • d. The authors claim that the labelling is provided by EVs harboring sortase on their surface, however also the plasma membrane of the cells may efficiently label cells. This should be explored and discussed. Which is the enzymatic sortase activity present on the EVs? How the authors can exclude that the red fluorescent protein is simply internalized by the neighbour cells? This should also be evaluated.
      2. Method section should be expanded, map of vectors provided and possibly deposited.

      Minors:

      The paper should be expanded, and experiments better described.

      Referees cross-commenting

      I find the comments of Reviewer 1 important to be addressed. I might have under-estimated the amount of time necessary to revise the work.

      I still believe the material and method section is insufficient.

      Significance

      The paper present technological innovation that can result of interest for the large audience of EV enthusiasts. The scientific advancement is limited since the conclusion: These results suggest that combination therapies targeting CSC, tumor macrophages and PD1 may synergize, is known and the work presented does not really support it since there is no evidence the EVs have any signaling role. Perhaps the authors should work more on tightening their result to a cell biology perspective of cancer niche interaction.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      In virtue of the classical cancer stem cells (CSC) marker ALDH1A1 and SMAD Response Element (SRE) promoter, the authors engineered CSC-derived extracellular vesicles (EVs). By performing the second sortase (SrtA)-based proximity labeling, the authors detected the immune cells that specifically interacted with the CSC-EVs and demonstrated that CSC-EVs preferentially target MHC-II- macrophages and PD-1+ T cells.

      Major comments:

      • Are the key conclusions convincing?

      No.

      CD63 is accepted as the exosome marker, but cannot represent the whole population of EVs. Especially, we do have the information on the percentage of CD63+ EVs among the total population derived by CSCs. However, it seems impossible to estimate the total population derived by CSCs. It is the inherent flaw of the strategy, which limits the accuracy of the labeling. One possible method is to label CD81+ and CD9+ EVs, together with CD63+ EVs, to study the immune cells interacting with CSC-EVs in vitro and in vivo. Compared with the normal cancer cells, cancer stem cells are a very small population. It is reasonable to consider that the CSC-EVs is also a small population among total EVs. Therefore, it is quite questionable to compare the interaction of normal cancer cells-derived EVs and CSC-EVs with immune cells. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Yes. The authors stated "such EV-mediated intercellular communication between CSC and these immune cells contributed to the observed spatial interactions and niche sharing." Not enough evidence supported the statement. - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      As mentioned before, if the authors could perform the labeling CD81+ and CD9+ CSC-EVs and study the interaction with immune cells, the conclusion may be more convincing. - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      The suggested experiments are time-consuming. - Are the data and the methods presented in such a way that they can be reproduced?

      Yes. - Are the experiments adequately replicated and statistical analysis adequate?

      Yes.

      Minor comments:

      • Specific experimental issues that are easily addressable.

      Yes - Are prior studies referenced appropriately?

      The references related SrtA-mediated labeling were not sufficiently referenced. - Are the text and figures clear and accurate?

      Yes - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      No

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Cancer cells are heterogeneous. It is natural to believe that EVs are heterogeneous due to their different origin. Considering the important role of cancer stem cells during tumor development and treatment resistance acquisition, it is important to understand the function of CSC-EVs in the tumor microenvironment. However, considering the methodology is questionable, I am not sure the conclusions are convincing.

      For Figure 3, there are many pieces of literature on this topic and showing the data that macrophages in CSCs niches are good for the maintenance of CSC. So, it is not novel.

      • Place the work in the context of the existing literature (provide references, where appropriate).
      • State what audience might be interested in and influenced by the reported findings.

      Cancer stem cells or extracellular vesicles are timely topics and would be interesting to people in the cancer and EV fields. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.,

      EVs biology, with no special focus on CSCs.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Answers to reviewers’ comments

      (Reviewers comments are in italics. Text modifications in the manuscript file are in blue.)

      Overall, we acknowledge referee’s careful reading of the paper and comments that we think have helped further improvement of the manuscript.

      On the attached pages are our detailed point by point responses to the referees’ comments along with a description of how the manuscript was modified in accordance.

      New data included:

      In response to the comments and suggestions of both reviewers 1 and 3, we conducted new experiments to test genetic interactions between different actors of the BMP and activin pathways. These new results confirm and complement the analyses described in the original manuscript. Furthermore, as suggested by reviewer 2, we have further studied the phenotypes of hiPSC-CM, by analyzing gene expression profiles and by analyzing the morphological changes induced as a result of PAX9 knockdown.

      NB: The title has been slightly modified, to highlight the conserved features of the genetic architecture of cardiac performance revealed in the study

      __Former title: __Genetic architecture of natural variation of cardiac performance in flies.

      __Novel title: __Genetic architecture of natural variation of cardiac performance: From flies to humans.

      Reviewer 1

      1. 1. The authors utilized the RNAi-mediated knockdown approach in their functional validation studies. It is not clear how each genetic variation (SNP) affects its associated genes. Could some of the SNPs activate the candidate gene expression? For the 4 candidate genes that failed to show cardiac defects, could the overexpression of these 4 genes alter cardiac performance? Answer 1- Of course, we cannot predict direction of the effect of the variants on the function of the genes. In this context, loss-of-function experiments are subjected to a risk of false negatives. It is indeed possible that in the case of a lack of effect of the loss of function, a gain of function could reveal an effect. But gain-of-function experiments are difficult to control, and often subjected to non-specific effects because it is complicated to control the level of over-expression compared to endogenous expression. This did not seem suitable for an extensive analysis of a large number of genes. We therefore chose to test only for loss of function.

      In addition, our approach to testing heart-specific RNAi aims to assess the quality of the association results by comparing RNAi for genes identified by GWAS to randomly selected genes. It is not intended to describe precisely the involvement of each gene individually.

      (See also answer to reviewer 2 comment n°2 and the modifications to the manuscript that have been made and which address these criticism)

      * 2. babo is the type I activin receptor, not type 2. *

      Answer 2- Thank you, we have corrected this error.

      • The authors show BMP and activin pathway genetically interacts to affect cardiac performance. But it is interesting to find that these interactions are in a trait-dependent manner. For example, it seems that babo and dpp epistatically interact to regulate FS, while they additively regulate HP and DI. The authors need to discuss the complex genetic interaction further. *

      Answer 3- See reply to reviewer 3, comment N°2 below.

      4*. Both snoo and sog are identified from GWAS. How about babo and dpp? Are there any identified SNPs associated with babo and dpp? *

      Answer 4- Considering GWAS for mean phenotypes, there is no variant in dpp that are within the 100 best ranked SNPs nor within the variants identified using fast epistasis. But given the size of the DGRP population we are far from being exhaustive, as we do not reach saturation. It is therefore difficult to comment on these ‘negative’ results. However, we do identify one variant in babo using fast epistasis (see figure 2B and Table S3).

      5. It is unclear why the mad KD behaves oppositely to dpp mutant, although both proteins are involved in BMP pathway. In Figure S5, the mad KD shows reduced FS and HP, but dpp LOF mutant shows increased FS and HP (Figure S4). Can the authors perform RNAi to knockdown dpp specific in the heart to reexamine the role of dpp in the regulation of cardiac function. The whole body LOF mutant dpp-d14 might not target cardiac tissue directly to control heart performance like mad KD.

      Answer 5- (see also answer to reviewer 3 comment n°2) We did perform heart specific dpp RNAi experiments together with other tests for interactions using new allelic combinations of activin and BMP pathways and therefore can compare heart specific knock down to heterozygotes for amorphic mutations for both dpp and mad.

      Regarding dpp, congruent effects on HP, DI, SI, ESD and EDD were observed between mutant and RNAi, while RNAi had opposite effects on FS compared to heterozygotes dppd14 mutants (decreased and increased FS compared to control, respectively). In the case of mad, heterozygous mutants had no effect on FS, EDD and ESD, but similarly to dpp mutants it increased SI, DI and HP. mad RNAi uniquely decreased HP, DI and SI and increased AI. However, similarly to dpp RNAi, it induced a decrease of FS.

      Thus, systemic versus heart specific knockdown of genes induce specific effects, suggesting cardiac non-autonomous interactions. This complex picture of TGFb involvement is now discussed in the result section (see below, Reviewer 3, major comment 2).

      6*. The authors selected two novel genes to study the conversed regulation in both flies and human iPSC cells. Besides testing these novel genes, the authors should also verify whether the conserved pathways, like TGF-beta, regulate heart performance in human iPSC cells similar to the flies. *

      Answer 6- We focused on poxm/Pax9 and sr/Egr2 because none of these TFs were known to have cardiac function in fly nor in mammals. Our paralleled analyses in fly and hiPS-CM illustrates how the description of the genetic architecture of cardiac traits in flies can accelerate discovery in mammals.

      There is extensive literature describing the involvement of TGF B /BMP and Activin pathways in heart development and diseases in humans, hence the choice not to focus on these pathways in iPS-CM.

      Reviewer 2:

        • It will be interesting to compare this fly GWAS to human heart disease GWAS data (for example, cardiomyopathy, arrhythmia, heart failure) from patients. Such cross comparison could make the data set more valuable. * Answer 1- We actually did make this comparison (Table 2, Table S11) and we agree it significantly validates our approach. This identified a set of orthologous genes associated with cardiac traits both in Drosophila and humans, supporting the conservation of the genetic architecture of cardiac performance traits, from arthropods to mammals.
      1. RNAi is the only experimental approach in this manuscript to validate the functional significance from data analyses. Authors may consider using genetic mutations such as deficiency lines or P-element lines to offer an alternative approach. This is simply a suggestion to improve the rigor and reproducibility, not absolutely required. *

      Answer 2- In an attempt to provide a consistent analysis of loss of gene function, our strategy was to concentrate our analysis on the effects of heart specific knock down. This allows us to compare -in a global way- the effects of the knock down of genes identified by GWAS to those of randomly selected genes.

      Our objective was to provide a global view of the heart specific effects of the identified genes, and not to characterize precisely the involvement of each of them, using a combination of mutant alleles, RNAi and gain of function. Given the experimental burden of analyzing cardiac function, such a strategy would have indeed required us to concentrate only a very small number of genes.

      We however recognize that this strategy has limitations:

      • Some variants may lead to gain-of-function effects of genes, and our strategy is not able to test for these effects.

      • Some variants may come from non-cell-autonomous effects, which would not be replicated by our targeted RNAi strategy in the heart.

      Therefore, the false negative rate of our experiments is difficult to estimate.

      We have tried to put this into perspective and to highlight the limitations of our analysis in the results section describing RNAi validation of GWAS results.

      “To assess in an extensive way whether mutations in genes harboring SNPs associated with variation in cardiac traits contributed to these phenotypes ….. (…)

      …… These results therefore supported our association results. It is important to emphasize that our approach is limited to testing the effect of tissue-specific gene knock down. Since some of the variants may lead to increased gene function and/or expression, this can lead to a false negative rate that is difficult to estimate. In addition, some of the associated variants may influence heart function by non cell-autonomous mechanisms, which would not be replicated by cardiac specific RNAi knock down.”

      *In order to validate the roles of predicted TF binding sites, the best approach would be introducing point mutations using CRISPR/Cas9 within the binding motif then testing out molecular and physiological outcomes. Rather authors chose to test indirectly to knock down those TFs. If so, authors need to at least acknowledge the potential caveats of such approach and the limitation in related data interpretation. *

      Answer 3- The reviewer is right, the definitive proof of the involvement of a potential TF binding site on the regulation of a gene located in cis requires to mutate the binding site and to analyze the effect on the expression of the corresponding gene. But this may not be sufficient to definitely demonstrate that the potential TF is indeed a regulator of that gene (the binding motif may be target of yet another TF): definitive proof may require motifs/TF DNA binding domain swaps. This would have been out of the scope of the present study. In addition, the effects on heart performance of mutating one TFBS at a time (among several dozens) may be too weak to allow their characterization with available tools and approaches.

      We acknowledge however that our approach provides an indirect validation of transcription factors binding sites predictions. This was, in our opinion, the most efficient way to evaluate the potential effect of predicted transcription factors.

      We clarify this in the result section:

      “We did not test individually the effects on cardiac performance of mutations in predicted TFBSs located near the SNPs because any individual effect would probably be too small to be detectable by the available methods. Rather, we tested the potential involvement of their cognate TFs by cardiac specific RNAi mediated KD”

      • hiPSC-CM data is somewhat limited by only showing the HR and AP duration data. It is recommended to include some immunocytochemistry data to show the morphology, sarcomere structure of these hiPSC-CMs. Gene expression data generated by qPCR or RNA-seq in particular focusing CM structure and function genes would be helpful too.*

      Answer 4- As suggested by referee 2, we have now performed gene expression analysis and immunostaining of PAX9 KD which gave the strongest phenotype in iPSC-CM (Figure 4 J-M). This unraveled increased expression of Na+ and K+ channels, which is in line with APD shortening phenotype, as well as down regulation of CASQ2, consistent with calcium transient shortening. Expression analysis also revealed increased sarcomeric genes and NPPA/B expression, which was consistent with increased CM size as quantified by the area of TNNT2 staining per nuclei.

      These new data are described at the end of the result section:

      “APD shortening for PAX9 KD was coincident with increased expression of Na+ and K+ ion channels (SCN5A, KCNH2 and KNCQ1) (Figure 4J), supporting the APD shortening phenotype. In this context, the AP kinetics also correlated with shorter calcium transient duration (Figure S8A-D and H-K), including faster upstroke and downstroke calcium kinetics and increased beat rate (peak frequency) (Figure S8E-G and L, M), consistent with decreased expression of Calsequestrin 2 isoform (CASQ2) associated with PAX9 KD (Figure 4J). Finally, assessment of the PAX9 KD effect on sarcomeric content revealed an increase in sarcomeric gene expression (Figure 4K), and an upregulation of genes associated with an hypertrophic response (NPPA, NPPB and NPR1 (Battistoni Et al Circulating biomarkers with preventive, diagnostic and prognostic implications in cardiovascular diseases, Int J Cardiol, 2012, vol. 157) which was coincident with increased CM size as quantified by the area of TNNT2 staining per cardiac nuclei (Figure 4 L, M).

      Collectively, these data illustrate conserved functions for poxm/PAX9 and sr/EGR2 in setting the cardiac rhythm and identify PAX9 as a novel and key regulator of cardiac performance at the cellular level, via the integrated regulation of expression of genes controlling electrophysiology, calcium handling and sarcomeric functions in hiPSC-CMs.”

      Reviewer 3

      Major Comments:

      1- There is an assumption in the use of RNAi knockdown to validate the genes identified in the quantitative analysis, and that is that natural variants are themselves hypomorphic. It is possible that among the variants identified some are hypermorphic, or among the transcription factor binding sites that variants lead to increased factor binding. While RNAi knockdown is an excellent choice to begin validation, I do not think the authors can rule out that a gene not functionally validated by their RNAi tests does not have a role in cardiac function.

      Answer 1. Please see our answers to reviewer 1 comment n°1 and reviewer 2 comment n°2.

      * 2- After performing RNAi knockdown to validate genes identified by GWAS the authors focus on the TGFbeta signaling pathway for downstream analysis. To do so they examine heterozygotes for sog, a repressor of BMP signaling, and snoo, an activator of Activin pathway. The data from the snoo/sog heterozygote is compelling in its disruption of heart phenotypes, and the authors conclude a "coordinated action of activin and BMP." snoo, however, also works as a transcriptional repressor in the BMP pathway, so it's possible that the effects the authors are seeing here could be confined to an increase in BMP signaling. Unlike snoo and sog, mutations in babo and dpp are both expected to have negative effects on Activin and BMP signaling, respectively. The babo/dpp interaction is not as quantitatively convincing as the snoo/sog data, despite the integral roles both babo and dpp play in their respective pathways. If both pathways are connected, why do snoo/sog heterozygotes affect SI phenotypes, while babo/dpp heterozygotes affect fractional shortening? I think the authors data suggest an interesting potential interaction between these pathways, which could be confirmed by examining further mutant combinations, knockdowns or increased expression transgenes, but falls short of a "confirmed synergystic genetic interaction." It does, however, underscore the value of the data in the paper for opening up new avenues for future study. *

      Answer 2 (and reviewer 1 comments 3 and 5).

      These comments led us to reconsider the analysis of the phenotypes associated with loss of function of the TGFb pathway, and to analyze other pathway components combinations.

      We acknowledge reviewer 3 criticisms on snoo/sog experiments, which are difficult to interpret given the broad action snoo may have on both BMP and activin pathways. We have addressed this in the result section.

      We have also analyzed other allelic combinations of BMP and activin pathways components, which strengthen the analysis performed on dpp/babo. Indeed, we tested babo/tkv heterozygotes (respectively specific activin and BMP receptors) and found significant genetic interactions for ESD and EDD. Albeit non-significant, babo/tkv double heterozygotes display a tendency to non-additive effects on FS (p= 0,054). mad/smox heterozygotes (respectively specific downstream TFs of BMP and activin pathways) display interactions (non-additive effects) on HP, SI, DI, ESD and EDD. These new results (Supplemental Figure 4) are thus supporting the hypothesis of genetic interactions between the pathways, but also reveal, as suggested by reviewer 3, a complex relationship between both pathways since interactions are revealed for specific traits in each of the mutant combinations analyzed.

      The phenotypes related to the individual loss of function of each of the actors of these pathways (dpp, tkv and mad for BMP; babo and smox for activin) are however very similar. When they have an effect, heterozygous amorphic alleles of these genes display increased phenotypes related to rhythmicity (HP, DI, SI, AI) and FS, but decreased cardiac diameters (ESD and EDD).

      Finally, as pointed out by reviewer 1, the picture is certainly even more complex since the phenotypes of RNAi mediated heart specific loss of function are not always similar to those of systemic loss of function. Indeed, mad RNAi causes a reduction of HP, DI, SI and FS (Figure S5) whereas heterozygotes for mad12 have either no or opposite effect on these phenotypes, and mad RNAi causes a significative increase in AI whereas mad12 has no effect (Figure S4). The discrepancy between tissue specific RNAi and heterozygous background was also found in the case of dpp, but specifically for the FS. Indeed, as suggested by reviewer 1 we have analyzed the loss of function of dpp by heart-specific RNAi. dpp RNAi results in a reduction of the FS (like mad RNAi) whereas the loss of function in the whole-body results in an increase of the FS.

      We therefore re-wrote the whole corresponding section of the results and modified Figure S4 to include babo/tkv; smox/mad and dppRNAi data.

      “We further focused on the TGFb pathway, since members of both BMP and activin pathways were identified in our analyses. We tested different members of the TGFb pathway for cardiac phenotypes using cardiac specific RNAi knockdown (Figure 2C), and confirmed the involvement of the activin agonist snoo (Ski orthologue) and the BMP antagonist sog (chordin orthologue). Notably, Activin and BMP pathways are usually antagonistic (Figure 2D). Their joint identification in our GWAS suggest that they act in a coordinated fashion to regulate heart function. Alternatively, it may simply reflect their involvement in different aspects of cardiac development and/or functional maturation. In order to discriminate between these two hypotheses, we tested if different components of these pathways interacted genetically. Single heterozygotes for loss of function alleles show dosage-dependent effects of snoo and sog on several phenotypes, providing an independent confirmation of their involvement in several cardiac traits (Figure S4). Importantly, compared to each single heterozygotes, snooBSC234/ sogU2 double heterozygotes flies showed non additive SI phenotypes (two-way ANOVA p val: 2,1 10-7) suggesting a genetic interaction (Figure 2E and Figure S4A). It is worth noting however that snoo is also a transcriptional repressor of the BMP pathway (PMID: 16951053). The effect observed in snooBSC234/ sogU2 double heterozygotes can therefore alternatively arise as a consequence of an increased BMP signaling without affecting the activin pathway. We thus tested other allelic combinations for loss of function alleles of BMP and activin pathways. babo/tkv heterozygotes (respectively activin and BMP type 1 receptors) displayed non additive ESD and EDD phenotypes (Figure S4C). Synergistic interaction of BMP and activin pathways was also suggested by the analysis of fractional shortening in loss of function mutants for babo and dpp, the BMP ligand (Figure S4B). Of note, babo/tkv double heterozygotes also displayed a tendency to non-additive effects on FS albeit non-significant (two-way anova p= 0,054). In addition, mad/smox heterozygotes (specifc downstream TFs of BMP and activin pathways) displayed non-additive effects on several traits, including phenotypes related to rhythmicity (HP, SI, DI) and contractility (ESD and EDD) (Figure S4D). Altogether, cardiac performance in response to allelic combinations of activin and BMP supported a coordinated action of both pathways in the establishment and/or maintenance of cardiac activity. This was further supported by the observation that simple heterozygotes for the tested loss of function alleles displayed similar trends with respect to cardiac performance, irrespective of the pathway considered (dpp, tkv and mad for BMP; babo and smox for activin). Indeed, they displayed either no effect or increased fractional shortening and rhythmicity phenotypes (HP, DI, SI, AI), and decreased cardiac diameters (ESD and EDD). This suggests coordinated activity of both pathways. Importantly, the genetic interactions were tested using amorphic alleles that lead to systemic loss of function. The observed phenotypes may thus not unravel cardiac specific effects of the pathways. In support of this, mad cardiac specific RNAi knock down was tested (see below, Figure S5) and lead to a decreased HP, DI, SI and FS whereas heterozygotes for mad12 have either no (FS) or opposite (HP, DI, SI) effect on these phenotypes (Figure S4D). Inversely, mad RNAi caused a significant increase in AI whereas mad12 had no effect. However, heart specific dpp RNAi knock down (Figure S4E) lead to similar phenotypic trends compared to dppd14 (increased HP, DI, SI, decreased EDD and ESD) with the notable exception of FS which was reduced following cardiac specific KD (Figure S4E), but increased in dppd14heterozygotes (Figure S4B). Taken together, these data point to a complex picture of TGFb pathway activity in regulating cardiac performance, involving both the activin and the BMP pathways as well as gene specific effects with both systemic and tissue-specific contributions.”

      *Minor Comments: *

      * There is an enormous amount of data in this paper, but there are places where things are summarized a little too briefly. For example, there are no definitions given at the beginning of the Results section for traits like "Heart Period" or "Systolic Interval," which would make this work significantly more accessible for other Drosophila researchers. (They do touch on this when they explain later in the paper that certain variants are "associated with quantitative traits linked to heart size and contractility" but more background earlier would be helpful.) When we consider heart performance traits, what is the baseline from known mutants? In other words, where is the line between variation and defect? *

      Answers:

      • We have detailed the description of the traits analyzed at the beginning of the result section. We hope this improves the ease of reading in the direction suggested by the reviewer. “7 cardiac traits were analyzed across the whole population (Dataset S1 and Table 1). As illustrated in Figure 1A, we analyzed phenotypes related to the rhythmicity of cardiac function: the systolic interval (SI) is the time elapsed between the beginning and the end of one contraction, the diastolic interval (DI) is the time elapsed between two contractions and the heart period (HP) is the duration of a total cycle (contraction + relaxation (DI+SI)). The arrhythmia index (AI, std-dev(HP)/mean (HP)) is used to evaluate the variability of the cardiac rhythm. In addition, 3 traits related to contractility were measured. The diameters of the heart in diastole (End Diastolic Diameter, EDD), in systole (End Systolic Diameter, ESD), and the Fractional Shortening (FS), which measures the contraction efficacy (EDD-ESD/EDD).“

      • With respect to the baseline of cardiac performance, there is no simple answer. The baseline is influenced by the genetic background and the experimental conditions. This is the reason why any analysis of mutants or RNAi is conducted in comparison with its own control, analyzed at the same time. Concerning the DGRP lines, no baseline can be defined, since the objective is to measure the diversity of cardiac performance traits within a natural population.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Saha et al. have conducted a robust analysis of genes involved in cardiac function in the fruit fly by analyzing heart traits amongst the sequenced inbred lines of the Drosophila Genetic Reference panel. Using seven quantitative traits they identified hundreds of genes associated with natural variation of heart function in the young adult fly. Among the most highly represented groups of genes in their study are transcription factors, and subsequent analysis of SNPs demonstrated that natural variations were frequently found in the vicinity of transcription factor binding sites. Moreover, these transcription factors had already been shown to be associated with variations in heart function. This analysis underscored the importance of transcriptional regulatory networks in heart function. The authors used a heart-specific Gal4 line to drive RNAi knockdown of multiple candidate genes from their quantitative analysis, and showed that knockdown frequently led to cardiac defects. This analysis revealed an interaction between the Activin and BMP signaling pathways in heart activity, a surprising finding given that previous data had shown these pathways to be antagonistic. The authors go on to identify additional genes involved with within-line variation, these genes were also enriched for transcriptional regulators. Finally, the authors identify the human orthologs of their GWAS-associated genes and demonstrate that the genes Stripe and pox meso are associated with increased heart rate.

      Major Comments:

      There is an assumption in the use of RNAi knockdown to validate the genes identified in the quantitative analysis, and that is that natural variants are themselves hypomorphic. It is possible that among the variants identified some are hypermorphic, or among the transcription factor binding sites that variants lead to increased factor binding. While RNAi knockdown is an excellent choice to begin validation, I do not think the authors can rule out that a gene not functionally validated by their RNAi tests does not have a role in cardiac function.

      After performing RNAi knockdown to validate genes identified by GWAS the authors focus on the TGFbeta signaling pathway for downstream analysis. To do so they examine heterozygotes for sog, a repressor of BMP signaling, and snoo, an activator of Activin pathway. The data from the snoo/sog heterozygote is compelling in its disruption of heart phenotypes, and the authors conclude a "coordinated action of activin and BMP." snoo, however, also works as a transcriptional repressor in the BMP pathway, so it's possible that the effects the authors are seeing here could be confined to an increase in BMP signaling. Unlike snoo and sog, mutations in babo and dpp are both expected to have negative effects on Activin and BMP signaling, respectively. The babo/dpp interaction is not as quantitatively convincing as the snoo/sog data, despite the integral roles both babo and dpp play in their respective pathways. If both pathways are connected, why do snoo/sog heterozygotes affect SI phenotypes, while babo/dpp heterozygotes affect fractional shortening? I think the authors data suggest an interesting potential interaction between these pathways, which could be confirmed by examining further mutant combinations, knockdowns or increased expression transgenes, but falls short of a "confirmed synergystic genetic interaction." It does, however, underscore the value of the data in the paper for opening up new avenues for future study.

      Minor Comments:

      There is an enormous amount of data in this paper, but there are places where things are summarized a little too briefly. For example, there are no definitions given at the beginning of the Results section for traits like "Heart Period" or "Systolic Interval," which would make this work significantly more accessible for other Drosophila researchers. (They do touch on this when they explain later in the paper that certain variants are "associated with quantitative traits linked to heart size and contractility" but more background earlier would be helpful.) When we consider heart performance traits, what is the baseline from known mutants? In other words, where is the line between variation and defect?

      Significance

      This novel and thorough study is characterized by meticulous data analysis and demonstrated functional significance. The lists of genes represent a wealth of information to begin to understand the etiology of cardiac performance variations, to understand the networks regulating cardiac function and to identify potential human-disease related alleles. It complements GWAS studies done on human populations (such as Arking et al., 2014). It is a fantastic example of the strength of the DGRP and GWAS as tools to understand complex traits. This data will be an important resource for researchers studying cardiac performance in any organism.

      The reviewer is a Drosophila geneticist with expertise in mesodermal development and gene regulatory networks.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This is a well-written, very comprehensive manuscript that aims to take advantage of the Drosophila Genetic Reference Panel (DGRP) to identify GWAS associated with natural variation of cardiac traits. The authors carefully analyzed the genetic architecture of these natural variation of cardiac performance in the fly, validated a few in both the live fly heart and human iPSC-CMs. Through these efforts, the authors suggested the value of this fly "cardiac performance GWAS" resource and future impact on identifying new gene regulators and pathways critical for heart development and function. This study is very unique and valuable, representing the first effort of such kind in the field. The authors fully leveraged the powerfulness of model organism such as fly to offer a refreshing view on how genetic variants and their interactions potentially contribute to differences in heart performance. Quantitative genetics, functional annotations, and network analyses are thorough and rigorous. The results could be valuable to the community and trigger many follow-up studies. Below are some suggestions for the authors to consider to further strengthen their manuscript:

      1. It will be interesting to compare this fly GWAS to human heart disease GWAS data (for example, cardiomyopathy, arrhythmia, heart failure) from patients. Such cross comparison could make the data set more valuable.
      2. RNAi is the only experimental approach in this manuscript to validate the functional significance from data analyses. Authors may consider using genetic mutations such as deficiency lines or P-element lines to offer an alternative approach. This is simply a suggestion to improve the rigor and reproducibility, not absolutely required.
      3. In order to validate the roles of predicted TF binding sites, the best approach would be introducing point mutations using CRISPR/Cas9 within the binding motif then testing out molecular and physiological outcomes. Rather authors chose to test indirectly to knock down those TFs. If so, authors need to at least acknowledge the potential caveats of such approach and the limitation in related data interpretation.
      4. hiPSC-CM data is somewhat limited by only showing the HR and AP duration data. It is recommended to include some immunocytochemistry data to show the morphology, sarcomere structure of these hiPSC-CMs. Gene expression data generated by qPCR or RNA-seq in particular focusing CM structure and function genes would be helpful too.

      Significance

      This study is very unique and valuable, representing the first effort of such kind in the field. The authors fully leveraged the powerfulness of model organism such as fly to offer a refreshing view on how genetic variants and their interactions potentially contribute to differences in heart performance. The results could be valuable to the community and trigger many follow-up studies.

      This reviewer was trained as a fly geneticist during PhD, then a stem cell biologist using hiPSC-CM during postdoc. As a PI, this reviewer has ample experience dealing with genomics, genetics data related to CV biology or disease.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Saha et al. investigated the natural variation and new genetic mechanisms underlying cardiac performance using a collection of inbred Drosophila Genetic Reference Panel (DGRP). Through GWAS analysis, the authors identified more than 500 unique variants associated with 7 cardiac performance traits. These variants are mapped to 332 genes, and located mostly in the 1Kb upstream regions of the TSS. The authors have also functionally verified 42 candidate genes and the knockdown of 38 of them results in cardiac performance defects, including genes in TGF-beta pathway. Finally, the authors examined two novel genes, poxm/PAX9 and sr/EGR2, in both flies and human iPSC-derived cardiomyocytes, which revealed conserved mechanisms for cardiac function across species. I only have minor concerns, as listed below.

      1. The authors utilized the RNAi-mediated knockdown approach in their functional validation studies. It is not clear how each genetic variation (SNP) affects its associated genes. Could some of the SNPs activate the candidate gene expression? For the 4 candidate genes that failed to show cardiac defects, could the overexpression of these 4 genes alter cardiac performance?
      2. babo is the type I activin receptor, not type 2.
      3. The authors show BMP and activin pathway genetically interacts to affect cardiac performance. But it is interesting to find that these interactions are in a trait-dependent manner. For example, it seems that babo and dpp epistatically interact to regulate FS, while they additively regulate HP and DI. The authors need to discuss the complex genetic interaction further.
      4. Both snoo and sog are identified from GWAS. How about babo and dpp? Are there any identified SNPs associated with babo and dpp?
      5. It is unclear why the mad KD behaves oppositely to dpp mutant, although both proteins are involved in BMP pathway. In Figure S5, the mad KD shows reduced FS and HP, but dpp LOF mutant shows increased FS and HP (Figure S4). Can the authors perform RNAi to knockdown dpp specific in the heart to reexamine the role of dpp in the regulation of cardiac function. The whole body LOF mutant dpp-d14 might not target cardiac tissue directly to control heart performance like mad KD.
      6. The authors selected two novel genes to study the conversed regulation in both flies and human iPSC cells. Besides testing these novel genes, the authors should also verify whether the conserved pathways, like TGF-beta, regulate heart performance in human iPSC cells similar to the flies.

      Significance

      Overall, the manuscript is well written and the experiments are well-thought-out. The newly identified mechanisms in this study will provide important insights into the genetic architecture of the complex cardiac performance traits.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      **Note:** This preprint has been reviewed by subject experts for *Review Commons*. Content has not been altered except for formatting.


      Reply to the reviewers

      **General Statements [optional]**

      Please find bellow the preliminary revision plan for our manuscript entitled “Recognition of copyback defective interfering rabies virus genomes by RIG-I triggers the antiviral response against vaccine strains” by Wahiba Aouadi et al. (RC-2022-01386). Reviewer’s’ comments/questions and suggestions are represented in blue in the text.

      **Description of the planned revisions**

      We thank the Reviewer#1 for underlining that identification of rabies virus 5’ copy-back DI genomes as “presumably bound to RIG-I is a useful advancement”, her/his interest in “observed difference between the responses to the two strains of virus” (THA strain and the vaccine SAD strain), and for emphasizing that “identification of the rabies viral RNAs that activate RIG-I is a significant finding for the rabies specialists”.

      Reviewer#1: In more details for the Evidence, reproducibility and clarity (Required) Much of the studies relied on weak methodologies. For example, in Fig 1, reporter assays were used, instead of measuring IFN mRNA levels; it is also not clear what is the nature of the promoter driving the reporter. Is it ISRE, which responds to IFN or is it the IFNb promoter, which responds to transcription factors activated by RIG-I? It is also not clear what is the nature of the RNA that was transfected. Is it total RNA from infected cells or is it purified viral RNA? No matter what, these results are quite predictable from the literature.

      Regarding the Reviewer #1 comment on type-I IFN cell report results “relying “on weak methodologies”, we would like to recall that to provide pieces of evidence that RIG-I-specific RNA ligands are produced during the infection with rabies virus we used several previously validated technics: - i) Fig.1A: transfected into ISRE-reporter cell line (ISRE, which responds to IFN) that is a classical validated tool to efficiently detect ISRE-activation even upon transfection of low quantities of immunoactive RNA ligands (PMID: 28768856, PMID: 27011352, PMID: 24098125, PMID: 29996094, PMID: 31761719, PMID: 23595062). - ii) Fig1B: Cell overexpressing LGP2 approach that has been previously developed and validated (Sanchez et al., 2019). LGP2 overexpressing cells provide a possibility to functionally distinguish between RIG-I and MDA5-driven activation of type-I IFN signaling. As noted in the corresponding figure legend in this experiment, the IFN-b promoter-reporter assay was used (“which responds to transcription factors activated by RIG-I”). - iii) Fig.1C -similar to Fig1A experiments performed in ISRE-reported cell lines partially depleted in either RIG-I or MDA5 (siRNA-based approach) to complement the Fig. 1B with additional functional validation using siRNAs.

      We apologize that we haven’t provided a detailed explanation for the origin of transfected RNA used all through Fig. 1. In the revised Fig1 we will correct the figure legend to explain the origin of total RNA used in experiments: Total RNA purified from SK.N.SH cells infected with THA or SAD for all experimental approaches presented in the figure. Moreover, as suggested by Reviewer#1 for Fig.1 we will add experiments measuring IFN-b mRNA by RT-qPCR.

      Referee #1

      Evidence, reproducibility and clarity

      A lot of effort was devoted to distinguish between RIG-I and MDA5 as the receptor of rabies viral RNA producing conflicting results from the binding assays and the reporter assays.

      This comment of Reviewer#1 is not clear to us. We have the feeling that our results do not show any conflict when analyzing the results represented in Fig. 2-3. They demonstrate that RIG-I and not MDA5 works as the key cytosolic sensor upon infection with rabies virus. Further, the apparent conflict observed by the Reviewer#1 about the fact that we failed to detect any specific RABV RNA ligands upon infection with THA strain (Fig.3A) while significant enrichment of immunoactive RNA ligands on RIG-I (Fig.2C) were observed can be easily commented and explained. We proposed in the revised version of our manuscript to discuss the possibility and to provide the results showing that enrichment in 5’PPP endogenous RNA ligands on RIG-I upon infection with THA RABV could explain the results observed on Fig.2C /Fig.3A. Indeed, in our recently accepted for publication study, we observed that a large spectrum of RNA virus infections leads to the mobilization of endogenous RNA ligands (transcripts of RNA Polymerase III) on RIG-I (https://www.cell.com/iscience/fulltext/S2589-0042(22)00871-9). Furthermore, we observed that upon infection Polymerase III transcripts can activate RIG-I signaling pathways even in the absence of RIG-I-specific viral RNA ligands. To address this possibility in the revised manuscript, we propose to perform additional analysis of our RNAseq results to demonstrate enrichment of endogenous RNA ligands on RIG-I in rabies virus-infected cells.

      Significance

      Conceptually, the paper does not add much to the literature. As pointed out by the authors, RIG-I-specific partners had been identified before for many RNA viruses including other rhabdoviruses.

      We additionally underline that although there is a slowly growing number of studies characterizing RLR-specific RNA ligands directly from infected cells with a slowly growing number of characterized viruses, to our knowledge our study provides the first characterization of RLRspecific RNA ligands in Rabies virus-infected cells and that the amount of these ligands differs between wild type viruses and vaccine strains. Furthermore, none of the previously published studies on Rabies virus used similar experimental approaches. We believe that only stepwise characterization of RLR-specific RNA ligands for different RNA virus families is fully original regarding rabies virus and will further provide a wider and more fundamental vision on the distribution of RIG-I and MDA5 specificities for sensing RNA viruses.

      Referee #3

      Evidence, reproducibility and clarity

      We thank Reviewer#3 for stressing that our “study is highly significant for understanding virus sensing mechanisms and to inform understanding of vaccine actions.” For the Reviewer#3 specific comments:

      The signaling analyses is focused on ISRE/promoter induction, which is several steps downstream from RIG-I. An more comprehensive signaling analysis is required to define the RLR pathway engagement, including examination of RIG-I binding to MAVS, IRF3 activation induced by viral RNA and recovered RIG-I or MDA5 ligands, and induction of IRF3-target gene expression (such as RSAD, IFI44, IFIT1, IFIT2) and interferon-stimulated gene (ISG) expression such as Mx1, Mx1, OAS, etc.

      We thank Reviewer#3 for his comments and also appreciate that additional characterization of type-I IFN signaling pathway activation by RABV RNA will deeper our research results. We will add additional experimental results to answer the comments suggested by the Reviewer#3 for each Figure, as presented below:

      Figure 1. RLR activation readout here relies exclusively on promoter/reporter assay. Assessment of endogenous IRF3, IRF3-target gene expression, and ISG expression needs to be included. Also, what are the dynamics of RLR signaling activation during infection over a time course? This is important to know and to associate with the accumulation of the cb RNAs.

      We will perform additional transfection of total RNA purified from SK.N.SH cells infected with THA or SAD to HEK293T (or other relevant cells) to detect by WB analysis the phosphorylation of IRF3. As suggested by the Reviewer#3 we will also perform gene expression analysis targeting RSAD, IFI44, IFIT1, IFIT2. Additionally, kinetics of the SK.N.SH cells infection with THA or SAD strains of RABV will be studied to detect the accumulation of 5’cbDI genomes during the infection as suggested at the second part of the comment by the Reviewer#3.

      Figure 2. The RLR-bound RNA signaling analysis is incomplete. The authors need to include analysis of IRF3 and gene expression as noted above. Also, the authors should assess RLRbound RNAs collected over a time course of infection, thus enabling an understanding of the temporal dynamics of RLR ligand and biological activity of this virus-host interaction.

      In order to reply to this comment we will provide additional characterization of type-I IFN signaling in ST-RLR cells infected with THA and SAD, comparing to the mock-infected cells. For this, we will perform western blot analysis of IRF3P in total protein lysates and carry additional analysis of our NGS data to visualize ISG expression profiles in the same conditions (THA, SAD, and mock). Unfortunately, it will be experimentally difficult to assess RLR-bound RNAs collected over a time course of infection. However, as our NGS analysis demonstrated accumulation of 5’cb DI RNA as specific RNA ligands of RIG-I, we can follow the kinetics of accumulation of these 5’cb DI RNAs in SK.N.SH and ST-RLR cells as described above in response to the Fig.1 comment of the Reviewer#3.

      Figure 3. These are strong data sets and are convincing. For panel C, one can see several RIGI-bound peaks. The authors should provide more information on the length of these peaks, please include in Table 1. Also for MDA5 there also are peaks but the histogram is saturated. The peaks and valleys need to be delineated, ideally in a large table. The needs to be confirmation of these motifs or RNAs as actually binding to RIG-I and MDA5. This binding activity needs to be shown in gel-shift assay or other suitable approach of direct RIG-I binding of specific RNAs produced in vitro corresponding to mapped regions shown in the figure 3. Also, a more careful analysis of MDA5-assocaited RNA needs to be conducted to ascertain if it has immune stimulatory/signaling activity. By assess IRF3 activation this activity might be identified.

      Based on the Reviewer#3 suggestions for the Fig.3C we will additionally summarize in Supplementary Table 4 RNA reads that are represented as enriched on RIG-I for the 5’ part of the RABV genome. Indeed, the full-length genome binding to MDA5 was observed for RNA- reads importantly in SAD-infected cells. However, we believe that how encapsidated full-length viral genome can still be detected by MDA5 in virus-infected cells needs to be addressed in a separate study. Additional experiments for detecting the IRF3 activation in ST-RLR cells will be performed as described above.

      Figure 4: VERY important: Do these RNAs bind to RIG-I in vitro, and do they activate IRF3 when transfected into cells, what is the role of 5'ppp in this activity?? These data are needed to make the strong conclusions stated by the authors.

      We are grateful to Reviewer#3 suggestions for Fig.4. We will address whether the detected RABV 5’cb DI RNAs are specific RIG-I ligands. We will synthetize and transfect these RNA molecules and study how efficiently they activate type-I IFN signaling (by IFN-b and ISRE reporter approaches as well as by gene expression assay analysis as suggested in Fig.1 by Reviewer#3). We will also address IRF3P efficiency upon cell transfection with DI-2170 and DI-1668. As controls, we will use previously described RIG-I/MDA5-specific RNA ligands and treat RNA transcripts with calf intestine alkaline phosphatase (CIP) to remove 5’ppp groups.

      **Description of the revisions that have already been incorporated in the transferred manuscript**

      No revisions have already been incorporated in the transferred manuscript.

      **Description of analyses that authors prefer not to carry out**

      As described above to answer to the Reviewer#3 suggestion, how encapsidated full-length viral genome can still be detected by MDA5 in virus-infected cells needs to be addressed in a separate study.

      Referee #2

      Evidence, reproducibility and clarity

      We thank the Reviewer#2 for underlining that our study “shed light on the RLR recognition of RABV RNAs upon infection” and that our study “clarify the mechanism of cellular immunity differences between RABV pathogenic strain and vaccine attenuated strain. Reviewer#2 suggested to “verify whether the difference in this mechanism is caused by the difference in the viral genome, whether the N gene or L gene of the two can be exchanged by reverse genetics, and then infect the cells to verify whether the 5'cb DI genomes can be generated just as this paper.”

      We agree with Reviewer#2 that applying reverse genetics for RABV genome by exchanging N and L genes could provide a more in-depth characterization of 5’cb DI generation and pathogenicity of RABV. However, these additional experiments cannot be provided within the scope of this paper and will take time for the revision process. We believe, that this question needs to be addressed in a separate study by exchanging either N and L genes using reverse genetics.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Bourhy and colleagues present their study focused on defining how rabies virus (RV) vaccine strains trigger RIG-I innate immune signaling. Triggering RIG-I or MDA5 leads to innate immune activation, and in vaccinology this is an important component of immune adjuvant actions that serve to overall enhance vaccine immunity. The group applied in vitro infection and RNA analyses including assessment of RLR-dependent signaling by RNAs recovered from infected cells, and RNAseq of RLR-associated RNA from virus-infected cells, showing that RIG-I binds to copyback (cb) RNAs of defective interfering (DI) genomes produced during replication by the RV vaccine strain SAD but not by THA strain which likely does not produce the cb RNAs. The study extends previous work showing that RIG-I senses RV RNA to now show That RIG-I binds to the cb RNAs. Data on MDA5 is included to show that MDa5 is bound across the RV negative strand but the RNA recovered from MDA5 in infected cells does not stimulate innate immune signaling.

      Specific comments:

      The signaling analyses is focused on ISRE/promoter induction, which is several steps downstream from RIG-I. An more comprehensive signaling analysis is required to define the RLR pathway engagement, including examination of RIG-I binding to MAVS, IRF3 activation induced by viral RNA and recovered RIG-I or MDA5 ligands, and induction of IRF3-target gene expression (such as RSAD, IFI44, IFIT1, IFIT2) and interferon-stimulated gene (ISG) expression such as Mx1, Mx1, OAS, etc.

      Figure 1. RLR activation readout here relies exclusively on promoter/reporter assay. Assessment of endogenous IRF3 , IRF3-target gene expression, and ISG expression needs to be included. Also what are the dynamics of RLR signaling activation during infection over a time course? This is important to know and to associate with the accumulation of the cb RNAs.

      Figure 2. The RLR-bound RNA signaling analysis is incomplete. The authors need to include analysis of IRF3 and gene expression as noted above. Also, The authors should assess RLR-bound RNAs collected over a time course of infection, thus enabling an understanding of the temporal dynamics of RLR ligand and biological activity of this virus-host interaction.

      Figure 3. These are strong data sets and are convincing. For panel C, one can see several RIG-I-bound peaks. The authors should provide more information on the length of these peaks, please include in Table 1. Also for MDA5 there also are peaks but the histogram is saturated. The peaks and valleys need to be delineated, ideally in a large table. The needs to be confirmation of these motifs or RNAs as actually binding to RIG-I and MDA5. This binding activity needs to be shown in gel-shift assay or other suitable approach of direct RIG-I binding of specific RNAs produced in vitro corresponding to mapped regions shown in the figure 3. Also, a more careful analysis of MDA5-assocaited RNA needs to be conducted to ascertain if it has immune stimulatory/signaling activity. By assess IRF3 activation this activity might be identified.

      Figure 4: VERY important: Do these RNAs bind to RIG-I in vitro, and do they activate IRF3 when transfected into cells, what is role of 5'ppp in this activity?? These data are needed to make the strong conclusions stated by the authors.

      Review cross-commenting:

      I agree completely with the comments provided by other two reviewers.

      Significance

      The study is highly significant for understanding virus sensing mechanisms and to inform understanding of vaccine actions.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript entitled by"Recognition of copy-back defective interfering rabies virus genomes by RIGI triggers the antiviral response against vaccine strains",which clarify the mechanism of cellular immunity differences between RABV pathogenic strain and vaccine attenuated strain in vitro. This paper using next-generation sequencing (NGS) combined with bioinformatics tools, it was found that the RABV vaccine attenuated strain replication in vitro induces a high release of 5' copy-back defective interfering genomes, which enhances a strong antiviral response. However, RABV pathogenic strain replication in vitro is characterized by the absence of defective interfering genomes thus induces a weak RLR-mediated innate immunity antiviral response. This paper demonstrated that IFN response induced by RLR RABV RNA recognition was principally mediated by RIG-I. 5'cb DI viral genomes that enhance RIG-I detection and therefore strongly stimulate the IFN response were exclusively produced by the RABV vaccine strain. To verify whether the difference in this mechanism is caused by the difference in the viral genome, whether the N gene or L gene of the two can be exchanged by reverse genetics, and then infect the cells to verify whether the 5'cb DI genomes can be generated just as this paper.

      Review cross-commenting:

      I agree completely with the comments provided by other two reviewers.

      Significance

      This paper shed light on the RLR recognition of RABV RNAs upon infection.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Rabies virus, a rhabdovirus, is a major human pathogen and mammalian cells respond to infection by this virus by triggering type I IFN synthesis. Here, the authors report that the cytoplasmic antiviral sensor RIG-I recognizes viral RNA to initiate signaling. Moreover, the vaccine strain SAD activates RIG-I more effectively than the pathogenic strain, THA. During SAD replication, two major 5' copy-back defective interfering genomes were generated and they bound to RIG-I to activate it.

      Much of the studies relied on weak methodologies. For example, in Fig 1, reporter assays were used, instead of measuring IFN mRNA levels; it is also not clear what is the nature of the promoter driving the reporter. Is it ISRE, which responds to IFN or is it the IFNb promoter, which responds to transcription factors activated by RIG-I? It is also not clear what is the nature of the RNA that was transfected. Is it total RNA from infected cells or is it purified viral RNA? No matter what, these results are quite predictable from the literature. A lot of effort was devoted to distinguish between RIG-I and MDA5 as the receptor of rabies viral RNA producing conflicting results from the binding assays and the reporter assays. A major weakness of these experiments is in the use of convoluted cell lines which added to the weakness of the reporter assays as outlined above. Identification of the DI viral sequences that presumably bound to RIG-I is a useful advancement.

      Significance

      Conceptually, the paper does not add much to the literature. As pointed out by the authors, RIG-I-specific partners had been identified before for many RNA viruses including other rhabdoviruses. The observed difference between the responses to the two strains of virus is interesting but multiple strains need to be tested to make a meaningful interpretation of the data. Nonetheless, identification of the rabies viral RNAs that activate RIG-I is a significant finding for the rabies specialists.

      This reviewer's expertise is in antiviral innate immune response and the type I IFN system.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      This reviewer did not leave any comments

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Major points:

      1. The authors claim the Alyref and Gabpb1 were repressed by H3K9me3 in SCNT embryos, however, they didn't provide the direct evidence of H3K9me3 modification in Alyref and Gabpb1 promoter or enhancer.
      2. The authors should provide direct evidence that how Alyref and Gabpb1 regulate pluripotency, cell viability and apoptotic related genes which are essential for morula arrest in knock out embryos.

      Minor points:

      1. Figure1A, which developmental stage of SCNT or IVF embryos are used for RNA-seq?
      2. In Figure S2B, FPKM values of Alyref in SCNT embryos with TSA+VC-1 is inconsistent low. More repetitions are recommended.
      3. The samples of single embryo RNA-seq are less. More repetitions are recommended. 4.They lack the data of embryos transfer and offsprings experiment of SCNT embryos by the addition of Alyref and Gabpb1 through mRNA injection.

      Significance

      The manuscript by Ihashi et al aims to find specific genes responsible for the arrest of SCNT embryos. The authors identified Alyref and Gabpb1 by siRNA screening and verified morulae arrest phnotype in Alyref and Gabpb1 KO IVF embryos, and single embryo RNA-seq revealed that Alyref is needed for the formation of inner cell mass. The preimplantation development of cloned embryos was aided by the addition of Alyref and Gabpb1 by mRNA injection. Overall, this is an interesting study that demonstrate incomplete activation of Alyref1 and Gabpb1 will lead to preimplantation arrest of SCNT embryos. However, this is a very preliminary study that some important issues are need to address, thus, this manuscript is far from a publishable form.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The authors picked up candidate genes which might be important for SCNT embryo development based on RNA-seq data (basically genes downregulated in SCNT embryos compared to normal ones) and performed siRNA-knockdown on the candidate 15 genes. Two genes, Alyref and Gabpb1, were required for normal development. KO of each gene resulted in embryonic lethality, confirming the KD data. RNA-seq showed that Alyref KO affected lineage specification while Gabpb1 KO resulted in apoptosis. Finally the authors injected mRNA for Alyref and Gabpb1 into SCNT embryos and observed improved development.

      Major comments:

      1. The data provided are overall convincing and support the authors' conclusion. However, as the authors may understand, the paper lacks mechanistic insights into how Alyref and Gabpb1 work in early embryos. Furthermore, it is still not very clear why Alyref and Gabpb1 are downregulated in SCNT embryos. It seems that the authors speculate that somatic H3K9me3 might regulate the expression of those genes, but it is still possible that H3K9me3 just inhibits the expression of upstream regulator of Alyref and Gabpb1. The paper lacks experiments to address these possibilities.

      2. Fig. 7: Given the current status of the SCNT field, it would be important to show the birth rate of SCNT embryos upon mRNA injection.

      3. Fig. 2C-D: It would be important to know when the differential expression of Alyref or Gabpb1 protein can be seen to understand the relationship between SCNT RNA-seq data and KO embryo phenotype.

      Minor comments:

      1. State clearly in the manuscript which stage of embryos was used for RNA-seq analysis or other assays.

      2. Fig 1A: "DEGs" might be confusing. Do the authors refer to downregulated genes as DEGs?

      3. L113: State clearly what the "transcriptional activators" in this study are. Also, "Repression mix" and "Activation mix" in Fig1C are not easily understandable.

      4. Fig6A: Are the pathways indicated top significant ones? If not, the IPA result should be indicated in an unbiased manner. Also, is it meaningful to show -log10(q-value) = ~1?

      5. S6A: The stage at which the Pou5f1 signal was measured is not clearly indicated; Pou5f1 expression becomes high in blastocysts, so comparison between 72 hpi morula would be appropriate.

      6. L217, Fig 6F: Is this indeed based on unsupervised hierarchical clustering?

      7. "heterozygous mutant mice" should be used rather than "hetero mice".

      Significance

      The study is constructed based on the previous finding that Kdm4d overexpression significantly improved SCNT embryo development. However, it is still important to know why the removal of H3K9me3 can exert such an effect. The authors tackled this question and suggested that two genes (Alyref and Gabpb1) expressed upon H3K9me3 removal play important roles for SCNT embryo development. Unfortunately, while it is now widely accepted that Dux expression is important for SCNT development in the field, the authors did not test nor discuss how Dux is involved in the authors' findings. In addition, there would be many other things to be done in this study (described in major comments) to contribute to the SCNT field.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to previous review

      Reviewer____ #1 (Evidence, reproducibility and clarity (Required)):

      The authors use newly available probes to show that the phosphoinositide PI(3,4)P2 plays a previously undescribed role in FcgammaR-mediated phagocytosis. Using RAW macrophages, they show that PI(3,4)P2 is enriched at the plasma membrane and also at phagocytic cups internalizing IgG-opsonized sheep red blood cells. Pharmacological inhibition using wortmannin, and also expression of membrane-targeted INPP4B phosphatase showed that PI(3,4)P2 production depends on PI3 kinase activity. Further experiments using selective inhibitors showed that PI(3,4)5P2 is mainly derived by dephosphorylation of PI(3,4,5)P3, likely by multiple phosphatases such as SHIP1 or OCRL. Depletion of PI(3,4)P2 at the plasma membrane by INPP4B also resulted in strongly decreased internalization of red blood cells, although their attachment to macrophages seemed unaltered, pointing to defects in particle engulfment. The authors then tested the potential role of lamellipodin, one of the few known PI(3,4)P2 specific effectors. Lamellipodin was found to be enriched at phagocytic cups, and this enrichment was shown to be dependent on the presence of PI(3,4)P2, by targeting of INPP4B to the plasma membrane. Macrophages depleted of lamellipodin by shRNA treatment showed reduced phagocytic efficiency and also aberrant phagocytic cup formation. As VASP is a known binding partner of lamellipodin and involved in actin polymerization, the authors next tested its potential involvement. Overexpression experiments showed that VASP colocalizes with lamellipodin at phagocytic cups. Sequestering of VASP at mitochondria through a respective construct containing the VASP binding site of ActA, together with a mitochondrial targeting sequence, showed that this also results in incompletely formed phagocytic cups and reduced phagocytic efficiency. Similar effects were observed upon expression of a lamellipodin construct with mutated binding sites for VASP. Collectively, the authors propose that PI(3,4)P2 is localized produced at phagocytic cups through the sequential activity of PI3 kinase and PI5 phosphatase, that it recruits lamellipodin and its binding partner VASP, and that this cascade is necessary for proper phagocytic cup formation and closure and thus phagocytic capacity of cells. This is an interesting study that uncovers a novel role for PI(3,4)P2 in phagocytic cup formation and closure. It is very well controlled, and the claims of the study are supported by the presented data. Statistical analysis is sound.

      Major comments: 1) The localization of VASP at phagocytic cups is only shown by overexpression of constructs. Endogenous staining of VASP should support this finding.

      We agree with the reviewer that localization of the endogenous VASP would strengthen our conclusions. We have therefore performed the suggested experiments and in the revised manuscript include a new panel (E) in the revised Figure 6 showing immunostaining of endogenous VASP during phagocytosis. The result confirms the localization of the GFP-chimeric protein.

      2) It is unclear whether the roles of PI(3,4)P2, lamellipodin, and VASP are restricted to FcgammaR-mediated phagocytosis. Their potential involvement in CR3-mediated phagocytosis should be discussed or addressed in a basic set of experiments.

      In the revised manuscript we have extended our original observations to analyze also CR3-mediated phagocytosis, as recommended by the reviewer. A new supplemental figure (Supplemental Figure 10) now documents that PtdIns(3,4)P2 is also accumulated and Lamellipodin and VASP are recruited to the phagocytic cup during CR3-mediated phagocytosis. These results imply that the role of this lipid and its effectors extend to other modes of phagocytosis. These new observations are discussed on Page 14 of the revised manuscript.

      Minor comments: 1) A very recent study (Körber and Faix, EJCB, 2022) describes the role of VASP in macroendocytosis in Dictyostelium. Specifically, VASP is found to be important for proper cup closure. The results are of direct importance to the current study and should be cited accordingly.

      Thank you for bringing this study to our attention. We now discuss the findings of Körber & Faix on Page 9 of the revised text.

      2) direct labelling of the figures would be helpful in assessing the manuscript

      To facilitate re-assessment of the paper, we have added the Figure numbers directly to the individual figures in the manuscript as suggested.

      Reviewer #1 (Significance (Required)): This study highlights the role of an underappreciated phospholipid in phagocytosis. It also describes for the first time a role for lamellipodin in formation of phagocytic cups and confirms the recent finding that also VASP is necessary for phagocytic cup closure. The paper should be of interest to researchers working on host-pathogen interaction, regulation of the actin cytoskeleton, and also to the general cell biological community

      Reviewer´s expertise: Actin regulation Microtubule-based transport Adhesion, migration, invasion Phagocytosis

      We thank the reviewers for his/her comments and suggestions that have clearly improved the manuscript.

      Reviewer____ #2 (Evidence, reproducibility and clarity (Required)):

      Review of Montano-Rendon et al: 'PI(3,4)P2, Lamellipodin and VASP coordinate cytoskeletal remodeling during phagocytic cup formation in macrophages'

      The authors employ biosensors for PI(3,4)P2 in RAW 264.7 macrophages to identify localized pools of PIP2 that were sensitive to INPP4B and wortmannin (Fig1). The biosensors for PIP2 are enriched on the forming phagocytic cup (Fig2, movies) in these macrophage cells. Inhibitors for PI3K blocked the recruitment of this biosensor to the membrane.

      Overall, the data are clear with the exceptions noted below. Krause et al (Dev Cell 2004) published a manuscript looking at PIP2, Lpd, and VASP in non-macrophage cells (fibroblasts, HeLa, etc...) where the influence of PI(3,4)P2 and these proteins was found to regulate actin and lamellipodial membrane extensions. This study also implicated Lpd protein coordinated actin networks in the docking of pathogens such as Vaccinia virus and EPEC bacteria. Given the additional reports of these proteins participating in dorsal ruffling (Michael et al Curr Bio 2010) and invasion (Carmona et al Oncogene 2016), it comes as no surprise that they participate in phagophore formation and phagocytosis. These studies are referenced, but having this in mind does diminish the novelty of implicating Lpd and VASP in the phagocytic process, though it seems to be the first time this machinery was directly implicated in macrophage cells.

      We would like to point out that docking of viruses or dorsal ruffling are very different biological processes from phagocytosis and that the common involvement of Lamellipodin in these very disparate processes does not, in our view, detract from the novelty of our studies.

      Specific Comments Although the images and movies graphically demonstrate a PI(3,4)P2 enrichment on phagocytic structures, the authors could provide some additional images that include fluorescently tagged phagocytic cargo such as the erythrocytes used. The addition of a fluorescent marker or phase image would be especially beneficial in the experiments where a lack of cPHx-biosensor recruitment is seen to the docked phagocytic cargo.

      We thank the reviewer for this suggestion. In the revised manuscript figures now include micrographs of the fluorescently labelled particles or phase-contrast images where appropriate.

      Otherwise, readers are left with the impression that perturbations such as INPP4B compromise docking and phagocytic cup formation altogether (Fig 2C)- which is perhaps the authors point? Make this clear?

      We apologize for the ambiguity of the former version of the manuscript. Initially, we noticed that particle engulfment -which is what we believe the reviewer means by “cup formation”- was the main defect in INPP4B-CaaX expressing cells. However, since the reviewer raised the possibility, we have gone back and re-analyzed the data and found that cells expressing the INPP4B-CaaX also have a small (~35%) decrease in particle engagement/binding (Reviewer uses the term “docking”). This suggests that the plasmalemmal pool of PtdIns(3,4)P2 in resting cells supports the actin dynamics at the cell surface which allows the RAW cells to survey their immediate environment and thereby contact more potential prey. This new finding is included in and discussed the revised manuscript. We thank the reviewer for prompting us to consider this alternative mechanism.

      There has already been an implication for PI3K in the phagocytic process, perhaps verifying that initial formation/membrane extension stages of phagocytosis are impacted by targeting the D-4 position of PIP2 would be of interest?

      PtdIns(4,5)P2 is well known to be essential for actin polymerization and is increased transiently at the sites of phagocytosis (Botelho et al., 2000 J Cell Biol; Scott et al., 2005 J Cell Biol; Fairn et al., 2009 J Cell Biol.). It is not clear whether the reviewer is curious about the possible consequences of converting PtdIns(4,5)P2 to PtdIns5P prior to activation of PI3K. Whether PtdIns5P itself has biological activity is a subject of debate and, to our knowledge, its existence has not been documented at sites of phagocytosis. It is also unclear whether PtdIns5P would serve as an effective substrate for PI3K and, if so whether the putative product, PtdIns(3,5)P2 that is normally found in endomembranes, would be functionally relevant.

      Depletion of PI(3,4)P2 through the expression of the INPP4B phosphatase demonstrated a reduction in phagocytic uptake of red blood cells (Fig4). The readout for this assay relied upon what appears to be differential labeling of phagocytosed red blood cells, though there are examples of cargo that is supposedly inside the macrophages labeled in green? Perhaps the authors can reconcile this and make the methods more clear for this approach?

      Thank you for this comment; we apologize if the original text was unclear in this regard. In the revised manuscript a detailed description of the staining protocol we used to distinguish inside from outside particles is now included in the Methods section. It is also worth pointing out that the green-only SRBCs in the INPP4B-CaaX panel in Figure 3 indicate targets that were fully internalized by those RAW macrophages not expressing BFP-INPP4B-CAAX (see image below)

      Fig4 demonstrates the PI(3,4)P2 dependent recruitment of Lamellipodin (Lpd) to the phagocytic cup, which is clear. Lpd is found to be necessary for effective phagocytic uptake in Fig 5. There is no blotting/qPCR data for the verification of Lpd knockdown shown?

      RAW macrophages and other macrophage cell lines are rather refractory to transfection, resulting in only a minor (10-20%) fraction of the cells expressing transfected constructs. For this reason, immunoblotting or qPCR analyses of the entire population yield misleading results, not reflective of the comparatively small transfected sub-population of cells. To overcome this limitation, we co-transfected the shRNA-containing plasmid with a smaller amount of a plasmid containing a fluorescent protein used to identify transfectants visually (a 5:1 ratio of shRNA:EGFP). By using a 5:1 ratio of the plasmids we ensured that cells expressing the fluorescent protein had a high likelihood of also expressing the shRNA. In this manner, the Lpd-depleted cells could be scored separately from the untransfected, wild-type cells following immunostaining (Supplemental Figure 5). Note that some immunostaining persisted in the Lpd-silenced cells, in all likelihood because some of the antibody binding is nonspecific, as is commonly seen in immunostaining. Nevertheless, the data indicate that substantial silencing of Lpd is achieved when transfecting the shRNA.

      The authors demonstrate a co-localization of Lpd/VASP proteins at the phagocytic cup of these macrophages in Fig 6 and sequester VASP protein to the mitochondria with some ActA derived fusion proteins to functionally block phagocytosis. The functional interaction of Lpd/VASP is further explored with experiments utilizing Ena/VASP mutants in Fig7, demonstrating a dependence on this interaction to promote phagocytic uptake.

      Reviewer #2 (Significance (Required)): see above

      Reviewer____ #3 (Evidence, reproducibility and clarity (Required)):

      In this study, Montano-Rendon and colleagues address the role of phosphatidylinositol (3,4)-bisphosphate in phagocytosis by RAW macrophages. Using small molecule inhibitors, they show that dephosphorylation of PI(3,4,5)P3 is the main source of PI(3,4)P2 in phagocytosis. Using an elegant approach based on overexpression of a PI(3,4)P2-specific phosphatase, they show that the selective depletion of PI(3,4)P2 impairs phagosome formation. Moreover, they identify two PI(3,4)P2 interacting proteins involved in phagocytosis: lamellipodin and VASP. They show that shRNA silencing of lamellipodin arrests phagocytosis, as well as mistargeting of VASP to mitochondria by a fusion protein. Overall, this is a high-quality study, well designed and written. I hence support publication, and only have a few relatively minor comments that the authors should consider as I believe it would improve the quality of the manuscript.

      The role of PI(3,4)P2 in the actin organisation in phagocytosis has been shown previously in various studies, see for example PMID: 16418223, 27806292 and review 32296634. In these studies, different mechanisms have been proposed of how PI(3,4)P2 affects the cytoskeleton and phagocytic process. It would be good to discuss how the findings with lamellipodin and VASP relate to these previously described mechanisms.

      We now include and discuss the references recommended by the reviewer to highlight that the importance of PtdIns(3,4)P2 extends to dendritic cells and HL60 neutrophils.

      In figure 6, a role for VASP in phagocytosis is shown by mistargeting it to mitochondria using a fusion protein consisting of a VASP binding region and a mitochondrial targeting motif. While this is an elegant approach, I wonder why not simply shRNA is used, similar to lamellipodin?

      We decided to use this approach because macrophages (including RAW cells) express other members of the Ena/VASP family of proteins such as EVL (Coppolino et al., 2001 J Cell Sci) that could potentially substitute for VASP; simultaneously silencing multiple, distinct members of the Ena/VASP family poses an experimental challenge. Moreover, in our experience introducing siRNA into RAW cells, even when using electroporation, is often insufficient to generate robust silencing of certain genes (e.g. Levin-Konigsberg, et al., 2019 Nature Cell Biology). Thus, we took advantage of the robust, more globally effective ActA-based molecular tool. To demonstrate its effectiveness, we now include a new Supplemental figure (Supplemental Figure 6, reproduced below) using immunostaining that shows how virtually all of the endogenous VASP is sequestered to the surface of mitochondria when the MITO-FP4 is expressed.

      Supplemental Figure 6. MITO-FP4 targets endogenous VASP to the Mitochondria

      In figure 3A: How was the inside-outside staining performed? I cannot find this information in the Methods.

      We apologize for the omission. The inside/outside staining protocol is now detailed in the Methods section of the manuscript.

      Figures are overall good quality. However, in figure 1,2, and 4 individual cells are shown in the graphs, whereas figures 3, 5, 6 and 7, and the supplementary figures only show averages with bar graphs. Please change these graphs to all show individual cells, as this will allow to see the variation among cells.

      Thank you for the suggestion. The graphs have been modified to violin plots to show the variation and distribution of results amongst the individual cells and experiments.

      Reviewer #3 (Significance (Required)):

      see above

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, Montano-Rendon and colleagues address the role of phosphatidylinositol (3,4)-bisphosphate in phagocytosis by RAW macrophages. Using small molecule inhibitors, they show that dephosphorylation of PI(3,4,5)P3 is the main source of PI(3,4)P2 in phagocytosis. Using an elegant approach based on overexpression of a PI(3,4)P2-specific phosphatase, they show that the selective depletion of PI(3,4)P2 impairs phagosome formation. Moreover, they identify two PI(3,4)P2 interacting proteins involved in phagocytosis: lamellipodin and VASP. They show that shRNA silencing of lamellipodin arrests phagocytosis, as well as mistargeting of VASP to mitochondria by a fusion protein. Overall, this is a high-quality study, well designed and written. I hence support publication, and only have a few relatively minor comments that the authors should consider as I believe it would improve the quality of the manuscript.

      The role of PI(3,4)P2 in the actin organisation in phagocytosis has been shown previously in various studies, see for example PMID: 16418223, 27806292 and review 32296634. In these studies, different mechanisms have been proposed of how PI(3,4)P2 affects the cytoskeleton and phagocytic process. It would be good to discuss how the findings with lamellipodin and VASP relate to these previously described mechanisms.

      In figure 6, a role for VASP in phagocytosis is shown by mistargeting it to mitochondria using a fusion protein consisting of a VASP binding region and a mitochondrial targeting motif. While this is an elegant approach, I wonder why not simply shRNA is used, similar to lamellipodin? In figure 3A: How was the inside-outside staining performed? I cannot find this information in the Methods.

      Figures are overall good quality. However, in figure 1,2, and 4 individual cells are shown in the graphs, whereas figures 3, 5, 6 and 7, and the supplementary figures only show averages with bar graphs. Please change these graphs to all show individual cells, as this will allow to see the variation among cells.

      Significance

      see above

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Review of Montano-Rendon et al: 'PI(3,4)P2, Lamellipodin and VASP coordinate cytoskeletal remodeling during phagocytic cup formation in macrophages'

      The authors employ biosensors for PI(3,4)P2 in RAW 264.7 macrophages to identify localized pools of PIP2 that were sensitive to INPP4B and wortmannin (Fig1). The biosensors for PIP2 are enriched on the forming phagocytic cup (Fig2, movies) in these macrophage cells. Inhibitors for PI3K blocked the recruitment of this biosensor to the membrane.

      Overall, the data are clear with the exceptions noted below. Krause et al (Dev Cell 2004) published a manuscript looking at PIP2, Lpd, and VASP in non-macrophage cells (fibroblasts, HeLa, etc...) where the influence of PI(3,4)P2 and these proteins was found to regulate actin and lamellipodial membrane extensions. This study also implicated Lpd protein coordinated actin networks in the docking of pathogens such as Vaccinia virus and EPEC bacteria. Given the additional reports of these proteins participating in dorsal ruffling (Michael et al Curr Bio 2010) and invasion (Carmona et al Oncogene 2016), it comes as no surprise that they participate in phagophore formation and phagocytosis. These studies are referenced, but having this in mind does diminish the novelty of implicating Lpd and VASP in the phagocytic process, though it seems to be the first time this machinery was directly implicated in macrophage cells.

      Specific Comments

      Although the images and movies graphically demonstrate a PI(3,4)P2 enrichment on phagocytic structures , the authors could provide some additional images that include fluorescently tagged phagocytic cargo such as the erythrocytes used. The addition of a fluorescent marker or phase image would be especially beneficial in the experiments where a lack of cPHx-biosensor recruitment is seen to the docked phagocytic cargo. Otherwise, readers are left with the impression that perturbations such as INPP4B compromise docking and phagocytic cup formation altogether (Fig 2C)- which is perhaps the authors point? Make this clear? There has already been an implication for PI3K in the phagocytic process, perhaps verifying that initial formation/membrane extension stages of phagocytosis are impacted by targeting the D-4 position of PIP2 would be of interest?

      Depletion of PI(3,4)P2 through the expression of the INPP4B phosphatase demonstrated a reduction in phagocytic uptake of red blood cells (Fig4). The readout for this assay relied upon what appears to be differential labeling of phagocytosed red blood cells, though there are examples of cargo that is supposedly inside the macrophages labeled in green? Perhaps the authors can reconcile this and make the methods more clear for this approach?

      Fig4 demonstrates the PI(3,4)P2 dependent recruitment of Lamellipodin (Lpd) to the phagocytic cup, which is clear. Lpd is found to be necessary for effective phagocytic uptake in Fig 5. There is no blotting/qPCR data for the verification of Lpd knockdown shown? The authors demonstrate a co-localization of Lpd/VASP proteins at the phagocytic cup of these macrophages in Fig 6 and sequester VASP protein to the mitochondria with some ActA derived fusion proteins to functionally block phagocytosis. The functional interaction of Lpd/VASP is further explored with experiments utilizing Ena/VASP mutants in Fig7, demonstrating a dependence on this interaction to promote phagocytic uptake.

      Significance

      see above

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors use newly available probes to show that the phosphoinositide PI(3,4)P2 plays a previously undescribed role in FcgammaR-mediated phagocytosis. Using RAW macrophages, they show that PI(3,4)P2 is enriched at the plasma membrane and also at phagocytic cups internalizing IgG-opsonized sheep red blood cells. Pharmacological inhibition using wortmannin, and also expression of membrane-targeted INPP4B phosphatase showed that PI(3,4)P2 production depends on PI3 kinase activity. Further experiments using selective inhibitors showed that PI(3,4)5P2 is mainly derived by dephosphorylation of PI(3,4,5)P3, likely by multiple phosphatases such as SHIP1 or OCRL. Depletion of PI(3,4)P2 at the plasma membrane by INPP4B also resulted in strongly decreased internalization of red blood cells, although their attachment to macrophages seemed unaltered, pointing to defects in particle engulfment.

      The authors then tested the potential role of lamellipodin, one of the few known PI(3,4)P2 specific effectors. Lamellipodin was found to be enriched at phagocytic cups, and this enrichment was shown to be dependent on the presence of PI(3,4)P2, by targeting of INPP4B to the plasma membrane. Macrophages depleted of lamellipodin by shRNA treatment showed reduced phagocytic efficiency and also aberrant phagocytic cup formation. As VASP is a known binding partner of lamellipodin and involved in actin polymerization, the authors next tested its potential involvement. Overexpression experiments showed that VASP colocalizes with lamellipodin at phagocytic cups. Sequestering of VASP at mitochondria through a respective construct containing the VASP binding site of ActA, together with a mitochondrial targeting sequence, showed that this also results in incompletely formed phagocytic cups and reduced phagocytic efficiency. Similar effects were observed upon expression of a lamellipodin construct with mutated binding sites for VASP.

      Collectively, the authors propose that PI(3,4)P2 is localized produced at phagocytic cups through the sequential activity of PI3 kinase and PI5 phosphatase, that it recruits lamellipodin and its binding partner VASP, and that this cascade is necessary for proper phagocytic cup formation and closure and thus phagocytic capacity of cells. This is an interesting study that uncovers a novel role for PI(3,4)P2 in phagocytic cup formation and closure. It is very well controlled, and the claims of the study are supported by the presented data. Statistical analysis is sound.

      Major comments:

      1. The localization of VASP at phagocytic cups is only shown by overexpression of constructs. Endogenous staining of VASP should support this finding.
      2. It is unclear whether the roles of PI(3,4)P2, lamellipodin, and VASP are restricted to FcgammaR-mediated phagocytosis. Their potential involvement in CR3-mediated phagocytosis should be discussed or addressed in a basic set of experiments.

      Minor comments:

      1. A very recent study (Körber and Faix, EJCB, 2022) describes the role of VASP in macroendocytosis in Dictyostelium. Specifically, VASP is found to be important for proper cup closure. The results are of direct importance to the current study and should be cited accordingly.
      2. direct labelling of the figures would be helpful in assessing the manuscript

      Significance

      This study highlights the role of an underappreciated phospholipid in phagocytosis. It also describes for the first time a role for lamellipodin in formation of phagocytic cups and confirms the recent finding that also VASP is necessary for phagocytic cup closure. The paper should be of interest to researchers working on host-pathogen interaction, regulation of the actin cytoskeleton, and also to the general cell biological community

      Reviewer´s expertise: Actin regulation Microtubule-based transport Adhesion, migration, invasion Phagocytosis

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We would like to thank the reviewers for their insightful and useful comments about our manuscript. Based on these comments and as outlined in our revision plan, we plan to strengthen our findings by performing new experiments and quantitative analyses. This particularly applies to our nanoscale (dSTORM) imaging dataset which was discussed by multiple reviewers.

      We also appreciate the reviewers’ overall positive evaluation of the significance of our labeling method for the axon initial segment studies. With regards to this, we would like to highlight that this manuscript particularly addresses the labeling of “difficult-to-label” neuronal proteins, such as large ion channels and transmembrane proteins. Although we and another group have recently reported click labeling of neurofilament light chain (PMID: 35031604) and AMPAR regulatory proteins (PMID: 34795271) in primary neurons, both of these proteins have a small size between ~30-68 kDa and compared to larger ion channels/transmembrane proteins are “easier” to express in primary neurons. The novelty in the current manuscript is that we successfully applied this method for the labeling of large and spatially restricted AIS components, such as NF186 and Nav1.6 (186 and 260 kDa, respectively). As some of the reviewers also pointed out, the size and complexity of these proteins makes labeling of the AIS rather challenging. We also used our approach to study the localization of epilepsy-causing Nav1.6 variants and could exclude the retention in the cytoplasm as a possible cause of their loss of function. Finally, we improved the efficiency of genetic code expansion in primary neurons by developing AAV-based viral vectors. Although AAVs are routinely used for gene delivery to neurons, AAVs for click-based labeling need to encode multiple components of the orthogonal translational machinery for genetic code expansion. By trying different promoters and gene combinations, we developed several variants that enable high efficiency of the genetic code expansion in neurons. On their own, these findings will facilitate further genetic code expansion and click chemistry studies, beyond the labeling of the axon initial segment.

      2. Description of the planned revisions

      Reviewer #2

      • On lines 107 and 108, the sentence "The C-terminal HA-tag allowed us to detect the full-length NF-186 protein by immunostaining it with an anti-HA antibody" would have a better place just after lines 104-105 " [...] we modified the previously described plasmid (Zhang et al., 1998) by moving the hemagglutinin (HA) tag from the N terminus to the C terminus".

      We will modify the text as the reviewer suggested.

      • Fig.2b: the AnkG staining looks substantially longer than that showed in c. However, the results on AIS length show no significant changes in between the groups. This is visually misleading, the authors should choose a picture for the WT construct that is representative of the data.

      We thank the reviewer for bringing this up. We will replace the panel in Fig.2b with a more representative image of NF186 WT construct in the revised version of the manuscript.

      • Line 238: what is the rationale behind choosing these cells? For example, have they been used in other studies for similar purposes? If so, please provide the reference.

      We initially probed neuroblastoma ND7/23 which are commonly used for the electrophysiological recordings of recombinant Nav1.6 (PMID: 30615093, 22623668, 25874799, 27375106). Although we were able to record Na+ currents in those cells, only a small portion of channels was detected on the cell surface by microscopy (Suppl. Fig. 5a). As we discuss in the manuscript (lines 237-240), we then switched to N1E-115-1 cells in which we obtained a higher level of expression of the recombinant NaV1.6 channels on the cell surface (Suppl. Fig. 5b). These cells have also been previously used for the electrophysiological studies of voltage-gated sodium channels, including Nav1.6 (PMID: 8822380, 24077057). We will modify the text and include these references in the revised manuscript.

      • Figure 3c, the authors omitted the comparison with the WT construct this time, as opposed to the neurofascin experiments. What is the reason?

      As shown by others (PMID: 31900387) and us in this manuscript, one of the main issues with the expression of the recombinant NF186 in neurons was that overexpression led to mislocalization of NF186 in neuronal soma and processes. This was particularly true for WT construct and certain amber mutants (e.g. K809TAG). Based on previous reports (PMID: 31900387), we then tested a weak human neuron-specific enolase promoter. This reduced expression level and improved localization of NF186. However, since we still observed some neurons with mislocalized NF186 WT even with the enolase promoter, we found it important to quantitively compare the AIS length of WT construct and amber mutants to surrounding untransfected cells. On the other hand, since we did not have overexpression and mislocalization problem with Nav1.6 WT construct (all observed neurons have signal localizing in the AIS), we measured only the AIS length of the amber mutants. However, to avoid any confusion, we will also measure the AIS size of the neurons expressing Nav1.6 WT construct and compare it to surrounding cells and amber mutants. For this, we will need to perform new experiments and acquire new images. We will include the data in the revised manuscript.

      • Fig. 4: why did the authors chose these cells for electrophysiology experiments and not neurons? Explain the rationale in the text or, alternatively, cite similar studies using the same tool.

      Due to the branched neuronal processes which cause the space clamp problem in voltage clamp experiments with neurons, round and none-branching cells are frequently used to examine the biophysical properties of ion channels, including Nav1.6. By far, most of studies investigating the biophysical properties of NaV1.6 channels were performed in neuroblastoma cells e.g. ND7/23 and N1E-115-1 cells (PMID: 25874799; 25242737). We tested these two types of cells and found that N1E-115-1 cells supported higher expression level of the recombinant NaV1.6 channels on the cell surface than the ND7/23 cells (Suppl. Fig 5). Hence, N1E-115-1 were more suitable to get robust and reliable recordings (as we also discuss above in the response to reviewer’s comment). We will clarify this in the revised manuscript.

      • Fig.4, biophysical properties: did the authors find differences in passive properties? Measures of resting potential, membrane resistance and cell capacitance should be reported.

      Passive properties such as resting membrane potential and membrane resistance are important functional features in neurons measured in current clamp experiments, but not applicable for ND7/23 and N1E-115-1 cells used in our voltage clamp experiments. To measure the Na+ current mediated by WT or mutant NaV1.6 channels expressed in N1E-115-1 cells, the endogenous Na+ channels were blocked by tetrodotoxin and the endogenous K+ channels were blocked by tetraethylammonium chloride, CsCl and CsF in extracellular and intracellular solutions. Under these conditions, resting potential and membrane resistance are not relevant for experiments. Cell capacitance reflects the size of the cell surface area, which can affect the number of channels expressed on the cell surface. To eliminate the effect of different cell sizes, Na+ current densities normalized by cell capacitances were used in our experiments. We will report on these values in the revised manuscript.

      • Fig 4, STORM images. The periodic distribution of the dots should be enhanced with some sort of arrows or lines, for the non-specialist audience.

      Based on the comments from multiple reviewers, we plan to obtain additional dSTORM images of the neurons expressing recombinant Nav1.6 WT or amber mutants. We also intend to improve the visualization of these results by updating/modifying existing figures and including quantitative data.

      • Line 374: rat or mouse primary neurons?

      We are here referring to both, rat and mouse neurons. The images shown in Fig. 06 and Suppl. Fig. 08 were obtained from rat cortical neurons expressing Nav1.6 or fluorescent reporter. However, we were also able to successfully transduce mouse neurons with AAV92A carrying orthogonal translational machinery (data not shown). We will clarify this in the revised manuscript.

      **Referees cross-commenting**

      I fully agree with the following remarks from Reviewers #3, #4 and #5. This is a point that I have raised in my report too. The authors need better images to show the periodicity we visualization, and a quantification would be of great benefit to support the claim with numbers (and how these compare to similar studies in the literature):

      R3: 2. For the dSTORM analysis of the tagged Nav1.6 protein, I also cannot tell there is periodic organization from the image directly. Some analysis is needed there. R4: 2."As there was no obvious difference in the nanoscale organization of the NaV1.6WT 317 -HA or NaV1.6TAG 318 -HA channels (Fig. 4. e-g), these experiments confirmed that the NaV1.6 overexpression, TCO*A319 Lys incorporation, and click labeling did not affect the nanoscale periodic organization of the sodium channels in the AIS." It is clearly noticeable that for WT, the spot density is more compared to the other two mutants. Why is that so? Using cluster analysis, one can quantify spot density and discuss nanoscale organization quantitatively. The author should quantify the periodicity and compare it among different variants and with previous reports. R5: 3. The authors claim that there was no obvious difference in the nanoscale organization of the NaV1.6WT 317 -HA or NaV1.6TAG 318 -HA channels (Fig. 4. e-g), but it is hard to conclude this without any quantification and statistical analysis. Sodium channels have been shown to be associated with the membrane-associated periodic skeleton structures in neurons and average autocorrelation analysis has been developed to quantify the degree of periodicity of such structural organizations (Han et al. PNAS 114(32)E6678-E6685, 2017). The authors should use this approach to quantify and compare the average autocorrelation amplitudes.

      As we outlined in our responses to the individual reviewers’ comments below, we will address these questions by performing new experiments and quantifications.

      I also agree with these comments from Reviewers #3 and #5:

      R3: 4. It is unclear, for all the presented data, whether all the cells are collected from a single biological replicate or from multiple replicates. At least 2-3 replicates are needed to see the reproducibility in terms of labeling efficiency, and other related conclusions. R5: 1. The authors should indicate how many replicates were performed and how many cells were analyzed for each experiment.

      We thank the reviewers for bringing this up. By mistake, we omitted this important information. We will include it in the revised manuscript, but we would like to highlight here that each experiment was repeated at least 3 times.

      Reviewer #3

      1. There is some patch-like background from the 488 channel from the click reaction, some of which have very as strong signal as the staining on the neurons. What is the potential cause for this? With immunostaining on HA, the background doesn't affect too much on the image data interpretation. However, the major goal of this method development is to use it in live-cell without immunostaining. Without another reference, the high background might cause issues in data interpretation. Can the author also suggest way to avoid or lower this in the discussion?

      We thank the reviewer for bringing this up. We have occasionally observed patch-like background in what appears to be the cell debris. Such dead cells do not have an intact cell membrane and therefore can absorb cell-impermeable ATTO488-tetrazine dye during click labeling. This kind of background is also present in the control neurons transfected with the WT Nav1.6, which suggests that it originates from the UAA and tetrazine-dye accumulations. Additionally, since these patches are not visible with the immunostaining, they do not contain our protein of interest, which further confirms that they contain only dye and UAA accumulations. Depending on the neuron prep/quality before and after transfections, the presence of these patches is more or less obvious. However, despite the background we did not have problems identifying AIS during live cell imaging. Especially when overall neuronal health is optimal after transfections, AIS can easily be distinguished from patches that are positioned outside of labeled neurons. We will investigate this further and discuss it in the revised manuscript.

      1. For the dSTORM analysis of the tagged Nav1.6 protein, I also cannot tell there is periodic organization from the image directly. Some analysis is needed there.

      We will address this in the revised manuscript by performing additional experiments and quantifications. We also wrote a detailed answer below, in the response to the other reviewers.

      1. The authors use the AIS length as a parameter to evaluate the function of the clickable mutant of NF186, and using patch clamp for functional validation of the clickable mutant of Nav1.6. In both cases, the comparison is done between the mutant and the WT construct, but both in transfected cell and exogenously expressed. It's also worth comparing with untransfected cells as the true native situation.

      We agree with the reviewer that it is important to compare transfected cells with untransfected cells. As the reviewer points out, we have already performed some of these comparisons. When it comes to the NF186, we used the AIS length as a parameter to estimate if the expression of clickable mutant affected the AIS structure. As we show in the Fig. 02, we co-immunostained neurons transfected with NF186-HA WT or TAG constructs. We used HA antibody to detect neurons expressing NF186, while the ankG was used as a marker of the AIS length. To check if the AIS length of transfected cells is affected, we compared the length of transfected cells (expressing NF186, HA+) to surrounding untransfected cells (HA-). When it comes to the Nav1.6, we also compared the AIS length of cells expressing Nav1.6 (HA+) to surrounding untransfected cells (HA-). Similarly to the experiments with NF186, this allowed us to check if the expression of the recombinant Nav1.6 affect the AIS structure. What is missing is the comparison with untransfected conditions (i.e. neurons that are simply stained with ankG). We assume that is what the reviewer is referring to? We will also include these data in the revised manuscript. Furthermore, since we introduced a labeling modification in NaV1.6, we wanted to check if such modification would affect its function. To do so, as routinely done in the field (PMID: 25874799), we rendered the WT and TAG channels TTX-resistant and recorded only recombinant Na+ currents in neuroblastoma cells in the presence of TTX. Perhaps we misunderstand the reviewer’s comment, but in this regard measurements of untransfected cells are not relevant since they would not allow us to compare WT and TAG mutants.

      1. It is unclear, for all the presented data, whether all the cells are collected from a single biological replicate or from multiple replicates. At least 2-3 replicates are needed to see the reproducibility in terms of labeling efficiency, and other related conclusions.

      We thank the reviewer for the observation. By mistake, we omitted this important information. We will include in the revised version of the manuscript. We would like to highlight here that each experiment was repeated at least 3 times.

      Reviewer #4

      1."Confocal microscopy revealed that the hNSE promoter lowered the WT and clickable NF186-HA expression levels and consequently improved the localization of these proteins." Is the lower expression level a measure of localization improvement? How does the author conclude that the localization has improved?

      Previous report (PMID: 31900387) suggested that the overexpression of the recombinant WT NF186 can affect its trafficking, leading to the NF186 mislocalization. We observed the same in our experiments with CMV NF186 (in particular for NF186 WT). Hence, based on the PMID: 31900387 we probed weak neuron specific enolase promoter. Since the WT was the most problematic in terms of the ectopic expression, we checked if AIS localization was improved with enolase promoter for this construct. To this aim, we counted number of neurons that with mislocalized signal or with the signal in the AIS for both, CMV and enolase promoter. We could observe that number of neurons with mislocalized signal was lower for enolase promoter. Since there were more neurons with the AIS-specific signal when NF186 was expressed from enolase promoter compared to CMV, we concluded that enolase promoter lowered expression and improved localization of the NF186. Therefore, we used enolase promoter for click labeling of NF186 amber mutants. We will include the results of this analysis in the revised version of the manuscript.

      2."As there was no obvious difference in the nanoscale organization of the NaV1.6WT 317 -HA or NaV1.6TAG 318 -HA channels (Fig. 4. e-g), these experiments confirmed that the NaV1.6 overexpression, TCO*A319 Lys incorporation, and click labeling did not affect the nanoscale periodic organization of the sodium channels in the AIS." It is clearly noticeable that for WT, the spot density is more compared to the other two mutants. Why is that so? Using cluster analysis, one can quantify spot density and discuss nanoscale organization quantitatively. The author should quantify the periodicity and compare it among different variants and with previous reports.

      We thank the reviewer for these suggestions. We will address these remarks by performing additional new experiments and quantifications. The difference in the level of the expression of the recombinant Nav1.6 might explain differences in the spot density for WT vs. TAG clickable mutants. However, as the reviewer suggested quantitative analysis is needed to address these concerns. We also intend to quantify the periodicity and compare it among different variants and with previous reports. It is just important to note that in the current version of the manuscript we looked at the nanoscale organization of the subset of Nav1.6 channels. The reason being that we used anti-HA antibody which will only detect our recombinant protein which got incorporated into the AIS and not the endogenous Nav1.6.

      Minor comments

      1."Although NF186K809TAG 158 -HA (Supplementary Fig. 4) showed bright click labeling, we excluded it from the analysis due to its frequent ectopic expression along the distal axon." How frequently is this bright click labeling observed for this mutation? Is it not observed for other mutations at all? The authors should state this point clearly with some statistics.

      We are not sure what is the exact question from the reviewer. If we understand it correctly, the reviewer is asking us to quantify how frequent was the ectopic expression of this amber mutant compared to other mutants? And not the click labeling (as written in their original comment), since click labeling was observed for all the mutants independently of their ectopic expression?

      2."Immunostaining with anti-HA antibody revealed that the expression of NaV1.6WT 239 -HA on the membrane of the N1E-115-1 cells was higher than on the ND7/23 cells (Supplementary Fig. 5a-c). However, click labeling of both NaV1.6K1425TAG 240 -HA and NaV1.6K1546TAG 241 -HA with ATTO488-tz was not successful (Supplementary fig. 5d) indicating insufficient expression of the clickable constructs." Is this due to insufficient expression level or accessibility? The author should make this statement clear.

      We thank the reviewer for bringing this up. We will clarify this in the revised version of the manuscript. We believe that the click labeling of the K1546TAG mutant in N1E-115-1 cells is absent due to the insufficient expression of the channels on the membrane, since this mutant was successfully labeled in the primary neurons that represent more native environment and where Nav1.6 form high-density clusters. K1425TAG mutant is not labeled due to the insufficient expression on the membrane in N1E-115-1 cells as well. However, since this mutant is also poorly labeled in primary neurons, we can speculate that K1425TAG position might be less accessible for the tetrazine-dye compared to K1546TAG. To further support our claim that due to the insufficient expression click labeling is low/absent in neuronal cells, we can use NF186 as an additional example. When NF186 was expressed from strong CMV promoter, we observed click labeling for all the mutants in ND7/23 cells (Suppl. Fig.01). However, when CMV was replaced with neuron specific enolase promoter, the expression was of NF186 was substantially lower in ND7/23 cells and click labeling was absent (data not shown). We will clarify this in the revised manuscript.

      1. Authors should clearly state the drift correction procedure of 3D STORM data. What are the localization precision and photon count for 3D STORM experiments?

      We processed 3D dSTORM data in NIS-elements AR software. We used the automatic drift correction from the NIS-elements software that is based on the autocorrelation. We will provide further and updated information in the revised manuscript, including the localization precision and photon count for the new dSTORM images.

      1. "Click labeling of NaV1.6 channels in living primary neurons" What kind of primary neurons have been used for click labeling of NaV1.6 channels? Is there any specific reason why authors have chosen cortical neurons for labeling NF186? Does this labeling strategy depend on primary neuron type?

      For the establishment and click labeling of Nav1.6 we used primary rat cortical neurons (Fig. 03, Fig. 06). The same neuronal type has been used for click labeling of NF186 (Fig. 02). We established labeling of the AIS components in cortical neurons because we use those routinely in the laboratory. However, this labeling strategy does not depend on the neuronal type. As we show in Fig. 05, to study localization of the loss-of-function pathogenic Nav1.6 variants we used mouse hippocampal neurons. The reason for this is that in previous study the same neuronal type was used to characterize these two mutations (lines 361-362). This demonstrates nicely that method can be easily transferred to any neuronal type. Furthermore, we were also able to label Nav1.6 and NF186 in mouse cortical neurons (data are not shown in the manuscript). We will clarify this in the revised manuscript.

      Reviewer #5

      1.Throughout the manuscript, only one representative image containing one AIG is shown for each condition without statistics and quantifications, so the conclusions are not sufficiently convincing. For example, in Fig. 1b, c, e; Fig. 2b,c,d,e; Fig. 3b,c,d,e ; Fig. 5c; and Supplementary Fig.1-6, the authors should quantify the average fluorescence intensities both for HA immunostaining and ATTO488-tz labeling in different conditions, as well as the labeling ratios (fluorescence intensity ratios between ATTO488 and AF647/AF555) . Without statistics and quantifications, it is unclear whether there is any significant difference between the constructs with different TAG positions, or between different transfection methods (e.g., lipofectamine 2000 vs 3000).

      We agree with the reviewer that the quantitative analysis is important and we will provide more quantitative data in the revised manuscript. At the same time, we are a bit confused by this comment which seems to refer to missing quantifications in one of the schemes (Fig. 1) and overlooks existing quantifications (e.g. quantitative analysis of the data set from Fig. 5c is shown in Fig. 5d). However, as suggested by the reviewer and to strengthen our data, in addition to the quantifications already provided in the manuscript (e.g. Fig. 2d: AIS length of NF186TAG constructs; Fig. 3f: AIS length of Nav1.6 TAG constructs; Fig. 5d: click-labeling intensity of LOF mutants), we intend to quantify the differences between labeling ratios of different mutants and transfection methods. When it comes to the different transfection methods, some data is already provided in the manuscript (e.g. we counted number of transfected versus transduced neurons) but we intend to clarify and expand on this in the revised manuscript.

      1. The only quantification done was for the average AIS length, but the statistical tests should be performed between different conditions and the corresponding P values should be provided. It seems that the transfected neurons generally have a longer AIS length than the transfected neurons (Fig. 2d and 3f). Could the authors provide an explanation for this?

      We are a bit confused by the first part of this comment. We measured the AIS lengths of NF186 WT or NF186 TAG as well as Nav1.6 TAG and compared it to the AIS lengths of surrounding untransfected cells (Fig. 2d and Fig.03f). In addition, we compared the AIS lengths of the NF186 WT and TAG to each other, and Nav1.6 TAG to each other. To analyze the differences, we performed statistical tests and provided the corresponding p values in the figure legends (Fig. 02 and 03). Further details on the statistical analysis are provided in supplementary tables (Suppl. table 01 and 02). Regarding the 2nd question, we have also noticed that the AIS lengths of transfected neurons appear longer than those of untransfected cells. This seems to be more pronounced in the case of NF186 which is expressed at the higher level compared to the Nav1.6. The appearance of slightly longer AIS is most likely the consequence of the fact that recombinant constructs are overexpressed in the neurons that express endogenous NF186 and Nav1.6. However, this difference in the AIS length is not significant to the controls. We will discuss this further in the revised manuscript.

      1. The authors claim that there was no obvious difference in the nanoscale organization of the NaV1.6WT 317 -HA or NaV1.6TAG 318 -HA channels (Fig. 4. e-g), but it is hard to conclude this without any quantification and statistical analysis. Sodium channels have been shown to be associated with the membrane-associated periodic skeleton structures in neurons and average autocorrelation analysis has been developed to quantify the degree of periodicity of such structural organizations (Han et al. PNAS 114(32)E6678-E6685, 2017). The authors should use this approach to quantify and compare the average autocorrelation amplitudes.

      We are thankful to the reviewer for suggestions on how to quantify the periodicity of recombinant sodium channels and how to more accurately compare WT and TAG mutants at the nanoscale level. We will perform additional experiments and analysis in order to address the concerns of this and other reviewers.

      1. The authors should also obtain dSTORM images for the click labeled neurons to demonstrate if the click labeling method would provide sufficient labeling efficiency for dSTORM, compared to immunostaining (HA and Ankyrin G immunostaining).

      We would like to thank the reviewer for this suggestion. We have already shown in our previous work that STED can be performed with click labeled neurons (PMID: 35031604). When it comes to this manuscript and AIS labeling, we have already obtained preliminary dSTORM images of click-labeled NF186. Since the expression of Nav1.6 is lower compared to NF186, the labeling is also less bright and dSTORM is a bit more challenging. To try to overcome this issue, in addition to dSTORM of click-labeled Nav1.6, we are planning to try click-PAINT (PMID: 27804198). Click-PAINT has been used for super-resolution imaging of less abundant targets in cells and could possibly allow super-resolution imaging of Nav1.6. We will report on these new experiments in the revised version of the manuscript.

      1. It seems that the click labeling has a off-target/background labeling in the soma of the neuron (see Fig. 3c,d. Could the authors quantify and determine the sources of such off-target labeling?

      We thank the reviewer for pointing this out. We will clarify this in the revised manuscript, but by looking at the other examples from our dataset it appears to us that this background is present in WT constructs as well. In the current version of the manuscript, this is not clear since the WT image that is shown in the Fig. 03b is a single plane confocal image. Therefore, we will replace it in the revised manuscript with a z-stack in which the presence of the background is more obvious (due to the maximum intensity projection). In addition, we will conduct additional control experiments to clarify this.

      Minor comments:

      1. The authors should indicate how many replicates were performed and how many cells were analyzed for each experiment.

      We thank the reviewer for bringing this up. By mistake, we omitted this important information. We will include this information in the revised manuscript, but we would like to highlight here that each experiment was repeated at least 3 times.

      1. The display range (i.e., intensity scale bar) was indicated only for a small portion of the fluorescence images. It is better to be consistent and show the display range for all images presented.

      We will include intensity scale bars in all the images in the revised version of the manuscript.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Not applicable.

      4. Description of analyses that authors prefer not to carry out

      Reviewer #3, comment #5. One application presented in this manuscript is to evaluate the effect of epilepsy-causing mutations of Nav1.6. By comparing the intensity of ATTO488, the result suggests that there is no significant impact of these mutations on membrane tracking. I am wondering if the author should study the membrane tracking by also looking at the diffusion in live-cell with the labeling method. The comparison of the intensity only can be achieved by just immunostaining. It doesn't really demonstrate the benefit of live-cell labeling and imaging with the presented method.

      Generally speaking, one of the advantages of click labeling is its compatibility with live cell labeling. As the reviewer also points out, this is especially useful for live-cell imaging but is not limited to it. In addition, click labeling allows selective labeling of membrane population of Nav1.6 in living neurons. We took advantage of this and used cell-impermeable dyes to label unnatural amino acids incorporated into extracellular part of Nav1.6 (Scheme 03a). On the contrary, HA tag that allows immunodetection of recombinant Nav1.6 is added to the intracellular C terminus. Hence, by anti-HA immunostaining total (intra- and extracellular) epilepsy-causing Nav1.6 channel population will be detected. That is why in this case live-cell click labeling was advantageous compared to the conventional immunostaining. We will clarify this in the revised manuscript. In addition, we would like to note that when we started the experiments with the epilepsy-causing mutations, we wanted to a) check if they are present on the membrane and b) depending on the outcome of those experiments follow the trafficking of these LOF Nav1.6 mutants. Since patch clamp recordings of pathogenic Nav1.6 showed loss of Na+ currents, we at first assumed that they are not properly expressed on the membrane. However, our click labeling showed that the pathogenic channels were detected at the AIS membrane despite the loss of Na+ currents. This was also somewhat surprising to us and we would love to investigate this further. We also appreciate the reviewer’s suggestion in this regard and we hope to be able to use all the advantages of our labeling approach in our follow-up studies. However, keeping in mind time and resources limitations, live-cell trafficking study might be beyond the scope of this revision.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #5

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, Nevena Stajković et al. present a method for live labeling of the proteins localized at the axon initial segment (AIS) of cultured neurons using unnatural amino acids (UAAs) carrying strained alkenes and click chemistry. Using this method, the authors showed the successful labeling of two AIS-localized proteins, the 186 kDa isoform of neurofascin (NF186) and the 260 kDa voltage-gated sodium channel (NaV1.6). The authors also showed the transduction of neurons using adeno-associated viruses (AAVs) had higher efficiency than transfection by lipofectamine in delivering the vectors expressing required components for the click labeling.

      Major comments:

      1. Throughout the manuscript, only one representative image containing one AIG is shown for each condition without statistics and quantifications, so the conclusions are not sufficiently convincing. For example, in Fig. 1b, c, e; Fig. 2b,c,d,e; Fig. 3b,c,d,e ; Fig. 5c; and Supplementary Fig.1-6, the authors should quantify the average fluorescence intensities both for HA immunostaining and ATTO488-tz labeling in different conditions, as well as the labeling ratios (fluorescence intensity ratios between ATTO488 and AF647/AF555) . Without statistics and quantifications, it is unclear whether there is any significant difference between the constructs with different TAG positions, or between different transfection methods (e.g., lipofectamine 2000 vs 3000).
      2. The only quantification done was for the average AIS length, but the statistical tests should be preformed between different conditions and the corresponding P values should be provided. It seems that the transfected neurons generally have a longer AIS length than the transfected neurons (Fig. 2d and 3f). Could the authors provide an explanation for this?
      3. The authors claim that there was no obvious difference in the nanoscale organization of the NaV1.6WT 317 -HA or NaV1.6TAG 318 -HA channels (Fig. 4. e-g), but it is hard to conclude this without any quantification and statistical analysis. Sodium channels have been shown to be associated with the membrane-associated periodic skeleton structures in neurons and average autocorrelation analysis has been developed to quantify the degree of periodicity of such structural organizations (Han et al. PNAS 114(32)E6678-E6685, 2017). The authors should use this approach to quantify and compare the average autocorrelation amplitudes.
      4. The authors should also obtain dSTORM images for the click labeled neurons to demonstrate if the click labeling method would provide sufficient labeling efficiency for dSTORM, compared to immunostaining (HA and Ankyrin G immunostaining).
      5. It seems that the click labeling has a off-target/background labeling in the soma of the neuron ( see Fig. 3c,d. Could the authors quantify and determine the sources of such off-target labeling?

      Minor comments:

      1. The authors should indicate how many replicates were performed and how many cells were analyzed for each experiment.
      2. The display range (i.e., intensity scale bar) was indicated only for a small portion of the fluorescence images. It is better to be consistent and show the display range for all images presented.

      Significance

      Unnatural amino acid (UAA)-based minimal tags for live-cell protein labeling in mammalian cells were invented about ten years ago (Lang et al., 2012b, Lang et al., 2012a, Nikic et al., 2014, Plass et al., 2012, Uttamapinant et al., 2015), and these authors recently introduced this labeling method to label live cultured neurons (Arsić et al., 2022). Therefore, it is unclear whether the method present in this manuscript has any significant advance compared to the Arsić et al. paper, given that the major difference between the two papers is that in the current manuscript, AIS localized proteins were labeled, whereas in the Arsić et al. paper, neurofilaments were labeled in the neurons. Therefore, the method presented in the current manuscript does not provide much novelty or technical advance compared to what has been described in the Arsić et al. paper.

      My expertis is super-resolution flurescence imaging, cell labeling methods, and neurobiology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      The manuscript demonstrates a novel method of labeling two large components of the initial axon segment, neurofascin (NF186) and Nav1.6 using unnatural amino acids and click chemistry in live cells. They have applied their method for epilepsy causing two Nav1.6 variants without affecting their functionality. Since these proteins are larger in size, selecting the labeling sites and transfection efficiency become critical factors. They have targeted different lysine sites and shown the best performing labeling site. Also, they have developed a viral vector to improve transfection efficiency.

      The experiments are well designed, and the manuscript is nicely written. In my opinion, the manuscript can be accepted, but the author should address the following comments.

      Major comments

      1. "Confocal microscopy revealed that the hNSE promoter lowered the WT and clickable NF186-HA expression levels and consequently improved the localization of these proteins." Is the lower expression level a measure of localization improvement? How does the author conclude that the localization has improved?
      2. "As there was no obvious difference in the nanoscale organization of the NaV1.6WT 317 -HA or NaV1.6TAG 318 -HA channels (Fig. 4. e-g), these experiments confirmed that the NaV1.6 overexpression, TCO*A319 Lys incorporation, and click labeling did not affect the nanoscale periodic organization of the sodium channels in the AIS." It is clearly noticeable that for WT, the spot density is more compared to the other two mutants. Why is that so? Using cluster analysis, one can quantify spot density and discuss nanoscale organization quantitatively. The author should quantify the periodicity and compare it among different variants and with previous reports.

      Minor comments

      1. "Although NF186K809TAG 158 -HA (Supplementary Fig. 4) showed bright click labeling, we excluded it from the analysis due to its frequent ectopic expression along the distal axon." How frequently is this bright click labeling observed for this mutation? Is it not observed for other mutations at all? The authors should state this point clearly with some statistics.
      2. "Immunostaining with anti-HA antibody revealed that the expression of NaV1.6WT 239 -HA on the membrane of the N1E-115-1 cells was higher than on the ND7/23 cells (Supplementary Fig. 5a-c). However, click labeling of both NaV1.6K1425TAG 240 -HA and NaV1.6K1546TAG 241 -HA with ATTO488-tz was not successful (Supplementary fig. 5d) indicating insufficient expression of the clickable constructs." Is this due to insufficient expression level or accessibility? The author should make this statement clear.
      3. Authors should clearly state the drift correction procedure of 3D STORM data. What are the localization precision and photon count for 3D STORM experiments?
      4. "Click labeling of NaV1.6 channels in living primary neurons" What kind of primary neurons have been used for click labeling of NaV1.6 channels? Is there any specific reason why authors have chosen cortical neurons for labeling NF186? Does this labeling strategy depend on primary neuron type?

      Significance

      Although the use of unnatural amino acids and click chemistry for labelling has been shown before from the same group, labelling large proteins, especially ion channels, without affecting their function is always challenging because of the accessibility of the labelling site as well as poor transfection efficiency. Here, they have selected two such large essential proteins: NF186 and Nav1.6, which are associated with epilepsy, and developed a method for fluorophore labelling with minimal perturbation. Other approaches namely using fluorescent proteins, biotin-streptavidin chemistry and halo-tag have been reported before to label these proteins, but these have a strong impact on their mislocalisation and perturbing their functionality. Therefore, this method will be of great importance in the field of studying these proteins.

      Expertise: Live-cell confocal and multi-photon microscopy imaging, Super-resolution microscopy imaging, Live-cell labelling, and Amyloid aggregations

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Stajkovic et al the describes step-wise generation and validation of the fluorescent labeling of NF186 and Nav1.6 in primary neurons by non-natural amino acid and click chemistry. For each protein of interest, the authors started by generating constructs carrying amber codon at different positions, and then selected for the best construct(s) by judging (1) the labeling efficiency, (2) whether the particular labeling position affect the function of the protein, and (3) whether the labeled protein shows any mislocalization. During the trouble shooting process, the authors also introduced adeno-associated viral (AAV) vectors for more efficiently delivering constructs into the cells. The method described in the manuscript could become a reference for researchers who aim to label similar neuronal proteins.

      Specific comments:

      1. There is some patch-like background from the 488 channel from the click reaction, some of which have very as strong signal as the staining on the neurons. What is the potential cause for this? With immunostaining on HA, the background doesn't affect too much on the image data interpretation. However, the major goal of this method development is to use it in live-cell without immunostaining. Without another reference, the high background might cause issues in data interpretation. Can the author also suggest way to avoid or lower this in the discussion?
      2. For the dSTORM analysis of the tagged Nav1.6 protein, I also cannot tell there is periodic organization from the image directly. Some analysis is needed there.
      3. The authors use the AIS length as a parameter to evaluate the function of the clickable mutant of NF186, and using patch clamp for functional validation of the clickable mutant of Nav1.6. In both cases, the comparison is done between the mutant and the WT construct, but both in transfected cell and exogenously expressed. It's also worth comparing with untransfected cells as the true native situation.
      4. It is unclear, for all the presented data, whether all the cells are collected from a single biological replicate or from multiple replicates. At least 2-3 replicates are needed to see the reproducibility in terms of labeling efficiency, and other related conclusions.
      5. One application presented in this manuscript is to evaluate the effect of epilepsy-causing mutations of Nav1.6. By comparing the intensity of ATTO488, the result suggests that there is no significant impact of these mutations on membrane tracking. I am wondering if the author should study the membrane tracking by also looking at the diffusion in live-cell with the labeling method. The comparison of the intensity only can be achieved by just immunostaining. It doesn't really demonstrate the benefit of live-cell labeling and imaging with the presented method.

      Significance

      The data itself is mostly convincing, however, I do not see much novelty from this manuscript. Both the labeling method using non-natural amino acid and click chemistry and AAV delivery are established. However, I can see that for research groups who specifically interested in studying these two proteins or proteins closed related, the results from this manuscript could be of direct help.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This study proposes a novel tool for AIS live and fixed labelling based on biorthogonal click chemistry. Stajkovic and colleagues used this method to specifically label the AIS proteins NF186 and Nav1.6 of mouse and rat neurons, and did a thorough process of optimization to get convincing results. The authors considered different promoters, transfection strategies and the use of AAVs to get the most efficient labelling strategy for both proteins. They have also gone through a strong validation process based on transfection efficiency, quality of staining, potential effects on AIS length and nanostructure, and electrophysiological properties. Finally, Stajkovic and colleagues used this tool to study how two epilepsy-causing Nav1.6 mutant variants affect AIS function, providing interesting data to the understanding of this pathology. In summary, this method convincingly overcomes some well-described issues associated with pre-existing AIS live cell labelling tools by being minimally "invasive" to the proteins of interest. Besides the scientific content, another strong point of this article is the clarity of the manuscript and the figures: the presence of schematics (i.e. Fig. 1) and the detailed description of experiments and results will help non-specialist readers to follow the study. I strongly recommend this article for journal publication.

      Major comments:

      I have no major comments

      Minor comments:

      I have some minor comments:

      • On lines 107 and 108, the sentence "The C-terminal HA-tag allowed us to detect the full-length NF-186 protein by immunostaining it with an anti-HA antibody" would have a better place just after lines 104-105 " [...] we modified the previously described plasmid (Zhang et al., 1998) by moving the hemagglutinin (HA) tag from the N terminus to the C terminus".
      • Fig.2b: the AnkG staining looks substantially longer than that showed in c. However, the results on AIS length show no significant changes in between the groups. This is visually misleading, the authors should choose a picture for the WT construct that is representative of the data.
      • Line 238: what is the rationale behind choosing these cells? For example, have they been used in other studies for similar purposes? If so, please provide the reference.
      • Figure 3c, the authors omitted the comparison with the WT construct this time, as opposed to the neurofascin experiments. What is the reason?
      • Fig. 4: why did the authors chose these cells for electrophysiology experiments and not neurons? Explain the rationale in the text or, alternatively, cite similar studies using the same tool.
      • Fig.4, biophysical properties: did the authors find differences in passive properties? Measures of resting potential, membrane resistance and cell capacitance should be reported.
      • Fig 4, STORM images. The periodic distribution of the dots should be enhanced with some sort of arrows or lines, for the non-specialist audience.
      • Line 374: rat or mouse primary neurons?

      Referees cross-commenting

      I fully agree with the following remarks from Reviewers #3, #4 and #5. This is a point that I have raised in my report too. The authors need better images to show the periodicity visualization, and a quantification would be of great benefit to support the claim with numbers (and how these compare to similar studies in the literature):

      R3: 2. For the dSTORM analysis of the tagged Nav1.6 protein, I also cannot tell there is periodic organization from the image directly. Some analysis is needed there. R4: 2."As there was no obvious difference in the nanoscale organization of the NaV1.6WT 317 -HA or NaV1.6TAG 318 -HA channels (Fig. 4. e-g), these experiments confirmed that the NaV1.6 overexpression, TCO*A319 Lys incorporation, and click labeling did not affect the nanoscale periodic organization of the sodium channels in the AIS." It is clearly noticeable that for WT, the spot density is more compared to the other two mutants. Why is that so? Using cluster analysis, one can quantify spot density and discuss nanoscale organization quantitatively. The author should quantify the periodicity and compare it among different variants and with previous reports. R5: 3. The authors claim that there was no obvious difference in the nanoscale organization of the NaV1.6WT 317 -HA or NaV1.6TAG 318 -HA channels (Fig. 4. e-g), but it is hard to conclude this without any quantification and statistical analysis. Sodium channels have been shown to be associated with the membrane-associated periodic skeleton structures in neurons and average autocorrelation analysis has been developed to quantify the degree of periodicity of such structural organizations (Han et al. PNAS 114(32)E6678-E6685, 2017). The authors should use this approach to quantify and compare the average autocorrelation amplitudes.

      I also agree with these comments from Reviewers #3 and #5:

      R3: 4. It is unclear, for all the presented data, whether all the cells are collected from a single biological replicate or from multiple replicates. At least 2-3 replicates are needed to see the reproducibility in terms of labeling efficiency, and other related conclusions. R5: 1. The authors should indicate how many replicates were performed and how many cells were analyzed for each experiment.

      Significance

      The proposed tool in this article represents a big step forward in the field of AIS live cell imaging. As stated by the authors in the introduction, previous studies have described methods based on tagging fluorescent proteins to the protein of interest or labelling the extracellular part of proteins with antibodies. The same studies reported several issues: the interference with important domains of the protein due to the size and the position of the tag in the case of fluorescent proteins (Dumitrescu et al., 2016, PMID: 27932952; Dzhashiashvili et al., 2007, PMID: 17548513), or the failure to report plasticity changes in the AIS in the case of antibodies (Dumitrescu et al., 2016, PMID: 27932952). This tool can be useful for research teams aiming to understand, for example, the live development of the AIS or understanding the trafficking of its proteins. The authors have applied this method to two transmembrane proteins (NF186 and Nav1.6), but as they state in their discussion, it will be useful to tag other candidates, including cytoplasmic proteins. One of the main problems of immunocytochemistry is to find the right antibody to detect your protein. Sometimes, absence of proof is not proof of absence: just because the protein is not detected via immunostaining does not mean that the protein is not expressed there. This tool offers an alternative to these challenging scenarios.

      My expertise keywords: axon initial segment, neuronal polarity, axon biology, super resolution microscopy.

    6. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      AIS organization is highly complex and unique and AIS labeling in living cells has been problematic. In this study, authors comprehensively searched and optimized live cell markers for AIS.The main advantage of the labeling approach is that the small fluorescent dye is directly attached to the proteins of interest. Therefore, the protein of interest is modified in a minimally invasive way.

      Significance

      I was first wondering whether tool development is enough important to be published as such but here, the development and testing of the constructs, whether they affect cell functionality, is very comprehensive. This all makes this tool very useful for other researchers. I can take the method directly to use and I don't need to do all testing by myself. I feel that this tool development takes field forward.

      Own expertise: I have personally gone through the AIS live cell labeling problems and therefore I can easily appreciate the work done here.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We are sincerely grateful to the reviewers for several key comments that led us to correct some mistakes and better appreciate how to put our findings in the context of recently published data. These changes undoubtedly improved the manuscript.

      Many other reviewer comments seem to equate chaperone binding with a functional chaperone role in de novo folding. These are not the same. Cytosolic chaperones presumably “sample” nearly every protein that is synthesized by cytoplasmic ribosomes. This does not mean that every such protein would misfold if even one of those chaperones failed to bind it. If we want to understand what chaperone mutations might cause human disease due to septin misfolding, for example, it will not be enough to catalog all the chaperones that bind septins. We have already done that. What will help is to understand which chaperones make functional contributions to septin folding and complex assembly. Our study is the first to experimentally address chaperone roles in de novo septin folding, period. We take responsibility for not being sufficiently clear about the goals of our work, and, to emphasize these points, we added one sentence to the Introduction and revised another.

      Another consistent criticism was that the use of the E. coli system, both in vivo and in vitro, limited our ability to gain insight into the folding of septins in eukaryotic cells and led to a “tessellated view”. For example, reviewers claimed that our model about translation elongation rates for Cdc12 were “based mainly on the E. coli system and bioinformatics analysis”. We disagree with this interpretation. Key evidence in support of our model come from published data in yeast, specifically the much higher density of ribosomes on Cdc12 and the accumulation of ribosomes on the Pro-rich cluster near the Cdc12 N terminus. These are precisely the kinds of “more stringent analysis” in “authentic yeast” (to use Reviewers’ language) that we would have wanted to do to test our model, had they not already been done by others. Without specific suggestions, we struggle to imagine what other kinds of experiments the Reviewers have in mind, apart from a eukaryotic version of a reconstituted cell-free translation system, which Reviewer #1 admits “would be substantially difficult” and “time consuming”. While we are intrigued by the reconstituted eukaryotic cell-free translation system that was published last year (which we mentioned on lines 994-995) and look forward to exploring it in future studies, it is not commercially available and we agree that the amount of effort required to prepare it ourselves is unrealistic for the current study. Most importantly, we do not find in the critiques provided any specific reason why our E. coli-based systems experiments are intrinsically less “stringent” or “rigorous”.

      Accordingly, we think that, together with the results of multiple new experiments (detailed below), the extensive re-writing and re-ordering that we have done in the revised manuscript will be enough to better emphasize the importance and rigor of our findings and thus to address all of the Reviewers’ specific concerns.

      Reviewer 1 thought that our manuscript “does not even provide new information, since the involvement of CCT and the Hsp70 system is not novel” and thought that “the key finding of this manuscript is how chaperones are involved in the de novo folding of septins, which is not conceptually new because of previous findings, including those of the authors”. Reviewer #3 also stated that “the function of Tric/CCT in septin folding and assembly is well documented”.

      We were quite surprised at this reaction, since we dedicated a significant portion of the original manuscript (lines 68-76 and 319-322) to explicitly discussing the only other paper in the literature that specifically addresses the question of whether or not CCT is required for de novo septin folding. As a reminder, that paper explicitly stated that “it is unlikely that CCT is required to fold septins de novo” and “septins probably do not need CCT for biogenesis or folding”. With regard to involvement of the Hsp70 system, the only existing evidence in the literature on this subject is the aggregation of some septins in ssb1∆ ssb2∆ cells. Like the CCT study, that study did not distinguish whether this was a result of problems during septin synthesis and before septin complex assembly, or, alternatively, whether pre-folded and assembled septins were subject to disassembly, misfolding, and aggregation. Our experiments specifically test the fate of newly-synthesized septins prior to assembly in living cells. Our previous findings documented physical interactions between wild-type septins and multiple chaperones but did not address whether these interactions had any functional relevance. We previously reported functional effects of interactions between chaperones and MUTANT septins but, again, these studies did not address functional chaperone requirements for WILD-TYPE septins. While we did our best to highlight these points in the original document without devoting excessive amounts of text, we accept responsibility for not making these points sufficiently clear and to address this issue we added additional text, including the text quoted above, to the Introduction.

      While Reviewer #3 commented that the manuscript “is overall well presented”, Reviewer 1 thought that the manuscript was “complicated to read” with “no logical connections, just a list of many results” and mentioned that part of the difficulty was “that it contains many negative results”.

      In addition to reorganizing the manuscript, as suggested by the reviewers, we added more text at the beginning and end of nearly every section to even more explicitly state the logical connections between results. In our opinion, negative results of properly controlled experiments are valuable to the research community, and we do not understand what it is about negative results that makes them difficult to read about. Many of the extra experiments we performed were in anticipation of being asked to perform them by reviewers, some of which generated negative results. We are reluctant to remove negative results unless there is a more compelling reason. For example, to address another reviewer concern, we did remove the negative results with the Ydj1–Ssa2 compensatory mutants.

      Reviewer #2: “4) Figure 2: The labeling on the protein structure makes it seem like the exact region for Ydj1 and Hsp70 was experimentally identified, when it hasn’t.”

      We acknowledge that the first sentence of the figure legend (“the colored ribbon follows the color scheme in the sequences at right for overlapping β-aggregation, Ydj1 and Hsp70-binding sites”) could be misinterpreted, since only in the second sentence does it say “Sequence alignments show predicted binding sites”. We corrected this mistake, and added the text “Predicted chaperone binding sites” as the first words in the legend to this figure.

      Reviewer #2: “8) The authors confusingly jump back and forth between different Septins and different chaperone (Ssa1-4, Ydj1, Sis1, Hsp104). We would ask the authors to re-arrange the manuscript, collating all the yeast work in one section and bacterial work in another.”

      We re-arranged the manuscript and put all the yeast work in one section and all the bacterial work in another, with the exception of the studies of individually purified Cdc3 and Cdc12, which we put in between the yeast studies of the kinetics of de novo assembly and the yeast studies of post-translational assembly. Our reasoning is that the studies with the purified proteins demonstrate challenges with maintaining native conformations in the absence of chaperones and other septins, which flows naturally into the yeast studies asking about the ability of “excess” septins to maintain oligomerization-competent conformations in the absence of other septins and when we experimentally eliminate specific chaperones. All of the work actually manipulating E. coli genes/proteins is now together.

      Reviewer #3: “1. The co-translational binding of CCT to nascent polypeptide chains has been studied (Stein et al., Mol Cell 2019). While the authors indicate that septin subunits are engaged co-translationally, they do not comment which ones are interacting with CCT and at which state of translation. This information is crucial and should also be mentioned in the discussion section.”

      We are grateful to the Reviewer for bringing up this point, which we had overlooked. We hadn’t noticed that, in the end, only Cdc3 met the CCT confidence threshold to be included in the supplemental data of the Stein et al. paper. All septins co-purified with CCT in an earlier Dekker et al proteomic study, so we strongly suspect that the failure of the other septins to meet the confidence threshold in the Stein et al paper reflects the sensitivity of that assay, rather than a significant difference in how septin GTPase domains interact with CCT. We also hadn’t appreciated that according to that study, the main sites in the Cdc3 GTPase domain bound by CCT and Ssb are the same. Hence our statement that Ssb bound to septins “earlier” during translation, and CCT bound “later” was wrong. Instead, the overlapping Ssb and CCT site in Cdc3 turns out to be remarkably consistent with a conclusion from Stein et al paper, that CCT binds Rossmann-fold proteins like septins at sites where “early” beta strands have been translated and expose a chaperone-binding surface that later becomes buried by an alpha helix. We corrected our mistake in the text and in our model figure and added: (1) a new supplemental figure with predicted septin structures and a sequence alignment indicating where CCT and Ssb bound; and (2) text discussing the confidence thresholds for “calling” septin-CCT interaction, the Rossmann-fold binding, and how we interpret Ssb and CCT binding to the same site.

      Reviewer #3 “3. Figure 3: It is recommended to also follow Cdc10-GFP and Cdc12-GFP fluorescence. This will on the one hand generalize the presented findings and provide a direct link to other parts of the study (e.g. crosslinking analysis of Cdc10).

      We carried out the requested experiment for Cdc12, using Cdc12-mCherry rather than Cdc12-GFP because of the formation of non-native foci that we observed with Cdc12-GFP. We also attempted to analyze Cdc10, using an existing GAL1/10-promoter-driven Cdc10-mCherry plasmid that we’d made a few years ago, but it did not behave as expected, with high expression even in the absence of galactose (not shown), which prevented us from performing the requested experiment. We have a Cdc10-GFP plasmid with the inducible MET15 promoter, but this promoter does not provide sufficiently low levels of expression in repressive conditions, so there would be too much expression at the beginning of the experiment for us to accurately follow accumulation thereafter. Instead, we tried the only other plasmid we had with the GAL1/10-promoter controlling a tagged septin: Cdc11-GFP. Above a certain threshold of expression, Cdc11-GFP formed unexpected cortical foci, but we were still able to perform the analysis and found a clear delay in septin ring signal in cct4 cells, providing the requested generalization to other septins, if not Cdc10.

      Reviewer #3 “5. Figure 4C: The finding that only ssb1 but not ssb2 knockouts have an effect on joining of free Cdc12-mCherry subunits into septin rings is puzzling. Similarly, Ssb1 largely acts co-translationally, while in this assay post-translational septin ring assembly is monitored. The authors need to comment on these two points.”

      We did not examine ssb2 knockouts, so we do not know to what the Reviewer is referring in the first point. If the Reviewer means that they are puzzled by the fact that we saw a phenotype in cells in which only SSB1 was deleted and SSB2 remained, we offer two explanations. As can be seen in the Saccharomyces Genome Database entry for SSB1 (https://yeastgenome.org/locus/S000002388/phenotype), there are at least a dozen known phenotypes associated with deletion of SSB1 in cells with wild-type SSB2. We even showed a very clear septin misfolding/mislocalization phenotype in Supplemental Figure 4D. Thus while our findings are new and provide novel insights into Ssb function, they are not unprecedented. The Reviewer is correct that most Ssb is ribosome-bound and thus Ssb1 “largely acts co-translationally” but ~25% of Ssb is not ribosome-associated (PMID: 1394434). Furthermore, the lack of a strong phenotype for ssb1∆ cells in our new kinetics-of-folding experiment (see below), plus the realization that Ssb and CCT both bind the same site in Cdc3, leads us to a new model: Ssb acts both co- and post-translationally in septin folding, but only the post-translational function is associated with a phenotype in ssb1∆ cells, because in that assay we drastically overexpress a tagged septin and thereby exceed the Ssb chaperone capacity that remains when we delete SSB1. This logic also explains the first ssb1∆ phenotype we saw, when overexpressing Cdc10(D182N)-GFP. In the kinetics-of-folding assay, on the other hand, tagged septin expression is much lower and reducing the amount of total Ssb by ~50% (via SSB1 deletion) likely does not compromise Ssb function in folding the tagged septin. We therefore removed our statement that “Ssb dysfunction leaves nascent septins in non-native conformations that are aggregation-prone and unrecognizable to CCT”, revised our model figure accordingly, and added new text and citations to explain our new model.

      Reviewer #3 “Additionally, they should test whether the appearance of septin ring fluorescence is slowed down in ssb1 mutants (as shown for cct4-1 mutant cells in Figure 3B).”

      We agree that slower septin folding in ssb1∆ cells is a prediction of our model, and we performed the requested experiment and include the results in our revised manuscript. The new data show that the appearance of septin ring fluorescence is not delayed in ssb1∆ mutants, which is easily explained by the ability of Ssb2 to chaperone the folding of the low levels of tagged septin that we express in these kinds of experiments (see above).

      Reviewer #3: “7. Figure 5G: The data is not convincing. This reviewer cannot detect a specific Cdc12 band accumulating in presence of GroEL/ES.”

      We re-ran the reactions again with fresh reagents and this time ran the gel longer to reduce excess signal from free fluorescent puromycin and the bright Cdc10 bands. We now see a very clear band for full-length Cdc12 in the reaction with added GroEL/ES, fully consistent with our mass spectrometry results. We updated the figure with the new results.

      Reviewer #3: “Furthermore, the activity tests done for the chaperonin system are confusing (Supplemental Figure 7). The ATPase rate (slope!) of GroEL/GroES seems higher as compared to GroEL but according to the authors it should be opposite.”

      In our assays, the ATPase activity is so fast that for our “time 0” timepoint, much of it has already occurred by the time the reaction can be physically stopped and measured. In other words, the handling time is such that we can’t visualize what happened in the earliest stages of the reaction, where the rates could accurately be estimated as slopes. This is obvious from the fact that at time 0, the absorbance for the “GroEL alone” reaction is already more than twice the absorbance for GroEL+ES. We added clarifying text to the figure legend.

      Reviewer #3: “The refolding assay using Rhodanese as substrate is also confusing: What is the activity of native Rhodanese? The aggregated Rhodanese sample seems to have substantial activity that is not too different from a GroEL/ES-treated one. From the presented data it is not clear to the reviewer to which extend GroEL/ES prevents aggregation and supports folding of denatured Rhodanese.”

      We thank the Reviewer for bringing this to our attention, because made we mistakenly left out the values for native Rhodanese with the reporter. With regard to the aggregated Rhodanese, we failed to note that this sample contains urea. When the urea absorbance is subtracted, it is clear that the GroEL/ES-treated sample has higher activity. Furthermore, some native enzyme is likely still active within the aggregated sample, explaining the “substantial activity” that the Reviewer correctly notes. We corrected the figure and added clarifying text to the figure legend.

      Reviewer #3: “the study goes astray following aspects that does not seem relevant to this reviewer (e.g. the role of N-terminal proline residues for Cdc12 translation, Fig. 5E/F).”

      We acknowledge that we did a poor job of introducing the N-terminal Pro-rich cluster in Cdc12 with relation to our model of slow Cdc12 translation. Instead, we have revised and reorganized the manuscript to set up these experiments as a direct test of our model: if ribosome collisions on the body of the ORF drive mRNA decay, then decreasing the spacing of those ribosomes should exacerbate the problem, and eliminating the Pro-rich cluster (where published yeast data already show ribosomes accumulate) is the most logical way to test the prediction. Far from being irrelevant, the results fit the prediction perfectly and thus support the model. We expect that this change will highlight the importance of these experiments for the reader.

      Reviewer #2: “1) Fig. 1 Is the folding of Cdc3 being measured in cells lacking chaperones mentioned towards the end of the paper or are the authors referring to the lack of yeast proteins?”

      We are unclear as to what the Reviewer is asking here. The title of Figure 1 states that these are “purified yeast septins” and the figure legend further emphasizes this fact. Additionally, the Coomassie-stained gel in Figure 1A shows a single band, corresponding to purified 6xHis-Cdc3. The proteins were purified from wild-type E. coli cells, so all E. coli chaperones were present when Cdc3 initially folded, but chaperones and all other proteins were removed during the purification and prior to the analysis. We do not know what change to make.

      Reviewer #2 asked “How do the authors account for the septin defect in Ssa4 delete cells in unstressed conditions where Ssa4 would be very low already? According to the authors previous work, Ssa2 and 3 should be able to compensate.”

      We explicitly addressed this point in the original manuscript (lines 893-898). Again, we think here the Reviewer is equating chaperone binding with chaperone function. According to our previous work, Ssa2 and Ssa3 are able to bind septins, but this does not mean that they can fold septins the same way as Ssa4. We cite several papers that discuss the distinct functional roles for the different Ssa proteins. We do not think that additional clarification of this point would strengthen the manuscript.

      Reviewer #3: “6. Figure 5B: It is unclear why Cdc3 is observed in the pulldown of His-tagged Cdc12 (37˚C), although no Cdc12 was isolated under these conditions. How is that possible?”

      That is not possible. As we indicate in the figure legend and with the red asterisk, the only band appearing in that lane is a non-specific band that cross-reacts with the anti-Cdc3 and/or anti-Cdc11 antibodies. This is why it is also present in the “No septins” control lanes. We made the asterisk larger to help accentuate this point.

      Reviewer #3: “Furthermore, the authors observe a specific effect on Cdc12-Cdc11 assembly in the E. coli groEL mutant. How do they rationalize this specific effect as Cdc12-Cdc3 assembly remained unchanged? This observation also seems in conflict with the suggestion of the authors that Cdc12 preferentially recruits Cdc11 before interacting with Cdc3 (page 45, lane 1024).”

      Cdc11 was not expressed in the groEL mutants because no Cdc11 gene was present in those cells, as explained in the body text and indicated in the labeling above the lanes in Figure 5A. The band near the size of Cdc11 is a non-septin protein that bound to the beads in the groEL-mutant cells, as is shown in the immunoblot using anti-Cdc11 antibodies in Figure 5B. Thus there is no conflict to rationalize.

      Reviewer #1: “The only evidence that CCT binds to septin is the list of LC-MS/MS. Western blotting would provide more solid data.” and “2) The cross-linking experiments appears not to have been successful. Why are the Ssas, Ydjs etc not detected here? “

      First, CCT subunits are relatively low-abundance, expressed at 5- to 50-fold lower levels than other chaperone families in the yeast cytosol (see PMID: 23420633). To the Reviewer’s second point, we did in fact detect other chaperones in our crosslinking mass spectrometry experiments, including Ydj1, multiple Ssa and Ssb chaperones, Hsp104, etc., as can be seen in Table S1. However, they were also detected in negative control experiments. This is not surprising, given that these chaperones are among the most common “contaminants” of affinity-based purification schemes (see the CRAPome database at https://reprint-apms.org/). It was for this reason we had to perform so many negative control experiments, which likely produced some false negative results, as some “real” interactions were likely discarded when the same chaperone showed up in our controls. We added a figure panel with a Venn diagram of overlap between experimental and control samples, and text pointing out this caveat of our approach.

      Second, in this experiment we attempted to identify proteins that transiently interact with a specific region of Cdc10 that will later become buried in a septin-septin oligomerization interface. Due to the transient nature of the interaction, we do not expect to detect high levels of crosslinked chaperones. Mass spectrometry is significantly more sensitive than immunoblotting, so there is no guarantee that we would be able to detect a band even if the crosslinking works as desired. Indeed, the crosslinked bands we saw by immunoblot for GroEL were quite faint (see Figure 2F), despite the fact that GroEL and the T7-promoter-driven Cdc10 were among the most abundant proteins in those E. coli cells.

      Third, there is no commercially available, verified antibody recognizing yeast Cct3 for which to perform the requested immunoblot experiment. Since both the N and C termini of CCT subunits project into the folding chamber, it is unwise to use a standard epitope tagging approach, as the tags may compromise function. Indeed, for purification purposes others inserted an affinity tag in an internal loop in Cct3 (PMID: 16762366). We have a yeast strain with Cct6 tagged in an analogous way, but to perform the requested immunoblot experiment with Cct3 would require creating or obtaining the Cct3-tagged strain, deleting NAM1/UPF1, and introducing our Bpa tRNA/synthetase and GST-6xHis-Cdc10 plasmids. Given the sensitivity of detection concerns stated above, we doubt this would help.

      In summary, we prefer not to attempt the requested immunoblot experiments.

      Reviewer #1: “-Fig. 3B ant related Figures: The experiment to see if GFP-tagged septin accumulates in the bud neck is important, but only the graphs after the analysis are shown. The authors should provide the readers with representative examples from imaging data.”

      We are confused, because the images at the bottom of Figure 3A already show what the Reviewer requests. As stated in the figure legend, these are representative examples of the imaging data from a middle timepoint of one of the experiments. It would be nearly impossible (for space reasons) to provide representative images for all of the timepoints for all of the genotypes for all of the experiments. Since in our new experiments we introduce new tagged septins (Cdc11-GFP and Cdc12-mCherry), we also now include representative images of cells expressing these proteins, as well.

      Reviewer #2: “3) If the authors had evidence of chaperone interaction from their previous study, why did they not simply do IPs with fragments of the septins/chaperones?”

      We are unclear why the Reviewer is suggesting IPs after referring to our previous study. IPs are a poor choice for transient interactions, which is why we mostly avoided them in previous studies, and instead used a novel approach (BiFC) to “trap” chaperone–septin interactions. Moreover, we seek to identify chaperones that bind wild-type septins at future septin-septin interfaces on the path towards the native conformation. Fragments of septin proteins would likely misfold and would therefore likely attract chaperones that wouldn’t normally bind the full-length septin. Indeed, our previous studies demonstrated that even a single non-conservative amino acid substitution was sufficient to alter chaperone-septin binding. Thus IPs with fragments of septins or chaperones would be highly unlikely to yield informative results for the questions we seek to answer. We strongly prefer not to attempt these suggested experiments.

      Reviewer #2: “5) While differences between Ssa paralogs are highly interesting, using deletions of Ssas is not useful, given that yeast compensate by overexpressing other paralogs. The yeast GFP Septin assays should be repeated in yeast lacking all Ssas and expressing one paralog on a constitutive promoter (See numerous papers by Sharma and Masison).”

      We disagree that ssa deletions are “not useful”, since if the overexpressed paralogs cannot fulfill the same function as the deleted SSA, then we will see a phenotype. Which we do. Furthermore, we had already obtained and thoroughly tested a strain like the ones mentioned by the reviewer (ECY487, a.k.a. JN516, from Betty Craig’s lab, with ssa2∆ ssa3∆ ssa4∆ and SSA1, which is constitutively expressed, PMID: 8754838), but we found that, as published, it divides slightly more slowly even under the most permissive of conditions. The requested strain cannot be analyzed using our method, because slow accumulation of ring fluorescence could be attributed to other defects unrelated to septin folding. Thus we strongly prefer not to attempt the suggested experiments.

      Reviewer #2: “7) The authors need to clarify the experiment with the Ydj1 D36N and Ssa2 R169H. In Reidy et al, they never fully biochemically test this system and it was never examined for Ssa2-Ydj1. The authors would need to do some fundamental experiments to demonstrate the validity and functionality of this double mutant in yeast.”

      Given that this experiment was unable to generate meaningful data, since the mutations affected the kinetics of induction of the GAL1/10 promoter, we do not think the requested biochemical experiments would add any value to the study. Instead, we removed these studies from the manuscript.

      Reviewer #3: “4. Figure 3B: The difference between wt and cct4-1 cells in appearance of septin ring fluorescence is observed at one timepoint. Since this experiment is considered highly relevant, the authors are asked to include another timepoint to bolster the conclusion that Cdc3-GFP folding and thus septin ring assembly is delayed in the CCT mutant.”

      We carried out new experiments with cct4-1 cells using Cdc12-mCherry and Cdc11-GFP with more timepoints than in our original cct4-1 experiments with Cdc3-GFP. Since these experiments provide the same kinds of results, but at multiple timepoints, we do not see the value in repeating the Cdc3-GFP experiment.

      Reviewer #3: “If Ssb1 functions to maintain Cdc12 in an assembly competent state preventing misfolding, one would expect either enhanced degradation or aggregation of Cdc12-mCherry in ssb1 mutant cells. Did the authors check for such scenario? Septin aggregation has been shown in a ssb1 ssb2 double deletion strain (Willmund et al., 2013), yet the data shown here predict that aggregation might already occur in single ssb1 mutants.”

      We already examined septin aggregation in single ssb1 mutants and showed these data (Supplementary Figure 4D). Indeed, this phenotype was the rationale for testing post-translational septin assembly in ssb1 single mutants. We have seen no evidence of septin degradation in any context (as we mentioned on line 889), so we would not expect it here. While we added new text and a very new citation showing that many “misfolded” conformations of wild-type E. coli proteins avoid aggregation and degradation, we do not think that the suggested experiments would add enough value to the current study to justify the effort, time and expense.

      Reviewer #3: “Fig. 3C: The figure showing septin ring fluorescence does not include error bars. This is crucial, also because the difference between wt and ssa4 mutant cells is not large.”

      There are, in fact, error bars included in the figure, as can be most clearly seen for the final timepoint for the ssa4∆ cells. For most of the other timepoints the error bars are smaller than the data point symbols (the circles and squares). We do not think that adjusting the size or opacity of the symbols to better show the error bars will be sufficiently valuable to justify the effort.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In the presented work the authors studied the folding and assembly of septin subunits and the role of molecular chaperones in this process. They used a variety of diverse in vitro and in vivo assays to study the interaction between septin subunits and chaperones and to characterize their conformational states. The manuscript includes a huge amount of work that is overall well presented. Yet, the diverse approaches also lead to a tessellated view. While the combined study of septin folding in E. coli and S. cerevisiae cells has advantages, a more stringent analysis in one organism (authentic yeast) would increase rigor. The manuscript also suffers from overinterpretation of data in parts (e.g. translation rate of Cdc12 and how this might be affected by folding and chaperones) and occasionally the study goes astray following aspects that does not seem relevant to this reviewer (e.g. the role of N-terminal proline residues for Cdc12 translation, Fig. 5E/F). The paper will therefore substantially benefit from streamlining and major rewriting. Concerning the roles of chaperones: the function of Tric/CCT in septin folding and assembly is well documented, the involvement of other chaperones (e.g. Hsp70: Ssb2 or Ssa4) remains less clear and was not fully explored. Considering these limitations, a major revision of the study will be necessary.

      Major points:

      1. The co-translational binding of CCT to nascent polypeptide chains has been studied (Stein et al., Mol Cell 2019). While the authors indicate that septin subunits are engaged co-translationally, they do not comment which ones are interacting with CCT and at which state of translation. This information is crucial and should also be mentioned in the discussion section.
      2. The evidence for direct interaction between Cdc10 and CCT is rather weak as it is based on CCT absence in a mass spec list of proteins from a control experiment.
      3. Figure 3: It is recommended to also follow Cdc10-GFP and Cdc12-GFP fluorescence. This will on the one hand generalize the presented findings and provide a direct link to other parts of the study (e.g. crosslinking analysis of Cdc10).
      4. Figure 3B: The difference between wt and cct4-1 cells in appearance of septin ring fluorescence is observed at one timepoint. Since this experiment is considered highly relevant, the authors are asked to include another timepoint to bolster the conclusion that Cdc3-GFP folding and thus septin ring assembly is delayed in the CCT mutant.
      5. Figure 4C: The finding that only ssb1 but not ssb2 knockouts have an effect on joining of free Cdc12-mCherry subunits into septin rings is puzzling. Similarly, Ssb1 largely acts co-translationally, while in this assay post-translational septin ring assembly is monitored. The authors need to comment on these two points. Additionally, they should test whether the appearance of septin ring fluorescence is slowed down in ssb1 mutants (as shown for cct4-1 mutant cells in Figure 3B). If Ssb1 functions to maintain Cdc12 in an assembly competent state preventing misfolding, one would expect either enhanced degradation or aggregation of Cdc12-mCherry in ssb1 mutant cells. Did the authors check for such scenario? Septin aggregation has been shown in a ssb1 ssb2 double deletion strain (Willmund et al., 2013), yet the data shown here predict that aggregation might already occur in single ssb1 mutants.
      6. Figure 5B: It is unclear why Cdc3 is observed in the pulldown of His-tagged Cdc12 (37{degree sign}C), although no Cdc12 was isolated under these conditions. How is that possible? Is the appearance of Cdc3 reflecting non-specific binding to the used resin? Furthermore, the authors observe a specific effect on Cdc12-Cdc11 assembly in the E. coli groEL mutant. How do they rationalize this specific effect as Cdc12-Cdc3 assembly remained unchanged? This observation also seems in conflict with the suggestion of the authors that Cdc12 preferentially recruits Cdc11 before interacting with Cdc3 (page 45, lane 1024).
      7. Figure 5G: The data is not convincing. This reviewer cannot detect a specific Cdc12 band accumulating in presence of GroEL/ES. Furthermore, the activity tests done for the chaperonin system are confusing (Supplemental Figure 7). The ATPase rate (slope!) of GroEL/GroES seems higher as compared to GroEL but according to the authors it should be opposite. The refolding assay using Rhodanese as substrate is also confusing: What is the activity of native Rhodanese? The aggregated Rhodanese sample seems to have substantial activity that is not too different from a GroEL/ES-treated one. From the presented data it is not clear to the reviewer to which extend GroEL/ES prevents aggregation and supports folding of denatured Rhodanese.

      Minor points:

      Fig. 3C: The figure showing septin ring fluorescence does not include error bars. This is crucial, also because the difference between wt and ssa4 mutant cells is not large.

      Review Cross-commenting:

      I also concur with the comments of the other reviewers. The manuscript is in need of extensive revision.

      Significance

      see statement above

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, the authors set out to understand the requirements for yeast Septin folding. They purify Cdc3 from bacteria homo-oligomerized. This was followed by very nice HDX-MS analysis demonstrating that the NTE is flexible and may be intrinsically disordered. Follow up experiments using cross-linking mass spectrometry identified interaction of Cdc10 with Cct3. Finally the authors queried the impact of both yeast and e.coli chaperones on septin folding and demonstrated a dependence on yeast Ssa4 and e. coli GroEL.

      Major comments:

      1. Fig. 1 Is the folding of Cdc3 being measured in cells lacking chaperones mentioned towards the end of the paper or are the authors referring to the lack of yeast proteins?
      2. The cross-linking experiments appears not to have been successful. Why are the Ssas, Ydjs etc not detected here? The authors only pull out one interactor which is not validated going forward.
      3. If the authors had evidence of chaperone interaction from their previous study, why did they not simply do IPs with fragments of the septins/chaperones?
      4. Figure 2: The labeling on the protein structure makes it seem like the exact region for Ydj1 and Hsp70 was experimentally identified, when it hasn't.
      5. While differences between Ssa paralogs are highly interesting, using deletions of Ssas is not useful, given that yeast compensate by overexpressing other paralogs. The yeast GFP Septin assays should be repeated in yeast lacking all Ssas and expressing one paralog on a constitutive promoter (See numerous papers by Sharma and Masison).
      6. How do the authors account for the septin defect in Ssa4 delete cells in unstressed conditions where Ssa4 would be very low already? According to the authors previous work, Ssa2 and 3 should be able to compensate.
      7. The authors need to clarify the experiment with the Ydj1 D36N and Ssa2 R169H. In Reidy et al, they never fully biochemically test this system and it was never examined for Ssa2-Ydj1. The authors would need to do some fundamental experiments to demonstrate the validity and functionality of this double mutant in yeast.
      8. The authors confusingly jump back and forth between different Septins and different chaperone (Ssa1-4, Ydj1, Sis1, Hsp104). We would ask the authors to re-arrange the manuscript, collating all the yeast work in one section and bacterial work in another.

      Review Cross-commenting:

      I agree with the comments made by the other two reviewers.

      Significance

      While the final figure 6 is nice, the data only hint at this model, much more work in vivo and in vitro to solidify this mechanism is needed. Overall, while this work has some merit, the experimental path seems a bit disorganized is highly preliminary-it is unclear what the impactful findings are.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The cellular folding of polymer-forming septins is an important but poorly understood issue. The authors and other groups previously found that cytoplasmic chaperones interact with septins and are involved in oligomerization, but the details have not been analyzed. Therefore, the authors conducted a series of experiments using the budding yeast septin system as a model using many methodologies. Specifically, the experiments include purified yeast septins, in vivo cross-linking, live-cell imaging of yeast, chaperones-deficient strains of E. coli, and a reconstituted cell-free translation system in E. coli. As a result of such experiments, the authors not only gained much insight into the role of chaperones in the de novo folding of septins but also found that chaperonins are co-translationally involved in the folding of septins. Furthermore, they even argue that the translation elongation rates are associated with the co-translational folding, based mainly on the E. coli system and bioinformatics analysis.

      Major comments:

      First, I should say that this paper is complicated to read. There are no logical connections, just a list of many results, making it difficult to grasp what is important. It is a complex combination of budding yeast and E. coli systems, but the E. coli results should not be immediately extended to cases in eukaryotes. Another reason for the difficulty in reading the manuscript is that it contains many negative results.

      The key conclusion in this manuscript seems to be the effect and the role of chaperones in the folding coupled with translation, as shown in the model diagram in Figure 6. However, the experiments related to translation are not justified because all the translation-related results are derived from the data in the E. coli system. Since the goal of this manuscript is to gain insight into the folding of septins in eukaryotic cells, I do not agree with drawing conclusions about septin folding based on the E. coli experiments.

      If the E. coli experiments are excluded, the key results in this manuscript, in my view, are very few, only Fig. 3B and partly Fig. 4D, in which the formation of the septin ring is slowed using a CCT mutant strain. If the point that translation speed is involved is not convincing, then this manuscript does not even provide new information, since the involvement of CCT and the Hsp70 system is not novel. If the model depicted in Fig. 6 is to be justified, the authors need to carefully study the eukaryotic translation system. However, it would be practically difficult because it would take long time and a reconstructed cell-free translation system used in this manuscript is not easy. However, it would be substantially difficult because it would be time consuming and not easy to use a eukaryotic version of a reconstituted cell-free translation system used in this manuscript.

      Minor comments:

      • The only evidence that CCT binds to septin is the list of LC-MS/MS. Western blotting would provide more solid data.
      • Fig. 3B ant related Figures: The experiment to see if GFP-tagged septin accumulates in the bud neck is important, but only the graphs after the analysis are shown. The authors should provide the readers with representative examples from imaging data.

      Review Cross-Commenting:

      I totally agree with both reviewers. I believe the manuscript needs a major revision with significant restructuring. Furthermore, I think it would be better to at least consider the possibility of splitting it up.

      Significance

      The key finding of this manuscript is how chaperones are involved in the de novo folding of septins, which is not conceptually new because of previous findings, including those of the authors. Many methodologies are used, but each is not a novel methodology and is not new. Finally, I am working on the mechanisms of action of chaperones, especially chaperonins, the intracellular dynamics of proteins, and proteomics from both biochemical and cell biology perspectives.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General response to the reviewer

      We thank all reviewers for their constructive comments on our manuscript. We were very pleased to see that the reviewers found our study ‘…represent new insight in the field’ (rev#1) and ‘…contains important and exciting novel findings’ (rev#2), and ‘…gives a more detailed perspective on how Src proteins (Src42A in Drosophila) control epithelial stability and the contraction of specific surfaces of epithelial cells’ (rev#3). The reviewers raised a number of specific points that we partially addressed already in a preliminary revision of the manuscript. Some more points will require some additional experiments that we will incorporate in a fully revised version of the manuscript.

      Reviewer #1

      (Evidence, reproducibility and clarity (Required)): Highest priority: 1) The Src42A knockdown and germline clone experiments both cause defects in cellularization (Fig. 2B and 9A), which could result in differences in the state of the blastoderm epithelium (cell size, cell number, structural integrity, organization, etc.) between the experimental and control conditions. In addition, Src42A knockdown appears to affect the size and shape of the egg (Fig. 9A and 9C). The manuscript would be strengthened if the authors included data to demonstrate that the initial structure of the epithelium is mostly normal (quantifications of cell size, number, etc.) in the Src42A RNAi condition, as this would bolster the argument that germband extension, rather than due to indirect effects resulting from the cellularization defects. The authors may have relevant data to do this on-hand, for example using data associated with figures 1, 3, 6, and 9.

      Response:

      The cellularization phenotype of src42A knockdown embryos has a penetrance of about 50% and exhibits a variable expressivity. We attempted to characterize this phenotype in detail, but failed to identify any dramatic differences in cellularization of the src42A knockdown embryos compared to wild type. The localization of E-cadherin, in turn is not affected, but occasionally, nuclei are dropping out of the blastoderm before cellularization is accomplished. This can result in patches of irregular cellularization, but the blastoderm epithelium in stage 6 embryos did not display major defects in overall structure. We will present additional data on the cellularization phenotypes in the fully revised manuscript. As the referee suggested, we will analyze our data to determine potential effects on the cell size, cell number and overall organization of the blastoderm before germband extension. We plan to present these data as an additional Suppl. Mat. Figure in the full revision.

      Lower priority:

      5) Figure 8 - in my opinion, using a FRAP or photoconversion approach would be a more convincing demonstration of differences in E-cadherin residency times / turnover rate than time-lapse imaging of E-cadherin:GFP alone. Authors should decide whether this improvement is worth the investment.

      Response:

      We thank the reviewer for this comment. While we believe that the data presented in Fig. 8 demonstrates a significant difference in the E-cadherin residence time based on E-cadherin-GFP fluorescence intensity, we agree with the referee that FRAP analyses would provide additional evidence to support our conclusion. For the full revision, we will therefore attempt to perform FRAP-experiments on src42A knockdown embryos expressing E-cadherin-GFP and compare the recovery time to the wild type.

      Reviewer #1 (Significance (Required)):

      The manuscript by Backer et al. examines the function of Src42A in germband extension during Drosophila gastrulation. Prior studies in the field have shown that Src family kinases play an important role in the early embryo, including cellularization (Thomas and Wieschaus 2004), anterior midgut differentiation (Desprat et al. 2008), and germband extension (Sun et al. 2017; Tamada et al. 2021). In this study, the authors showed that Src42A was enriched at adherens junctions and was moderately enriched along junctions with myosin-II. They then showed that maternal Src42A depletion exhibits phenotypes, starting with cellularization and including a defect in germband extension. The authors focus on defects in germband extension and found that Src42A was required for timely rearrangement of junctions and that the Src42A RNAi phenotype is enhanced by Abl RNAi. Finally the authors show that E-cadherin turnover is affect by Src42A depletion.

      Overall, this study provided a higher resolution description of how Src42A regulates the behavior of junctions during germband extension. I thought the authors conclusions were well supported by the data and represent new insight in the field.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: Chandran et al. investigate the role of Src42A in axis elongation during Drosophila gastrulation. Using maternal RNAi and CRISPR/Cas9-induced germline mosaics, they revealed that Src42A is required to contract junctions at anterior/posterior cell interfaces during cell intercalations. Using time-lapse imaging and image analysis, they further revealed the role of Src42A in E-Cad dynamics at cell junctions during this process.

      By analyzing double knockdown embryos for Src42A and Abl, they further showed that Src42A might act in parallel to Abl kinase in regulating cell intercalations. The authors proposed that Src42A is involved in two processes, one affecting tension generated by myosin II and the other acting as a signaling factor at tricellular junctions in controlling E-Cad residence time. Overall, the data are clear and nicely quantified. However, some data do not convincingly support the conclusion, and statistical analyses are missing for an experiment or two. Methods for several quantifications also need improvement in writing. Also, several figures (Figures 6-8) do not match the citation in the text and need to be corrected.

      Page and line numbers were not indicated in the manuscript. For my comments, I numbered pages starting from the title page (Title, page 1; Abstract, page 2, Introduction, pages 3-6; Results, pages 7-14; Discussion, pages 15-18; M&M, 19-23; Figure legends, 28-30) and restarted line numbers for each page. For Figures 6-8 that do not match the citation in the text, I still managed to look at the potentially right panels. All the figure numbers I mention here are as cited in the text. My detailed comments are listed below.

      Response:

      We apologize for the lack of organization of the manuscript and the figure numbering. In the revised version we have added page numbers, line numbers and we corrected the figure numbers.

      Major comments: 1. b-Cat/E-Cad signals at the D/V and A/P junctions in Src42Ai (Figs. 5-6). These data are critical for their major conclusion and should be demonstrated more convincingly.

      In Fig. 5A, the authors said, "When the AP border was cut, the detached tAJs moved slower in Src42Ai embryos compared to control (Fig. 5A)". However, even control tAJs do not seem to move that much in the top panels, and I found the images not very convincing.

      Response:

      We thank the referee for commenting on the lack of clarity in the presentation of the data. The overall movement within the first 10 seconds after the laser cut (determined by movement of adjacent D/V tAJs from each other) was about 2 µm in the wildtype, while in the mutant it was 1 µm. Despite this 50% difference, it may be difficult to appreciate this difference from looking at Fig. 5A in our original submission. The yellow lines in Fig 5A only showed the region of the cut, but did not indicate the movement of the tAJ from each other, which may have led to a distraction from the actual movement. We will change the annotation and the marks within the figure to visualize the movement much more clearly in the full revision. In the fully revised manuscript, we will also add movies from the experiments including marks of the tricellular junctions to follow the displacement as part of the Supplemental Material.

      Based on the genetic interaction between Src42A and Abl using RNAi (Fig. 7), the authors argue that Src42A and Abl may act in parallel. However, the efficiency of Abl RNAi has not been tested. It can be done by RT-PCR or Abl antibody staining. Also, the effect of Abl RNAi alone on germband extension should be tested and compared with Src42A & Abl double RNAi embryos. I expect the experiments can be done within a few weeks without difficulty.

      Response:

      We agree with the referee that it is important to determine the level of depletion in Abl RNAi embryos in order to interpret the genetic relationship between Abl and Src42A. In the full revision of the manuscript, we will follow the advice of the referee and analyze the knockdown, preferably by antibody labeling with an anti-Abl antibody. We will also generate single knockdowns of abl in embryos and determine their effect on germband extension compared to wildtype and src42/abl double knockdown.

      Minor comments:

      Fig. 2 - Fig. 2B: Higher magnification images of the defective cytoplasm can be shown as insets.

      Response:

      We will add some higher magnification images of the cellularization phenotype in the full revision of the manuscript. In addition, as mentioned in the response to reviewer #1, we will provide a more detailed analysis of the cellularization in src42Ai embryos in the fully revised manuscript.

      • Fig. 2E: A simple quantification of the penetrance of cuticle defects in Src42A mutants and RNAi will be helpful, as shown in Fig. S3.

      Response:

      In the full revision, we will add the quantification of the occurrence of the different classes of cuticle phenotypes.

      Fig. 9 - Fig. 9A: Magnified views of the cytoplasmic clearing can be added as insets.

      Response: As described in our response to the comments made by referee #1, we will add a more detailed analysis of the cellularization phenotype in the full revision.

      Page 14, lines 9-10: More explicit description of the phenotype rather than just "stronger compared to Src42Ai" will be helpful.

      Response:

      In the full revision, we will add a more detailed description of the phenotype and re-analyze and present data on the hatching rate, stage of lethality and cuticle phenotypes.

      Reviewer #2 (Significance (Required)): This work revealed the role of Src42A in regulating germband extension. A previous study suggested the roles of Src42A and Src64 in this developmental process using a partial loss of both proteins (Tamada et al., 2021). Using different approaches, the authors demonstrated a role of Src42A in regulating E-Cad dynamic at cell junctions during Drosophila axis elongation. Most of the analyses were done with maternal knockdown using RNAi, but they successfully generated germline clones for the first time and confirmed the RNAi phenotypes. Overall, this work contains important and exciting novel findings. This work will be of general interest to cell and developmental biologists, particularly researchers studying epithelial morphogenesis and junctional dynamics. I have expertise in Drosophila genetics, epithelial morphogenesis, imaging, and quantitative image analysis.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Chandran et al. report on the function of Src42A during cell intercalation in the early Drosophila gastrula. They create a Src42A-specific antibody (there are two Src genes in the fly genome) and examine the localization of Src42A and observe a planar-polarized distribution at cell interfaces. They then measure cell-contractile dynamics and show that T1 contraction is slower after Src42A disruption. The authors then argue that Src42A functions in a parallel pathway to the Abl protein, and that E-cadherin dynamics (turnover) is altered in Src42A disrupted embryos. Src function at these stages has been studied previously (though not to the degree that this study does), and in some respects the manuscript feels a little preliminary (please label figures with figure number!), but after editing this should be a polished study that merits publication in a developmentally-focused journal.

      1) Does the argument that Src42A has two functions fully make sense? Myosin II function is known to affect E-cadherin stability (and vice versa), so it seems that Src42A could affect both MyoII and Ecad by either decreasing Myosin II function/engagement at junctions or by destabilizing Ecad.

      Response:

      We thank the referee for raising an important point that we may not have discussed appropriately in our initial submission. We agree that the reciprocal relationship between actomyosin and E-cadherin might not be reflected equivocally in our manuscript. As the referee points out, Src42A could affect both MyoII planar localization and E-cadherin dynamics through the same pathway. Previous studies showed that Src is involved in translating the planar polarized distribution of the Toll-2 receptor by recruiting Pi3-Kinase activity to the Toll-2 receptor complex resulting in planar polarized distribution of MyoII at the A/P interfaces. These data, however do not address the possibility that a well-known Src target, the E-cadherin/ß-Catenin complex, which is extensively remodeled in germband extension contributes to the delay in germband extension. The observed defects in both studies can be attributed to both a defect in abnormal planar polarization of MyoII and the abnormal dynamics of the E-cadherin/ß-catenin complex. In either of these cases, we suggest that Src42A phosphorylates distinct substrates, the Toll-2 intracellular domain in the MyoII planar polarity pathway and the E-cad/ß-Cat complex controlling E-cad dynamics. Given the relationship between MyoII and E-cadherin, however, it is not possible to decide whether these two effects are independent functions of Src42A or are consequences of each other. Since we cannot resolve a possible epistatic relationship between these potential two activities of Src42A, we decided to extend the discussion on this topic by taking both possible scenarios into account and discussing them appropriately. We will add this discussion in the full revision of the manuscript.

      ) One obvious question that arises is the nature of cleavage defects that are mentioned that happen previously to intercalation. For example, is E-cad normal prior to intercalation initiating? How specific are the observed defects to GBE?

      Response:

      please see response to referee #1

      3) Pg. 10, "the shrinking junction along the AP axis strongly reduces its length with an average of 1.25 minute" - what is this measurement? How much is "strongly"?

      Response:

      We thank the referee for pointing out our inappropriate qualitative statement of the experimental data, which was indeed misleading. The measurement of the shrinking junction was based upon the time it takes for the AP interface junction between two adjacent vertices on the DV axis to shrink into a single 4-cell vertex. The time for this contraction was on average 1 minute 25 seconds. The data in Fig.4 A’,C show that after 2 minutes in the control embryo 100% of the observed AP junctions have collapsed and the extension of the new DV junction along AP axis has begun. At the same timepoint of 2 minutes in the src42A knockdown, we show in Fig. 4B’,D that the shrinking of the AP junction interface has still not been completed in 60% of the cases.

      In the full revision, we will remove the qualitative statement and replace it with a correct description of the measurements taken and will refer to the data described in Fig. 4 A-D.

      4) Also pg. 10, "the AP junction was not markedly reduced after 1 minute" - what is the criteria for this statement? X%? 1 minute is very specific, it feels like how much of a reduction/non-reduction should also be specific.

      Response:

      please see response to point 3.

      Reviewer #3 (Significance (Required)):

      This study gives a more detailed perspective on how Src proteins (Src42A in Drosophila) control epithelial stability and the contraction of specific surfaces of epithelial cells.

      Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #2 and #3 noted that the manuscript was somewhat unorganized with regard to lacking the numbering of pages, lines and figures. We also noted that in the submission process the figures were not presented in the correct order. In the preliminary revision of the manuscript, we fixed these problems to facilitate the evaluation of our transferred manuscript by editorial boards.

      In addition, we also addressed issues that the referees mentioned by editing the text according to their comments. We also addressed problems regarding the presentation of the figures and statistical analyses of the data. The following changes were made:

      1. We added page numbers and line numbers.
      2. We added figure numbers to the figure panels.
      3. We corrected ordering of figures in the transferred manuscript.
      4. We addressed the following comments by statistical analyses, editing the text and the figures:

        Regarding comments from Reviewer #1:

      Highest Priority:

      2) There is a discrepancy in the staging of embryos used between some of the analyses, which make it hard to interpret some of the data. For example, characterization of the knockdowns in Fig. 1A and B are based on stages 10 and 15, whereas the majority of the paper is focused on earlier stages 6 - 8 during germband extension (e.g., Fig. 1D). The analysis for Fig. 1B would be more meaningful if it was done on the same stages used for subsequent phenotypic analysis so they can be directly compared.

      Response:

      We thank the referee for pointing out an apparent misunderstanding caused by the description of Fig. 1A,B. The data presented in Fig.1A and 1B do not show RNAi knockdown experiments, but show a comparison between embryos that are heterozygous or homozygous for the loss-of-function allele src42A26-1. These data were intended to demonstrate that zygotic mutants still maintain levels of maternal Src42A protein up until late stages of development. Data for embryos at an earlier stage (stage 5) were shown in the Supplementary Fig. S1E, where no difference in protein levels of Src42A can be observed between heterozygous and homozygous zygotic src42A26-1 embryos.

      At the beginning of the results sections 1 and 2 of the preliminary revised manuscript, we added a sentence to address the referee’s concern that earlier stages exhibit no difference in protein levels and will refer to Fig. S1E. We also more explicitly spelled that out that the experiment (referring to Fig.1A,B and S1) was intended to look at zygotic mutants and to demonstrate that our novel Src42A antibody was able to detect the reduction of maternal Src42A protein in mid- to late-stage homozygous zygotic embryos.

      3) There is incongruence between figures in terms of which junctional pools (bAJs vs. tAJs) of beta-catenin and E-cadherin are quantified that makes it difficult to draw comparisons between analyses. For example, pTyr levels are examined for both bAJs and tAJs in Figure 3, however, only tAJs are considered in Fig. 8. Similarly, in some cases planar cell polarity is considered (e.g., comparison of levels at AP vs DV bAJs in Fig. 6 and 9), and in other cases (e.g. Fig. 8) it is not.

      Response:

      We thank the referee for commenting on the different readouts for different pools of cell junctions in our experiments. In our study we considered effects on src42A on both, bAJs and tAJs by RNAi knockdown of src42A. We decided to present the data for bAJ and tAJ in separate figures for clarity and structure. For example, the data for the effect of src42A knockdown on the planar polarized distribution on bAJs of E-cadherin were presented in Fig.6, while the effect on E-cadherin residence time in tAJs were presented in Fig.8. The analysis pTyr levels considered both pools in order to determine whether src42A knockdown leads to an overall reduction of pTyr levels or to a reduction in a specific junctional pool. From our data we conclude that pTyr levels show a similar reduction in both, the bAJ and the tAJ junctions.

      In order to address the reviewer’s comment, we have linked the figures more stringently with the results text of the preliminary revision. We only referred to the reduction in PTyr levels in Fig. 3 to point out that both junctional pools are affected by reduced PTyr in src42i embryos. Furthermore, we referred to the individual figure panels when addressing junctional pools and explain the rationale to focus on particular pools (bAJs or tAJ) in the experiments in detail. For Fig. 6 we point out in the preliminary revised manuscript that we focus the analyses on the known planar polarized distribution of beta-catenin and E-Cadherin.

      Lower priority: 1) Introduction, 2nd paragraph - The modes of cell behaviors described to drive cell intercalation leaves out another clear example in the literature - Sun et al., 2017 - which describes a basolateral cell protrusion-based mechanism. While the authors cite this paper later, leaving it out when summarizing the state of the field misrepresents the current knowledge of the range of mechanisms responsible.

      Response:

      We thank the referee for this remark. In the preliminary revision, we have added to the introduction that the cell behaviors associated with germband elongation include apical and basolateral rearrangements of the cells indicating that basolateral protrusions also contribute to the set of mechanisms that drive germ band elongation.

      2) 'defective cytoplasm' - this term is confusing, and could perhaps be replaced with 'cellularization defect', or something similar.

      Response:

      We agree that the term we applied for the cellularization defect may be misleading. The observation, we intended to describe with the term was a defect in the cytoplasmic clearing which occurs in the last syncytial division and the beginning of the cell formation process. We changed the description of this observation according now refer to the defect in the preliminary revised manuscript as ‘cytoplasmic clearing defect’.

      3) Tests of statistical significance are not uniformly applied across the figures. For instance, Figures 3G + H indicate statistical significance, but Fig. 3D + E do not. Performing statistical tests throughout the paper, or clearly articulating a rationale when they are not used, would strengthen the manuscript. Specifically, the authors should consider this for Fig. 3D + E, and Fig. 7D + E, to support their arguments that rates of germband extension are different between conditions.

      Response:

      We agree with the reviewer and have provided statistical analysis for the data displayed in Fig. 3D,E and Fig. 7D,E in the preliminary revision of the manuscript.

      4) Page 12 - "We found that Src42A showed a distinct localization at the tAJs (Fig. 1B)": Figure 1B shows a quantification of levels at bAJs, not tAJs.

      Response:

      In the preliminary version of the revised manuscript, we added a quantification of the localization of Src42A at the tAJs as a part of Suppl Fig. S4. In Fig. S4A-C we show that Src42A is enriched in comparison to the bAJs.

      Regarding comments from reviewer #2:

      Major Comments:

      In Fig. 6A, b-Cat signals look fuzzier and dispersed and have more background signals in the control, compared to the Src42Ai background. Also, b-Cat signals in the control image do not seem to show enrichment at the D/V border, as shown in Tamada et al., 2012.

      Response:

      We agree with the referee that the image in Fig. 6A for the control is fuzzier and looks dispersed. This is due to the fixation method that we used. In this experiment we did not apply heat fixation, but used formaldehyde fixation in which b-catenin protein, in addition to the junctional pool, is also maintained in the cytoplasm creating the fuzzy cytoplasmic staining. We chose to do this in order to be able to co-immunolabel the embryos with b-catenin and E-cadherin antibodies; the latter staining is not working with the heat fixation applied in the Tamada et al. 2012 paper. Despite the slightly lower quality of the staining, the quantification of the data clearly indicated an effect of src42A knockdown on the planar polarized distribution of E-cad/b-cat complex does show an enrichment. In the preliminary revision added a note to the figure legend to indicate the fact that the fixation procedure was not optimized for b-catenin junctional staining. In the preliminary revision we also added a quantification of live imaging data recording E-cadherin-GFP in wild-type and src42Ai embryos. We present these additional data in Fig. S5 in the preliminary revision of the manuscript. These data are consistent with the results in Fig. 6 from the immunolabeling and support our conclusion that E-cadherin AP/DV ratio is increased in Src42A knockdown embryos.

      In Fig. 6B, C, it is not clear how the intensity was measured and how normalization was done. Was the same method used for these quantifications as "Protein levels at bicellular and tricellular AJs" on pages 21-22? Methods should be written more explicitly with enough details.

      Response:

      We thank the referee for pointing out the lack of detail in explaining how the quantification was done. In the preliminary revision of the manuscript, we extended a paragraph entitled ‘Protein levels at bicellular and tricellular junctions’ in the methods section that will serve this purpose and describe the methods that were applied for each quantification and the method as to how the data were normalized.

      Does each sample (experimental repeat) for the D/V border in Fig. 6B match the one right below for the A/P border in Fig. 6C? It should be clearly mentioned in the figure legend. The ratio of the DV intensity to AP intensity will better show the compromised planar polarity of the b-Cat/E-Cad complex.

      Response:

      We thank the reviewer for pointing out a lack of clarity in our presentation. The experimental repeats for each measurement do indeed match, i.e. the measurement of the DV border matches the same adjacent 4-cell pair in the same embryo and in total 5 distinct embryos were analyzed for each experiment. In the preliminary revision of the manuscript, we explain this detail of the experimental design in the figure legend. In the preliminary revision, we also determined the ratios of DV/AP cell interfaces for b-Cat and E-Cad and added this quantification as panel 6C and 6E for a clearer presentation of the data.

      Minor notes: Page 4, missing comma after "For example"

      Response: The text was edited accordingly.

      Page 4, "inevitable" does not make sense in this context

      Response: We eliminated ‘inevitable’ and replaced it with ‘critical’ to better indicate the importance of Canoe protein for germband elongation.

      Page 7, lines 6-7 - The localization of Src42A in control should be described in more detail and more clearly here.

      Response: In the preliminary revised manuscript, we extended our description of the distribution of Src42A in more detail pointing out its dynamics and differential distribution at distinct plasma membrane domains.

      Supplemental Fig S1 - Fig. S1D: Based on the head structure and the segmental grooves, the embryo shown here is close to late stage 13/early stage 14, not stage 15. - Fig S1E: It will be helpful if the predicted protein band and non-specific bands are indicated by arrows/arrowheads in the figure.

      Response:

      We thank the referee for their careful observation of the embryonic stage. We agree that the embryo was actually a younger stage. In the preliminary revision, we replaced the images with an example of an older stage. We will also add clear annotations as arrows to clearly mark the specific protein bands in Fig. S1E.

      Page 7, lines 21-22 - "Src42A was slightly enriched at the AP interface" - To argue that, quantification should be provided.

      Response:

      We thank the referee for pointing out a qualitative statement that we made with regard to the distribution of Src42A at the AP cell interfaces. In the preliminary revision of the manuscript, we present an additional quantification of the imaging data of Src42A immunolabeling. In Figure S4A-C, we now present a quantification of the enrichment of Src42A at the tricellular junctions. In addition, the new Fig. S4D,E shows a quantification of the planar polarized distribution of Src42A at the AP cell interfaces.

      Figure 1 - Fig. 1B: Src42A levels should be compared between control (Src42A/+) and Src42A/Src42A for each stage. It currently shows a comparison between Src42A/Src42A of stages 10 and 15.

      Response:

      We thank the referee for the comment. As indicated in our response to referee #1, the point of this analysis was to (1) provide evidence for the specificity of our new anti-Src42A antibody and (2) to demonstrate the presence of substantial material contribution of Src42A protein in zygotic mutant. We do not see the advantage to provide a detailed developmental Western-blot analysis, but we provide data in Suppl. Mat Fig S1E showing that the level of Src42A is unimpaired in stage 6 zygotic src42A[26-1] homozygous mutant embryos.

      • Fig. 1B: The figure legend says, "dotted line represents mean value and error bars," but there are no dotted lines shown in the figure. Also, what p-value is for ****? It should be mentioned in the figure legend. It also says Src42A levels were normalized against E-Cad intensity here (stages 10 and 15). They have shown that E-Cad levels are affected in Src42A RNAi during gastrulation (Fig. 6). Is E-Cad not affected in Src42A26-1 zygotic mutants at stages 10 and 15?

      Response:

      We thank the referee for pointing out inaccuracies in the presentation and the description of Fig.1B. In the preliminary revision, we emphasized the marks on the graph and provide p-values throughout. Regarding the E-Cadherin levels: E-cadherin levels were altered in src42A RNAi knockdown embryos, but not in zygotic mutants, even at later developmental stages.

      Page 8, line 14 - "Embryos expressing TRiP04138 showed reduced hatching rates with variable penetrance and expressivity depending on the maternal Gal4 driver used (Fig. 2B)" - Fig. 2B doesn't seem to be a right citation for this sentence.

      Response:

      We agree with the referee and in the preliminary revised manuscript we corrected the reference to the conclusion drawn from Figure 2A’, which does show the relationship of hatching rate to the various maternal Gal4 drivers.

      • Fig. 2C: It will be helpful to indicate two other non-specific bands in the figure with arrows/arrowheads with a description in the figure legend.

      Response:

      In the preliminary revision, we added an arrow to mark the band specific for Src42A and asterisks to mark unspecific bands in Fig 2C.

      Page 9, line 9 - This is the first time that the fast and the slow phases of germband extension are mentioned. As these two phases are used to compare the Src42A and Src42A Abl double RNAi phenotypes, they should be introduced and explained better earlier, perhaps in Introduction.

      Response:

      We thank the referee for pointing out that the two phases of germband extension were not introduced. We added a sentence to introduce and define the distinct phases of extension movements in the preliminary revision.

      Fig. 3 - Fig. 3A: It will be helpful to mark the starting and the ending points of germband elongation with different markers (arrows vs. arrowheads or filled vs. empty arrowheads).

      Response:

      In the preliminary revision, we added distinct markers to indicate the start and endpoints of germband elongation to make this figure easier to read.

      • Fig. 3C figure legend: R2 is wrongly mentioned in Fig. 3D, E. Also, R2 (coefficient of determination) needs to be defined either in the figure legend or Materials & Methods.

      Response:

      We thank the referee for pointing this misleading reference to us. In the preliminary revision we corrected the reference to R2 in Fig,3D,E and will describe the definition of R2 in the figure legend.

      • Fig. 3D, E: statistical analysis is missing.

      Response:

      In the preliminary revision, we included a statistical analysis of the data (see ref #1). We changed the figure to indicate the data sets that were analyzed and added the p-values to the figure legend.

      • Fig. 3G and H should be cited in the text.

      Response:

      In the preliminary revision, we added references to Fig 3G,H in the result section to the annotation of Fig.3F).

      • Fig. 3F: It should be mentioned that the heat map is shown for pY20 signals in the figure legend, with an intensity scale bar in the figure.

      Response:

      In the preliminary revision, we added an intensity scale bar to the figure panel and mentioned the relationship to the PY20 signal.

      Fig. 7A: Arrows can be added to mark the delayed germband extension.

      Response:

      In the preliminary revision, we added arrows to mark the anterior and posterior extent of the germband.

      Fig. 8A: It should be mentioned that the heat map is shown for E-Cad signals in the figure legend, with an intensity scale bar in the figure.

      Response:

      In the preliminary revision, we added an intensity scale to the heat map and mention the relationship to the E-cadherin signal in the figure legend.

      Fig. S3G: An arrowhead can be added to the gel image to indicate the band described in the legend.

      Response:

      In the preliminary revision, we added an arrow to help annotating the Src42A-specific bands on the Western blot.

      • Fig. 9B: Arrow/arrowheads can be added to show the absence of the signals in the nurse cells.

      Response:

      In the preliminary revision, we added markers to help recognizing the reduced signal in the nurse cells and the oocyte.

      • Fig. 9C: Indicate the ending point of the germband extension by arrows.

      Response: In the preliminary revision, we added arrows to mark the anterior and posterior extent of the germband.

      Regarding comments from reviewer #3:

      Minor notes: Page 4, missing comma after "For example"

      Response: The text was edited accordingly.

      Page 4, "inevitable" does not make sense in this context Response:

      In the preliminary revision, we eliminated ‘inevitable’ and replaced it with ‘critical’ to better indicate the importance of Canoe protein for germband elongation.

      Description of analyses that authors prefer not to carry out

      Referee #1 point2 and Referee#2 minor comment figure 1. Both referees suggest that figure 1 AB should include earlier developmental stages according to the stages looked at in the RNAi knockdown experiment.

      Response:

      The referees’ comments are likely based on a misunderstanding. The data that the reviewer are referring to present analyses of the zygotic phenotype of embryos homozygous for the src42A26-1 loss of function allele. They are not related to the maternal RNAi knockdown experiments, but were meant to demonstrate the existence and extent of a maternal pool of Src42A protein, that persists even to late stages in development. The maternal knockdown mutants are analyzed in detail at the appropriate stages in Fig. 2.

      As described in our response above, we don’t feel that a detailed developmental stage Western analysis of wildtype and src42A26-1 embryos would provide significant additional insights. As mentioned in our response above, data for an earlier developmental stage (before germband elongation, as requested by the referees, were provided in Suppl. Fig. S1E.

      Referee #1 Point 6) Figure 8E - showing images of multiple tAJs, rather than z-slices of a single vertex, would better support the claim here, as the assertion is that Src42a levels are different between control and sdk RNAi conditions, and not that it varies in the z-dimension.

      Response:

      The image series of Fig. 8E shows one representative example of multiple tAJs that have been imaged for this experiment (n=6 for wild type and n=10 for sdk RNAi). We think that the presentation of Z-slices for this experiment is important as the protein distribution needs to be considered for a larger area along the apical-lateral cell interface. In addition the quantification of the data for multiple tAJs was presented in Fig. 8F,G as a graph. We would therefore rather not change this figure in the revised manuscript.

      Referee #3 suggests that anti MyoII staining should accompany the analysis of tension measurements in the germband.

      As this analysis has already been performed by Tamada et al. 2021, we decided not to reproduce these data, but rather extend the analysis towards tension measurements, which support the findings by Tamada et al. 2021 on a functional level. We do not see the added value of adding MyoII labeling.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Chandran et al. report on the function of Src42A during cell intercalation in the early Drosophila gastrula. They create a Src42A-specific antibody (there are two Src genes in the fly genome) and examine the localization of Src42A and observe a planar-polarized distribution at cell interfaces. They then measure cell-contractile dynamics and show that T1 contraction is slower after Src42A disruption. The authors then argue that Src42A functions in a parallel pathway to the Abl protein, and that E-cadherin dynamics (turnover) is altered in Src42A disrupted embryos. Src function at these stages has been studied previously (though not to the degree that this study does), and in some respects the manuscript feels a little preliminary (please label figures with figure number!), but after editing this should be a polished study that merits publication in a developmentally-focused journal.

      1. Does the argument that Src42A has two functions fully make sense? Myosin II function is known to affect E-cadherin stability (and vice versa), so it seems that Src42A could affect both MyoII and Ecad by either decreasing Myosin II function/engagement at junctions or by destabilizing Ecad.
      2. One obvious question that arises is the nature of cleavage defects that are mentioned that happen previously to intercalation. For example, is E-cad normal prior to intercalation initiating? How specific are the observed defects to GBE?
      3. Pg. 10, "the shrinking junction along the AP axis strongly reduces its length with an average of 1.25 minute" - what is this measurement? How much is "strongly"?
      4. Also pg. 10, "the AP junction was not markedly reduced after 1 minute" - what is the criteria for this statement? X%? 1 minute is very specific, it feels like how much of a reduction/non-reduction should also be specific.
      5. It seemed odd to mention altered myosin levels but then skip over a measurement of myosin in favor of an indirect measurement such as interface recoil. Again (point 1), it seems that changes in Myosin II recruitment could cause changes in Ecad turnover.

      Minor notes:

      Page 4, missing comma after "For example"

      Page 4, "inevitable" does not make sense in this context

      Significance

      This study gives a more detailed perspective on how Src proteins (Src42A in Drosophila) control epithelial stability and the contraction of specific surfaces of epithelial cells.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Chandran et al. investigate the role of Src42A in axis elongation during Drosophila gastrulation. Using maternal RNAi and CRISPR/Cas9-induced germline mosaics, they revealed that Src42A is required to contract junctions at anterior/posterior cell interfaces during cell intercalations. Using time-lapse imaging and image analysis, they further revealed the role of Src42A in E-Cad dynamics at cell junctions during this process.

      By analyzing double knockdown embryos for Src42A and Abl, they further showed that Src42A might act in parallel to Abl kinase in regulating cell intercalations. The authors proposed that Src42A is involved in two processes, one affecting tension generated by myosin II and the other acting as a signaling factor at tricellular junctions in controlling E-Cad residence time. Overall, the data are clear and nicely quantified. However, some data do not convincingly support the conclusion, and statistical analyses are missing for an experiment or two. Methods for several quantifications also need improvement in writing. Also, several figures (Figures 6-8) do not match the citation in the text and need to be corrected.

      Page and line numbers were not indicated in the manuscript. For my comments, I numbered pages starting from the title page (Title, page 1; Abstract, page 2, Introduction, pages 3-6; Results, pages 7-14; Discussion, pages 15-18; M&M, 19-23; Figure legends, 28-30) and restarted line numbers for each page. For Figures 6-8 that do not match the citation in the text, I still managed to look at the potentially right panels. All the figure numbers I mention here are as cited in the text. My detailed comments are listed below.

      Major comments:

      1. b-Cat/E-Cad signals at the D/V and A/P junctions in Src42Ai (Figs. 5-6). These data are critical for their major conclusion and should be demonstrated more convincingly.

      In Fig. 5A, the authors said, "When the AP border was cut, the detached tAJs moved slower in Src42Ai embryos compared to control (Fig. 5A)". However, even control tAJs do not seem to move that much in the top panels, and I found the images not very convincing.

      In Fig. 6A, b-Cat signals look fuzzier and dispersed and have more background signals in the control, compared to the Src42Ai background. Also, b-Cat signals in the control image do not seem to show enrichment at the D/V border, as shown in Tamada et al., 2012.

      In Fig. 6B, C, it is not clear how the intensity was measured and how normalization was done. Was the same method used for these quantifications as "Protein levels at bicellular and tricellular AJs" on pages 21-22? Methods should be written more explicitly with enough details.

      Does each sample (experimental repeat) for the D/V border in Fig. 6B match the one right below for the A/P border in Fig. 6C? It should be clearly mentioned in the figure legend. The ratio of the DV intensity to AP intensity will better show the compromised planar polarity of the b-Cat/E-Cad complex. 2. Based on the genetic interaction between Src42A and Abl using RNAi (Fig. 7), the authors argue that Src42A and Abl may act in parallel. However, the efficiency of Abl RNAi has not been tested. It can be done by RT-PCR or Abl antibody staining. Also, the effect of Abl RNAi alone on germband extension should be tested and compared with Src42A & Abl double RNAi embryos. I expect the experiments can be done within a few weeks without difficulty.

      Minor comments:

      Page 2, line 14 - The abbreviation for tAJs was not introduced before.

      Page 7, line 6 - A reference should be cited for the Src42A26-1 allele.

      Figure 1 - Fig. 1B: Src42A levels should be compared between control (Src42A/+) and Src42A/Src42A for each stage. It currently shows a comparison between Src42A/Src42A of stages 10 and 15. - Fig. 1B: The figure legend says, "dotted line represents mean value and error bars," but there are no dotted lines shown in the figure. Also, what p-value is for ****? It should be mentioned in the figure legend. It also says Src42A levels were normalized against E-Cad intensity here (stages 10 and 15). They have shown that E-Cad levels are affected in Src42A RNAi during gastrulation (Fig. 6). Is E-Cad not affected in Src42A26-1 zygotic mutants at stages 10 and 15?

      Page 7, lines 6-7 - The localization of Src42A in control should be described in more detail and more clearly here.

      Supplemental Fig S1

      • Fig. S1D: Based on the head structure and the segmental grooves, the embryo shown here is close to late stage 13/early stage 14, not stage 15.
      • Fig S1E: It will be helpful if the predicted protein band and non-specific bands are indicated by arrows/arrowheads in the figure.

      Page 7, lines 21-22

      • "Src42A was slightly enriched at the AP interface" - To argue that, quantification should be provided.

      Page 8, line 14

      • "Embryos expressing TRiP04138 showed reduced hatching rates with variable penetrance and expressivity depending on the maternal Gal4 driver used (Fig. 2B)" - Fig. 2B doesn't seem to be a right citation for this sentence.

      Fig. 2

      • Fig. 2B: Higher magnification images of the defective cytoplasm can be shown as insets.
      • Fig. 2C: It will be helpful to indicate two other non-specific bands in the figure with arrows/arrowheads with a description in the figure legend.
      • Fig. 2E: A simple quantification of the penetrance of cuticle defects in Src42A mutants and RNAi will be helpful, as shown in Fig. S3.

      Page 9, line 9

      • This is the first time that the fast and the slow phases of germband extension are mentioned. As these two phases are used to compare the Src42A and Src42A Abl double RNAi phenotypes, they should be introduced and explained better earlier, perhaps in Introduction.

      Fig. 3

      • Fig. 3A: It will be helpful to mark the starting and the ending points of germband elongation with different markers (arrows vs. arrowheads or filled vs. empty arrowheads).
      • Fig. 3G and H should be cited in the text.
      • Fig. 3C figure legend: R2 is wrongly mentioned in Fig. 3D, E. Also, R2 (coefficient of determination) needs to be defined either in the figure legend or Materials & Methods.
      • Fig. 3D, E: statistical analysis is missing.
      • Fig. 3F: It should be mentioned that the heat map is shown for pY20 signals in the figure legend, with an intensity scale bar in the figure.

      Fig. 7A: Arrows can be added to mark the delayed germband extension.

      Fig. 8A: It should be mentioned that the heat map is shown for E-Cad signals in the figure legend, with an intensity scale bar in the figure.

      Fig. S3G: An arrowhead can be added to the gel image to indicate the band described in the legend.

      Fig. 9

      • Fig. 9A: Magnified views of the cytoplasmic clearing can be added as insets.
      • Fig. 9B: Arrow/arrowheads can be added to show the absence of the signals in the nurse cells.
      • Fig. 9C: Indicate the ending point of the germband extension by arrows.

      Page 14, lines 9-10: More explicit description of the phenotype rather than just "stronger compared to Src42Ai" will be helpful.

      Significance

      This work revealed the role of Src42A in regulating germband extension. A previous study suggested the roles of Src42A and Src64 in this developmental process using a partial loss of both proteins (Tamada et al., 2021). Using different approaches, the authors demonstrated a role of Src42A in regulating E-Cad dynamic at cell junctions during Drosophila axis elongation. Most of the analyses were done with maternal knockdown using RNAi, but they successfully generated germline clones for the first time and confirmed the RNAi phenotypes. Overall, this work contains important and exciting novel findings.

      This work will be of general interest to cell and developmental biologists, particularly researchers studying epithelial morphogenesis and junctional dynamics.

      I have expertise in Drosophila genetics, epithelial morphogenesis, imaging, and quantitative image analysis.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Highest priority:

      1. The Src42A knockdown and germline clone experiments both cause defects in cellularization (Fig. 2B and 9A), which could result in differences in the state of the blastoderm epithelium (cell size, cell number, structural integrity, organization, etc.) between the experimental and control conditions. In addition, Src42A knockdown appears to affect the size and shape of the egg (Fig. 9A and 9C). The manuscript would be strengthened if the authors included data to demonstrate that the initial structure of the epithelium is mostly normal (quantifications of cell size, number, etc.) in the Src42A RNAi condition, as this would bolster the argument that germband extension, rather than due to indirect effects resulting from the cellularization defects. The authors may have relevant data to do this on-hand, for example using data associated with figures 1, 3, 6, and 9.
      2. There is a discrepancy in the staging of embryos used between some of the analyses, which make it hard to interpret some of the data. For example, characterization of the knockdowns in Fig. 1A and B are based on stages 10 and 15, whereas the majority of the paper is focused on earlier stages 6 - 8 during germband extension (e.g., Fig. 1D). The analysis for Fig. 1B would be more meaningful if it was done on the same stages used for subsequent phenotypic analysis so they can be directly compared.
      3. There is incongruence between figures in terms of which junctional pools (bAJs vs. tAJs) of beta-catenin and E-cadherin are quantified that makes it difficult to draw comparisons between analyses. For example pTyr levels are examined for both bAJs and tAJs in Figure 3, however, only tAJs are considered in Fig. 8. Similarly, in some cases planar cell polarity is considered (e.g., comparison of levels at AP vs DV bAJs in Fig. 6 and 9), and in other cases (e.g. Fig. 8) it is not.

      Lower priority:

      1. Introduction, 2nd paragraph - The modes of cell behaviours described to drive cell intercalation leaves out another clear example in the literature - Sun et al., 2017 - which describes a basolateral cell protrusion-based mechanism. While the authors cite this paper later, leaving it out when summarizing the state of the field misrepresents the current knowledge of the range of mechanisms responsible.
      2. 'defective cytoplasm' - this term is confusing, and could perhaps be replaced with 'cellularization defect', or something similar.
      3. Tests of statistical significance are not uniformly applied across the figures. For instance, Figures 3G + H indicate statistical significance, but Fig. 3D + E do not. Performing statistical tests throughout the paper, or clearly articulating a rationale when they are not used, would strengthen the manuscript. Specifically, the authors should consider this for Fig. 3D + E, and Fig. 7D + E, to support their arguments that rates of germband extension are different between conditions.
      4. Page 12 - "We found that Src42A showed a distinct localization at the tAJs (Fig. 1B)": Figure 1B shows a quantification of levels at bAJs, not tAJs.
      5. Figure 8 - in my opinion, using a FRAP or photoconversion approach would be a more convincing demonstration of differences in E-cadherin residency times / turnover rate than time-lapse imaging of E-cadherin:GFP alone. Authors should decide whether this improvement is worth the investment.
      6. Figure 8E - showing images of multiple tAJs, rather than z-slices of a single vertex, would better support the claim here, as the assertion is that Src42a levels are different between control and sdk RNAi conditions, and not that it varies in the z-dimension.

      Significance

      The manuscript by Backer et al. examines the function of Src42A in germband extension during Drosophila gastrulation. Prior studies in the field have shown that Src family kinases play an important role in the early embryo, including cellularization (Thomas and Wieschaus 2004), anterior midgut differentiation (Desprat et al. 2008), and germband extension (Sun et al. 2017; Tamada et al. 2021). In this study, the authors showed that Src42A was enriched at adherens junctions and was moderately enriched along junctions with myosin-II. They then showed that maternal Src42A depletion exhibits phenotypes, starting with cellularization and including a defect in germband extension. The authors focus on defects in germband extension and found that Src42A was required for timely rearrangement of junctions and that the Src42A RNAi phenotype is enhanced by Abl RNAi. Finally the authors show that E-cadherin turnover is affect by Src42A depletion.

      Overall, this study provided a higher resolution description of how Src42A regulates the behavior of junctions during germband extension. I thought the authors conclusions were well supported by the data and represent new insight in the field.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Uveal melanoma is the most common primary intraocular malignancy in adults. About 50% of patients develop metastases, being the liver the most common place for them. Despite over 50 years of study, little progress has been done in efficacious treatments. In this report, the authors aimed at a better understanding of the mechanistic drivers for cancer aggressiveness and poor overall survival at the metastatic stage. To accomplish this, the authors performed a series of elegant genomics and transcriptomics analyses and identified molecular aberrations in miR16, which has been previously associated with other malignancies. The authors demonstrated that high level of miR16 sponges inversely correlate with poor overall survival. As a reference and validation, they are using the TCGA data analysis. Lastly, the authors generated a signature for survival prediction based on 4 genes, which was confirmed using an independent study.

      The methodology was very elegant. The appropriate analyses were done.

      Significance

      This manuscript provides incremental knowledge to the field. In the last 5 years there are many manuscripts addressing different transcription factors or miRNA molecules and their role in different cancers. Uveal melanoma is an orphan disease with high unmet need. Prognostication is highly valuable, however; it is the treatment where we need the most attention.

      The authors did elegant studies to demonstrate the relevance of miR16. This is not part of the standard of care, but the prognostic tool of the selected 4 genes, could be very helpful. I wish they could have included a sentence on the impact in the field.

      The response to the following questions can make the manuscript more robust: Description of the TCGA - how many of the primary tumors had clinically detectable metastases? This is important as you are describing a potential companion diagnostic testing to predict OS. We need additional information on the UM cells, which can be found in literature. It is necessary for the audience to understand which of these cell lines come from patients that die from metastases, which ones had additional malignancies. There are UM cell lines - commercially available ? for which the primary and metastatic line were developed from the same patient. Those are very helpful, especially if one of the objectives is demonstrating poor OS due to metastatic disease. That was not clear from the manuscript. Levels of miR16 - What were the copy numbers in healthy patients? Do we need them relative to 501 Mel? or should we compare to a system that is not dysregulated? Why were the DROSHA KO be lung cancer cells? HCT116? Why choosing this cell line?

      Overall, I consider this a good manuscript and with some tweaking it can be better.

      Referees cross-commenting

      Based on the different reviews and the added note, we all agree the manuscript needs work and is not ready yet to be accepted. We all agreed that needs to be re-structured as we all mentioned about key missing pieces in the writing. It will be of help to the authors to go back to the guidelines to know the max words and extend their manuscript.

      The timing needed is might not be too relevant, as we all agreed it needs work.

      Thank you to my reviewers/colleagues as they pointed out things very comprehensively.

      Agreed with their comments.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This is an extremely short report that identified a potentially new mechanism as to how miR-16 may be involved in uveal melanoma (UM). In this report the authors used previous data, that identified that miR-16 is involved in UM, to gain a more comprehensive understanding of the mechanism involved. miR-16 was selected as a candidate due to its location in chromosome 3, which UM patients of the have chromosome 3 monosomy. The group evaluated the transcriptome following miR-16 overexpression in a single cell line, and as others often find with miRNAs, there was a cohort of up- and down-regulated RNAs. Figure 1 is a summary of the workflow, indicating the number of up- and down-regulated RNAs with validations performed. In figure 2 the authors identified 2 cohorts based on the previous expression data, one of which defines a more at-risk group. While the date presented in Figures 1 and 2 is interesting overall, the conclusions are mostly drawn from a supplemental figure. The data in this manuscript needs to be revaluated and the most critical included in the main figures. If done soe, and rewritten appropriately this study could have a bigger impact. As is, this study would be of low to moderate interest. There are also some instances, early on, where conclusions are overstated. Overall the text is lacking in description to conduct a thorough review, the authors fail to provide an introduction to put the study in the context of the field, and they provide a one-sentence discussion. Some of these issues are defined below, but due to the lack of description of the studies, and overstatements made, this is not a thorough review.

      Major issues:

      The main text and Figure 1 inappropriately referred to upregulated RNAs as miR-16 sponges. At this point in the manuscript, these are nothing more that upregulated RNAs.

      The text is often vague and lacks discretion that is essential for the Reviewer to understand the study. Some sentences are fragments and are not clear. Some (but not all) examples of difficulties encountered while attempting the review are indicated below as well.

      Figure 1E legend. "in function of the experimental workflow detailed". "In function" does not make sense in this sentence. It is not clear what the authors are referring to. Similarly, "In function of their expression...." In function aging is inappropriate and the sentence is not clear. Also "MRE for miRNA response element". Perhaps the authors mean "MRE = miRNA response element." Also, the text "10arbouring" is included int this legend. In brief, Figure 1E is extremely difficult to evaluate with the poor text in the legend.

      Multiple figures have odd wording to indicate biological replicates. This needs to be clarified in better, complete sentences.

      For Figure S2, Fold induction is indicated as a %. This is not appropriate. It is either a fold change or a change in %, but not both.

      How is the experiment done in Figure S3A a "kinetic" experiment? There is no kinetic analysis here.

      Details of the critical biotinylating study are completely lacking. And since this is the most critical experiment that gets to the main point of the study, this should be well defined and part of the main figures in the manuscript. It should not be in the supplemental information. All of Figure S2 should be in the main part of the manuscript.

      All axis in Figure S2B are not labeled appropriately. Enrichment is used, however not all RNAs are enriched. Perhaps fold change would be a better and more accurate name. Those on the left side (in blue) are depleted, not enriched.

      For Figure S3C, the authors should change "blue ones" to "solid blue bars indicate". Same for S3E

      For Figure S3D, "logo" is inappropriate and not the correct term. This is a consensus sequence, not a logo.

      How did the authors make the conclusion that the upregulated RNAs are targets of miR-16 if they do not have a canonical miR-16 binding sites? They could easily be indirect RNAs that are elevated post miR-16 exposure. The authors do not validate that the cohort of RNAs upregulated are indeed miR-16 targets. Thus, the overstatement of "sponge" RNAs (Figure 1H) or even "target" RNAs (Figure 1F) without appropriate validation is overstated. Simply doing the biotinylating study is still not enough to conclude direct interactions of these RNAs with miR-16. These can be false positives, that are not well controlled for due to poor selection of a control RNA.

      Figure 1G is not large enough to see, nor is the inclusion of it clear in the text of the manuscript.

      Figure 1H, referring to upregulated RNAs, post miR-16 expression as sponges is inappropriate unless they have all been validated as miR-16 sponges. These could merely be RNAs that are indirectly upregulated following miR-16 transfection, and their upregulating following miR-16 overexpression has been validated. However their miR-16 sponging activity has not been validated. Similarly for Figure 2A.

      Figure 2 is poorly defined. This needs clarification and the font should be increased. There are also "..." in the figure legend which is inappropriate. Many things are not defined such as "CN" which the reviewer is assuming means copy number. Also the colors and description for "Yes/Dead/Male" are not clear. What are these? How are they relevant?

      For Figure 2B legend, what is meant by miR-16 "in function"?

      The authors should show the level of upregulation of miR-16 following transfection for all experiments where miR-16 is transfected.

      For all figures where qRT-PCR was conducted, what are RNAs normalized to. This should be indicated on the axis and/or in the figure legend. While in the methods section, this should also be present in the main body of text (ie. figures).

      For Sup Figure 2A the authors indicate that RNA levels were compared to 501Mel. They should show the 501mel levels in the same graph. They also state that the absolutely copy number was determined from Norther Blot. As the authors likely know, quantification using qRT-PCR is much more quantitative than Northern. They should conduct qRT-PCR for the main cell line they are comparing to. The Northern is also not shown and the reference provided for it is for a Nature Review article, not for a study that shows a Northern blot.

      It is not clear what the control RNA was for all the studies. Specifically for the biotinylated studies, the authors should use another miRNA, not a non-specific control. Because a control miRNA will also binds AGO and other miRNA-associated factors, non-specific binding due to these factors could be better controlled for. The non-specific RNA will not account for these factors.

      Sup Fig 4 is missing details. What orientation is the consensus sequence shown in relative to the miRNA (5'-3' or 3'-5')? Other details are missing as well, this is just one example of many issues.

      For Sup Fig 5A the CT values should be included. That gives a better direct comparison than a graph of something that is indicated as not determined. You cannot graph something that is not determined.

      For Sup Fig 5C the font cannot be read it is too small.

      For Sup Fig 5D, again, how much miR-16 is present when overexpressed. Would this amount be physiologically achievable?

      The title is poor and not descriptive enough for the study. It reads more like the title for a review article.

      Methods for siRNAs indicated kinetic as well. Not clear what kinetic data were acquired during this study.

      Significance

      Referees cross-commenting

      I am in full agreement with the additional comments made by Reviewer #1. I however disagree with Reviewer #3 that the study was well conducted and "elegant". Based on multiple issues (many cooperated) between R#1 and R#2 I do not feel that this study is acceptable and will take an extraordinary amount of time to be acceptable for publication.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The article presents interesting data regarding the role of miR-16 in the development and progression of uveal melanoma. The authors propose the analysis of miR-16 activity as a marker for uveal melanoma progression instead of miR-16 expression. To analyze the activity of miR-16 they propose a risk model developed around a 4 genes signature that was shown to be able to stratify uveal melanoma patients into low and high-risk groups. Unfortunately, the current version of the manuscript is hard to understand and to carefully evaluate due to a lack of structure and clear presentation of the experimental design, methods, results, and discussions.

      Major Comments:

      • The manuscript needs to be restructured into clear sections including a detailed introduction, materials and methods, results, discussions, and conclusion. The present format is hard to follow with information and results being spread across multiple documents and parts of the manuscript.
      • The authors mention that the level of miR-16 reached after transfection is higher than that in the physio-pathological level, but there is no data to support this. I recommend adding a quantification of miR-16 levels achieved after transfection as part of the S2.
      • The way of using references is confusing as it is not clear what was done in the results section and what work is cited from the literature. The results section should focus strictly on presenting the results achieved by the experiments while delineating clearly the work that was done in other articles.
      • The discussion section needs to be extended to better present the role of specific investigated genes and proteins like PYGB and PTP4A3 in the development and progression of uveal melanoma.
      • The experiments analyzing the sequestration of miR-16 at non-canonical sites are performed using the cell line HCT116 WT and DROSHA KO. HCT116 is a human colon adenocarcinoma cell line, a tumor with a completely different histology. These experiments should be performed also on a human uveal melanoma cell line in order to ensure consistency of the results.

      Minor Comments:

      • The manuscript is lacking consistency regarding the usage of abbreviations. These should be defined the first time when used in the text.
      • The figure S2 needs to be adjusted. It is hard to understand in S2B how the statistical analysis was performed. I recommend representing each line with the two conditions side by side to increase clarity.
      • When presenting the miR-16 interactome the data are spread in three different sources Figures S2, 1D-E, and Table S1 which makes it difficult to follow the images. I recommend presenting these data in the same figure.
      • The manuscript requires English grammar and style editing. There are several words misspelled and phrases with a complicated syntax that makes it difficult to understand.
      • The method of presenting the data in Figures 1 F and H needs to be reorganized. The current version makes it very hard to understand how the gene expression changed after miR-16 exposure.

      Significance

      The article presents important results regarding the role of miR-16 in uveal melanoma by an innovative approach analyzing the activity of miR-16 instead of its expression. The authors focus on the relationship between miR-16 sponges and targets and through a set of elegantly designed experiments they identify a set of 4 genes whose expression can be used as a risk predictor model in uveal melanoma. The audience of this article can be represented by both clinicians and researchers that could take advantage of the results presented. Also, the approach of the article can open new directions for other cancers and miRNAs.

      Referees cross-commenting

      Would go with a revision of the manuscript according to the comments

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2021-01016

      Corresponding author(s): Dennis Klug

      1. General Statements [optional]

      Dear editor, dear reviewers,

      thank you very much for the quick review of our manuscript as well as for the constructive criticism and the interesting discussion of our results. Reading the comments, we realized that we may have put too much emphasis on the in vivo microscopy of sporozoites and their interaction with the salivary gland. We believe that the generated mosquito lines can be used to address different scientific questions, the in vivo microscopy of host-pathogen interactions being only one of them. Because of this imbalance, and to address some of the reviewers' comments, we have partially rewritten the manuscript (particularly the introduction). At the same time, we have implemented additional data on the inducibility of the promoters used, as well as on the functionality of hGrx1-roGFP2 in the salivary glands. Furthermore, we created an additional figure to better present the expression patterns of trio and saglin promoters within the median lobe, and we expanded the section on in vivo microscopy of sporozoites. We hope that these results further highlight the significance of our study. Accordingly, we have also changed the title of the manuscript to „A toolbox of engineered mosquito lines to study salivary gland biology and malaria transmission” to indicate the broad applicability of the generated mosquito lines and we have included an additional co-author, Raquel Mela-Lopez, who conducted the redox analysis. We hope that these changes will adequately answer the questions of the reviewers and address any concerns they may have had. We look forward to hearing from you.

      With our kind regards,

      Dennis Klug

      Katharina Arnold

      Raquel Mela-Lopez

      Eric Marois

      Stéphanie Blandin

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      **Summary**

      This manuscript reports the generation and characterization of transgenic lines in the African malaria mosquito Anopheles coluzzii that express fluorescent proteins in the salivary glands, and their potential use for in vivo imaging of Plasmodium sporozoites. The authors tested three salivary gland-specific promoters from the genes encoding anopheline antiplatelet protein (AAPP), the triple functional domain protein (TRIO) and saglin (SAG), to drive expression of DsRed and roGFP2 fluorescent reporters. The authors also generated a SAG knockout line where SAG open reading frame was replaced by GFP. The reporter expression pattern revealed lobe-specific activity of the promoters within the salivary glands, restricted either to the distal lobes (aapp) or the middle lobe (trio and sag). One of the lines, expressing hGrx1-roGFP2 under control of aapp promoter, displayed abnormal morphology of the salivary glands, while other lines looked normal. The data show that expression of fluorescent reporters does not impair Plasmodium berghei development in the mosquito, with oocyst densities and salivary gland sporozoite numbers not different from wild type mosquitoes. Salivary gland reporter lines were crossed with a pigmentation deficient yellow(-) mosquito line to provide proof of concept of in vivo imaging of GFP-expressing P. berghei sporozoites in live infected mosquitoes.

      **Major comments**

      Overall the manuscript is very well written with a clear narrative. The data are very well presented. The generation of the transgenic mosquito lines is elegant and state-of-the art, and the new reporter lines are thoroughly characterized.

      This is a nice piece of work that is suitable for publication, although the in vivo imaging of sporozoites is somewhat preliminary and would benefit from additional experiments to increase the study impact.

      We would like to thank the reviewer for his/her appreciation of our manuscript. In the revised version, we have included additional experiments on in vivo imaging of sporozoites, which allowed us to quantify moving and non-moving sporozoites imaged under the cuticle of live mosquitoes. Although this is still a proof of concept, we believe that these new data provide novel interesting data and will better illustrate potential applications.

      The reporter mosquito lines express fluorescent salivary gland lobes, yet the authors only provide imaging of parasites outside the glands. It would be relevant to provide images of the parasite inside the fluorescent glands.

      We have now included images showing sporozoites inside the salivary glands in vivo in Figure 8C and discuss possible ways to further improve resolution and efficiency of the imaging procedure in lines 563-586.

      The advantage of the pigmentation-deficient line over simple reporter lines is not clear, essentially due to the background GFP fluorescent in figure 5C. Imaging of GFP-expressing parasites should be performed in mosquitoes after excision of the GFP cassette under control of the 3xP3 promoter. This would probably allow to document the value of the reporter lines more convincingly.

      Indeed, by incorporating two Lox sites in the transgenesis cassette, we designed the yellow(-)KI line to permit removal of the fluorescent cassette and completely exclude expression of the transgenesis reporter EGFP. Still, EGFP expression in the yellow(-)KI adults is restricted to the eye and ovary, as we show now in Figure 7 supplement 1D. In contrast, no EGFP fluorescence was observed in the thorax area (Figure 7 supplement 1D). Therefore, we believe that the benefit of removing the fluorescence cassette for this study is limited. Moreover, the generation of such a line would take at least 3-4 months before experiments could be performed. Nevertheless, we agree with the reviewer that removal of the fluorescence cassette would be instrumental for follow-up studies. To draw the reader's attention to this issue, we now discuss background fluorescence in lines 378-387.

      Along the same line, it is unclear if the DsRed spillover signal in the GFP channel is inherent to the high expression level or to a non-optimal microscope setting. This is a limitation for the use of the reporter lines to image GFP-expressing parasites.

      We have discussed this problem with the head of the imaging platform at our institute, and we believe that it is not a problem that occurs due to incorrect settings. Rather, it seems to be due to the significant expression differences of the two fluorescence reporters used. We agree with the reviewer that this is a limitation and discuss the problem now in lines 416-412 and 565-567.

      The authors should fully exploit the SAG(-) line, which is knockout for saglin and provides a unique opportunity to determine the role of this protein during invasion of the salivary glands. This would considerably augment the impact of the study. In this regard, line 131 and Fig S3E: why is there persistence of a PCR band for non-excised in the sag(-)EX DNA?

      We definitely share the reviewer's enthusiasm about saglin and its role in parasite development in mosquitoes. We have thoroughly characterized the phenotype of sag(-) lines with respect to fitness and Plasmodium infection. These results are described in a spearate manuscript currently in peer review and available as a preprint on bioRxiv (https://doi.org/10.1101/2022.04.25.489337). Furthermore, in the revised manuscript, we have included additional data on the transcriptional activity of the saglin promoter with respect to the onset of expression and blood meal inducibility (Figure 2). In addition, we have included a completely new Figure 3 to highlight the spatial differences in transcriptional activity of the saglin promoter compared with the trio promoter. These new data are commented in lines 206-276.

      There might be a misunderstanding in the interpretation of the genotyping PCR. The PCR shown in Figure 1 – figure supplement 3, displays PCR products for different genomic DNAs (sag(-)EX, sag(-)KI and wild type) using the same primer pair. „Excised“ refers to sag(-)EX while „non excised“ refers to sag(-)KI and „control“ to wild type. Primers were chosen in a way to yield a PCR product as long as the transgene has integrated, only the shift in size between „excised“ and „non excised“ indicates the loss of the 3xP3-lox fragment. We have now changed the labeling of the respective gel in Figure 1 – figure supplement 3 to make this clearer.

      Did the authors search for alternative integration of the construct to explain the trioDsRed variability?

      We validated trio-DsRed cassette insertion in the X1 locus by PCR. The only way to rule out an additional integration of the transgene would be whole genome sequencing, which we did not perform. Still, we believe that the observed expression patterns are due to locus-specific effects of the X1 locus. Indeed, several lines of evidence point in this direction: (1) transgenesis was realized using the phage Φ31 integrase that promotes site-specific integration (attP is 38bp long and very unlikely to occur as such in the mosquito genome) and for which we never detected insertion in other sites in the genome for other constructs inserted in X1 and other docking lines; (2) additional unlinked insertions would have been easily detected during the first backcrosses to WT mosquitoes we perform in order to isolate the transgenic line and homozygotise it; (3) we have often observed variegated expression patterns for other transgenes located in the X1 locus in the past, leading us to believe that this locus is subjected to variegation influencing the expression of the inserted promoters. Usually, the variation we observe is simpler (e.g. strong and weak expression of the fluorescent reporter placed under the control of the 3xP3 promoter in the same tissues where it is normally expressed), but some promoters are more sensitive to nearby genomic environment than others, which we believe is the case for trio. Finally, should there be additional insertions of the transgenesis cassette in the genome, they should all be linked to the X1 locus as we would otherwise have detected them in the first crosses as mentioned above, which is unlikely. Thus, although very unlikely, we cannot exclude a single additional and linked insertion possibly explaining the high/low DsRed patterns, but variegation would still be required to explain other patterns. We have mentioned this alternative explanation in the manuscript in lines 522-524.

      Line 254-255. Does the abnormal morphology of SG from aapp-hGrx1-roGFP2 result in reduced sporozoite transmission?

      This is an interesting question. For future experiments, it could indeed be important to test if the transmission of sporozoites by the generated salivary gland reporter lines is not impaired. However, the quantification of the number of sporozoites in aapp-hGrx1-roGFP2 expressing salivary glands did not reveal any significant differences from the wild type (Figure 5 – figure supplement 1B) and would definitely be sufficient to infect mice. As we have no evidence for reduced invasion of sporozoites in the salivary glands of aapp-hGrx1-roGFP2 and of the DsRed reporter lines, no good reason to believe that the expression of fluorescent proteins would interfere with parasite transmission, and as we produced these lines as tools to follow sporozoite interaction with salivary glands, we have not performed transmission experiments.

      Of note, we have now included images of highly infected salivary glands of all reporter lines in Figure 5 – figure supplement 1D to confirm that expression of the respective fluorescence reporter does not interfere with sporozoite invasion. Also we have not observed that sporozoites do not invade salivary gland areas displaying high levels of hGrx1-roGFP2.

      **Minor comments**

      -Line 51: sporogony rather than schizogony

      Schizogony was replaced with sporogony.

      -Line 56: sporozoites are not really deformable as they keep their shape during motility

      This sentence was removed.

      -In the result section, it is not clearly explained where constructs were integrated.

      We have now included the sentence „...with an attP site on chromosome 2L...“ (line 173) and the respective reference (PMID: 25869647) to give more information about the integration site.

      Line 106 and 434-435: for the non-expert reader, it is not clear what X1 refers to, strain or locus for integration?

      X1 refers to both, the locus and the docking line. We have rephrased the beginning of the result section (previously line 106) to give more information about the integration site as mentioned above.

      -Line 112-115: the rational of integrating GFP instead of SAG is not clearly explained here, but become clearer in the discussion (line

      We have slightly rephrased the sentence to better explain the reasoning for this procedure (lines 182-184).

      -Line 140: FigS2A instead of S3A

      This mistake was corrected in the revised manuscript.

      -Perhaps mention that GFP reporters (SG) might be useful to image RFP-expressing parasites.

      We have now included an image of the aapp-hGrx1-roGFP2 line infected with a mCherry expressing P. berghei strain in Fig. 7D.

      -Line 236: the authors cannot exclude integration of an additional copy (as mentioned in the discussion line 367-368).

      As discussed above, we removed „..as a single copy...“ and introduced the possibility of an additional integration linked to X1 (lines 522-524).

      -Line 257-258. The title of this section should be modified as SG invasion was not captured.

      The title was rephrased. It reads now „Salivary gland reporter lines as a tool to investigate sporozoite interactions with salivary glands” (line 356-357).

      -Line 287: remove "considerable number" since there is no quantification.

      This was removed. In addition, we included new data in this section of the manuscript and rephrased the results accordingly (lines 406-427).

      -Line 400-402: Klug and Frischknecht have shown that motility precedes egress from oocysts (PMID 28115054), so the statement should be modified.

      Thank you for this suggestion. The passage was modified accordingly.

      -Line 404: remove "significant number" since there is no quantification.

      This section was rephrased and the phrase "significant number" was removed (lines 406-427).

      -Line 497: typo "transgenesis"

      The typo was correct in the revised manuscript.

      -FigS1: add sag-DsRed in the title

      Thank you for spotting this inconsistency, we corrected this mistake (line 1134).

      -Stats: Mann Whitney is adequate for analysis in fig 2C but not 2B, where ANOVA should be used (more than 2 groups).

      We have performed now an one-way-ANOVA test and adapted figure and figure legend accordingly.

      Reviewer #1 (Significance (Required)):

      This work describes a technical advance that will mainly benefit researchers interested in vector-Plasmodium interactions. Invasion of salivary glands by Plasmodium sporozoites is an essential step for transmission of the malaria parasite, yet remains poorly understood as it is not easily accessible to experimentation. The development of transgenic mosquitoes expressing fluorescent salivary glands and with decreased pigmentation provides novel tools to allow for the first time in vivo imaging in live mosquitos of the interactions between sporozoites and salivary glands.

      Reviewer's expertise: malaria, Plasmodium berghei, genetic manipulation, host-parasite interactions

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The first achievements of the Klug et al. study are the (i) genetical engineering of the Anopheles coluzzii mosquitoes reared in insectarium, that stably express distinct fluorescent reporters (DsRed and hGrx1-roGFP2 and EGFP) under the putative "promoters" of genes reported to encode proteins expressed differentially in the pluri-lobal salivary glands(Sg) of anthropophilic blood-feeding adult females, (ii) the analysis of the promoter activity - based on the selected fluorescent reporter - with a primary focus on the salivary gland/Sg (including at the Sg lobe level) of the adult female but also considering the preimaginal developmental time with larvae and pupa samples. Of note, some data confirm the already reported time-dependent and blood meal-dependent promoter activity for the related Anopheles species. The last part presents preliminary dataset on live imaging of Plasmodium berghei sporozoites with the aim of highlighting the usefulness of these A. coluzzii transgenic

      lines to better understand how the rodent Plasmodium sporozoites first colonize and then settle as packed cells in Sg acinar host cells.

      **Major comments**

      The two first objectives presented by the authors have been convincingly achieved with (i) the challenging production of four different lines expressing different single or double reporters chosen by the authors (and appropriately presented in the result text and figure sections), (ii) the careful analysis of the spatiotemporal expression of the DsRed reporter under two "promoters" studied and with regards to the blood feeding event parameter. However, if the reason why the authors have put so much effort in the production of their transgenic mosquitoes is (and as mentioned) to provide a significant improved setting enabling the behavioral analysis of sporozoites upon colonization and survival in the Sg, it seems this part is kind of limited. Likely in relation with this perception is the fact I found the introductory section often confusing and not enough direct to the points: in particular distinguishing the rationale from the necessity to produce appropriate models, and clarifying what is/are the added value(s) offered by these new transgenic lines models when compared to what exist (in Anopheles stephensi) with specific evidence that argue for this knowledge gain. At this stage, it is unfortunately not clear to me, what is the bonus of imaging the Plasmodium fluorescent sporozoites in hosts with fluorescent salivary gland lobes if one can not monitor key events of the Sg-sporozoite interaction that were not reachable without the fluorescent mosquito lines. Furthermore, it should be better explained why the rodent Plasmodium species has been chosen rather Plasmodium falciparum (or other human species) for which A. coluzzii is a natural host; may be just mentioning that this study would serve as a proof of concept but bringing real biological insights would be fine.

      We would like to thank the reviewer for his/her evaluation of our manuscript, which has helped us clarify our manuscript on several points. Our goal here was a proof of concept demonstrating potential applications for the fluorescent salivary gland reporter lines and for the low pigmented yellow(-) line we generated. In vivo imaging of sporozoites in salivary glands is one possible application that we intended to use as proof-of-concept, but we tailored the manuscript too restrictively with this aim in mind and neglected other applications as well as characterization of the biology of salivary glands in general. To improve this, we have included further data on the blood inducibility of the promoters tested (Figure 2), the functionality of roGFP2 in the salivary glands (Figure 5), and the use of the generated lines in the examination and definition of expression patterns of salivary gland proteins in vivo (Figure 6). Accordingly, we have adjusted the entire manuscript to adequately describe all the results presented. We have also rephrased major parts of the abstract and the introduction to better describe the impact of salivary gland biology on the transmission of pathogens, and to explain the anatomy of salivary glands in more detail.

      We agree with the reviewer that it would be desirable to show direct salivary gland-sporozoite interactions in vivo. Still we believe that having mosquito lines expressing a fluorescent marker in the salivary gland as well as weakly pigmented mosquitoes are a first step to make this visualization possible, although we cannot provide a lot of quantitative data about this interaction yet.

      1- The three genes and gene products selected by the authors should definitively be more systematically explained, which means for example the authors need to introduce the different mosquito species and the parasite-mosquito host pairs they are then referring to for the promoter/encoded proteins of their interest. In the same vein, I did not find any information as to the choice of the mosquito species (A. Coluzzii) for the current work. I was curious to know what is the advantage since better knowledge was available with Anopheles stephensi with respect to (i) Saglin and its promotor activity, (ii) aap driven dsRed expression (lines already existing) and (iii) sporozoite-gland interaction.

      We have largely reworded the introduction to clarify the rationale for selecting these three promoters while providing a better understanding of salivary gland biology in general.

      The choice of the mosquito species depends, in our opinion, strongly on the perspective and on the experiments to be performed. We agree with the reviewer that the malaria mosquito A. stephensi is a widely used model, based on its robustness in breeding and its high susceptibility to P. berghei and P. falciparum infections. However, in these cases, both vector-parasite pairs are to some extend artificial. Indeed, although it is also a vector of P. falciparum in some regions, A. stephensi mostly transmits P. vivax that cannot be cultured in vitro. Thus research efforts on this vector-parasite pair is limited. Also, due to the emerging number of observed differences between Anopheles species and their susceptibility to Plasmodium infection and transmission, more research has recently been conducted on African mosquito species. This effect is also reinforced by the fact that P. falciparum, unlike all other Plasmodium species infecting humans, causes the most deaths, making control strategies for species from the A. gambiae complex such as A. coluzzii particularly important. As a result, the number of available genetic tools in A. coluzzi/gambiae has overpaced A. stephensi. These include mosquito lines with germline-specific expression of Cas9 for site-directed transgenesis, lines expressing Cre for lox-mediated recombination, and several docking lines. Such tools are, as far as we know, not available in A. stephensi and were essential in reaching our objectives. Docking lines are of particular interest because they allow reliable integration into a characterized locus, which is an advantage over random transposon-mediated integration. Random insertion sites have generally not been characterized in the past, which can cause problems since integrations regularly occur in coding sequences. Docking lines also enable comparison of different transgenes as they are all integrated in the same genetic environment, which does not ensure some expression variation as illustrated in our manuscript. For all these reasons, we have thus chosen to work with A. coluzzii.

      Concerning the use of the murine malaria parasite P. berghei instead of the human one P. falciparum, there are two reasons that motivated our choice. (1) For in vivo imaging of sporozoites, we needed a parasite line that is strongly fluorescent at this stage, and there is no such line existing for P. falciparum. Actually, there is no fluorescent P. falciparum line able to efficiently infect A. coluzzii reported thus far, as reporter genes have all been inserted in the Pfs47 locus that is required by P. falciparum for A. coluzzii colonization. (2) Imaging P. falciparum infected mosquitoes, especially with sporozoites in their salivary glands, requires to have access to a confocal microscope in a biosafety level 3 laboratory. Hence our objective here was indeed to provide a proof of principle of in vivo imaging of sporozoites in the vicinity or inside salivary glands using our engineered mosquitoes, and to provide a first analysis of this process using P. berghei as a model of infection. Nevertheless, we agree with the reviewer that the goal should be to work as close as possible to the human pathogen.

      Despite the wide range of topics that this study touches on, we want to try and keep the manuscript as concise as possible. Therefore, we have not discussed the advantages and disadvantages of the different vector-parasite pairs and ask the reviewer to indulge us in this.

      2- To help clarifying the added value of the present study, introducing the species names of the mosquito and the Plasmodium that serve as a model would be appreciated.

      We have included now the name of the used Plasmodium species in line 361. At this position we also give now more details about the transgene this line is carrying. We mention the used mosquito species A. coluzzii now at different positions in the manuscript (e.g. lines 52, 162 and 177).

      3- Since a focus is the salivary gland of the blood feeding female Anopheles sp., a rapid description of the glands with different lobes and subdomains the results and figure 1 nicely refer to, would help in the introduction.

      We explain now the anatomy of female and male mosquito salivary glands in the introduction (lines 119-123). The different lobes are now also indicated in the salivary gland images shown in several figures including Figure 1.

      4- That description could logically introduce the few proteins actually identified with lobe specific or cell domain specific expression (apical versus basal side, intracellular or surface expose, vacuole, duct...) profiles. The context with regards to sporozoite biology would then easily validate the "promoter choice". As a minor remark, I miss the reason why the authors wrote " the astonishing degree of order of the structures (referring to the packing of sporozoites within the Sg acinars) raise the question whether sporozoite can recognize each other". Please clarify since packing/accumulation can be passive due to cell mechanical constraints and explain what this point has to see with the question and experimental work proposed here?)

      We thank you for this suggestion. We have reworded key parts of the introduction to make the reasons for using the three selected promoters clearer. We also mention now other proteins expressed in the salivary glands which have been characterized in more detail because of their effect on blood homeostasis (e.g. anticoagulants) (lines 136-139).

      The mention of stack formation of salivary gland sporozoites served only to clarify that almost nothing is known about the behavior of sporozoites within the salivary glands in vivo to explain why new methods are needed to make these processes visible. We have now reworded this passage to make this clearer, and we also mention that stack formation could also occur due to mechanical constraints, as suggested by the reviewer (lines 101-102, 106-110).

      5- The selection of hGrx1-roGFP2 is quite interesting and justified but there is then no use of this reporter property in the preliminary characterization of the Sg and Sg-sporozoite interaction. Could the authors provide such characterization?

      We have now implemented data testing the functionality of hGrx1-roGFP2 in the salivary glands. We also show qualitatively that the redox state of glutathione does not change upon infection with P. berghei sporozoites (Figure 6). We now describe and discuss these new data in lines 337-354.

      6- Figure 1: it would be nice to add in the legend at what time the dissection/imaging has been made (age, blood feeding timing?). I would also omit the double mutant trio-Dsred/aapDsred in the main figure (may be supplemental) since the two single mutants Dsred separately together with the double mutant (with different fluorescence) already provide the information. I would suggest to regroup the phenotypic presentation of the transgenic line made in the KI mosquitoes (current figure 5) in the main figure 1.

      We have now added the missing information about the age of dissected mosquitoes and their feeding status in the legend of Figure 1. We also thank the reviewer for the suggestion to replace one image displaying aapp and trio promoter activity in trans-heterozygous mosquitoes with an image of the pigment deficient mutant yellow(-)KI. Still, due to the changes made to the manuscript based on the reviewers comments in general, we have now implemented new data highlighting the functionality of the generated salivary gland reporter lines investigating the redox state of glutathione as well as the expression pattern of the saglin and trio promoters at the single cell level (see Figure 3 and 6). Therefore it would no longer seem logical to introduce the yellow(-)KI mutant in Figure 1 while further data on this mutant are provided in the last two figures of the manuscript and discussed later in the manuscript (Figure 7 and 8). In addition we believe that co-expression of different transgenes (carrying fluorescent reporters) in the median and the distal lobes could potentially be interesting for certain applications. We believe that readers who might actually be interested in combining both transgenes in a cross would like to see the outcome to better evaluate the usefulness before experiments are planned and performed. This is especially true because localization as well as expression strength may differ between different fluorescence reporters while using the same promoter (e.g. the hGrx1-roGFP2 construct appears less bright and more localized to the apex of the distal-lateral lobes than dsRed, while expression of both reporters is driven by the aapp promoter in aapp-hGrx1-roGFP2 and aapp-DsRed, respectively).

      7- Figure 2:

      1. a) Is there anything known on the Sgs' size change overtime. It seems that between day 1 and 2 there is an increase of size and volume as much as I can evaluate the volume (Fig S4). Could that mean that there is increase in cell number in the lobes and therefore more cells expressing the transgene which would account for the signal intensity increase rather than more transcripts per cell? Thank you for this interesting question. The changes in the morphology of the salivary glands in Anopheles gambiae following eclosion have been studied in detail by Wells et al., 2017 (PMID: 28377572) which we cite now in the introduction (line 122-123). According to this reference, cell counts of the salivary gland are not changing upon emergence of the adult mosquito. However, we agree with the reviewer that the glands appear smaller and differ in morphology directly after eclosion. We noted that glands of freshly emerged females are more „fragile“ during dissections and lack secretory cavities, as reported by Wells et al., 2017. We believe that the increase in size occurs through the formation and filling of the secretory cavities which has been reported to take place within the first 4 days after emergence (Wells et al., 2017). This observation is in accordance with our observations that the promoters of the saliva proteins AAPP and Saglin display only weak activity after hatching, or, in the case of TRIO are not yet active directly after emergence. The timing of the formation of the secretory cavities is also in agreement with our time course experiment (Figure 2) which shows a strong increase in fluorescence intensity in dissected glands within the first 4 days after emergence.

      2. b) why choosing 24h after the blood meal to assess promoter activity in the Sgs? Do we have any information on how the blood meal impact on the Sgs'development. At this time anyway the sporozoites are far from being made. Yosshida and Watanabe 2006 mentioned at significant decrease of Sg proteins post-blood feeding. Could the authors detail their rationale based on what the questions they wish to address Thank you for this question. Unfortunately, the data available in the literature on this topic are very sparse, so we could only refer to few previous publications. The decision to quantify the fluorescence signals as early as 24 hours after blood feeding was based on Yoshida et al, Insect Mol. Biol, 2006, PMID: 16907827. The authors of this study generated the first salivary gland reporter line in A. stephensi by using the aapp promoter sequence to drive DsRed expression, and showed by qRT-PCR that DsRed transcripts increase 1-2 days after blood feeding compared to controls. Consistent with this observation and because we were concerned that putative changes in protein levels would only be visible for a short period of time, we began quantification one day after feeding. Since we observed significant changes in fluorescence intensity for the aapp-DsRed and sag(-)KI lines 24 hours after blood feeding, we retained the experimental setup and did not change it further. Nevertheless, we agree with the reviewer that different time points could help determine how long the effect lasts, and whether trio expression might also be regulated by blood feeding, but at a later time point. Still, our main objective here was to validate that the ectopic expression of DsRed driven by the aapp promoter in the aapp-DsRed line was indeed induced upon blood feeding as previously reported (PMID: 16907827). This experiment allowed us to confirm the inducibility of aapp in a different way and to show for the first time that saglin, but not trio, is induced one day after blood feeding. Our transgenic lines could be used for follow-up studies investigating the inducibility of salivary gland-specific promoters by different stimuli, or after infection with Plasmodium sporozoites. For example, for trio, transcription has been shown to increase after infection of the salivary gland by Plasmodium (PMID: 29649443).

      8- Figure 3: The figure is quite informative in terms of subcellular localization. Concerning the section "Natural variation of DsRed expression in trio-DsRed mosquitoes", I think it could be shortened because because it is a bit out of the focus the study.

      We agree with the reviewer that this part of the manuscript sticks a bit out and is not perfectly in line with the remaining results because it doesn’t deal with the salivary gland. Still, we would like to emphasise that in this work, we particularly want to show possible applications of the generated mosquito lines to address unanswered questions in host-parasite interactions and salivary gland biology. As a result, this manuscript establishes potentially important tools. For this reason, we feel it is important to mention the natural variation in DsRed expression, as this natural variation can have a significant impact on crossing schemes (especially with lines inheriting other DsRed-marked transgenes) and experiments (e.g. visualizing DsRed expression by western blot in larval and pupal stages). Furthermore, it is important for the use of the line to show that the transgene is inserted only once, at the expected location, which we try to emphasize with figure 4 – figure supplement 1 and figure 4 – figure supplement 2.

      We would also like to note that transgenesis in Anopheles is a relatively young field of research and altered expression patterns of ectopically used promoters have rarely been described so far, although this could have major implications e.g. in the case of gene drives. Therefore, we hope that the data shown will bring this previously neglected observation more into focus and highlight the importance of accurate characterization of generated transgenic mosquito lines.

      9- In contrast the last section of live imaging of P. berghei sporozoites in the vicinity and within salivary gland should be expanded. The 2 sentences summarizing the data are quite frustrating "We also observed single sporozoites moving actively through tissues in a back and forth gliding manner (Fig. 6B, Movie 3) or making contact with the salivary gland although no invasion event could be monitored"

      We have now implemented new data and extended Figure 8 showing the results of the in vivo imaging in a qualitative manner. We have rephrased the result and discussion section accordingly.

      10- I am aware of the technical difficulties to perform live imaging of sporozoite on whole mosquitoes, even when the salivary gland lobe under observation is closely apposed to the cuticle but that seems to be the final aim of the authors. I looked very carefully to the three movies and I am sorry but at this stage I could not make meaningful analysis out of them, and could not agree with the conclusions: for instances, the authors specify that sporozoites were undergoing back and forth movements (movie 3) but I do not see that and do not see the Sg contours in the available movies? The authors should also add bar and time scales to their movies. Having an in-depth description with regards to the sub-domain marked by a relevant reporter would strengthen the study, even if images are not collected in the whole mosquito to get higher resolution.

      We thank the reviewer for this comment. We have to admit that parasite imaging in fluorescent salivary glands in vivo is an ambitious goal given the complex biological system we are working with. We believe that the system presented in our manuscript is a first and important step to enable the analysis of the interaction of sporozoites with salivary glands, although in-depth analysis will require further optimization and considerable time, especially to generate quantitative data. Therefore, we now downstate the significance of our results in this respect and changed the title accordingly. Still, we also provide a more detailed analysis of the data we have already collected (Figure 8 and lines 406-427). Because we focus on the analysis of sporozoites in the thorax area in the revised manuscript, the outlines of the salivary gland are not necessarily visible in the images.

      I am not sure I understand the relevance of this quite condensed sentence in the text. Could the authors rephrase and expand if they wish to keep the issues they refer to. "The sporozoites' distinctive cell polarization and crescent shape, in combination with high motility, allows them to „drill" through tissues". I would stress more on the main unknown in terms of sporozoite-Sg interactions and the need to get right models for applying informative approaches (i.e. here, imaging).

      We thank you for this suggestion. The sentence mentioned has been removed in its entirety. We have also adjusted the text accordingly and reworded most of the introduction to make the narrative clearer (lines 91-119).

      Of note, it could help to point that the "Sgs is a niche in which the sporozoites which egress from the oocyst could mature and be fully competent when co-deposited with the saliva into the dermis of their intermediary hosts"

      We have now implemented a similar sentence in the introduction (lines 93-98).

      Reviewer #2 (Significance (Required)):

      1- Clear technical significance with the challenging molecular genetics achieved in the mosquito A. coluzzii.

      2- More limited biological significance: fair analysis and gain of knowledge of spatio-temporal of reporter expression under the selected promoter but limited significance of the final goal analysis which concerns the Plasmodium sporozoite biology once egressed from oocysts

      As stated above, we changed the title to place the focus on the engineered mosquito lines.

      3- Previous reports cited by the authors have used the DsRed reporter and the aap promoter in another Anopheles (i.e. A. stephensi, Yoshida and Watanabe, Insect Mol Biol, 2006; Wells and Andrew, 2019) which is also a natural host and vector for human Plasmodium spp.) with significantly more resolutive 3D visualization of GFP-fluorescent P. berghei but in dissected salivary glands and not in whole mosquitoes. The Wells and Andrew publication entitled "Salivary gland cellular architecture in the Asian malaria vector mosquito Anopheles stephensi" in Parasite Vectors, 2015 would deserve to be reference and described.

      Thank you very much for this suggestion. We considered citing Wells and Andrews (PMID: 26627194). However, this reference focuses very specifically on the subcellular localization of AAPP and shows only highly magnified sections of immunostained dissected and fixed salivary glands. Working only with the AAPP promoter, we felt it important to refer to the previously observed expression pattern along the entire salivary gland, as shown in Yoshida and Watanabe (PMID: 16907827). Nevertheless, we have cited two other publications by Wells and Andrews (PMID: 31387905 and 28377572) at various points in the manuscript.

      4- Audience: I would say that this work should be of interest of mostly scientists investigating Plasmodium biology (basic and field research) or in entomology of Diptera.

      5- To describe my fields of expertise, I can refer to my extensive initial training in entomology including at one point in the genetic basis of mosquito-virus interaction. I have also been working for more than 20 years in the field of Apicomplexa biology (Plasmodium and Toxoplasma) and I have long-standing interest in live and static high-resolution imaging.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Klug et al. generated salivary gland reporter lines in the African malaria mosquito Anopheles coluzzii using salivary gland-specific promoters of three genes. Lobe-specific reporter activity from these promoters was observed within the salivary glands, restricted either to the distal lobes or the medial lobe. They characterized localization, expression strength and onset of expression in four mosquito lines. They also investigated the possibility of influences of the expressed fluorescent reporters on infection with Plasmodium berghei and salivary gland morphology. Using crosses with a pigmentation deficient mosquito line, they demonstrated that their salivary gland reporter lines represent a valuable tool to study the process of salivary gland colonization by Plasmodium parasites in live mosquitoes. SG positioning close to the cuticle in 20% of females in this strain is another key finding of this study.

      The key findings from this study are largely quite convincing. The authors have created a suite of SG reporter strains using modern genetic techniques that aid in vivo imaging of Plasmodium sporozoites.

      Vesicular staining within salivary acinar cells should be stated as "vesicle-like" staining unless a co-stain experiment in fixed SGs is conducted using antisera against the marker protein(s) and antisera against a known vesicular marker (e.g. Rab11). It may also be possible to achieve this in vivo using perfusion of a lipid dye (e.g. Nile Red), but this is not necessary. As is, in Fig. 3A, there are images in which it appears that the vesicle-like staining is located both within acinar cells' cytoplasm and in the secretory cavities (e.g. Fig. 3A: aapp-DsRed bottom and middle), and this is fine, but should be more inclusively stated. Fixed staining of the reporter strain SGs would allow for clarification of this point. In previous work, other groups have observed vesicle-like structures in both locations (e.g. PMID: 33305876).

      Thank you very much for this suggestion. Indeed, when we observed the vesicle-like localization, we had similar ideas and considered investigating the identity of the observed particles in more detail. Ultimately, however, we concluded that the localization of DsRed does not play a critical role in the use of the lines as such and believe that a more detailed investigation of the trafficking of the fluorescent protein DsRed is beyond the scope of this study.

      We have thus followed the suggestion of the reviewer and now use the phrase „vesicle-like“ throughout the manuscript. In addition, we extended the discussion on the different localizations observed and presented some explanations that might have led to this observation. We also included a new reference that investigated the localization of AAPP using immunofluorescence (PMID: 28377572).

      Morphological variation is extensive among individual mosquito SGs, thought to impact infectivity, and well documented in the literature. The manuscript should be edited to make it much clearer (e.g. n = ?) exactly how many SGs, especially in microscopy experiments, were imaged before a "representative" image was selected from each data point and in any additional experiment types where this information is not already presented. Figure S8 is an example where this was done well. Figure 3A-B is an example where this was not well done. All substantial variation (e.g. "we detected a strangulation..." - line 189) across individual SGs within a data point should be noted in the Results. Because of the genetics and labor involved, acceptable sample sizes for minor conclusions may be small (5-10), but should be larger for major conclusions when possible.

      Thank you for this comment. We have improved this point by specifying precisely the number of samples and of repetitions in the respective figure legends. For example, we have now quantified the proportion of moving sporozoites and report both the number of sporozoites evaluated and the number of microscopy sessions required (see Figure 8).

      Thank you for this comment. We have improved this point by specifying precisely the number of samples and of repetitions in the respective figure legends. For example, we have now quantified the proportion of moving sporozoites and report both the number of sporozoites evaluated and the number of microscopy sessions required (see Figure 8). Regarding Figure 3, fluorescence expression and localization in salivary gland reporter lines was actually very uniform in each line. We added the following sentence in the legend of revised figures 3 and 5: “Between 54 and 71 images were acquired for each line in ≥3 independent preparation and imaging sessions. Representative images presented here were all acquired in the same session”.

      Sporozoite number within SGs has been shown to be quite variable across the infection timeline, by mosquito species, by parasite strain, in the wild vs. in the lab, and according to additional study conditions. The authors mention that the levels they observed are consistent with their prior studies and experience, but they did not utilize the reporter strains and in vivo imaging to support these conclusions, instead relying on dissected glands and a cell counter. It is important for these researchers to attempt to leverage their in vivo imaging of SG sporozoites for direct quantification, likely using the "Analyze Particles" function in Fiji. The added time investment for this additional analysis would be around two weeks for one person experienced in the use of the imaging software.

      Thank you for this interesting suggestion. Indeed, it would be beneficial to use an imaging based approach to quantify the sporozoite load inside the salivary glands. We already used „watershed segmentation“ in combination with the „Analyze Particles“ function in Fiji on images of infected midguts to determine oocyst numbers. Still, we believe this analysis cannot be applied to images of infected salivary glands mainly because of differences in shape and location of the oocyst and sporozoite stages. Sporozoites inside salivary glands form dense, often multi-layered stacks. Because of this close proximity, watershedding cannot resolve them as single particles which could subsequently be counted. This creates an unnecessary error by counting accumulations of sporozoites as one, likely leading to an underestimation of actual parasite numbers. Furthermore, given that the proximity issue could be resolved e.g. by performing infections yielding lower sporozoite densities, another problem would be that infected salivary glands prepared for imaging are often slightly damaged leading to a leak of sporozoites from the gland into the surrounding. These leaked sporozoites are likely not included on images which would then be used for analysis, potentially leading again to an underestimation of counts. Since these issues are circumvented by the use of a cell counter, we believe that this method is still the method of choice in acquiring sporozoite numbers.

      Nevertheless, we can understand the reviewer's concern that counts performed with a hemocytometer do not reflect the variability in the sporozoite load of individual mosquitoes. To highlight that all generated reporter lines can have high sporozoite counts, we have now included images of highly infected salivary glands for each line in Figure 7D.

      This manuscript is presented thoughtfully and such that the data and methods could likely be well-replicated, if desired, by other researchers with similar expertise.

      The statistical analysis is appropriate for the experiments conducted. It is currently unclear if some experiments were adequately replicated. That information should be added to the paper throughout where it is missing.

      We do appreciate your comments on our efforts to give all required information for other laboratories to replicate our experiments. We have added the missing information about the number of independent experiments in the respective figure legends wherever appropriate.

      Studies from multiple groups should be more thoroughly referenced when the authors are describing the "vesicle-like" staining patterns observed in SGs from reporter strains (e.g. Fig. 3A). Is this similar to the SG vesicle-like structures observed previously (e.g. PMIDs: 28377572, 33305876, and others)?

      Thank you for this comment. We did not discuss this observation in detail in the first version of our manuscript because the observed localization was rather unexpected, as DsRed was not fused to the AAPP leader/signal peptide. The observed localization is therefore difficult to explain, however, we have expanded the discussion on this (lines 465-482) and now cite one of the proposed references (PMID: 28377572, lines 468-469).

      There are minor grammar issues in the manuscript text (e.g. "Up to date" should be "To date"). The figures are primarily presented very clearly and accurately. One minor suggestion: In cases such as Fig. S2A images 3 and 6, where some of the staining labels are very difficult to read, please move all labels for the figure to boxes located directly above the image.

      We are sorry for the grammatical errors we have missed in the first version of our manuscript. We have now performed a grammar check over the whole manuscript. We have also increased the font size of the captions in the above figures and tried to make them better readable by moving the captions over the images.

      The data and conclusions are presented well.

      Reviewer #3 (Significance (Required)):

      This report represents a significant technical advance (improved in vivo reporter strain and sporozoite imaging), and a minor conceptual advance (active sporozoite active motility), for the field.

      This work builds off of previous SG live imaging studies involving Plasmodium-infected mosquitoes (e.g. Sinnis lab, Frischneckt lab, etc.), addressing one of the major challenges from these studies (reliable in vivo imaging inside mosquito SGs).

      This work will appeal to a relatively small audience of vector biology researchers with an interest in SGs. Many in the field still see the SGs as intractable, instead choosing to focus on the midgut due to ease of manipulation. Perhaps work like this will spark new interest in tangential research areas.

      I have sufficient expertise to evaluate the entirety of this manuscript. Some descriptors of my perspective include: bioinformatics, SG molecular biology, mosquito salivary glands, microscopy, RNA interference, SG infection, and SG cell biology.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Klug et al generated transgenic mosquito lines expressing fluorescent reporters regulated by salivary gland specific promoters and characterized fluorescent reporter expression level over the time, subcellular localization of fluorescent reporters, and impact on P. berghei oocyst and salivary gland sporozoite generation. In addition, by crossing one of the lines (aapp-DsRed) with yellow(-) KI mosquitoes, they open up the possibility to perform in vivo visualization of salivary glands and sporozoites.

      Overall the generation and characterization of these transgenic lines is well-done and will be helpful to the field. However, there are several concerns with the in vivo imaging data shown in Figure 6, which does not convincingly show fluorescent sporozoites in the lobe or secretory cavity of a fluorescent salivary gland lobe. This needs to be addressed. Points related to this concern are outlined below:

      (1) Although the authors mention that the DsRed signal was strong enough to see with GFP channel, it would be more appropriate to show that the DsRed signal from salivary glands and GFP channel image co-localize.

      We now show a merge of the GFP and DsRed signal in Figure 7 – figure supplement 2 The yellow appearance of the salivary gland in the merge likely indicates the spillover of the DsRed signal into the GFP channel. In addition we discuss the issue in lines 416-412 and 565-567.

      (2) Mosquitoes were pre-sorted using the GFP fluorescence of the sporozoites on day 17-21. From figure 4B, median salivary gland sporozoite number was about 10,000 sporozoites/mosquito on day 17-18. However, in Figure 6A there are no sporozoites in the secretory cavities. They should be able to see sporozoites in the cavities at this time. Can the authors confirm that they can visualize sporozoites in secretory cavities in vivo and perhaps show a picture of this.

      This is entirely correct. We also examined mosquitoes for the presence of sporozoites in the salivary glands and wing joints prior to imaging, as shown in Figure 7B and Figure 7 – figure supplement 2A, to increase the probability that sporozoites could be observed. Nevertheless, the area of the salivary gland that comes to the surface is often small and limited to a few cells that can be imaged with good resolution. Unfortunately, these same cells were often not infected although other regions of the salivary glands must have been very well infected based on the previously observed GFP screening (Figure 7B). In addition, with the confocal microscope available to us, we struggled to achieve the necessary depth to image sporozoites in the cavities of the salivary gland cells. For this reason, we were often able to detect a strong GFP signal in the background, but not always to resolve the sporozoites sufficiently well. Still, we have now included an image showing sporozoites in salivary glands (Figure 8C). However, we believe that the method can be further improved to be more efficient and provide better resolution. We discuss possible ways to further improve the imaging in lines 563-586.

      (3) There is no mention of the number of experiments performed (reproducibility) and no quantification of the imaging data. In the results (line 287-288), the authors state that sporozoites are present in tissue close to the gland and sometimes perform active movement. How can this be? Do they believe these sporozoites are on route to entering? More relevant to this study would be a demonstration that they can see sporozoites in the secretory cavities of the salivary gland epithelial cells, this should shown. If they have already performed a number of experiments, I would suggest to do quantification of the number of sporozoites observed in defined regions . The mention that sporozoites are moving is confounded by the flow of hemolymph. How do they know that the sporozoites are motile versus being carried by the hemolymph. Perhaps it's premature to jump to sporozoite motility in the mosquito when they haven't even shown sporozoite presence in the salivary glands.

      Thank you very much for this comment. We have followed the suggestions of the reviewer and have now quantified the behavior of sporozoites in the thorax area of the mosquito. For the analysis, we only considered sporozoites that could be observed for at least 5 minutes. This analysis revealed that 26% of persistent sporozoites performed active movements, which in most cases resembled patch gliding previously described in vitro. We adjusted the results section accordingly. In addition, we have changed the figure legend to accurately indicate the number of experiments performed. Likewise, we now also provide an image of sporozoites that we assume are located in the salivary gland (Figure 8C). Although we have not yet been able to image and quantify vector-sporozoite interactions extensively (further improvements would be required, as mentioned previously), we believe these results illustrate the potential of the transgenic lines.

      (4) In vivo imaging has been performed with the mosquito' sideways. Was this the best orientation? Have you tried other orientations like from the front (Figure 5B orientation).

      It is true that in the abdominal view as shown in Figure 7B the fluorescence in the salivary glands is very well visible. This is mainly due to the fact that in this area the cuticle is almost transparent and therefore serves as a kind of "window". Nevertheless, the salivary glands are not close to the cuticle in this position, which makes good confocal imaging impossible. Imaging always worked best where the salivary gland was very close to the cuticle, and this was always laterally. However, there were differences in the position of the salivary glands in individual mosquitoes, which also led to slight differences in the imaging angle.

      Overall, the text is easy to follow and I have only few suggestions.

      Thank you for this comment.

      In the result section, the authors describe the DsRed expression during development of mosquito (line 194-236) after they describe subcellular localization of fluorescent reporters. I felt the flow was disrupted. Thus, this part (line 194-236) could summarize and move to line 135. In this way, the result section flow according to the main figures.

      Thank you very much for this suggestion. We have considered your idea, but based on the changes we have made in response to reviewer comments and new data implemented in the form of two new figures, we believe the current order in the results section is more appropriate. The rationale was primarily to first characterize the expression of fluorescent reporters in the salivary glands of all lines before going into more detail on expression in other tissues of a single line. We then finish with potential applications like in vivo imaging of sporozoite interactions with salivary glands.

      Also, and as mentioned previously (reviewer 2, point 8), we believe it is important to describe the variability of ectopic promoter expression at a given locus with sufficient details, as this has not been characterized thus far despite its importance.

      In the result section, text line 186-190, the authors describe the morphological alternation of salivary gland in aapp-hGrx1-roGFP2. I would suggest to mention that this observation was only in one of lateral lobe. (I saw that it was mentioned in the figure legend but not in the main text.)

      We believe there has been a misunderstanding. The morphological alteration in salivary glands expressing aapp-hGrx1-roGFP2 was observed in all distal-lateral lobes to varying degrees (quantification in Figure 6E). To include as many salivary glands as possible in the quantification and because in some images only one distal-lateral lobe was in focus, only the diameter of one lobe per salivary gland was measured and evaluated. We have now revised the legend to prevent further misunderstandings.

      In the discussion section, author discuss localization of fluorescent reporters (line 322-331). When I looked at aapp-DsRed localization pattern (Figure 3A), the pattern looked similar to the previous publication by Wells et al 2017 (https://www.nature.com/articles/s41598-017-00672-0). This publication used AAPP antibody and stain together with other markers (Figure 4-7). This publication could be worth referring in the discussion section.

      Thank you for this suggestion. According to the information available through Vectorbase, we did not fuse DsRed with any coding sequence of AAPP that could potentially encode a trafficking signal. Therefore, it is rather unlikely that the observed DsRed localization in our aapp-DsRed line and the localization observed by AAPP immunofluorescence staining in WT mosquitoes match. This is further exemplified by the cytoplasmic localization of hGrx1-roGFP2 in the aapp-hGrx1-roGFP2 line, where the reporter gene was cloned under the control of the same promoter. For this reason, we had not mentioned this reference in the first version of the manuscript. In the revised manuscript, we have included now the suggested reference (lines: 475-476) and extended the discussion on possible reasons which led to the observed localization pattern.

      In the text, authors describe salivary gland lobes as distal lobes and middle lobe. It would be more accurate to refer to the lobes as the lateral and medial lobes. The lateral lobes can then be sub-divided into proximal and distal portions. I would suggest to use distal lateral lobes, proximal lateral lobes and median lobe as other references use (Wells M.B and Andrew D.J, 2019).

      Thank you for this suggestion. We have corrected the nomenclature for the description of the salivary gland anatomy as suggested throughout the manuscript.

      Overall, the figures are easy to understand and I have following suggestions and questions.

      Figure 1C) It is hard to see WT salivary gland median lobe. If authors have better image, please replace it so that it would be easier to compare WT and transgenic lines.

      We have replaced the wild-type images of salivary glands in this figure and labeled the median and distal-lateral lobes accordingly (see Figure 1).

      Figure 2) While it was interesting to observe the significant expression differences between day 3 and day 4, have you checked if this expression maintained over time or declines or increases (especially on day 17-21 when author perform in vivo imaging)?

      Thank you for this interesting question. We have not quantified fluorescence intensities in mosquitoes of higher age. Nevertheless, we regularly observed spillover of DsRed signaling to the GFP channel during sporozoite imaging, suggesting that expression levels, at least in aapp-DsRed expressing mosquitoes, remain high even in mosquitoes >20 days of age (see Figure 8A). We also confirmed this observation by dissecting salivary glands from old mosquitoes, whose distal lateral lobes always showed a strong pink coloration even in normal transmission light (data not shown).

      Figure 3A) There is no description of "Nuc" in figure legend. If "nuc" refers to nucleus, have you stained with nucleus staining dye (example, DAPI)?

      Thank you for spotting this missing information in the legend. Initial images shown in this figure were not stained with a nuclear dye. To test whether the observed GFP expression pattern really colocalizes with DNA, we performed further experiments in which salivary glands from both aapp-hGrx1-roGFP2 and sag(-)KI mosquitoes were stained with Hoechst. We have now included these new data in Figure 3 - figure supplement 1. It appears that GFP is concentrated around the nuclei of the acinar cells, which makes the nuclei clearly visible even without DNA staining.

      Figure 4B) The number of biological replicates in the figure and the legend do not match (In the figure, there are 3-5 data points and, in the legend, text says 3 biological replicates.)

      Thank you for spotting this inconsistency. The number of biological replicates refers to the number of mosquito generations used for experiments. The difference is due to the fact that sometimes two experiments were performed with the same generation of mosquitoes using two different infected mice. We have clarified the legend accordingly to avoid misunderstandings.

      Figure 4C) The number of data points from (B) is 5. However, in (C) only 4 data points are presented.

      We have corrected this mistake. In the previous version, the results of two technical replicates were inadvertently plotted separately in (B) instead of the mean.

      Figure 5) I would suggest to have thorax image of P. berghei infected mosquito to show both salivary glands and parasites.

      Thank you for this suggestion. Images in Figure 7B (previously Figure 5) were replaced with an infected specimen to show salivary glands (DsRed) and sporozoites (GFP) together.

      Reviewer #4 (Significance (Required)):

      The transgenic lines that authors created have potential for in vivo imaging of salivary gland and sporozoite interactions. Since the aapp and trio lines have distinct fluorescence expression, they could help elucidate why sporozoites are more likely to invade distal lateral lobes compare to median lobe.

      My areas of expertise are confocal microscope imaging, mosquito salivary gland and Plasmodium infection and sporozoite motility.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Klug et al generated transgenic mosquito lines expressing fluorescent reporters regulated by salivary gland specific promoters and characterized fluorescent reporter expression level over the time, subcellular localization of fluorescent reporters, and impact on P. berghei oocyst and salivary gland sporozoite generation. In addition, by crossing one of the lines (aapp-DsRed) with yellow(-) KI mosquitoes, they open up the possibility to perform in vivo visualization of salivary glands and sporozoites.

      Overall the generation and characterization of these transgenic lines is well-done and will be helpful to the field. However, there are several concerns with the in vivo imaging data shown in Figure 6, which does not convincingly show fluorescent sporozoites in the lobe or secretory cavity of a fluorescent salivary gland lobe. This needs to be addressed. Points related to this concern are outlined below:

      (1) Although the authors mention that the DsRed signal was strong enough to see with GFP channel, it would be more appropriate to show that the DsRed signal from salivary glands and GFP channel image co-localize.

      (2) Mosquitoes were pre-sorted using the GFP fluorescence of the sporozoites on day 17-21. From figure 4B, median salivary gland sporozoite number was about 10,000 sporozoites/mosquito on day 17-18. However, in Figure 6A there are no sporozoites in the secretory cavities. They should be able to see sporozoites in the cavities at this time. Can the authors confirm that they can visualize sporozoites in secretory cavities in vivo and perhaps show a picture of this.

      (3) There is no mention of the number of experiments performed (reproducibility) and no quantification of the imaging data. In the results (line 287-288), the authors state that sporozoites are present in tissue close to the gland and sometimes perform active movement. How can this be? Do they believe these sporozoites are on route to entering? More relevant to this study would be a demonstration that they can see sporozoites in the secretory cavities of the salivary gland epithelial cells, this should shown. If they have already performed a number of experiments, I would suggest to do quantification of the number of sporozoites observed in defined regions . The mention that sporozoites are moving is confounded by the flow of hemolymph. How do they know that the sporozoites are motile versus being carried by the hemolymph. Perhaps it's premature to jump to sporozoite motility in the mosquito when they haven't even shown sporozoite presence in the salivary glands.

      (4) In vivo imaging has been performed with the mosquito' sideways. Was this the best orientation? Have you tried other orientations like from the front (Figure 5B orientation).

      Overall, the text is easy to follow and I have only few suggestions.

      In the result section, the authors describe the DsRed expression during development of mosquito (line 194-236) after they describe subcellular localization of fluorescent reporters. I felt the flow was disrupted. Thus, this part (line 194-236) could summarize and move to line 135. In this way, the result section flow according to the main figures.

      In the result section, text line 186-190, the authors describe the morphological alternation of salivary gland in aapp-hGrx1-roGFP2. I would suggest to mention that this observation was only in one of lateral lobe. (I saw that it was mentioned in the figure legend but not in the main text.)

      In the discussion section, author discuss localization of fluorescent reporters (line 322-331). When I looked at aapp-DsRed localization pattern (Figure 3A), the pattern looked similar to the previous publication by Wells et al 2017 (https://www.nature.com/articles/s41598-017-00672-0).

      This publication used AAPP antibody and stain together with other markers (Figure 4-7). This publication could be worth referring in the discussion section.

      In the text, authors describe salivary gland lobes as distal lobes and middle lobe. It would be more accurate to refer to the lobes as the lateral and medial lobes. The lateral lobes can then be sub-divided into proximal and distal portions. I would suggest to use distal lateral lobes, proximal lateral lobes and median lobe as other references use (Wells M.B and Andrew D.J, 2019).

      Overall, the figures are easy to understand and I have following suggestions and questions. Figure 1C) It is hard to see WT salivary gland median lobe. If authors have better image, please replace it so that it would be easier to compare WT and transgenic lines.

      Figure 2) While it was interesting to observe the significant expression differences between day 3 and day 4, have you checked if this expression maintained over time or declines or increases (especially on day 17-21 when author perform in vivo imaging)? Figure 3A) There is no description of "Nuc" in figure legend. If "nuc" refers to nucleus, have you stained with nucleus staining dye (example, DAPI)? <br /> Figure 4B) The number of biological replicates in the figure and the legend do not match (In the figure, there are 3-5 data points and, in the legend, text says 3 biological replicates.) Figure 4C) The number of data points from (B) is 5. However, in (C) only 4 data points are presented. Figure 5) I would suggest to have thorax image of P. berghei infected mosquito to show both salivary glands and parasites.

      Significance

      The transgenic lines that authors created have potential for in vivo imaging of salivary gland and sporozoite interactions. Since the aapp and trio lines have distinct fluorescence expression, they could help elucidate why sporozoites are more likely to invade distal lateral lobes compare to median lobe.

      My areas of expertise are confocal microscope imaging, mosquito salivary gland and Plasmodium infection and sporozoite motility.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Klug et al. generated salivary gland reporter lines in the African malaria mosquito Anopheles coluzzii using salivary gland-specific promoters of three genes. Lobe-specific reporter activity from these promoters was observed within the salivary glands, restricted either to the distal lobes or the medial lobe. They characterized localization, expression strength and onset of expression in four mosquito lines. They also investigated the possibility of influences of the expressed fluorescent reporters on infection with Plasmodium berghei and salivary gland morphology. Using crosses with a pigmentation deficient mosquito line, they demonstrated that their salivary gland reporter lines represent a valuable tool to study the process of salivary gland colonization by Plasmodium parasites in live mosquitoes. SG positioning close to the cuticle in 20% of females in this strain is another key finding of this study.

      The key findings from this study are largely quite convincing. The authors have created a suite of SG reporter strains using modern genetic techniques that aid in vivo imaging of Plasmodium sporozoites.

      Vesicular staining within salivary acinar cells should be stated as "vesicle-like" staining unless a co-stain experiment in fixed SGs is conducted using antisera against the marker protein(s) and antisera against a known vesicular marker (e.g. Rab11). It may also be possible to achieve this in vivo using perfusion of a lipid dye (e.g. Nile Red), but this is not necessary. As is, in Fig. 3A, there are images in which it appears that the vesicle-like staining is located both within acinar cells' cytoplasm and in the secretory cavities (e.g. Fig. 3A: aapp-DsRed bottom and middle), and this is fine, but should be more inclusively stated. Fixed staining of the reporter strain SGs would allow for clarification of this point. In previous work, other groups have observed vesicle-like structures in both locations (e.g. PMID: 33305876).

      Morphological variation is extensive among individual mosquito SGs, thought to impact infectivity, and well documented in the literature. The manuscript should be edited to make it much clearer (e.g. n = ?) exactly how many SGs, especially in microscopy experiments, were imaged before a "representative" image was selected from each data point and in any additional experiment types where this information is not already presented. Figure S8 is an example where this was done well. Figure 3A-B is an example where this was not well done. All substantial variation (e.g. "we detected a strangulation..." - line 189) across individual SGs within a data point should be noted in the Results. Because of the genetics and labor involved, acceptable sample sizes for minor conclusions may be small (5-10), but should be larger for major conclusions when possible.

      Sporozoite number within SGs has been shown to be quite variable across the infection timeline, by mosquito species, by parasite strain, in the wild vs. in the lab, and according to additional study conditions. The authors mention that the levels they observed are consistent with their prior studies and experience, but they did not utilize the reporter strains and in vivo imaging to support these conclusions, instead relying on dissected glands and a cell counter. It is important for these researchers to attempt to leverage their in vivo imaging of SG sporozoites for direct quantification, likely using the "Analyze Particles" function in Fiji.

      The added time investment for this additional analysis would be around two weeks for one person experienced in the use of the imaging software.

      This manuscript is presented thoughtfully and such that the data and methods could likely be well-replicated, if desired, by other researchers with similar expertise.

      The statistical analysis is appropriate for the experiments conducted. It is currently unclear if some experiments were adequately replicated. That information should be added to the paper throughout where it is missing.

      Studies from multiple groups should be more thoroughly referenced when the authors are describing the "vesicle-like" staining patterns observed in SGs from reporter strains (e.g. Fig. 3A). Is this similar to the SG vesicle-like structures observed previously (e.g. PMIDs: 28377572, 33305876, and others)?

      There are minor grammar issues in the manuscript text (e.g. "Up to date" should be "To date"). The figures are primarily presented very clearly and accurately. One minor suggestion: In cases such as Fig. S2A images 3 and 6, where some of the staining labels are very difficult to read, please move all labels for the figure to boxes located directly above the image.

      The data and conclusions are presented well.

      Significance

      This report represents a significant technical advance (improved in vivo reporter strain and sporozoite imaging), and a minor conceptual advance (active sporozoite active motility), for the field.

      This work builds off of previous SG live imaging studies involving Plasmodium-infected mosquitoes (e.g. Sinnis lab, Frischneckt lab, etc.), addressing one of the major challenges from these studies (reliable in vivo imaging inside mosquito SGs).

      This work will appeal to a relatively small audience of vector biology researchers with an interest in SGs. Many in the field still see the SGs as intractable, instead choosing to focus on the midgut due to ease of manipulation. Perhaps work like this will spark new interest in tangential research areas.

      I have sufficient expertise to evaluate the entirety of this manuscript. Some descriptors of my perspective include: bioinformatics, SG molecular biology, mosquito salivary glands, microscopy, RNA interference, SG infection, and SG cell biology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The first achievements of the Klug et al. study are the (i) genetical engineering of the Anopheles coluzzii mosquitoes reared in insectarium, that stably express distinct fluorescent reporters (DsRed and hGrx1-roGFP2 and EGFP) under the putative "promoters" of genes reported to encode proteins expressed differentially in the pluri-lobal salivary glands(Sg) of anthropophilic blood-feeding adult females, (ii) the analysis of the promoter activity - based on the selected fluorescent reporter - with a primary focus on the salivary gland/Sg (including at the Sg lobe level) of the adult female but also considering the preimaginal developmental time with larvae and pupa samples. Of note, some data confirm the already reported time-dependent and blood meal-dependent promoter activity for the related Anopheles species. The last part presents preliminary dataset on live imaging of Plasmodium berghei sporozoites with the aim of highlighting the usefulness of these A. coluzzii transgenic lines to better understand how the rodent Plasmodium sporozoites first colonize and then settle as packed cells in Sg acinar host cells.

      Major comments

      The two first objectives presented by the authors have been convincingly achieved with (i) the challenging production of four different lines expressing different single or double reporters chosen by the authors (and appropriately presented in the result text and figure sections), (ii) the careful analysis of the spatiotemporal expression of the DsRed reporter under two "promoters" studied and with regards to the blood feeding event parameter. However, if the reason why the authors have put so much effort in the production of their transgenic mosquitoes is (and as mentioned) to provide a significant improved setting enabling the behavioral analysis of sporozoites upon colonization and survival in the Sg, it seems this part is kind of limited. Likely in relation with this perception is the fact I found the introductory section often confusing and not enough direct to the points: in particular distinguishing the rationale from the necessity to produce appropriate models, and clarifying what is/are the added value(s) offered by these new transgenic lines models when compared to what exist (in Anopheles stephensi) with specific evidence that argue for this knowledge gain. At this stage, it is unfortunately not clear to me, what is the bonus of imaging the Plasmodium fluorescent sporozoites in hosts with fluorescent salivary gland lobes if one can not monitor key events of the Sg-sporozoite interaction that were not reachable without the fluorescent mosquito lines. Furthermore, it should be better explained why the rodent Plasmodium species has been chosen rather Plasmodium falciparum (or other human species) for which A. coluzzii is a natural host; may be just mentioning that this study would serve as a proof of concept but bringing real biological insights would be fine.

      1- The three genes and gene products selected by the authors should definitively be more systematically explained, which means for example the authors need to introduce the different mosquito species and the parasite-mosquito host pairs they are then referring to for the promoter/encoded proteins of their interest. In the same vein, I did not find any information as to the choice of the mosquito specie (A. Coluzzii) for the current work. I was curious to know what is the advantage since better knowledge was available with Anopheles stephensi with respect to (i) Saglin and its promotor activity, (ii) aap driven dsRed expression (lines already existing) and (iii) sporozoite-gland interaction.

      2- To help clarifying the added value of the present study, introducing the species names of the mosquito and the Plasmodium that serve as a model would be appreciated.

      3- Since a focus is the salivary gland of the blood feeding female Anopheles sp., a rapid description of the glands with different lobes and subdomains the results and figure 1 nicely refer to, would help in the introduction.

      4- That description could logically introduce the few proteins actually identified with lobe specific or cell domain specific expression (apical versus basal side, intracellular or surface expose, vacuole, duct...) profiles. The context with regards to sporozoite biology would then easily validate the "promoter choice". As a minor remark, I miss the reason why the authors wrote " the astonishing degree of order of the structures (referring to the packing of sporozoites within the Sg acinars) raise the question whether sporozoite can recognize each other". Please clarify since packing/accumulation can be passive due to cell mechanical constraints and explain what this point has to see with the question and experimental work proposed here?)

      5- The selection of hGrx1-roGFP2 is quite interesting and justified but there is then no use of this reporter property in the preliminary characterization of the Sg and Sg-sporozoite interaction. Could the authors provide such characterization?

      6- Figure 1: it would be nice to add in the legend at what time the dissection/imaging has been made (age, blood feeding timing?). I would also omit the double mutant trio-Dsred/aapDsred in the main figure (may be supplemental) since the two single mutants Dsred separately together with the double mutant (with different fluorescence) already provide the information. I would suggest to regroup the phenotypic presentation of the transgenic line made in the KI mosquitoes (current figure 5) in the main figure 1.

      7- Figure 2:

      a) Is there anything known on the Sgs' size change overtime. It seems that between day 1 and 2 there is an increase of size and volume as much as I can evaluate the volume (Fig S4). Could that mean that there is increase in cell number in the lobes and therefore more cells expressing the transgene which would account for the signal intensity increase rather than more transcripts per cell?

      b) why choosing 24h after the blood meal to assess promoter activity in the Sgs? Do we have any information on how the blood meal impact on the Sgs'development. At this time anyway the sporozoites are far from being made. Yosshida and Watanabe 2006 mentioned at significant decrease of Sg proteins post-blood feeding. Could the authors detail their rationale based on what the questions they wish to address

      8- Figure 3: The figure is quite informative in terms of subcellular localization. Concerning the section "Natural variation of DsRed expression in trio-DsRed mosquitoes", I think it could be shortened because because it is a bit out of the focus the study.

      9- In contrast the last section of live imaging of P. berghei sporozoites in the vicinity and within salivary gland should be expanded. The 2 sentences summarizing the data are quite frustrating "We also observed single sporozoites moving actively through tissues in a back and forth gliding manner (Fig. 6B, Movie 3) or making contact with the salivary gland although no invasion event could be monitored"

      10- I am aware of the technical difficulties to perform live imaging of sporozoite on whole mosquitoes, even when the salivary gland lobe under observation is closely apposed to the cuticle but that seems to be the final aim of the authors. I looked very carefully to the three movies and I am sorry but at this stage I could not make meaningful analysis out of them, and could not agree with the conclusions: for instances, the authors specify that sporozoites were undergoing back and forth movements (movie 3) but I do not see that and do not see the Sg contours in the available movies? The authors should also add bar and time scales to their movies. Having an in-depth description with regards to the sub-domain marked by a relevant reporter would strengthen the study, even if images are not collected in the whole mosquito to get higher resolution.

      I am not sure I understand the relevance of this quite condensed sentence in the text. Could the authors rephrase and expand if they wish to keep the issues they refer to. "The sporozoites' distinctive cell polarization and crescent shape, in combination with high motility, allows them to „drill" through tissues". I would stress more on the main unknown in terms of sporozoite-Sg interactions and the need to get right models for applying informative approaches (i.e. here, imaging).

      Of note, it could help to point that the "Sgs is a niche in which the sporozoites which egress from the oocyst could mature and be fully competent when co-deposited with the saliva into the dermis of their intermediary hosts"

      Significance

      1- Clear technical significance with the challenging molecular genetics achieved in the mosquito A. coluzzii.

      2- More limited biological significance: fair analysis and gain of knowledge of spatio-temporal of reporter expression under the selected promoter but limited significance of the final goal analysis which concerns the Plasmodium sporozoite biology once egressed from oocysts

      3- Previous reports cited by the authors have used the DsRed reporter and the aap promoter in another Anopheles (i.e. A. stephensi, Yoshida and Watanabe, Insect Mol Biol, 2006; Wells and Andrew, 2019) which is also a natural host and vector for human Plasmodium spp.) with significantly more resolutive 3D visualization of GFP-fluorescent P. berghei but in dissected salivary glands and not in whole mosquitoes. The Wells and Andrew publication entitled "Salivary gland cellular architecture in the Asian malaria vector mosquito Anopheles stephensi" in Parasite Vectors, 2015 would deserve to be reference and described.

      4- Audience: I would say that this work should be of interest of mostly scientists investigating Plasmodium biology (basic and field research) or in entomology of Diptera.

      5- To describe my fields of expertise, I can refer to my extensive initial training in entomology including at one point in the genetic basis of mosquito-virus interaction. I have also been working for more than 20 years in the field of Apicomplexa biology (Plasmodium and Toxoplasma) and I have long-standing interest in live and static high-resolution imaging.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      This manuscript reports the generation and characterization of transgenic lines in the African malaria mosquito Anopheles coluzzii that express fluorescent proteins in the salivary glands, and their potential use for in vivo imaging of Plasmodium sporozoites. The authors tested three salivary gland-specific promoters from the genes encoding anopheline antiplatelet protein (AAPP), the triple functional domain protein (TRIO) and saglin (SAG), to drive expression of DsRed and roGFP2 fluorescent reporters. The authors also generated a SAG knockout line where SAG open reading frame was replaced by GFP. The reporter expression pattern revealed lobe-specific activity of the promoters within the salivary glands, restricted either to the distal lobes (aapp) or the middle lobe (trio and sag). One of the line, expressing hGrx1-roGFP2 under control of aapp promoter, displayed abnormal morphology of the salivary glands, while other lines looked normal. The data show that expression of fluorescent reporters does not impair Plasmodium berghei development in the mosquito, with oocyst densities and salivary gland sporozoite numbers not different from wild type mosquitoes. Salivary gland reporter lines were crossed with a pigmentation deficient yellow(-) mosquito line to provide proof of concept of in vivo imaging of GFP-expressing P. berghei sporozoites in live infected mosquitoes.

      Major comments

      Overall the manuscript is very well written with a clear narrative. The data are very well presented. The generation of the transgenic mosquito lines is elegant and state-of-the art, and the new reporter lines are thoroughly characterized.

      This is a nice piece of work that is suitable for publication, although the in vivo imaging of sporozoites is somewhat preliminary and would benefit from additional experiments to increase the study impact.

      The reporter mosquito lines express fluorescent salivary gland lobes, yet the authors only provide imaging of parasites outside the glands. It would be relevant to provide images of the parasite inside the fluorescent glands.

      The advantage of the pigmentation-deficient line over simple reporter lines is not clear, essentially due to the background GFP fluorescent in figure 5C. Imaging of GFP-expressing parasites should be performed in mosquitoes after excision of the GFP cassette under control of the 3xP3 promoter. This would probably allow to document the value of the reporter lines more convincingly.

      Along the same line, it is unclear if the DsRed spillover signal in the GFP channel is inherent to the high expression level or to a non-optimal microscope setting. This is a limitation for the use of the reporter lines to image GFP-expressing parasites.

      The authors should fully exploit the SAG(-) line, which is knockout for saglin and provides a unique opportunity to determine the role of this protein during invasion of the salivary glands. This would considerably augment the impact of the study. In this regard, line 131 and Fig S3E: why is there persistence of a PCR band for non-excised in the sag(-)EX DNA?

      Did the authors search for alternative integration of the construct to explain the trioDsRed variability?

      Line 254-255. Does the abnormal morphology of SG from aapp-hGrx1-roGFP2 result in reduced sporozoite transmission?

      Minor comments

      -Line 51: sporogony rather than schizogony

      -Line 56: sporozoites are not really deformable as they keep their shape during motility

      -In the result section, it is not clearly explained where constructs were integrated. Line 106 and 434-435: for the non-expert reader, it is not clear what X1 refers to, strain or locus for integration?

      -Line 112-115: the rational of integrating GFP instead of SAG is not clearly explained here, but become clearer in the discussion (line

      -Line 140: FigS2A instead of S3A

      -Perhaps mention that GFP reporters (SG) might be useful to image RFP-expressing parasites.

      -Line 236: the authors cannot exclude integration of an additional copy (as mentioned in the discussion line 367-368).

      -Line 257-258. The title of this section should be modified as SG invasion was not captured.

      -Line 287: remove "considerable number" since there is no quantification.

      -Line 400-402: Klug and Frischknecht have shown that motility precedes egress from oocysts (PMID 28115054), so the statement should be modified.

      -Line 404: remove "significant number" since there is no quantification.

      -Line 497: typo "transgenesis"

      -FigS1: add sag-DsRed in the title

      -Stats: Mann Whitney is adequate for analysis in fig 2C but not 2B, where ANOVA should be used (more than 2 groups).

      Significance

      This work describes a technical advance that will mainly benefit researchers interested in vector-Plasmodium interactions. Invasion of salivary glands by Plasmodium sporozoites is an essential step for transmission of the malaria parasite, yet remains poorly understood as it is not easily accessible to experimentation. The development of transgenic mosquitoes expressing fluorescent salivary glands and with decreased pigmentation provides novel tools to allow for the first time in vivo imaging in live mosquitos of the interactions between sporozoites and salivary glands.

      Reviewer's expertise: malaria, Plasmodium berghei, genetic manipulation, host-parasite interactions

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the two reviewers for the precious time devoted in the evaluation of our original manuscript and for the useful feedback that guided its revision. We addressed all points raised by the reviewers as detailed below. The changes in the revised manuscript are reported using green characters so that they can be more easily identified in the next evaluation. We would like to comment on the sentence of Reviewer 2 that

      “The bulk of this paper was either demonstrated or predicted by another paper Hanemaaijer et al. in 2020 using very similar methods.”

      We disagree with this statement and here is why. In Hanemaaijer et al. (2020) the authors demonstrate Nav1.2 permeability to calcium by experiments in HEK-293 cells expressing the channel. Then, they show that a calcium transient associated with the action potential (AP) in the AIS is mediated by sodium channels, but the pharmacology they used (TTX) could not distinguish between Nav1.2 and Nav1.6 (both channels being TTX-sensitive) and produced the blockade of the action potential. Thus, they inferred that at least part of the calcium transient associated with the AP is mediated by Nav1.2, but they could not exclude the involvement of Nav1.6. This is extremely important since we clearly demonstrate that it is only Nav1.2 that carries a calcium current and not Nav1.6. Regarding the interaction between Nav1.2 and BK channels, this possibility was only suggested in the Discussion on the basis of the channel distribution and its biophysical properties. However, here again, there was no formal demonstration. This seminal study is of course important nevertheless and hence was cited in our manuscript six times (reference [12]). However, this study did not reach formal conclusions. The reason why the authors could not further advance from this hypothesis relies on the lack of methods. In our study we fulfilled this limitation by adding two technical advances that were not available before. First, we achieved a selective partial block of Nav1.2 thanks to the newly optimized G1G4Huwentoxin-IV peptide. Second, we directly estimated the effect of Nav1.2 block on the sodium current underlying the AP generation using an imaging technique that we recently developed (Filipis & Canepari 2021, reference [15] in the manuscript). Thanks to these significant methodological differences with respect to Hanemaaijer et al. (2020), we unambiguously demonstrated a Nav1.2 calcium influx component associated with the AP and we assessed the question of the target of this calcium signal which turned out to be the BK channel. In line with that, we argue reasonably that when a research study reports interesting results and from there on suggests important working hypotheses that are not formally demonstrated, and then a second study demonstrates experimentally that these hypotheses are correct (thanks to innovative methodologies), then this second study is also of major importance and cannot be considered a confirmation of the first study.

      Rebuttal to Reviewer 1

      However, statistical analysis should be performed with non-parametric tests such as Mann-Whitney or Wilcoxon tests and not t-test because of the small samples (t-test can be used only for large samples).

      We strongly agree that a “two-population” t-test, being parametric, is not adequate when comparing two or more small sets of independent samples. However, in this study, we never compared sets of independent samples, but we compared two sets of correlated samples where the first sample is the measurement in control condition and the second sample is the same measurement after addition of the channel blocker. Thus, for this type of datasets, we used a “paired” t-test. While the standard Mann-Whitney or Wilcoxon are not appropriate for sets of correlated samples, we followed the reviewer recommendation to perform a non-parametric test and we applied the “Wilcoxon ranked sign test” (non-parametric equivalent to the paired t-test). The results of this additional test were consistent with the paired t-test. As reported in the manuscript, all data and metadata used in this study will be available in the public repository Zenodo (doi: 10.5281/zenodo.5835995) after the manuscript will be accepted by a journal. Thus, while in the manuscript we only report p

      1) abstract, line 5, "...by a recent peptide,...". I guess the authors mean "...by a peptide recently identified..."

      We replaced with “recently modified peptide” since the wild-type Huwentoxin-IV was identified some time ago.

      Rebuttal to Reviewer 2

      1. It would be fair to point out somewhere in the introduction or discussion that the role of Nav1.2 and its interaction with BK channels was already predicted by Hanemaaijer et al. (2020).

      We pointed out this prediction in the Discussion of the revised manuscript. It must be said that the important work of Hanemaaijer et al. (2020) (reference [12]) is cited six times in the manuscript. As stated above, the interaction between Nav1.2 and BK channels was only suggested in Hanemaaijer et al. (2020) and not demonstrated.

      I would recommend citing Huang & Rasband (2018) for a relatively up to date review of channels in the AIS.

      This review is now cited in the Discussion of the revised manuscript (reference [38]).

      The authors should state whether the L5 cells are L5a or L5b (or whether they didn't distinguish).

      Although L5a and L5b could be in principle recognised also by their morphology, no attempt was done to distinguish between two functionally different groups of cells. This is now stated in the Introduction of the revised manuscript.

      The choice of colors in the figures made it hard to see which line was which. The authors use blue vs light blue and green vs light green, which were indistinguishable on my screen and in print. The orange vs yellow figs could just be made out.

      In control conditions, we use green for sodium, red for voltage and blue for calcium, which also makes it easy to follow in the simulations. We followed the reviewer’s recommendation and used always grey traces for all traces after addition of the channel blocker. It should be easier now in the revised manuscript to visually discriminate the traces.

      Are the plots on the RHS of Fig. 2b averages?

      The plots are from the cell in panel a. This is now stated in the figure legend of the revised manuscript.

      I don't understand the sentence starting, "This signal translated in a substantial delay..."

      We replaced this sentence, in the revised manuscript, with “In our experiments, the peak of the Ca2+ current in the distal axon preceded both the peak of the somatic AP and the peak of the Ca2+ in the proximal part of the AIS”. We thank the reviewer for finding this ambiguous sentence.

      I also didn't understand the point being made in the sentence: "The evident anticipation of the AP peak, also with respect to the somatic AP peak, suggests that Ca2+ influx associated with the AP is mainly mediated by VGNCs in the distal axon, whereas it is mainly mediated by VGCCs in the proximal axon." The block of calcium with VGCC-blockers is convincing in itself but this argument based on timing isn't convincing. Isn't the point that VGCCs are to be found closer to the soma, despite the fact that Nav1.2 is also found closer to the soma, so that the presence of Ca-conducting Na+ channels near the cell body could be masked?

      We erased this sentence in the revised manuscript.

      I think it would make more sense to write, "the effects produced by 1 µM IK CAKC inhibitor tram-34 were not significant".

      We replaced “variable” with “not significant” in the revised manuscript.

      There is a mismatch in Fig. S5 between the 400 nM, 800 nM, 1600 nM labelling in the figure and the "40 nM, 80 nM or 160 nm" in the legend.

      We thank the reviewer for having found this mismatch. The values are those in the figure labels, so we corrected the values in the figure legend in the revised manuscript.

      In Fig. S7c-d, the authors write "... the widening of the distal axonal AP was observed in 4 cells, whereas the AP waveforms did not change in 3 cells". I think this is not appropriate if the results are statistically insignificant. It implies that the authors know somehow that the outliers are not just noise.

      We changed this inappropriate sentence in the revised manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Filipis et al. present a study of the underlying mechanisms for the generation of action potentials in the axon initial segment (AIS) of layer 5 somatosensory pyramidal neurons. The experiments employed three different imaging approaches for following sodium, voltage and calcium changes at two different locations in the axon near the cell body. This was combined with pharmacology for comparing the contribution of two different types of sodium channels: Nav1.6 and Nav1.2, the latter having a concomitant Ca2+ conductance. They confirm the previously established findings in terms of the location of Nav1.6 lying more distally than Nav1.2 on the AIS, and show that while Nav1.6 contributes mostly to the initiation of the AP, Nav1.2 plays an important role in controlling the shape of the AP via Ca2+ activation of BK (potassium) channels. The study represents a careful attempt to investigate a difficult subject owing to the small size of the axon initial segment and the difficulty of disentangling the different channels and influences. I find the paper well carried out and well presented and a useful contribution to understanding the firing properties of these important neurons.

      I only have a few minor points:

      1. It would be fair to point out somewhere in the introduction or discussion that the role of Nav1.2 and its interaction with BK channels was already predicted by Hanemaaijer et al. (2020).
      2. I would recommend citing Huang & Rasband (2018) for a relatively up to date review of channels in the AIS.
      3. The authors should state whether the L5 cells are L5a or L5b (or whether they didn't distinguish).
      4. The choice of colors in the figures made it hard to see which line was which. The authors use blue vs light blue and green vs light green, which were indistinguishable on my screen and in print. The orange vs yellow figs could just be made out.
      5. Are the plots on the RHS of Fig. 2b averages?
      6. I don't understand the sentence starting, "This signal translated in a substantial delay..."
      7. I also didn't understand the point being made in the sentence: "The evident anticipation of the AP peak, also with respect to the somatic AP peak, suggests that Ca2+ influx associated with the AP is mainly mediated by VGNCs in the distal axon, whereas it is mainly mediated by VGCCs in the proximal axon." The block of calcium with VGCC-blockers is convincing in itself but this argument based on timing isn't convincing. Isn't the point that VGCCs are to be found closer to the soma, despite the fact that Nav1.2 is also found closer to the soma, so that the presence of Ca-conducting Na+ channels near the cell body could be masked?
      8. I think it would make more sense to write, "the effects produced by 1 µM IK CAKC inhibitor tram-34 were not significant".
      9. There is a mismatch in Fig. S5 between the 400 nM, 800 nM, 1600 nM labelling in the figure and the "40 nM, 80 nM or 160 nm" in the legend.
      10. In Fig. S7c-d, the authors write "... the widening of the distal axonal AP was observed in 4 cells, whereas the AP waveforms did not change in 3 cells". I think this is not appropriate if the results are statistically insignificant. It implies that the authors know somehow that the outliers are not just noise.

      Hanemaaijer, N. A., Popovic, M. A., Wilders, X., Grasman, S., Arocas, O. P., & Kole, M. H. (2020). Ca2+ entry through NaV channels generates submillisecond axonal Ca2+ signaling. Elife, 9, e54566. Huang, C. Y. M., & Rasband, M. N. (2018). Axon initial segments: structure, function, and disease. Annals of the New York Academy of Sciences, 1420(1), 46-61.

      Significance

      • The results are useful and important (but perhaps not major).
      • The bulk of this paper was either demonstrated or predicted by another paper Hanemaaijer et al. in 2020 using very similar methods.
      • The audience for this paper will be a relatively specialized group focusing on biophysical explanations for axonal excitability and/or modellers.
      • My expertise is a specialization on the electrical properties of these neurons (L5 pyramidal neurons).
    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This paper investigates the mechanisms of BK potassium channels activation in the axon initial segment (AIS) of cortical neurons. Using whole-cell patch-clamp recording, voltage imaging of the AIS, pharmacological tools, use of peptides and computer simulations, the authors show that calcium influx via Nav1.2 channels activates BK channels, thus shaping the action potential waveform.

      The manuscript is well written and the data are clear. The conclusions are convincing as they are in agreement with a recent study showing that Nav1.2 are permeable to calcium ions (Hanemaaijer et al., eLife 2020). The results and methods are presented in a way that they can be reproduced. However, statistical analysis should be performed with non-parametric tests such as Mann-Whitney or Wilcoxon tests and not t-test because of the small samples (t-test can be used only for large samples).

      Minor points:

      abstract, line 5, "...by a recent peptide,...". I guess the authors mean "...by a peptide recently identified..."

      Significance

      The study is interesting as it provides a target (activation of BK channels) for the Nav1.2-mediated calcium influx. Audience: specialized journal in neurobiology.

      My field of expertise is the cellular neurophysiology (axon, ion channels, synapse and plasticity). I have sufficient expertise to evaluate all the aspects of this paper.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to the reviewers

      Manuscript number: RC-2022-01407

      Corresponding author(s): Ivana, Nikić-Spiegel

      1. General Statements

      We would like to thank the reviewers for careful reading of our manuscript and for their insightful and useful comments. We are happy to see that the reviewers find these results to be of interest and significance. The way we understand reviewers’ reports, their main concerns can be roughly divided in following categories: 1) providing more quantitative data 2) interpretation of the Annexin V/PI assay 3) additional evidence for calpain involvement. We intend to address these experimentally or by modifying the text, as outlined below.

      2. Description of the planned revisions

      Reviewer #1

      Fig1A/B o SYTO 16 staining suggests slight reshaping of nucleus upon spermine NONOate, showing less blurry punctae. From the SYTO 16 profile, this should be quantifiable.

      By looking at the shown examples and the entire dataset, it appears to us as if neuronal nuclei are shrinking upon spermine NONOate treatment resulting in their less blurry appearance. We are not sure if this is what the reviewer is referring to, but this can also be quantified by measuring changes in neuronal nuclear size. We already have this data from the measurements shown in Fig4 and we intend to show it in the revised version of the manuscript. Line profile measurements are also possible, but the nuclear size quantification might be more suitable for this purpose.

      o There is a subset of neuron nuclei that are SYTO 16 positive. Please quantify the ratio

      We will use our existing dataset to quantify the ratio of NFL positive and SYTO16 positive nuclei.

      FigS1A o Show NeuN with Anti-NFL merged figures

      We will show merged NeuN and anti-NFL images, which might require rearrangement of the existing figures and figure panels. We will do this in the revised manuscript.

      FigS1C o Show quantification and timeline. I want to know whether there is also a plateau reached here.

      As the data shown in the FigS1C do not include NeuN staining, we will do additional experiments and perform proposed quantifications.

      FigS2A-F o Though the statements might be true, selecting one nucleus for a line profile as a statement for the whole dataset seems problematic. Average a larger number of unbiased selected nuclei profiles across multiple cultures to make a stronger statement, or a percentage of positive nuclei as in FigS1b.

      Corresponding images and line profiles are representative of the entire dataset. However, we agree with the reviewer that this is not obvious from the current manuscript version. Thus, to strengthen our findings, we intend to quantify the percentage of positive nuclei as in FigS1b. The only difference will be that instead of NeuN, we will use SYTO16 as a nuclear marker. The reason being that the existing datasets contain images of NFL and SYTO16 and not NeuN.

      FigS3 • There are no fluorescence profiles, no quantification

      As the reviewer suggests, we will quantify the ratio of NFL positive and SYTO16 positive nuclei, and include the quantifications in the revised manuscript.

      General statement: There do seem to be punctated patterns of non-nucleus accumulating NFL fragments. Can they be localized to any specific structure?

      We assume that the reviewer is referring to neuronal/axonal debris. They are present after injury but they do not colocalize with nuclear stains. We will address this in the revised manuscript.

      Fig1C-F • I find it too simplistic to categorize c+f and d+e together. There is a huge difference in the examples of nuclear localization between d and e. To not comment on their distinction (if that is consistent) is problematic. Also, since we don't see a merge with either NeuN or SYTO 16, reader quantification is difficult.

      We thank the reviewer for bringing this up. We will carefully check our entire dataset and we will update the figures and the text accordingly. We will also show the corresponding SYTO16 images, as the reviewer suggested.

      Would the microfluidic device construction allow for time to transport any axonally damaged fragments to the soma?

      Yes, the construction of the microfluidic devices allows the transport of axonal proteins back to the soma. Based on our experiments, it seems that damaged NFL from the axonal compartment could be contributing to the accumulation of NFL fragments in the nuclei. However, this contribution seems to be minimal as we cannot detect nuclear NFL upon the injury of axons alone. Alternatively, it could be that the processing of axonal NFL fragments proceeds differently if neuronal bodies are not injured and that this is the reason we don’t detect the NFL nuclear accumulation upon injury of axons alone. We will discuss this in the revised manuscript.

      Fig2C+D • The statement ".... no annexin V was detected on the cell membrane" needs to be shown more clearly

      We will modify figures to address this comment.

      • Please provide merged AnnexinV/PI images

      We will modify figures to address this comment.

      • The conclusion about 2D, that nuclear accumulated NFL overlaps with PI is not supported by the example image shown. There are plenty of PI positive spots that are not NFL positive and even several NFL positive ones that do not have a clear PI staining. Please quantify and then show a very clear result in order to be able to suggest necrosis as the underlying process.

      We are not sure if we understand the reviewer’s concern correctly. We will try to clarify it here and in the revised text. If necessary, we will tone down our conclusion, but the reason why not all of PI positive spots are NFL positive is most likely due to the fact that not all injured nuclei are NFL positive. We quantified in FigS1 that up to 60% of nuclei under injury conditions show NFL accumulations. That is why we are not surprised to see some PI positive/NFL negative nuclei. And the fact that there are some NFL positive nuclei which appear to be PI negative is most likely related to the fact that the PI binding is affected. In addition, upon closer inspection of NFL and PI panels in Fig2d it can be observed that NFL positive nuclei are also PI positive, albeit with a lower PI fluorescence intensity. We will modify the figure to show this clearly in the revised manuscript.

      FigS5 C+D • If the case is made that nitric oxide damage induces necrosis, then why is it that the AnnexinV example of Staurosporine exposure (which induces apoptosis) looks similar to that of nitric oxide damage in Fig2d and necrosis induction with Saponin looks very different?

      We thank the reviewer for bringing this up. We will try to clarify this in the revised manuscript. Regarding the specific questions, the most likely explanation why staurosporine treated neurons look similar to the ones treated with spermine NONOate is that in the late stages of apoptosis cell membrane ruptures and allows for the PI to label nuclei. This is probably the case here as illustrated by the nucleus in the middle of the image (FigS5c) that shows the fragmentation characteristic for the apoptosis. This is not happening in early apoptotic cells due to the presence of an intact plasma membrane. On the other hand, the reason why saponin treated cultures look different compared to spermine NONOate is that membranes are destroyed by saponin so that the PI can enter the cell. For that reason, there could have not been any AnnexinV binding to the membrane which would correspond to the AnnexinV signal of spermine NONOate treated neurons. As we will discuss below, we did not try to mimic spermine NONOate-induced injury with saponin treatment. Instead this was a control condition for PI labeling and imaging. We also used a rather high concentration of saponin which probably destroyed all the membranes which was not the case with spermine NONOate treatment. We intend to do additional control experiments to address this.

      • Additionally, does necrosis induction with Saponin also cause NFL fragment accumulation in the nucleus? Please show a co-staining of them. Also, the authors want to make a claim about reduce PI binding in NFL accumulated necrotic cells. In these examples, the intensity of the nuclear stain of PI with Saponin looks dimmer than with Staurosporine. Are the color scalings similar? It might be that the necrotic process itself causes reducing binding of PI and is not related to the presence of NFL.

      With regards to this question, it is important to note that Annexin V and PI imaging was done in living cells. To obtain the corresponding anti-NFL signal as shown in Fig 2c,d we had to fix the neurons, perform immunocytochemistry and identify the same field of view. We tried to do the same procedure after saponin treatment (Supplementary Figure 5d) but the correlative imaging was very difficult due to the detachment of neurons from the coverslip after the saponin treatment. For this reason, we could not identify the same field of view co-stained with NFL. However, other fields of view did not show NFL fragment accumulation. This could also be the consequence of the high saponin concentration that we used as we discuss above. We have also noticed the reduced intensity of PI binding in the nuclei of saponin-treated neurons. However, if the necrotic process itself reduces the binding of PI to the DNA, then all of the neurons treated with spermine NONOate would have an equally low PI signal. In our experiments, only the nuclei which contained NFL accumulations had a low PI signal, while the signal of NFL-negative nuclei was higher (as shown in Fig2d). We would also like to point out again that the saponin treatment was our control of the PI’s ability to penetrate cells and bind the DNA, as well as our imaging conditions, and not the control of the necrotic process itself. This is the reason why we didn’t go into details about neuronal morphology and NFL localization upon saponin treatment. We thank the reviewer for pointing this out since it prompted us to reevaluate what we wrote in the corresponding paragraph of the manuscript. We realized that the confusion might stem from our explanation of the AnnexinV/PI assay controls in the lines 196-198 (“Additional control experiments in which neurons were treated with 10 μM staurosporine (a positive control for induction of apoptosis) or with 0.1% saponin (a positive control for induction of necrosis) confirmed the efficiency of the annexin V/PI assay (Supplementary Fig. 5c,d).”). We will modify this portion of the text to clearly state that staurosporine and saponin treatments were controls of the AnnexinV and PI binding to their respective targets and not of the apoptosis/necrosis process. When it comes to the saponin treatment, our intention was only to permeabilize the membranes in order to allow PI penetration and DNA binding and not to induce necrosis or to mimic the effect of the spermine NONOate. We also intend to perform experiments with lower concentration of saponin to try to address this experimentally in addition to the text modifications.

      Fig3d • Please show similarly scaled images from controls for proper comparison

      We will show similarly scaled images of the control neurons so that they can be properly compared. They were initially not scaled the same for visualization purposes, but we will modify this in the revised manuscript.

      • How do the authors scale the degree and kinetics of induced damage between application of hydrogen peroxide/CCCP and glutamate toxicity? Does glutamate toxicity take longer to affect the cell, not allowing enough time to accumulate NFL fragments in the nucleus?

      It is challenging to scale the degree and kinetics of induced damage with different stressors. That is why we did not intend to do this. Instead we set different injury conditions based on the published literature. That is why can only speculate when it comes to this. In this regard, it can be that the glutamate toxicity takes “longer” to affect the cells even though it is very difficult to compare them on a timescale, especially when considering different mechanisms of action. We will discuss this limitation in the revised manuscript.

      Fig4B • Some groups (like NO and NO + emricasan) have much larger numbers of close to 0 intensity, compared to the control group. Why?

      We were wondering the same when we analyzed the data. The fact that our nuclear fluorescence intensity analysis picked up NFL signal in control neurons which had no nuclear NFL accumulation made us realize that the intensity measured in the nuclei of control group comes entirely from the out of focus fluorescence – from neurofilaments in cell bodies, dendrites and axons (an example can be seen in the FigS6). That is why we presented the corresponding data with a cut-off value based on the control signal (as mentioned in lines 238-240). Since the oxidative injury causes NFL degradation (not only in neuronal soma, but also neuronal processes), the overall fluorescence intensity of the NFL immunocytochemical staining is reduced in injured neurons. We can see that in all of our images. Consequently, there is no contribution of out of focus fluorescent signal to the measured fluorescence intensity in the majority of nuclei. Due to that, the nuclei without NFL accumulation (at least 40% of injured nuclei) will appear to have a close to 0 intensity of the fluorescent signal. We will discuss and clarify this additionally in the revised manuscript.

      • Please add the ratio of above/below threshold (50/50 obviously in controls)

      We will update the figure in the revised manuscript.

      • The description of the CTCF value calculation seems a little... muddled? Several parameters are described whereas "integrated density" is not even used. Why not simply mean intensity of nuclear ROI-mean intensity of background ROI?

      We included the integrated density in the description since it is measured together with the raw integrated density and can also be used for the CTCF value calculation. However, since we didn’t use it for the CTCF calculation, we will remove it from the corresponding section of the manuscript. We calculated the CTCF value instead of calculating mean intensity of the nuclear ROI - mean intensity of the background ROI, since the CTCF value also takes into account the area of the ROI and not just the mean intensity.

      • Also, please tell me if the areas for nuclear ROIs change, as I noted for Fig1A/B

      We will include this information in the revised manuscript.

      • To make sure that one of the 3 experimental repeats didn't skew the results, please show the median fluorescence intensity for each individual experiment to clarify that the supposed effect is repeated across experiments.

      We have already noticed that in the earliest of the three experiments overall fluorescence intensity was higher, but this was consistent across all the experimental groups and did not skew the results or affect the overall conclusion. However, we will double-check this and revise the figure.

      • From the text "...and due to the NFL degradation during injury...": this seems to contradict the process? Either the NFL fragment accumulates in the nucleus or it is degraded during injury. And isn't the degradation through calpain what supposedly allows this fragment of NFL to go to the nucleus in the first place? I reckon that the authors are possibly trying to reconcile why there are many close-to-0 intensity nuclei in the NO and NO + emricasan groups, but I don't feel the explanation given here fits.

      As we tried to explain in our response above, we think that the overall degradation of neurofilaments in neurons affects the fluorescence intensity originating from the out of focus neurofilaments. Therefore, the nuclei without NFL accumulation in injured conditions have a close to 0 fluorescence intensity. Additionally, we think that this is not an either/or situation, but that both degradation and nuclear accumulation of NFL happen simultaneously. We also think that degradation of axonal NFL and the transport of its tail domain to the soma will at least partially contribute to the accumulation in the nucleus. In any case, degradation and nuclear accumulation seem to be differentially regulated in individual neurons, as some of them show nuclear NFL accumulation and some not. Furthermore, calpain and other mechanisms could also cause NFL degradation up to the point at which these fragments can no longer be recognized by the anti-NFL antibody leading to the loss of signal. We will try to clarify this in the revised version of the manuscript.

      Fig5 • Does the distribution of this GFP in B match any of the various antibody stainings of different NFL fragments? Perhaps this is still a valid fragment of NFL, just not picked up by any AB?

      The GFP signal in B appears rather homogenous and it does not match any of the various antibody stainings of different NFL fragments. As the reviewer points out, this could also be a valid fragment of NFL fused to GFP that none of our antibodies is recognizing. We will clarify this in the revised manuscript.

      • "... and was indistinguishable from the full277 length NFL-GFP." Based on what parameters?

      We will clarify this in the revised text, but we meant in terms of overall neurofilament network and cell appearance, which is commonly used to test the effect of NFL mutations.

      • The authors claim that b is different from d, but I am not convinced. I would like to see a time dependent curve from multiple cells showing a differential change in nuclear and cytosolic GFP signal.

      As we also wrote in the manuscript, in the majority of neurons that were monitored during injury we were not able to detect an increase in the GFP fluorescence intensity in the nucleus. This is what prompted further experiments with NFL(ΔA461–D543)-FLAG. We will clarify this additionally in the revised manuscript and perform line profile intensity measurements to show the difference in nuclear and cytosolic GFP signal.

      • Secondly, the somatic GFP intensity for NFL increases for full length NFL-GFP. How is this explained, if it is only a separation of NFL and GFP? If anything, GFP should float away. And if the answer is that NFL is recruited to the nucleus, you showed that inhibition of calpain activity partially prevents that. So, if calpain activity is necessary for the transport of NFL to the nucleus, then wouldn't it also cut the GFP from NFL before it reaches the nucleus?

      We thank the reviewer for bringing this up and we apologize for the confusion. This can be explained by the fact that the images were scaled in a way that the GFP signal over time could still be seen easily (i.e. differently across different time points which we unfortunately forgot to mention in the figure legend). In the revised manuscript, we will either scale the images the same or we will alternatively show the displayed grey values in individual panels.

      Fig6 • It is recommended to overlap the transfected cells with a stain for endogenous NFL to show that despite the absence of the FLAG-tag, there is still NFL.

      We did not overlap the anti-NFL with anti-FLAG and SYTO16 staining, due to the space constraint and the intent to clearly show the overlap of FLAG and SYTO16 signals in the merged images above the graphs. However, the line profile intensity measurements were done in all three channels and show that despite the absence of FLAG, there is still NFL in the nucleus (Fig6b), or that both FLAG and NFL are present in the nucleus (Fig6d, NFL signal shown in gray). However, as this is not obvious and can easily be overlooked, we will show the endogenous NFL staining overlap in the revised version of the manuscript.

      Fig7 • „ ...all disrupted neurofilament assembly...": this sounds like the staining for native NFL supposedly shows a distortion due to a dominant negative effect of the expression of these constructs? Please clarify.

      Yes, we were referring to the disruption of neurofilament assembly due to a dominant negative effect of the expression of NFL domains. We will clarify this in the revised version of the manuscript.

      Discussion: • The authors show that after overepression of the head domain only, it possibly passively diffuses into the nucleus even in the absence of oxidative injury. However, it seems to be suggested as well that the head domain would not be freely floating around if it wouldn't be for increased calpain activity as a result of oxidative injury in the first place. Therefore, a head domain fragment localized in the nucleus would still more prominently happen upon oxidative injury and interact with DNA through prior identified putative DNA interaction sites from Wang et al. Please comment.

      That is correct. Upon injury and calpain cleavage, it is conceivable that a fragment containing the NFL head domain would also be present in the cell and could potentially diffuse to the nucleus and interact with the DNA. However, by staining injured neurons with an antibody that recognizes amino acids 6-25 of the NFL head domain, we were not able to detect an NFL signal in the nucleus (FigS2a,b). It could be that either the NFL head domain does not localize in the nuclei upon injury, or that the fragment localizing in the nucleus does not contain amino acids 6-25 of the NFL head domain. As the putative DNA-binding sites described by Wang et al involve 7 amino acids located in the first 25 residues of the NFL head domain, we would expect to detect it with the aforementioned antibody. However, as that was not the case we speculated that the interaction of NFL and DNA occurs differently in living cells, as opposed to the test tube conditions utilized by Wang et al. We will comment and clarify this in the revised version of the manuscript.

      • Reviewer #2*

      • Major Comments:

      • The initial data presented in the paper is good, does response of oxidative damage with proper controls, testing the antibodies to NF-L and etc. (Fig. 1-Fig. 4). *

      We thank the reviewer for their positive feedback.

      1. The evidence for calpain involvement in NF-L cleavage during oxidative damage is missing. Provide the evidence for full length NF-L construct and deletion mutants transfected into cells by immunoblot for cleavage of NF-L, perform nuclear and cytoplasmic extract preparations and show that enrichment of the tagged cleaved NF-L fragment in nuclear fraction.

      We thank the reviewer for their comments and suggestions. Since we saw in our microscopy experiments that calpain inhibition reduced the accumulation of NFL in the nucleus, and since it is known that NFL is a calpain substrate (Schlaepfer et al., 1985; Kunz et al., 2004 and others), we did not perform additional experiments to confirm the involvement of calpain in NFL degradation during injury. However, to strengthen our findings, we intend to perform the suggested experiments and include the results in the revised manuscript.

      1. Show calpain activation during oxidative damage by performing alpha-Spectrin immunoblots identify calpain specific 150-kda Spectrin and caspase specific 120-kDa fragment generation in these cells. Also, calpain activation can be measured by MAP2 level alteration and p35 to p25 conversion. Without this evidence it's very hard to believe if the calpain activity is increased or decreased during oxidative damage and these markers are altered by using calpain inhibitors.

      To confirm the calpain activation, we intend to perform anti-alpha spectrin and/or anti-MAP2 blots in lysates of control and injured neurons and include the results in the revised manuscript.

      1. The premise that NF proteins are absent in cell bodies and present only in axons is not correct. It has been demonstrated by multiple investigators that NFs are present in the perikaryon and dendrites of many types of neurons (Dahl, 1983, Experimental Cell Research)., Dr. Ron Liem's group showed NF protein expression in cell bodies of dorsal root ganglion cells (Adebola et ., 2015, Human Mol Genetics) and also showed N-terminal antibodies for NF-L, NF-M and NF-H stain rat cerebellar neuronal cell bodies and dendrites (Kaplan et al., 1991, Journal of Neuroscience Research) when NFs are less phosphorylated. (Schlaepfer et al., 1981, Brain Research) show staining of cell bodies of cortex and dorsal root ganglion cell bodies with NF antibody Ab150, and Yuan et al., 2009 in mouse cortical neurons with GFP tagged NF-L.

      We are not sure what the reviewer is referring to since we cannot find a corresponding section in which we claim that NF proteins are absent in cell bodies. We wrote the following “Anti-NFL antibody staining of neurons treated with the control compound showed the expected neurofilament morphology, that is, a strong fluorescence intensity in axons and lower intensity in cell bodies and dendrites (Fig. 1a)” in our results section (lines 119-121), but the claim we were trying to make there was that NF proteins are particularly abundant in axons. We will clarify this in the revised manuscript.

      1. Quantifying NF-L signal or tagged NF-L fragment signals in the cell body by ICC has many problems and making conclusions. It's extremely difficult to have control over levels of proteins in transfected overexpression models and comparing two or three different constructs with each other by ICC. Not every cell expresses same levels of protein in transfected cells and quantifying it by ICC again has a major problem. This can be addressed if there are stable lines that express equal levels of protein in all cells that comparisons can be made. Under thesese circumstances validation of the hypothesis presented in the study has no strong direct evidence to demonstrate that calpain is activated and NF-L fragment translocate to the nucleus.

      We agree that the results from overexpression-based experiments should be interpreted with caution as levels of expression vary between the cells. We intend to discuss this in the revised manuscript. However, we find it difficult to experimentally address this comment since we are not sure which specific experiments the reviewer is referring to. With regards to this, we would like to emphasize that most of the initial experiments in which we observed NFL accumulation in the nuclei of injured neurons were based on the ICC labeling of endogenous NFL and didn’t involve its overexpression. This includes labeling of endogenous NFL in various types of neurons, comparing the effects of different types of oxidative injury, as well as testing the effects of calpain inhibition on the observed nuclear accumulation (Figures 1-4; Supplementary Figures 1-6). We later resorted to the overexpression experiments in primary neurons (Figures 5-7; Supplementary Figure 7, 10) to gain more information about the identity of NFL fragment which was detected in the nucleus. Due to the low transfection efficiency of primary neurons, we performed an additional set of overexpression experiments in neuroblastoma ND7/23 cells (Figure 8; Supplementary Figures 8,9) and obtained similar results in a higher number of cells. We agree that having stable cell lines which e.g. express same levels of NFL domains would be a more elegant approach and we intend to make them for our follow-up studies, however the generation of said stable cell lines might be beyond the scope of this revision. Furthermore, looking at our data with overexpression of NFL domains in ND7/23 cells (Supplementary Figure 8,9), it appears to us as if different domains are rather homogenously expressed in different cells. While the expression levels might vary, it seems that they all show the same trend when it comes to their localization (which was the main point of those experiments).

      1. The interpretation that NF-L preventing DNA labeling cells is misinterpretation. NFs have very long half-life compared to other proteins. Due to oxidative damage, DNA is degraded in the cells but NFs that have very long half-life you see as NFs rings in the dead cells. So, NFs do not prevent DNA labeling, but DNA or chromatin is degraded in dead cells.

      We thank the reviewer for their useful insight. DNA degradation could certainly be the reason why we observe a lower fluorescence intensity of the propidium iodide fluorescence in the nuclei of injured neurons. We intend to discuss this in the revised manuscript. However, if the DNA degradation is the only reason for the lower PI fluorescence intensity, then the PI fluorescence intensity would be the same in all injured nuclei. In our experiments, we saw the reduced PI fluorescence intensity in nuclei that contained NFL accumulations and not in other nuclei. Additionally, we observed a reduction of SYTO16 fluorescent labeling of nuclei which contained accumulations of the NFL tail domain, even in the absence of oxidative injury. Due to these reasons we speculated that NFL accumulation in the nucleus might hinder nuclear dyes from interacting with the DNA. But this is only a speculation and we will try to clarify this further in the revised manuscript including alternative explanations.

      Minor comments: 1. In the introduction on page 4 reference is missing for NF transport, aggregation and perikaryal accumulation (on line 93).

      We will add a reference to the revised manuscript.

      1. The statement in discussion on page 14 line 454 for Zhu et al., 1997 study is not accurate. It should be modified to sciatic nerve crush not spinal cord injury.

      We will correct this mistake in the revised manuscript.

      1. What is the size of the calpain cleaved NF-L tail domain? If you perform immunoblots on cell extracts treated with oxidative agents one would know it.

      We will perform immunoblots on cell lysates and incorporate the corresponding results in the revised manuscript.

      1. Authors could make their conclusions clear. This is particularly true for the experiments in Figure 4 panels c and d. It is very difficult to understand the conclusions of the experiments. First state the expectation and then described whether the expectation is true or different.

      We will do as the reviewer suggested in the revised manuscript.

      1. The ICC images are at extremely low magnification. They should be shown at 100x or 120x so that details of the cell body and the nucleus can be seen.

      Our intention was to show larger fields of view and wherever appropriate insets, but we will try to improve this in the revised manuscript by either zooming in, cropping or adding additional insets with individual cell bodies and nuclei. In general, images were taken with an optimal resolution/pixel size in mind for any of the used objectives (60x/1.4 NA or 100x/1.49 NA) and we can easily modify our figure panels to show more details.

      1. Oxidative damage leads to beaded accumulation of NF-L in neurites and axons. Authors should address this issue.

      We will discuss this in the revised manuscript.

      1. The combination treatment of the inhibitors (last 3 sets of the Fig. 4 b) has no statistical significance should be removed.

      Actually, these differences were statistically significant (Supplementary Table 1). For clarity and as described in the figure legend (line 516: “The most relevant significant differences are indicated with an asterisk”) we showed only a subset of them on the graph, but we will change this in the revised manuscript.

      1. Why only two antibodies recognize cleaved NF-L? If the antibodies at directed at tail region, they should recognize it unless the phosphorylated tail at Ser473 may inibit the antibody binding. In that case NF-L Ser473 specific antibody (EMD Millipore: MABN2431) may be used to test this idea.

      This is a very good point that we also wonder about. Even if all antibodies are directed at tail region, exact epitopes are not described for all of them. That makes it also difficult for us to understand and speculate on this. However, we have already ordered the new antibody as suggested by the reviewer and we will experimentally test it.

      **Referees cross-commenting**

      I agree with the reviewer#1 about presenting the quantification data for the indicated figures to make conclusions strong and see how much of variation is there among sampled cells.

      As discussed in our response to reviewer #1, we will provide additional quantifications.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      4. Description of analyses that authors prefer not to carry out

      Reviewer #2, major comment 7. Authors could do chromatin immunoprecipitation (chip) analysis to identify NF-L binding sites on chromatin and perform gel shift assays to show NF-L tail domain binding to specific consensus DNA sequences.

      We thank the reviewer for their suggestion. We are very interested in performing additional experiments and identifying the NFL binding sites on the DNA (either by chromatin immunoprecipitation or DamID-seq) and we intend to perform these experiments as soon as possible. Unfortunately, at the moment we do not have the expertise to perform such experiments in our lab. Instead, this type of follow-up project requires establishing a collaboration which is beyond the scope of this revision.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The authors hypothesize that Neurofilament-L subunit of NFs participates in oxidative medicated damage by calpain activation and cleavage of its C-terminal region, translocation to nucleus and activation of toxicity driven gene expression. Authors used cell culture system, NF-L gene transfections and immunocytochemistry to make their conclusions.

      Major Comments:

      1. The initial data presented in the paper is good, does response of oxidative damage with proper controls, testing the antibodies to NF-L and etc. (Fig. 1-Fig. 4).
      2. The evidence for calpain involvement in NF-L cleavage during oxidative damage is missing. Provide the evidence for full length NF-L construct and deletion mutants transfected into cells by immunoblot for cleavage of NF-L, perform nuclear and cytoplasmic extract preparations and show that enrichment of the tagged cleaved NF-L fragment in nuclear fraction.
      3. Show calpain activation during oxidative damage by performing alpha-Spectrin immunoblots identify calpain specific 150-kda Spectrin and caspase specific 120-kDa fragment generation in these cells. Also, calpain activation can be measured by MAP2 level alteration and p35 to p25 conversion. Without this evidence it's very hard to believe if the calpain activity is increased or decreased during oxidative damage and these markers are altered by using calpain inhibitors.
      4. The premise that NF proteins are absent in cell bodies and present only in axons is not correct. It has been demonstrated by multiple investigators that NFs are present in the perikaryon and dendrites of many types of neurons (Dahl, 1983, Experimental Cell Research)., Dr. Ron Liem's group showed NF protein expression in cell bodies of dorsal root ganglion cells (Adebola et ., 2015, Human Mol Genetics) and also showed N-terminal antibodies for NF-L, NF-M and NF-H stain rat cerebellar neuronal cell bodies and dendrites (Kaplan et al., 1991, Journal of Neuroscience Research) when NFs are less phosphorylated. (Schlaepfer et al., 1981, Brain Research) show staining of cell bodies of cortex and dorsal root ganglion cell bodies with NF antibody Ab150, and Yuan et al., 2009 in mouse cortical neurons with GFP tagged NF-L.
      5. Quantifying NF-L signal or tagged NF-L fragment signals in the cell body by ICC has many problems and making conclusions. It's extremely difficult to have control over levels of proteins in transfected overexpression models and comparing two or three different constructs with each other by ICC. Not every cell expresses same levels of protein in transfected cells and quantifying it by ICC again has a major problem. This can be addressed if there are stable lines that express equal levels of protein in all cells that comparisons can be made. Under thesese circumstances validation of the hypothesis presented in the study has no strong direct evidence to demonstrate that calpain is activated and NF-L fragment translocate to the nucleus.
      6. The interpretation that NF-L preventing DNA labeling cells is misinterpretation. NFs have very long half-life compared to other proteins. Due to oxidative damage, DNA is degraded in the cells but NFs that have very long half-life you see as NFs rings in the dead cells. So, NFs do not prevent DNA labeling, but DNA or chromatin is degraded in dead cells.
      7. Authors could do chromatin immunoprecipitation (chip) analysis to identify NF-L binding sites on chromatin and perform gel shift assays to show NF-L tail domain binding to specific consensus DNA sequences.

      Minor comments:

      1. In the introduction on page 4 reference is missing for NF transport, aggregation and perikaryal accumulation (on line 93).
      2. The statement in discussion on page 14 line 454 for Zhu et al., 1997 study is not accurate. It should be modified to sciatic nerve crush not spinal cord injury.
      3. What is the size of the calpain cleaved NF-L tail domain? If you perform immunoblots on cell extracts treated with oxidative agents one would know it.
      4. Authors could make their conclusions clear. This is particularly true for the experiments in Figure 4 panels c and d. It is very difficult to understand the conclusions of the experiments. First state the expectation and then described whether the expectation is true or different.
      5. The ICC images are at extremely low magnification. They should be shown at 100x or 120x so that details of the cell body and the nucleus can be seen.
      6. Oxidative damage leads to beaded accumulation of NF-L in neurites and axons. Authors should address this issue.
      7. The combination treatment of the inhibitors (last 3 sets of the Fig. 4 b) has no statistical significance should be removed.
      8. Why only two antibodies recognize cleaved NF-L? If the antibodies at directed at tail region, they should recognize it unless the phosphorylated tail at Ser473 may inibit the antibody binding. In that case NF-L Ser473 specific antibody (EMD Millipore: MABN2431) may be used to test this idea.

      Significance

      The study has very high significance. The results obtained with proper experimentation great implications in understanding how NF proteins are degraded in the cells and how these degraded fragments would alter neurodegeneration during oxidative stress and other conditions.

      Referees cross-commenting

      I agree with the reviewer#1 about presenting the quantification data for the indicated figures to make conclusions strong and see how much of variation is there among sampled cells.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The manuscript presented by Arsić and Nikić-Spiegel investigates a physiological consequence when neurons in vitro are exposed to oxidative stress injury, specifically a supposed interaction of the tail subdomain of the neurofilament light chain (NFL), after cleavage of the full NFL protein by calpain.

      General comments:

      The conclusions the authors draw from individual non-quantified example images are sometimes seen to be too simplistic when the shown examples ask for a more thorough investigation, especially when specific merged images are not available. It is highly recommended that the authors use the available data to come to more comprehensive answers across the entire acquired dataset. This for instance happens only in figures 4 and 8 and should be extended to other figures as well. There is not necessarily doubt about the author's general claims, but convincing the reader requires showing the variability and effect size of the entire group beyond a single selected example.

      If these more thorough quantifications continue to support the author's claims, then I find no objections for publication of this data.

      Specific comments:

      Fig1A/B - SYTO 16 staining suggests slight reshaping of nucleus upon spermine NONOate, showing less blurry punctae. From the SYTO 16 profile, this should be quantifiable. - There is a subset of neuron nuclei that are SYTO 16 positive. Please quantify the ratio

      FigS1A - Show NeuN with Anti-NFL merged figures

      FigS1C - Show quantification and timeline. I want to know whether there is also a plateau reached here.

      FigS2A-F - Though the statements might be true, selecting one nucleus for a line profile as a statement for the whole dataset seems problematic. Average a larger number of unbiased selected nuclei profiles across multiple cultures to make a stronger statement, or a percentage of positive nuclei as in FigS1b.

      FigS3 - There are no fluorescence profiles, no quantification

      General statement:

      There do seem to be punctated patterns of non-nucleus accumulating NFL fragments. Can they be localized to any specific structure?

      Fig1C-F - I find it too simplistic to categorize c+f and d+e together. There is a huge difference in the examples of nuclear localization between d and e. To not comment on their distinction (if that is consistent) is problematic. Also, since we don't see a merge with either NeuN or SYTO 16, reader quantification is difficult. - Would the microfluidic device construction allow for time to transport any axonally damaged fragments to the soma?

      Fig2C+D - The statement ".... no annexin V was detected on the cell membrane" needs to be shown more clearly - Please provide merged AnnexinV/PI images - The conclusion about 2D, that nuclear accumulated NFL overlaps with PI is not supported by the example image shown. There are plenty of PI positive spots that are not NFL positive and even several NFL positive ones that do not have a clear PI staining. Please quantify and then show a very clear result in order to be able to suggest necrosis as the underlying process.

      FigS5 C+D - If the case is made that nitric oxide damage induces necrosis, then why is it that the AnnexinV example of Staurosporine exposure (which induces apoptosis) looks similar to that of nitric oxide damage in Fig2d and necrosis induction with Saponin looks very different? - Additionally, does necrosis induction with Saponin also cause NFL fragment accumulation in the nucleus? Please show a co-staining of them. Also, the authors want to make a claim about reduce PI binding in NFL accumulated necrotic cells. In these examples, the intensity of the nuclear stain of PI with Saponin looks dimmer than with Staurosporine. Are the color scalings similar? It might be that the necrotic process itself causes reducing binding of PI and is not related to the presence of NFL.

      Fig3d - Please show similarly scaled images from controls for proper comparison - How do the authors scale the degree and kinetics of induced damage between application of hydrogen peroxide/CCCP and glutamate toxicity? Does glutamate toxicity take longer to affect the cell, not allowing enough time to accumulate NFL fragments in the nucleus?

      Fig4B - Some groups (like NO and NO + emricasan) have much larger numbers of close to 0 intensity, compared to the control group. Why? - Please add the ratio of above/below threshold (50/50 obviously in controls) - The description of the CTCF value calculation seems a little... muddled? Several parameters are described whereas "integrated density" is not even used. Why not simply mean intensity of nuclear ROI-mean intensity of background ROI? - Also, please tell me if the areas for nuclear ROIs change, as I noted for Fig1A/B - To make sure that one of the 3 experimental repeats didn't skew the results, please show the median fluorescence intensity for each individual experiment to clarify that the supposed effect is repeated across experiments. - From the text "...and due to the NFL degradation during injury...": this seems to contradict the process? Either the NFL fragment accumulates in the nucleus or it is degraded during injury. And isn't the degradation through calpain what supposedly allows this fragment of NFL to go to the nucleus in the first place? I reckon that the authors are possibly trying to reconcile why there are many close-to-0 intensity nuclei in the NO and NO + emricasan groups, but I don't feel the explanation given here fits.

      Fig5 - Does the distribution of this GFP in B match any of the various antibody stainings of different NFL fragments? Perhaps this is still a valid fragment of NFL, just not picked up by any AB? - "... and was indistinguishable from the full277 length NFL-GFP." Based on what parameters? - The authors claim that b is different from d, but I am not convinced. I would like to see a time dependent curve from multiple cells showing a differential change in nuclear and cytosolic GFP signal. - Secondly, the somatic GFP intensity for NFL increases for full length NFL-GFP. How is this explained, if it is only a separation of NFL and GFP? If anything, GFP should float away. And if the answer is that NFL is recruited to the nucleus, you showed that inhibition of calpain activity partially prevents that. So, if calpain activity is necessary for the transport of NFL to the nucleus, then wouldn't it also cut the GFP from NFL before it reaches the nucleus?

      Fig6 - It is recommended to overlap the transfected cells with a stain for endogenous NFL to show that despite the absence of the FLAG-tag, there is still NFL.

      Fig7 - „ ...all disrupted neurofilament assembly...": this sounds like the staining for native NFL supposedly shows a distortion due to a dominant negative effect of the expression of these constructs? Please clarify.

      Discussion:

      • The authors show that after overepression of the head domain only, it possibly passively diffuses into the nucleus even in the absence of oxidative injury. However, it seems to be suggested as well that the head domain would not be freely floating around if it wouldn't be for increased calpain activity as a result of oxidative injury in the first place. Therefore, a head domain fragment localized in the nucleus would still more prominently happen upon oxidative injury and interact with DNA through prior identified putative DNA interaction sites from Wang et al. Please comment.

      Significance

      This in vitro study, despite its acknowledged caveats, can provide novel support for the claim that calpain induced cleavage of the NFL may play a role in downstream gene expression in order to regulate a neural response upon oxidative injury. Further investigation into this topic may provide further understand of physiological gene expression through interaction with cleavage products as well as yield possible therapeutic targets for pathological conditions. This study therefore may be of interest to a broad audience.

  2. Jun 2022
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript Woglar et al. use several light and electron microscopy techniques combined with averaging/registration methodologies to produce a comprehensive molecular map of the centriole in the C. elegans gonad. The images produced are very impressive and potentially very informative, allowing the authors to draw several important conclusions (e.g. about the chirality of the structure, and the potential organisation of Sas-6 in the cartwheel, the latter of which has been controversial in this species). Thus, although the manuscript is largely descriptive, there is a lot here that will be of great interest to the centriole field. The manuscript is generally well written and well presented, and, although I am not a great expert in all of these techniques, the data seems to solidly support the main conclusions. I therefore have only a small number of relatively minor suggestions for improvements.

      Minor Comments:

      1. It should be clarified whether the centrioles being examined here are organising genuine PCM and MTs. I know that in the embryo SPD-2 and SPD-5 are considered the main organisers of the mitotic PCM, and these centrioles are in S-phase or G2 (so I'm not sure if they are organising any PCM). SPD-5 is located internally to SPD-2, perhaps suggesting that these centrioles are not organising a bona fide PCM? On the other hand, TBG-1 and MZT-1 are located at the periphery, so I assume these centrioles are organising MTs?
      2. I think the labels (A, B, C) in Figure S1 are probably in the wrong order and are not referred to correctly in the main text.
      3. In Figure S1A two centrioles are shown that seem to be touching at their proximal ends, which I initially interpreted as meaning the centrioles were engaged. If so, there seems to be a long tail of Sas-6 connecting the two centrioles that extends well below the centriole MTs. However, reading the legend, I think this interpretation is incorrect, and the images are showing two separate centrioles that just happen to be touching? Perhaps swap in another image that won't lead to this potential confusion?

      Significance

      Although several papers have reported high resolution molecular mapping of centrioles, this one is perhaps the most detailed and does a nice job of superimposing the molecular structures on high quality EM images. Not all of these C. elegans proteins are obviously conserved, but C. elegans is a 'poster-child' model organism for centriole research, and this broad architecture will be of great interest to the entire centriole/centrosome (and also cilia) fields. In addition, the observation of chirality that is intrinsic to the inner centriole structure, and that Sas-6 is likely organised into rings rather than a steep helix, are important conclusions.

      I am an expert in centrioles and high resolution imaging, but not EM.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Woglar et al describe molecular features of the C. elegans centriole with unprecedented detail. By adapting U-ExM to extracted gonads and combining it with EM and TEM data, the authors precisely mapped the location of 12 components. They uncovered that these centrioles are shorter than in the embryo, have the same structural elements, and show an offset of centriolar proteins distribution relative to microtubules which results in chirality. Their detailed analysis also identified two novel electron-dense regions: the Inter Paddlewheel Density (IPD); and the SAS-6/4/1 Containing Density (SCD). This manuscript is a very nice description of C. elegans centrioles and we have mostly minor comments to improve it.

      1. Regarding the duplication and maturation section, the authors state in the abstract: "We uncovered that the procentriole assembles from a location on the centriole margin characterized by SPD-2 and ZYG-1 accumulation.". The data collected by the authors do not provide evidence of enrichment of ZYG-1 and SDP-2 prior to procentriole assembly (in the main text the authors clearly say they are speculating). This statement in the abstract should be corrected to more accurately match what is described in the main text and supported by the results.
      2. It is stated in the main text that the procentrioles can emanate from the middle of the centriole but no representative image is shown (only shown for off-centered procentrioles or very short templates). It is also referred that this may have implications on chirality- it would be important to explain better those implications, as well as offer an example of this configuration.
      3. The authors mention "core PCM" throughout the manuscript without explaining or referencing its definition. Would be useful to the reader if more information is provided.
      4. FigS1.A looks strange because procentrioles seem much longer than centrioles and their relative orientation does not seem to be orthogonal. If this image is representative, it would be helpful to have a diagram explaining the image.
      5. In the main text it is said: "Four components were found to localize to the paddlewheel: HYLS-1[N], SPD-2, SPD-5 and PCMD-1." and SDP-5 is represented in the final scheme (Fig. 7). However, an overlay of SPD5 and EM data is never shown. The authors may extrapolate that SPD-5 localizes there because it is interior to SPD-2 with no offset compared to α-tubulin, but if this is the case it should be clearer in the text.
      6. A statistics section is missing in which the program used is detailed and whether the {plus minus} values in the figures depict SD or SEM. The number of independent experiments should also be mentioned.
      7. Although symmetrization has been increasingly adopted by the field, it would still be useful to reference previous examples of its application in centriole structure analysis.
      8. S1B and S1C figure labels are swapped.
      9. The authors claim that "the procentriole likewise harbors little SAS-4 initially and that more protein is recruited at prometaphase, resulting in similar levels of SAS-4 in the centriole and the procentriole by then (Fig. 2D)". Can the authors provide some sort of semi-quantitative readout?
      10. In Figure 5A side view, the presence of an inner tube is not very clear. Given that diameter quantifications were done using the mostly side views, it would be beneficial if the authors could provide a clearer image.

      Significance

      Overall, these observations contribute toward a better understanding of centriole structure, molecular composition and diversity, with a particular focus on C. elegans. The precision of the approach developed by the authors (U-ExM and EM overlay) is a valuable tool and will be of interest to the centriole biology field and to cell biologists in general.

      Reviewer expertise: Cellular and molecular biologists working in the field of centrioles.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors analyze here the organization of centrioles in C. elegans, by combining the physical expansion of the specimens (by about 5-fold) with stimulated emission depletion (STED) microscopy. They analyze a large number of centriole components in different experiments, and they combine the data into a convincing model of the centriole, which is presented in conjunction with electron microscopy images of this structure. The work is solid, well-performed and technically sound. While this reviewer is not a centriole expert, the work also appears to be sufficiently novel, simply due to its precision, to warrant publication.

      Significance

      I only have one suggestion, which the authors may consider. Most of their work involves analyzing the symmetry of the structures, as presented, for example, in Fig. 4. However, symmetry problems, observable in individual structures, may also be informative. Are specific proteins more prone to variable localization, as, for example, SPD-2-C or SPD-5, while others are more stereotypically organized? Could an analysis of the variability of the stainings provide information on flexibility in the centriole organization?

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The team explores a previously developed "centriole stability assay" to monitor the disappearance of centrioles after RNAi-dependent depletion of various centrosome components. Important roles in centriole stability are found for the PCM and for cartwheel proteins, in addition to proteins of the centriole wall. The remainder of the study focuses on the centriole wall protein ANA1: induced degradation of ANA1 during Drosophila oogenesis strongly reduces the PCM and other centriole markers, and ANA1-dependent defects cannot be prevented by GFP-Polo-PACT, which is otherwise known to protect from the loss of PCM. In complementary experiments, forced targeting of ANA1 to the PCM, or overexpression of AN1 protects centriole integrity.

      Significance

      The study shows that ANA1 is important for the integrity of centrosomes. Generally, this work is well executed and correctly controlled. The novelty of the results is somewhat limited, since a role of ANA1 in centrosome assembly has already been reported by others. The present work emphasizes aspects of centrosome protein maintenance, but doesn't provide mechanistic details of protein turnover. The manuscript should be of interest to the scientific community working on the centrosome.

      Other comments:

      I wonder whether the results from the centrosome maintenance experiment with GFP-Polo-PACT (Figure 3) are really very telling: since PCM and other centriole markers are lost upon ANA1-depletion, GFP-Polo-PACT cannot target to the PCM, and it is therefore unsurprising that GFP-Polo-PACT fails to provide its protective effect. Would expression of GFP-Polo-PACT prior to addition of ANA1-RNAi have a protective effect?

      Minor point:

      Figure 1H: it is unclear to me how centrioles are identified with the BLD10 marker in samples that have been treated with BLD10 RNAi.

      Referees cross-commenting

      I agree very much with reviewers 1 and 2 that a role of ANA1 "downstream" of the PCM is not really supported by the data.

      I also think that all other points raised in the reviews merit attention.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this paper, the authors show that the turnover of centriole components is necessary for proper centriole maintenance within Drosophila cultured cells (during prologued cell cycle arrest) and within Drosophila oocytes, where centrioles are normally degraded prior to fertilisation. They highlight Ana1 as an important player in centriole maintenance. The authors begin with a candidate screen to identify core centriole proteins that are required to properly maintain centrioles. They then focus on Ana1, given that its depletion had the strongest effect, and show that its depletion leads to a reduction in the levels of centriole components in Drosophila oocytes. They show that the previously observed ability of centriole-targeted Polo to counteract centriole loss depends at least in part on Ana1 and that targeting Ana1 to centrioles also counteracts centriole loss. The authors conclude that Ana1 is a component of the PCM-promoted centriole integrity pathway.

      Major comments

      1. The authors say that Plk4 depletion does not lead to centriole loss, but there are significant differences in centriole number between the control and Plk4 depletion cells in Fig 1F and S1D. Please comment.
      2. One of the main results is that depletion of centriole components leads to a reduction in centrosome numbers when measured 8 days after S-phase arrest. I wonder whether a restriction of centriole duplication could add to this effect? Any cells that were in G2 or M phase when the drugs were added would presumably progress into the following S-phase and duplicate their inherited centrioles, but not if centriole duplication proteins had been depleted. It's true that Plk4 depletion leads to a relatively mild centriole loss phenotype, but can the authors be sure that this is not due to variations in the efficiency of different RNAi constructs? Perhaps the authors can show that Plk4 depletion efficiently prevents centriole duplication under otherwise normal conditions.
      3. The authors show that Ana1 depletion has the strongest effect, but this could in theory be due to differences in RNAi efficiency. I don't expect the authors to show the efficiency of all RNAi constructs, but they could state in the text that this is a caveat e.g. "...although we cannot rule out the possibility that differences in RNAi efficiency lead to the observed differences in severity of phenotype..."
      4. A key conclusion is that core centriole components turnover to some extent and that the incorporation of new molecules is necessary for centriole maintenance. This is a very interesting and important point and so it would be nice to have more direct data to support it. This could be done in different ways, including transfecting fluorescently tagged centriole components after S-phase arrest and showing that some molecules become incorporated into the centrioles, or by performing FRAP experiments. Of course, it is possible that the turnover is so low that the incorporated fluorescent molecules cannot be detected...
      5. The authors show that depletion of Ana1 from oocytes leads to a reduction in the intensity of centriole markers. They do not measure centrosome numbers, as the centrosomes cluster too tightly. The authors therefore can't be certain that Ana1 depletion leads to a reduction in centrosome numbers. The authors could show this by inhibiting centrosome clustering while depleting Ana1. There is a recent BioRxiv paper showing that centrosome clustering can be inhibited by depletion of Kinesin-1.
      6. In Figure 3B the authors show that expression of GFP-Polo-PACT partially rescues the effect of "all PCM" depletion, but this seems strange given that Polo's role is presumably to recruit PCM (which has been depleted). Can the authors comment? Also, it would make sense to test whether GFP-Polo-PACT can rescue centriole loss after the depletion of Ana1 alone (not Ana1 and all PCM). If Ana1 has a role in recruiting Polo (either directly or indirectly), which has been shown previously in mitotic cells, then there should be a rescue to some extent.
      7. In Fig4A,C, the authors say that γ-tubulin levels at centrosomes increase when GFP-Polo is forced onto the centrosomes - the graph seems to show a big increase, but the pictures do not...? Are the authors measuring total levels at all centrosomes? If so, I think they should be measuring the average at individual centrosomes. Also, why is the level of GFP alone not much higher when expressed with GFPnanoPACT (Fig 1B)? Presumably GFP should be recruited to the centrosomes by GFPnanoPACT.
      8. The authors show that tethering Ana1-GFP to the centrioles counteracts centriole loss in oocytes (Fig4G). They say that the centrosomes are most likely inactive because they don't recruit PCM, but they have only looked at γ-tubulin, which is a downstream component of the PCM. I think it is important to check whether Polo is recruited, given that tethering Polo to centrioles also counteracts centriole loss and that a recent paper showed that Ana1 has a role in recruiting Polo to centrosomes (Alvarez-Rodigo et al., 2021). The authors also say that these centrosomes do not organise microtubules but do not show the data.
      9. The authors propose that Ana1 is downstream of the PCM, and so over-expressing Ana1 should at least partially rescue centriole loss after PCM depletion. But I don't really agree with this. If Ana1 relies on the PCM then how would its overexpression manage to rescue the phenotype in the absence of the PCM? The finding that over-expressing Ana1 partially rescues centriole loss may instead suggest that Ana1 is either upstream of the PCM or part of an independent pathway. Indeed, the authors show that depletion of both the PCM and Ana1 has a stronger effect than either depletions individually - this is indicative of two independent pathways.

      Minor comments

      1. When the authors say that the centriole wall and cartwheel components are "dynamic" I think that they need to make it clear that this "dynamicity" is not very fast. Using the term dynamic tends to suggest rapid turnover (like in the PCM). Perhaps the authors could use the term "slow exchange" or something similar.
      2. The authors currently use a 0 or 1 centriole categorisation - it would be nice to see the breakdown of what percentage of cells have 0, 1, 2, or >2 centrioles, perhaps in a supplementary excel file.

      Significance

      How centrioles are eliminated in certain cells is an interesting question and the data presented is also relevant to understanding centriole biology in general, because it seems that some apparently very stable structural proteins actually turnover. It is widely known that PCM proteins turnover relatively quickly, but core centriole proteins are considered to be stably incorporated. The data will therefore raise interest in the centrosome field. I do, however, feel that for the authors to make this point more strongly it would be good to show this more directly. Overall, this is a very interesting paper that is well written. The data is well presented and supports the conclusions that centriole components turnover and that Ana1 is involved in maintaining centriole integrity.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript Pimenta-Marques build on their previous work addressing how centrioles are stabilized and maintained or destabilized and disassembled, depending on the cell type and developmental context. Using Drosophila cell culture and oogenesis as an in vivo model for centriole destabilization, they identify the centriole wall protein Ana1 as a central player in centriole stability. Its presence is required for the maintenance even of mature centrioles, suggesting that there is continued turnover of centriole structural components.

      Major comments:

      1. The experiments and results are very well described and most of the conclusions are supported by the data. One aspect needs clarification though. It is not clear to this reviewer how the authors envision the regulation and mechanism by which Ana1 functions in centriole stability. The data suggest that it can stabilize centrioles independent of PCM (Fig. 3B, 5B), yet the authors claim in the results and discussion that it functions downstream of PCM. As presented, this does not make sense. I would argue the opposite, it may function upstream or in parallel to the PCM. Related to the above, the last sentence of the intro states: "Finally, we found that both Polo and the PCM require ANA1 to promote centriole structural integrity." This is shown for Polo, but where is the data showing that PCM requires ANA1 for promoting centriole stability?
      2. I have a concern regarding the number n used for statistics in the quantifications. In many cases it seems that the number n of cells etc. was used (e.g. n>100 cells) rather than the number of experiments (e.g. n=3). The statistics should measure variability between experimental repetitions, not between cells etc. If statistics were indeed not done on experiments and would have to be changed, some of the observed effects may not be statistically significant and would require additional experimental replicates, which would increase the time needed for revision.

      Minor comments:

      1. I would advice the authors to improve the presentation of the figures. In particular the labels are in many cases very small and difficult to read. Readability is also reduced by the use of bold font in the labels and a mix of various font sizes within single figure panels.
      2. The result section could be shortened/become more readable by moving several paragraphs to the intro or discussion.
      3. The introduction is quite long and some parts read more like an introduction of a review on the topic.

      Significance

      This is a nice, focused study on the requirements underlying centriole stability and maintenance. The first part identifies the cartwheel, the centriole wall, and the PCM as important for centriole maintenance. The remaining parts identify and focus on the essential role of ANA1 in this process. This is an important finding, since the mechanisms underlying centriole stability and maintenance are poorly understood, yet highly relevant. Some cell types inactivate and/or disassemble centrioles during differentiation and this is likely important to their function. Providing more mechanistic insight, for example, regarding the relationship between ANA1 and PCM recruitment or the regulation of ANA1's centriole function by Polo, would have further strengthened the study. The audience interested in this work will be cell and developmental biologists. My expertise is in centrosome biology and microtubule organization.

      Referees cross-commenting

      I agree with the additional points raised by the other reviewers. I still think that overall the paper is fine and most things could be addressed in a reasonable time frame. The work does not provide much mechanism though. In this regard, the confusing placement of ANA1 downstream of PCM, would be the only mechanistic aspect, and it seems the authors got it wrong, at least based on the provided data. Here, additional experiments could elucidate these relationships further, but if this is not the goal, text changes could also address this and it would remain a smaller, more focused study.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The manuscript describes how a rhythmically active transcription factor is important for molting cycles. The first part of the manuscript focuses on oscillating genes, and the authors nicely show a rhythmic transcription of these genes. Indeed, using RNAPolII Chip-Seq experiments, they show a rhythmic recruitment of the RNAPolII to the promoters of the oscillatory genes they have previously described. They then demonstrate that GFPs driven by the promoter of these oscillating genes, and inserted as a single copy, can very accurately recapitulate the rhythmic transcription of oscillating genes. It is interesting to see the weak impact of introns and 3'UTR on the rhythmic expression. In the second part of the manuscript, the authors perform an RNAi screen, looking for oscillating transcription factors (TF) important for molting. The goal of this approach is to identify core oscillators that could control molting cycles. Focusing their screen on oscillating TF allows them to exclude TF that would be required for embryonic or larval viability unrelated to molting. Among the 6 candidates they have identified, 5 have already been linked to molting. They focus their study on grh-1, which has never before been described during larval development. They characterize the molting phenotype of grh-1 defective worms using the AID degron system. They monitor the molting cycles using a luciferase assay in a liquid culture where transgenic worms for luciferase are grown in the presence of bacteria supplemented with luciferin. Using this approach (which is quantitative and allows high-throughput analysis), they show that grh-1 is required for each molt in a dose-dependent manner. The GRH-1 protein oscillates and peaks shortly before each molt entry, reinforcing the idea that GRH-1 is an important core TF for molting. The authors finally show that the oscillating activity of GRH-1 is crucial for the molting.

      Major comments:

      Overall, the data are clear and convincing, and the results are quantified with care. The first part of the manuscript represents a significant amount of work with the RNAPolII Chip-Seq, the different single-copy integrants and RT-PCRs. Then, the authors provide a quantitative assessment of the molting process by combining their grh-1-AID construct with the luciferase system. This strengthens the quality of the manuscript.

      My substantial suggestion is that the authors could consider extending the scope of the study in two alternative ways:

      • One question that immediately springs to mind is: what are the targets of grh-1? By applying their luciferase assay to further possible downstream targets of grh-1, they could attempt to phenocopy the grh-1 molting defect, and then look if the oscillating expression of these targets is eliminated in grh-1 defective animals. The binding site of grh-1 is apparently known (Venkatesan, 2003), so is it possible to reduce the potential target list among the oscillating genes using a bioinformatics approach? This requires a substantial amount of work (2 to 3 months) but it would help tell a more complete story.
      • It is striking that myrf-1 and nhr-23 RNAi display the same molting defects as grh-1 RNAi as show in figure 2B. Have the authors considered testing the genetic interactions of grh-1 with these two other candidates? Do they belong to the same GRN? Does grh-1 depletion impact the expression of the nhr-23 and myrf-1, or vice versa? Do they have the same target genes (Chip-seq data for nhr-23 are available)? This, again, would significantly strengthen the paper and would make the second part more complete. This alternative piece of work would require less experiments than the first suggestion but would be also of great interest. These two points are only suggestions as it represents a significant amount of work, and the paper could very well be published in its current form.

      About the luciferase assay for grh-1, nhr-23 and myrf-1 RNAi, the authors observe "an apparent arrest in development or death following atypical molts". What do they mean by "atypical molt" at this stage of the paper? Indeed, for these candidates, the luminescence traces are highly perturbed after the second molt (for grh-1 and nrh-23 RNAi) or the third molt (for myrf-1), but these abnormal traces seem to reflect an arrest in development or larval lethality rather than an atypical molting. Can the authors clarify this point?

      In the part on the phenotypic analysis of GRH-1 depleted animals, the authors conclude the paragraph with "GRH-1 is required for viability at least in part through its role in proper cuticle formation". This role in proper cuticle formation refers to the cuticle break in the head region as observed in time lapse. It would be useful to have a visual test of the cuticle permeability using an Hoechst staining.

      The authors show GFP::GRH-1 pictures at different stages to describe a rhythmic protein accumulation (see also my minor comment on GFP picture quality). From the perspective of whether all tissues are oscillating, it would be interesting to see if all the cells they mention in the text are showing the same rhythmic fluorescence.

      In relation to the previous comment, which tissue is responsible for the defects observed by the degradation of GRH-1? Is it possible to use a tissue-specific depletion of AID-tagged GRH-1 using Seam-cell specific, rectal cell specific, vulval precursors specific promoters, etc...?

      In the last part of the results, the authors show that molting requires oscillatory GRH-1 activity by depleting GRH-1 at variable times in L2. It would be interesting to know what happens if a stable (non-oscillating) amount of GRH-1 protein is maintained over time in the worms (using a non-oscillating promoter).

      Minor comments:

      In figure 5 B, C, D, it seems that right before entering the M2, M3 and M4 respectively, there is a peak of luminescence (a thin bright line) and a strong luminescent signal is detected at the molt exit. Can the authors comment on that?

      If I understand correctly, for the GRH-1 GFP CRISPR reporter (Fig S7, S8 and S9), the authors have imaged single worms in microchambers on a spinning disk microscope. I fail to see why they used such a sophisticated approach to describe the expression pattern of GRH-1. This imaging setup is ideal for timelapse. However, in the context of which cell express GRH-1, the resolution is not good enough to fully assess cell identity. Indeed, the GFP images are a bit blurry, and it is difficult to make out the difference between real GFP fluorescence and gut autofluorescence. It would be helpful to have better quality pictures with a more regular setup, i.e., a 2% agarose pad mounted on a regular microscope or confocal. For non-specialists of the C. elegans anatomy, small insets for each category of cells mentioned (seam cells, vulval precursors etc.) would be appreciated.

      It would be easier to assess the GRH-1 expression decrease in adults if the pictures were shown in parallel with the larval pictures, with the same brightness/contrast correction (if any). Make insets to compare the same cells between different stages.

      How can the authors quantify the duration of molts 3 and 4 in fig S4 when these molts are not seen in the luciferase assay in fig 2B? Can the authors clarify this point?

      Writing/clarity: For non-specialists, mention why the authors used a PEST sequence in their constructs and explain what the eft-3 promoter is (they mention it in the Luciferase assay, but it is not clear enough).

      In Fig S4, make clearer that EV = MOCK.

      In the methods, the authors refer to the we146 mutant strain, but they neither use it nor mention it in the body of the text. Producing such a mutant strain is great and it should be mentioned in the results, with an explanation as to why they are dying. Otherwise, it should be removed from methods.

      In the Methods, the genotype of the strains is misleading. For example HW1372 : EG6699; xeSi... is not the regular way to write a C. elegans genotype. It should be written as: HW1360 xeSi131[F58H1.2p::GFP::H2B::Pest::unc-54 3', unc-119+] II; unc-119(ed3) III as the strain used to generate the MosSci insertion is described in the paragraph on Transgenic reporter strain generation.

      For the GFP CRISPR strain, the authors write either GFP::GRH-1 in Fig S7, S8 and S9, grh-1::gfp::3xflag in the methods or GRH-1-GFP fusion in the results. The authors should homogenize the way they write this reporter strain. Whether it is an Nterminal or a Cterminal fusion will determine how they should label it.

      Enlarge the font size for Fig S8 and 9 for the scale bar.

      Significance

      The author's prior analysis (Meeuse, et al 2020) showed that mRNA oscillations are coupled with developmental processes, including molting. The present paper extends that finding by showing that oscillating transcript levels are directly linked to a rhythmic recruitment of the RNAPolII on their promoter. Then Meeuse and her colleagues use the molting as a model system to access the importance of oscillating TF for rhythmic processes. Through an RNAi screen, they have identified 6 candidates involved in the molting process. One of the candidates, grh-1, is characterized further. They combine a quantitative-based analysis (luciferase assay) with a time and dose-controlled degradation of GRH-1 to clearly describe the impact of grh-1 depletion on molting. This time and dose control is very smart and key to their study. Overall, the paper adds some interesting piece of information to the field of rhythmic control of molting cycles, as it shows that oscillating transcription factors provide a developmental clock in this process. But this notion is not completely new, as it has been shown in other developmental processes like the circadian clock. Moreover, how molting cycles are controlled by GRH-1 remains to be elucidated.

      My field of expertise is GNRs studies, the genetic of C. elegans, embryonic and larval development in C.elegans, timelapse and confocal imaging. I do not have expertise in Chip-Seq analysis.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this paper, Meeuse et al. analyze the determinants of rhythmical molting in C. elegans using a combination of RNA-polymerase ChIP-seq, RNA-seq, RNAi and imaging coupled to targeted degradation. The main finding is the discovery of the importance for the molting process of GRH-1, a transcription factor homolog to Vertebrate Grainyhead, as well as the identification of 5 additional transcription factors for which molting is defective.

      Major comments:

      The experiments are generally well performed, and the results convincingly demonstrate the function of GRH-1 in molting.

      The coupled RNA-seq and ChIP-seq experiment (Fig. 1A/B) was done only once.

      One of the claims (p3) is that rhythmic transcription of oscillating genes is driven by rhythmic RNA pol II occupancy. If this is suggested visually by Figure 1A/B, I think this claim merits further statistical analysis. How is ChIP-seq correlated with RNA-seq globally and at the individual gene level? How good is the correlation? Can the authors provide a supplementary table with this correlation at the gene level? In the same paragraph, a couple examples of genes for which RNA polII ChIP does not correlate with RNA-seq would be helpful to the reader (they are currently cited as "instances where oscillating mRNA levels were not accompanied by rhythmic RNAPII promoter binding"). Please provide gene names.

      The oscillatory transcription of several promoters is tested using GFP fusions coupled with q-RT PCR. It is not clear to me whether these experiments were repeated. Additionally, the authors state that each qPCR was repeated (only once), hence both data points should be shown in the graphs on Fig. 1C and S2.

      The authors perform then RNAi knock-down of 92 transcription factors involved in molting using bioluminescence and identify 6 genes involved in molts, three of which have been characterized previously. They focus on one of the other three, grh-1, as its orthologs are involved in epidermal biology in other organisms. Targeted degradation of GRH-1 using the auxin degron confirms the function of GRH-1 in each molt, in an auxin-concentration dependent manner. GFP tagging of GRH-1 shows an accumulation prior to each molt, suggesting the transcription factor is necessary for the onset of molting. As the GRH-1 target site is known (PMID 12888489), the paper would be greatly strengthened if the authors could loop back to the RNA-seq/ChIP-seq dataset and highlight which cycling genes have indeed a GRH-1 binding site in their promoter sequence and whether this correlates with one specific phase of the molting cycle.

      Similar to the ChIP/RNA-seq experiments, it is not clear to me whether the different bioluminescence experiments were performed once or twice.

      As stated above, the results are very convincing and the conclusions quite clear. Additionally, all wet lab experiments are described in very many details. However, I feel that the description of the data analysis is too succinct to allow reproducing the experiments, even using the GEO data. I would expect the authors to provide a github public link with the scripts to perform the RNA-seq and ChIP-seq analyses, the Matlab scripts to analyze bioluminescence experiments (Fig. 2,5,7) and the CNN used to analyze single worm molting, even if the method is to be described elsewhere.

      Minor comments:

      The authors should provide descriptive statistics for their high-throughput sequencing (read number, mapping statistics etc...).

      In figure 1C, it would be helpful to the reader to write peak phase and amplitude of the tested genes on the graph.

      In the same figures, for gene F58H1.2, for which the correlation between the reporter and the endogenous gene is not perfect, the authors "suspect [...] that the reporter may lack relevant promoter or intronic enhancer elements". An alternative explanation is that post-transcriptional regulation occurs for this mRNA, a hypothesis which should be added to the text.

      Significance

      As stated above, this manuscript highlights convincingly that GRH-1 is involved in the molting cycle in the nematode C. elegans, a conserved function of the gene between evolutionary distant species for skin biology.

      Field of expertise: C. elegans, high-throughput sequencing methods, imaging.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Meeuse and colleagues describes a series of detailed and elegant experiments addressing the molecular mechanisms underlying oscillatory gene expression patterns in the nematode Caenorhabditis elegans and how these are required for molting between larval stages. By performing ChIP-seq on synchronised populations at 12 different timepoints (each separated by 1h), the authors find that RNA pol II (RNAPII) occupancy at >2000 promoters shows an oscillating behaviour that is paralleled by similar changes in mRNA abundance. To identify transcription factors (TFs) involved in generating these cycles, the authors conducted a candidate RNAi screen on 92 TFs annotated as been expressed in an oscillating manner themselves. Monitoring developmental progression of individual animals by a luminescence-based assay, the authors identify 6 TFs that caused either death (or arrest) after aberrant molts (3 TFs) or prolonged duration on molts (3 TFs). One of these, the Grainyhead/LSF transcription factor GRH-1 is then investigated in further details. Temporal control of GRH-1 depletion is achieved by TIR1/auxin-mediated protein degradation. This revealed that GRH-1 is required during each larval molt. In agreement with the criteria for including grh-1 among the candidate RNAi clones, GRH-1 protein levels are shown to peak immediately before entry to each molt.

      I am very enthusiastic about the manuscript at several levels: conceptualisation, experimental design, quality of data and clarity of text and discussion. I have only two comments in the category of "major comments":

      1. I do not see how the experiments presented in Figure 7 can be used as argument to conclude that "Molting requires oscillatory GRH-1 activity" (title of last Results section and also reflected in the title of the manuscript). I think the experiment nicely shows that there is a timepoint beyond which removal of GRH-1 no longer interferes with the upcoming molt but that doesn't imply that GRH-1 necessarily needs to oscillate. Stronger evidence could be provided by inducible, non-oscillating GRH-1 expression.
      2. After reading the manuscript, one is left with the obvious question: which of the many oscillating genes are direct targets of the oscillating TF GRH-1? Experiments to answer this are not strictly needed for the current manuscript, which stands perfectly on its own, but it would make a significant increase in the overall understanding of oscillating genes and GRH-1 in particular.

      Minor comments:

      The sampling used to generate the data represented in Figure 1 correspond to from 22 hours until 33 hours of post-embryonic development at 25{degree sign}C. It would be useful to indicate how these time points relate to larval development L1-L4.

      The authors state that co-oscillation of RNAPII and mRNA is not observed for all genes. Although this might indeed be due to technical limitations are suggested by the authors, it would be relevant to provide more information (e.g. number or percentage of genes).

      Are the images of GRH-1::GFP expression in larvae (Fig S8) and adults (Fig S9) acquired with identical settings? I assume so, but it is not obvious, particularly because the images are in separate figures.

      Have the authors examined if GRH-1 activity is not only controlled by oscillating transcription, but also post-translationally (e.g. phosphorylation, nuclear import, etc.) as is the case for many TFs. The authors could potentially "overlay" the GRH-1::GFP data with transcriptional data (available at least for 22-33h) to see if the shapes coincide.

      Significance

      Oscillating genes are found in many biological contexts and are fascinating examples of tightly controlled gene expression. The Großhans laboratory has previously identified close to 4,000 oscillating transcripts and is a leader in this field. The current manuscript incorporates a variety of sophisticated techniques that together enable the authors to identify six genes that are required for rhythmic molting in C. elegans. The protein most deeply studied in the manuscript, GRH-1, is homologous to Grainyhead, which is involved in ECM remodeling and other cyclic processes. The findings in this manuscript are therefore of potential relevance across a broad evolutionary scale.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      The manuscript is dedicated to a study of functional roles of a panel of cell migration regulatory factors, and notably the highly homologous family of ARF GTPases. The chosen model is a prostate cancer cell line, used in a number of assays in culture, as well as in a study of primary tumor formation and metastasis in mice. The authors apply a rigorous quantitative approach to their assays of 2D and 3D migration in culture, and use artificial intelligence for the analysis of results, thus upgrading their work from mere phenotypic observations, and gaining statistically significant results. The main finding of the study is the discovery of a unique role of ARF3, a regulatory protein that is shown to control a switch between individual and collective cell migration depending on its abundance. In fact, a depletion of ARF3 leads to an increased individual cell migration and invasion, and to increased metastasis formation in mice, whereas an overexpression of ARF3 favors a sheet-like collective cell migration, which is also more efficient than control in culture, but does not induce metastasis in vivo. This phenomenon appears to depend on the levels of cellular N-cadherin, that is shown to be positively regulated by ARF3 on the protein level, by a mechanism that remains unclear. Finally, the authors analyze the expression of ARF3 and N-cad in a variety of tumors of different origins and grades, and attempt to show the prognostic value of these factors for progression-free survival.

      Major comments:

      • Are the key conclusions convincing?

      The majority of conclusions are convincing, with the exception of the observations I will address in the next paragraph. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      In Fig. 2B, the phenotypes 1 (Spread, pink line) and 5 (Spindle, yellow line) are described in the text as showing a "modest but robust increase". No such increase is evident by eye, and if it is statistically robust, the information should be presented in the Figure, or the statement should be modified/removed from the article.

      In Fig. 2E, the lower middle panel shows a dramatically increased quantity of acini, whereas the authors specifically say that proliferation is not impacted by the KD of ARF3, and indeed, the KD2 looks very much like the control in this respect. It is misleading, and a more typical panel should be presented for ARF3_KD1.

      In Fig. 3, the authors study the effects of simple versus double KD of ARF1 and ARF3, and conclude that a double KD leads to a phenotype "midway" between the two simple KDs. However, with regard to the 3D invasion assays (Figs. 3DEJ), it looks like a double KD is less efficient than either of the simple ones, as if ARF1 and 3 were partially mutually dependent in this regulation. It is not clear what "midway phenotype" the authors are talking about. - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      The authors make a strong case for physical interaction and mutual stabilization of ARF3 and N-cad, though the negative regulation of N-cad following ARF3 depletion is not obvious from Fig. 5B (positive regulation is very clear). Moreover, it is difficult to understand why a total disappearance of ARF3 has such a discreet effect on N-cad, whereas a very modest overexpression of ARF3 leads to such a dramatic increase of N-cad. Perhaps, some experiments with proteasome inhibition (using MG132, for example) could substantiate the authors' claim about the mutual stabilization of the two proteins. - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Yes, the experiments are realistic and should not take more than a month. - Are the data and the methods presented in such a way that they can be reproduced?

      Yes. - Are the experiments adequately replicated and statistical analysis adequate?

      I am not sufficiently qualified in artificial intelligence algorithms to judge this part of the study. In general, as mentioned above, all differences characterized as "significant" or "robust" should have a statistical basis for this statement, which was not always the case in the manuscript.

      Minor comments:

      • Specific experimental issues that are easily addressable.

      Please see above. - Are prior studies referenced appropriately?

      To the best of my knowledge, yes. - Are the text and figures clear and accurate?

      Yes, with the exceptions described before. - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Please see above.

      Referees cross-commenting

      I fully agree with the detailed and careful analysis made by the reviewers 1 and 2. I do not have any additional comments.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      The study shows, for the first time and in a very clear way, that the small GTPase ARF3 has a unique function in determining the pattern of human cancer cell migration, that this function depends on the C-terminal domain of ARF3 and on N-cadherin (by mechanisms that remain to be elucidated), and that this phenomenon is important for metastases formation in vivo. - State what audience might be interested in and influenced by the reported findings.

      The study will be interesting for the field of cell migration, but also for specialists in cancer and metastasis formation. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Cell migration 2D 3D, acini assay, RAC-WAVE-ARP2/3 pathway. I am not sufficiently qualified to evaluate the robustness of the artificial intelligence algorithms, nor to judge the relevance of the analysis presented in Fig. 7.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Sandilands et al. analyze the role of ARF GTPases ARF1 and ARF3 in human prostate cancer cell setting with specific focus on 3D versus 2D growth and invasion. The study connects to previous work where the authors conducted similar studies with ARF GTPase exchange factor IQSEC1 acting as an invasion promoting factor. Now, the authors interrogate the "ARFome" in prostate cancer cell line PC3 by use of a lentiviral shRNA library. The authors conduct detailed 3D and 2D cell culture analyses to identify specific differences in cell morphology between the different single knockout clones, showing loss of ARF1 and ARF3 as key switches of a spindle like morphology that is associated with enhanced migration and invasion. Interestingly, the authors find ARF3 functioning significantly dependent on the C-terminal region, and, further, that ARF3 is a direct companion of N-cadherin levels, whose downregulation leads to enhanced migratory capability. The main findings of the manuscript are: (1) ARF1 and ARF3 knockdown elicits key-differences in 3D morphology and migratory capacity, (2) specifically ARF3 is associated with maintenance of N-cadherin levels via PSD and RAB11FIP4 effectors, (3) whose downregulation leads to enhanced metastatic spread in a orthotopic xenograft mouse model, and (4) lowered N-cadherin protein / ARF3 mRNA levels identify more aggressive human prostate tumors. Analyses and experiments conducted in the manuscript are highly extensive, they follow robust methods and are well controlled. Results described are highly detailed and are impressively visualized and presented. Key data are highlighted, and the manuscript is clearly structured. However, some data could be described and argued more concisely, which would strongly support the results shown. I recommend publication after some minor but important changes.

      Major comments:

      1. The entire results are based on studies of a single cell line PC3 derived from a highly aggressive metastasic lesion. To infer such an essential principle of tumor invasion and migration from this may be a bit precarious, and perhaps this principle should also be demonstrated in another cell line. The choice of PC3 cells and its implications should at least be discussed.
      2. Results showing cell morphology page 6 / Fig.2: end of paragraph: stating "normally suppressing invasion": seems too far at this state of the manuscript, as these experiments are shown in the next section. maybe better "involved in preservation of a rounded phenotype". Results Suppl.Fig.3f: please use the same colors for the morphologies as in Fig. 2 etc. (round - red, spindle - green, spread - blue).
      3. Conclusive sentences should not be put at the beginning of an experiment, before one can know its outcome: Results page 8, Fig.4g, bottom: "This revealed that the C-terminus of ARF3 is required for sheet type invasive activity" maybe put that rather as a conclusion of the whole section.
      4. Results showing 2D migration and 3D invasion: In the illustrations of migration and invasion assays shown throughout Figs 3-5 and Suppl. Figs 3 and 4, please clearly state and indicate for each case whether this is 2D migration or 3D invasion, as these two assays are very similar, which is a little confusing throughout the manuscript. Suppl.Fig.3e,k,l: is this 2D or 3D? Results Fig.4b+d and Fig.5g: please amend "3D invasion".
      5. Summarizing sketch Results Fig.5k: in the scheme on the right side indication of respective presence / absence of ARF3/N-cadherin is missing. Which state induces which condition? Please amend.
      6. ARF3 suppresses PC3 xenograft metastasis shown in Fig.6: N-cadherin stainings of mouse xenografts and metastases are missing.
      7. Patient data ARF3 mRNA correlations Fig.7 and Suppl.Fig.6ab / Results page 11: the whole section describing ARF3 mRNA levels in diverse tumor types is too long and a little bit confusing: maybe shorten the text, put Fig.7f+g supplemental, and please indicate the combined GENT2 database in Fig. S6b.
      8. Patient data CDH2 mRNA/protein correlations: Fig.7 and Suppl.Fig.6cd / Results page 12: one should also shorten and sharpen this section. Please also exchange Suppl. Fig. panels 6 dc to have the same order as Suppl. Fig.6ab. Regarding the detailed analyses of CDH2 mRNA levels, make a too long story short, essential are N-cadherin protein levels, and these results shown in Fig. 7 hik and Fig. 7rst should be enlarged and highlighted. All other data (Fig. 7jl) and the right bars of panels Fig. 7mno, as well as Fig. 7pq are rather supplemental.
      9. The finding of elevated N-cadherin levels correlated with reduced invasion/migration and according better tumor outcome is surprising, particularly with regard to the quite established "EMT" dogma of N-cadherin driven single cell migration. Could you go into more detail about this property of N-cadherin driven mode of reduced tumor spread in the discussion?

      Minor comments:

      1. Manuscript title: maybe rephrase and reverse order of events: first invasion, then metastasis?
      2. Results Suppl.Fig.1ij: which cells are shown? Please amend RWPE-1 and PC3.
      3. Results page 5: although already described in Nacke et al 2021, please explain the term "acinus".
      4. Results page 6: maybe introduce an additional subchapter? Some subchapter titles could be more explicative.
      5. Results page 6, 2nd paragraph: please describe what was done: "revealed that knockdown of ..."
      6. Results page 7: please explain more detailed class I ARFs, which ARFs are included in this class?
      7. Results page 7, Fig.3f-j and 2nd paragraph: maybe better switch: first 3D, then 2D? Results Fig.3j: maybe amend indication of knockdown "KD" as indicated in Figure 3b-e.
      8. The conclusion on page 8 top "this suggests that a function of ClassI ARFs may be to regulate molecules that control collective bahaviours" is quite broad, please be more specific.
      9. Suppl.Fig.4n: should be named "l"?
      10. Results page 8, introducing sentence: please formulate more clearly and following the previous results.

      Significance

      Contents/Level of interest/merit:

      This study by Sandilands et al. analyzes cellular models of ARF-GTPase linked changes of cell morphology and migration to detect altered prostate cancer cell metastatic behaviour. Understanding the contribution of specific ARF GTPases in cancer cell shape and movement might help to identify markers of disease progression and metastasis.

      Strengths/Conclusions:

      The authors perform whole ARF-compendium knockdown and conduct detailed data analysis and visualization. The authors perform morphological analyses and conduct migration and invasion studies. Mechanistically, they confirm expression changes of N-cadherin, the key adhesive protein that is regulated. This study underlines the importance of analyzing subtle GTPase pathway differences by detailed morphological observations and methods.

      Comment/Weakness:

      The authors show extensive and detailed data that have been thoroughly analyzed, and results are presented and described fluently. There is a clear sequence of results description that is presented detailed. Form and contents of the paper is sound. The experiments are highly connected to previous experiments and data and this is also the major drawback of this manuscript: there is a lack of clear description of what is shown because it is already presupposed. Therefore, some sections should be worded and presented in more detail to present results more explicitly. The manuscript can be accepted with minor but essential revisions.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study represents a tour de force in advancing our understanding of the Arf family and it's associated regulators/effectors in controlling the morphology of individual cells and collective behaviours in 3D ECM. The authors use shRNA-mediated knockdown in high throughput imaging and AI approaches to define the influence of the ARFome on the morphology of prostate cancer PC3 cells growing as acini structures in 3D ECM. This allowed classification of ARFs and effectors/regulators on the basis of associated phenotypes, and this in itself is a useful resource for the community. It also drove the authors to focus on ARF3 and its regulator ESD and effector Rab11-FIP4. Analysing cell motility in elegant 2D and 3D assays showed that ARF3 levels control the collective migration of PC3 cells in 3D invasion, and the authors relate these findings to an interaction between Arf3 and N-cadherin. Using a mouse model of intra-prostatic injection of PC3 cells they demonstrate clear links between ARF3 levels and metastasis in vivo, and patient data further supports the link between Arf3, N-cadherin and metastasis. Experiments are well controlled and complex data are beautifully presented. In general data support the conclusions, where this is not as clear is highlighted below.

      Major comments:

      Figure 4: The use of the Arf3/4 chimeras is an interesting approach, used to show that the ARF3 C-terminus is important for its function related to migration/invasion. However the effect this has is not clear- it is not GTP loaded efficiently and may therefore act as a dominant negative. Furthermore the authors do not indicate which intracellular compartment ARF3 associates with, or if this is altered when the ARF3 C-term is replaced by that of ARF4 (ARF3N/1C).

      Figure 5: Links to N-cadherin are clearly interesting, but the model proposed in Figure 5K is a little speculative. Clearly it is possible that ARF3/Rab11-FIP4 regulate N-cadherin trafficking such that loss of the pathway leads to degradation and gain promotes stability, but it is also possible that expression levels are controlled at the level of transcription. This could be assessed by a simple surface labelling experiment in wt, overexpressing and knockdown cells, and/or by analysing localisation of N-cad with respect to ARF3 and late endosomes/lysosomes when ARF3 levels are manipulated. Does ESD knockdown similarly impact N-cad?

      Figure 6: The metastasis experiments are highly relevant to metastatic prostate cancer. Essentially the overall conclusion that high Arf3 (overexpression) suppresses and low Arf3 (knockdown) supports metastasis are well supported by the data, and the wildtype Arf3 levels sit in between (hence trends are observed but aren't statistically significant). Here it would be interesting to compare ARF3 levels in patient tumours with those in wt, overexpressing and knockdown PC3 cells in the mouse model (if sections are available) to give confidence that the overexpression is within the physiological range. If it were also possible to analyse N-cadherin levels in tumours or metastases that would provide an even stronger case for the mechanism proposed.

      Minor comments:

      Figure 2: The classification of phenotypes into groups is interesting, but the trends in some groups (eg Group4, 6 and 7) seem very similar. It wasn't clear to me if these are AI generated? Also, knockdown of individual ARFs is often in different groups- is this a reflection of knockdown level?

      Significance

      The manuscript provides a very significant advance in our understanding of the function of the ARFome with respect to cell morphodynamics. The first two figures and supporting data represent a fantastic resource for the field, and the remaining figures provide new insight into the function of ARF3 in collective cell movement and metastasis in mouse models and patients. Whilst ARF6 and its function in cell migration/invasion/metastasis is well studied, ARF3 has received relatively little attention. This study is therefore of broad interest to the trafficking community, and the new links between ARF3 and invasion/metastasis are broadly of interest to the cell biology and cancer communities. The mechanistic link between N-cadherin and ARF3 is fairly well defined and the fact that high/low levels of both correspond to improved/poor outcomes is a major strength of the study. Expertise: Vesicle trafficking/cell migration/invasion/cancer

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The work of Sako and co-workers tries to employ a single (two) molecule technique in nanodisks to learn about EGFR TM-JM dimerization versus oligomerization. The authors vary the lipid composition in the disk as well as the phosphorylation state of Thr654 which resides in close proximity to several basic amino acids. The outcome suggested by the authors is that Thr654 phosphorylation switches EGFR from a signaling state to a scaffold state. This a tough problem to investigate and single molecule techniques such as FRET might be a successful way of finding answers. I should mention that I am not a single molecule specialist.

      The authors also performed live cell experiments in CHO cells that do not express endogenous EGFR. The results in figure 9 are interesting. However, receptor internalization has not been mentioned.

      In general, the work is premature for publication. My major critique points are 1) that there is a lack of statistical analysis in most of the figures and 2) that the lipid composition lacks highly negatively charged phosphoinositides, which are known to bind EGFR at the membrane interface. Likewise, what do we learn from artificial lipid compositions that lack cholesterol? Even non-phase-separated membrane areas contain cholesterol (maybe less). 3) Receptor internalization is not discussed.

      Major critique points:

      Figure 2. FRET changes after acceptor photobleaching. The excitation and emission wavelengths need to be stated. It is not clear from the traces that there is only a single donor peptide in the disk. Only the traces in the two left panels and the one in the upper right support FRET between Cy3 and Cy5. The "FRET efficiency" is misleading because it suggests that all traces show an increase in donor fluorescence after acceptor bleaching which is clearly not the case. Why is the donor fluorescence dropping? Donor bleaching? Figure 2a supports this. If so, this gives a nonsense FRET readout. There should be an accompanying bar graph with full statistical analysis. There should be control experiments in which two Cy3-tagged EGFR TM-JM molecules are used and photobleaching is still aiming for Cy5.

      Figure 3. There is no statistical analysis. As said for Figure 2: FRET efficiency seems only partially reflecting the single fluorophore traces. A change from 0.9 to 0.95 FRET efficiency seems small and statistical relevance is not determined.

      Figure 4. The result that cholesterol overwrote the Thr-P effect means that phosphorylation is not relevant in the cellular environment. Does this not contradict the main hypothesis?

      Figure 5. It is unclear how fluorescence intensity of CY3-labelled EGFR TM-JM peptides change. Is this due to homo-FRET? Again, a lack of cholesterol is not meaningful.

      Figure 9. I don't understand the results from the EGFR expression control. 30 min after EGF stimulation, the EGFR should have been internalized and destroyed in the lysosome. Why is there more EGFR? Do CHO cells lack the machinery to internalize RTKs? Again, no statistics in Figure 10.

      Significance

      As the test system is very artificial, I don't see major impact. The role of Ser/Thr phosphorylation sites is investigated quite substantially.

      I think the study should be re-designed. The cell work is quite interesting but major factors such as receptor internalization have not been considered.

      Audience: cell biologists

      My expertise: we work on EGFR signaling and EGFR phosphorylation.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The main goal of this paper is to investigate the assembly of the transmembrane (TM)-juxtamembrane (JM) region of EGFR, using single-pair FRET imaging and a nanodisc technique. In particular, the authors address the role of a threonine residue (Thr654) that is phosphorylated after ligand association and is located in the JM domain of EGFR. The authors showed that anionic lipids, cholesterols, and EGFR Thr654 phosphorylation (pT654) regulate the dimerization and/or oligomerization of EGFR.

      Previously, these authors reported that anionic lipids caused the dimerization of JM domains, and that pT654 together with acidic lipids induced the dissociation of the EGFR dimer. In this manuscript, authors show that EGFR dimers assembled in cholesterols vs. anionic lipids display significant conformational changes mainly concerning the positioning of JM and TM. Furthermore, they state that phosphorylation of Thr654 (pT654) promoted oligomerization of TM-JM peptides in cholesterol containing membranes. Finally, the authors show distinct pT654-dependent functional roles and oligomerization states for EGFR in living cells. Authors propose a model in which membrane cholesterol and pT654 work cooperatively to switch the EGFR function from the kinase dimer to the scaffold oligomer

      Major comments:

      •Not clear what Figure 2 is supposed to indicate? Not clear what peptides are being tested. There is an indication in figure legend that photobleaching was performed but it is not clear how, when or why? If the idea is to validate FRET signal and analysis, then maybe photobleaching was performed to obliterate the acceptor dye and unquench the donor dye? If that is the case the authors must explain the experiment and what they are achieving with it. Maybe showing unquenching of the donor would be advised. Also, there is no mention in the text of the photobleaching experiment. If the goal of figure 2 is only to validate FRET analysis, then maybe it should be moved to suppl data.

      •Figure 3-4 address the role of cholesterol and pThr654 on the positioning of TM or JM. TM domains come closer in the presence of pT654 and membrane cholesterol. However, cholesterol but not pTh654 increased the proximity between the C-terminus of JM domains in PC 152 and PC/PS membranes. Although interesting results, the connection between the TM and JM conformational changes and oligomerization results is not direct or even interpretable into a model. The experiment that may be lacking is the testing of the role of cholesterol and pThr654 on the positioning of JM vs TM peptides. These experiments were suggested in Figure 2 but Figure 3-4 test the conformational changes of TM or JM but not of TM vs JM. Actually, Table 1 includes the values for Fig 3-4 FRET measurements which are TM-TM and JM-JM not TM vs JM as suggested in the title. The authors need to clarify this issue.

      •Figure 5: The authors suggest that a cooperative effect of cholesterol and Thr654 phosphorylation induces higher-order assembly of the TM-JM peptides. The relationship between these experiments and that of Figure 3-4 needs to be better described.

      • igure 6-7: Authors state that in all conditions other than non-phosphorylated peptides in the PC/PS membrane, Cy3 distributions after Cy5 photobleaching (red) peaked at the fluorescence intensity of ~100. However, the absence of statistical analysis of the comparison between different distributions makes this figure difficult to interpret and analyze. If not possible then the authors should stress that the results interpretation is based solely on a qualitative analysis of the curves.

      •Figure 8 model in which PS and cholesterol are proposed to exert competitive effects on dimerization and oligomerization, while pT654 disrupts the PS-induced JM dimer, and promotes oligomerization of the peptides, is not readily supported by the presented data.

      •Regarding the experiments of EGFR activation/phosphorylation using pT654 and pY1068 in Figure 9.

      -Blots should be clearly labeled (mainly pEGFR versus total EGFR and pT654 versus pY1068). Can authors clarify why in panel a only is showed 0 and 30 min (in pY1068 there is 3 time points) and why two main bands (one about 160 kDa and another about 210 kDa) are observed in all phospho or total EGFR immunoblots?

      -In panel a, it would be expected a stronger pT654 signal upon EGF in wt PMA(-)? Additionally, here the total EGFR blot is missing.

      -In panel b, it is important to clarify what band(s) are being quantified and statistical analyses (t-test or ANOVA for example) should be provided. Also, the graph indicates an about 2-fold increase in pY1068 signal at time 0 when wt PMA(-) is compared with pT654A PMA(-). This is apparently given by low total EGFR in pT654A condition, perhaps lower levels of transfection? it would be also a plus if it is possible to have beta actin or other loading control for blots or mention lower levels of transfection if it is the case.

      -Figure 9 presentation and analysis should be improved since it is an important part of main conclusions of the present work. Did authors considered to analyze also the effect a phosphomimetic in T654?

      •Figure 10b:

      -this reviewer cannot detect any difference between conditions/experiments, if there is something going on, authors need to find a better way to visualize the results and use statistical analysis to determine significance. Considering the lack of statistical analysis, Figure 10d overinterprets results since 10a-c do not provide strong supporting claims.

      -In methods authors mention that fluorescence is measured at the basal plasma membrane. However, it is not clear why at the condition pma- egf- the signal of egfr-gfp is so weak. Also, it not very clear why convexness indicates high immobilization.

      The following experiments and claims are over-interpreted or speculative. The reviewer would suggest limiting the conclusions extracted from these experiments. Moreover, Discussion is too long, convoluted, and speculative.

      •Oligomerization experiments are not convincing, and they are overinterpreted

      •homo-FRET (self-quenching) between two N-terminal labeled Cy3 peptides are speculative and not clear how to validate and demonstrate the presence of home-FRET. Maybe fluorescence-anisotropy-based homo-FRET detection could be included.

      Minor comments:

      •This sentence "Fractions of PS 97 (inner leaflet) and cholesterol (inner and outer leaflets) mimicked those in the plasma membrane" should be referenced

      •Figure 2a: the authors should explain what is the meaning of 5 seconds between the two images?

      •Figure 3 legend should include what type of peptide was used. According to the text the N-terminal regions of the TM domains were used in Fig. 3. However, that is not included in Fig 3 figure legend.

      •Figure 3-7 need the distribution and the 95% percentile section as in Figure 1 for statistical analysis of FRET measurements

      •Figures 9 and 10 should include statistical analysis to demonstrate significance

      Significance

      It should be mentioned that a significant amount of work included in this manuscript was already extensively covered in previous publications of the group. In particular, this paper follows the previous one "Lipid-Protein Interplay in Dimerization of Juxtamembrane Domains of Epidermal Growth Factor Receptor" (Biophys J. 2018 Feb 27; 114(4): 893-903), where authors evaluate the effects of anionic lipids on EGFR dimerization. The authors cite this previous work in the manuscript, but nevertheless the first figures of the present manuscript are very similar to the previous work mentioned. The only difference being that the authors now are adding cholesterol as a new variable. Moreover, text throughout the main part of the manuscript (especially description of Results and Introduction) are very similar to the previous work published. Overall, this similarity between results and approach, decrease the significance and novelty of the present work.

      Maeda et al., describes an advance regarding the role of cholesterol in EGFR molecular assembly. This is compared here to the previous findings about effect of phosphorylation in EGFR Thr654 residue and anionic lipids in EGFR molecular assembly. Since EGFR is a widely studied receptor both in health and disease, especially in cancer, the mechanisms of EGFR activation and assembly are of high interest both in basic and clinical research. It would be interesting if the authors of the present work could contextualize the physiological relevance of the findings and how for instance depletion or increase in cholesterol physiologically could impact in EGFR related functions.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In the manuscript titled "Threonine phosphorylation regulates the molecular assembly and signaling of EGFR in cooperation with membrane lipids", Maeda et al. investigate the associations of the EGF receptor, in particular the short TM-JM linker peptide that connects the extracellular to the intracellular domain and couples ligand binding to kinase activity. A particular focus lies on the influence of cholesterol and the phosphorylation of T654 on this process. Based on single-molecule FRET and intensity measurements in lipid nanodiscs, biochemical activity assays and analysis of EGFR cluster formation in live cell, they conclude that phosphorylation of T654 promotes clustering through dissociation of anti-parallel JM dimers and cholesterol supports the clustering process. In particular, they challenge the view that clusters contain kinase-active EGFR dimers, but rather are tight assemblies of EGFR in form with reduces kinase activity. This is a very interesting story and could explain observations made by the authors and others, e.g. decreased activity of EGFR after initial clustering. The experiments are convincing, but the authors should revisit the interpretation of their results; in particular, they should consider the possibilities (i) that nanodiscs might have different sizes depending on the lipid composition, (ii) that a bias towards repeated interactions is introduced when selecting nanodiscs containing three peptide molecules. Also, the normalization to basal levels of EGFR expression and activity in the Western blot and luminescence assay should be treated more carefully. Nevertheless, it is a good story that would be a good contribution to the EGFR field.

      Major comments:

      1.Line 93: the authors write that with cholesterol, there were two peaks. But there are three peaks (Fig. 1e, red line). I understand the following sentence that they used the peak at 7-8mL for their experiments and the analysis in Fig. 1f. They then write that this fraction had a similar size without cholesterol. But I see only one peak without cholesterol (at 11-13mL), and it shows a bigger size, as they also wrote before. Or do they refer to the very small bump in the black line at 7-8mL?

      2.Figure 6: If the density of proteins in the lipid is low (see previous comment), the selection of nanodiscs with two Cy3 and one Cy5 introduces a bias, in particular in the conditions where there is no higher-order assembly occurring (i.e. with non-phosphorylated T654). Since the three proteins are then restricted to a very small membrane area, they are experiencing a high apparent density, and the equilibrium between assembled and dissociated 3-protein clusters will be shifted to the trimer.

      3.Figure 6: In connection to the previous comment, the previous comment #1 becomes very critical. If I understand the red line (PC/PS+chol) and black line (PC/PS) of Fig. 1 correctly, the size of the nanodiscs is bigger for PC/PS than for PC/PS+cholesterol. If this is the case, it will have an impact on the apparent density of the three protein molecules in the nanodisc, since a larger nanodisc means lower membrane density or lower encounter probability. Also, a quick MC simulation shows that the average distance of two randomly positioned proteins in a disc is 0.9*radius, which would be 5nm for the 11nm nanodiscs, and therefore within the Forster radius of Cy3 and Cy5 (5.3nm). With larger nanodiscs, this would change and less FRET would occur in the dissociated state. Therefore, the size of the PC only and PC/PS nanodiscs should be measured similar to the PC/PS+cholesterol. The effects of nanodisc size on apparent density and on FRET in the dissociated state should be considered.

      4.Fig. 9b: It seems that the T654A mutation increases EGFR expression since the EGFR-normalized levels are higher in wt, but lower in T654A. Was this effect directly quantified, e.g. by comparing to a standard protein? How can this be explained? The authors claim that pY1068 levels were higher for T654A than for WT. Although this is correct, the levels normalized to total EGFR were lower for T654A than for WT. This is somehow contradictory and should be resolved.

      5.Is it possible that basal Grb2 recruitment is different for WT and T654A, and therefore the baseline for the luminescence measurement is different? This should be carefully investigated, since Fig. 9b suggests that T654A has a critical impact on EGFR expression in its inactive state, and therefore can be assumed to influence basal EGFR activity.

      6.Line 401: The claim that pT654 promotes downstream signal transduction is based on the apparently increased EGF-induced Grb2 recruitment in the presence of PMA (which leads to T654 phosphorylation) in the EGFR wt but not in the T654A mutant. This claim might not hold when considering that maybe basal levels of Grb2 interaction were different under the different conditions. Then the apparent increase would only be caused by the normalization to the basal luminescence level (see comment #5).

      Minor comments:

      1.Line 66: the first work should probably be "although" instead "also".

      2.Line 101: it is unclear what "16 types of nanodiscs" mean. Were there more lipid compositions than the four compositions in Fig. 1d investigated? Or was it always nanodiscs prepared with the same lipid composition (and same fraction) but 16 different proteins or combinations of proteins?

      3.Fig. 3-7 (all FRET and intensity distribution diagrams): An error for a each bin should be calculated or estimated so it is easier to assess if a bump in the line is more than random, and if the difference between two distributions is significant.

      4.Line 128: "PS competed with cholesterol when T654 was non-phosphorylated (Fig. 3d)." What does compete exactly mean here and how is this deduced from the data?

      5.Line 139: "These results confirmed the results of our previous study". The authors could clarify this by stating what the conclusion was/is (T654 phosphorylation dissociates dimers when acidic lipids are present).

      6.Figure 5: adding the conditions into the panels would clarify the figure.

      7.Line 167: In how far can be assumed that co-localization of multiple peptides in the same nanodisc is caused by interaction? Could they not be incorporated by coincidence, because of a high density? What is the probability to obtain two non-interacting proteins in the same nanodisc?

      8.Figure 6: I cannot find the details to determine pre- and post-bleaching intensities. The authors should state in the methods the time they include to determine the average value on each side of the bleaching event. The same for all values where intensities or FRET efficiencies were calculated by averaging over multiple points of the time course.

      9.Line 177: If the low intensity for N-terminally labeled peptides was caused by homo-FRET, there should be an increase of intensity after the first Cy3 bleached. Did the authors observe that? Also, when the trajectories are getting complicated by homo-FRET during dissociation/association, the selection of nanodiscs with two Cy3 might become ambiguous. What were the exact selection criteria? There is no information in the text or methods.

      10.Figures 6a/7a: The authors should double check the origin of the data. The panels in the bottom left of Fig. 6a and the bottom left of Fig. 7a seem to contain identical data.

      11.Figure 7b-e: Although there are differences in the intensity histograms, I find them to be quite subtle, compared to the effect of pT654. Therefore, I find the claims made in lines 224-229 (induction of oligomerization by cholesterol, and oligomerization instead of dimer by pT654) too strong. Based only on the visual assessment of the intensity distributions, I would rather suggest a shift towards one or the other condition. A model calculation with assumptions for FRET values and distances for certain conditions would strengthen their claims.

      12.Figure 8: The different configurations and why they are induced in certain condition needs to better explained. In the main text, there is only one sentence (lines 224-229), and in the figure legend, there is no explanation at all.

      13.Line 230: "...whether the peptides in the oligomers directly interacted or not." Since there are no other proteins involved except the peptides, I am wondering how an indirect interaction in the oligomer would work?

      14.Fig. 10b: It is surprising that the authors observe a large fraction of oligomers for EGFR WT without EGF or PMA, since EGFR WT is thought to be primarily monomeric in its inactive state. This should be discussed.

      15.Fig. 10b: The distributions might better reflect the process of oligomerization if the fraction is multiplied by the size of the oligomer. This would show reflect how many receptors are clustered, instead of how many clusters are present (one large cluster should make the same fraction as the sum of two smaller clusters half the size).

      16.Fig. 10a: The authors should present high quality movies of the observations to allow the reader to appreciate the clustering/immobilization better and also to facilitate the comparison of raw data for other researchers in the field.

      17.Many experiments in this work assume that the intensity of a fluorophore remains constant while it remains in a specific monomer or dimer state. This should be confirmed with nanodiscs containing positive and negative control proteins for dimerization. E.g. a monomeric protein should give a constant intensity, and a constitutively dimeric protein should give two intensity levels, one before and one after photobleaching of the first dye molecule. Also, FRET should remain quite constant over time in a constitutive dimer labeled with Cy3 and Cy5.

      18.It is questionable in how far the postulated trimer formation will happen if the extracellular and kinase domains are present in the full EGFR, since these domains are quite bulky and might not allow certain trimer configurations, e.g. the close trimer of the JM domain. This should be discussed.

      Significance

      The prevalent view that EGFR clusters contain kinase-active dimers is challenged. The authors claim that instead, tight assemblies of EGFR in form with reduces kinase activity after the initial activation phase. Based on the novel lipid nanodisc FRET experiments, different configurations of the TM-JM domain can be discriminated in the membrane environment. This study is a good contribution to the EGFR field and will be interesting also for people in the single-molecule imaging field. The reviewer has experience in single-molecule imaging and has also worked on the EGFR.

      Referess cross-commenting

      I want to add two comments:

      1. For reviewer #2 's first major comment on the photobleaching: From my point of view as a single-molecule researcher, this experiment is quite clear. Photobleaching is a process that inevitably occurs when using high intensity, as required for single-molecule imaging; therefore, it was not "performed", but (unfortunately) occurs. Only the data before the photobleaching event are useful. But I agree with the reviewer that Fig. 2 could be moved to the supplement, or, in a condensed form, could be merged, e.g. with Fig. 3 and Fig. 4.

      2. Concerning reviewer #2 's significance statement: I was not aware of the fact that the nanodisc approach has been used and published by the same authors before. They should have clearly indicated this in a written form, e.g. by saying that they used a previously developed nanodisc FRET assay. Also there is no reference to the previous work in the first results section or the nanodisc section of the methods. This diminishes the significance and novelty of the work considerably in my opinion. Also, as the reviewer correctly observes, in the text of this manuscript there are a number of similarities to the mentioned publication which also reduces the originality of this work.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Lemonidis et al perform a structural and biochemical investigation of FANCI-Ub/FANCD2 complex. The key findings are that FANCD2-Ub is more rapidly deubiquitinated that FANCI-Ub by USP1:UAF1, particularly in the context of the full FANCD2-Ub:FANCI-Ub complex. They present a structure of FANCI-Ub/FANCD2 and show key features that distinguish it from the other FA core complex structures. Several regions are unstructured in their model, which the authors propose is because of the absence of ubiquitin on FANCD2. The exposure of Ubiquitinated lysine of FANCD2 is suggested to provide a mechanism for the rapid reubiquitination of this site when bound to FANCI-Ub, and they also show this biochemically as a faster ubiquitination of FANCD2 when bound to FANCI-Ub instead of FANCI-Ub. A very neat conclusion is made about the dual role of FANCI-Ub in maintaining FANCD2-Ub in a DNA clamped state.

      Major comments:

      No major issues identified. The data supports the conclusions. Some mutagenesis studies that investigate the hypotheses generated by the structure would increase the overall impact of the study.

      Minor comments:

      The role of FANCI-Ub and DNA binding in protecting both FANCI and FANCD2 from deubiquitination was first shown in van Twest et al 2017 Molecular Cell, however this is not referred to in the manuscript.

      USP1:UAF1 has been shown to have DNA binding activity (eg Liang et al 2016 Cell Reports). Could this DNA binding activity potentially reduce its activity on I-Ub-D2-Ub in the presence of DNA (ie through sequestration away from the substrate by binding to other DNA?). This could be tested by showing that titrating the amount of DNA in the reaction does not inhibit deubiquitination. Also, one previous study showed that DNA stimulates deubiquitination of FANCD2. This study used complexes where FANCI was not ubiquitinated, but is a different result to what is seen in Figure 4A (but perhaps because deubiquitination is already 100% in the absence of DNA in these experiments). Some investigation/discussion of this discrepancy would be good to include. Some further discussions speculating why a FANCI-Ub is necessary in the FA pathway (ie why not just have FANCD2-Ub as the clamping mechanism given that it is sufficient to lock the protein onto DNA?) would increase the interpretation of the work by non-specialists.

      Significance

      The work provides a significant advance in our understanding of the mechanism by which di-monoubiquitinated FANCD2:FANCI clamps the complex onto DNA. The first structures of I-UbD2 are presented, and some evidence for why they exist in cells (as an intermediate state of deubiquitination of the complex). Previous studies already showed that FANCI-Ub and DNA binding prevent the deubiquitination activity (eg van Twest et al 2017, Arkinson et al 2018, Rennie et al 2020, Wang et al 2020) of USP1:UAF1.

      The work has clinical relevance - USP1:UAF1 inhibitors and FA core complex inhibitors are being developed by several biotech for cancer therapy (this is not referenced in the manuscript, and could perhaps be included in the introduction?). Researchers in several fields will be interested in the work: those working on FA pathway and mechanistic understanding of FANCD2 monoubiquitination, as well as those working more generally in ubiquitination and deubiquitination pathways. The structures included "complete the set" for all states of FANCD2:FANCI ubiquitination states, and this is a highly cited field. My expertise is in understanding the mechanistic basis of FANCD2 and FANCI ubiquitination and deubiquitination so I am well suited to evaluate all aspects of the proposal.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript by Lemonidis et al describes a structural and biochemical basis for the interdependency of FANCI-FANCD2 monoubiquitylation. Continuing their landmark work in the FANCI-FANCD2-Ub Cryo-EM structure, this study shows that the homeostasis of FANCI-Ub-FANCD2-Ub (ID2-diUb) can be partly explained by accessibility differences of the ubiquitylated forms of FANCI and FANCD2 complex. Using structure and enzymatic assays, the authors show that FANCI-Ub is not as susceptible for deubiquitylation by USP1-UAF1 deubiquitinase (DUB) when in complex with FANCD2 and DNA. In contrast, FANCD2 deubiquitiylation is more efficient when FANCI is unmodified, and more protected when it is in complex with FANCI-Ub and DNA. Interestingly, FANCI ubiquitylation progresses at a faster rate when in complex with FANCD2-Ub and DNA. Even though this is somewhat an incremental study, nevertheless, the mechanisms of USP1 can or cannot remove ubiquitylated FANCI-FANCD2-DNA complex is intriguing and opens new areas of study to understand whether several different states of modified and unmodified FANCI-FANCD2 exists physiologically or not and what functional differences they may have.

      The major limitions of this study is the lack of functional connection to the described mechanisms of deubiquitylation or reubiquitylation of FANCI-FANCD2 complexes. In the future, hopefully, this group or others will generate point mutants that can selectively separate the functions of FANCI or FANCD2 deubiquitylation or FANCI or FANCD2 reubiquitylation to show the importance of dynamic regulation of these processes in a cellular or in vivo context.

      A major comment is whether the authors can by structural inference figure out what part of USP1 N-terminal extension is required for interaction and recognition of FANCI-FANCD2-Ub-DNA complexes. This is the only part of the study that wasn't very satisfying. Perhaps the authors can address this with some modeling and/or with additional mutational analyses on USP1.

      One minor comment is that the authors only cite the Smorgorzewska et al, Cell, 2007 paper for observing the first interdependency of FANCI-FANCD2 ubiquitylation in cells (Citations in the Intro and Discussion sections). The authors should also cite Sims et al NSMB, 2007 for the exact same observation shown in the two papers just to be consistent.

      Significance

      Significance of the study is as indicated above.

      The audience is the Fanconi Anemia and ubiquitin field.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Lemonidis et al concentrate on the role of FANCI ubiquitination in the ubiquitination and deubiquitination of the FANCD2 in the FANCI-FANCD2 complex. They work with purified FANCI and FANCD2 proteins that were in vitro ubiquitinated and with purified USP1-UAF1, allowing them to test the kinetics of the ubiquitination and deubiquitination reactions under different conditions. They also show a cryo EM structure of IUbD2 at a resolution of 4.1Å.

      Using biochemical assays, the authors show that ubiquitination of one of the subunits (FANCI or FANCD2) enhances the ubiquitination of the other. D2Ub can be de-ubiquitinated with high efficiency, with no protection by DNA and FANCI. However, when FANCI is ubiquitinated, FANCD2 de-ubiquitination is decreased. It is also more easily re-ubiquinated. The efficiency of de-ubiquitination of FANCIUb is strongly decreases when complexed to DNA and FANCD2, and it is independent of the FANCD2 ubiquitination. The structure shows that the IUbD2, like IUbD2Ub and ID2 Ub, is clamped on the DNA, but the FANCD2K561 site in exposed explaining how re-ubiquitination of the FANCD2 is facilitated.

      Major comments:

      The key conclusions are overall convincing. The IUbD2 structure shows a conformation in which the FANCD2 lysine is more easily accessible than in ID2 (3B). Biochemical assays support the claim that the ubiquitination of FANCI and FANCD2 promote the ubiquitination of their opposing subunit in ID2 (FANCD2, and FANCI, respectively). Furthermore, robust biochemical assays and structures in Figure 4 support the claim that the ubiquitin on FANCI is relatively more protected than the ubiquitin on FANCD2, of which the latter can be de-ubiquitinated with high efficiency.

      1. The major caveat is that these structures were generated using "engineered Ube2T and SpyCatcher-SpyTag", and not via the FA core complex. Recent structural work by the Passmore and Pavletich labs shows the intricate process of ID2 complex ubiquitination, and propose a sequential ubiquitination model that is tied closely to the unique structural properties of the FA core and the ID2 interface. Similarly, the biochemistry is done in the absence of the core complex which may change the kinetics of the ubiquitination and deubiquitination. So, although the biochemical experiments performed are robust, these caveats need to be explicitly noted in the manuscript.
      2. Ideally, the authors should also solve cryo EM structure of IUbD2Ub and compare this to 6VAE to show that their in vitro ID2 assembly leads to structures aligning with the FA-core complex ubiquitinated form of ID2.
      3. I would suggest including ubiquitinated FANCD2 in the PIFE experiments in Figure 2, to assess differential binding affinity of I and IUb. This would help serve as a control that FANCD2 ubiquitination also strengthens the ID2 complex binding the DNA.
      4. The authors claim that "ubiquitination of either of the two subunits of ID2 (FANCI or FANCD2) actually favors ubiquitination of the other subunit" (supported by biochemical and structural data in Figure 3). In Figure 3A-B, it is difficult to draw a conclusion on the "accessibility of the lysine residues" from these figures. Authors should include a measurement of the distances between the Lysines and the NTD of the opposite subunit. This would add a numeric measure to "accessibility".
      5. Wang et al, 2020 show that the FANCI K523 is only accessible for ubiquitination upon prior FANCD2 ubiquitination. However, the IUbD2 state could be formed by FANCD2-deubiquitination of IUbD2Ub. Figure 5 may need to changed to remove the arrow ID2 state to IUbD2.
      6. In Figure 3B, the structure of IUbD2Ub should be included to allow a proper comparison of the Lysine accessibility of IUbD2 and ID2Ub versus IUbD2Ub.

      Minor comments:

      1. Please include specific method of expressing, purifying and ubiquitinating FANCI and FANCD2. This method doesn't become clear by referencing previous papers, and these methods might have changed.
      2. The final paragraph needs to be reworded. The following sentence is very confusing "Of these two interfaces, one is absent in equivalent ubiquitin-FANCI interactions in ID2Ub-DNA structure (Wang et al, 2020)(Fig. 4D-E)." It does not come across that ubiquitin of FANCIUb lacks an interface with FANCD2.
      3. "Our structure indicates that this ubiquitin-FANCD2 interface ..." probably should be "Our structure indicates that the interface between FANCD2 and the ubiquitin of FANCIUb ..."
      4. I would suggest adding additional supplemental figures of Figure 3B including 1 extra angle for each structure. This would help gauge "Lysine accessibility".
      5. Please add PDB images in Figure 4D from the IUbD2Ub structure of both the interface between FANCD2 and ubiquitin of FANCIUb, and FANCI and the ubiquitin of FANCD2Ub. This is important to allow for the comparison of the different ubiquitin interfaces amongst the IUbD2Ub, ID2Ub, and IUbD2 structures

      References:

      Previous in vitro and in vivo data well referenced in introduction and discussion.

      Statistics:

      Satisfactory

      Comments on time and resources:

      Solving the cryo EM structure of IUbD2Ub is not a trivial ask, but the authors may already have it or may be inclined to do it since they already have access to purified IUb D2Ub and have optimized cryo EM conditions for IUbD2 in this paper. The remaining suggestions seems to be minor (repeat of biochemical experiment, measurement of pre-existing data, and minor changes to figures).

      Significance

      This work fits very well with the recent structural advances describing ubiquitination and de-ubiquitination of the Fanconi ID2 complex: Wang et al. 2020 and 2021, Alcon et al. 2020. These papers were groundbreaking in describing the mechanism in which FANCD2 and FANCI are ubiquitinated by the FA core complex. The same group (Rennie et al. 2021) solved a structure of ID2 bound to USP1-UAF1 which was also important for understanding of the Fanconi anemia pathway function. The current manuscript sheds light on the role of the Ubiquitinated FANCI in the function of the FANCID2 dimer.

      The manuscript will be of interest to the DNA repair, genome integrity, hereditary diseases and ubiquitin fields. This review was written by an investigator in the genome integrity and Fanconi anemia field.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      To reviewer #1

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): Please see combined review below in the next section, Reviewer #1 (Significance (Required)):

      This is a descriptive manuscript providing a few new insights into a well-recognized and biologically important phenomenon - the lymphatic endothelial cells have heterogeneous origins in different organs. Overall, the idea of Islet1 lineage contributes to regional lymphatic vessel formation during a particular developmental stage is exciting and proven with detailed and careful lineage tracing. The first observation that Islet1 lineage gives rise to cardiac lymphatic vessels was published by the same group in Dev Bio in 2019 so the novelty here is dampened, although the pharyngeal lymphatics and the exact time of these non-venous origin lymphatic vessels arise were not previously characterized - so the current manuscript does provide new important insights. Both the data quality and manuscript layout need improvements, especially when it comes to defining where Islet1 is expressed at all the stages and statistics. The following suggestions will deepen the scope of the manuscript:

      First of all, we would like to express our appreciation to the reviewer for all the constructive comments. We carefully read the reviewer’s comments and discussed it. We agree with the reviewer that our manuscript needs improvements with changes in layout several additional experiments. We have also included several description and new immunostaining data (e.g., Isl1,VEGFR3 and LYVE1 co-staining), to confirm our new findings and highlight the importance of the current manuscript beyond our previous one in Dev Biol in 2019. We also have included detailed quantification methods, single-channel images with improved data resolution, and improved clarity of the manuscript.

      Specific points were addressed as follows:

      COMMENTS BY THE REVIEWER

      1. Whether Isl1 lineage is independent of venous-derived endothelial cells.

      2. This point is very important: the manuscript does not actually show Isl1 expression through the stages they are inducing. I would want to be sure that lymphatic endothelial cells at this stage don't express Isl1. Another way to get at this is to maybe use other second-heart fields or even broader mesoderm drivers that are known to be never expressed in endothelial cells to confirm the findings.

      Response:

      In our previous work (Maruyama et al, Dev Biol 452:134–143, 2019, Figure 2C), we demonstrated that Isl1+ lineages using Isl1-Cre mice did not contribute to endothelial cells in the cardinal vein and its branches (intersomitic vessels: ISVs), which had been thought to the primary and biggest sources of lymphatic endothelial cells (LECs). In this paper, we confirmed this finding using Isl1-MerCreMer mice with tamoxifen treatment at E8.5 (Figure 4J). We have scanned whole embryos and detected no eYFP+ cells in the cardinal vein or ISVs (the detailed quantification methods have been added in Methods section). Consistently, another group (Lioux et al., Dev Cell 52:350-363, 2020) re-evaluated this point using Isl1-Cre mice that the Isl1+ lineage contribute to endothelial cells of the cardinal vein only by less than 2%, which neither explains the abundant contribution of the Isl1+ lineage to coronary lymphatics (>50%) nor its restriction to the ventral heart. Based on these reports, we supposed that the Isl1+ lineage was independent of LECs derived from the cardinal vein and ISVs.

      In the revised manuscript, we added new data showing thorough expression patterns of Isl1, Prox1, Flk1, and PECAM in the E9.0 to E11.5 pharyngeal arches and cardinal veins by immunostaining and presented them as Supplemental Figure 4. In these sections, we detected Isl1 and Prox1 expression with partial overlapping within the pharyngeal mesodermal core, whereas Isl1 was co-expressed with Flk1, or PECAM neither in vessel-like structures around the mesodermal core nor in the cardinal vein and their surrounding Prox1+/PECAM+ LECs (Supplemental Figure 4H’ and J’) confirming the independency. These findings have been described in the manuscript as follows:

      To identify possible Isl1+ LEC progenitors, we investigated the expression patterns of Isl1, Prox1, and vascular endothelial markers (Flk1 and PECAM) by immunostaining sections of E9.0 to E11.5 pharyngeal arches and cardinal veins. Consistent with the previous report (Cai et al., 2003), Isl1 was abundantly expressed in the core mesoderm of the first and second pharyngeal arches corresponding to the CPM from E9.0 to E11.5 (Nathan et al., 2008), where Prox1+ cells also aggregated and partially overlapped with Isl1 signals (Supplemental Figure 4A, A’ C, C’ E, E’ G, G’ I, I’). By contrast, Flk1+ or PECAM+ cells were distributed mainly around the CPM and not expressed Isl1 (Supplemental Figure 4A, A’ C, C’ E, E’ G, G’ I, I’). Furthermore, Isl1 was expressed neither in the endothelial layer of the cardinal vein nor in surrounding Prox1+/PECAM+ LECs (Supplemental Figure 4B, B’ D, D’, F, F’, H, H’, J, and J’). Taken together with the result from Myf5-CreERT2 mice, these results indicate that Isl1+ non-myogenic CPM cells may serve as LEC progenitors independent of venous-derived LECs and the commitment to LEC differentiation occurs before E9.5 in the pharyngeal arch region. (Page 6-7, lines 187-200)

      Regarding the use of other second-heart field drivers as the reviewer recommended,

      Lioux et al., have already shown the contribution of Mef2c-AHF+ , which marked CPM-derived cells including second heart field, cranial musculatures and connective tissues(Adachi et al., 2020), to ventral cardiac lymphatics. We are also trying to introduce Mef2c-AHF-Cre mice, but it is unfortunately delayed due to the pandemic of COVID-19.

      The author stated that Islet1 lineage gives rise to lymphatic endothelial cells via the Tie2 mechanism but did not elaborate on this part. What is the potential relationship between Islet1 and Tie2? Or Tie2 just serves as a pan-endothelial lineage marker here?

      Response:

      To clearly demonstrate the relationship between Isl1+ and Tie2+ lineages in facial lymphatics, we added schematic representation in Figure 6G, which showed the differential Tie2 expression in lymphatic vessels in the tongue and facial skin.

      Related to point 1 and 2, it has been thought that almost all LECs are formed from cardinal vein-derived Tie2+ endothelial cells. However, we identified the presence of Isl1+/Tie2+ LECs in the tongue, which are apparently not originated from the cardinal vein. In previous reports using Tie2-GFP mice or in situ hybridization of Tie2, Tie2 was not detected in the developing LECs at E9.5, 11.5, 13.5, and E15.5(Motoike et al., 2000; Srinivasan et al., 2007). In adult mice, Tie2 expression in lymphatics was only observed in restricted regions (Morisada et al., 2005; Tammela et al., 2005). Taken together with our present data that the differentiation fate of Isl1+ CPM-derived LECs was determined between E6.5 and E9.5 (Figure 2-4, Supplemental Figure 3), Tie2 is supposed to be transiently expressed during LEC differentiation in the tongue from early Isl1+ CPM cells, although it remains difficult to identify the Tie2-expressing stage during non-venous LEC differentiation.

      It will be an important future subject to identify the stage and implication of transient Tie2 expression in the lineage and, in this paper, we want to just note that the Tie2+ lineage does not always mean the derivation from cardinal vein endothelial cells.

      This point has already been included in the manuscript as follows:

      The present study further indicates that the LECs in the tongue are derived from Tie2-expressing cells among the Isl1+ lineage. Although it is unclear whether Isl1+-derived cells at the Tie2-expressing stage represent a venous endothelial identity, this result means that Tie2+ LECs are not equivalent to cardinal vein-derived LECs. (Page 10, 298-301)

      The effect of lineage-specific Prox1 knockout is very descriptive, without any discussion of the potential biological function of such cellular origin heterogeneity. This part may be worth a few follow-up experiments in later embryonic stages or even in postnatal stages. The authors demonstrated that loss of Prox1 in Islet1 lineage decreases the number of lymphatic vessels and leads to lymphangiectasia, but whether this phenotype can be later compensated or shows any clinical impact was not proven. Therefore, the statement made in line 206 is questionable, and whether Islet1 lineage-derived lymphatic endothelial cells are dispensable/indispensable remains unclear.

      Response:

      We agree with the reviewer in that additional follow up experiments using later embryonic or postnatal stages will give an insight into the potential biological function of cellular origin heterogeneity. We are generating lineage-specific Prox1 knockout mice by treating Isl1-MerCreMer; Prox1fl/fl mice with tamoxifen at E8.5 to analyze phenotypes in the facial lymphatic vessels.

      The layout of the manuscript needs to be reorganized: 1. Details in statistical methods and quantification logic were completely missing from the manuscript. For example, definitions of "a sample" (how many sections are taken from one biological sample and how many fields take from one section, etc.), "number of vessels per field", "diameters", and of what parameters the numbers were normalized to, etc. need to be described in the materials and methods section. For instance, it is not clear how "tomato+ lymphatic vessels per field/Vegfr3+ lymphatic vessels" was defined. First, what proportion of tomato+ cells need to colocalize with Vegfr3 expression cells in a specific vessel to make this vessel being determined as a "tomato+ lymphatic vessel"? Most data provided here are section immunostaining where "multiple vessels" are very likely coming from different cross-sections of one same vessel in the same field. Second, Vegfr3 can stain venous endothelial cells in earlier stages so the specificity of this marker can be controversial. These are some important technical aspects to include in the revised version. Figures needing more description in quantification methods include but are not limited to Fig 1H, 2H, 2P, 5K-R.

      We have revised the statistical methods from the ratio of the count of the number to ratio of the area of lymphatic vessels in Figure 1H, 2H, P, and Supplemental Figure 3I to represent more precisely the contribution of Tomato+ cells to lymphatic vessels. We also added more detailed description of the quantification methods in ‘Materials and Methods’ section, as follows:

      Quantification of the section and whole mount images

      For the quantification of section immunostaining at E16.5 embryos, the average of two 16-μm-thick sections taken every 50 μm and 10x power field of views (0.42 mm2/field) for each anatomical part (the larynx, the skin of the lower jaw, the tongue, and the cardiac outflow tracts) were subjected to the analyses. In the facial skin, lymphatic vessels in superficial layers of dermis were subjected to the analyses. The middle sagittal sections, including the aorta, larynx, and tongue, which were selected as hall marks of midline, was chosen from created sections. The coronal sections, including both eyes, tongue, and olfactory lobes with left and right symmetrical features, was selected. For E12.5 embryos (Figure 4O), two 16-μm-thick sagittal sections taken every 50 μm, including the 1st and 2nd pharyngeal arches and outflow tracts, were subjected to analyses. The area and the number of cells were measured manually using ImageJ software. For the whole mount immunostaining of embryos and the heart, the whole samples were scanned every 20 μm and confirmed eYFP contribution to LECs (Figure 3) and cardinal veins (Figure 4J, and Supplemental Figure 4B, D, F, H, J). (Page 13, lines 402-416)

      We also have tested expression patterns of VEGFR3 with Prox1 or LYVE1 as Supplemental Figure 1. At E14.5, VEGFR3 was widely co-expressed with Prox1 in the tongue, facial skin, and around the pulmonary artery (Supplemental Figure 1A-C’). At E16.5, VEGFR3 was co-expressed with LYVE1 in the tongue, facial skin, and around the pulmonary artery. Thus, we thought that VEGFR3 could be used as a marker of LECs in these cardiopharyngeal region.

      This point has been included in the manuscript as follows:

      Co-immunostaining of platelet endothelial cell adhesion molecule (PECAM) and vascular endothelial growth factor receptor 3 (VEGFR3), which we confirmed its co-localization with lymphatic vessel endothelial hyaluronan receptor 1 (LYVE1) at E14.5 and E16.5 (Supplemental Figure 1), revealed tdTomato+ LECs in and around the larynx, the skin of the lower jaw, the tongue, and the cardiac outflow tracts, at various frequencies, whereas no such cells were found on the dorsal side of the ventricles, which agrees with our previous study (Maruyama et al., 2019). (Page 4, lines 112-119)

      Data resolution needs to be improved. The magnification of the figures in Fig 1-4 is not sufficient to demonstrate the marker colocalization as described in the texts. Single-channel images (such as the ones shown in Fig 5-6 but in higher magnifications) are also necessary to show the co-expression of markers.

      Response:

      There was a limit on the data capacity when submitting the manuscript. We were therefore obliged to reduce the quality of images and file size. We have revised the figures to add several higher magnification and single-channel images with improved data resolution throughout Figure 1-4.

      The experimental design is not well-elaborated in the context. For example, the scientific logic of choosing a particular time point/stage for lineage-knockout induction or sample collection needs to be justified. Also, it seems that the authors are using fl/+ as control littermates in most of the experiments. Any specific reason favors using fl/+ heterozygous instead of fl/fl littermates without cre exposure, which is the more commonly used control sample in this kind of comparison, should be addressed.

      Response:

      Knockdown of Prox1 in the Tie2+ lineage has shown to cause an initial failure in specification of LECs at E14.5 with no appearance of lymphatics even at E17.5(Klotz et al., 2015; Lioux et al., 2020; Maruyama et al., 2019), indicating that the effect on lymphatic vessels would not be compensated even at E16.5. In addition, the systemic lymphatic network formation is almost completed at E16.5(Srinivasan et al., 2007), and the lineage trace was also evaluated at this stage. Thus, it was reasonable to compare the phenotype at E16.5.

      This point has been addressed in the text as follows:

      When Prox1 is knocked down in the Tie2+ lineage, an initial failure in specification of LECs was confirmed at E14.5 with a lack of LECs even at E17.5(Klotz et al., 2015; Lioux et al., 2020; Maruyama et al., 2019). Therefore, we compared lymphatic vessel phenotypes at E16.5, by which systemic lymphatics formation is normally completed(Srinivasan et al., 2007). (Page 7, lines 208-212)

      In Prox1-flox(Prox1fl/+) mice, recombinant cells were labeled with EGFP(Iwano et al., 2012), as already described in the manuscript (Page 7, lines206-208). Therefore, the recombined cells can be visualized by EGFP expression in both heterozygous (fl/+) and homozygous (fl/fl) mice, which enables phenotype analysis referring to the recombined (knocked-out in fl/fl) cells. Importantly, these mice showed no specific phenotypes(Klotz et al., 2015; Maruyama et al., 2019). It is therefore reasonable to use heterozygous mice as controls to compare the phenotype appropriately. Although fl/fl littermates without cre exposure could usually serve as controls, they do not express EGFP in the Prox1 lineage, detracting from their utility(Klotz et al., 2015; Maruyama et al., 2019).

      Some of the phrases are not clear in the text- either because of the writing style or because the corresponding figures failed to support the statements. These include but are not limited to lines 104-106, 122, 206, 226, and 228-233.

      104-106: we crossed Isl1-Cre mice, which express Cre recombinase under the control of the Isl1 promoter and in which second heart field-derivatives are effectively labeled, with the transgenic reporter line R26R-tdTomato at E16.5.

      Response:

      We have re-phrased this sentence as follows:

      we crossed Isl1-Cre mice, which express Cre recombinase under the control of the Isl1 promoter and in which second heart field-derivatives are effectively labeled, with the transgenic reporter line R26R-tdTomato and analyzed at E16.5, when lymphatic networks are distributed throughout the whole body. (Pages 4, lines 109-112)

      122: After tamoxifen was administered at E8.5, tdTomato+ cells were broadly detected in the muscle in the head and neck regions at E16.5, indicating effective Cre-mediated recombination of the target gene.

      Response;

      We have re-phrased this sentence as follows:

      After tamoxifen was administered at E8.5, tdTomato+ cells were broadly detected in the skeletal muscle in the head and neck regions at E16.5, indicating effective Cre recombination in CPM-derived musculatures. (Page 4, lines 130-132)

      We have also included red arrowheads, indicating CPM-derived musculatures in Supplemental Figure 1.

      206: These results suggested that defects in LEC differentiation and/or maintenance due to Prox1 deletion in the Isl1+ lineage were compensated for by other cell sources, probably of venous origin, in facial skin, but not in the tongue, resulting in impaired lymphatic vessel formation in the tongue.

      Response:

      We have re-phrased this sentence as follows:

      These results suggested that defects in LEC differentiation and/or maintenance due to Prox1 deletion in the Isl1+ lineage were compensated for by LECs from other cell sources, probably of venous origin, in facial skin, but not in the tongue. (Page 7-8, lines 231-233)

      226:  Almost all of the LYVE1+/PECAM+ lymphatic vessels in the tongue were positive for eGFP in the Tie2-Cre;Prox1fl/+ heterozygous mice (Figure 6A and Supplemental Figure 3D), indicating that the majority of LECs derived from Isl1+ CPM cells developed through Tie2 expression in the tongue.

      Response:

      We have added new cartoon in Figure 6G to more clearly show the relation of Tie2 expression in Isl1+ lineages. Previous reports have used Tie2-Cre mice to show the vein-derived LECs (Klotz et al., 2015; Srinivasan et al., 2007) , because most of cardinal vein endothelium were composed of Tie2+ lineages. In our present study, in the tongue, most of the LECs were derived from Isl1+/Tie2+ lineages (Figure 1D, H, Figure 2D, Figure 4G, Figure 5B, N, Figure 6A and Supplemental Figure 3F, I). These data suggested that there was a group of Tie2+ lineages even though they are derived from non-venous Isl1+ lineages.

      Reference needed for Myf5-Cre as a driver for Myogenic CPM in the results section. Response:

      We have included several reference, as shown below:

      (Harel et al., 2012, 2009; Heude et al., 2018)

      1. Harel et al., Dev cell, 2009
      2. Harel et al., PNAS, 2012
      3. Heude et al., eLife, 2018
      4. In discussion the reference to Pitx2-driven mesenteric lymphatic heterogeneity (Mahadevan et al 2014) is missing yet Islet1 has been shown downstream of Pitx2 (Davis et al 2008). The authors should discuss their findings of gut lymphatic heterogeneity in this context, considering that mediastinum is mesentery-derived.

      Response:

      Isl1+ CPM-derived LECs have been distributed to the anterior mediastinum and their relationship to mesenteric lymphatic vessels, which continuous with the thoracic duct in the posterior mediastinum, is currently unclear. However, since this paper is valuable for understanding the heterogeneity of the origins of LECs, we have included the indicated paper (Mahadevan et al., 2014) in ‘Introduction’ section to show gut lymphatic heterogeneity. (Page 3, line 67)

      To reviewer #2

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript entitled "The cardiopharyngeal mesoderm contributes to lymphatic vessel development" identified a novel non-venous origin of craniofacial and cardiac LECs using genetic lineage tracing. Their results also revealed the spatiotemporal difference between CPM- and venous-derived LECs. Overall, the paper is well-organized and has certain implications for understanding lymphatic development. However, some issues still need to be improved:

      First of all, we would like to express our appreciation to the reviewer for all the constructive comments. We carefully read the reviewer’s comments and discussed it. We agree with the reviewer’s comments to make the text easier to understand and emphasize what we really want to say.

      Specific points were addressed as follows:

      (1). Clearly, the introduction needs to be more concise and focused on the main questions you propose to answer and why these questions are important.

      Response:

      We have revised introduction section to be more concise and focus on the developmental process of lymphatic vessels and its relation to CPM. (Page 2-4, lines 41-103)

      (2). In the discussion section, you should focus on how the questions have been answered and what they mean. And it would be rash to infer the role of LECs in lymphatic malformation. It would be helpful to validate the changes of CPM-derived LECs in LM patient samples.

      Response:

      We have revised the discussion section to be more concise. To demonstrate our findings more clearly, we have also revised and added some cartoons in Figure 6G and Figure 7.

      (3). For the statistical analysis, all the quantitative data should be tested for statistical significance. There are several bar charts lacking P values.

      Response:

      We have included P values in the Figure legends.

      Reviewer #2 (Significance (Required)):

      This study enriched the contribution of CPMs to broader regions of the facial, cardiac and laryngeal lymphatic network and revealed the spatiotemporally difference between CPM- and venous-derived LECs, which provided some basic reference for understanding lymphatic vessel development.

      To reviewer #3

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Short summary of the findings and key conclusions:

      The work from Murayama and colleagues traces the ontogenetic origin of the endothelial cells of the lymphatic vessels in the head and neck region. Using the Cre-lox-based mouse genetics approach, they conclude that the lymphatic endothelial cells (LECs) in this region have mixed origin, with contributions from both the cardiopharyngeal mesoderm (CPM) as well as from cardinal vein. The lineage tracing study is buttressed by assaying LEC formation following selective deletion of the key LEC regulator Prox1 in CPM lineage.

      First of all, we would like to express our appreciation to the reviewer for all the constructive comments. We carefully read the reviewer’s comments and discussed it.

      Specific points were addressed as follows:

      Major comments: 1) The key conclusions: LECs in the head and neck region derive from CPM. LECs in this region have mixed developmental origins. Both these conclusions are convincingly supported by the study. However, the work would greatly be strengthened by Pax3-Cre lineage tracing. This would complement the Isl1-Cre lineage tracing. As the authors observe, the LEC descendants of Isl1+ cells also appear to go through Tie2+ state. Therefore, Tie2-Cre study has not helped to delineate the LECs of CPM and cardinal vein origins. In this context, tracing with Pax3-Cre is likely to give a very clear picture of LEC origins.

      Response:

      We agree with the reviewer in that the data using Pax3-Cre mice will strengthen our manuscripts. Unfortunately, we could not find out researchers who had this line in our society in Japan. For using this line, we need to get cryo-recovered mice from Jaxon laboratory. It will take at least several months. Therefore it is not realistic for us to use Pax3-Cre mice in this work because of time limitation. Instead, we addressed this issue by rewriting the discussion on the possible complementation with the Pax3-Cre lineage by citing (Lupu et al., 2022; Stone and Stainier, 2019).

      This point has been addressed in the text as follows:

      A recent study has suggested that Pax3+ paraxial mesoderm-derived cells contribute to the cardinal vein and therefore venous-derived LECs originate from the Pax3+ lineage (Stone and Stainier, 2019). The same group has further argued that the Pax3+ lineage gives rise to lymphatic vessels on the trunk side through lymphangiogenesis(Lupu et al., 2022). Therefore, the Isl1+ and Pax3+ lineages may complement each other to form systemic lymphatic vessels. (Page 10, lines 314-319)

      2) In addition, the article should be revised to include the number of sections and the number of cells counted per embryo in the Figure legend in each case. This will help assess how robust and reliable are the measurements.

      Response:

      We have revised the statistical methods from the ratio of the count of the number to the area in Figure 1H, 2H, P, and Supplemental Figure 3I to demonstrate more precisely the contribution of Tomato+ cells in lymphatic vessels. We also added more detailed description of the quantification methods in ‘Materials and Methods’ section, as follows:

      Quantification of the section and whole mount images

      For the quantification of section immunostaining at E16.5 embryos, the average of two 16-μm-thick sections taken every 50 μm and 10x power field of views (0.42 mm2) for each anatomical part (the larynx, the skin of the lower jaw, the tongue, and the cardiac* outflow tracts) were subjected to the analyses. In the facial skin, lymphatic vessels in superficial layers of dermis were subjected to the analyses. The middle sagittal sections, including the aorta, larynx, and tongue, which were hall marks of midline, was selected from created sections. The coronal sections, including both eyes, tongue, and olfactory lobes with left and right symmetrical features, was selected. For E12.5 embryos (Figure 4O), two 16-μm-thick sagittal sections taken every 50 μm, including the 1st and 2nd pharyngeal arches and outflow tracts, were subjected to analyses. The area and the number of Prox1+ cells were measured manually using ImageJ software. For the whole mount immunostaining of embryos and the heart, the whole samples were scanned every 20 μm and confirmed eYFP contribution to LECs (Figure 3) and cardinal veins (Figure 4J, and Supplemental Figure 4B, D, F, H, J). * (Pages 13, lines 402-416)

      We have also included the number of eYFP+/Prox1+ cells among Prox1+ cells in the first and second pharyngeal in the Figure 4O legends as follows;

      • (the number of eYFP+/Prox1+ cells (10.83 (mean) ± 1.249 (SEM)): Prox1+ cells (30.83 ± 4.549)) or E9.5 (the number of eYFP+/Prox1+ cells (2.833 ± 1.108): Prox1+ cells (35.50 ± 5.847)). (Page 23, lines 684-686)*

      Minor comments: 1) Several groups have contributed to the CPM literature. The citation of seminal works from Tzahor and Kelly groups is good, however, work from other groups has not been cited. For example, reports such as Heude et al and Grimaldi et al from Tajbakhsh group are very relevant to this work.

      Response:

      According to reviewer’s suggestion, we have included following references in the introduction section for the explanation of CPM derivatives. (P3, line 70)

      1. (Heude et al., 2018)
      2. (Grimaldi et al., 2022)

        2) It would help the reader if the authors explain the reasons for selecting specific regions, such as the tongue, and the skin of the lower jaw, for the study.

      Response:

      This is because many lymphatic vessels are distributed in these cardiopharyngeal area and these area is well known as anatomical parts where lymphatic malformation most often occurs. This has been mentioned in the manuscript as follows:

      From a clinical viewpoint, head and neck regions contributed by the CPM are the most common sites of lymphatic malformations (LMs) (Page 3-4, lines 99-100)

      3) The authors should consider presenting the wholemount images, such as those in Figures 3A and 3E for Figures 5 and 6. This would help assess the lymphatic vessel development in a holistic manner.

      Response:

      Although we tried to do the whole mount images of facial and tongue lymphatics, we could not succeed. Antibodies did not penetrate well on the tongue and, as for lymphatics of facial skin, their complicated morphology prevented clear visualization. Whole-mount imaging of the entire head was difficult for the same reason. In our experience, the antibody was useful for immunostaining of the early-stage embryos (up to E11.5) and the surface area of the heart, where lymphatic vessels were distributed on the epicardium. Even in the whole-mount heart, we have not succeeded in clear and estimable imaging of the vascular structure in the myocardium. Instead, we improved the quality of images and statistical comparisons in the revised manuscript, which we believe makes it more convincing.

      Reviewer #3 (Significance (Required)):

      The nature and significance of the advance for the field & the work in the context of the existing literature: Groups working in the domain of cardiopharyngeal mesoderm (CPM) have focussed on skeletal muscle and heart development. This pool is also known to give rise to skeletal tissues as well as blood vessel endothelium. A recent work Nomaru et al. (Morrow group, Nat Commun 2021) has identified a multi-lineage primed population in the cardiopharyngeal field. In this context, the work from Maruyama and colleagues highlights the versatility of CPM by providing evidence for the emergence of LEC from this multipotent pool. This complex developmental potential of CPM has implications to understand the evolutionary origin of CPM itself.

      The connective tissues in the head/neck have mixed origins (Heude et al, 2018 and Grimaldi et al 2022 from Tajbakhsh group)- from CPM as well as neural crest. This work shows mixed origin for LECs. These works begin to put together the pieces of the puzzle of vertebrate head evolution. Jacob proposed evolution is tinkering. This appears to be true both at the molecular level as well as the cellular level. Head tissues appear to have been put together by exploiting varied sources.

      The study is of broad interest to developmental biologists.

      Reviewer: A developmental biologist with an interest in understanding the axial patterning of mesoderm early during mammalian development. Not an expert in lymphatic vasculature development.

      References for the revision

      Adachi N, Bilio M, Baldini A, Kelly RG. 2020. Cardiopharyngeal mesoderm origins of musculoskeletal and connective tissues in the mammalian pharynx. Development 147:dev185256. doi:10.1242/dev.185256

      Cai C-L, Liang X, Shi Y, Chu P-H, Pfaff SL, Chen J, Evans S. 2003. Isl1 Identifies a Cardiac Progenitor Population that Proliferates Prior to Differentiation and Contributes a Majority of Cells to the Heart. Dev Cell 5:877–889. doi:10.1016/s1534-5807(03)00363-0

      Grimaldi A, Comai G, Mella S, Tajbakhsh S. 2022. Identification of bipotent progenitors that give rise to myogenic and connective tissues in mouse. Elife 11:e70235. doi:10.7554/elife.70235

      Harel I, Maezawa Y, Avraham R, Rinon A, Ma H-Y, Cross JW, Leviatan N, Hegesh J, Roy A, Jacob-Hirsch J, Rechavi G, Carvajal J, Tole S, Kioussi C, Quaggin S, Tzahor E. 2012. Pharyngeal mesoderm regulatory network controls cardiac and head muscle morphogenesis. Proc National Acad Sci 109:18839–18844. doi:10.1073/pnas.1208690109

      Harel I, Nathan E, Tirosh-Finkel L, Zigdon H, Guimarães-Camboa N, Evans SM, Tzahor E. 2009. Distinct Origins and Genetic Programs of Head Muscle Satellite Cells. Dev Cell 16:822–832. doi:10.1016/j.devcel.2009.05.007

      Heude E, Tesarova M, Sefton EM, Jullian E, Adachi N, Grimaldi A, Zikmund T, Kaiser J, Kardon G, Kelly RG, Tajbakhsh S. 2018. Unique morphogenetic signatures define mammalian neck muscles and associated connective tissues. Elife 7:e40179. doi:10.7554/elife.40179

      Klotz L, Norman S, Vieira JM, Masters M, Rohling M, Dubé KN, Bollini S, Matsuzaki F, Carr CA, Riley PR. 2015. Cardiac lymphatics are heterogeneous in origin and respond to injury. Nature 522:62–67. doi:10.1038/nature14483

      Lioux G, Liu X, Temiño S, Oxendine M, Ayala E, Ortega S, Kelly RG, Oliver G, Torres M. 2020. A Second Heart Field-Derived Vasculogenic Niche Contributes to Cardiac Lymphatics. Dev Cell 52:350–363. doi:10.1016/j.devcel.2019.12.006

      Lupu I-E, Kirschnick N, Weischer S, Martinez-Corral I, Forrow A, Lahmann I, Riley PR, Zobel T, Makinen T, Kiefer F, Stone OA. 2022. Direct specification of lymphatic endothelium from non-venous angioblasts. Biorxiv 2022.05.11.491403. doi:10.1101/2022.05.11.491403

      Mahadevan A, Welsh IC, Sivakumar A, Gludish DW, Shilvock AR, Noden DM, Huss D, Lansford R, Kurpios NA. 2014. The Left-Right Pitx2 Pathway Drives Organ-Specific Arterial and Lymphatic Development in the Intestine. Dev Cell 31:690–706. doi:10.1016/j.devcel.2014.11.002

      Maruyama K, Miyagawa-Tomita S, Mizukami K, Matsuzaki F, Kurihara H. 2019. Isl1-expressing non-venous cell lineage contributes to cardiac lymphatic vessel development. Dev Biol 452:134–143. doi:10.1016/j.ydbio.2019.05.002

      Morisada T, Oike Y, Yamada Y, Urano T, Akao M, Kubota Y, Maekawa H, Kimura Y, Ohmura M, Miyamoto T, Nozawa S, Koh GY, Alitalo K, Suda T. 2005. Angiopoietin-1 promotes LYVE-1-positive lymphatic vessel formation. Blood 105:4649–4656. doi:10.1182/blood-2004-08-3382

      Motoike T, Loughna S, Perens E, Roman BL, Liao W, Chau TC, Richardson CD, Kawate T, Kuno J, Weinstein BM, Stainier DYR, Sato TN. 2000. Universal GFP reporter for the study of vascular development. Genesis 28:75–81. doi:10.1002/1526-968x(200010)28:23.0.co;2-s

      Nathan E, Monovich A, Tirosh-Finkel L, Harrelson Z, Rousso T, Rinon A, Harel I, Evans SM, Tzahor E. 2008. The contribution of Islet1-expressing splanchnic mesoderm cells to distinct branchiomeric muscles reveals significant heterogeneity in head muscle development. Development 135:647–57. doi:10.1242/dev.007989

      Srinivasan RS, Dillard ME, Lagutin OV, Lin F-J, Tsai S, Tsai M-J, Samokhvalov IM, Oliver G. 2007. Lineage tracing demonstrates the venous origin of the mammalian lymphatic vasculature. Gene Dev 21:2422–2432. doi:10.1101/gad.1588407

      Stone OA, Stainier DYR. 2019. Paraxial Mesoderm Is the Major Source of Lymphatic Endothelium. Dev Cell 50:247-255.e3. doi:10.1016/j.devcel.2019.04.034

      Tammela T, Saaristo A, Lohela M, Morisada T, Tornberg J, Norrmén C, Oike Y, Pajusola K, Thurston G, Suda T, Yla-Herttuala S, Alitalo K. 2005. Angiopoietin-1 promotes lymphatic sprouting and hyperplasia. Blood 105:4642–4648. doi:10.1182/blood-2004-08-3327

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Short summary of the findings and key conclusions:

      The work from Murayama and colleagues traces the ontogenetic origin of the endothelial cells of the lymphatic vessels in the head and neck region. Using the Cre-lox-based mouse genetics approach, they conclude that the lymphatic endothelial cells (LECs) in this region have mixed origin, with contributions from both the cardiopharyngeal mesoderm (CPM) as well as from cardinal vein. The lineage tracing study is buttressed by assaying LEC formation following selective deletion of the key LEC regulator Prox1 in CPM lineage.

      Major comments:

      1. The key conclusions: LECs in the head and neck region derive from CPM. LECs in this region have mixed developmental origins. Both these conclusions are convincingly supported by the study. However, the work would greatly be strengthened by Pax3-Cre lineage tracing. This would complement the Isl1-Cre lineage tracing. As the authors observe, the LEC descendants of Isl1+ cells also appear to go through Tie2+ state. Therefore, Tie2-Cre study has not helped to delineate the LECs of CPM and cardinal vein origins. In this context, tracing with Pax3-Cre is likely to give a very clear picture of LEC origins.
      2. In addition, the article should be revised to include the number of sections and the number of cells counted per embryo in the Figure legend in each case. This will help assess how robust and reliable are the measurements.

      Minor comments:

      1. Several groups have contributed to the CPM literature. The citation of seminal works from Tzahor and Kelly groups is good, however, work from other groups has not been cited. For example, reports such as Heude et al and Grimaldi et al from Tajbakhsh group are very relevant to this work.
      2. It would help the reader if the authors explain the reasons for selecting specific regions, such as the tongue, and the skin of the lower jaw, for the study.
      3. The authors should consider presenting the wholemount images, such as those in Figures 3A and 3E for Figures 5 and 6. This would help assess the lymphatic vessel development in a holistic manner.

      Significance

      The nature and significance of the advance for the field & the work in the context of the existing literature:

      Groups working in the domain of cardiopharyngeal mesoderm (CPM) have focussed on skeletal muscle and heart development. This pool is also known to give rise to skeletal tissues as well as blood vessel endothelium. A recent work Nomaru et al. (Morrow group, Nat Commun 2021) has identified a multi-lineage primed population in the cardiopharyngeal field. In this context, the work from Maruyama and colleagues highlights the versatility of CPM by providing evidence for the emergence of LEC from this multipotent pool. This complex developmental potential of CPM has implications to understand the evolutionary origin of CPM itself.

      The connective tissues in the head/neck have mixed origins (Heude et al, 2018 and Grimaldi et al 2022 from Tajbakhsh group)- from CPM as well as neural crest. This work shows mixed origin for LECs. These works begin to put together the pieces of the puzzle of vertebrate head evolution. Jacob proposed evolution is tinkering. This appears to be true both at the molecular level as well as the cellular level. Head tissues appear to have been put together by exploiting varied sources.

      The study is of broad interest to developmental biologists.

      Reviewer: A developmental biologist with an interest in understanding the axial patterning of mesoderm early during mammalian development. Not an expert in lymphatic vasculature development.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript entitled "The cardiopharyngeal mesoderm contributes to lymphatic vessel development" identified a novel non-venous origin of craniofacial and cardiac LECs using genetic lineage tracing. Their results also revealed the spatiotemporal difference between CPM- and venous-derived LECs. Overall, the paper is well-organized and has certain implications for understanding lymphatic development. However, some issues still need to be improved:

      1. Clearly, the introduction needs to be more concise and focused on the main questions you propose to answer and why these questions are important.
      2. In the discussion section, you should focus on how the questions have been answered and what they mean. And it would be rash to infer the role of LECs in lymphatic malformation. It would be helpful to validate the changes of CPM-derived LECs in LM patient samples.
      3. For the statistical analysis, all the quantitative data should be tested for statistical significance. There are several bar charts lacking P values.

      Significance

      This study enriched the contribution of CPMs to broader regions of the facial, cardiac and laryngeal lymphatic network and revealed the spatiotemporally difference between CPM- and venous-derived LECs, which provided some basic reference for understanding lymphatic vessel development.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Please see combined review below in the next section,

      Significance

      This is a descriptive manuscript providing a few new insights into a well-recognized and biologically important phenomenon - the lymphatic endothelial cells have heterogeneous origins in different organs. Overall, the idea of Islet1 lineage contributes to regional lymphatic vessel formation during a particular developmental stage is exciting and proven with detailed and careful lineage tracing. The first observation that Islet1 lineage gives rise to cardiac lymphatic vessels was published by the same group in Dev Bio in 2019 so the novelty here is dampened, although the pharyngeal lymphatics and the exact time of these non-venous origin lymphatic vessels arise were not previously characterized - so the current manuscript does provide new important insights. Both the data quality and manuscript layout need improvements, especially when it comes to defining where Islet1 is expressed at all the stages and statistics. The following suggestions will deepen the scope of the manuscript:

      1. Whether Isl1 lineage is independent of venous-derived endothelial cells.
      2. This point is very important: the manuscript does not actually show Isl1 expression through the stages they are inducing. I would want to be sure that lymphatic endothelial cells at this stage don't express Isl1. Another way to get at this is to maybe use other second-heart fields or even broader mesoderm drivers that are known to be never expressed in endothelial cells to confirm the findings.
      3. The author stated that Islet1 lineage gives rise to lymphatic endothelial cells via the Tie2 mechanism but did not elaborate on this part. What is the potential relationship between Islet1 and Tie2? Or Tie2 just serves as a pan-endothelial lineage marker here?
      4. The effect of lineage-specific Prox1 knockout is very descriptive, without any discussion of the potential biological function of such cellular origin heterogeneity. This part may be worth a few follow-up experiments in later embryonic stages or even in postnatal stages. The authors demonstrated that loss of Prox1 in Islet1 lineage decreases the number of lymphatic vessels and leads to lymphangiectasia, but whether this phenotype can be later compensated or shows any clinical impact was not proven. Therefore, the statement made in line 206 is questionable, and whether Islet1 lineage-derived lymphatic endothelial cells are dispensable/indispensable remains unclear.

      The layout of the manuscript needs to be reorganized:

      1. Details in statistical methods and quantification logic were completely missing from the manuscript. For example, definitions of "a sample" (how many sections are taken from one biological sample and how many fields take from one section, etc.), "number of vessels per field", "diameters", and of what parameters the numbers were normalized to, etc. need to be described in the materials and methods section. For instance, it is not clear how "tomato+ lymphatic vessels per field/Vegfr3+ lymphatic vessels" was defined. First, what proportion of tomato+ cells need to colocalize with Vegfr3 expression cells in a specific vessel to make this vessel being determined as a "tomato+ lymphatic vessel"? Most data provided here are section immunostaining where "multiple vessels" are very likely coming from different cross-sections of one same vessel in the same field. Second, Vegfr3 can stain venous endothelial cells in earlier stages so the specificity of this marker can be controversial. These are some important technical aspects to include in the revised version. Figures needing more description in quantification methods include but are not limited to Fig 1H, 2H, 2P, 5K-R.
      2. Data resolution needs to be improved. The magnification of the figures in Fig 1-4 is not sufficient to demonstrate the marker colocalization as described in the texts. Single-channel images (such as the ones shown in Fig 5-6 but in higher magnifications) are also necessary to show the co-expression of markers. Figure numbers are missing from the main figures making it a bit challenging to read.
      3. The experimental design is not well-elaborated in the context. For example, the scientific logic of choosing a particular time point/stage for lineage-knockout induction or sample collection needs to be justified. Also, it seems that the authors are using fl/+ as control littermates in most of the experiments. Any specific reason favors using fl/+ heterozygous instead of fl/fl littermates without cre exposure, which is the more commonly used control sample in this kind of comparison, should be addressed.
      4. Some of the phrases are not clear in the text- either because of the writing style or because the corresponding figures failed to support the statements. These include but are not limited to lines 104-106, 122, 206, 226, and 228-233.
      5. Reference needed for Myf5-Cre as a driver for Myogenic CPM in the results section.
      6. In discussion the reference to Pitx2-driven mesenteric lymphatic heterogeneity (Mahadevan et al 2014) is missing yet Islet1 has been shown downstream of Pitx2 (Davis et al 2008). The authors should discuss their findings of gut lymphatic heterogeneity in this context, considering that mediastinum is mesentery-derived.
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this study, Dobson and colleagues use the well-established Drosophila mitonuclear model to ask how diversity in mitochondrial and nuclear background impacts reproductive response to various diets (DMN, diet-by-mito-by-nuclear interactions). For this, they generated populations from three different geographical locations by introgressions to produce different mitonuclear combinations, which they produced in triplicates, and after preliminary analyses they excluded one geographical population (Canada), which brought down the total number of populations from 27 to 12. Then, they studied at the effects of 3 diets: a standardized control diet, a diet enriched in essential amino acids, and one enriched in lipids on various reproductive traits in two feeding regimes: one chronic (parents and offspring fed the diets), or parental (parents fed the experimental diets, eggs developed on a standard medium). They found that the magnitude of the dietary effects and the effect of parental nutrition on offspring fitness were dependent on mitonuclear genotype, with some diets unexpectedly deleterious in specific cases. They also show that this DMN variation is repeatable among independent replicates, and that mitonuclear epistasis plays a major role in the response to nutrition. Finally, they find an association between DMN interactions and a polymorphism in the mitochondrial gene long ribosomal RNA (mt:lrRNA), showing that a mitochondrial regulatory factor plays a role in DMN effects.

      Major comments:

      The authors have collected an impressive and very useful amount of data, with rigorous breeding regimes and statistical analysis. Their findings, interpretations and conclusions are in my opinion strongly supported by their experimental data. I have no major concerns with the study, and I believe it deserves to be in a wide-audience and high-impact journal. I only have some minor corrections and suggestions listed as follows.

      Minor comments:

      • Have the authors thought about looking at rates of food consumption, and could some populations be consuming less (or none) of the experimental diets, possibly explaining lethality?

      • Line 136: this study's data does indeed show diet-dependent effects of mitonuclear interactions, but it not the first one, maybe rephrase? See refs (a couple are cited in the supplementary text, but it would be useful to add to the main text): Aw, W. C. et al (2018). PLoS Genetics doi:10.1371/journal.pgen.1007735 Camus, M. F., Moore, J., & Reuter, M. (2020). Biology Letters, 16(2), 20190891. doi:10.1098/rsbl.2019.0891 Camus, M. F., Kotiadis, V., Carter, H., Rodriguez, E., & Lane, N. (2022). bioRxiv, doi:10.1101/2022.02.10.479862 Cormier, R. P. J. et al. (2019). Scientific Reports. doi:10.1038/s41598-018-36060-5 Rodríguez, E. et al. (2021). Frontiers in Genetics. doi:10.3389/fgene.2021.734255 Towarnicki, S. G., & Ballard, J. W. O. (2018). Frontiers in Genetics. doi:10.3389/fgene.2018.00593

      • Figure 1D and results section lines 156-178: why is the term "eggs" included in development? I assume it refers to the measure of the amount of eggs, could the authors explain and mention it in the main text and/or figure legend? And why is it "eggs+1" in fig S2D, I could not find an explanation in the methods for this difference in the label.

      • Line 221, last sentence of the paragraph: I would make it clearer that this was in the "chronic" paradigm, I found it difficult to follow here.

      • Line 446 typo in the figure legend, should read "24h on..."

      • Figure 2A: "Genome origin" section of the legend could be placed on the top of the legend, I would find it easier to see as we read the figure from left to right.

      • Figure 2B: some kind of separation between T and C (dashed lines for eg.) would be appreciated to be able to compare better at a glance.

      • Figure S2D: boxplot lines seem very thin, could the outline be made clearer? It was especially difficult to see on a printed version.

      • Line 506, S2B legend: should "Benin" be used instead of "Dahomey"? Keeping the Dahomey mentioned only in Supporting text line 153.

      • Figure S3B and its legend text: the diets now appear completely different, except for EAA. Why is it "sya", should it be the same as "rearing diet" in S2B, or is it the "control" diet? "Mar", margarine, is to my understanding the lipid diet mentioned throughout the paper? I assume this is a matter of fixing the legend labels, unless I misunderstood this part. In the legend text lines 532-536, I could not find any explanation for this difference in labels.

      • Supplementary text line 43: "wild-type" meaning Benin flies?

      • Supp. text line 46: "development" medium is sya? See comment above about confusing diet names.

      • Supp. text line 51: I think the figures referenced are not the correct ones (S2B and S2C)? Should they be S2D and S2E?

      • Supp. text line 64: "AA1" is repeated twice in the parentheses, should it read "AA3" instead?

      CROSS-CONSULTATION COMMENTS

      I have read reviewer #1's comments and I see that we both agree on the quality and relevance of the paper, and the very minor corrections needed. Looking forward to seeing this published.

      Significance

      This study appears to me as very relevant in the context of predicting outcomes of dietary regimes (and more) given a certain mitonuclear background, which is very much needed in the field and in the context of personalized medicine. Repeatability of the effects gives confidence to the type of breeding regime and analyses done in the field of mitonuclear interactions (for example, in the research done by the authors cited previously). Pinpointing a synonymous SNP in a non-coding region of mtDNA is important in the context of deciphering the mechanism(s) behind DMN. As I come from the mitochondrial physiology and aging field, and currently work on mitonuclear interactions, these findings appear to me of great importance, but they will also appeal to a wider audience in the evolutionary biology and biomedicine fields; these results will contribute to advancing our knowledge of mitochondria-nucleus interactions in different environments, up to the human personalized medicine perspective.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Here, the authors investigate mitochondrial x nuclear x diet interactions in Drosophila melanogaster. They start by constructing D. melanogaster populations comprising fully-factorial combinations of mitochondrial and nuclear genomes from Australia, Benin, and Canada. This creates discrete populations with distinct combinations of mitochondrial and nuclear genomes. These populations are then exposed to various diets, notably a control diet, a diet high in essential amino acids (which promotes fecundity), and a diet high in plant-based lipids (which represses fecundity). They first screened for evidence of repeatable mitonuclear effects on fecundity, which they found. They then screened for additional fitness traits looking for influence of mito-nucleotype on response to chronic vs parental dietary changes. Once again, they were able to find evidence of phenotypic variation dependent on mito-nucleotype; however, the effect of mito-nucleotype on traits was variable. Certain mito-nucleotypes exhibited lower to near lethality progeny counts after amino acid feeding (which is thought to promote fecundity), while others exhibited normal counts. When quantifying the size of various effects, they found that mitonucleotype interactions often had comparable effect size to that of diet:mitotype interactions, diet:nucleotype interactions, and diet on its own. Finally, they were able to associate this mito-nucleotype interaction with a mt:lrRNA C/T polymorphism that had nucleotype-dependent effects on fertility.

      Major Comments:

      I find the key conclusions convincing, and I feel as if though the authors sufficiently show that there are mitonucleotype x diet interactions on fecundity/fertility traits. Furthermore, their ability to associate the phenotypic variation with a mt:lrRNA polymorphism is also of interest. I do not feel as if though the authors make claims that lack support in the paper. I do not feel as if though additional experiments would be essential to support their claims; however, I would be remiss not to acknowledge that the bulk of the experimental results came down to a 2 x 2 population exploration (Australia-Australia, Australia-Benin, Benin-Australia, Benin-Benin). This study would have been aided by more mitonucleotypes; however, I understand that to generate additional mitonucleotypes would have taken additional time and resources. As most of my comments are minor and should be easily addressed by the authors, I place them in the following section.

      Minor Comments:

      Below are minor comments and questions I had while reading the manuscript. I apologize in advance if any of my questions are answered in the main text, and I missed them while reviewing.

      • Page 3, line 35 - you state that variation in mitochondrial function can contribute to variation in dietary optima. I would add a citation here.

      • Page 4, line 74 - If possible, I would add a little more clarifying detail regarding the crossing scheme to the main text. I would just be clear that the generation of these mitonucleotypes takes advantage of the maternal transmission of the mitochondrial genome by crossing virgin females from population 1 to males from population 2, followed by continual backcrossing of virgin females each generation to males from population 2. You go into specific details in the materials & methods section, but I was surprised not to see it earlier. I know this is probably more detail, but I think it's better to be very clear about the crossing scheme used.

      • Page 5, line 90 - I would move the citations for the fact that enriching essential amino acids promotes fecundity from the supplement to the main text. You say that this is an established manipulation, and it would be best to point to examples of this manipulation, especially as some of your results find that fertility is negatively affected.

      • Page 8, line 129 - In this paragraph, you speak about the impacts of chronic EAA feeding in the AA, BA, and AB mitonucleotypes, but you do not coment on BB? Is there a reason for this? (This connects to one of my comments later)

      General Questions - This may be outside the scope of your study, but I was surprised that there was no commentary on co-adaptation (or the potential lack of) between the mitochondrial and nuclear genome in the discussion. It's not immediately clear to me how isolated and for how long the Australian and Benin populations are, but I would expect there to be co-evolution/co-adaptation between the mitochondrial and nuclear genomes. Consequently, I would have expected AA and BB populations to show elevated fitness values; however, it actually seems as if though the AA mitonucleotype performs the worst when given the chronic EAA diet. I wonder if you could comment on the co-evolutionary potential between these two genomes.

      Significance

      • It is becoming increasingly clear when studying mitochondrial x nuclear interactions that the environmental context of the study can significantly influence the results (see Camus et al. 2019, Rodriguez et al. 2021, and Towarnicki and Ballard 2018). This study furthers our understanding of mitochondrial x nuclear x environment interactions by exploring fully-factorial combinations of mitonucleotypes on two distinct diets (enriched for essential amino acids or plant-based lipids) and evaluating the fertility/fecundity of the different mitonucleotypes. They were able to identify a signle mt:lrRNA SNP that was associated with the measured phenotypes, raising the question of whether and how mitochondria-derived regulatory factors influence phenotypic variation. I also find the comparison to the omnigenic model compelling, particularly their commentary that mitochodnrial genes could contribute to the "core gene" set for several notable phenotypes.

      • I believe that this work would be of interest to those obviously studying the evolution and importance of mitochondrial x nuclear x environmental interactions, and I also believe that this work would be of interest to those interested in the effects of various nutrients/diets in both Drosophila and humans.

      • My expertise is as a theoretical population and evolutionary geneticist. I am primarily interested in genetic conflict, including between the mitochondrial and nuclear genome. I am interested in how these two genomes evolve to both cause and resolve genetic and sexual conflict.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      In this article, Amico et al. explore how Spindly self-regulates its interaction with Dynein-Dynactin. They propose that Spindly adopts an auto-inhibited, closed conformation that blocks the CC1 box and Spindly motif, preventing its interaction with dynein-dynactin. The authors used a combination of X-ray crystallography, biochemistry, and structure predictions to detail the intramolecular interactions in Spindly that mediate this closed state. They then use analytical SEC to test their proposed auto-inhibition mechanism by monitoring Spindly binding to the pointed end complex. They suggest that auto-inhibited Spindly is unable to bind Dynein-Dynactin regardless of the presence or absence of Spindly's cargo, the RZZ complex. In contrast, by using mutagenesis to prevent this auto-inhibition, the authors show that uninhibited Spindly can interact with members of the Dynein-Dynactin complex. Finally, they use cellular experiments to show that relieving autoinhibition prevents the proper localization of Spindly and Dynein-Dynactin to kinetochores during mitosis, likely due to the formation of ectopic Spindly-Dynein-Dynactin complexes in these cells.

      This is an interesting paper that provides important insights into the mechanism of Spindly regulation and its associations with its interacting partners. However, additional work is necessary to support some of their conclusions. In addition, the text is at times quite dense and harder to follow, which prevents their findings as being impactful as could be possible for the bigger picture paradigms of kinetochore function.

      We thank the reviewer for a supportive assessment and for raising some concerns that we have now fully addressed in our revision.

      Major Points:

      The crosslinking and mass photometry experiments are done at highly differing concentrations (5 μM vs. 10 nM). The mass photometry should be performed at the same concentration as the crosslinking experiments to determine if Spindly forms a higher order oligomer at the higher concentration. These results will aid in the interpretation of the crosslinking mass spectrometry experiments, as the observed interactions could be intermolecular contacts rather than intramolecular contacts if Spindly is tetrameric at these concentrations, as is suggested in figure 4E for specific Spindly constructs.

      We thank the reviewer for raising this point. Mass photometry (MP) requires very low sample concentrations as it is essentially a single molecule technique, and therefore the particle density cannot be increased arbitrarily. To assess whether the Spindly construct is prevalently tetrameric at the concentration of the crosslinking experiment, we performed the crosslinking experiments at the standard concentration, and only then diluted the samples and performed MP measurements. The results, displayed in two new panels (Figure 1 – Supplement 1M-N), show that crosslinked samples are primarily dimeric, providing further evidence that we are looking at bona fide intra-dimer contacts.

      In figure 2, more conclusive evidence is needed to show that full length Spindly does not form a complex with Dynein-Dynactin. My interpretation of the gels in figure 2D suggests that full length Spindly does form a complex with Dynein-Dynactin, as in the final gel (red outline) it looks as if full length Spindly is indeed peaking with the rest of the Dynein-Dynactin proteins, albeit with excess Spindly eluting later. Figure legends containing protein concentrations used in SEC assays would aid in the interpretation of this data.To conclusively show that full length Spindly doesn't form a complex with Dynein-Dynactin, additional assays will be necessary, such as pull-down assays, or mass photometry.

      We have now added concentrations of binding species at the relevant points of the figure legends.

      The essence of the reviewer’s concern is that full length Spindly, like BicD2, binds the DD, which would invalidate our model that Spindly is auto-inhibited in absence of a second trigger (other than DD), or alternatively showing that auto-inhibition can be easily overcome. Our conclusion that Spindly remains auto-inhibited, however, is strongly supported by the gels in Figures 2D-G. There, the peak containing DD and BicD2 and eluting around 6.2 ml (panel D) is not visible when BicD2 is replaced with Spindly (panel E), and RZZ does not change this (panels F-G). Note that the peak at 6.6 ml appears to be a contaminant, possibly DNA, and it is visible also with individual Dynein and Dynactin samples. These experiments strongly support our point and we have tried to improve the presentation of the results by boxing relevant fractions of the displayed SDS-PAGEs.

      We have now also repeated these experiments with recombinant human Dynactin. The new results are displayed in Figure 2 – Supplement 2. Also in this case, we see minimal complex formation with Spindly and complex formation with BicD2, even if the trailing of Dynein, Dynactin, and Spindly in the earlier elution fractions (already in the absence of complex formation) makes the gels harder to interpret. We also note that these experiments are consistent with those with the isolated PE complex.

      Regretfully, we cannot gather additional information by mass photometry because even our positive control dissociates at the extremely low concentrations required to image this very large complex.

      In figure 3C, 3E, and figure 5C, there is a shift in the PE peaks in the presence of Spindly, but it isn't clear why doesn't the complex doesn't elute earlier than Spindly alone. If the complex is dissociating on the column, additional assays are necessary to confirm that these Spindly constructs stably interact with PE. If this shift is also accompanied by a major change in shape, thus allowing Spindly to elute later than it does alone, this needs to be explored or explained further.

      Elution from a size exclusion chromatography column is dominated by the hydrodynamic radius of the macromolecule. In this particular case, Spindly is highly elongated and essentially sets an upper limit for the elution volume of both the un-complexed and complexed protein. We have described this behavior in many other cases of highly elongated proteins (e.g. Huis in ‘t Veld et al. eLife 2019). We are aware that the binding affinity for the interaction of Spindly and the PE complex is low, and therefore are not surprised to observe dissociation of the complex during the SEC run, i.e. upon dilution of the sample after incubation. In these experiments, we have tried to focus on the shift in elution volume of the PE complex from its elution position in isolation.

      The authors should provide better a rationale for why the pointed-end complex is used in figure 3 in lieu of the complex used in figure 2.

      We now write that the Spindly motif of adaptors binds the pointed end complex with measurable affinity also in absence of Dynein (near line 294). We then clarify that “As the Spindly motif is predicted to sit within the autoinhibited portion of the protein, we hypothesized that the PE-Spindly motif interaction could be used as a proxy to measure the autoinhibition status of Spindly, bypassing the need to form the entire Dynein-Dynactin-Spindly complex.”

      In Figure 5I, WT Spindly also binds to LIC, although less WT Spindly is bound to LIC than Spindly CC2* or Spindly deltaRV. This should be addressed in the text.

      Thank you for pointing this out. We have now clarified this in the text near line 440.

      The authors claim that the mechanism they describe may be a paradigm for dynein activation by other adaptors at various cellular locations, but they aren't able to identify a mechanism for how Spindly converts from its auto-inhibited state to its permissive state. A more thorough examination of this mechanism is necessary to claim that this mechanism could be paradigmatic, or a revision of the text is needed.

      Following an additional concern by reviewer 3, we have now revised the text to meet this concern. So, both in the last sentence of the abstract, and in the last paragraph of the discussion, we do not any longer discuss our results as paradigmatic, although we have reasons to believe that they might be eventually recognized as such, after additional examples will have been analyzed.

      Minor Points:

      1) The manuscript could benefit from careful review of the text, captions, and figures, as a few minor typos and inconsistencies in the figures and text were present.

      We have now re-reviewed the text and figures to try eliminate residual inconsistencies.

      2) The list of common structural and functional features of Dynein-Dynactin adaptors could be indicated more clearly.

      We have re-written this part of the Introduction, where we now indicate more clearly the features of the DD complex

      3) Several times the authors use alpha fold predictions to confirm their data. Although the predictions support several of their conclusions, saying that predictions can confirm the data is an overstatement.

      We thank the reviewer for pointing this out. We now replaced “confirmed” with “also supported” on line 190, where we explicitly referred to AF2 predictions as “confirmatory”. We also re-wrote a statement in the Discussion where we had commented on the power of AF2 and indicate that it “became available in the late phases of our work as a guiding and validation tool” (line 524).

      4) Figure 1H would be improved by the addition of the amino acid numbers in the domain diagram.

      Fixed – we also added amino acid numbers in 1G for consistency.

      5) Concentrations used for each protein for the analytical SEC experiments should be listed in the figure or caption.

      Thank you for suggesting this. We have now added the protein concentrations for these experiments directly in the legends.

      6) In addition to the caption, it would be helpful to the reader to indicate which experiments use farnesylated Spindly.

      Done in legends wherever applicable.

      7) Error bars are missing from the WT sample in figure 5J. This figure would benefit from statistical analysis.

      Done – see also point 4, Reviewer 2.

      Significance:

      This paper builds on recent work from the Mussachio lab and others exploring the nature of the fibrous corona at kinetochores and the molecular basis for dynein recruitment. This paper is focused on the structural nature of the interactions that underlie Spindly recruitment to kinetochores and its interactions with dynein and other factors. Although reductionist in its approach, this paper has the potential to have broad implications for thinking about the control of corona assembly and dynein recruitment with an elegant auto-regulation of Spindly. Researchers interested in cell division, chromosome segregation, kinetochore function, dynein regulation, and the structural basis for core cellular processes should be interested in this paper.

      Reviewer #2

      The study by d'Amico et al. presents an in-depth analysis of how intramolecular folding of the coiled-coil adaptor Spindly regulates its interaction with the motor dynein and its obligatory co-factor dynactin. Using biochemical reconstitution and diverse biophysical approaches (including cross-linking mass spectrometry, X-ray crystallography, AF2-based structure prediction, size exclusion chromatography, and analytical ultracentrifugation), the authors uncover and dissect an intricate Spindly autoinhibition mechanism. At kinetochores Spindly is known to co-oligomerize into filaments with the RZZ complex (its kinetochore receptor/cargo), which drives expansion of the outermost kinetochore region (the corona). Here the authors show that Spindly is a dimer in solution and that successive coiled-coil segments interact with each other in an asymmetric 'closed' conformation that is unable to form a complex with dynein and dynactin. Specifically, a 2-residue insertion in the middle of Spindly's first coiled-coil (CC1) creates a kink that allows CC1 to fold back on itself, which has two important structural consequences: it brings a key segment in CC2 (residues 276-309) in contact with a CC1 region called the CC1 box (previously shown to bind dynein light intermediate chain), and it blocks a motif at the beginning of CC2, called the Spindly motif, from accessing the pointed end complex that caps dynactin's minifilament. Mutations in either the CC1 box, the CC1 2-residue insertion, or the CC2(276-309) segment, 'open up' full-length Spindly and promote its interaction with the dynactin pointed end complex and, in case of the latter two types of mutants, with dynein light intermediate chain. CC1 box-deficient Spindly and the CC2 segment mutant (which corresponds to two charge-inverting point mutations) also support complex formation of Spindly and intact dynein-dynactin. Interestingly, while the CC2 mutant can bind to RZZ, the interaction between RZZ and wild-type Spindly is insufficient to make Spindly competent for dynein-dynactin binding (even when RZZ-Spindly are phosphorylated by mitotic kinases). The authors therefore propose that releasing Spindly from autoinhibition requires an additional trigger at the kinetochore, which likely involves an interaction between the Spindly CC2(276-309) segment and an as yet unidentified kinetochore component. The CC2 mutant is also shown to be defective in kinetochore recruitment and in Spindly-RZZ filament formation in vitro, suggesting kinetochore recruitment of Spindly is coupled to kinetochore expansion through a mechanism involving CC2(276-309).

      The experiments are of excellent technical quality and the results are presented in a logical and concise manner. There is clarity in the writing (the introduction deserves particular praise), and the authors' conclusions are fully supported by the data. Although there is no direct structural evidence for Spindly's closed conformation, as the authors themselves are careful to point out, the numerous Spindly mutants that are characterized (only some of which are mentioned in the summary above) in aggregate make a convincing case for the proposed autoinhibition mechanism.

      We are very grateful to the reviewer for supporting our work

      Minor comments:

      • Page 5: "605-residue adaptor Spindly". State that "605-residue" refers to the human protein.

      We have added this clarification

      • Page 8: "The region of Spindly downstream of the Spindly box (residues 281-322) is very conserved among Spindly orthologues, but not among other members of the BICD adaptor family (Figure 1 - Supplement 1L)." This is not very obvious from the alignment shown in the figure.

      We agree with the reviewer that the text, as written, was confusing. We have now rephrased it and write “Downstream of the Spindly box, sequences of Spindly orthologues and BICD family adaptors diverge”

      • Page 13: "...(A23V-A24V) mutant, which has been previously shown to inhibit the interaction with the LIC2 in a similar assay (Gama et al., 2017)." The LIC isoform used in the referenced study was LIC1.

      Thank you for identifying this error. We have corrected the text accordingly.

      -Figure 5J: Information about statistical significance should be added.

      Done. See also Minor point 7, Reviewer 1.

      -Figure 7B - D: Red on black is not an ideal color choice for these graphs.

      We now replaced red with yellow

      -Page 15: When discussing the recently discovered interphase functions of Spindly, also cite Clemente et al. (2018; doi:10.3390/jdb6020009) and Conte et al. (2018; doi:10.1242/bio.033233).

      We apologize for the involuntary omission of these two references, which have now been included in the revised manuscript.

      -Page 17: "Evidence supporting this idea is that mutations in the 276-306 region, including the deletion of this entire fragment or the introduction of charge-inverting point mutations at residues 295 and 297 respectively abolish or largely decrease the kinetochore recruitment of Spindly ((Raisch et al., 2021) and this study),...". Sacristan et al. (2018) should also be cited in this context, as this study established the importance of residues 274-287 for Spindly recruitment to kinetochores.

      We agree and apologize for the inadvertent omission. We have now included the Sacristan et al. reference in this context.

      • Page 17: "In vitro, the 276-306 region is also required for the assembly of RZZ-Spindly filaments (this study and (Raisch et al., 2021))." It could also be mentioned here that residues 274-287 of Spindly are necessary for RZZ-Spindly filament formation in cells, as shown by Sacristan et al. (2018).

      We have now reported this fact on lines 560-561.

      • Page 17: "Plausibly, the solution to this conundrum will require biochemical reconstitutions addressing the spectrum of interactions that this protein establishes at the kinetochore." Presumably, "this protein" refers to Spindly, but this is not clear since the subject of the preceding sentence is RZZ.

      Done – line 565

      Significance

      Cargo transport by cytoskeletal motors must be tightly regulated to establish and maintain intracellular organization and for faithful execution of development, including cell division. Much of this regulation occurs at the motor-cargo interface but remains poorly understood at the molecular level. In recent years it has become clear that adaptor proteins not only provide a physical link between motors and their cargo but also participate in motor activation. Adaptor-coupled activation is particularly important for dynein, because adaptors promote dynein's interaction with its essential co-factor dynactin.

      BICD2 (along with other Bicaudal D proteins) is the most intensely studied dynein adaptor and has long been known to be subject to autoinhibition with regard to dynein-dynactin binding, which is relieved by cargo binding to the BICD2 C-terminal region. A important question has been whether the same regulatory logic applies to other dynein adaptors. The study by d'Amico et al. presents the first evidence that conformational inhibition extends to adaptors other than Bicaudal D proteins. The study also reveals that Spindly's autoinhibition mechanism is more complex than that of BICD2. This likely reflects Spindly's dual function in dynein-dynactin recruitment and kinetochore expansion. The results of d'Amico et al. suggest that the Spindly autoinhibition mechanism has evolved to coordinate the two processes, and this idea is further supported by a recent study on the RZZ-Spindly interaction from the same group (Raisch et al. 2021; doi:10.1101/2021.12.03.471119). One of the most important insights from d'Amico et al. is that there must be another binding partner of Spindly at kinetochores besides the RZZ complex that participates in the relief of Spindly autoinhibition. The study has therefore identified an important future research direction. It will be interesting to investigate whether additional adaptors follow the multi-step activation model proposed here for Spindly.

      Regarding the technical aspects, the study illustrates that AF2-based structure prediction is a powerful tool for investigating conformational regulation, and it introduces an important innovation: the ability to generate recombinant human dynactin opens the door to the engineering of dynactin mutants, which promises to accelerate mechanistic dissection of this essential dynein co-factor. In conclusion, the study represents a significant step forward in our understanding of how dynein-cargo interactions are regulated by adaptor proteins and is therefore of general interest for researchers studying the molecular mechanisms of chromosome segregation as well as intracellular transport.

      Reviewer #3

      The Dynein-Dynactin (DD) complex interacts with different activating adaptors to assemble functional motor complexes capable of moving along microtubules while transporting various cargoes. However, it remains poorly understood how DD activation is precisely controlled so that Dynein-mediated transport is only stimulated at the appropriate time and place. DD adaptor regulation is likely a crucial piece of this puzzle. In this manuscript, the authors show that Spindly, a mitotic adaptor of DD complex, undergoes a series of conformational rearrangements that result in efficient Spindly autoinhibition and affect its ability to bind DD. The work from d'Amico et al includes an impressive amount of biochemical and biophysical data, supported by well-designed experiments that are carefully documented. Resorting to crosslinking experiments and protein structural modelling, the authors find that several intramolecular contacts occur between specialized domains within Spindly N-terminus. The resulting compact conformation occludes important DD-binding motifs in Spindly and, thus, limits the access of DD to the adaptor. By utilizing different Spindly mutants predicted to render the adaptor more elongated, the authors bypass Spindly autoinhibition and rescue binding to DD in vitro. Surprisingly, unlike other DD adaptors, Spindly autoinhibition is not relieve upon binding to its cargo (the RZZ complex) arguing that the interaction with an additional binding partner is require to fully unleash the potential of Spindly to bind DD. In line with this, the authors identify a Spindly mutant that is unable to localize to kinetochores from human cells, despite its open conformation. Collectively, this work provides significant advances in the understanding of Spindly regulation and brings a new perspective to the mechanism of DD adaptor activation and therefore should be of interest for a wide audience.

      We are very grateful to the reviewer for the support and for the thorough and constructive evaluation of our work.

      Major concerns:

      • The authors show that Spindly 33-605 is able to form a complex with DD which eventually enables the recruitment of Dynactin to kinetochores from Spindly 33-605-expressing cells. This result is unexpected since this Spindly mutant lacks CC1 box, which has been previously shown to be required for the kinetochore localization of Dynactin (Sacristan et al 2018). A more comprehensive discussion about this discrepancy would enrich the article and benefit the audience.

      We thank the reviewer for pointing this out. We now write (line 492): “This result was unexpected, because the CC1 box has been previously shown to be required for kinetochore localization of Dynactin (Sacristan et al., 2018)”

      • In Fig.7, the authors show that two Spindly mutants (Spindly CC2* and Spindly chimera) are unable to fully decorate the kinetochores from human cells. The same is true for Spindly AA/VV mutant. Do the authors know whether these mutants are expressed as stable proteins in cells for example by performing a western blot analysis?

      In this revised version of our manuscript, we have explained more clearly that in this experiment we electroporate recombinant proteins. These are essentially the same proteins that we use for the experiments in vitro. This provides an internal test in these experiments, because we can verify, through their successful expression and purification, that the proteins are stable. We cannot exclude, however, that the proteins are “treated differently” in cells, for instance because they interact differently with certain binding partners in ways that modifies their stability. As the proteins are not expressed continuously, but rather introduced in the cells in a single electroporation event several hours before imaging, the overall levels of these proteins may differ. We have now included a representative western blot (Figure 7 – Supplement 1C) that demonstrates the levels of electroporated proteins in the experiments in Figure 7. SpindlyCC2*appears to be present at somewhat lower levels than the other constructs. mChSpindly33-605 and Spindlychimera, on the other hand, were present at very similar levels, supporting our conclusion that a kinetochore-binding region is impaired in the latter. We now refer in the main text to the uncertainty created by the comparatively lower cellular levels of SpindlyCC2*. We have also chosen more representative kinetochores for the insets of CC2* and Chimera in Figure 7A.

      • In line with the previous point, could the authors tether each Spindly mutant to the kinetochore for example by fusing the construct to known kinetochores proteinssuch as Mis12 and test whether these fusion constructs are now able to recruit Dynactin to kinetochores?

      This would be a potentially interesting experiment. However, reasoning that Spindly is a strong dimer that needs to interact with another strong hexamer like the RZZ complex, discouraged us as these stoichiometries would almost certainly complicate the interpretation of these experiments. It is clear that further work will be required to define the complete picture for this complex system.

      • The authors conclude that the 2-step or multistep mechanism involved in the regulation of Spindly activation may be a common mechanism to different DD adaptors. However, the authors point out to existing differences between the conformational arrangement of Spindly and another DD adaptor, BICD2, arguing against a common mode of regulation for all adaptors. This needs to be clarified.

      The reviewer has a good point and we have indeed tuned this down. We have re-written the last sentence of the abstract and replaced it with “Thus, our work illustrates how Dynein can be specifically activated at a defined cellular locale.” We also write (line 592): Whether a similar 2-step or multistep mechanism applies to additional cargo-adaptor systems is an important question for future studies.

      Minor concerns:

      • In Fig.2D, full length Spindly does not bind DD in vitro. This is most likely to occur because Spindly N-terminus adopts a compacted conformation and hinders the access to DD-binding motifs. In Fig.2B, the authors show a structural prediction for Spindly 1-275 which should adopt a more elongated shape. According to prevailing model, this construct should now be able to bind DD in a similar biochemical assay.

      We agree with the reviewer that Spindly1-275 (and Spindly∆276-306) might be expected to be strong DD binders based on our model. Indeed, these proteins bind to the PE, albeit apparently weakly. Nevertheless, as explained in lines 350 and following, these mutants appear to form higher oligomers and we have not been able to show convincingly that they are fully open and available to bind DD.

      • In Gama et al 2017, LIC1 was able to pull down a wild-type N-terminal Spindly construct. How do the authors reconcile this with the data presented in this manuecript?

      We have expanded the discussion, also to answer major point 5, reviewer 1, on line 446 and following, where we also refer to the observation of Gama et al. 2017.

      • The section where the authors test point mutations to open Spindly ("Opening up Spindly with point mutations") should be better contextualized. The transition is difficult to follow as it is.

      We have now rephrased this part of the text to make are thoughts clearer.

      • In the text, it is not clear whether Mps1 kinase is required to promote RZZ oligomerization in the presence of Spindly chimera, an uninhibited Spindly mutant. According to the model, this mutant construct should drive oligomerization independently of Mps1 (as the N-terminal deletion construct from Sacristan et al 2018).

      The reviewer is correct and we have rephrased this part of the text to clarify

      • The nomenclature the authors adopt for the CC1 second conserved motif (SCM) and for the Spindly motif (SM) can be confusing at some point when identifying each mutant in the text and figures. Nomenclature should be standardized.

      We agree with the reviewer and have now adopted a different nomenclature for the CC2 box or second conserved motif, namely HBS1, for Heavy Chain Binding Site 1. This functional annotation derives from work of one of our laboratories (Carter) and has been discussed with, and approved by, Geert Kops, whose laboratory had originally proposed the name “CC2 box”, as well as Reto Gassmann, Erika Holzbaur, Roberto Dominguez, Sam Reck Peterson, Rick McKenney, and Ahmet Yildiz.

      • In Fig.6A, mCh-Spindly 33-605 and mCh-Spindly chimera lines have the same color.

      Thank you for spotting this subtle mistake. We have corrected the color line.

      Significance

      The work represents a significant advance in the field and it would be of interest for a wide range of audiences.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The Dynein-Dynactin (DD) complex interacts with different activating adaptors to assemble functional motor complexes capable of moving along microtubules while transporting various cargoes. However, it remains poorly understood how DD activation is precisely controlled so that Dynein-mediated transport is only stimulated at the appropriate time and place. DD adaptor regulation is likely a crucial piece of this puzzle. In this manuscript, the authors show that Spindly, a mitotic adaptor of DD complex, undergoes a series of conformational rearrangements that result in efficient Spindly autoinhibition and affect its ability to bind DD. The work from d'Amico et al includes an impressive amount of biochemical and biophysical data, supported by well-designed experiments that are carefully documented. Resorting to crosslinking experiments and protein structural modelling, the authors find that several intramolecular contacts occur between specialized domains within Spindly N-terminus. The resulting compact conformation occludes important DD-binding motifs in Spindly and, thus, limits the access of DD to the adaptor. By utilizing different Spindly mutants predicted to render the adaptor more elongated, the authors bypass Spindly autoinhibition and rescue binding to DD in vitro. Surprisingly, unlike other DD adaptors, Spindly autoinhibition is not relieve upon binding to its cargo (the RZZ complex) arguing that the interaction with an additional binding partner is require to fully unleash the potential of Spindly to bind DD. In line with this, the authors identify a Spindly mutant that is unable to localize to kinetochores from human cells, despite its open conformation. Collectively, this work provides significant advances in the understanding of Spindly regulation and brings a new perspective to the mechanism of DD adaptor activation and therefore should be of interest for a wide audience.

      Major concerns:

      • The authors show that Spindly 33-605 is able to form a complex with DD which eventually enables the recruitment of Dynactin to kinetochores from Spindly 33-605-expressing cells. This result is unexpected since this Spindly mutant lacks CC1 box, which has been previously shown to be required for the kinetochore localization of Dynactin (Sacristan et al 2018). A more comprehensive discussion about this discrepancy would enrich the article and benefit the audience.
      • In Fig.7, the authors show that two Spindly mutants (Spindly CC2* and Spindly chimera) are unable to fully decorate the kinetochores from human cells. The same is true for Spindly AA/VV mutant. Do the authors know whether these mutants are expressed as stable proteins in cells for example by performing a western blot analysis?
      • In line with the previous point, could the authors tether each Spindly mutant to the kinetochore for example by fusing the construct to known kinetochores proteinssuch as Mis12 and test whether these fusion constructs are now able to recruit Dynactin to kinetochores?
      • The authors conclude that the 2-step or multistep mechanism involved in the regulation of Spindly activation may be a common mechanism to different DD adaptors. However, the authors point out to existing differences between the conformational arrangement of Spindly and another DD adaptor, BICD2, arguing against a common mode of regulation for all adaptors. This needs to be clarified.

      Minor concerns:

      • In Fig.2D, full length Spindly does not bind DD in vitro. This is most likely to occur because Spindly N-terminus adopts a compacted conformation and hinders the access to DD-binding motifs. In Fig.2B, the authors show a structural prediction for Spindly 1-275 which should adopt a more elongated shape. According to prevailing model, this construct should now be able to bind DD in a similar biochemical assay.
      • In Gama et al 2017, LIC1 was able to pull down a wild-type N-terminal Spindly construct. How do the authors reconcile this with the data presented in this manuecript?
      • The section where the authors test point mutations to open Spindly ("Opening up Spindly with point mutations") should be better contextualized. The transition is difficult to follow as it is.
      • In the text, it is not clear whether Mps1 kinase is required to promote RZZ oligomerization in the presence of Spindly chimera, an uninhibited Spindly mutant. According to the model, this mutant construct should drive oligomerization independently of Mps1 (as the N-terminal deletion construct from Sacristan et al 2018).
      • The nomenclature the authors adopt for the CC1 second conserved motif (SCM) and for the Spindly motif (SM) can be confusing at some point when identifying each mutant in the text and figures. Nomenclature should be standardized.
      • In Fig.6A, mCh-Spindly 33-605 and mCh-Spindly chimera lines have the same color.

      Significance

      The work represente a significant advance in the field and it would be of interest for a wide range of audiences.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The study by d'Amico et al. presents an in-depth analysis of how intramolecular folding of the coiled-coil adaptor Spindly regulates its interaction with the motor dynein and its obligatory co-factor dynactin. Using biochemical reconstitution and diverse biophysical approaches (including cross-linking mass spectrometry, X-ray crystallography, AF2-based structure prediction, size exclusion chromatography, and analytical ultracentrifugation), the authors uncover and dissect an intricate Spindly autoinhibition mechanism. At kinetochores Spindly is known to co-oligomerize into filaments with the RZZ complex (its kinetochore receptor/cargo), which drives expansion of the outermost kinetochore region (the corona). Here the authors show that Spindly is a dimer in solution and that successive coiled-coil segments interact with each other in an asymmetric 'closed' conformation that is unable to form a complex with dynein and dynactin. Specifically, a 2-residue insertion in the middle of Spindly's first coiled-coil (CC1) creates a kink that allows CC1 to fold back on itself, which has two important structural consequences: it brings a key segment in CC2 (residues 276-309) in contact with a CC1 region called the CC1 box (previously shown to bind dynein light intermediate chain), and it blocks a motif at the beginning of CC2, called the Spindly motif, from accessing the pointed end complex that caps dynactin's minifilament. Mutations in either the CC1 box, the CC1 2-residue insertion, or the CC2(276-309) segment, 'open up' full-length Spindly and promote its interaction with the dynactin pointed end complex and, in case of the latter two types of mutants, with dynein light intermediate chain. CC1 box-deficient Spindly and the CC2 segment mutant (which corresponds to two charge-inverting point mutations) also support complex formation of Spindly and intact dynein-dynactin. Interestingly, while the CC2 mutant can bind to RZZ, the interaction between RZZ and wild-type Spindly is insufficient to make Spindly competent for dynein-dynactin binding (even when RZZ-Spindly are phosphorylated by mitotic kinases). The authors therefore propose that releasing Spindly from autoinhibition requires an additional trigger at the kinetochore, which likely involves an interaction between the Spindly CC2(276-309) segment and an as yet unidentified kinetochore component. The CC2 mutant is also shown to be defective in kinetochore recruitment and in Spindly-RZZ filament formation in vitro, suggesting kinetochore recruitment of Spindly is coupled to kinetochore expansion through a mechanism involving CC2(276-309).

      The experiments are of excellent technical quality and the results are presented in a logical and concise manner. There is clarity in the writing (the introduction deserves particular praise), and the authors' conclusions are fully supported by the data. Although there is no direct structural evidence for Spindly's closed conformation, as the authors themselves are careful to point out, the numerous Spindly mutants that are characterized (only some of which are mentioned in the summary above) in aggregate make a convincing case for the proposed autoinhibition mechanism.

      Minor comments:

      • Page 5: "605-residue adaptor Spindly". State that "605-residue" refers to the human protein.
      • Page 88: "The region of Spindly downstream of the Spindly box (residues 281-322) is very conserved among Spindly orthologues, but not among other members of the BICD adaptor family (Figure 1 - Supplement 1L)." This is not very obvious from the alignment shown in the figure.
      • Page 13: "...(A23V-A24V) mutant, which has been previously shown to inhibit the interaction with the LIC2 in a similar assay (Gama et al., 2017)." The LIC isoform used in the referenced study was LIC1.
      • Figure 5J: Information about statistical significance should be added.
      • Figure 7B - D: Red on black is not an ideal color choice for these graphs.
      • Page 15: When discussing the recently discovered interphase functions of Spindly, also cite Clemente et al. (2018; doi:10.3390/jdb6020009) and Conte et al. (2018; doi:10.1242/bio.033233).
      • Page 17: "Evidence supporting this idea is that mutations in the 276-306 region, including the deletion of this entire fragment or the introduction of charge-inverting point mutations at residues 295 and 297 respectively abolish or largely decrease the kinetochore recruitment of Spindly ((Raisch et al., 2021) and this study),...". Sacristan et al. (2018) should also be cited in this context, as this study established the importance of residues 274-287 for Spindly recruitment to kinetochores.
      • Page 17: "In vitro, the 276-306 region is also required for the assembly of RZZ-Spindly filaments (this study and (Raisch et al., 2021))." It could also be mentioned here that residues 274-287 of Spindly are necessary for RZZ-Spindly filament formation in cells, as shown by Sacristan et al. (2018).
      • Page 17: "Plausibly, the solution to this conundrum will require biochemical reconstitutions addressing the spectrum of interactions that this protein establishes at the kinetochore." Presumably, "this protein" refers to Spindly, but this is not clear since the subject of the preceding sentence is RZZ.

      Significance

      Cargo transport by cytoskeletal motors must be tightly regulated to establish and maintain intracellular organization and for faithful execution of development, including cell division. Much of this regulation occurs at the motor-cargo interface but remains poorly understood at the molecular level. In recent years it has become clear that adaptor proteins not only provide a physical link between motors and their cargo but also participate in motor activation. Adaptor-coupled activation is particularly important for dynein, because adaptors promote dynein's interaction with its essential co-factor dynactin.

      BICD2 (along with other Bicaudal D proteins) is the most intensely studied dynein adaptor and has long been known to be subject to autoinhibition with regard to dynein-dynactin binding, which is relieved by cargo binding to the BICD2 C-terminal region. A important question has been whether the same regulatory logic applies to other dynein adaptors. The study by d'Amico et al. presents the first evidence that conformational inhibition extends to adaptors other than Bicaudal D proteins. The study also reveals that Spindly's autoinhibition mechanism is more complex than that of BICD2. This likely reflects Spindly's dual function in dynein-dynactin recruitment and kinetochore expansion. The results of d'Amico et al. suggest that the Spindly autoinhibition mechanism has evolved to coordinate the two processes, and this idea is further supported by a recent study on the RZZ-Spindly interaction from the same group (Raisch et al. 2021; doi:10.1101/2021.12.03.471119). One of the most important insights from d'Amico et al. is that there must be another binding partner of Spindly at kinetochores besides the RZZ complex that participates in the relief of Spindly autoinhibition. The study has therefore identified an important future research direction. It will be interesting to investigate whether additional adaptors follow the multi-step activation model proposed here for Spindly.

      Regarding the technical aspects, the study illustrates that AF2-based structure prediction is a powerful tool for investigating conformational regulation, and it introduces an important innovation: the ability to generate recombinant human dynactin opens the door to the engineering of dynactin mutants, which promises to accelerate mechanistic dissection of this essential dynein co-factor.

      In conclusion, the study represents a significant step forward in our understanding of how dynein-cargo interactions are regulated by adaptor proteins and is therefore of general interest for researchers studying the molecular mechanisms of chromosome segregation as well as intracellular transport.

      Reviewer expertise keywords: same as the keywords of this manuscript.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this article, Amico et al. explore how Spindly self-regulates its interaction with Dynein-Dynactin. They propose that Spindly adopts an auto-inhibited, closed conformation that blocks the CC1 box and Spindly motif, preventing its interaction with dynein-dynactin. The authors used a combination of X-ray crystallography, biochemistry, and structure predictions to detail the intramolecular interactions in Spindly that mediate this closed state. They then use analytical SEC to test their proposed auto-inhibition mechanism by monitoring Spindly binding to the pointed end complex. They suggest that auto-inhibited Spindly is unable to bind Dynein-Dynactin regardless of the presence or absence of Spindly's cargo, the RZZ complex. In contrast, by using mutagenesis to prevent this auto-inhibition, the authors show that uninhibited Spindly can interact with members of the Dynein-Dynactin complex. Finally, they use cellular experiments to show that relieving autoinhibition prevents the proper localization of Spindly and Dynein-Dynactin to kinetochores during mitosis, likely due to the formation of ectopic Spindly-Dynein-Dynactin complexes in these cells.

      This is an interesting paper that provides important insights into the mechanism of Spindly regulation and its associations with its interacting partners. However, additional work is necessary to support some of their conclusions. In addition, the text is at times quite dense and harder to follow, which prevents their findings as being impactful as could be possible for the bigger picture paradigms of kinetochore function.

      Major Points:

      1. The crosslinking and mass photometry experiments are done at highly differing concentrations (5 μM vs. 10 nM). The mass photometry should be performed at the same concentration as the crosslinking experiments to determine if Spindly forms a higher order oligomer at the higher concentration. These results will aid in the interpretation of the crosslinking mass spectrometry experiments, as the observed interactions could be intermolecular contacts rather than intramolecular contacts if Spindly is tetrameric at these concentrations, as is suggested in figure 4E for specific Spindly constructs.
      2. In figure 2, more conclusive evidence is needed to show that full length Spindly does not form a complex with Dynein-Dynactin. My interpretation of the gels in figure 2D suggests that full length Spindly does form a complex with Dynein-Dynactin, as in the final gel (red outline) it looks as if full length Spindly is indeed peaking with the rest of the Dynein-Dynactin proteins, albeit with excess Spindly eluting later. Figure legends containing protein concentrations used in SEC assays would aid in the interpretation of this data. To conclusively show that full length Spindly doesn't form a complex with Dynein-Dynactin, additional assays will be necessary, such as pull-down assays, or mass photometry.
      3. In figure 3C, 3E, and figure 5C, there is a shift in the PE peaks in the presence of Spindly, but it isn't clear why doesn't the complex doesn't elute earlier than Spindly alone. If the complex is dissociating on the column, additional assays are necessary to confirm that these Spindly constructs stably interact with PE. If this shift is also accompanied by a major change in shape, thus allowing Spindly to elute later than it does alone, this needs to be explored or explained further.
      4. The authors should provide better a rationale for why the pointed-end complex is used in figure 3 in lieu of the complex used in figure 2.
      5. In Figure 5I, WT Spindly also binds to LIC, although less WT Spindly is bound to LIC than Spindly CC2* or Spindly deltaRV. This should be addressed in the text.
      6. The authors claim that the mechanism they describe may be a paradigm for dynein activation by other adaptors at various cellular locations, but they aren't able to identify a mechanism for how Spindly converts from its auto-inhibited state to its permissive state. A more thorough examination of this mechanism is necessary to claim that this mechanism could be paradigmatic, or a revision of the text is needed.

      Minor Points:

      1. The manuscript could benefit from careful review of the text, captions, and figures, as a few minor typos and inconsistencies in the figures and text were present.
      2. The list of common structural and functional features of Dynein-Dynactin adaptors could be indicated more clearly.
      3. Several times the authors use alpha fold predictions to confirm their data. Although the predictions support several of their conclusions, saying that predictions can confirm the data is an overstatement.
      4. Figure 1H would be improved by the addition of the amino acid numbers in the domain diagram.
      5. Concentrations used for each protein for the analytical SEC experiments should be listed in the figure or caption.
      6. In addition to the caption, it would be helpful to the reader to indicate which experiments use farnesylated Spindly.
      7. Error bars are missing from the WT sample in figure 5J. This figure would benefit from statistical analysis.

      Significance

      This paper builds on recent work from the Mussachio lab and others exploring the nature of the fibrous corona at kinetochores and the molecular basis for dynein recruitment. This paper is focused on the structural nature of the interactions that underlie Spindly recruitment to kinetochores and its interactions with dynein and other factors. Although reductionist in its approach, this paper has the potential to have broad implications for thinking about the control of corona assembly and dynein recruitment with an elegant auto-regulation of Spindly. Researchers interested in cell division, chromosome segregation, kinetochore function, dynein regulation, and the structural basis for core cellular processes should be interested in this paper.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): **Summary:** Techniques to probe the local environment of membrane proteins are sparse, although the influence of lipids on the membrane protein's function are known since many years. Therefore, the paper by Umebayashi et al. is important. The environment-sensitive dye Nile red (NR) coupled to a membrane protein is an appropriate sensor for monitoring the local membrane fluidity. Linking of Nile red to the receptor via a flexible tether was achieved with the acyl carrier protein (ACP)-tag method. Experiments showed that depending on the ACP site a certain linker length is required to have NR inserted in the membrane and thus be an effective sensor for lipid disorder. This technology could be of general usability to study the environment of membrane proteins in the context of their function. As an example, the technique allowed insulin induced membrane disorder in the close insulin receptor vicinity to be observed. Further, results suggested that tyrosine activity is required for this disorder to happen. The experimental results appear to be complete and controls were made.

      **Major comments:** 1) Sometimes technical terms are used without explanation: What is the GP value? What is ACP-IR? The spectrum was measured in number of rois? The reader can find those abbreveations out, but it would be nice to have them defined.

      We have made a list of abbreviations.

      2) Fig. 1d) is confusing. The ACP-IR labelling is evident in 3 panels, but there is no difference in the color (emission spectra of 1992-ACP-IR vs 2031-ACP-IR should be visible??). The DAPI staining is very different. When doing the latter, how difficult is it to get the staining equal?

      The differences in spectra cannot be seen because we used pseudo colors for display of the DAPI and CoA-PEG-NR staining. The reviewer’s comments about the unequal DAPI staining is correct. The reason for this is most likely that the cell membrane is unequally permeabilized by PFA treatment. As the point of this figure is just to show that the plasma membrane is labeled, dependent upon the expression of the ACP-tagged insulin receptor, we don’t think that the variable intensities of the DAPI staining is important. DAPI is simply used to indicate the position of the cells.

      3) How can one interpret Fig. 4: a) Control goes over 4 frames, at 240" insulin is added, and 10 frames should show a fluctuation difference?

      We showed 4 frames after control treatment that showed no significant change was observed by control treatment. We expected that clear changes would be invoked by insulin treatment in GP images, however these changes, while visible in the GP images, are difficult to see for the untrained observer. This is the reason why we used the ZNCC method in the subsequent figures to better visualize the changes.

      1. b) A color shift from blue to green is visible after insulin addition. But it is faint - difficult to assess from the pseudo color scheme. What does 1000 pixel top/1000 pixel bottom mean in c). Is it an attempt to better visualize the fluctuation? It is difficult to recognize a difference before and after adding insulin. d) It seems that the kymograph set should show this. What is the color scale? Why is 3 so untypical, i.e., no change? Box 6 is also peculiar: the left side does not show a strong change upon insulin administration, the right side does. Why? We appreciate the helpful comments for improving our manuscript.

      As pointed out, the change of GP value is extremely small before and after insulin addition, so it is difficult to fully visualize the change with normal pseudo-color expression. To deal with this, we adopted the following two methods to visualize minute changes.

      1) Visualization of local changes of the statistical GP value showed by ZNCC throughout the time-lapse images (Fig. 6 and Fig. S2B).

      2) Visualization of the top/bottom 1000 pixels of the sorting ZNCC value in each image (Fig. 7 and Fig. S2C). The top 1000 pixels are the ones that showed the largest changes. The bottom 1000 pixels are the ones that showed the smallest changes.

      Owing to these expressions, we found out that the level of the response against the insulin signal was spatially and temporally heterogeneous in the membrane.

      As for the color scale, in order to clarify the meaning of the difference of color, we have added the description about the relationship between the color and the ZNCC value in the results section.

      4) How is the kymogram calculated? The legend says 'The horizontal dimension represents the averaged ZNCC inside the rectangular area, and the vertical dimension represents time'. The averaged ZNCC is a single value, so it is not clear why the kymogram shows a variation from left to right. May it be the ZNCC was averaged just vertically?

      We apologize that we did not provide information regarding making the kymograph.

      In the yellow rectangular area (Fig. 6B), the ZNCC values of the pixels with the same x coordinate value were vertically averaged, which were represented as the horizontal direction of the kymograph. That is, one horizontal line of the kymograph holds the spatial distribution of the ZNCC value along the horizontal direction of the membrane, and the vertical direction shows their time changes. To make it easier to understand, we refined the description about the kymograph in the legend of Fig. 6.

      5) When calculating cross-correlation values on images, they need to be aligned. What fraction of the total image does the selected 19x19 box represent? As described, I imagine that a rolling CC over 19x19 pixels is calculated over an image from the time lapse series comparing it with the reference Iave(x,y). Compared to the 3x3 median filtered CP image, the ZNCC image should then be much more blurred??

      Below we provide more information regarding the calculation of ZNCC.

      Each local window for ZNCC calculation is set to a 19x19 pixels centered on every single pixel excluding the edges of an image. The ZNCC value calculated in that window is set to a center pixel of that area. After that, a new window centered on the adjacent pixel is set and calculate the new ZNCC. That is, the calculation window is slid throughout the image. Also, the calculated ZNCC value is not set to all the pixels of the window, but is set to only the center pixel of the window, so there is no blur effect like median filtering.

      The figure below shows a schematic view of our ZNCC calculation.

      Schematic view of our ZNCC calculation

      **Minor comment:** On page 16 supplementary is not spelled properly.

      corrected

      Reviewer #1 (Significance (Required)):

      The key point of this paper is convincing and the new technology appears to have a lot of potential. It can be applied to study membrane protein function in the context of its environment, the lipid bilayer.

      Membrane fluidity measurements have been developed (e.g., using fluorescent probes like laurdan). However, the trick to link a probe like nile red by ACP technology to the insulin receptor and to observe its activity is quite new.

      A most recent description of such a technology is in TrAC Trends in Analytical Chemistry Volume 133, December 2020, 116092.

      This is an interesting review, but not directly impacting on our work.

      **Referees cross-commenting**

      All comments are constructive and important. The paper is important but needs to be amended as proposed.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): **Summary:** In this manuscript, authors generated an ACP-attached Nile Red probe in order to specifically label Insulin receptor in the membrane. Owing to this specificity, one can measure the lipid membrane properties around a specific protein in the membrane. **Major comments:**

      For the conclusions in the manuscript to be convincing, in my opinion, these additional data need to be added. Some of these are new experiments, and some are detailed analysis of existing data. The new experiments are not for new line of investigation, instead it is to confirm their statements and conclusions. The major point is the reliability of spectral shift. In usual environment sensitive probes, it is certain that they are in the membrane whatever is done to the membrane. However, when the probe is attached to a protein, it is not trivial to have the same confidence that the probe is always inside the membrane, and it is in the same plane of the membrane. 1992-ACP-IR is a good example; authors state that it binds to the protein outside the membrane, but when there is cholesterol addition and -maybe more interestingly- cholesterol removal, the dye still reacts and changes its emission (even PreCT changes its emission quite a bit at the 570 nm region). This is a clear indication of a change in localization of the probe upon some changes in the membrane. This implies that observed spectral shifts may not be due to lipid packing differences, but due to localization of the probes. For this reason, it is crucial to know where any environment sensitive probe localize in the membrane with respect to membrane normal, and this knowledge is more important for this probe. Related to this, the spectral difference upon insulin treatment and activation of insulin receptor could be due to changes in probe's localization in the membrane. Especially because authors show in Fig1e, the spectra can change depending on the probe localization. Relatedly, quantum yield of NR should be significantly different when it is inside vs outside membrane. Authors should show QY for 1992-ACP-NR and 2031-ACP-NR with different PEG lengths and upon insulin treatment.

      We understand the logic of the request to measure the QY, since the QY of Nile red is much higher in organic solvents than in aqueous solutions, so it might be predicted that the QY of Nile red is higher in a lipid bilayer than when covalently bound to the protein in an aqueous environment. However, this argument depends upon the mechanism for the increase in quantum yield when going from aqueous to a non-polar solution. One possible explanation is based on the intrinsic properties of the dye under the two conditions. The alternative explanation would be that the dye would aggregate (be insoluble) in aqueous solution and therefore either not fluoresce or self-quench. In this case, we believe that the latter is the explanation because we and others have previously shown the turn-on properties of the probe when binding to proteins (SNAP-tag and others). It is not simple to measure QY in the cell under a microscope, but we have done something similar shown in supplementary figure 4. We labeled the three ACP-receptor complexes with PEG11-Nile red and co-stained with antibody to the Insulin Receptor. We then calculated a relative quantum yield. There were very little differences at all between the relative quantum yields, so we conclude that it is not the environment of the probe, which affects the quantum yield under these conditions, but the fact that it is covalently attached to a protein and incapable of forming aggregates. What distinguishes these constructs is the emission spectrum, not the quantum yield. In supplementary Table 2 we also did QY measurements in vitro and we could reproduce the increase of quantum yield by association with liposomes or in organic solvents. We tested whether non-covalent association with a protein would increase the QY by incubation with the lipid binding protein, BSA, in PBS. This was not the case, strongly pointing to the conclusion that it is the covalent association with the protein that increases the QY, not association with a protein. We believe that our demonstration of changes in fluorescent spectra with changes in cholesterol, large changes in fluorescent spectra with linker length for the 1992 construct and voltage sensitivity using patch-clamp prove that the Nile red is reporting on the membrane environment under the conditions we propose.

      **Minor comments:** - Fig 1d requires quantification We do not agree on this. This is simply to show that the labeling is dependent upon expression of the relevant ACP-IR constructs. There is no detectable labeling of the control.

      • Voltage sensitivity of different PEG length of 2031-ACP probe should be added. We have added this data in figure 2 panel E.

      • Fig 3a graph should show all data points, not only bar graphs. Also, the band in 3a for +CoA-PEG-NR is dimmer than other bands, is it specific to this particular gel since quantification does not show any difference?

      There is no significant difference- Fig 4d, colour code is needed.

      Done

      • Fig 5b and Fig3d are basically the same experiments in terms of control measurement, why is the difference in 3b is 0.04 GP unit while it is 0.007 GP unit?

      We explain in the MS, but have improved the title of Y-axis in Fig.5 b graph so that the difference in what is plotted is clear. - Why is inhibitor data so noisy? We should discuss.

      We don’t know the exact reason why inhibitor data is noisy, but we speculate that the actin cytoskeleton and phosphoinositide-dependent signaling could affect the membrane stability, and the membrane environment would be fluctuated in the presence of latrunculin B or PI3K inhibitor.

      Reviewer #2 (Significance (Required)): Overall, this is a very useful approach, and this line of research will yield very useful tools to shed light on how lipids surrounding proteins can change their function. Major advance of the paper is the new chemical biology tool. There is also biological data on how insulin can change the insulin receptor's membrane environment which is contradictory to some old literature claiming that InsR becomes more "rafty" upon insulin treatment (e.g., PMID: 11751579).

      If this type of tagging proves robust and reproducible (limitations and concerns listed above and below), it could be used by other researchers to tag their protein of interest and investigate the lipid environment around those proteins.

      The downside of this method is that the probe requires ACP tag, a relatively less used tag than others in biology, therefore researchers interested in using this probe should have their proteins with ACP tag. Moreover, the linker length and ACP-tag position are quite crucial parameters (and probably should be optimized for each protein). Longer PEG lengths cannot report on changes efficiently (Fig3b), while shorter lengths are prone to artefacts as they can go out of membrane (Fig1 and Fig2). This might limit its widespread use.

      The reason for using the ACP tag is that neither the SNAP tap nor the HALO tag working. The tethered Nile Red preferred to bind to the tqg rather than inserting into the membrane.

      **Referees cross-commenting** I agree with all comments and concerns of other reviewers. I see the usability and potential of this new technology along with its limitations as all three reviewers pointed out.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): See below. No concerns on any of these issues.

      Reviewer #3 (Significance (Required)): **Critique:** This MS reports a proof-of-principle for using site-directed environmentally sensitive probe technology to assess the local membrane environment of a receptor tyrosine kinase (IR) upon activation. This technology addresses a major gap in our arsenal of tools to study the mechanisms of membrane signaling as the parameters of interest are biophysical parameters rather than purely biochemical ones. How to do this with spatial and temporal resolution is a major challenge. This study builds on previous work by the Riezman group that develops an extrinsic labeling system to tether Nile Red to specific sites on the ectodomain of a signaling receptor and then probe local membrane environments as a function of receptor activity. This is a carefully done study is well-controlled, is clever in design and is well-described. Although the major issues to which such a general technology could contribute involve intracellular (and not extracellular) event, the advances described will be of general interest -- particularly that local membrane order decreases when IR becomes activated. Specific comments for the authors' consideration follow:

      **Specific Comments:** (i) As a general comment, the authors are measuring extracellular plasma membrane leaflet properties that may or may not translate to what is happening in the local inner leaflet environment. A general reader may well miss the significance of this. This point needs to be more explicitly emphasized in the Discussion.

      This has been discussed in the revised version.

      (ii) Why not treat cells with a PLC inhibitor to block PIP2 hydrolysis and ask if that inhibits membrane disorder. It is PIP2 hydrolysis/resynthesis that regulates the actin cytoskeleton at signaling receptors and this seems an attractive candidate for study.

      There is a long list of attractive post-signaling events of the insulin receptor and how this works in different cell types that could be tested. We believe that this is beyond the scope of this study and we encourage others to do this.

      (iii) The data acquisition time is at least 4 min which is long enough for activated receptors to be recruited to sites of endocytosis. Can the authors exclude the possibility that what they are measuring isn't reflective of such spatial reorganization? Does a clathrin inhibitor block the observed change in local membrane order for activated IR? We determined localization to AP2 adaptor containing clathrin coated pits at the cell surface and showed that during the time-course of the experiment that there is no significant change in co-localization or evidence for endocytosis (new figure 9). Therefore, we decided not to do the clathrin inhibitor blocking experiment because we believe that it could only lead to indirect effects.

      (iv) Receptor activation is accompanied by other transitions such as dimerization, etc. Can the authors exclude the possibility that what they are measuring is related to changes in depth of insertion of the NR probe into the plasma membrane outer leaflet that is a consequence of IR conformational transitions associated with activation? This is highly unlikely given the fact that fluidification of the membrane environment is found with all length linkers. Given the intervals in increases in linker length on the 2031 construct, which is the closest to the membrane, it is very difficult to conceive that any of the ones larger than 5 PEGs restrict significantly the membrane insertion of the dye. **Referees cross-commenting**

      I think we have a consensus opinion

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      See below. No concerns on any of these issues.

      Significance

      Critique:

      This MS reports a proof-of-principle for using site-directed environmentally sensitive probe technology to assess the local membrane environment of a receptor tyrosine kinase (IR) upon activation. This technology addresses a major gap in our arsenal of tools to study the mechanisms of membrane signaling as the parameters of interest are biophysical parameters rather than purely biochemical ones. How to do this with spatial and temporal resolution is a major challenge. This study builds on previous work by the Riezman group that develops an extrinsic labeling system to tether Nile Red to specific sites on the ectodomain of a signaling receptor and then probe local membrane environments as a function of receptor activity.

      This is a carefully done study is well-controlled, is clever in design and is well-described. Although the major issues to which such a general technology could contribute involve intracellular (and not extracellular) event, the advances described will be of general interest -- particularly that local membrane order decreases when IR becomes activated. Specific comments for the authors' consideration follow:

      Specific Comments:

      (i) As a general comment, the authors are measuring extracellular plasma membrane leaflet properties that may or may not translate to what is happening in the local inner leaflet environment. A general reader may well miss the significance of this. This point needs to be more explicitly emphasized in the Discussion.

      (ii) Why not treat cells with a PLC inhibitor to block PIP2 hydrolysis and ask if that inhibits membrane disorder. It is PIP2 hydrolysis/resynthesis that regulates the actin cytoskeleton at signaling receptors and this seems an attractive candidate for study.

      (iii) The data acquisition time is at least 4 min which is long enough for activated receptors to be recruited to sites of endocytosis. Can the authors exclude the possibility that what they are measuring isn't reflective of such spatial reorganization? Does a clathrin inhibitor block the observed change in local membrane order for activated IR?

      (iv) Receptor activation is accompanied by other transitions such as dimerization, etc. Can the authors exclude the possibility that what they are measuring is related to changes in depth of insertion of the NR probe into the plasma membrane outer leaflet that is a consequence of IR conformational transitions associated with activation?

      Referees cross-commenting

      I think we have a consensus opinion

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, authors generated an ACP-attached Nile Red probe in order to specifically label Insulin receptor in the membrane. Owing to this specificity, one can measure the lipid membrane properties around a specific protein in the membrane.

      Major comments:

      For the conclusions in the manuscript to be convincing, in my opinion, these additional data need to be added. Some of these are new experiments, and some are detailed analysis of existing data. The new experiments are not for new line of investigation, instead it is to confirm their statements and conclusions. The major point is the reliability of spectral shift. In usual environment sensitive probes, it is certain that they are in the membrane whatever is done to the membrane. However, when the probe is attached to a protein, it is not trivial to have the same confidence that the probe is always inside the membrane, and it is in the same plane of the membrane. 1992-ACP-IR is a good example; authors state that it binds to the protein outside the membrane, but when there is cholesterol addition and -maybe more interestingly- cholesterol removal, the dye still reacts and changes its emission (even PreCT changes its emission quite a bit at the 570 nm region). This is a clear indication of a change in localization of the probe upon some changes in the membrane. This implies that observed spectral shifts may not be due to lipid packing differences, but due to localization of the probes. For this reason, it is crucial to know where any environment sensitive probe localize in the membrane with respect to membrane normal, and this knowledge is more important for this probe. Related to this, the spectral difference upon insulin treatment and activation of insulin receptor could be due to changes in probe's localization in the membrane. Especially because authors show in Fig1e, the spectra can change depending on the probe localization. Relatedly, quantum yield of NR should be significantly different when it is inside vs outside membrane. Authors should show QY for 1992-ACP-NR and 2031-ACP-NR with different PEG lengths and upon insulin treatment.

      Minor comments:

      • Fig 1d requires quantification
      • Voltage sensitivity of different PEG length of 2031-ACP probe should be added.
      • Fig 3a graph should show all data points, not only bar graphs. Also, the band in 3a for +CoA-PEG-NR is dimmer than other bands, is it specific to this particular gel since quantification does not show any difference?
      • Fig 4d, colour code is needed.
      • Fig 5b and Fig3d are basically the same experiments in terms of control measurement, why is the difference in 3b is 0.04 GP unit while it is 0.007 GP unit?
      • Why is inhibitor data so noisy?

      Significance

      Overall, this is a very useful approach, and this line of research will yield very useful tools to shed light on how lipids surrounding proteins can change their function. Major advance of the paper is the new chemical biology tool. There is also biological data on how insulin can change the insulin receptor's membrane environment which is contradictory to some old literature claiming that InsR becomes more "rafty" upon insulin treatment (e.g., PMID: 11751579).

      If this type of tagging proves robust and reproducible (limitations and concerns listed above and below), it could be used by other researchers to tag their protein of interest and investigate the lipid environment around those proteins.

      The downside of this method is that the probe requires ACP tag, a relatively less used tag than others in biology, therefore researchers interested in using this probe should have their proteins with ACP tag. Moreover, the linker length and ACP-tag position are quite crucial parameters (and probably should be optimized for each protein). Longer PEG lengths cannot report on changes efficiently (Fig3b), while shorter lengths are prone to artefacts as they can go out of membrane (Fig1 and Fig2). This might limit its widespread use.

      Referees cross-commenting

      I agree with all comments and concerns of other reviewers. I see the usability and potential of this new technology along with its limitations as all three reviewers pointed out.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Techniques to probe the local environment of membrane proteins are sparse, although the influence of lipids on the membrane protein's function are known since many years. Therefore, the paper by Umebayashi et al. is important. The environment-sensitive dye Nile red (NR) coupled to a membrane protein is an appropriate sensor for monitoring the local membrane fluidity. Linking of Nile red to the receptor via a flexible tether was achieved with the acyl carrier protein (ACP)-tag method. Experiments showed that depending on the ACP site a certain linker length is required to have NR inserted in the membrane and thus be an effective sensor for lipid disorder. This technology could be of general usability to study the environment of membrane proteins in the context of their function. As an example, the technique allowed insulin induced membrane disorder in the close insulin receptor vicinity to be observed. Further, results suggested that tyrosine activity is required for this disorder to happen. The experimental results appear to be complete and controls were made.

      Major comments:

      1) Sometimes technical terms are used without explanation: What is the GP value? What is ACP-IR? The spectrum was measured in number of rois? The reader can find those abbreveations out, but it would be nice to have them defined.

      2) Fig. 1d) is confusing. The ACP-IR labelling is evident in 3 panels, but there is no difference in the color (emission spectra of 1992-ACP-IR vs 2031-ACP-IR should be visible??). The DAPI staining is very different. When doing the latter, how difficult is it to get the staining equal?

      3) How can one interpret Fig. 4: a) Control goes over 4 frames, at 240" insulin is added, and 10 frames should show a fluctuation difference? b) A color shift from blue to green is visible after insulin addition. But it is faint - difficult to assess from the pseudo color scheme. What does 1000 pixel top/1000 pixel bottom mean in c). Is it an attempt to better visualize the fluctuation? It is difficult to recognize a difference before and after adding insulin. d) It seems that the kymograph set should show this. What is the color scale? Why is 3 so untypical, i.e., no change? Box 6 is also peculiar: the left side does not show a strong change upon insulin administration, the right side does. Why?

      4) How is the kymogram calculated? The legend says 'The horizontal dimension represents the averaged ZNCC inside the rectangular area, and the vertical dimension represents time'. The averaged ZNCC is a single value, so it is not clear why the kymogram shows a variation from left to right. May it be the ZNCC was averaged just vertically?

      5) When calculating cross-correlation values on images, they need to be aligned. What fraction of the total image does the selected 19x19 box represent? As described, I imagine that a rolling CC over 19x19 pixels is calculated over an image from the time lapse series comparing it with the reference Iave(x,y). Compared to the 3x3 median filtered CP image, the ZNCC image should then be much more blurred??

      Minor comment:

      On page 16 supplementary is not spelled properly.

      Significance

      The key point of this paper is convincing and the new technology appears to have a lot of potential. It can be applied to study membrane protein function in the context of its environment, the lipid bilayer.

      Membrane fluidity measurements have been developed (e.g., using fluorescent probes like laurdan). However, the trick to link a probe like nile red by ACP technology to the insulin receptor and to observe its activity is quite new.

      A most recent description of such a technology is in TrAC Trends in Analytical Chemistry Volume 133, December 2020, 116092.

      Referees cross-commenting

      All comments are constructive and important. The paper is important but needs to be amended as proposed.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      First we would like to express our deep gratitude to the reviewers for thoroughly and fairly reviewing our work.


      Reviewer #1:

      Major Concerns

      1. A major concern I have is with the use of DAPT to modulate Notch signaling, and investigate the impact on integrins, Yap, cadherins, etc. Gamma-secretase, the target of DAPT, cleaves not only Notch receptors, but also IntegrinB1, Nectins, Cadherins, Ephrins and more. This recent review lists 149 substrates (Guner & Lichtenthaler Seminars in Cell & Developmental Biology 2020). The risk that some of the results reflect DAPT impact on IntegrinB1, Cadherins etc themselves is significant. The authors should validate their findings with more specific modulation of Notch activity, for example with a Notch blocking antibody, with siRNA, or with SAHM1. We agree with the reviewer´s comment and will add additional key experiments using SAHM1 as alternative inhibitor of Notch activity.

      Furthermore, EGTA was used to "acutely destabilize VE-Cadherin". But EGTA chelates Calcium, which is essential for Notch structure, and EGTA is thus a well-known activator of Notch signaling (see eg Rand MD et al. (2000) Calcium depletion dissociates and activates heterodimeric notch receptors. Mol Cell Biol). The authors rightfully describe and cite this paper, but the use of EGTA nonetheless confounds interpretation. The authors check for NICD levels (at what timepoint?) but the staining is cytoplasmic (also not labelled in the figure per se, but described in the figure legend? - please label the staining in the panel). And in any case, NICD is very short-lived and nuclear staining cannot be taken as a hallmark of signaling activity. In particular if staining is performed at a time point at which the receptor and NICD may have been exhausted/depleted. The authors should validate these observations/conclusions with the Notch reporter to conclusively demonstrate whether EGTA does not activate Notch in their system.

      To test whether transient treatment with EGTA causes Notch activation we will repeat this experiment with Notch reporter activity as readout.

      Trans-endocytosis of NECD on different substrates: the authors suggest that trans-endocytosis of NECD by Dll4 increases on softer substrates. But the authors also show that soft substrates lead to spreading out of cells, which could confound interpretation (is overlapping membranes, not internalization). The authors could validate trans-endocytosis by FACS: check if red Dll4+ cells contain more NECD. It is also not clear to me in this experiment whether the authors are looking at green NECD, or Notch1 full length, since they write "overlap of Notch1 and Dll4", which would not reflect trans-endocytosis but interactions at the cell surface for both cells. Please also define "overlay intensity", or explain further.

      We will validate the trans-endocytosis by flow cytometry. In addition, we describe the procedure for microscopic analysis more clearly (methods section, p 4; results section, p 17-19)

      The authors conclude their introduction with a statement that mechanosensitivity of Notch is linked to endocytosis, but their conclusion from Fig 6C was that Notch stiffness-dependence was independent of endocytosis, using the rhDll4..?

      We have now rephrased this sentence.

      • *

      Minor concerns

      1. In the introduction, the authors describe Dll3 as a Notch ligand that activates Notch signaling in trans. To my knowledge, Dll3 has only been described as a cis-inhibitor of Notch signaling. (I think this may have arisen during repeated edits of the manuscript!) This has now been corrected in the current version.

      In the introduction, the authors state that Notch1, Dll4 and Jag1 control angiogenesis, but then they only describe what Notch1/Dll4 do in the next few sentences. Perhaps one sentence to describe the role of Jag1 would help avoid the feeling of being "left hanging".

      This has now been corrected in the current version.

      Data presentation: please show all bar graphs with the individual replicates (dotplots).

      We have now changed all bar graphs into scatter plots.

      Data analysis/normalization: many graphs represent normalization of values in multiple steps which are not described in the methods/legends/results. For example, Notch reporter gene activity (Fig 1A) is Firefly divided by Renilla, and presumably normalized to the control condition at 1 (or an average of 1 for the three controls?). This is not explained. Also, it is not clear whether the data reported for the Control condition are Huvec on rhDll4 compared (normalized) to Huvec on control substrate (and similar for each other condition). What controls are included in this experiment? Please provide the full data to provide insight into the magnitude of activation by Dll4 itself. Perhaps "Control" is without rhDll4? But the bar underneath A/B implies this rhDll4 was used in all conditions.

      We have edited our manuscript accordingly to avoid these ambiguities.

      Statistics: data should be presented as means +/- standard deviation, not standard error of the mean (see for example Barde & Barde Perspect Clin Res. 2012): "SEM quantifies uncertainty in estimate of the mean whereas SD indicates dispersion of the data from mean. As readers are generally interested in knowing the variability within sample, descriptive data should be precisely summarized with SD."

      We now use SD instead of SEM.

      Statistics: In the Methods section, the authors state that one-way ANOVA was followed by Dunnett's multiple comparison test, and two-way ANOVA was followed by Tukey's multiple comparison test. Dunnett is used to compare every mean to a control mean, while Tukey is used to compare every mean with every other mean. Fig 1 describes using Dunnett for Fig 1B, but the end of the legend days Tukey was used. However Fig 1A,C show internal pairwise comparisons to plastic. Please be sure to explain which statistics were used where, and why, and if plastic was set as the comparator, please be explicit about this. Fig 3 uses "Sidak's corrected two-way ANOVA" and "Sidak's multiple comparison test"? I think Sidak is a method to correct alpha or p for multiple comparisons, as stated in the first instance, but it is described why this was used here, and not in other analyses, and whether the authors then applied Tukey's post-hoc test as described in the methods section? Similar comments for Fig 6. It is counter-intuitive that the plastic -1.5kPa PDMS difference with no error-bar overlap in 1A would be 1-star significance, while the plastic-70kPa difference with almost overlapping error bars in 1B would be 4-star significance. Please check/show values. In Fig 1B Figure legend, the authors write "Data is presented in a bar plot and compared with the integrin β____1 intensities without DAPT treatment", but this is not the statistical comparison presented. Fig 3B shows a very minor difference with overlapping error bars as 3-star significance? Is this correct?

      We have checked all statistical issues and corrected where necessary. Since the sample size and variance were homogenous in all comparisons we now uniformly use ANOVA and Tukey´s multiple comparison test as post hoc to keep things simple.

      How much nuclear NICD (NICD intensity) is there in control conditions? (Control missing from Fig 1B, D).

      We will repeat the experiment and compare the NICD levels with those in non-activated cells on plastic.

      A DAPI counterstaining for 1B/D right panels would facilitate evaluation of whether NICD nuclear intensity is increased. The same applies for nuclear YAP assessment in Fig 3B. I assume a nuclear counter-stain was done for quantification of nuclear NICD intensity, and nuclear YAP intensity, but this is not described in the Materials and Methods, please add a description of how intensity was quantified, and provide nuclear counterstain images. (Also, what is the unit on the y-axis of "intensity" graphs? Arbitrary units (a.u.)?

      The counterstaining method with Hoechst as well as the use of the nuclear staining for quantitative analysis of images are now described in the Methods section and where needed in the figure legends. The y-axis of the intensity graphs now has a dimension (a.u.). We decided against overlay of the nuclear staining with the NICD or YAP images for graphical reasons (visibility of the respective staining).

      How much "overall" integrin B1 is there in DAPT-treated conditions in Fig 2C? (related to the concept that DAPT could be cleaving integrin B1, it could be depleted at 24 hours..?)

      We will additionally add this experiment and validate the effect of Noch inhibition on the overall intergrin level by the alternative inhibitor SAHM1

      More details regarding the analysis procedure need to be added to the Methods Section. Were cells segmented and then mean intensity estimated for the whole cell? Was this done by means of Intensity Ratio Nuclei Cytoplasm Tool plugin for Fiji alone? Were images background corrected, corrected for inhomogeneous illumination, normalized? In the case of Integrin beta 1 active, the expression seems to be patterned, was intensity expressed as mean intensity of every pixel corresponding to cytoplasm? For VE Cadherin staining, how was intensity estimated (only pixels corresponding to membrane were considered or every pixel of the cell)? Many figures are originated from a confocal microscope: were z-stacks acquired and then maximum projections done? Were z-stacks acquired and then fluorescence quantified in 3D images? Was a single plane acquired or analyzed, and if that is the case, how was this plane chosen?

      The requested information has now been inserted in the respective results and method sections.

      In Fig 4A, how is VE-Cadherin intensity quantified? As an average per field of view? Or per cell? And if per cell, how was each cell delineated? And if not per cell, how were equal cell numbers ensured? In FRAP experiment, how was intensity quantified? Was it per cell, per field of view or per region? Was each bleached region analyzed separately, or each cell? The datapoints should be either added to Figure 4C or as supplementary to assess the fitting. How many bleached regions per cell were done and how many cells were analyzed? In FRAP experiment, was bleaching done with an increased pixel dwell time? Was laser intensity increased? Do you have an estimation of laser power (not percentage) or flux?

      These issues are now described in more detail in the respective figure legend.

      Figure S2 is not referenced in the manuscript - I think a reference to "Figure S3" in the NECD transendocytosis section (no page numbers or line numbering) should be to Fig S2 instead?

      Sorry for this mistake! We corrected this now.

      In Figure 5A NICD nuclear intensity normalized somehow (normalization not explained), and stiffness no longer appears to regulate NICD levels as shown in Figure 1B.

      We have now described the normalization better in the figure legend. The difference to the results in Fig. 1B is that in Fig. 5A the cells were not activated by Dll4 sender cells or rhDll4 (endogenous Notch activity). This is now stated more clearly.

      Fig 6B: From the immuno at right there is a clear stiffness-dependent difference in Transferrin uptake. How were "single cell uptake" and "number of particles" quantified? (How were cell bodies identified?) Uptake could also be verified with FACS.

      In this point, we disagree with the reviewer: we really do not see a systematic difference in intensities between the different substrates. The process of image analysis is now better described in the figure legend. The result was so clear that we did not use FACS as complementary approach.

      Fig 6C: there appear to be very different numbers of cells in the brightfield image at right. Are the 70, 1.5, and 0.5 kPa Notch reporter activities different from one another or only different from plastic? Might these results reflect cell density/increased Notch signaling due to more cell-cell contacts?

      Unfortunately, with decreasing stiffness the PDMS gels become optically more and more cloudy, giving the false impression of a higher cell number. We tried to circumvent this by changing contrast and brightness of the images, but to no satisfying effect. We now mention this issue in the figure legend.

      How was the Dll4 coating of the different substrates done?

      The coating of the substrates is now described under a specific subheading in the Methods section.

      It would be helpful to describe the composition of Collagen G (Collagen I) in the text (it is a risk to expect vendor information to remain available indefinitely).

      The role and composition of the Collagen G coatings was included in the text (p 7). Further information on the manufacturer of the product used is included in the methods section.

      Please list catalog numbers for all reagents, and dilutions used for antibodies.

      We have added this information wherever possible.

      Instead of using red and green for images, maybe cyan, yellow and/or magenta could be used to help the reader see what is being shown (especially if the reader might be color blind).

      We will of course adhere to the respective policy of the publishing journal, once the manuscript is accepted.

      Packages and tools such as Intensity Ratio Nuclei Cytoplasm Tool plugin for FIJI should be referenced.

      We have now referenced respective tools.

      Reviewer #2:

      *Major comments: *

      Is there difference on a growth rate of cells on softer vrs stiffer gels that could affect cell morphology/signaling pathways?

      This is an important point and we will perform additional respective experiments.

      Nuclear localization of NICD and YAP would be good to validate with western blot.

      Quantification of Western Blots (especially after nuclear isolation) is – at least in our hands – much less sensitive and reliable then quantitative imaging. We do not think that this experiment would strengthen our study.

      In Figure 3 and Figure 5, siRNA experiments would strengthen the data. DAPT is not only an inhibitor of Notch but affects to other proteins as well. This should be stated.

      A similar point was raised by Reviewer#1 with the suggestion to use SAHM1 as an alternative to DAPT. As suggested we will add these experiments.

      How was the mean VE-cadherin branch length determined? This term often refers to angiogenesis assay/sprout formation and maybe another one should be considered here to describe VE-cadherin junction morphology.

      Add to all figure texts how many cells were used for the analyses*. *

      The cell number is now added wherever appropriate.

      In Fig. 6C the cell morphology of HUVECs look abnormal in comparison to other images and should be re-done.

      In contrast to all other experiments the cells where not confluent in this case. The different morphology is a sign of the lack of neighbours, not of some problem with the cells.

      Was all the data normally distributed and thus ANOVA was used? Please add more details on the statistics part. Did you remove outliers?

      Like also suggested by Reviewer #1 we have added more information on statistics and streamlined this. The data are normally distributed, outliers wer not removed.

      MTT assay of DAPT would need to be presented as it can be cytotoxic. Cells are not well visible in Fig 2C with DAPT. DAPI and F-actin staining would help to see the cell morphology.

      We will add respective data on cell viability after DAPT (and SAHM1) treatment in a revised version of the manuscript.

      Minor comments:

      Please clarify how coating with rhDDL4 is done as this was unclear at least for this reviewer.

      The coating of the substrates is now described under a specific subheading in the Methods section.

      HUVECs are known to be hard to transfect. Please provide data on transfection efficiencies of all transiently transfected cells.

      We did not systematically monitor transfection efficiencies in this context, since there was always an internal control (e.g. co-reporter in the reporter gene assay) or the data were obtained on a single cell based quantification. Generally, we yield transfection efficiencies around 30% with HUVECs.

      Reviewer #3:

      Major comments:

      • *

      1) The authors use recombinant Dll4 or Dll4-expressing ("sender") cells to activate Notch in co-cultured cells. This is per se fine however, one might over-estimate all other observed downstream effects as endogenous Notch activity is lower. It would be important to see how naïve HUVEC or other primary endothelial cells respond to changes in stiffness. qPCR of Notch target genes such as Hey1, Hey2, Hes5, Dll4 is frequently used as a readout of Notch activity in this context. Also. the Notch transcriptional reporter assay might be a suitable read-out-

      In Fig.5A we show data on endogenous Notch activity (- EGTA) on substrates with different stiffness. In this case NICD levels in the nucleus do not differ. It will definitely be interesting to repeat this experiment based on the reporter gene assay.

      2) As the authors mention in the Discussion, cell density could be of utmost importance given the fact that Notch signaling usually is assumed as an in trans signaling event between adjacent cell membranes. However, also other signaling modes (in cis, cis inhibition, JAG1 vs DLL4 ratio) might be important. As such, the authors should carefully document an report on cell density in all experiments. Secondly, the authors should use other conditions such as sparse cell density and thirdly the authors should measure transcriptional effects of stiffness on Notch ligand expression.

      In all experiments (with the exception of Fig. 6C) we used confluent cells. With the sparse cells (Fig. 6C) we also observe stiffness dependency. Investigating Notch ligand expression is definitely a good idea and will be investigated in the revised manuscript.

      3) The authors need to compare stiffness in their model with physiological conditions in developing tissues and ideally also in tumor which often have increased tissue stiffness.

      *Good point! We have now integrated such comparisons in the Discussion. *

      4) Is Notch activation due to changes in stiffness dependent on the presence of ligands or could it be that (unspecific) binding of Notch receptors to ECM could trigger cleavage just by conformational change?

      Since there is no stiffness dependent response on collagen (Fig. 6C, left panel), an effect of unspecific binding is highly unlikely.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors use different cell culture conditions to alter stiffness (DPMS model) and to measure the effect on Notch signaling and potential upstream and downstream factors. The experiments suggest that softer stiffness leads to higher Notch signaling activity in cultured endothelial cells which had been further stimulated by the Notch ligand DLL4. The data suggest that beta1 integrin activity is promoted by Notch which supports previous findings by others. Also, there is a bidirectional interaction with VE-Cadherin also supporting previous findings. This is a solid study using cultured cells only. The topic is of interest for researches investigating vascular biology, potentially also tumor vascular biology, ECM stiffness and its effect on signaling and Notch signaling per se.

      Major comments:

      • Are the key conclusions convincing? YES
      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? NO
      • Would additional experiments be essential to support the claims of the paper? YES, SEE BELOW-
      • Are the suggested experiments realistic in terms of time and resources? YES within about a six months' time period.
      • Are the data and the methods presented in such a way that they can be reproduced? YES, however, more information is needed about cell density on the plates and the DLL4 expression level on the sender cells.
      • Are the experiments adequately replicated and statistical analysis adequate? YES, however showing data points within the bar graphs would improve this study.

      • The authors use recombinant Dll4 or Dll4-expressing ("sender") cells to activate Notch in co-cultured cells. This is per se fine however, one might over-estimate all other observed downstream effects as endogenous Notch activity is lower. It would be important to see how naïve HUVEC or other primary endothelial cells respond to changes in stiffness. qPCR of Notch target genes such as Hey1, Hey2, Hes5, Dll4 is frequently used as a readout of Notch activity in this context. Also. the Notch transcriptional reporter assay might be a suitable read-out-

      • As the authors mention in the Discussion, cell density could be of utmost importance given the fact that Notch signaling usually is assumed as an in trans signaling event between adjacent cell membranes. However, also other signaling modes (in cis, cis inhibition, JAG1 vs DLL4 ratio) might be important. As such, the authors should carefully document an report on cell density in all experiments. Secondly, the authors should use other conditions such as sparse cell density and thirdly the authors should measure transcriptional effects of stiffness on Notch ligand expression.
      • The authors need to compare stiffness in their model with physiological conditions in developing tissues and ideally also in tumor which often have increased tissue stiffness.
      • Is Notch activation due to changes in stiffness dependent on the presence of ligands or could it be that (unspecific) binding of Notch receptors to ECM could trigger cleavage just by conformational change?

      Significance

      It was shown that Notch1 acts as a mechanosensor in endothelial cells. However, it is unclear how blood flow activates Notch1. Also, it is clear that stiffness influences blood vessel formation, which is under genetic control of Notch signaling. The importance of this study is to show that stiffness has a strong effect on Notch1 activation (maybe by increasing pulling force of ligands and subsequent endocytosis).

      The major limitations of this study are:

      1. work was only performed in cell culture, unclear whether there is any relevance in vivo
      2. there is an artificial (over)-activation of endothelial Notch signaling by Dll4 expressing cells. Unclear whether this reflects physiological Notch signaling activity.
      3. The mechanism how Notch1 gets activated remained elusive.
    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Kretchmer et al. investigates the role of substrate stiffness on Notch signalling pathway. They show increased Notch activity on softer substrates. Transendocytosis of NECD is suggested to be regulated by the substrate stiffness. They also conclude that the softer the substrate the more integrin beta 1 is activated.

      Major comments:

      Is there difference on a growth rate of cells on softer vrs stiffer gels that could affect cell morphology/signaling pathways?

      Nuclear localization of NICD and YAP would be good to validate with western blot.

      In Figure 3 and Figure 5, siRNA experiments would strengthen the data. DAPT is not only an inhibitor of Notch but affects to other proteins as well. This should be stated.

      How was the mean VE-cadherin branch length determined? This term often refers to angiogenesis assay/sprout formation and maybe another one should be considered here to describe VE-cadherin junction morphology.

      Add to all figure texts how many cells were used for the analyses.

      In Fig. 6C the cell morphology of HUVECs look abnormal in comparison to other images and should be re-done.

      Was all the data normally distributed and thus ANOVA was used? Please add more details on the statistics part. Did you remove outliers?

      MTT assay of DAPT would need to be presented as it can be cytotoxic. Cells are not well visible in Fig 2C with DAPT. DAPI and F-actin staining would help to see the cell morphology.

      Minor comments:

      Please clarify how coating with rhDDL4 is done as this was unclear at least for this reviewer. HUVECs are known to be hard to transfect. Please provide data on transfection efficiencies of all transiently transfected cells.

      Significance

      The paper is interesting for the researchers studying angiogenesis and also cancer as the matrix stiffness regulates cancer progression.

      My expertise lies in understanding mechanisms of angiogenesis, endothelial cell function and crosstalk with other cell types of the vessel wall. My group also studies Hippo signaling and has vast experience on isolation, culturing and doing experiments on HUVECs and other types of endothelial cells.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Kretschmer and colleagues investigate the role of matrix stiffness in Notch signaling using a series of gain and loss of function experiments (over-expression and inhibitors). As read-outs they use Notch reporter assays, FRAP, transferrin uptake, and immunofluorescence analyses. The authors conclude that softer substrates potentiate Notch signaling. While the questions are interesting and important, I am concerned with the use of inhibitors with off-target or unintended effects, as listed below. There is also some information missing from Materials and Methods which makes it difficult to assess the methodology and resulting conclusions.

      Major Concerns

      1. A major concern I have is with the use of DAPT to modulate Notch signaling, and investigate the impact on integrins, Yap, cadherins, etc. Gamma-secretase, the target of DAPT, cleaves not only Notch receptors, but also IntegrinB1, Nectins, Cadherins, Ephrins and more. This recent review lists 149 substrates (Guner & Lichtenthaler Seminars in Cell & Developmental Biology 2020). The risk that some of the results reflect DAPT impact on IntegrinB1, Cadherins etc themselves is significant. The authors should validate their findings with more specific modulation of Notch activity, for example with a Notch blocking antibody, with siRNA, or with SAHM1.
      2. Furthermore, EGTA was used to "acutely destabilize VE-Cadherin". But EGTA chelates Calcium, which is essential for Notch structure, and EGTA is thus a well-known activator of Notch signaling (see eg Rand MD et al. (2000) Calcium depletion dissociates and activates heterodimeric notch receptors. Mol Cell Biol). The authors rightfully describe and cite this paper, but the use of EGTA nonetheless confounds interpretation. The authors check for NICD levels (at what timepoint?) but the staining is cytoplasmic (also not labelled in the figure per se, but described in the figure legend? - please label the staining in the panel). And in any case, NICD is very short-lived and nuclear staining cannot be taken as a hallmark of signaling activity. In particular if staining is performed at a time point at which the receptor and NICD may have been exhausted/depleted. The authors should validate these observations/conclusions with the Notch reporter to conclusively demonstrate whether EGTA does not activate Notch in their system.
      3. Trans-endocytosis of NECD on different substrates: the authors suggest that trans-endocytosis of NECD by Dll4 increases on softer substrates. But the authors also show that soft substrates lead to spreading out of cells, which could confound interpretation (is overlapping membranes, not internalization). The authors could validate trans-endocytosis by FACS: check if red Dll4+ cells contain more NECD. It is also not clear to me in this experiment whether the authors are looking at green NECD, or Notch1 full length, since they write "overlap of Notch1 and Dll4", which would not reflect trans-endocytosis but interactions at the cell surface for both cells. Please also define "overlay intensity", or explain further.
      4. The authors conclude their introduction with a statement that mechanosensitivity of Notch is linked to endocytosis, but their conclusion from Fig 6C was that Notch stiffness-dependence was independent of endocytosis, using the rhDll4..?

      Minor concerns

      1. In the introduction, the authors describe Dll3 as a Notch ligand that activates Notch signaling in trans. To my knowledge, Dll3 has only been described as a cis-inhibitor of Notch signaling. (I think this may have arisen during repeated edits of the manuscript!)
      2. In the introduction, the authors state that Notch1, Dll4 and Jag1 control angiogenesis, but then they only describe what Notch1/Dll4 do in the next few sentences. Perhaps one sentence to describe the role of Jag1 would help avoid the feeling of being "left hanging".
      3. Data presentation: please show all bar graphs with the individual replicates (dotplots).
      4. Data analysis/normalization: many graphs represent normalization of values in multiple steps which are not described in the methods/legends/results. For example, Notch reporter gene activity (Fig 1A) is Firefly divided by Renilla, and presumably normalized to the control condition at 1 (or an average of 1 for the three controls?). This is not explained. Also, it is not clear whether the data reported for the Control condition are Huvec on rhDll4 compared (normalized) to Huvec on control substrate (and similar for each other condition). What controls are included in this experiment? Please provide the full data to provide insight into the magnitude of activation by Dll4 itself. Perhaps "Control" is without rhDll4? But the bar underneath A/B implies this rhDll4 was used in all conditions.
      5. Statistics: data should be presented as means +/- standard deviation, not standard error of the mean (see for example Barde & Barde Perspect Clin Res. 2012): "SEM quantifies uncertainty in estimate of the mean whereas SD indicates dispersion of the data from mean. As readers are generally interested in knowing the variability within sample, descriptive data should be precisely summarized with SD."
      6. Statistics:
        • a. In the Methods section, the authors state that one-way ANOVA was followed by Dunnett's multiple comparison test, and two-way ANOVA was followed by Tukey's multiple comparison test. Dunnett is used to compare every mean to a control mean, while Tukey is used to compare every mean with every other mean. Fig 1 describes using Dunnett for Fig 1B, but the end of the legend days Tukey was used. However Fig 1A,C show internal pairwise comparisons to plastic. Please be sure to explain which statistics were used where, and why, and if plastic was set as the comparator, please be explicit about this.
        • b. Fig 3 uses "Sidak's corrected two-way ANOVA" and "Sidak's multiple comparison test"? I think Sidak is a method to correct alpha or p for multiple comparisons, as stated in the first instance, but it is described why this was used here, and not in other analyses, and whether the authors then applied Tukey's post-hoc test as described in the methods section? Similar comments for Fig 6.
        • c. It is counter-intuitive that the plastic -1.5kPa PDMS difference with no error-bar overlap in 1A would be 1-star significance, while the plastic-70kPa difference with almost overlapping error bars in 1B would be 4-star significance. Please check/show values.
        • d. In Fig 1B Figure legend, the authors write "Data is presented in a bar plot and compared with the integrin β1 intensities without DAPT treatment", but this is not the statistical comparison presented.
        • e. Fig 3B shows a very minor difference with overlapping error bars as 3-star significance? Is this correct?
      7. How much nuclear NICD (NICD intensity) is there in control conditions? (Control missing from Fig 1B, D).
      8. A DAPI counterstaining for 1B/D right panels would facilitate evaluation of whether NICD nuclear intensity is increased. The same applies for nuclear YAP assessment in Fig 3B. I assume a nuclear counter-stain was done for quantification of nuclear NICD intensity, and nuclear YAP intensity, but this is not described in the Materials and Methods, please add a description of how intensity was quantified, and provide nuclear counterstain images. (Also, what is the unit on the y-axis of "intensity" graphs? Arbitrary units (a.u.)?
      9. How much "overall" integrin B1 is there in DAPT-treated conditions in Fig 2C? (related to the concept that DAPT could be cleaving integrin B1, it could be depleted at 24 hours..?)
      10. More details regarding the analysis procedure need to be added to the Methods Section. Were cells segmented and then mean intensity estimated for the whole cell? Was this done by means of Intensity Ratio Nuclei Cytoplasm Tool plugin for Fiji alone? Were images background corrected, corrected for inhomogeneous illumination, normalized? In the case of Integrin beta 1 active, the expression seems to be patterned, was intensity expressed as mean intensity of every pixel corresponding to cytoplasm? For VE Cadherin staining, how was intensity estimated (only pixels corresponding to membrane were considered or every pixel of the cell)? Many figures are originated from a confocal microscope: were z-stacks acquired and then maximum projections done? Were z-stacks acquired and then fluorescence quantified in 3D images? Was a single plane acquired or analyzed, and if that is the case, how was this plane chosen?
      11. In Fig 4A, how is VE-Cadherin intensity quantified? As an average per field of view? Or per cell? And if per cell, how was each cell delineated? And if not per cell, how were equal cell numbers ensured?
      12. In FRAP experiment, how was intensity quantified? Was it per cell, per field of view or per region? Was each bleached region analyzed separately, or each cell? The datapoints should be either added to Figure 4C or as supplementary to assess the fitting. How many bleached regions per cell were done and how many cells were analyzed?
      13. In FRAP experiment, was bleaching done with an increased pixel dwell time? Was laser intensity increased? Do you have an estimation of laser power (not percentage) or flux?
      14. Figure S2 is not referenced in the manuscript - I think a reference to "Figure S3" in the NECD transendocytosis section (no page numbers or line numbering) should be to Fig S2 instead?
      15. In Figure 5A NICD nuclear intensity normalized somehow (normalization not explained), and stiffness no longer appears to regulate NICD levels as shown in Figure 1B.
      16. Fig 6B: From the immuno at right there is a clear stiffness-dependent difference in Transferrin uptake. How were "single cell uptake" and "number of particles" quantified? (How were cell bodies identified?) Uptake could also be verified with FACS.
      17. Fig 6C: there appear to be very different numbers of cells in the brightfield image at right. Are the 70, 1.5, and 0.5 kPa Notch reporter activities different from one another or only different from plastic? Might these results reflect cell density/increased Notch signaling due to more cell-cell contacts?
      18. How was the Dll4 coating of the different substrates done?
      19. It would be helpful to describe the composition of Collagen G (Collagen I) in the text (it is a risk to expect vendor information to remain available indefinitely).
      20. Please list catalog numbers for all reagents, and dilutions used for antibodies.
      21. Instead of using red and green for images, maybe cyan, yellow and/or magenta could be used to help the reader see what is being shown (especially if the reader might be color blind).
      22. Packages and tools such as Intensity Ratio Nuclei Cytoplasm Tool plugin for FIJI should be referenced. https://github.com/MontpellierRessourcesImagerie/imagej_macros_and_scripts/wiki/Intensity-Ratio-Nuclei-Cytoplasm-Tool#how-to-cite-the-tool

      Significance

      The concept of how stiffness regulates Notch signaling is of timely interest. While the mechanobiology of Notch has attracted a fair amount of attention (publications), less is known of how stiffness impacts Notch signaling.

      The work could be of interest to the Notch field, biomechanics, cell biology/adhesion experts. It could be relevant for designing cellular scaffolds for biological or medical applications.

      The expertise of this reviewer is Notch and imaging.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This is a well-executed and interesting study addressing a still controversial issue in clathrin-mediated endocytosis, namely the nature of curvature generation during formation of endocytic clathrin coated vesicles. The authors have applied new techniques to this old question, including state-of-the-art high resolution 3D single-molecule localization microscopy (SMLM, i.e. Super-resolution microscopy), a new maximum-likelihood based fitting framework to fit complex geometric models into localized point clouds (Wu et al., 2020, BioRxix) and mathematical modeling leading to a new cooperative curvature model of clathrin coat remodeling and temporal reconstruction of CCP structural dynamics based on the distribution of static super-resolution images. This is an important contribution, but will it resolve the controversy of constant curvature vs constant area for CCP invagination? I doubt it. In some ways the controversy is somewhat contrived and, as this paper shows the answer is unlikely to be either or. Below are some specific comments, in somewhat random order, from someone (a curmudgeon?) who has reviewed and/or carefully read these papers since 1980. Points that the authors should address are in bold. All can be addressed with modifications to the text, as the one experiment I asked for (quantification of clathrin recruitment) is impossible with this approach).

      • I wonder how many people who cite Heuser's 1980 paper have ever read it carefully. Indeed, many of the observations made here were also made by Heuser. Below, for example, is a summary I wrote, but then removed from a review as it was too lengthy "While Heuser favored the model that CCPs assemble first as flat structures and then rearrange during invagination, he was also careful to note several caveats. First, he observed that the edges of CCPs were 'ragged', likely reflecting sites of assembly of new polygons and that pentagons were more abundant at the edges. Thus, he argued that 'if even a few of these edge pentagons were destined to become completely surrounded with hexagons, it would be necessary to conclude that some degree of curvature can be built into coats as soon as they form". Second, by examining tilted sections he observed that "even the flattest baskets have a small degree of inward curvature, and many were complete hemispheres". Finally, he cautioned that his images were snap-shots and a precursor-product relationship could not, therefore, be unambiguously established and that the very large flat lattices he observed might well be 'prove to be some sort of dead end'. We now know that fibroblasts, in particular, have large numbers of static flat clathrin plagues."

      Thus, many of the author's conclusions, i.e. that 'completely flat clathrin coats are rare (pg 12, although they're not numbered), and that curved structures can be seen to emerge from the edges of flat lattices (see Supplemental Figure 1a, 3 examples on the right) are indeed consistent with Heuser's observations. In many ways, Heuser's 1980 paper is used as a straw man argument for the constant area model. The authors should more accurately cite and acknowledge this seminal paper.

      Response: __We thank the reviewer for this insightful and constructive input on the interpretation of the constant area model (CAM). We have revised the discussion (Page 14, Lines 397-402), citing Heuser’s observations more carefully and in similarity of what was already suggested eloquently by the reviewer. We agree that the strict interpretation of the CAM is misleading, and early evidence already suggests its flawed approximation of the endocytic mechanism (further mentioned now on __Page 15, Lines 429-431).

      • As Heuser did in his 1980 classic, the authors here would do well to note several caveats related to their analyses. These include:

      +

      Like Heuser they have assembled static imaged to create a pseudotemporal model, albeit using a much more quantitative approach. Nonetheless, it seems that this assumes only a single, stereotypic pathway for CCV formation. How good is this assumption? We know from dynamic imaging that there exists significant heterogeneity in both the kinetics and the molecular composition of CCPs. The authors should acknowledge this limitation.

      __Response: __We agree with the reviewer that the lack of direct temporal information is a clear limitation of our approach.

      We now introduce this limitation on Page 16, Lines 474-484, where we discuss the disadvantage of reconstructing an average trajectory based on static images. Here, the assumption of a single, stereotypic pathway of endocytosis is addressed. We cannot exclude the possibility of slight mechanistic variations being averaged out using our approach. However, we want to highlight the fact that our approach seems sensitive enough to distinguish between structures that originate via endocytosis, and structures that derived from a different pathway, potentially from the Golgi.

      We further address the kinetic variability in terms of abortive events on Page 14, Lines 405-411, __and discuss their effect on the mechanistic interpretation of our results. Generally speaking, abortive events are characterized as dim and short-lived structures in live-cell acquisitions. As the earliest structures in our data set already contain half the final coat area, we are most likely not capturing these abortive events in the first place (potential technical reasons for not capturing earlier structures are discussed on __Page 14, Lines 385-395).

      • The method, which required that they 'optimized the sample preparation to densely label clathrin at endocytic sites' involves labeling cells to near saturation with rabbit polyclonal antibodies to both clathrin light chains and clathrin heavy chains followed by detection with a second polyclonal donkey anti-rabbit. This gives 20 nm of additional and presumably flexible linker on the label. How might this effect the measurements and modeling? The Wu et al paper, which BTW has not been peer-reviewed, shows high precision fitting of the nuclear pore structure, but using endogenously tagged NUP-95, not two-layers of antibodies. The authors will need to discuss this limitation, it is my biggest concern regarding the analysis shown.

      Response: __We acknowledge the limitations imposed by indirect immunolabelling and formulated a hypothesis on how this could affect our model fit (mentioned on __Page 13, Line 363, illustrated in Supplementary Figure 6). A larger linkage error between label and target molecule would increase the distribution of localizations around the true underlying structure. As LocMoFit fits our spherical model directly to the localization coordinates, it is able to take this distribution into account, and will weigh the fit results based on the uncertainty of the localization estimation. A uniform distribution of labels around the true underlying structure should therefore be fitted accurately also at larger linkage error. A non-uniform labeling could occur should e.g. the densely crowded space between the coat and the plasma membrane not allow for the diffusion of the antibody to the clathrin epitopes. In that case, labeling would be one-sided, and instead of the true underlying structure, LocMoFit would optimize the spherical model to the highest probability density of label around + 10 nm from the true clathrin coat. This would result in an overestimation of the radius by the model, which we could correct by substracting 10 nm from the experimentally determined radius. This was done in Supplementary Figure 6 for the hypotheses of (1) uniform displacement by the antibodies; (2) biased displacement of the antibodies towards the cytosol; and (3) biased displacement of the antibodies towards the plasma membrane. Whilst we see that the fitting parameters scale with the corrected radii, the mechanistic interpretation of partial flat pre-assembly on the membrane, and subsequent bending and surface area growth still holds true.

      • One reason for continued controversy in this field is the lack of rany attempt to resolve findings obtained using different methods. Can a parsimonious explanation be found, or are their artifacts or misinterpretations of previous findings that can explain the discrepancies? Any valid model should fit all of the valid data. For example, the authors fail to cite a recent paper by Willy et al in Dev Cell (PMID 34774130), which has been on BioRxiv since 2019 (doi: https://doi.org/10.1101/715219). Here, similar to this present study, the authors used high resolution SIM-TIR to analyze ~1000 CCPs in 3 different cells lines (sadly non-overlapping with the cells used herein) and in Drosophila embryos to quantitatively test the two models. They conclude that their findings unambiguously support a constant curvature model. The authors would do the field a favor if they carefully read this paper and identified areas of commonality (i.e. that curvature is detected at early stages in both cases) and possible explanations for the discrepancies. Certainly, they should not ignore it.

      Response: __We agree with the reviewer on the importance of consolidating findings from different studies to converge to a generally accepted mechanism of clathrin coat formation. We had indeed cited Willy et al in the introduction, but agree that further discussion of their findings should be included. We therefore discuss their findings in more detail, also in comparison to our work, on __Page 17, Lines 502-511. We agree that we reach contradictory conclusions, which we think lies at least in part with the way that Willy et al. analyze their data. Willy et al. acquire 2D projections of the endocytic clathrin structures, whose size is just at the limit of their image resolution. They then compare their projected sizes to a purist constant area model, which assumes that a coat has to grow to its entire surface as an entirely flat structure and then instantaneously snaps to an increased curvature, resulting in a sudden drop of the projected area (footprint). As we and others (e.g. Bucher et al 2011, Heuser, 1980) have observed, completely flat lattices are rare, and curvature is initiated before final surface area is acquired. We do not agree that the absence of a purist constant area model implies that clathrin mediated endocytosis follows a constant curvature trajectory. Instead, we imagine that our cooperative curvature model is likely to fit well with the observations of Willy and colleagues.

      • An important body of evidence that is not considered in their model or discussion is that derived from live cell imaging. In addition to the heterogeneity mentioned above, studies have shown that the clathrin addition to CCPs is complete (i.e. the growth phase) occurs within the first ~20-30s, followed by a variable length (0->100s) plateau phase (Loerke et al, PMID 21447041). Both the current study and the Willy et al study admit that they may not be able to detect the earliest intermediates in CCP assembly. Indeed, in this study the smallest surface area CCPs are only 2-fold smaller than the largest CCPs, suggesting that over half of the triskelions have been recruited before a CCP can be distinguished from the background of clustered, nonspecifically-bound antibodies. Could the authors be monitoring events during the plateau phase and not the earliest events? Regardless, the findings are important as they address the nature of curvature generation during this plateau phase. While monitoring curvature generation during early events in CME, a recent study (Wang et al., eLife, PMID 32352376) showed that the acquisition of curvature within the first 20s of CCP assembly was a distinguishing feature between abortive and productive events. The authors might discuss how these studies on CCP dynamics might (or might not) inform their models.

      __Response: __We thank the reviewer for this very insightful comment and discuss this hypothesis on __Page 16-17, Lines 485-511. __We suggest that part of the initiating/growth phase observed in live-cell dynamics falls into the fast, flat assembly that we are unable to capture with our approach. It is challenging to clearly identify at which point in real-time we are detecting our earliest sites. We would however argue that the plateau phase in real-time could coincide with curvature generation and final addition of triskelia at the lattice rim. The variability in the duration of this plateau phase could therefore result from variable recruitment speed of triskelia and other factors during the finalizing of the vesicle neck.

      • The authors advertise 'quantitative' description of clathrin coated structure and indeed their measurements and models are quantitative; but there is no measure of intensity/numbers of triskelions and CCP growth: an important piece of quantitative data. I expect this is impossible with indirect immunofluorescence but should be considered as a limitation of the approach. Indeed, to my knowledge no one has yet quantitatively measured curvature generation in parallel to clathrin addition at CCPs (closest is Saffarian and Kirchhausen, PMID 17993495), but they don't discuss the relationship.

      Response: __We agree with the reviewer that quantifying the number of triskelia would be an essential piece of information to correlate area growth and curvature generation with dynamic information retrieved from fluorescence intensity in live-cell studies. Unfortunately, the indirect immunolabelling approach used in this work complicated this quantification, and direct comparison between number of localizations and fluorescence intensity cannot be made. However, we do observe a correlation between coat surface area and number of localizations in our data and show this in the newly added __Supplementary Figure 7. This allows us to formulate the hypothesis on Page 16-17, Lines 485-511, which suggests that the plateauing of fluorescence intensity coincides with curvature generation and final triskelia addition to the coat rim. We further highlight the necessity of capturing both high spatial and temporal resolution simultaneously, to ultimately overcome this limitation.

      • On page 7 equation 1, you assume a constant growth rate for addition of triskelia, but later describe that the rate might be cooperative (as the number of edges increases). How would this affect your modeling?

      Response: __We formulate the __surface area growth rate of the clathrin coat to be proportional to the rim length with a constant____ rate. The cooperativity between clathrin molecules we consider to affect the rate of curvature generation. The more molecules are present, the more the entire coat is inclined to bent. We rephrased that section to emphasize this distinction (Page 8, Line 217).

      Minor points:

      • Can you indicate in the first paragraph of the results that you are using indirect immunofluorescence with rabbit anti-CLCA, anti-CHC and detection with donkey anti-rabbit for labeling, to augment the rather vague statement 'we optimized the sample preparation to densely label clathrin at endocytic sites'.

      Response: __We added a clear indication on the labelling strategy used in this work on __Page 4, Lines 109-110.

      • I'm not comfortable with the conclusioin on page 5 that your data 'indicates that at the time point of scission, the clathrin coat of nascent vesicles is still incomplete'. Other explanations might be the relative kinetics of scission vs CCP growth (i.e. these structures are too transient to detect), or that deeply invaginated pits are sheered-off the membrane during sample preparation (there is evidence that most biochemically isolated CCVs are derived from sheered CCPs).

      Response: __We extended the explanation for the absence of fully closed vesicles with the hypotheses mentioned by the reviewer on __Page 5, Lines 159-161.

      • Bottom of page 5, can you briefly mention what data is shown in Supplemental Figure 2 (ie. Figure 2D and examples of likely non-endocytic CCPs shown in Supplemental Figure 2). When I read this, I questioned your speculation.

      Response: __We clarified the cross reference to (now) Supplementary Figure 3 accordingly on __Page 6, Lines 184-185.

      • Can you indicate N CCPs from N cells in the data in Tables 2-3 for fibroblasts and U2OS cells? Do you observe and have to ignore a larger number of flat/clustered CCPs in the fibroblasts?

      Response: __We indicated the number of cells and sites per data set in the Table captions on __Page 36, Lines 51; 959; and 967. We did not quantify the number of flat/clustered, plaque like structures in our data sets. During data acquisition, we would specifically select cells with minimal number of these structures present, and even within this cell chose an area in the periphery exhibiting low number of plaques. Our data is therefore not ideal to reliably quantify plaque density between different cell lines. Qualitative observations showed that whilst we had to disregard a few cells from the U2OS and SK-MEL-2 cell-lines due to high plaque formation, the 3T3 fibroblasts were relatively straight forward to image, as few cells showed high plaque density. A recent study by Hakanpää et al., 2022 (bioRxiv) showed the decreased formation of plaques when cells were seeded on fibronectin. The fact that fibroblasts excrete their own fibronectin agrees well with our observations of relatively few 3T3 cells exhibiting extensive plaque formation.

      • The last 3 paragraphs of the Introduction are results. The Introduction might best be used to review literature in more detail, discuss the reasons why uncertainty still exists and perhaps indicate how the methods applied here will help.

      Response: __We re-wrote the last 3 paragraphs of the introduction, now clearly stating the knowledge gap in the field, and what methods would be required to bridge it (Page 3, Lines 80-102).__

      Reviewer #1 (Significance (Required)):

      This is another excellent addition to a growing list of papers seeking to define the process of curvature generation at endocytic clathrin coated pits. In my opinion, its impact would be increased by better integrating the results presented here with other studies and methods, including the recent paper by Willy et al and the large body of literature on coated pit dynamics, some of which might be relevant in interpreting results, or at least placing them in a real vs pseudo-temporal perspective. The methods introduced and the quality of imaging, modeling and quantification further increase the study's significance. The finds will be of interest to those in the CME field, those studying membrane curvature generation in other contexts, those modeling CME, vesicle formation and curvature generation and those using SMLM to discern the structure of macromolecular assemblies.

      Reviewer expertise: Clathrin-mediated endocytosis (Sandra Schmid)

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary In this article, the authors aimed to investigate the dynamic of clathrin lattice during clathrin-mediated endocytosis (CME). Overall, they successfully achieved the goal by observing a large number of clathrin spots from several cell lines with 3D single-molecule localization microscopy (SMLM). With the help of this high-resolution imaging technique, they were able to describe the physical properties of each spot and reconstruct the assembly and remodeling of the clathrin coat. Moreover, by comparing the constant area/curvature model with their own data, the authors highlighted that neither of the prevailing models perfectly explained what they observed and proposed 'cooperative curvature model'. With the novel model, the authors were able to reconstruct the clathrin coat remodeling in different cell lines and concluded that the simultaneously bending and assembly of the clathrin coat is a homogenous property of endocytosis.

      The experiments and analytical procedures are well-designed and performed, and the manuscript is well-organized. The conclusion 'cooperative curvature model' was deduced from a large amount of data analysis and clearly stated in the text. I would like to recommend its publication if the following issues will be clarified.

      Major comments:

      1. The authors compared the morphological dynamics of clathrin-coated pit among three different cell lines (SK-MEL-2, U2OS, and 3T3) and found slight differences. As U2OS cells was derived from bone tissues, it has different mechanical properties (membrane tension, elasticity of cortical layer, etc..). It would be interesting to consider those mechanical properties in understanding the morphology (Figure 2) and progress (Figure 4) of the CME. Considering the fact that the bending energy of the plasma membrane is dependent on the membrane tension, they may be able to find some relationships between mechanical properties of the cell cortex and CME.

      __Response: __We thank the reviewer for this comment and very much agree that the relationship between mechanical properties structural adaptation of the endocytic machinery is a highly interesting question. We came to the same conclusion and are therefore exploring this relationship at the moment. This is however not a straightforward task, and the complex nature of plasma membrane mechanics necessitates careful experimental design. It is therefore outside the scope of this publication. We do think this point further highlights the potential of the method presented here, as it allows the investigation of additional principles in clathrin-mediated endocytosis mechanics. We do hope to share our insights on this topic soon.

      In Figure 4, the authors estimated the progression of the CME using the frequency distribution of theta. However, I wonder how they handled the events which were aborted in the middle of the CME. It had been suggested that some CME are aborted during the initial step of the CME. The authors should consider (at least discuss) those abortive events, which can disturb the analysis.

      Response: __Generally speaking, abortive events (now discussed on __Page 14, Lines 405-411) are characterized as dim and short-lived structures in live-cell acquisitions. As the earliest structures in our data set already contain half the final coat area, we are most likely not capturing these abortive events in the first place (potential technical reasons for not capturing earlier structures are discussed on Page 14, Lines 385-395).

      Abortive events throughout the later process of endocytosis would, according to our data, still follow the same mechanistic trajectory as other sites. They could potentially slightly skew our pseudotime analysis, as they would result in an overestimation of specific endocytic stages. The overall mechanistic insight of our work would not be greatly affected, as curvature generation would still occur according to the same trajectory. Due to the low impact on our overall results we do not discuss these late abortive events further.

      Minor comments:

      1. Page5, result section 2. The author should further explain why vesicles from trans Golgi could responsible for the small disconnected set of data points corresponding to the vesicles with larger curvatures.

      Response: __We extended our explanation for the presence of non-endocytically derived structures in our data set on __Page 6, Lines 184-189. We further extended the supplementary information with an additional experiment (Supplementary Figure 4), highlighting the absence of AP2-positive structures within the disconnected population. As AP2 is a specific marker for CME, these results further solidify our hypothesis. Further experiments would be required to determine their exact origin, and are outside of the scope of this publication.

      Page7, line 6. The author assumed that the clathrin coat starts growing on a flat membrane. However, as is mentioned in the discussion, clathrin has been proved to have curvature sensing ability which could be further amplified by adapter proteins by several times (Zeno et al., 2021). So, it seems that clathrin preferred a highly curved membrane instead of a flat one. Is it still reasonable to make this assumption?

      Response: __Whilst our assumption states the growing of clathrin coat on flat membranes, we do not restrict our model to an intercept through 0, and it would therefore still hold true even in the case of growth starting on slightly bent membranes. The impact of the preference of clathrin for curvature is considered as a potential mechanistic explanation for the positive feedback in curvature generation described by our model. We therefore already cite the reference mentioned by the reviewer on __Page 8, Line 224.

      As we do observe flat structures in our data set (discussed more in detail now on Page 14, Lines 396-404), we still think the assumption of early flat growth holds true.

      Page 9, result section 4. In the sentence: "we effectively generated the average trajectories of how curvature, surface area, projected area and lattice edge change during endocytosis in SK-MEL-2 cells (Figure 4B-E)." Here I think the authors are describing Figure 4C-F.

      __Response: __That is correct, an oversight on our part. We changed the cross-reference.

      Page 11, discussion. In the sentence: "A deviation of the cross-sectional profile from a circle is nevertheless preserved in the averaging (Supplementary Figure 5)." I didn't see supplementary figure 5 in the article.

      Response: __We changed the cross-reference. We were addressing a subsection of __Supplementary Figure 8.

      Reviewer #2 (Significance (Required)):

      From a vast amount of microscopic images and data analysis, the manuscript gives a clear model on the progress of the CME, which integrates two opposing models; constant area and constant curvature models. This is a big progress in our understanding of the molecular mechanism of CME, and will attract many researchers in the field of cell biology. From a viewpoint of my expertise (molecular imaging of plasma membrane and endocytic processes), this manuscript has significant impact on the related research fields.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors used single-molecule localization microscopy of clathrin in fixed cells (2 human cell lines, one mouse) to capture snapshots of a clathrin-mediated endocytosis (CME), fitted these localizations to a geometric model of a forming vesicle, and used these fitted measurements to test existing models of clathrin-mediated vesicle formation before refining their own. Specifically, the closing angle, a measure of vesicle completeness, was used as a proxy for growth-stage of the vesicle such that the many captured snapshots could reconstruct a pseudo-timeline with an unknown parameterization of time on closing angle. Two standard models of CME vesicle formation, where the surface area is kept constant or where the curvature is kept constant, were examined and determined to be incommensurate with the pseudo timelines of curvatures and surface area. The authors then describe their own model for CME vesicle formation, in which neither surface area nor curvature are constant in evolution of the vesicle, and cooperative forces are hypothesized to non-linearly modulate the curvature-growth as a function of closing angle. Additionally, by binning snapshots and then aligning, scaling, and azimuthally smoothing each bin, they reconstruct representations of distinct endocytic stages.

      Major comments:

      Most results are quite convincing, and the authors do a nice job of displaying examples of SMLM data, both with fit results as well as example clathrin assemblies that are too far removed from their budding-vesicle model to be included for analysis, for example. It is also worth noting that the clathrin images themselves appear to be very high-quality - clearly, as detailed in the methods, attention was given to each step of the imaging and reconstruction process.

      While the presented cooperative curvature model seems reasonable and surely fits the curvature-, surface area-, and rim length-vs. closing angle data better than the simplistic constant surface-area and constant curvature models, it also has more parameters, namely: gamma (the initial rate of curvature change with closing angle) and H_0 (the final preferred curvature). It would be appropriate to calculate an information criterion (e.g. Bayesian), using an assumption of Gaussian-distributed errors (presumably the data fitting in R was least squares, so this would match) to justify the additional parameters.

      Response: __This is an important observation by the reviewer. Indeed, our model uses one more parameter compared to the models we compare it with. To justify this, we performed the calculation as suggested by the reviewer, and found that the cooperative curvature model (CoopCM) indeed results in the lowest BIC (__Supplementary Notes). We therefore are confident that out of the three models tested in this work, our CoopCM fits best to the underlying experimental data (Page 8, Lines 232-235).

      A related issue relates to the error in the extracted value of the closing angle from a single 3D reconstruction - the error distribution should be quantified for this very important parameter. The errors in the other parameters extracted from the fits are less important, but would enhance the paper.

      Response: __We thank the reviewer for pointing out the importance of the estimation error of the key parameter closing angle. To address this point, based on the geometrical model, we simulated clathrin-coated structures with closing angles evenly distributed across the entire range (0-180°). This realistic simulation represents the data quality (e.g., localization precision and labeling efficiency) of the experimental data (corresponding methods are included in __Pages 22- 23, Lines 679-706). The result of fitting these structures using LocMoFit shows an unbiased estimation with small spread of the error (overall STD = 2.82°; see the newly included Supplementary Figure 2a).

      Pseudo-temporal sorting on closing angle makes sense and I appreciate the authors mentioning potential caveats to the monotonicity, etc. However, a comment about the impact of closing angle errors on the pseudo-time determinations would be helpful. The agreement of theta-rank plots with the hypothesized sqrt(t) scaling is reassuring.

      I additionally appreciate the robustness of fitting a geometric structure from localizations rather than relying on pseudo-temporal sorting on clathrin count extracted from localization-merging of multi-blinking emitters.

      Response: __The pseudo-temporal sorting is based on the precisely estimated closing angle, and therefore is also precise, as the distribution of the fitted closing angle has no significant distortion compared to the expectation (__Supplementary Figure 2b).

      The authors did a nice job of qualifying their more speculative claims, in particular I appreciated their mentioning the possibility that smaller clathrin coats could be below their detection limit.

      The authors state a set of data points in suppl. figure 2D (and suppl. Fig 3A-C) are "likely" small clathrin-coated vesicles from the trans Golgi. I appreciate the examples rendered in that figure so a reader can appraise, but if they have my background they might not know how reasonable exclusion of this data is from model testing. This claim could be rephrased or the rationale expanded upon to justify the Golgi hypothesis.

      Response: __We agree with the reviewer and further expanded on our hypothesis on the origin of the structures within the disconnected cloud of data points (Page 6, Lines 184-189). We further performed an additional experiment (Supplementary Figure 4)__, where we simultaneously imaged the clathrin coat at high resolution, and the CME specific AP2 complex tagged with GFP at diffraction limited resolution. We observed that there were no AP2-GFP positive structures present in the disconnected cloud of our data set, and conclude that these structures indeed must originate via a different pathway.

      The data and methods are presented such that they could be reproduced, and replicating their experiment in multiple cell lines, across multiple species, would seem to be adequate replication. As mentioned above, the statistical analysis of whether the model complexity is justified by improved goodness of fit is currently missing but can readily be checked and added.

      Minor comments:

      Last paragraph of the introduction, positive feedback is mentioned but not the slowing down as preferred curvature is realized (inclusion of which might help foster a clearer understanding of the model early on).

      Response: __We now mention the slowing down towards a preferred curvature in our introduction on __Page 3, Lines 100-102.

      In Fig. 1, please state in the figure caption what is being displayed in the two large panels and what is the color map. Is this the 3D data from the overlapping elliptical Gaussians projected on the plane in a "hot" map? Further, in the top right small panels, are the x-y images projections of all z, or measured at a specific z?

      Response: __We adjusted Figure 1 and the figure caption to clearly explain what is mentioned in each superresolution panel. The exact details for image rendering, including the color map and gaussian blurring of the localization coordinates are now described in the methods on __Page 21, Lines 625-627. Ultimately, the x-y images represent an enlarged view of the projections as visible in the previous two panels. We hope that rephrasing of Figure 1 legend clarifies this accordingly.

      In Eqn. (1), epsilon is not defined.

      Response: __The definition is mentioned on __Page 8, Line 210, right before the equation, same as for kon.

      For the theta-rank plots (Fig4 B, SFig D-F ii) moving the theta(t)=sqrt(t) red curves behind sorted theta data would make the data easier to see.

      __Response: __We adjusted the Figures according to the reviewer's suggestion.

      "Laser" in sentence about the speckle reducer should probably be plural.

      Response: __We corrected this grammar mistake, and changed “laser” to “lasers” on __Page 20, Line 586.

      I would like to see the "custom" algorithm based on redundant cross-correlation for drift correction briefly described.

      Response: __We added an explanation on the algorithm used for the drift correction on __Pages 20-21, Lines 611-617.

      A legend for supplemental figure 3 A-C would be nice.

      Response: __We added a legend for the various models in (now) __Supplementary Figure 5, and further made some clarifications in the figure caption.

      If the definition of the abbreviation flat-to-curved-transition as FTC was explicit I missed it.

      Response: __As we do not use this abbreviation anywhere else in the manuscript, we removed it from the __Supplementary Note to avoid confusion.

      Resolution of 20 and 30 nm (laterally and axially, respectively) was quoted once towards the beginning of the manuscript as being an improvement resulting from the localization method described in Li et al., 2018. Resolution can be difficult to speak about precisely, but the methods section would seem to indicate that localizations are filtered at 20 nm lateral localization precision (potentially 30 nm axially?), and I think the authors could consider rephrasing to depict this unless I am missing elsewhere a description of the resolution metric being used.

      Response: __The original 20 and 30 nm resolution (laterally and axially) was calculated based on the median localization precision values in x-y and z for a representative image, using the FWHM approach (described in Methods __Page 21, Lines 621-624). After consideration of the reviewer's question, we found the modal value to be a better quantity to calculate the resolution, and changed this in the text accordingly (Page 4, Lines 113-115, and Methods Page 21, Lines 621-624).

      Reviewer #3 (Significance (Required)):

      Proteins involved with inducing curvature in membranes are in general very exciting targets for localization microscopy, yet still for many systems questions remain unanswered. The authors tackle one such question in this manuscript. In other, unresolved, discussions, the posed hypotheses are quite similar to the simplistic models surpassed in this work (e.g. that curvature scales linearly with local protein copy number, or that surface area scales linearly with local protein copy number). The idea of cooperativity may be useful for others to consider, and the authors additionally demonstrate a seemingly smooth workflow using their separately described tools (primarily LoMoFit; Wu et al. 2021).

      I myself am not an expert on CME or vesicle trafficking. My background is primarily in SMLM method development and SMLM / fluorescence image analysis. From my perspective, the novelty of the biological conclusions appears to be the authors' specific cooperative model and the presence of two structural states which are enriched (closing angle 70{degree sign} and 130{degree sign}). As referenced, and authors F. Frey and U. S. Schwarz nicely present in Bucher et al. 2018, the constant curvature and constant surface area models are known to be inaccurate descriptions of CME evolution, and further it is also known that clathrin first assembles small flat structures before beginning to curve the membrane. However, the 3D super-resolution imaging and direct evaluation of a 3D model geometry in this work is a nice extension of the 2D super-resolution imaging and projection evaluation in the authors' previous work studying endocytosis through ensemble averaging in yeast (Mund et al. 2018) as well as the analysis on projections in Bucher et al. 2018. Fully 3D treatment of the clathrin structures allows the authors to orient asymmetric assemblies such that they are averaged out in their ensemble reconstruction, and as they point out the molecular specificity afforded by a fluorescence-based technique ensures unbiased segmentation of clathrin-involved endocytic sites. In other words, while this work does not describe a technical advance not already described elsewhere, it sets a nice example for those researching protein-membrane interactions of how to leverage the right tools to clearly and directly answer their questions. With their additional work to make these tools extensible to other geometries, multiple color channels, etc., I expect their work to inspire quality studies in other systems. That significance is complementary to their proposal of a reasonable model for the geometric evolution of CME.

      References:

      Maximum-likelihood model fitting for quantitative analysis of SMLM data, Yu-Le Wu, Philipp Hoess, Aline Tschanz, Ulf Matti, Markus Mund, Jonas Ries, bioRxiv 2021.08.30.456756; doi: https://doi.org/10.1101/2021.08.30.456756

      Bucher, D., Frey, F., Sochacki, K.A. et al. Clathrin-adaptor ratio and membrane tension regulate the flat-to-curved transition of the clathrin coat during endocytosis. Nat Commun 9, 1109 (2018). https://doi.org/10.1038/s41467-018-03533-0

      Markus Mund, Johannes Albertus van der Beek, Joran Deschamps, Serge Dmitrieff, Philipp Hoess, Jooske Louise Monster, Andrea Picco, François Nédélec, Marko Kaksonen, Jonas Ries, Systematic Nanoscale Analysis of Endocytosis Links Efficient Vesicle Formation to Patterned Actin Nucleation, Cell, 174, 4, (2018). https://doi.org/10.1016/j.cell.2018.06.032.

      s

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The authors used single-molecule localization microscopy of clathrin in fixed cells (2 human cell lines, one mouse) to capture snapshots of a clathrin-mediated endocytosis (CME), fitted these localizations to a geometric model of a forming vesicle, and used these fitted measurements to test existing models of clathrin-mediated vesicle formation before refining their own. Specifically, the closing angle, a measure of vesicle completeness, was used as a proxy for growth-stage of the vesicle such that the many captured snapshots could reconstruct a pseudo-timeline with an unknown parameterization of time on closing angle. Two standard models of CME vesicle formation, where the surface area is kept constant or where the curvature is kept constant, were examined and determined to be incommensurate with the pseudo timelines of curvatures and surface area. The authors then describe their own model for CME vesicle formation, in which neither surface area nor curvature are constant in evolution of the vesicle, and cooperative forces are hypothesized to non-linearly modulate the curvature-growth as a function of closing angle. Additionally, by binning snapshots and then aligning, scaling, and azimuthally smoothing each bin, they reconstruct representations of distinct endocytic stages.

      Major comments:

      Most results are quite convincing, and the authors do a nice job of displaying examples of SMLM data, both with fit results as well as example clathrin assemblies that are too far removed from their budding-vesicle model to be included for analysis, for example. It is also worth noting that the clathrin images themselves appear to be very high-quality - clearly, as detailed in the methods, attention was given to each step of the imaging and reconstruction process.

      While the presented cooperative curvature model seems reasonable and surely fits the curvature-, surface area-, and rim length-vs. closing angle data better than the simplistic constant surface-area and constant curvature models, it also has more parameters, namely: gamma (the initial rate of curvature change with closing angle) and H_0 (the final preferred curvature). It would be appropriate to calculate an information criterion (e.g. Bayesian), using an assumption of Gaussian-distributed errors (presumably the data fitting in R was least squares, so this would match) to justify the additional parameters.

      A related issue relates to the error in the extracted value of the closing angle from a single 3D reconstruction - the error distribution should be quantified for this very important parameter. The errors in the other parameters extracted from the fits are less important, but would enhance the paper.

      Pseudo-temporal sorting on closing angle makes sense and I appreciate the authors mentioning potential caveats to the monotonicity, etc. However, a comment about the impact of closing angle errors on the pseudo-time determinations would be helpful. The agreement of theta-rank plots with the hypothesized sqrt(t) scaling is reassuring. I additionally appreciate the robustness of fitting a geometric structure from localizations rather than relying on pseudo-temporal sorting on clathrin count extracted from localization-merging of multi-blinking emitters.

      The authors did a nice job of qualifying their more speculative claims, in particular I appreciated their mentioning the possibility that smaller clathrin coats could be below their detection limit.

      The authors state a set of data points in suppl. figure 2D (and suppl. Fig 3A-C) are "likely" small clathrin-coated vesicles from the trans Golgi. I appreciate the examples rendered in that figure so a reader can appraise, but if they have my background they might not know how reasonable exclusion of this data is from model testing. This claim could be rephrased or the rationale expanded upon to justify the Golgi hypothesis.

      The data and methods are presented such that they could be reproduced, and replicating their experiment in multiple cell lines, across multiple species, would seem to be adequate replication. As mentioned above, the statistical analysis of whether the model complexity is justified by improved goodness of fit is currently missing but can readily be checked and added.

      Minor comments:

      Last paragraph of the introduction, positive feedback is mentioned but not the slowing down as preferred curvature is realized (inclusion of which might help foster a clearer understanding of the model early on).

      In Fig. 1, please state in the figure caption what is being displayed in the two large panels and what is the color map. Is this the 3D data from the overlapping elliptical Gaussians projected on the plane in a "hot" map? Further, in the top right small panels, are the x-y images projections of all z, or measured at a specific z?

      In Eqn. (1), epsilon is not defined.

      For the theta-rank plots (Fig4 B, SFig D-F ii) moving the theta(t)=sqrt(t) red curves behind sorted theta data would make the data easier to see.

      "Laser" in sentence about the speckle reducer should probably be plural.

      I would like to see the "custom" algorithm based on redundant cross-correlation for drift correction briefly described.

      A legend for supplemental figure 3 A-C would be nice.

      I would enjoy hearing the authors' thoughts on why resting points at closing angle 70{degree sign} and 130{degree sign} are present. If these thoughts can be readily rationalized/referenced some speculation might even be warranted in the text.

      If the definition of the abbreviation flat-to-curved-transition as FTC was explicit I missed it.

      Resolution of 20 and 30 nm (laterally and axially, respectively) was quoted once towards the beginning of the manuscript as being an improvement resulting from the localization method described in Li et al., 2018. Resolution can be difficult to speak about precisely, but the methods section would seem to indicate that localizations are filtered at 20 nm lateral localization precision (potentially 30 nm axially?), and I think the authors could consider rephrasing to depict this unless I am missing elsewhere a description of the resolution metric being used.

      Significance

      Proteins involved with inducing curvature in membranes are in general very exciting targets for localization microscopy, yet still for many systems questions remain unanswered. The authors tackle one such question in this manuscript. In other, unresolved, discussions, the posed hypotheses are quite similar to the simplistic models surpassed in this work (e.g. that curvature scales linearly with local protein copy number, or that surface area scales linearly with local protein copy number). The idea of cooperativity may be useful for others to consider, and the authors additionally demonstrate a seemingly smooth workflow using their separately described tools (primarily LoMoFit; Wu et al. 2021).

      I myself am not an expert on CME or vesicle trafficking. My background is primarily in SMLM method development and SMLM / fluorescence image analysis. From my perspective, the novelty of the biological conclusions appears to be the authors' specific cooperative model and the presence of two structural states which are enriched (closing angle 70{degree sign} and 130{degree sign}). As referenced, and authors F. Frey and U. S. Schwarz nicely present in Bucher et al. 2018, the constant curvature and constant surface area models are known to be inaccurate descriptions of CME evolution, and further it is also known that clathrin first assembles small flat structures before beginning to curve the membrane. However, the 3D super-resolution imaging and direct evaluation of a 3D model geometry in this work is a nice extension of the 2D super-resolution imaging and projection evaluation in the authors' previous work studying endocytosis through ensemble averaging in yeast (Mund et al. 2018) as well as the analysis on projections in Bucher et al. 2018. Fully 3D treatment of the clathrin structures allows the authors to orient asymmetric assemblies such that they are averaged out in their ensemble reconstruction, and as they point out the molecular specificity afforded by a fluorescence-based technique ensures unbiased segmentation of clathrin-involved endocytic sites. In other words, while this work does not describe a technical advance not already described elsewhere, it sets a nice example for those researching protein-membrane interactions of how to leverage the right tools to clearly and directly answer their questions. With their additional work to make these tools extensible to other geometries, multiple color channels, etc., I expect their work to inspire quality studies in other systems. That significance is complementary to their proposal of a reasonable model for the geometric evolution of CME.

      References:

      Maximum-likelihood model fitting for quantitative analysis of SMLM data Yu-Le Wu, Philipp Hoess, Aline Tschanz, Ulf Matti, Markus Mund, Jonas Ries bioRxiv 2021.08.30.456756; doi: https://doi.org/10.1101/2021.08.30.456756

      Bucher, D., Frey, F., Sochacki, K.A. et al. Clathrin-adaptor ratio and membrane tension regulate the flat-to-curved transition of the clathrin coat during endocytosis. Nat Commun 9, 1109 (2018). https://doi.org/10.1038/s41467-018-03533-0

      Markus Mund, Johannes Albertus van der Beek, Joran Deschamps, Serge Dmitrieff, Philipp Hoess, Jooske Louise Monster, Andrea Picco, François Nédélec, Marko Kaksonen, Jonas Ries, Systematic Nanoscale Analysis of Endocytosis Links Efficient Vesicle Formation to Patterned Actin Nucleation, Cell, 174, 4, (2018). https://doi.org/10.1016/j.cell.2018.06.032.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      In this article, the authors aimed to investigate the dynamic of clathrin lattice during clathrin-mediated endocytosis (CME). Overall, they successfully achieved the goal by observing a large number of clathrin spots from several cell lines with 3D single-molecule localization microscopy (SMLM). With the help of this high-resolution imaging technique, they were able to describe the physical properties of each spot and reconstruct the assembly and remodeling of the clathrin coat. Moreover, by comparing the constant area/curvature model with their own data, the authors highlighted that neither of the prevailing models perfectly explained what they observed and proposed 'cooperative curvature model'. With the novel model, the authors were able to reconstruct the clathrin coat remodeling in different cell lines and concluded that the simultaneously bending and assembly of the clathrin coat is a homogenous property of endocytosis. The experiments and analytical procedures are well-designed and performed, and the manuscript is well-organized. The conclusion 'cooperative curvature model' was deduced from a large amount of data analysis and clearly stated in the text. I would like to recommend its publication if the following issues will be clarified.

      Major comments:

      1. The authors compared the morphological dynamics of clathrin-coated pit among three different cell lines (SK-MEL-2, U2OS, and 3T3) and found slight differences. As U2OS cells was derived from bone tissues, it has different mechanical properties (membrane tension, elasticity of cortical layer, etc..). It would be interesting to consider those mechanical properties in understanding the morphology (Figure 2) and progress (Figure 4) of the CME. Considering the fact that the bending energy of the plasma membrane is dependent on the membrane tension, they may be able to find some relationships between mechanical properties of the cell cortex and CME.
      2. In Figure 4, the authors estimated the progression of the CME using the frequency distribution of theta. However, I wonder how they handled the events which were aborted in the middle of the CME. It had been suggested that some CME are aborted during the initial step of the CME. The authors should consider (at least discuss) those abortive events, which can disturb the analysis.

      Minor comments:

      1. Page5, result section 2. The author should further explain why vesicles from trans Golgi could responsible for the small disconnected set of data points corresponding to the vesicles with larger curvatures.
      2. Page7, line 6. The author assumed that the clathrin coat starts growing on a flat membrane. However, as is mentioned in the discussion, clathrin has been proved to have curvature sensing ability which could be further amplified by adapter proteins by several times (Zeno et al., 2021). So, it seems that clathrin preferred a highly curved membrane instead of a flat one. Is it still reasonable to make this assumption?
      3. Page 9, result section 4. In the sentence: "we effectively generated the average trajectories of how curvature, surface area, projected area and lattice edge change during endocytosis in SK-MEL-2 cells (Figure 4B-E)." Here I think the authors are describing Figure 4C-F.
      4. Page 11, discussion. In the sentence: "A deviation of the cross-sectional profile from a circle is nevertheless preserved in the averaging (Supplementary Figure 5)." I didn't see supplementary figure 5 in the article.

      Significance

      From a vast amount of microscopic images and data analysis, the manuscript gives a clear model on the progress of the CME, which integrates two opposing models; constant area and constant curvature models. This is a big progress in our understanding of the molecular mechanism of CME, and will attract many researchers in the field of cell biology. From a viewpoint of my expertise (molecular imaging of plasma membrane and endocytic processes), this manuscript has significant impact on the related research fields.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is a well-executed and interesting study addressing a still controversial issue in clathrin-mediated endocytosis, namely the nature of curvature generation during formation of endocytic clathrin coated vesicles. The authors have applied new techniques to this old question, including state-of-the-art high resolution 3D single-molecule localization microscopy (SMLM, i.e. Super-resolution microscopy), a new maximum-likelihood based fitting framework to fit complex geometric models into localized point clouds (Wu et al., 2020, BioRxix) and mathematical modeling leading to a new cooperative curvature model of clathrin coat remodeling and temporal reconstruction of CCP structural dynamics based on the distribution of static super-resolution images. This is an important contribution, but will it resolve the controversy of constant curvature vs constant area for CCP invagination? I doubt it. In some ways the controversy is somewhat contrived and, as this paper shows the answer is unlikely to be either or. Below are some specific comments, in somewhat random order, from someone (a curmudgeon?) who has reviewed and/or carefully read these papers since 1980. Points that the authors should address are in bold. All can be addressed with modifications to the text, as the one experiment I asked for (quantification of clathrin recruitment) is impossible with this approach).

      1. I wonder how many people who cite Heuser's 1980 paper have ever read it carefully. Indeed, many of the observations made here were also made by Heuser. Below, for example, is a summary I wrote, but then removed from a review as it was too lengthy

      "While Heuser favored the model that CCPs assemble first as flat structures and then rearrange during invagination, he was also careful to note several caveats. First, he observed that the edges of CCPs were 'ragged', likely reflecting sites of assembly of new polygons and that pentagons were more abundant at the edges. Thus, he argued that 'if even a few of these edge pentagons were destined to become completely surrounded with hexagons, it would be necessary to conclude that some degree of curvature can be built into coats as soon as they form". Second, by examining tilted sections he observed that "even the flattest baskets have a small degree of inward curvature, and many were complete hemispheres". Finally, he cautioned that his images were snap-shots and a precursor-product relationship could not, therefore, be unambiguously established and that the very large flat lattices he observed might well be 'prove to be some sort of dead end'. We now know that fibroblasts, in particular, have large numbers of static flat clathrin plagues."

      Thus, many of the author's conclusions, i.e. that 'completely flat clathrin coats are rare (pg 12, although they're not numbered), and that curved structures can be seen to emerge from the edges of flat lattices (see Supplemental Figure 1a, 3 examples on the right) are indeed consistent with Heuser's observations. In many ways, Heuser's 1980 paper is used as a straw man argument for the constant area model. The authors should more accurately cite and acknowledge this seminal paper. 2. As Heuser did in his 1980 classic, the authors here would do well to note several caveats related to their analyses. These include: - a. Like Heuser they have assembled static imaged to create a pseudotemporal model, albeit using a much more quantitative approach. Nonetheless, it seems that this assumes only a single, stereotypic pathway for CCV formation. How good is this assumption? We know from dynamic imaging that there exists significant heterogeneity in both the kinetics and the molecular composition of CCPs. The authors should acknowledge this limitation. - b. The method, which required that they 'optimized the sample preparation to densely label clathrin at endocytic sites' involves labeling cells to near saturation with rabbit polyclonal antibodies to both clathrin light chains and clathrin heavy chains followed by detection with a second polyclonal donkey anti-rabbit. This gives 20 nm of additional and presumably flexible linker on the label. How might this effect the measurements and modeling? The Wu et al paper, which BTW has not been peer-reviewed, shows high precision fitting of the nuclear pore structure, but using endogenously tagged NUP-95, not two-layers of antibodies. The authors will need to discuss this limitation, it is my biggest concern regarding the analysis shown.<br /> 3. One reason for continued controversy in this field is the lack of rany attempt to resolve findings obtained using different methods. Can a parsimonious explanation be found, or are their artifacts or misinterpretations of previous findings that can explain the discrepancies? Any valid model should fit all of the valid data. For example, the authors fail to cite a recent paper by Willy et al in Dev Cell (PMID 34774130), which has been on BioRxiv since 2019 (doi: https://doi.org/10.1101/715219). Here, similar to this present study, the authors used high resolution SIM-TIR to analyze ~1000 CCPs in 3 different cells lines (sadly non-overlapping with the cells used herein) and in Drosophila embryos to quantitatively test the two models. They conclude that their findings unambiguously support a constant curvature model. The authors would do the field a favor if they carefully read this paper and identified areas of commonality (i.e. that curvature is detected at early stages in both cases) and possible explanations for the discrepancies. Certainly, they should not ignore it. 4. An important body of evidence that is not considered in their model or discussion is that derived from live cell imaging. In addition to the heterogeneity mentioned above, studies have shown that the clathrin addition to CCPs is complete (i.e. the growth phase) occurs within the first ~20-30s, followed by a variable length (0->100s) plateau phase (Loerke et al, PMID 21447041) . Both the current study and the Willy et al study admit that they may not be able to detect the earliest intermediates in CCP assembly. Indeed, in this study the smallest surface area CCPs are only 2-fold smaller than the largest CCPs, suggesting that over half of the triskelions have been recruited before a CCP can be distinguished from the background of clustered, nonspecifically-bound antibodies. Could the authors be monitoring events during the plateau phase and not the earliest events? Regardless, the findings are important as they address the nature of curvature generation during this plateau phase. While monitoring curvature generation during early events in CME, a recent study (Wang et al., eLife, PMID 32352376) showed that the acquisition of curvature within the first 20s of CCP assembly was a distinguishing feature between abortive and productive events. The authors might discuss how these studies on CCP dynamics might (or might not) inform their models. 5. The authors advertise 'quantitative' description of clathrin coated structure and indeed their measurements and models are quantitative; but there is no measure of intensity/numbers of triskelions and CCP growth: an important piece of quantitative data. I expect this is impossible with indirect immunofluorescence but should be considered as a limitation of the approach. Indeed, to my knowledge no one has yet quantitatively measured curvature generation in parallel to clathrin addition at CCPs (closest is Saffarian and Kirchhausen, PMID 17993495), but they don't discuss the relationship. 6. On page 7 equation 1, you assume a constant growth rate for addition of triskelia, but later describe that the rate might be cooperative (as the number of edges increases). How would this affect your modeling?

      Minor points:

      • Can you indicate in the first paragraph of the results that you are using indirect immunofluorescence with rabbit anti-CLCA, anti-CHC and detection with donkey anti-rabbit for labeling, to augment the rather vague statement 'we optimized the sample preparation to densely label clathrin at endocytic sites'.
      • I'm not comfortable with the conclusioin on page 5 that your data 'indicates that at the time point of scission, the clathrin coat of nascent vesicles is still incomplete'. Other explanations might be the relative kinetics of scission vs CCP growth (i.e. these structures are too transient to detect), or that deeply invaginated pits are sheered-off the membrane during sample preparation (there is evidence that most biochemically isolated CCVs are derived from sheered CCPs).
      • Bottom of page 5, can you briefly mention what data is shown in Supplemental Figure 2 (ie. Figure 2D and examples of likely non-endocytic CCPs shown in Supplemental Figure 2). When I read this, I questioned your speculation.
      • Can you indicate N CCPs from N cells in the data in Tables 2-3 for fibroblasts and U2OS cells? Do you observe and have to ignore a larger number of flat/clustered CCPs in the fibroblasts?
      • The last 3 paragraphs of the Introduction are results. The Introduction might best be used to review literature in more detail, discuss the reasons why uncertainty still exists and perhaps indicate how the methods applied here will help.

      Significance

      This is another excellent addition to a growing list of papers seeking to define the process of curvature generation at endocytic clathrin coated pits. In my opinion, its impact would be increased by better integrating the results presented here with other studies and methods, including the recent paper by Willy et al and the large body of literature on coated pit dynamics, some of which might be relevant in interpreting results, or at least placing them in a real vs pseudo-temporal perspective. The methods introduced and the quality of imaging, modeling and quantification further increase the study's significance. The finds will be of interest to those in the CME field, those studying membrane curvature generation in other contexts, those modeling CME, vesicle formation and curvature generation and those using SMLM to discern the structure of macromolecular assemblies.

      Reviewer expertise: Clathrin-mediated endocytosis (Sandra Schmid)

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their constructive comments and are pleased that all reviewers share our opinion, that the present study “makes an important contribution to the molecular architecture of mitochondria”, is in addition “an important advancement in our understanding of the mechanism by which Cqd1 regulates CoQ distribution” and will “thereby appealing to the broad readership of the journals”. We are convinced that addressing the important points raised by the reviewers will further strengthen the manuscript and result in additional significant insights in the molecular function of Cqd1.

      Reviewer #1:

      The major concerns affecting the conclusions are: 1) Experimental evidence is lacking on the contribution of contact site formation by Cqd1 to the effects on mitochondrial architecture and respiration-dependent growth. Determining the effects of the overexpression of the kinase-dead mutant on mitochondrial morphology and contact site formation with Por1-Om14 can address that.

      We thank reviewer #1 for raising these important points. Indeed, the various functions of Cqd1 might be independent from each other and so far we cannot distinguish between them. As suggested by the reviewer we will analyze the effect of overexpression of CQD1 in the Dups1 deletion mutant and make use of the point mutant in the conserved ATP binding domain which cannot complement the phenotype of the Dups1 Dcqd1 double deletion mutant. We generated a yeast mutant strain expressing Om14-3xHA in the absence of wild type Cqd1. Expression of the cqd1(E330A) mutant in the Om14-3xHA background and subsequent immunoprecipitation will allow us to test whether ATP binding is also essential for contact site formation. Preliminary experiments showed that the overexpression of cqd1(E330A) in the Dcqd1 deletion background results in a growth defect comparable to that caused by overexpression of CQD1 WT. Therefore, we think it might be more promising to analyze the interaction of Om14 and Cqd1 E330A at wild type level in order to avoid pleiotropic effects.

      In addition, we will further characterize the cqd1(E330A) mutant by analyzing the effect of its overexpression on mitochondrial morphology, cell growth and assembly of MICOS and F1FO ATP synthase in the Dcqd1 deletion background.

      2) Related to point #1, Cqd1 overexpression in deltaUsp1 cells could have addressed whether the role of Cqd1 in contact sites and mitochondrial architecture is independent of its role on CoQ distribution and phospholipid metabolism. Further characterization of the kinase-dead Cqd1 mutant on CoQ distribution, contact sites, mitochondrial archictecture and phsophsolipid metabolism might help discerning how these activities can be separated.

      We agree that the related points 1) and 2) raised by reviewer #1 are important and addressed our plans in the response on point 1).

      3) It is unclear how both Cqd1 overexpression and deletion induce mitochondrial fragmentation. Performing live cell imaging with a mitochondrial-phoactivatable GFP to measure mitochondrial fusion rates could help discerning the causes for fragmentation. It is a possibility that overexpression induced fragmentation by activating fission without changing fusion, while deletion induced fragmentation by blocking fusion.

      We thank reviewer #1 for bringing up this point. Perhaps our explanation in this respect was too short. Fig. 4E shows that deletion of CQD1 does not result in altered mitochondrial morphology, however, deletion of CQD1 in the Dups1 background leads to virtual complete fragmentation of the mitochondrial network. This is likely due to inhibition of mitochondrial fusion through disturbed processing of the fusion protein Mgm1 (see Fig. 4D). In contrast, overexpression of CQD1 does NOT result in formation of small mitochondrial fragments, but in formation of huge mitochondrial clusters which in addition contain a large proportion of ER membranes. So, we don’t think that this phenotype is related to either enhanced fission or reduced fusion. We will clarify this point in text of the revised manuscript.

      Minor comment:

      1) Figure 4 claims that mitochondrial function is impaired by ups1 deletion, which Cqd1 deletion exacerbates. However, no respiration data is shown in figure 1, only measurements of mitochondrial architecture are shown. Thus, oxygen consumption measurements are needed to claim effects on mitochondrial function.

      We did not want to claim that mitochondria lose respiratory competence upon simultaneous deletion of CQD1 and UPS1. Actually, our results indicate that the Dups1 Dcqd1 double deletion mutant grows like wild type on complete medium containing glycerol. Therefore, respiration is not impaired in this mutant. However, mitochondrial function is not restricted to ATP production by oxidative phosphorylation. The reviewer probably refers to Figure 4 where we show that mitochondrial biogenesis and dynamics are impaired in the Dups1 Dcqd1 double deletion mutant – the heading of the legend summarizes this as "mitochondrial function". We will be more precise in the revised version on this point and add a panel showing growth of the mutant strain on non-fermentable carbon source to avoid any further confusion.

      2) Some Western blots lack quantifications and statistical analyses of independent experiments.

      It is correct that some quantification and the respective statistics were missing in the initially submitted manuscript. We will add the requested information in the revised version of the manuscript.

      Reviewer #2:

      I have the following concerns for the authors to consider. (1) Although biochemical evidence shows that Cqd1 is likely a factor that forms CS structures in mitochondria, it would make the manuscript stronger if the authors can observe uneven distribution of Cqd1 in the mitochondrial membranes (assessed by fluorescent microscopy or ideally high-resolution microscopy) and the presence of Cqd1 in the region of close apposition of the OM and IM by immunogold labeling for electron microscopy.

      Two independent lines of evidence show that Cqd1 is a novel contact site protein: (i) it is found in the contact site fraction in density gradients (Fig. 6A), and (ii) it can be co-immunoprecipitated with outer membrane proteins (Fig. 6G, H, I). Furthermore, the co-IP is supported by cross-links of expected size (Fig. 6F). In sum, we feel that this is solid evidence to support our claim that Cqd1 is present in mitochondrial contact sites. However, it still might be interesting to check an uneven distribution of Cqd1 in mitochondria, as suggested by the reviewer. We will do this by 3D deconvolution fluorescence microscopy.

      (2) Since the structural characterization of Cqd1 is important to understand its interactions with the OM proteins and other UbiB protein kinase-like family proteins, Coq8 and Cqd2, take different orientations, the membrane topology of Cqd1 should be experimentally analyzed. The authors state, "two hydrophobic stretches can be identified in the Cqd1 sequence, of which the first one (amino acids 125-142) might be a bona fide transmembrane segment" (lines 97-100); then is Cqd1 a single membrane spanning protein or two-membrane spanning protein?  

      Unfortunately, it was not possible to test the location of the N terminus experimentally because an N-terminally tagged variant of Cqd1 (tag inserted between presequence and mature part) turned out to be unstable. We consider it very unlikely that the second hydrophobic stretch is a transmembrane domain as it is rather short (only 11 amino acids). Furthermore, several Cqd1 homologs in other fungi, including Yarrowia lipolytica, Aspergillus niger and Schizosaccharomyces pombe, are lacking the second hydrophobic stretch. Therefore, we propose that the major part of Cqd1 including the protein kinase-like domain is exposed to the intermembrane space. We will point out this more clearly in the revised manuscript.

      (3) The authors state, "conserved GxxxG dimerization motif (amino acids 504‐508)" (Fig. 1A caption), but this description needs a reference. The GxxxG motif was proposed to mediate transmembrane helix-helix association (https://doi.org/10.1006/jmbi.1999.3489), which is not consistent with the membrane topology proposed by the authors.

      We thank reviewer #2 for this comment. It is correct that GxxxG motifs are usually present in transmembrane a-helices. However, there is information available indicating that these motifs may also be present in soluble proteins and are stabilizing dimeric interactions for instance in the homodimeric Holliday-junction protein resolvase (Kleiger et al., 2002; doi: 10.1021/bi0200763.). However, as this point is not critical for our conclusions we will remove the discussion of the GxxxG motif from the revised manuscript.

      (4) What is the role of the kinase activity of Cqd1 in the CS formation? The effects of overexpression of Cqd1 (Fig. 7) should be tested for its E330A mutant.

      We also thank reviewer #2 for raising this important point similar to reviewer #1. Please see our response to point 1) of reviewer #1.

      (5) Is there stoichiometric as well as quantitative information on the 400 kD complex consisting of Cqd1, Por1 and Om14? Does the stoichiometry and amount of the complex depend on the growth condition? Does the complex contain other Por1 interacting IM proteins like Mdm31?

      We appreciate that reviewer #2 points out this important aspect. It might well be that the amount of the Cqd1 containing complex depends on growth conditions since its presence might be important for phospholipid homeostasis, CoQ distribution and mitochondrial architecture and morphology which for sure strongly depend on growth conditions. Therefore, we will try to analyze the amount of the Cqd1 complex present in mitochondria isolated from yeast cells grown on different media by BN-PAGE. So far we do not have any information on the stoichiometry of this complex and we feel that an analysis would go beyond the scope of this study. We agree with reviewer #2 that Mdm31 is an obvious candidate for an interaction partner of Cqd1. We actually tested this by co-immunoprecipitation using Cqd1-3xHA or Mdm31-3xHA. However, none of these approaches resulted in successful co-isolation of the potential interaction partner. We will mention this result in the revised manuscript.

      (6) For Fig. 7E, the authors state, "consistently, we observed dramatically increased mitochondria‐ER interactions Cqd1 overexpression", but this observation could be due to secondary effects because overexpression of Cqd1 itself already caused abnormal morphology of mitochondria.

      We thank reviewer #2 for bringing up this important point. To check whether the increased mitochondria‐ER interactions are a secondary effect due to altered mitochondrial morphology we will analyze the mitochondria‐ER interactions in other mitochondrial morphology mutants by fluorescence microscopy. This will reveal whether abnormal mitochondrial morphology generally leads to disturbed ER structure.

      (7) Since the antagonistic role of Cqd2 to Cqd1 was proposed, the results of the experiments for Cqd1 can be compared with those for Cqd2. For example, what will become of overexpression of Cqd2 instead of Cqd1 for Fig. 7? What is the lipid composition of the cqd1Dcqd2D double deletion mutant cells (the decreased PA level is recovered?)? Lines 424-425: In summary, overexpression of Cqd1 causes severe phenotypes on growth, formation of mitochondrial structural elements, and mitochondrial architecture and morphology. Is this phenotype affected by overexpression of Cqd2?

      This point raised by reviewer #2 is very interesting. Our preliminary experiments and previously published data (Tan et al., 2013) indicate that overexpression of Cqd2 is also toxic and results in the formation of huge mitochondrial clusters. Therefore, we will extend our study and analyze the effect of overexpression of CQD2, either alone or in combination with overexpression of CQD1.

      Reviewer #3:

      1) The central point of the paper is that Cqd1 is part of a novel contact site between the inner and the outer membrane. Om14 and Por1 were identified as outer membrane components of this contact site by immunoprecipitation. The data look convincing but they were generated from targeted experiments to test the involvement of suspected proteins. Ideally, one would like to see a cross-linking mass spectrometry (XL-MS) experiment that identifies the physical interactions of Cqd1 without bias.

      We thank reviewer #3 for acknowledging the presented data as convincing. Considering the significant amount of experiments planned for the revised version of the manuscript, we hope that reviewer #3 agrees that this point is not essential.

      2) Could an analogous blot of the MICOS complex be added to Figure 6D?

      Of course, we are happy to include BN-PAGE analysis showing the running behavior of MICOS next to the Cqd1 containing complex in Fig. 6D.

      3) In the Introduction, a host of contact sites is mentioned, which are partly from older papers. I'm not sure whether this is the accepted view of the field. Also, newer data suggest that the permeability transition pore is derived from complex V rather than ANT, CK, and VDAC. The authors should double check in order to represent the current state of the art

      We thank reviewer #3 for this comment. We will update this part according to the more recent literature.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Khosravi et al present a comprehensive characterization of the mitochondrial protein Cqd1. They show that Cqd1 is an integral inner membrane protein that affects the mitochondrial lipid composition and that Cqd1 deletion exacerbates the delta-ups phenotype, which is also related to abnormalities in lipids. Importantly, Cqd1 is part of a large protein complex that behaves like an inner-outer membrane contact site upon sucrose density gradient centrifugation. The outer membrane proteins Por1 and Om14 were identified as likely interaction partners of Cqd1. The authors demonstrate clearly that the complex is distinct from MICOS. The data are logically presented and the paper is well organized. The results are interesting and offer a new prospective on the function of Cqd1. Although the potential involvement in lipid metabolism is not developed from the mechanistic point of view, the discovery of a new contact site between the two mitochondrial membranes is important.

      Minor critique

      1. The central point of the paper is that Cqd1 is part of a novel contact site between the inner and the outer membrane. Om14 and Por1 were identified as outer membrane components of this contact site by immunoprecipitation. The data look convincing but they were generated from targeted experiments to test the involvement of suspected proteins. Ideally, one would like to see a cross-linking mass spectrometry (XL-MS) experiment that identifies the physical interactions of Cqd1 without bias.
      2. Could an analogous blot of the MICOS complex be added to Figure 6D?
      3. In the Introduction, a host of contact sites is mentioned, which are partly from older papers. I'm not sure whether this is the accepted view of the field. Also, newer data suggest that the permeability transition pore is derived from complex V rather than ANT, CK, and VDAC. The authors should double check in order to represent the current state of the art.

      Significance

      The paper makes an important contribution to the molecular architecture of mitochondria.

      My expertise is mainly in mitochondrial lipids

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      I have the following concerns for the authors to consider.

      1. Although biochemical evidence shows that Cqd1 is likely a factor that forms CS structures in mitochondria, it would make the manuscript stronger if the authors can observe uneven distribution of Cqd1 in the mitochondrial membranes (assessed by fluorescent microscopy or ideally high-resolution microscopy) and the presence of Cqd1 in the region of close apposition of the OM and IM by immunogold labeling for electron microscopy.
      2. Since the structural characterization of Cqd1 is important to understand its interactions with the OM proteins and other UbiB protein kinase-like family proteins, Coq8 and Cqd2, take different orientations, the membrane topology of Cqd1 should be experimentally analyzed. The authors state, "two hydrophobic stretches can be identified in the Cqd1 sequence, of which the first one (amino acids 125-142) might be a bona fide transmembrane segment" (lines 97-100); then is Cqd1 a single membrane spanning protein or two-membrane spanning protein?  
      3. The authors state, "conserved GxxxG dimerization motif (amino acids 504‐508)" (Fig. 1A caption), but this description needs a reference. The GxxxG motif was proposed to mediate transmembrane helix-helix association (https://doi.org/10.1006/jmbi.1999.3489), which is not consistent with the membrane topology proposed by the authors.
      4. What is the role of the kinase activity of Cqd1 in the CS formation? The effects of overexpression of Cqd1 (Fig. 7) should be tested for its E330A mutant.
      5. Is there stoichiometric as well as quantitative information on the 400 kD complex consisting of Cqd1, Por1 and Om14? Does the stoichiometry and amount of the complex depend on the growth condition? Does the complex contain other Por1 interacting IM proteins like Mdm31?
      6. For Fig. 7E, the authors state, "consistently, we observed dramatically increased mitochondria‐ER interactions Cqd1 overexpression", but this observation could be due to secondary effects because overexpression of Cqd1 itself already caused abnormal morphology of mitochondria.
      7. Since the antagonistic role of Cqd2 to Cqd1 was proposed, the results of the experiments for Cqd1 can be compared with those for Cqd2. For example, what will become of overexpression of Cqd2 instead of Cqd1 for Fig. 7? What is the lipid composition of the cqd1Dcqd2D double deletion mutant cells (the decreased PA level is recovered?) ? Lines 424-425: In summary, overexpression of Cqd1 causes severe phenotypes on growth, formation of mitochondrial structural elements, and mitochondrial architecture and morphology. Is this phenotype affected by overexpression of Cqd2?

      Significance

      Mitochondrial functions rely on the formation of intramitochondrial contact sites (CS) between the outer membrane (OM) and inner membrane (IM). It is established that MICOS, involved in cristae junction formation, contributes to the formation of the CS through its interactions with the OM proteins including the SAM complex, TOM complex, Por1 etc. However, it is also recognized that CS can be formed independently of MICOS. Here Khosravi et al. report that Cqd1 in the IM could interact with Por1 and Om14 in the OM to form MICOS-independent CS. Cqd1 was previously reported to be involved in normal cellular CoQ distribution. Now Cqd1 was shown to be genetically and functionally related to the mitochondrial lipid biosynthetic pathway involving Ups1 and Crd1. Deletion of the CQD1 gene causes PA (phosphatidic acid) to decrease and overexpression of Cqd1 causes abnormal IM morphology. Most of the experiments were carefully performed and the results are properly interpreted. The present findings will extend our understanding of the mitochondria membrane architecture significantly, thereby appealing to the broad readership of the journals.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Khosravi et al show that the protein cqd1, which was shown to export CoQ outside the mitochondria, forms a contact site by interacting with por1-om14. They conclude that the main role of this complex is to control mitochondrial architecture and phospholipid metabolism. The data shown to draw this conclusion are the effects of cqd1 overexpression altering mitochondrial morphology, as well as the exacerbation of the effects of usp1 deletion by Cqd1 deletion.

      The major concerns affecting the conclusions are:

      1. Experimental evidence is lacking on the contribution of contact site formation by Cqd1 to the effects on mitochondrial architecture and respiration-dependent growth. Determining the effects of the overexpression of the kinase-dead mutant on mitochondrial morphology and contact site formation with Por1-Om14 can address that.
      2. Related to point #1, Cqd1 overexpression in deltaUsp1 cells could have addressed whether the role of Cqd1 in contact sites and mitochondrial architecture is independent of its role on CoQ distribution and phospholipid metabolism. Further characterization of the kinase-dead Cqd1 mutant on CoQ distribution, contact sites, mitochondrial archictecture and phsophsolipid metabolism might help discerning how these activities can be separated.
      3. It is unclear how both Cqd1 overexpression and deletion induce mitochondrial fragmentation. Performing live cell imaging with a mitochondrial-phoactivatable GFP to measure mitochondrial fusion rates could help discerning the causes for fragmentation. It is a possibility that overexpression induced fragmentation by activating fission without changing fusion, while deletion induced fragmentation by blocking fusion.

      Minor comment:

      1. Figure 4 claims that mitochondrial function is impaired by ups1 deletion, which Cqd1 deletion exacerbates. However, no respiration data is shown in figure 1, only measurements of mitochondrial architecture are shown. Thus, oxygen consumption measurements are needed to claim effects on mitochondrial function.
      2. Some Western blots lack quantifications and statistical analyses of independent experiments.

      Significance

      The finding that Cqd1 forms new contact sites and interacts with Usp1 is significant and is an important advancement in our understanding of the mechanism by which Cqd1 regulates CoQ distribution. This work will be of high interest to researchers on the mitochondria field, CoQ biogenesis, and inter and intra-organellar communication.

      However, it is still unclear whether the effects observed on mitochondrial architecture are just secondary to disturbed CoQ distribution or whether they are a primary consequence of Cqd1 forming these contact sites (effects independent of CoQ distribution and lipid metabolism as concluded by the authors).

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01288R

      Corresponding author(s): Florence Naillat, Seppo Vainio, Dagmar Iber

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      We would like to thank the reviewers for their positive and constructive reviews. We have already addressed their major concerns by including additional data, especially in the new Figure 3. We also detail the planned experiments that we propose to perform to address their remaining comments. Some points are mentioned in both section 2 and 3 of the revision plan.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Major Comments:

      1. Fig 3, I don't understand why the control lost Six2 expression (Fig 3A). Fig 3E is not consistent withFig 3d, which showed that the treatment of Fgf8 antibody significantly decreases the number of live cells. Lastly, the 3D culture matrix experiments did not provide evidence on the role of Fgf8 for NPC condensation.
      2. *

      The figure 3A showed a reduced expression of SIX2 expression (Red) in the NPC population in the kidney. When the NPC population is cultured without any FGF ligands, the SIX2 expression in the NPCs disappeared. Our results are similar to Dapkunas et al 2019 (Arvydas Dapkunas, Ville Rantanen, Yujuan Gui, Maciej Lalowski, Kirsi Sainio, Satu Kuure, and Hannu Sariola. Simple 3d culture of dissociated kidney mesenchyme mimics nephron progenitor niche and facilitates nephrogenesis wnt-independently. Scientific reports, 9(1):1–10, 201).

      • *

      The figure 3D showed that the antibody against FGF8 prevented the SIX2 expression in the NPC population from the nephrosphere experiment. We have modified the legend of figure 3D.

      • *

      We agreed with the reviewer that this result might confuse the reader. We are carrying another set of experiment for the Figure 3E where nephrospheres will be treated with and without FGF8 ligand instead of culturing a full kidney with and without FGF8 ligand for 24 hours as mentioned in figure 3E.

      We would like to mention that Dapkunas et al. 2019 demonstrated that dissociated kidney mesenchyme which contain the NPC population formed spontaneously self-organized spheres with the addition of FGF2 ligand and PP2 a Src inhibitor. By staining with Pax2 antibody which is a marker for progenitors and early nephron precursors we could show similarly as in Dapkunas et al that ectopic FGF8 ligand induced PAX2 expression whereas the antibody against FGF8 did not induce PAX2 expression in the cultured nephrosphere (Figure 3E-G)

      • *

      At the end of "A model based on Fgf8-induced motility leads to robust condensation of NPC", there is not a conclusive sentence.

      We have modified the text. We have written “We conclude from these simulation results that the chemokinetic effect of FGF8 enables the niche-wide distribution of NPCs. This allows them to reach the vicinity of the UB and also to enter the sphere of influence of epithelial factors that support the immobilization of NPCs. The corresponding motility gradient that appeared in the simulations (Supplementary Fig. Sup3) is in agreement with experimental observations Combes et al. 2016 (Alexander N. Combes, James G. Lefevre, Sean Wilson, Nicholas A. Hamilton, and Melissa H. Little. Cap mesenchyme cell swarming during kidney development is influenced by attraction, repulsion, and adhesion to the ureteric tip. (2016) Developmental Biology, 418(2):297–306. The simulations also show that excess FGF8 can override the guidance of epithelial signaling and prevent mesenchymal condensation.”

      Whole kidney qPCR results are not enough to support the claim of incomplete deletion of Fgf8 in mouse models. Protein staining or mRNA detection in section is required to support the claim. In addition, clear explanation is required on how the phenotypes of Fgf8 KO mice are associated the function of Fgf8 for NPC condensation.

      The qPCR results of the figure 3I and J have been carried out on the nephrospehere assays. In figure 3I, the nephrosphere assay which consist of culturing the kidney mesenchyme with ectopic FGF8 ligand for 24 hours. This showed that the NPCs markers were sustained due to FGF8 ligand. This is further confirmed with the staining of PAX2 a marker for progenitors and early nephron precursors which stained the aggregated NPC cells expressing SIX2 marker as in Dapkunas et al. 2019 (Figure 3E-G).

      • *

      I can not understand the last sentence well "Further work is required to reveal how Fgf8 along with its receptors and inhibiting factors orchestrates NPC condensation, its ......"

      We have modified the text. We have written “It is known that FGF8 ligand interacts with several FGF receptors and such interaction can also be modulated by heparan sulfate proteoglycan which will consequently regulate the gradient of FGF8 concentration (Harish et al. 2022 bioRXiv). Towards this goal a detailed ligand/receptor interactions in \textit{in vivo} is required to fully understand how FGF8 imparts its function.”

      • *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Major concerns:

      • Parametric tests are not appropriate for a small sample size; non-parametric tests should be applied to examine if data are robust. Specifically, in Figures 3F and 3G, where n=3.

      *We would like to thank the reviewer as we did not write the correct statistical test that we have carried out. It is the 2-way Anova Sidak multiple comparison that is a non-parametric test. We have corrected the legend of the figure 3. *

      • *

      • In Figure 2E it is unclear how many kidneys were analyzed since the graph shows an n=5 per genotype while the figure legend indicates an n=6. Similarly, please indicate the number of independent biological samples analyzed in panel 2G rather than only the total number of cells, and specify the statistical test used for data analysis.

      We would like to thank the reviewer for catching our written mistake for the figure 2E. We have analysed 5 kidneys and we have corrected the legend. We have added the number of independent biological samples analyzed in the figure 2G and the statistical test (Wilcoxon signed-rank test) used in the legend.

      • *

      • Can the authors clarify how the experiment described in Figure 3E was performed? It is unclear if NPCs were treated with FGF8 ligand (as indicated in the chart legend) or with an anti-FGF8b antibody (as described in the figure legend). Moreover, the authors stated that "the loss of Six2 expression as a result of the absence of FGF8 was not completely due to cells death as more live cells were observed", however, the number of live cells seems similar between control and FGF8-treated nephrospheres. Can the authors comment on that?

      *The figure was mislabeled the NPC were treated with the ectopic FGF8 ligand (mistake has been corrected in manuscript). *

      To detect the number of cells (dead/live), the flow cytometry was utilized on a full cultured kidney for 24 hours with or without ectopic FGF8 ligand. This method requires several washing steps which can remove the dead cells during the procedure. However, we are planning to repeat the same experiment using the nephrosphere assay where the nephrosphere will be cultured with or without ectopic FGF8 ligand for 24 hours before being sorted to check the live and dead cells.

      • *

      • The authors argued that "the NPCs, the Six2+ cells, accumulate around the UB tip and that NPC induction is interrupted failing the PTAs formation". Please include quantification of Lhx1-positive structures to assess the number of PTA structures in wildtype, as well as, in Pax8Cre;Fgf8n/c and Wnt4Cre;Fgf8n/c mutant kidneys.

      As we never have worked with Lhx1 antibody before, we are optimizing the protocol for the staining of the Lhx1 antibody combined with Troma (epithelial marker) and Six2 (NPC population marker) antibodies to highlight the NPC population from the ureteric bud in the WT kidney slides. We have few sample slides for Pax8Cre;Fgf8n/c and Wnt4Cre;Fgf8n/c mutant kidneys and we would like a working protocol before any staining.

      • It is unclear if both male and female offspring were collected. If so, did the authors observe sex-related differences in outcomes?

      *We have use male and female embryos. We did not genotype for the sex of the embryos in any of the experiments. In such way the sample collection was unbiased regarding the sex of the embryos. *

      Minor concerns:

      • Abbreviations should be defined at first mention in the text (e.g. "MET" in the second paragraph of the Introduction) and in each figure/table legend (e.g. UB, CM, tNPCs, PTA in Figure 1).

      We appreciate the suggestion and include all the abbreviation in the text and in the legend of Fig1.

      • In Figure 7, please label the structures (kidney; ureters, and bladder) in the urogenital system of control and mutant mice to facilitate the reader's understanding.

      We appreciate the suggestion and have labelled the structures in the figure 7.

      • For consistency, please include an inset showing a higher magnification image for Figure 8C. We followed the reviewer’s suggestion and have included a higher magnification image for the figure 8C.

      • Please revise the following sentence for clarity: "Primary antibody incubation duration and temperature was."

      We clarified the sentence and have added the temperature and the time for the staining for each staining in the table 2.

      • The legend of Supplementary Figure 1 needs to be improved, as it does not contain all information required to fully understand the presented results.

      We have rewritten the legend of the supplementary figure 1 as the reviewer suggested.

      • In Supplementary Figure 2, please include the phospho-GSK3β relative expression normalized to GSK3β expression.

      We have calculated the ratio of phospho Gsk3 to GSK3 and no statistical difference was found.

      • *

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      • *

      Several minor comments regarding typos and simple errors have already been incorporated in the transferred manuscript. The changes are highlighted in blue in the revised submission.

      *We have addressed all the minor comments that the reviewers have kindly highlighted to us. We feel these were straightforward to do and feasible in a short time, so do not require a detailed listed plan. *

      As mentioned above we are planning to stain the samples WT, Pax8Cre;Fgf8n/c and Wnt4Cre;Fgf8n/c mutant kidneys with Lhx1 antibody counterstained with Troma and Six2 markers and quantify the number of observed PTA structures


      Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      • *
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This is a well-written and organized manuscript that investigated the role of FGF8 in chemokinesis and condensation of nephron progenitor cells to the ureteric bud during metanephric kidney development. The results described in this present study are scientifically relevant, and the figures clearly support the content and authors' conclusions. However, there are some major and minor concerns that should be addressed by the authors.

      Major concerns:

      • Parametric tests are not appropriate for a small sample size; non-parametric tests should be applied to examine if data are robust. Specifically, in Figures 3F and 3G, where n=3.
      • In Figure 2E it is unclear how many kidneys were analyzed since the graph shows an n=5 per genotype while the figure legend indicates an n=6. Similarly, please indicate the number of independent biological samples analyzed in panel 2G rather than only the total number of cells, and specify the statistical test used for data analysis.
      • Can the authors clarify how the experiment described in Figure 3E was performed? It is unclear if NPCs were treated with FGF8 ligand (as indicated in the chart legend) or with an anti-FGF8b antibody (as described in the figure legend). Moreover, the authors stated that "the loss of Six2 expression as a result of the absence of FGF8 was not completely due to cells death as more live cells were observed", however, the number of live cells seems similar between control and FGF8-treated nephrospheres. Can the authors comment on that?
      • The authors argued that "the NPCs, the Six2+ cells, accumulate around the UB tip and that NPC induction is interrupted failing the PTAs formation". Please include quantification of Lhx1-positive structures to assess the number of PTA structures in wildtype, as well as, in Pax8Cre;Fgf8n/c and Wnt4Cre;Fgf8n/c mutant kidneys.
      • It is unclear if both male and female offspring were collected. If so, did the authors observe sex-related differences in outcomes?

      Minor concerns:

      • Abbreviations should be defined at first mention in the text (e.g. "MET" in the second paragraph of the Introduction) and in each figure/table legend (e.g. UB, CM, tNPCs, PTA in Figure 1).
      • In Figure 7, please label the structures (kidney; ureters, and bladder) in the urogenital system of control and mutant mice to facilitate the reader's understanding.
      • For consistency, please include an inset showing a higher magnification image for Figure 8C.
      • Please revise the following sentence for clarity: "Primary antibody incubation duration and temperature was."
      • The legend of Supplementary Figure 1 needs to be improved, as it does not contain all information required to fully understand the presented results.
      • In Supplementary Figure 2, please include the phospho-GSK3β relative expression normalized to GSK3β expression.

      Significance

      This study provides provide conceptual and methodological insights relevant to the field and it will be of considerable interest to the readers.

      My field of expertise: kidney development and disease (mouse models, kidney explants, cell culture). I do not have sufficient expertise to evaluate the 2D simulations of NPC condensation to the ureteric epithelium.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      NPC condensation is essential for nephron formation, but underlying regulation mechanisms remain elusive. Previous studies have demonstrated the important role of Fgf8 for the survival of NPCs. In this manuscript, Sharma et al reveal a novel function of Fgf8. By using mouse models, quantitative imaging assays, and data-driven computational modeling, they demonstrated the crucial role of Fgf8 signaling for the coordination of NPCs behaviors to the UB, especially for NPC condensation. Generally speaking, the manuscript was well organized and written. The experiments and analysis were well done. However, I have following concerns:

      1. Fig 3, I don't understand why the control lost Six2 expression (Fig 3A). Fig 3E is not consistent withFig 3d, which showed that the treatment of Fgf8 antibody significantly decreases the number of live cells. Lastly, the 3D culture matrix experiments did not provide evidence on the role of Fgf8 for NPC condensation.
      2. At the end of "A model based on Fgf8-induced motility leads to robust condensation of NPC", there is not a conclusive sentence.
      3. Whole kidney qPCR results are not enough to support the claim of incomplete deletion of Fgf8 in mouse models. Protein staining or mRNA detection in section is required to support the claim. In addition, clear explanation is required on how the phenotypes of Fgf8 KO mice are associated the function of Fgf8 for NPC condensation.
      4. I can not understand the last sentence well "Further work is required to reveal how Fgf8 along with its receptors and inhibiting factors orchestrates NPC condensation, its ......"

      Significance

      It is novel to study the role of Fgf8 for NPC condensation as a chemokine.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all reviewers for their very helpful comments. We feel that the comments pointed to a few main issues that we could remedy. First, we found that many comments and concerns could be addressed with work from our previous paper (doi.org/10.1101/2020.11.24.396002). To fix this, we added additional descriptions of experiments done previously and additional citations. We discussed more in depth an experiment that shows that ciliary membrane and membrane proteins can indeed come from the cell body plasma membrane, we talked more about how we determined that the actin puncta are representative of membrane remodeling functions like endocytosis, and we discussed some of the mechanistic insights provided by our previous work that are applicable here. We hope that this helps to answer several of the reviewer questions. Second, there were a few experiments we thought would be useful to add. These are represented in bold in our responses below. Briefly, we added a measure of internalization or endocytosis in the drp3 mutant, we added some images of cilia to the phalloidin figure to orient readers’ views of the cell, we added some additional mechanistic insight (supplemental figure 3), and we added an axoneme stain to confirm that the axoneme was extending (supplemental figure 4). Finally, we fixed some of our wording in the paper to represent our findings more accurately. Together, we hope that these revisions will address the reviewer concerns.

      Additionally, we added some data that we collected while waiting on reviews. We investigated the requirement for myosin in this pathway and include this data in the supplement.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The current manuscript by Bigge et al. demonstrated that the chemical inhibition of GSk3 causes ciliary elongation in Chlamydomonas reinhardtii. They show that lithium induced ciliary lengthening is majorly due to GSK3 inhibition. Consistent with earlier reports, they show that new protein synthesis is not required for lithium induced ciliary elongation. The authors report that targeting endocytosis either by using chemical inhibitors (dynasore and CK-666) or genetic mutants (dpr3 and Arpc4) does not cause lithium induced ciliary elongation. They further reveal enhanced actin dynamics in lithium treated cells and such activity is lost in Arpc4 mutants. Based on these results, the authors concluded that endocytic pathways may be involved in lithium induced ciliary lengthening. The results are interesting, and this work is important in understanding more about ciliary length regulation. However, more experimental evidence addressing the current interpretation that endocytic pathways may be involved in lithium induced ciliary lengthening is required.

      Major comments: 1 The authors use chemical inhibitors as major tools for their study. However, the specificity of these inhibitors is a concern. How specific are these GSK3 inhibitors such as LiCl? Can authors show that LiCl mediated ciliary lengthening is due to inhibition of GSK3? Authors used BFA and Dynasore to show that not the Golgi, but the endocytosis derived membrane is required for ciliary lengthening. Again, here the specificity of these inhibitors is a concern. Especially as Dynasore has been shown to have non-specific effects.

      We agree that the specificity of chemical inhibitors can be a concern. This is why we used 4 separate inhibitors of GSK3, each showing elongation of cilia and an increase in actin puncta (suggesting an increase in actin dynamics at the membrane). While these different inhibitors may have different off-target effects. Their intended target, GSK3, is the same, suggesting that the shared phenotype from each inhibitor is conserved. The ability of LiCl to affect GSK3 activity in Chlamydomonas was also investigated in depth with a kinase assay and a western blot in Wilson, 2004 (doi: 10.1128/EC.3.5.1307-1319.2004). To address the off-target effects of Dynasore, we employed the drp3 mutant to confirm genetically what we saw from the chemical inhibition. We also show in our previous paper that Dynasore and PitStop2 have similar effects in Chlamydomonas, both of them inhibiting the internalization of a dye-labelled membrane, suggesting that they both function to block endocytosis (doi.org/10.1101/2020.11.24.396002). While no mutant or alternative inhibitor is available to look at the effects of BFA, this inhibitor and its effects on cilia have been well-characterized in Dentler, 2013 (doi.org/10.1371/journal.pone.0053366).

      Does inducing/enhancing endocytosis independent of GSK3 by other means has any effect on ciliary length regulation?

      Our concern with the proposed experiment is that even if elongation requires endocytosis, all endocytosis might not lead to ciliary elongation when endocytosis is for other purposes. For example, endocytosis could occur for other purposes, like nutrient uptake, that will have no effect on cilia. The plasma membrane to cilium pathway may be a targeted pathway triggered by specific disruptions. Therefore, we don’t feel that the proposed experiments will add to our model.

      The major claim of this paper is that LiCl mediated ciliary lengthening is due to enhanced endocytosis. Although authors showed that inhibition of endocytosis results in reduced ciliary length, it is important to show if GSK3 inhibition by LiCl (or any other inhibitor) causes any increased cellular endocytosis? Similarly, what is the effect of GSK3 mutants on endocytosis?

      *We show an increase in actin dynamics at the membrane and actin puncta following treatment with LiCl and the other GSK3 inhibitors. We show here and in our previous paper (doi.org/10.1101/2020.11.24.396002), that these puncta are likely endocytic based on the timing of their appearance and the proteins required for puncta formation (including the Arp2/3 complex and Clathrin) (Figure 7, previous paper). We updated our latest version to reflect the data we have already collected and presented as follows: *

      “Further, they rely on proteins typically thought to be involved in endocytosis including the Arp2/3 complex and clathrin, and they form at times when it makes sense for endocytosis to be occurring, like immediately following deciliation when membrane and protein must be recruited to cilia in a timeframe too short for new protein and membrane synthesis, sorting, and trafficking (Bigge et al. 2020). Thus, we stained cells with phalloidin to visualize filamentous actin and these endocytosis-like punctate structures when cells are treated with GSK3 inhibitors.”

      A phenotypic mutant of GSK3 does not currently exist in Chlamydomonas, and methods of reliably introducing mutants in Chlamydomonas do not currently exist. Thus, we used the array of GSK3 inhibitors.

      Are these endocytic processes enhanced specifically at/or around the cilium during the ciliary lengthening process?

      *Based on our phalloidin staining data, these processes are primarily enhanced near the cilium, but puncta also exist throughout the cell. To more clearly show this and in response to a comment from reviewer 2, we added a set of images with brightfield to demonstrate where the dots are in relation to cilia. We also added arrows to the images in the figure to point out the apex of the cell as determined by the filamentous actin structures in the cells. *

      Authors claim that drp3 is a target of GSK3 and, similar to the canonical dynamin, functions in endocytosis. While, it is an important observation, experiments are required to show the role of drp3 in endocytosis and also to show that it is indeed a target of GSK3.

      To address this comment, we are employing an experiment that was designed in our previous paper (doi.org/10.1101/2020.11.24.396002, Figure 5B-E). This experiment uses a lipophilic membrane dye, FM4-46FX. The dye binds to the membrane but is unable to enter the cell alone. It is quickly endocytosed and results in vesicular-like structures within the cell. We added a panel to Figure 3 where we do this experiment in wild-type and ____drp3 mutant cells. This shows that endocytosis is affected by the mutation in DRP3. The discussion of this new data is summarized in the text as follows:

      “Additionally, we showed that this DRP is required for internalization of a lipophilic membrane dye, FM4-46FX through endocytosis. This dye binds to the membrane but is unable to enter the cells on its own and must be endocytosed. In wild-type cells it is quickly endocytosed and visible as puncta within the cell (Figure 3F, H) (Bigge et al. 2020). However, in drp3 mutants the amount of dye endocytosed is significantly lower (Figure 3G-H), suggesting that DRP3 is required for optimal endocytosis in these cells.”

      Mechanistic insights into how endocytosis/actin dynamics regulate ciliary lengthening would be interesting to see. Further, it is interesting to see if the ciliary signaling defects caused by abnormal ciliary length can be rescued by inhibition of endocytosis.

      *In our previous paper (doi.org/10.1101/2020.11.24.396002), we dive into the mechanisms tying together actin dynamics, endocytosis, and cilia. We find that Arp2/3 complex-nucleated actin networks are required for endocytosis to reclaim ciliary membrane and membrane proteins from a pool in the plasma membrane for the rapid early stages of ciliary assembly. We believe that this is a similar mechanism to what is occurring when cells elongate following lithium treatment. This is because there are several parallels in phenotypes: *

      -The Arp2/3 complex is required for both ciliary assembly (Figure 1, previous paper) and ciliary elongation resulting from lithium treatment. In the case of ciliary assembly, treating with cycloheximide to block the synthesis of new protein fully eliminates regrowth in the absence of the Arp2/3 complex, suggesting this Arp2/3 complex dependent mechanism in early ciliary assembly does not involve new protein synthesis (Figure 2, previous paper). Similarly, the process of ciliary elongation in response to lithium does not require new protein synthesis.

      *-A burst in actin dynamics/actin puncta occurs immediately following deciliation during early regrowth and during growth initiated by lithium treatment. We know these puncta are Arp2/3 complex and clathrin dependent (Figures 4 and 7, previous paper). *

      *-Both initial ciliary assembly or ciliary maintenance and elongation of cilia due to lithium treatment require endocytosis (Figures 5, 7-8, previous paper) but not require Golgi-derived membrane (Figure 3, previous paper). *

      *-Also in the previous paper, we find that this mechanism is required for the internalization and relocalization of a ciliary membrane protein for mating (Figure 6, previous paper). We also find that ciliary membrane proteins move from the plasma membrane to the cilia during ciliary assembly (Figure 7-8, previous paper). *

      *This is summarized in the text as follows: *

      *In the introduction we added: *

      “Previous data from our lab suggest that the Arp2/3 complex and actin are involved in reclaiming material from the cell body plasma membrane that is required for normal ciliary assembly (Bigge et al. 2020). We show that the Arp2/3 complex is required for the normal assembly of cilia and for endocytosis of both plasma membrane and plasma membrane proteins in various contexts. Further, we find that deciliation triggers Arp2/3 complex-dependent endocytosis by observing an increase in actin puncta immediately following deciliation (Bigge et al. 2020).”

      And in the discussion we added:

      “Previous work has shown that while the Golgi is required for ciliary maintenance and assembly (Dentler 2013), it is not the only source of membrane. Instead, we found that membrane reclaimed through actin and Arp2/3-complex dependent endocytosis is required for ciliary assembly or growth from zero length (Bigge et al. 2020). More specifically, we found that the Arp2/3 complex is required for normal ciliary maintenance and ciliary assembly, especially in the early stages when membrane and protein are needed quickly. The Arp2/3 complex is also required for the internalization of membrane and a specific ciliary membrane protein required for mating. Further, we show that endocytosis-like actin puncta form immediately following deciliation in an Arp2/3 complex and clathrin-dependent manner, and that membrane from the cell body plasma membrane can be reclaimed and incorporated into cilia (Bigge et al. 2020). This led us to question whether that same mechanism might be required for ciliary elongation from steady state length induced by lithium treatment.”

      Minor comments: 1. The paper needs a thorough proof reading as it harbors many spelling mistakes, grammatical errors, and poor sentence formation in multiple instances.

      *The paper was thoroughly read, and spelling mistakes and grammar were fixed. *

      Supplemental Figure S2A and S2B should be quoted separately from S2C and S2D.

      *This was updated in the latest version of the paper. *

      In Page 6 paragraph 2 - "authors wrote "To determine if GSK3 could be a potential kinase for this protein, we employed ScanSite4.0, which confirmed that of the 9 DRPs of Chlamydomonas, the only one with a traditional GSK3 target sequence was DRPs (Supplemental Figure 2)." No data is shown in S2 with regard to this. Either data needs to be shown or change the text in a way to avoid confusion.

      *The text was changed in a way to avoid confusion. *

      It would be nice to see if GSK3 can actually phosphorylate DRP3.

      *This would be interesting, however there is not currently a simple way to test this. There is not an antibody for DRP3 that shares enough of its immunogen sequence with the Chlamydomonas DRP3 sequence to use for a western blot. *

      The authors observe that arpc4 mutants do not form actin puncta upon LiCl treatment. Could this phenotype be rescued by complementing with WT ARPC4.

      *We showed in our previous paper (doi.org/10.1101/2020.11.24.396002) that the actin puncta could be rescued by re-expression of wild-type ARPC4 (Figure 4). *

      The concentration of inhibitors is described differently in the text and figure legends (for example Fig. 4A)

      *In the figure legend of figure 4, the concentration of 6-BIO was accidentally reported as 100 µM instead of the correct value (100 nM) as it was throughout the rest of the paper. This was addressed in the latest version. *

      The p values are not significant in some of the figures. (Fig. 4D &Fig. 5C)

      P values were provided for all comparisons in an effort to be transparent and so that readers could draw their own conclusions about the data.

      Reviewer #1 (Significance (Required)):

      The current manuscript by Bigge et al. demonstrates that endocytosis is required for GSK3 inhibition mediated ciliary lengthening. Maintenance of proper length of cilia is crucial and its dysregulation results in pathogenesis. This work takes the field forward and helps in our understanding of how ciliary length is regulated. This work is of interest to researchers working in the field of ciliary biology as well as to those working on endocytosis.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors show in this study that Lithium and other GSK3-beta inhibitors induce cilia elongation in Chlamydomonas. They further demonstrate that inhibition of endocytosis by Dynasore prevents the induced elongation of cilia. They speculate that a Dynamin-related protein might be involved in this process, and determine 9 Dynamin related proteins (DRPs) in Chlamydomonas of which DRP3 shows the highest sequence similarity. Lithium-induced ciliary elongation is prevented in DRP3 mutants supporting the author's hypothesis and indicating that DRP3 might be a GSK3-beta target, similar to some animal Dynamins. Since Dynamins interact with the F-actin regulator ARP3/3-complex, and because F-actin reorganization is observed in cells after GSK3-beta inhibition, they test the induction of ciliary elongation in arpc4 mutants and after blocking the ARP-complex by CK-666. Indeed, F-actin remodeling and cilia elongation were prevented after loss of ARP-complex function. The induction of ciliary elongation and F-actin remodeling also correlates with the emergence of strong F-actin punctae in cells, and the authors interpret that as induction of Dynamin-dependent endocytosis (also addressed in a current preprint from the group). From that, the conclude that endocytosis is required for delivering membrane to the growing cilium and that this is required for the observed effects. While this claim is somewhat supported by a lack of cilia elongation inhibition after treatment to prevent protein synthesis or Golgi function, direct evidence for membrane delivery to the cilium, the need for membrane delivery for ciliary elongation, and presence of bona fide endocytotic vesicles is sadly missing. Therefore, this study sheds new light on an important process in ciliary functional regulation and also furthers our understanding on why GSK3-beta inhibition induces elongated cilia in many cell systems, but I am not convinced that the conclusions are actually supported by the data, as the two key points in question were not experimentally addressed at this point.

      Main points: 1. The authors need to demonstrate that new membrane is delivered in the process to the growing cilium. E.g. this could be done by membrane stains (pulse) and static or live-cell imaging analysis in untreated, GSK3-beta inhibitor treated and in mutants.

      *In our previous paper (doi.org/10.1101/2020.11.24.396002), we do an experiment similar to the one described here (Figure 8, previous paper). We biotinylated all surface proteins, then removed the cilia (and therefore all labelled ciliary surface proteins) and allowed them to regrow. We then isolated the new cilia and probed for biotinylated proteins because any biotinylated proteins must have come from the surface of the cell. We found that the cilia did contain membrane proteins from the surface of the cell. This experiment shows that membrane and membrane proteins derived from the plasma membrane are entering growing cilia during regeneration. We added a description of this experiment to the text as follows: *

      “Conversely, when treated with Dynasore to inhibit endocytosis, cilia could not elongate to the same degree as untreated cells (Figure 3A-B), implying endocytosis is required for lithium-induced elongation and that endocytosis requires dynamin. This is consistent with results from our previous studies which show that ciliary membrane and membrane proteins are delivered from the cell body plasma membrane to the cilia. In an experiment first performed in Dentler 2013 and then later in Bigge et al. 2020, we biotinylated all cell surface proteins. Then, deciliated cells and allowed cilia to regrow. We then isolated cilia and probed for biotinylated proteins. Any biotinylated proteins present must have come from the cell body plasma membrane, and we found that indeed biotinylated proteins exist in the newly grown cilia, suggesting that ciliary membrane and membrane proteins can be recruited from the cell body plasma membrane (Dentler 2013; Bigge et al. 2020).”

      However, this experiment cannot be done in the case of lithium because cilia are not removed meaning they already will contain labelled surface proteins. Additionally, cells do not regrow cilia in the presence of lithium, meaning that we cannot add a regeneration. Regardless, work from our previous paper described above does establish that ciliary membrane and membrane proteins are able to come from the cell body plasma membrane as the reviewer requested.

      Along the same line, the authors need to demonstrate that the punctae are truly endocytotic vesicles. For that uptake assays/stains could be used and additional markers. Furthermore, there are multiple modes of endocytosis (e.g. Clathrin) besides Dynamin. The authors should determine if blocking other modes of endocytosis has similar or divergent effects on cilia elongation.

      *In our previous paper (doi.org/10.1101/2020.11.24.396002) we supplement the actin puncta data with membrane labelling to show that the puncta are likely endocytic pits (doi.org/10.1101/2020.11.24.396002, Figure 5). We also show that the puncta require both the Arp2/3 complex and active clathrin to form, further suggesting that they are endocytic (Figure 7, previous paper). We added this to the paper as follows: *

      “Further, they rely on proteins typically thought to be involved in endocytosis including the Arp2/3 complex and clathrin, and they form at times when it makes sense for endocytosis to be occurring, like immediately following deciliation when membrane and protein must be recruited to cilia in a timeframe too short for new protein and membrane synthesis, sorting, and trafficking (Bigge et al. 2020). To provide additional evidence that these are endocytic puncta, we also showed that a corresponding increase in membrane internalization occurs during this same timeframe using a fluorescent membrane dye that is endocytosed in wild-type cells (Bigge et al. 2020).”

      Additionally, Dynamin is required for most forms of endocytosis, including clathrin mediated endocytosis. In the previous paper (doi.org/10.1101/2020.11.24.396002), which we cite here, we do a deep dive into which endocytic proteins are present in Chlamydomonas. We found that clathrin mediated endocytosis is the most highly conserved on the endocytic processes we looked at (Figure 5, previous paper).

      We did add a new figure to this paper (Figure 4) using a dye that labels membrane in lithium treated cells. This dye binds to the plasma membrane but is unable to enter cells by itself and must be endocytosed. We found that during the first 30 minutes of lithium treatment there is increased membrane dye internalization.

      No cilia are actually shown in the study. I personally, would like to see how these cilia look like, especially in relation to the sites of F-actin remodeling and punctae formation. What comes first? Please also provide a axoneme staining to confirm elongation of the ciliary core and what happens to the tubulin pool when cilia cannot elongate any more? Is it accumulating at the ciliary base?

      We added a panel demonstrating where the puncta are in relation to cilia in Figure 4 with a brightfield overlay.* We also look at the appearance and timing of these puncta more in depth in our previous paper (doi.org/10.1101/2020.11.24.396002, Figure 7). We find that puncta form immediately following deciliation and start to return to normal following about 10 minutes of regrowth. We think that this mechanism of ciliary elongation in lithium is similar to what occurs during those early steps of ciliary assembly suggesting that the dots likely form very early on. *

      We also included axoneme staining in Supplemental figure 4*. We show that the axoneme does continue to elongate with the cilia. After about 90 minutes, the cilia actually stop growing and detach from the cells (doi: 10.1128/EC.3.5.1307-1319.2004, doi: doi.org/10.1247/csf.12.369). However, we are interested in the more acute mechanisms that result in ciliary elongation. *

      The authors also claim that the method of GSK3 inhibition is not important. It would be more correct to say that the mode/drug of GSK3 inhibition is not important, but discuss how some of the minor variance between treatments could be explained (incl. the timeline and temporal dynamics of the diverging effects; and the dose-dependency as low concentrations of BIO seem to induce shortening but high doses induce elongation of cilia).

      *We further discussed this in the text as follows: *

      “The minor variances between the drugs could be explained by the timeline in which we tested cilia (90 minutes) or the exact dosages we used. An example of this is 6-BIO where treatment with a low dose of 100 nM caused ciliary lengthening, but treatment with a higher concentration of 2 µM reportedly caused ciliary shortening (Kong et al. 2015). Together, the data suggest that the mode of inhibition by chemical targets of GSK3 is not important for ciliary lengthening. Whether GSK3 was inhibited via competition for ATP binding or phosphorylation, cilia were able to elongate.”

      They propose here a positive effect of F-actin build up in cilia length regulation, while most studies to date report ciliary shortening to correlate with increased F-actin at the ciliary base. I believe that this is not highlighted and discussed enough, which I find reduces the overall quality of the paper (but is easy to improve). It might be also interesting to test if other F-actin inducers/stabiliziers have the same effect?

      *This is addressed in the discussion in the latest version in depth as follows: *

      “One important detail to point out is that Chlamydomonas differ from mammalian cells in that they have a cell wall. The stability awarded by the cell wall means that Chlamydomonas does not require a cortical actin network as mammalian cells do. Thus, in Chlamydomonas, we are able to investigate actin dynamics and functions without the interference of the cortical actin network. This also means that some of the effects we see might be masked in mammalian cells by the presence of the cortical actin network and the effect that it has on ciliary assembly and maintenance.”

      *We also added a section to the introduction to address this concern early on so that readers will have this difference in mind as they read the paper: *

      “Additionally, unlike mammalian cells, Chlamydomonas lacks a cortical actin network which simplifies the relationship between cilia and actin and makes this an ideal model to study such interactions.”

      Also, F-actin inducers/stabilizers do not typically have the same effect because the filamentous actin needed for these processes must be dynamic, or able to undergo rapid depolymerization and repolymerization as needed during this fairly quick timeframe. This is demonstrated in Avasthi, 2014 (*doi.org/10.1016/j.cub.2014.07.038). Cells were treated with several actin targeting inhibitors including LatB which results in depolymerization of filaments and Jasplakinolide which results in stabilization of filaments. In both cases, ciliary regeneration is impaired suggesting that actin must be dynamic for its functions related to cilia. *

      Minor points: 1. In many Figures, the x-axis is labeled "Number of values", but I think that maybe number of observations might be more appropriate.

      We discussed this point and decided to change the axis titles to “Number of cilia”.

      The author often use the word "normally" elongating, but in all cases the elongation is induced = abnormal situation. Maybe the authors could use a different term.

      We originally used “normally” because there are times when we get defective elongation but not no elongation. In the latest version we changed this to “elongation consistent with untreated wild-type cells” or something along those lines.

      It is puzzling as to why DRP3 was chosen, while DRP2 actually is most similar in terms of domain composition. Maybe they could discuss that. They also could explain a bit better how the mutants were generated in which a "cassette was inserted early in the gene". What kind of disruption is expected?

      DRP3 was chosen because it has the highest sequence identity (and similarity). DRP2 while containing all domains, has low overall sequence conservation. DRP3 is also the only DRP that showed a potential GSK3 target site when investigated with ScanSite4.0. This was all made clearer in the text as follows:

      “Chlamydomonas contains 9 DRPs with similarity to a canonical dynamin (DRP1-9). Despite lacking 2 of the canonical dynamin domains, the DRP with the highest sequence similarity and identity to canonical dynamin is DRP3 (Supplemental Figure 2C-D). To determine if GSK3 could be a potential kinase for this protein, we employed ScanSite4.0, which confirmed that of the 9 DRPs of Chlamydomonas, the only one with a traditional GSK3 target sequence was DRP3.”

      The representative images in Figure 4A do not really seem to match the quantifications.

      *The quantitative data suggest that these different treatments have increased dots, which we believe the representative images do show. LiCl and CHIR99021 have the most dots, while 6-BIO and Tideglusib have more dots, but less than LiCl and CHIR99021. *

      line 109: "of-targets" should be off-targets

      Fixed in the latest version, thanks for pointing this out.

      line 141: "delivery form the Golgi" should be FROM the Golgi

      Fixed in the latest version, thanks for pointing this out.

      line 160: "was DRPs" should be was DRP3

      Fixed in the latest version, thanks for pointing this out.

      line 204/205: the sentence starting "Thus, we phalloidin..." should be rephrased. It sounds not quite correct

      Fixed in the latest version, thanks for pointing this out.

      line 209: Figure 4A should refer to Figure 4B

      Fixed in the latest version, thanks for pointing this out.

      line 211: "times or rapid ciliary" should be of rapid ciliary...

      Fixed in the latest version, thanks for pointing this out.

      line 257: "in lithium." Should be in lithium treated cells Fixed in the latest version, thanks for pointing this out.

      Reviewer #2 (Significance (Required)):

      This study sheds new light on an important process in ciliary functional regulation and also furthers our understanding on why GSK3-beta inhibition induces elongated cilia in many cell systems, but I am not convinced that the conclusions are actually supported by the data, as the two key points in question were not experimentally addressed at this point.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Chlamydomonas maintains relatively regular length of cilia (flagella). However, when the cell is exposed to high concentration of lithium ions, it elongates cilia further. In this work, Bigge and Avasthi made experiments to build a potential hypothesis of molecular mechanism of this unusual cilia elongation. Their hypothesis is (1) cilia elongation is triggered, depending on supply of extra membrane (not proteins), (2) membrane is supplied from plasma membrane by clathrin-dependent endocytosis (not from Golgi), (3) this endocytosis contains Arp2/3 complex, (4) GSK3 downregulates Arp2/3 dependent endocytosis and (5) GSK3 is suppressed by lithium. They conducted well-organized experiments to prove each step. While some of them are indirect, their hypotheses were supported experimentally in outline.

      (1) is undoubted, since the authors demonstrated that inhibition of protein production by cycloheximide did not influence cilia elongation.

      (2) The authors clearly demonstrated that source of ciliary membrane for elongation is plasma membrane and not Golgi by examining specific inhibitors' effect. They also showed protein transfer from plasma membrane to cilia, by biotinylaing surface proteins in the cell, deciliating and growing cilia and detecting biotinylated proteins in cilia. This part rather characterizes initial growth of cilia, not elongation. Therefore this result must be properly described in the context of this work (which is elongation of cilia).

      This comment was particularly helpful as it also helps us address some of the comments from the other reviewers. We updated the description of this experiment in the context of this work in the latest version as follows:

      “Further, they rely on proteins typically thought to be involved in endocytosis including the Arp2/3 complex and clathrin, and they form at times when it makes sense for endocytosis to be occurring, like immediately following deciliation when membrane and protein must be recruited to cilia in a timeframe too short for new protein and membrane synthesis, sorting, and trafficking (Bigge et al. 2020). To provide additional evidence that these are endocytic puncta, we also showed that a corresponding increase in membrane internalization occurs during this same timeframe using a fluorescent membrane dye that is endocytosed in wild-type cells (Bigge et al. 2020).”

      For (3)-(4), they visualized Arp2/3 localization, showing highly condensed Arp2/3. They interpreted these particles as sign of clathrin endocytosis. Since so far such an endocytosis particle has not been reported in Chlamydomonas, the authors confirmed that DRPs are target of GSK3 to indirectly show GSK3 influences formation of endocytosis. This reviewer thinks the author should be able to directly confirm endocytosis for example by electron microscopy (of traditional epon-embedded and stained cells).

      We visualized Arp2/3 complex-dependent filamentous actin localization. We provide DRP3 as a potential target of GSK3, but do not report that it is the target that results in increased endocytosis or increased ciliary length. We agree that electron microscopy would be ideal to visualize endocytosis in these cells. However, we feel this is outside the scope of this current work. But, we do have plans to look at endocytosis in Chlamydomonas *using electron microscopy in the future and hope that the increased context from the previous data are sufficient at this time. *

      (5) was elegantly proved by multiple drugs (all known as inhibitor of GSK3), including lithium.

      After fixing these points, this manuscript will be ready for publication.

      Minor points: Line188-191: not clear. What are *** and ****?

      Fixed in the latest version, thanks for pointing this out.

      Line262-264: It would be helpful how the initial cilia growth of the arpc4 cell.

      We agree that this would be helpful information, and included more of a description of how ciliary growth is affected by loss of Arp2/3 complex function in the latest version: “Specifically, we found that the Arp2/3 complex is required for reclamation of membrane from a pool in the plasma membrane during the rapid growth that occurs during early ciliary assembly”.

      Line321: it should read as follows. Cang 2014; carlsson and Bayly 2014). While we...

      Fixed in the latest version, thanks for pointing this out.

      Line329: were -> where

      Fixed in the latest version, thanks for pointing this out.

      Line365-366: Lithium-treated cells are not motile. Any thought why? Maybe protein production is not necessary for apparent cilia elongation, but necessary for elongation of functional cilia.

      *This is an interesting idea. However, even when protein production is allowed to proceed, Lithium-treated cells are not motile. This is a ciliary dysfunction, and in fact, after about 90 minutes incubation with lithium, the cilia of these cells start to crash out or fall off, demonstrating that these are not healthy cells or healthy cilia. *

      Reviewer #3 (Significance (Required)):

      This work is an important step toward the understanding of cilia elongation and thus growth mechanism. It will attract wide audience who have interest in cell biology and motility. My expertise is about motile cilia and their 3D structure.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Chlamydomonas maintains relatively regular length of cilia (flagella). However, when the cell is exposed to high concentration of lithium ions, it elongates cilia further. In this work, Bigge and Avasthi made experiments to build a potential hypothesis of molecular mechanism of this unusual cilia elongation. Their hypothesis is (1) cilia elongation is triggered, depending on supply of extra membrane (not proteins), (2) membrane is supplied from plasma membrane by clathrin-dependent endocytosis (not from Golgi), (3) this endocytosis contains Arp2/3 complex, (4) GSK3 downregulates Arp2/3 dependent endocytosis and (5) GSK3 is suppressed by lithium. They conducted well-organized experiments to prove each step. While some of them are indirect, their hypotheses were supported experimentally in outline.

      (1) is undoubted, since the authors demonstrated that inhibition of protein production by cycloheximide did not influence cilia elongation.

      (2) The authors clearly demonstrated that source of ciliary membrane for elongation is plasma membrane and not Golgi by examining specific inhibitors' effect. They also showed protein transfer from plasma membrane to cilia, by biotinylaing surface proteins in the cell, deciliating and growing cilia and detecting biotinylated proteins in cilia. This part rather characterizes initial growth of cilia, not elongation. Therefore this result must be properly described in the context of this work (which is elongation of cilia).

      For (3)-(4), they visualized Arp2/3 localization, showing highly condensed Arp2/3. They interpreted these particles as sign of clathrin endocytosis. Since so far such an endocytosis particle has not been reported in Chlamydomonas, the authors confirmed that DRPs are target of GSK3 to indirectly show GSK3 influences formation of endocytosis. This reviewer thinks the author should be able to directly confirm endocytosis for example by electron microscopy (of traditional epon-embedded and stained cells). (5) was elegantly proved by multiple drugs (all known as inhibitor of GSK3), including lithium. After fixing these points, this manuscript will be ready for publication.

      Minor points:

      Line188-191: not clear. What are and *?

      Line262-264: It would be helpful how the initial cilia growth of the arpc4 cell.

      Line321: it should read as follows.

      Cang 2014; carlsson and Bayly 2014). While we...

      Line329: were -> where

      Line365-366: Lithium-treated cells are not motile. Any thought why? Maybe protein production is not necessary for apparent cilia elongation, but necessary for elongation of functional cilia.

      Significance

      This work is an important step toward the understanding of cilia elongation and thus growth mechanism. It will attract wide audience who have interest in cell biology and motility. My expertise is about motile cilia and their 3D structure.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The authors show in this study that Lithium and other GSK3-beta inhibitors induce cilia elongation in Chlamydomonas. They further demonstrate that inhibition of endocytosis by Dynasore prevents the induced elongation of cilia. They speculate that a Dynamin-related protein might be involved in this process, and determine 9 Dynamin related proteins (DRPs) in Chlamydomonas of which DRP3 shows the highest sequence similarity. Lithium-induced ciliary elongation is prevented in DRP3 mutants supporting the author's hypothesis and indicating that DRP3 might be a GSK3-beta target, similar to some animal Dynamins. Since Dynamins interact with the F-actin regulator ARP3/3-complex, and because F-actin reorganization is observed in cells after GSK3-beta inhibition, they test the induction of ciliary elongation in arpc4 mutants and after blocking the ARP-complex by CK-666. Indeed, F-actin remodeling and cilia elongation were prevented after loss of ARP-complex function. The induction of ciliary elongation and F-actin remodeling also correlates with the emergence of strong F-actin punctae in cells, and the authors interpret that as induction of Dynamin-dependent endocytosis (also addressed in a current preprint from the group). From that, the conclude that endocytosis is required for delivering membrane to the growing cilium and that this is required for the observed effects. While this claim is somewhat supported by a lack of cilia elongation inhibition after treatment to prevent protein synthesis or Golgi function, direct evidence for membrane delivery to the cilium, the need for membrane delivery for ciliary elongation, and presence of bona fide endocytotic vesicles is sadly missing. Therefore, this study sheds new light on an important process in ciliary functional regulation and also furthers our understanding on why GSK3-beta inhibition induces elongated cilia in many cell systems, but I am not convinced that the conclusions are actually supported by the data, as the two key points in question were not experimentally addressed at this point.

      Main points:

      1. The authors need to demonstrate that new membrane is delivered in the process to the growing cilium. E.g. this could be done by membrane stains (pulse) and static or live-cell imaging analysis in untreated, GSK3-beta inhibitor treated and in mutants.
      2. Along the same line, the authors need to demonstrate that the punctae are truly endocytotic vesicles. For that uptake assays/stains could be used and additional markers. Furthermore, there are multiple modes of endocytosis (e.g. Clathrin) besides Dynamin. The authors should determine if blocking other modes of endocytosis has similar or divergent effects on cilia elongation.
      3. No cilia are actually shown in the study. I personally, would like to see how these cilia look like, especially in relation to the sites of F-actin remodeling and punctae formation. What comes first? Please also provide a axoneme staining to confirm elongation of the ciliary core and what happens to the tubulin pool when cilia cannot elongate any more? Is it accumulating at the ciliary base?
      4. The authors also claim that the method of GSK3 inhibition is not important. It would be more correct to say that the mode/drug of GSK3 inhibition is not important, but discuss how some of the minor variance between treatments could be explained (incl. the timeline and temporal dynamics of the diverging effects; and the dose-dependency as low concentrations of BIO seem to induce shortening but high doses induce elongation of cilia).
      5. They propose here a positive effect of F-actin build up in cilia length regulation, while most studies to date report ciliary shortening to correlate with increased F-actin at the ciliary base. I believe that this is not highlighted and discussed enough, which I find reduces the overall quality of the paper (but is easy to improve). It might be also interesting to test if other F-actin inducers/stabiliziers have the same effect?

      Minor points:

      1. In many Figures, the x-axis is labeled "Number of values", but I think that maybe number of observations might be more appropriate.
      2. The author often use the word "normally" elongating, but in all cases the elongation is induced = abnormal situation. Maybe the authors could use a different term.
      3. It is puzzling as to why DRP3 was chosen, while DRP2 actually is most similar in terms of domain composition. Maybe they could discuss that. They also could explain a bit better how the mutats were generated in which a "cassette was inserted early in the gene". What kind of disruption is expected?
      4. The representative images in Figure 4A do not really seem to match the quantifications.
      5. line 109: "of-targets" should be off-targets
      6. line 141: "delivery form the Golgi" should be FROM the Golgi
      7. line 160: "was DRPs" should be was DRP3
      8. line 204/205: the sentence starting "Thus, we phalloidin..." should be rephrased. It sounds not quite correct
      9. line 209: Figure 4A should refer to Figure 4B
      10. line 211: "times or rapid ciliary" should be of rapid ciliary...
      11. line 257: "in lithium." Should be in lithium treated cells

      Significance

      This study sheds new light on an important process in ciliary functional regulation and also furthers our understanding on why GSK3-beta inhibition induces elongated cilia in many cell systems, but I am not convinced that the conclusions are actually supported by the data, as the two key points in question were not experimentally addressed at this point.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The current manuscript by Bigge et al. demonstrated that the chemical inhibition of GSk3 causes ciliary elongation in Chlamydomonas reinhardtii. They show that lithium induced ciliary lengthening is majorly due to GSK3 inhibition. Consistent with earlier reports, they show that new protein synthesis is not required for lithium induced ciliary elongation. The authors report that targeting endocytosis either by using chemical inhibitors (dynasore and CK-666) or genetic mutants (dpr3 and Arpc4) does not cause lithium induced ciliary elongation. They further reveal enhanced actin dynamics in lithium treated cells and such activity is lost in Arpc4 mutants. Based on these results, the authors concluded that endocytic pathways may be involved in lithium induced ciliary lengthening. The results are interesting, and this work is important in understanding more about ciliary length regulation. However, more experimental evidence addressing the current interpretation that endocytic pathways may be involved in lithium induced ciliary lengthening is required.

      Major comments:

      1. The authors use chemical inhibitors as major tools for their study. However, the specificity of these inhibitors is a concern. How specific are these GSK3 inhibitors such as LiCl? Can authors show that LiCl mediated ciliary lengthening is due to inhibition of GSK3? Authors used BFA and Dynasore to show that not the Golgi, but the endocytosis derived membrane is required for ciliary lengthening. Again, here the specificity of these inhibitors is a concern. Especially as Dynasore has been shown to have non-specific effects.
      2. Does inducing/enhancing endocytosis independent of GSK3 by other means has any effect on ciliary length regulation?
      3. The major claim of this paper is that LiCl mediated ciliary lengthening is due to enhanced endocytosis. Although authors showed that inhibition of endocytosis results in reduced ciliary length, it is important to show if GSK3 inhibition by LiCl (or any other inhibitor) causes any increased cellular endocytosis? Similarly, what is the effect of GSK3 mutants on endocytosis?
      4. Are these endocytic processes enhanced specifically at/or around the cilium during the ciliary lengthening process?
      5. Authors claim that drp3 is a target of GSK3 and, similar to the canonical dynamin, functions in endocytosis. While, it is an important observation, experiments are required to show the role of drp3 in endocytosis and also to show that it is indeed a target of GSK3.
      6. Mechanistic insights into how endocytosis/actin dynamics regulate ciliary lengthening would be interesting to see. Further, it is interesting to see if the ciliary signaling defects caused by abnormal ciliary length can be rescued by inhibition of endocytosis.

      Minor comments:

      1. The paper needs a thorough proof reading as it harbors many spelling mistakes, grammatical errors, and poor sentence formation in multiple instances.
      2. Supplemental Figure S2A and S2B should be quoted separately from S2C and S2D.
      3. In Page 6 paragraph 2 - "authors wrote "To determine if GSK3 could be a potential kinase for this protein, we employed ScanSite4.0, which confirmed that of the 9 DRPs of Chlamydomonas, the only one with a traditional GSK3 target sequence was DRPs (Supplemental Figure 2)." No data is shown in S2 with regard to this. Either data needs to be shown or change the text in a way to avoid confusion.
      4. It would be nice to see if GSK3 can actually phosphorylate DRP3.
      5. The authors observe that arpc4 mutants do not form actin puncta upon LiCl treatment. Could this phenotype be rescued by complementing with WT ARPC4.
      6. The concentration of inhibitors is described differently in the text and figure legends (for example Fig. 4A)
      7. The p values are not significant in some of the figures. (Fig. 4D &Fig. 5C)

      Significance

      The current manuscript by Bigge et al. demonstrates that endocytosis is required for GSK3 inhibition mediated ciliary lengthening. Maintenance of proper length of cilia is crucial and its dysregulation results in pathogenesis. This work takes the field forward and helps in our understanding of how ciliary length is regulated. This work is of interest to researchers working in the field of ciliary biology as well as to those working on endocytosis.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Point-by-point response

      We thank the reviewers for their constructive comments. We have addressed all of them to the best of our knowledge. Our responses are shown below in bold and all changes in the text are highlighted in yellow.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This study of Rizk, Bekiaris and colleagues is well written, carefully edited, and nicely placed into the trending context of the juvenile immune system development.

      They suggest that cIAP ubiquitin ligases cIAP1 and cIAP2 sustain type 3 γδ T after 4-5 weeks of age in mice. As a mechanism they show that these ubiquitin ligases are required in a cell-intrinsic manner to maintain cMAF and RORγt levels, and that this depends likely on overt activation of the non-canonical NF-κB pathway.

      Extrinsic factors such as microbiota did not seem to play a major role in this context.

      **Major comments:**

      It is absolutely crucial to directly and stringently control the efficiency of cIAP depletion via RORgt-cre, which may take some time and thus perhaps only reaches relevant (exponential) penetrance at early adulthood?

      Fig 4C is nice, however the Birc2 loxP sites may be far less efficient than those in the ROSA26-LSL-RFP system.

      - We thank the reviewer for this comment. In this regard, we sorted day 1 old γδT17 cells from the thymus of Cre+ and Cre- mice and screened for Birc2 mRNA (cIAP1) expression. We additionally compared expression to CD27+ γδ T cells, from the same thymi as RORγt-neg controls. Please see new Fig S5A and text line 208-209.

      However, the pre-puberty timing aspect is surprising, but without this aspect the conclusions would be similarly exciting.

      - The fact that Birc2 is indeed deleted in newborn thymocytes, supports our conclusions that its impact is seen progressively while mice are aging

      **Minor comments:**

      • To understand the general impact of cIAP on gdT17 homeostasis, the authors should consider investigating them in additional organs, as these gdT17 are quite tissue-resident and differentially adapt to their environment, where they use specific anti-apoptotic strategies to persist, including expression of Bcl2a1 family proteins.

      - We have investigated lung from adult ΔIAP1/2 and found significantly reduced γδT17 cells, in accordance with our data in the LN, gut and skin. Please see new Fig S1E and text lines 134-136.

      • Fig. 3: Has the presence of gdT17 in the graft been analyzed or enumerated? Experiments shown in Fig 3 AB and FG might collectively suggest that co-transferred gdT17 from the 45.1 BM graft could have reconstituted the regenerated gdT17 compartment in competition with the radioresistant 45.1/2 host gdT17 cells. This would actually not compromise the results, as the cIAP deficient cells did not persist.

      - We are not entirely sure what the reviewer means with this comment. We believe that the data in Fig 3 clearly shows that ΔIAP1/2 cells cannot compete with WT cells. This is also reinforced in Fig S3 where the host is ΔIAP1/2.

      Reviewer #1 (Significance (Required)):

      **Significance**

      This work is very original and might be of pharmacological interest for approaches targeting cIAP, e.g. in order to enhance anti-viral therapies.

      **Referee Cross-commenting**

      No further comments.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors studied the effect of the inactivation of cIAP1 and 2 on the development and evolution of γδ T cells and in type 3 innate lymphoid cells (ILC3) using RORc-Cre induced inatiction of cIAP1 in combination or not with cIAP2 whole body KO. The authors showed that these two E3 ubiquitin ligases that regulate the NF-κB pathway, are important to maintain a population of IL-17 producers γδ T and ILC3 in adult animals. This lack of maintenance is correlated with a loss of c-MAF and RORγt expression in the two cell types and may be related to a deficiency in entering cell cycle in response to various cytokines. The authors also established that the mechanism is independent of the TNFR1 pathway. The article is well written, clear and most of the conclusions are well supported by the data showed. The results presented are novel and interesting for the field. However, I would suggest some major changes to make the story suitable for publication.

      1- The study of 2 different cell types brings some confusion to the story, even if I understand it makes some sense to pool these two parts in the same article. The γδ T cell part is more complete than the ILC3 part, which brings some frustration for the reader, as nothing indicates that the mechanisms leading to the loss of maintenance are similar in the 2 cell types. I would suggest to simply remove the ILC3 part and keep it for another article. If the authors wish to keep it in this article, they must perform a similar set of experiments already done for the γδ T cell part, especially the lineage tracing performed in figure 5 as c-Maf is known to be important in ILC plasticity for ILC3 and ILC1. They would also need to confirm the mechanisms involved in the process leading to ILC3 decrease.

      - We thank the reviewer for this comment. We do realize that the ILC3 part of the story may seem incomplete. For this reason, we have taken into consideration the reviewer’s advice and performed lineage tracing in ILC3 cells. In adult ΔIAP1/2 mice that were reporting RFP in RORγt+ cells, we found that within the ILC population, there was 10-fold reduction in RFP+ cells, suggesting that it is unlikely they convert to a non-ILC3 population. Please see new Fig S9C and text lines 297-302. In accordance KLRG1+ ILC2 numbers were not affected (Fig S9C).

      Next, we isolated sLP lymphocytes from 4-week old mice and treated them with cytokines that are known to induce ILC3 proliferation including IL-7, IL-1β and IL-23. We also chose these cytokines to concur with our γδT17 findings. However, we could not induce cell cycle in either WT or ΔIAP1/2 cells. We contacted experts in the field, namely Dr David Withers at the University of Birmingham, who contacted further experts (Dr Matt Hepworth), in order to ask for advice of how to induce gut ILC3 proliferation. We quote David Withers “we have never had any joy making ILC3 proliferate much in vitro”, and Matt Hepworth “have been looking at this and have struggled to make them proliferate in vivo or in vitro”. So, unfortunately, we cannot test ILC3 proliferation in the same way we did for γδT17 cells.

      2- Although the authors nicely excluded the TNFR1 pathway from the mechanisms leading to γδ T cell loss in adult, the overt activation of the cRel pathway is not enough established as far as I am concerned. It would at least require a more thorough quantification of the immunofluorescent staining done. Showing only one cell is not enough. If possible, using another approach to confirm these data would also be needed.

      - We have now quantified RelB nuclear translocation over 4 experiments and found a significant increase in ΔIAP1/2 cells. Please see new panel in Fig 5F and text lines 244-250. Furthermore, there was a significant increase in Relb mRNA in ΔIAP1/2 newborn thymic γδT17 cells, which is consistent with activation of the non-canonical NF-kB pathway. Please see new Fig S6C and text lines 244-246.

      3- The expression level/quantity of protein of cIAP1/2 in γδ T cells from WT animal at the various stages of development has not been analyzed. Does it remain constant? Does it vary throughout development of γδ T cells? This information is important to further enforce and understand the role of these protein in the development of γδ T cells.

      - Unfortunately, we cannot quantify cIAP1/2 protein levels in these cells for technical reasons. There is no Ab for flow and only a cIAP1 Ab for western blots, which is of course impossible when dealing with such low cell numbers.____ However, we have contacted Dr Dominic Grün who had done a single-cell RNA-seq profiling of γδ T cells throughout different developmental stages, and asked to analyze expression levels of Birc2 and Birc3. We found that both Birc2 and Birc3 were expressed across all subsets of fetal and adult thymic γδ T cells with no specific enrichment and no apparent up- or down-regulation between the two time points. Please see attached Figure 1.

      Attached Figure 1: expression patterns of Birc2 and Birc3 at a single cell level in the different populations of fetal and adult thymic γδ T cells.

      4- In Figure 4D-E, the authors showed that in vitro, γδ T cells fail to progress through cell cycle in response to IL-7 or IL1b+ IL-23. Is a similar block detectable directly ex-vivo? Furthermore, it appears that Imiquimod treatment restore at least partially the deficiency in γδ T cells in the double KO mice. It would mean other cytokines or TCR triggering is rescuing this phenotype. Could the author test in vitro other stimuli and test whether γδ T cells are reactive to some stimuli but not others? It would bring some lights on the signaling regulated by cIAP1/2.

      - There is little if any detectable active cell cycling of these cells directly ex vivo, as shown by near absence of Ki67+7AAD+ cells (see below). We can still pick up small differences in Ki67+ cells but this is not sufficient to conclude whether there is more or less cell division. Please see attached figure 2.

      Attached Figure 2: ex-vivo cell cycle analysis of γδT17 cells form 4 week old- ΔIAP2 or ΔIAP1/2 mice.

      - We have now tested how 4-week old γδT17 cells from ____Δ____IAP1/2 mice respond to IL-2 and TCR stimulation. We found that similar to IL-7, IL-1b and IL-23, cells lacking IAPs proliferate less under both conditions. See new Fig S5C and lines 227-228.

      **Minor point:**

      although the authors cite a reference in the result section, could they show a dot plot confirming that the CD44hi CCR6+ or CCR6+ are the only population producing IL-17 among, γδ T cells?

      - We now show this in Fig S1D and lines 129-130

      Reviewer #2 (Significance (Required)):

      This article describes a new role for the cIAP1 and 2 in the maintenance of γδ T cells and ILC3s. In line with their previous work (Rizk, 2019), they show that this effect is correlated with a loss in c-MAF expression, which is a major transcription factor for these 2 cell types. These discoveries are of interest for specialists in the field, including myself. I am an expert in T cells and ILCs, with an interest in c-MAF function in these cell types.

      **Referee Cross-commenting**

      No further comments.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors studied the effect of the inactivation of cIAP1 and 2 on the development and evolution of T cells and in type 3 innate lymphoid cells (ILC3) using RORc-Cre induced inatiction of cIAP1 in combination or not with cIAP2 whole body KO. The authors showed that these two E3 ubiquitin ligases that regulate the NF-B pathway, are important to maintain a population of IL-17 producers T and ILC3 in adult animals. This lack of maintenance is correlated with a loss of c-MAF and RORγt expression in the two cell types and may be related to a deficiency in entering cell cycle in response to various cytokines. The authors also established that the mechanism is independent of the TNFR1 pathway. The article is well written, clear and most of the conclusions are well supported by the data showed. The results presented are novel and interesting for the field. However, I would suggest some major changes to make the story suitable for publication.

      1- The study of 2 different cell types brings some confusion to the story, even if I understand it makes some sense to pool these two parts in the same article. The T cell part is more complete than the ILC3 part, which brings some frustration for the reader, as nothing indicates that the mechanisms leading to the loss of maintenance are similar in the 2 cell types. I would suggest to simply remove the ILC3 part and keep it for another article. If the authors wish to keep it in this article, they must perform a similar set of experiments already done for the T cell part, especially the lineage tracing performed in figure 5 as c-Maf is known to be important in ILC plasticity for ILC3 and ILC1. They would also need to confirm the mechanisms involved in the process leading to ILC3 decrease.

      2- Although the authors nicely excluded the TNFR1 pathway from the mechanisms leading to T cell loss in adult, the overt activation of the cRel pathway is not enough established as far as I am concerned. It would at least require a more thorough quantification of the immunofluorescent staining done. Showing only one cell is not enough. If possible, using another approach to confirm these data would also be needed.

      3- The expression level/quantity of protein of cIAP1/2 in T cells from WT animal at the various stages of development has not been analyzed. Does it remain constant? Does it vary throughout development of T cells? This information is important to further enforce and understand the role of these protein in the development of T cells.

      4- In Figure 4D-E, the authors showed that in vitro, T cells fail to progress through cell cycle in response to IL-7 or IL1b+ IL-23. Is a similar block detectable directly ex-vivo? Furthermore, it appears that Imiquimod treatment restore at least partially the deficiency in T cells in the double KO mice. It would mean other cytokines or TCR triggering is rescuing this phenotype. Could the author test in vitro other stimuli and test whether T cells are reactive to some stimuli but not others? It would bring some lights on the signaling regulated by cIAP1/2.

      Minor point:

      although the authors cite a reference in the result section, could they show a dot plot confirming that the CD44hi CCR6+ or CCR6+ are the only population producing IL-17 among , T cells?

      Significance

      This article describes a new role for the cIAP1 and 2 in the maintenance of T cells and ILC3s. In line with their previous work (Rizk, 2019), they show that this effect is correlated with a loss in c-MAF expression, which is a major transcription factor for these 2 cell types. These discoveries are of interest for specialists in the field, including myself. I am an expert in T cells and ILCs, with an interest in c-MAF function in these cell types.

      Referee Cross-commenting

      No further comments.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study of Rizk, Bekiaris and colleagues is well written, carefully edited, and nicely placed into the trending context of the juvenile immune system development.

      They suggest that cIAP ubiquitin ligases cIAP1 and cIAP2 sustain type 3 γδ T after 4-5 weeks of age in mice. As a mechanism they show that these ubiquitin ligases are required in a cell-intrinsic manner to maintain cMAF and RORγt levels, and that this depends likely on overt activation of the non-canonical NF-κB pathway. Extrinsic factors such as microbiota did not seem to play a major role in this context.

      Major comments:

      It is absolutely crucial to directly and stringently control the efficiency of cIAP depletion via RORgt-cre, which may take some time and thus perhaps only reaches relevant (exponential) penetrance at early adulthood? Fig 4C is nice, however the Birc2 loxP sites may be far less efficient than those in the ROSA26-LSL-RFP system.

      However, the pre-puberty timing aspect is surprising, but without this aspect the conclusions would be similarly exciting.

      Minor comments:

      • To understand the general impact of cIAP on gdT17 homeostasis, the authors should consider investigating them in additional organs, as these gdT17 are quite tissue-resident and differentially adapt to their environment, where they use specific anti-apoptotic strategies to persist, including expression of Bcl2a1 family proteins.
      • Fig. 3: Has the presence of gdT17 in the graft been analyzed or enumerated? Experiments shown in Fig 3 AB and FG might collectively suggest that co-transferred gdT17 from the 45.1 BM graft could have reconstituted the regenerated gdT17 compartment in competition with the radioresistant 45.1/2 host gdT17 cells. This would actually not compromise the results, as the cIAP deficient cells did not persist.

      Significance

      Significance

      This work is very original and might be of pharmacological interest for approaches targeting cIAP, e.g. in order to enhance anti-viral therapies.

      Referee Cross-commenting

      No further comments.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Sasaki et al titled "Conditional GWAS of non-CG transposon methylation in Arabidopsis thaliana reveals major polymorphisms in five genes" employed conditional GWAS to identify trans-regulators of mCHG levels in Arabidopsis natural accessions, after controlling for mCHH. Using loss of function mutants for couple of these genes, the authors also tested their effects on mCHG levels.

      Overall, this manuscript makes a nice contribution. I suggest the following improvements to enhance the quality of this manuscript.

      Comments:

      1. MSI1 has been shown to be copurified with TCX5, a component of DREAM Complex. The DREAM complex transcriptional regulates CMT3, MET1, DDM1 in a cell cycle dependent manner (ref: Yong-Qiang Ning, 2020 nature plants). Tcx5/6 double mutants have ectopic gain of TE and genic mCHG. It would be nice to refer this paper and add to the MSI1 part accordingly. Absolutely: thanks for suggesting this!

      Multifaceted regulation of mCHG levels seems to be evident from this and previous studies. Why would such complex pathways evolv to regulate mCHG? Bewick et al 2016 and Wendte et al 2019 showed lack of CMT3 or ectopic expression of CMT3 can influence CG gene body methylation (gbM). One possibility is that these five factors regulate CHG to maintain it at a level that is just enough to target TE. Irrespective of the functional relevance of gbM, differences in the levels of these five factors might result in erroneous gbM. It would be interesting to look for the rates of gbM and number of gbM genes in the natural accession carrying 1 to 4 number of mCHG-decreasing alleles. Also, in the one line from Iberian peninsula carrying polymorphisms in all five genes.

      Yes, the connection between CHG and gbM is very interesting and deserves more attention. We looked for the effect of cumulative mCHG-decreasing alleles on gbM, but there was no association with gbM — but this is really not expected given the stable epigenetic inheritance of gbM. The Iberian peninsula line carrying all decreasing alleles did slightly lower gbM levels, but it is impossible to exclude the effects of population structure. Since we have nothing to add beyond speculation, we prefer not to go into this topic.

      The authors mentioned a significant peak for mCHG|mCHH on RdDM-targeted transposons was located 196 bp downstream of MIR823a and not on mature miRNA. Therefore, this cannot directly impair miR823 base pairing with CMT3 mRNA transcripts and its cleavage. Moreover, natural accessions carrying alternative MIRNA823 allele show reduced CMT3 and mCHG levels, meaning more miR823 levels? Does this 196 downstream region contain any regulatory feature that effects miR823 transcription? Or this region still falls in the primary miRNA hairpin region? A single nucleotide change in pri-miRNA can have a significant impact on its secondary structure that can impede DICER processivity and effectively levels of mature miR823 molecules? It will be beyond the scope of this paper to pin down the exact mechanism. But a simple stem loop RT-PCR for miR823 levels in reference and alternative accessions would be informative (on accessions that grow at the same speed). Perhaps, the authors can at least model SNP induced pri-miRNA secondary structure variations using Vienna RNAFold (http://rna.tbi.univie.ac.at/cgi-bin/RNAWebSuite/RNAfold.cgi) and present MEF values (maximum free energy) for representative accessions.

      Stem-loop qRT-PCR for MIR823a expression would indeed be helpful to confirm allelic effects. However, comparing lines with wildly different genetic backgrounds is fraught with difficulty due to trans-effects. Furthermore, MIR823a is expressed specifically during embryogenesis, and the expression quickly decreases after the early heart stage (Papareddy et al., 2021). Thus, we would need to extract microRNA from embryos at exactly the same developmental stage, from lines that may develop at different speeds.. Most likely, time-series data would be required, and generating such data is a massive undertaking. As noted in the paper, we did measure MIR823a expression by stem-loop qRT-PCR for several lines carrying reference and alternative alleles but the results were inconclusive. A proper study of this is beyond the scope of this paper.

      Testing predicted effects on RNA secondary structure, on the other hand, is eminently feasible. As suggested, we used Vienna RNAFold for the region, including the GWAS peak. Since the SNP is linked to a 35 bp deletion (shown in S4A), it is closer to the MIR823A coding region than 196 bp. However, the results indicate that the SNP (Chr3:4496626) is not within the stem-loop. It remains possible that this SNP tags multiple SNPs in the annotated stem regions. This is now mentioned.

      Figure 1A can be made more reader friendly. Perhaps this can be broken down into correlation plots for individual conditions or tissue types. In addition, it might be good to add individual r-square values for each of them instead of compound r-square.

      We respectfully disagree, since the main point of the figure is the overall correlation and heterogeneity, rather than the correlation within sub-sets. Instead of splitting the plot, we changed color contrasts to make it easier to read.

      Page 3, Paragraph 1 from line 3 to end of paragraph. The authors wrote "Much of this variation is due to differences in the environment (including tissue, which can be viewed as a cellular environment)". A possible explanation is these two tissues have different mitotic indices (fraction of cells diving and non-diving; flowers have more dividing cell, leaves have more non dividing and endoreduplicated cells) that explains non-CG variation. I would suggest authors to change the text to this and refer to Filipe Borges et al 2021 Current biology paper.

      This is certainly a possibility, although higher mCHG levels in flower buds presumably also reflect higher CMT3 expression during embryogenesis (Feng et al. 2020; Gutzat et al. 2020; Papareddy et al. 2021). We now mention both explanations and cite Borges et al. (2021).

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Sasaki et al. carried out a conditional GWAS analysis of TE-CHG methylation in Arabidopsis thaliana natural accessions. They revealed multiple associations with SNPs in known DNA methylation genes. A new finding is the association found proximal to JMJ26, which had no previously described role in the maintenance/establishment of RdDM-targeted transposons. The authors validate the JMJ26 association using a loss-of-function mutant of JMJ26, which essentially recapitulates the GWAS effect, suggesting that JMJ26 is likely causal. An important point of the study is that the associations detected with conditional GWAS have not been seen in previous univariate (i.e. unconditional) GWAS, probably due to to a lack of power. At the sub-genome-wide threshold the authors discovered further, albeit weaker, associations that were also highly enriched for known DNA methylation genes.

      Overall impression:

      The manuscript is clearly written, and the functional validation of the JMJ26 GWAS signal is commendable and certainly goes beyond the typical GWA study. Beyond this validated association however, the GWAS results are mainly confirmatory. They essentially highlight that methylation genes previously identified by way of mutant screens are variable in natural populations, and (probably) causative of non-CG methylation variation in TEs. What I personally found very distracting throughout the manuscript was the strong emphasis on the methodological aspect; that is, the conditional GWAS, which is really not new. Furthermore, the conceptual/philosophical discussion about what is a complex trait or what can be called polygenic was slightly pedantic and distracted from the biological message.

      There are three points here. First, we disagree that the GWAS results are confirmatory. Sure, only one of our associations is connected with a novel gene, but the fact that the four other genes apparently harbor major polymorphism is a new finding that contributes to our understanding of the function of this trait (and, possibly, these genes). Second, while it is possible that we emphasize statistical methodology too much, we do this for clarity, not to claim that what we are doing is novel. Third, we are similarly not interested in defining what is polygenic and what isn’t, but rather put the results in the context of other studies. We have changed the writing in various places to make it clearer (and hopefully less distracting/pedantic).

      A conceptual comment:

      • The conditional GWAS presented here is conceptually very similar to conditional QTL mapping approaches where candidate loci are included, a priori, as covariates in the model, and a scan is performed to search for additional modifiers. It is known that this approach increases power because the scan is performed on the residual trait variation (having accounting for effect of candidate loci). This is also the idea behind MQM mapping, although in the latter the inclusion is not restricted to candidate loci. Instead of including candidate SNPs as covariate the authors include TE-CHH methylation levels as a covariate as it is highly correlated with TE-CHG methylation. By doing this, the authors essentially "control" for any SNP affecting the covariance between CHG and CHH, even if these SNPs (and their genetic architecture) remain unknown. Hence, the conditional scan is mainly on the residual variation in TE-CHG methylation that is unique to this context (i.e. independent of CHH). That additional TE-CHG associated loci pop up in this scan is perhaps not so surprising.

      We agree, and have even written papers on this very subject. We were surprised by this comment as we felt we had included lengthy sections (see also comment above) about methodology, emphasizing that multi-trait analysis is a good idea in principle. One of our purposes here is to provide a beautiful example demonstrating this. We have tried to make these points clearer.

      The finding that this conditional GWAS yields again a handful of loci of that explain a considerable part of the trait (now residual trait) variation leads the authors to suggest that the genetic architecture underlying non-CG methylation of TEs is not "polygenic". I think this is semantics. All the authors have done is relegate any causal SNPs underlying the covariance between TE-CHG and TE-CHH to the right hand side of the equation of their GWAS model, and subsumed it under the predictor "TE-CHH methylation levels". That is, the genetic architecture underlying this covariance is still unknown, difficult to identify and probably highly polygenic.

      Again we agree, and fail to see why the reviewer thinks we do not. Nowhere do we claim that the overall covariance has a simple basis, and we explicitly state that it is the conditional mCHG variation that has an oligogenic basis. We did write that “univariate GWAS of mCHG variation failed to detect any significant associations, leading us to conclude, erroneously, that the trait was simply too polygenic”, which was imprecise, and arguably erroneous. The word “erroneously” has been removed in the revision.

      The authors essentially decompose a complex traits into parts and map genetic architectures for each part. Although each part seems less complex and more oligogenic than polygenic, when putting all the parts back together, I would argue we are getting close to a complex trait with a polygenic architecture. The study by Hüther et al, which the authors also cite, is another example of how a complex trait can be decomposed into parts. In reference to one of the authors' GWAS associations, they say "...this association was also recently found by Hüther et al. (2022) using GWAS for unconditional mCHG levels of individual transposons. The MIR823A polymorphism appears to almost exclusively affect mCHG (Figs. S2, S3), primarily targeting the same transposons as a CMT3 knock-out...". In the case of Hüther et al., the complex TE-CHG methylation trait is simplified by selecting specific TEs, a priori, that are differential methylated in CMT3 knock-out lines. One could go on like this, and continue to peel away this complex trait. But, again, this does not mean that the overall TE-CHG methylation trait is not complex nor polygenic. It spirals down into a discussion of what is actually meant by "complex" or "polygenic", which is an interesting discussion, but - in the case - of this manuscript takes away from the biological message. My point is perhaps best reflected in the following statement from the discussion section: "Despite high heritability, univariate GWAS of mCHG variation failed to detect any significant associations, leading us to conclude, erroneously, that the trait was simply too polygenic (Kawakatsu et al., 2016)." But a few lines below the authors seem to realize what they have actually done "We believe that, by controlling for mCHH, we have effectively simplified the trait, revealing genetic factors affecting mCHG only, perhaps by affecting the maintenance of this type of DNA methylation."

      The phrase “seem to realize” is unwarranted and unnecessary sarcasm. Given that we cite the two century-old papers that first demonstrated that it was possible to decompose complex traits into Mendelian ones, it should be obvious that we understand what we have done. That our writing could have been better is another matter. As noted above, the word “erroneously” has been dropped, and we have also changed the second sentence to make it obvious that this is obvious. We suspect that whether one finds this part of the Discussion “distracting” or not depends on training and background — our objective was to explain our results to readers who (unlike us and the reviewer) are not well-versed in quantitative genetics.

      Specific comments

      1. A large part of the manuscript focuses on SNPs that enriched for a priori genes that fall below the genome-wide significance threshold. While I see the reasons for doing this in this particular manuscript, I do not see how this is useful in general (again this approach is partly "sold" on methodological grounds). The approach can obviously not be extended to study traits where a priori gene sets are unavailable or incomplete. Moreover, the "FDR" approach based on the a priori gene set labels GWAS hits that are not within the a priori set "false discoveries", which may or may not be true. Moreover, there is no "natural" stopping point for going below GWAS thresholds. An alternative, to this would be to perform a targeted GWAS for a priori genes (+ a LD window around them). Since this alleviates the multiple testing burden, I would be curious to see what this yields both in terms of conditional and unconditional analysis. Candidates that show a signal could be included as covariates and a conditional scan for unknown genes could then be performed.

      The FDR analysis using a set of a priori genes should be explained in detail in this ms. It is cumbersome to go to another manuscript to see what was done exactly, especially since this information is also difficult to dig up in the Atwell 2010 study. Although I understand the idea behind this approach, I would be concerned that this type of "FDR" analysis assumes that that all methylation genes are known. A novel candidate that was perhaps never identified in mutants screens before would be classified as a false discovery. Similarly, known candidates that carry no functional polymorphisms in nature, perhaps because they are highly constraint, will never become a discovery.

      Comments 1 and 9 largely overlap, and so we moved 9 here for clarity and respond to both at the same time. We agree that the enrichment analysis should be explained in this article as well, so as to save the reader from finding the supplement to an old paper. A new section has been added to Methods. In this section, we also try to preempt some of the misunderstandings in the reviewer's comments.

      First, our approach is indeed generally applicable. Whether it is useful depends on what you want to do, and yes, the utility will depend on the quality of the independent data, but note that the a priori gene set does not have to be genes: you could use this approach to compare coding vs non-coding regions of the genome, for example.

      Second, we are not trying to “sell” our approach (or anything else for that matter).

      Third, the approach does not label GWAS hits that are not within the a priori set as false discoveries: it says nothing about these hits.

      Fourth, we are not sure what is meant by a ‘“natural” stopping point for going below GWAS thresholds’, but our approach does provide a simple way to explore how FDR (in the a priori set!) depends on the threshold used.

      Fifth, the proposed alternative of “targeted GWAS” (non-genomewide association, as it were) is not equivalent, because our approach was not designed to increase power by alleviating the multiple testing burden, but rather to rigorously demonstrate that there is a signal in the data when faced with uncalibrated p-values. That it can also be used to explore sub-significant associations is a nice side-effect that we exploit here.

      Sixth, we do not assume that all methylation genes are known, nor is our goal to find them all.

      With regards to the CMT2 signals (particularly section "Further evidence for allelic heterogeneity at CMT2") it would have been useful/clearer to break down CHH into CWA and non-CWA.

      While this is a sensible suggestion, the focus of this paper is on mCHG, and refining the mCHH measurement would essentially amount to re-doing all analyses.

      I understand that the authors set out to do this conditional analysis because previously no hits could be found for CHG TE methylation. However, have the authors considered going the other way around and performing a CHH|CHG analysis to find additional QTL affecting CHH methylation, partly indepedently of CHG?

      Yes, this was in the paper, but we only mention it in the Discussion (and Fig S13) as the results were only of methodological interest (as expected, they were very similar).

      The authors write: "While both mCHG and mCG showed high heritability, GWAS yielded little in terms of significant associations. This might be because these "traits" are highly polygenic, or because they are at least partly transgenerationally inherited, and hence do not behave like standard phenotypes." Please clarify what they mean by "not behave like standard phenotypes".

      Done.

      The authors write: "Our starting point is the observation that mCHG and mCHH levels on transposons are strongly correlated in the 1001 Epigenomes data set (Kawakatsu et al., 2016), especially for RdDM- targeted transposons (Fig. 1A; see Methods). Much of this variation ....". What is mean by "this variation"?

      The sentence has been changed to make this clearer.

      A few lines below, they write "...huge". Please rephrase.

      Done.

      The authors write: "sample data set ("Leaf SALK ambient temperature"; n=846). Interestingly, the covariance between mCHH and mCHG showed the same pattern in data generated by knocking out known or potential DNA methylation regulators in the same genetic background (Fig. 1B) (Stroud et al., 2013). This demonstrates strong co-regulation of these types of methylation, in particular for RdDM-targeted transposons." It is noticeable that many double mutants are off the diagonal. To me this indicates that they affect one context more than the other (i.e. they break covariance). Second, it suggests that they are probably interacting non-additively. It would be great if the authors could comment on this observation; perhaps also later in the ms, where they make a case for additivity.

      We are not convinced that the double- or triple-mutant show non-additivity. Adding up effects in Figure 1 works pretty well. As for our GWAS results, it is clear that small effects (like the ones in our GWAS) will always tend to look additive for simple mathematical reasons. This does not mean that no interactions exist, and we emphasize this in the paper. We also have an example of non-linearity when it comes to TE activity. This is now also emphasized.

      The authors write: " it is difficult to say what fraction of these factors is genetic and what is environmental, but, regardless of this, we hypothesized that the substantial covariance could reduce power of GWAS for either mCHH or mCHG (when using a standard univariate model), and that an analysis accounting for this covariance might perform better...". The arguments given thus far are not sufficient to understand why a "substantial covariance" between traits would reduce the power to map individual traits. I think more needs to be done here to motivate this.

      The sentence following the one quoted is “In essence, we sought to simplify a complex trait by breaking it into constituent parts”, which is very much part of the motivation. As the reviewer noted above, it is not surprising that a conditional analysis turns out to be more powerful. The comment may have arisen from the statement “This insight is the basis for this paper”, which is misleading — there is no insight here, just a very obvious hypothesis, which turned to be correct. We have changed the writing to make this clearer.

      The authors write" "However, MSI1 is required to control DNA methylation via repression of MET1, and a loss of FAS2 in CAF-1 induces mCHG hypermethylation (Fig 1B) (Stroud et al., 2013; Jullien et al., 2008)...", where is the "FAS2 in CAT-1" result visible in Fig. 1B?

      fas2 induces mCHG hypermethylation in CMT2-targeted TEs, presumably via a complex that also involves MSI1. It is marked in Fig. 1B. We have rephrased the sentence to make this clearer.

      The results presented in "A jmjC gene is a novel modifier of mCHG in RdDM-targeted transposons" could have been showcased better. Only after reading the methods part did I realize that the authors generated CRISPR mutants. It reads as if the authors just picked up some available loss of function mutants and profiled them. But, clearly, much more work was involved here and the authors could have brought that out more. Perhaps more generally, I think all the new functional analysis the authors perform is largely "under-sold" in this manuscript at the expense of unnecessary methodological/concpetual discussion (see point above).

      We actually generated CRISPR/CAS9 mutants only for MIR823A (Table S5). For JMJ26, a t-DNA insertion line was available, and results based on this and rescue lines provided sufficient results. To clarify this, we corrected the subsection titles.

      In section "The power and complexity of conditional GWAS", the authors write "The performance of GWAS relies on using the right model for the relation between genotype and phenotype. As with other statistical methods, using the wrong model may lead to unpredictable results." This seems like a too obvious of a statement.

      Indeed: it is meant ironically. It is obvious, yet people do it.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Sasaki et al. carried out a conditional GWAS analysis of TE-CHG methylation in Arabidopsis thaliana natural accessions. They revealed multiple associations with SNPs in known DNA methylation genes. A new finding is the association found proximal to JMJ26, which had no previously described role in the maintenance/establishment of RdDM-targeted transposons. The authors validate the JMJ26 association using a loss-of-function mutant of JMJ26, which essentially recapitulates the GWAS effect, suggesting that JMJ26 is likely causal. An important point of the study is that the associations detected with conditional GWAS have not been seen in previous univariate (i.e. unconditional) GWAS, probably due to to a lack of power. At the sub-genome-wide threshold the authors discovered further, albeit weaker, associations that were also highly enriched for known DNA methylation genes.

      Overall impression:

      The manuscript is clearly written, and the functional validation of the JMJ26 GWAS signal is commendable and certainly goes beyond the typical GWA study. Beyond this validated association however, the GWAS results are mainly confirmatory. They essentially highlight that methylation genes previously identified by way of mutant screens are variable in natural populations, and (probably) causative of non-CG methylation variation in TEs. What I personally found very distracting throughout the manuscript was the strong emphasis on the methodological aspect; that is, the conditional GWAS, which is really not new. Furthermore, the conceptual/philosophical discussion about what is a complex trait or what can be called polygenic was slightly pedantic and distracted from the biological message.

      A conceptual comment:

      • The conditional GWAS presented here is conceptually very similar to conditional QTL mapping approaches where candidate loci are included, a priori, as covariates in the model, and a scan is performed to search for additional modifiers. It is known that this approach increases power because the scan is performed on the residual trait variation (having accounting for effect of candidate loci). This is also the idea behind MQM mapping, although in the latter the inclusion is not restricted to candidate loci. Instead of including candidate SNPs as covariate the authors include TE-CHH methylation levels as a covariate as it is highly correlated with TE-CHG methylation. By doing this, the authors essentially "control" for any SNP affecting the covariance between CHG and CHH, even if these SNPs (and their genetic architecture) remain unknown. Hence, the conditional scan is mainly on the residual variation in TE-CHG methylation that is unique to this context (i.e. independent of CHH). That additional TE-CHG associated loci pop up in this scan is perhaps not so surprising.

      The finding that this conditional GWAS yields again a handful of loci of that explain a considerable part of the trait (now residual trait) variation leads the authors to suggest that the genetic architecture underlying non-CG methylation of TEs is not "polygenic". I think this is semantics. All the authors have done is relegate any causal SNPs underlying the covariance between TE-CHG and TE-CHH to the right hand side of the equation of their GWAS model, and subsumed it under the predictor "TE-CHH methylation levels". That is, the genetic architecture underlying this covariance is still unknown, difficult to identify and probably highly polygenic.

      The authors essentially decompose a complex traits into parts and map genetic architectures for each part. Although each part seems less complex and more oligogenic than polygenic, when putting all the parts back together, I would argue we are getting close to a complex trait with a polygenic architecture. The study by Hüther et al, which the authors also cite, is another example of how a complex trait can be decomposed into parts. In reference to one of the authors' GWAS associations, they say "...this association was also recently found by Hüther et al. (2022) using GWAS for unconditional mCHG levels of individual transposons. The MIR823A polymorphism appears to almost exclusively affect mCHG (Figs. S2, S3), primarily targeting the same transposons as a CMT3 knock-out...". In the case of Hüther et al., the complex TE-CHG methylation trait is simplified by selecting specific TEs, a priori, that are differential methylated in CMT3 knock-out lines. One could go on like this, and continue to peel away this complex trait. But, again, this does not mean that the overall TE-CHG methylation trait is not complex nor polygenic. It spirals down into a discussion of what is actually meant by "complex" or "polygenic", which is an interesting discussion, but - in the case - of this manuscript takes away from the biological message. My point is perhaps best reflected in the following statement from the discussion section: "Despite high heritability, univariate GWAS of mCHG variation failed to detect any significant associations, leading us to conclude, erroneously, that the trait was simply too polygenic (Kawakatsu et al., 2016)." But a few lines below the authors seem to realize what they have actually done "We believe that, by controlling for mCHH, we have effectively simplified the trait, revealing genetic factors affecting mCHG only, perhaps by affecting the maintenance of this type of DNA methylation."

      Specific comments

      • A large part of the manuscript focuses on SNPs that enriched for a priori genes that fall below the genome-wide significance threshold. While I see the reasons for doing this in this particular manuscript, I do not see how this is useful in general (again this approach is partly "sold" on methodological grounds). The approach can obviously not be extended to study traits where a priori gene sets are unavailable or incomplete. Moreover, the "FDR" approach based on the a priori gene set labels GWAS hits that are not within the a priori set "false discoveries", which may or may not be true. Moreover, there is no "natural" stopping point for going below GWAS thresholds. An alternative, to this would be to perform a targeted GWAS for a priori genes (+ a LD window around them). Since this alleviates the multiple testing burden, I would be curious to see what this yields both in terms of conditional and unconditional analysis. Candidates that show a signal could be included as covariates and a conditional scan for unknown genes could then be performed.
      • With regards to the CMT2 signals (particularly section "Further evidence for allelic heterogeneity at CMT2") it would have been useful/clearer to break down CHH into CWA and non-CWA.
      • I understand that the authors set out to do this conditional analysis because previously no hits could be found for CHG TE methylation. However, have the authors considered going the other way around and performing a CHH|CHG analysis to find additional QTL affecting CHH methylation, partly indepedently of CHG?
      • The authors write: "While both mCHG and mCG showed high heritability, GWAS yielded little in terms of significant associations. This might be because these "traits" are highly polygenic, or because they are at least partly transgenerationally inherited, and hence do not behave like standard phenotypes." Please clarify what they mean by "not behave like standard phenotypes".
      • The authors write: "Our starting point is the observation that mCHG and mCHH levels on transposons are strongly correlated in the 1001 Epigenomes data set (Kawakatsu et al., 2016), especially for RdDM- targeted transposons (Fig. 1A; see Methods). Much of this variation ....". What is mean by "this variation"?
      • A few lines below, they write "...huge". Please rephrase.
      • The authors write: "sample data set ("Leaf SALK ambient temperature"; n=846). Interestingly, the covariance between mCHH and mCHG showed the same pattern in data generated by knocking out known or potential DNA methylation regulators in the same genetic background (Fig. 1B) (Stroud et al., 2013). This demonstrates strong co-regulation of these types of methylation, in particular for RdDM-targeted transposons." It is noticeable that many double mutants are off the diagonal. To me this indicates that they affect one context more than the other (i.e. they break covariance). Second, it suggests that they are probably interacting non-additively. It would be great if the authors could comment on this observation; perhaps also later in the ms, where they make a case for additivity.
      • The authors write: " it is difficult to say what fraction of these factors is genetic and what is environmental, but, regardless of this, we hypothesized that the substantial covariance could reduce power of GWAS for either mCHH or mCHG (when using a standard univariate model), and that an analysis accounting for this covariance might perform better...". The arguments given thus far are not sufficient to understand why a "substantial covariance" between traits would reduce the power to map individual traits. I think more needs to be done here to motivate this.
      • The FDR analysis using a set of a priori genes should be explained in detail in this ms. It is cumbersome to go to another manuscript to see what was done exactly, especially since this information is also difficult to dig up in the Atwell 2010 study. Although I understand the idea behind this approach, I would be concerned that this type of "FDR" analysis assumes that that all methylation genes are known. A novel candidate that was perhaps never identified in mutants screens before would be classified as a false discovery. Similarly, known candidates that carry no functional polymorphisms in nature, perhaps because they are highly constraint, will never become a discovery.
      • The authors write" "However, MSI1 is required to control DNA methylation via repression of MET1, and a loss of FAS2 in CAF-1 induces mCHG hypermethylation (Fig 1B) (Stroud et al., 2013; Jullien et al., 2008)...", where is the "FAS2 in CAT-1" result visible in Fig. 1B?
      • The results presented in "A jmjC gene is a novel modifier of mCHG in RdDM-targeted transposons" could have been showcased better. Only after reading the methods part did I realize that the authors generated CRISPR mutants. It reads as if the authors just picked up some available loss of function mutants and profiled them. But, clearly, much more work was involved here and the authors could have brought that out more. Perhaps more generally, I think all the new functional analysis the authors perform is largely "under-sold" in this manuscript at the expense of unnecessary methodological/concpetual discussion (see point above).
      • In section "The power and complexity of conditional GWAS", the authors write "The performance of GWAS relies on using the right model for the relation between genotype and phenotype. As with other statistical methods, using the wrong model may lead to unpredictable results." This seems like a too obvious of a statement.

      Significance

      The manuscript is clearly written, and the functional validation of the JMJ26 GWAS signal is commendable and certainly goes beyond the typical GWA study. Beyond this validated association however, the GWAS results are mainly confirmatory. They essentially highlight that methylation genes previously identified by way of mutant screens are variable in natural populations, and (probably) causative of non-CG methylation variation in TEs.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Sasaki et al titled "Conditional GWAS of non-CG transposon methylation in Arabidopsis thaliana reveals major polymorphisms in five genes" employed conditional GWAS to identify trans-regulators of mCHG levels in Arabidopsis natural accessions, after controlling for mCHH. Using loss of function mutants for couple of these genes, the authors also tested their effects on mCHG levels. Overall, this manuscript makes a nice contribution. I suggest the following improvements to enhance the quality of this manuscript.

      Comments:

      1. MSI1 has been shown to be copurified with TCX5, a component of DREAM Complex. The DREAM complex transcriptional regulates CMT3, MET1, DDM1 in a cell cycle dependent manner (ref: Yong-Qiang Ning, 2020 nature plants). Tcx5/6 double mutants have ectopic gain of TE and genic mCHG. It would be nice to refer this paper and add to the MSI1 part accordingly.
      2. Multifaceted regulation of mCHG levels seems to be evident from this and previous studies. Why would such complex pathways evolv to regulate mCHG? Bewick et al 2016 and Wendte et al 2019 showed lack of CMT3 or ectopic expression of CMT3 can influence CG gene body methylation (gbM). One possibility is that these five factors regulate CHG to maintain it at a level that is just enough to target TE. Irrespective of the functional relevance of gbM, differences in the levels of these five factors might result in erroneous gbM. It would be interesting to look for the rates of gbM and number of gbM genes in the natural accession carrying 1 to 4 number of mCHG-decreasing alleles. Also, in the one line from Iberian peninsula carrying polymorphisms in all five genes.
      3. The authors mentioned a significant peak for mCHG|mCHH on RdDM-targeted transposons was located 196 bp downstream of MIR823a and not on mature miRNA. Therefore, this cannot directly impair miR823 base pairing with CMT3 mRNA transcripts and its cleavage. Moreover, natural accessions carrying alternative MIRNA823 allele show reduced CMT3 and mCHG levels, meaning more miR823 levels? Does this 196 downstream region contain any regulatory feature that effects miR823 transcription? Or this region still falls in the primary miRNA hairpin region? A single nucleotide change in pri-miRNA can have a significant impact on its secondary structure that can impede DICER processivity and effectively levels of mature miR823 molecules? It will be beyond the scope of this paper to pin down the exact mechanism. But a simple stem loop RT-PCR for miR823 levels in reference and alternative accessions would be informative (on accessions that grow at the same speed). Perhaps, the authors can at least model SNP induced pri-miRNA secondary structure variations using Vienna RNAFold (http://rna.tbi.univie.ac.at/cgi-bin/RNAWebSuite/RNAfold.cgi) and present MEF values (maximum free energy) for representative accessions.
      4. Figure 1A can be made more reader friendly. Perhaps this can be broken down into correlation plots for individual conditions or tissue types. In addition, it might be good to add individual r-square values for each of them instead of compound r-square.
      5. Page 3, Paragraph 1 from line 3 to end of paragraph. The authors wrote "Much of this variation is due to differences in the environment (including tissue, which can be viewed as a cellular environment)". A possible explanation is these two tissues have different mitotic indices (fraction of cells diving and non-diving; flowers have more dividing cell, leaves have more non dividing and endoreduplicated cells) that explains non-CG variation. I would suggest authors to change the text to this and refer to Filipe Borges et al 2021 Current biology paper.

      Significance

      The manuscript by Sasaki et al titled "Conditional GWAS of non-CG transposon methylation in Arabidopsis thaliana reveals major polymorphisms in five genes" employed conditional GWAS to identify trans-regulators of mCHG levels in Arabidopsis natural accessions, after controlling for mCHH. Using loss of function mutants for couple of these genes, the authors also tested their effects on mCHG levels. Overall, this manuscript makes a nice contribution. I suggest the following improvements to enhance the quality of this manuscript.

      Comments:

      1. MSI1 has been shown to be copurified with TCX5, a component of DREAM Complex. The DREAM complex transcriptional regulates CMT3, MET1, DDM1 in a cell cycle dependent manner (ref: Yong-Qiang Ning, 2020 nature plants). Tcx5/6 double mutants have ectopic gain of TE and genic mCHG. It would be nice to refer this paper and add to the MSI1 part accordingly.
      2. Multifaceted regulation of mCHG levels seems to be evident from this and previous studies. Why would such complex pathways evolv to regulate mCHG? Bewick et al 2016 and Wendte et al 2019 showed lack of CMT3 or ectopic expression of CMT3 can influence CG gene body methylation (gbM). One possibility is that these five factors regulate CHG to maintain it at a level that is just enough to target TE. Irrespective of the functional relevance of gbM, differences in the levels of these five factors might result in erroneous gbM. It would be interesting to look for the rates of gbM and number of gbM genes in the natural accession carrying 1 to 4 number of mCHG-decreasing alleles. Also, in the one line from Iberian peninsula carrying polymorphisms in all five genes.
      3. The authors mentioned a significant peak for mCHG|mCHH on RdDM-targeted transposons was located 196 bp downstream of MIR823a and not on mature miRNA. Therefore, this cannot directly impair miR823 base pairing with CMT3 mRNA transcripts and its cleavage. Moreover, natural accessions carrying alternative MIRNA823 allele show reduced CMT3 and mCHG levels, meaning more miR823 levels? Does this 196 downstream region contain any regulatory feature that effects miR823 transcription? Or this region still falls in the primary miRNA hairpin region? A single nucleotide change in pri-miRNA can have a significant impact on its secondary structure that can impede DICER processivity and effectively levels of mature miR823 molecules? It will be beyond the scope of this paper to pin down the exact mechanism. But a simple stem loop RT-PCR for miR823 levels in reference and alternative accessions would be informative (on accessions that grow at the same speed). Perhaps, the authors can at least model SNP induced pri-miRNA secondary structure variations using Vienna RNAFold (http://rna.tbi.univie.ac.at/cgi-bin/RNAWebSuite/RNAfold.cgi) and present MEF values (maximum free energy) for representative accessions.
      4. Figure 1A can be made more reader friendly. Perhaps this can be broken down into correlation plots for individual conditions or tissue types. In addition, it might be good to add individual r-square values for each of them instead of compound r-square.
      5. Page 3, Paragraph 1 from line 3 to end of paragraph. The authors wrote "Much of this variation is due to differences in the environment (including tissue, which can be viewed as a cellular environment)". A possible explanation is these two tissues have different mitotic indices (fraction of cells diving and non-diving; flowers have more dividing cell, leaves have more non dividing and endoreduplicated cells) that explains non-CG variation. I would suggest authors to change the text to this and refer to Filipe Borges et al 2021 Current biology paper.
  3. May 2022
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01392R

      Corresponding author(s): Ilan Davis

      General Statements

      We thank the reviewers for their constructive and helpful comments on our manuscript. We are delighted to find their consensus that the manuscript represents a useful resource for the Drosophila community in particular, and for the fields of neural development and post-transcriptional gene regulation. The following is our detailed responses and plan for how we will address all the major points raised by the reviewers. We also plan to address all minor points fully and have been through them in great detail one by one, so we are confident this is feasible within a reasonable and expected time frame.

      Description of the planned revisions

      Reviewer #1

      Major 1: For the wildtype CS flies, there is no YFP mRNA signal in neuroblast region and how about YFP mRNA signal in MB, OL VNC and NMJ regions? What is the criterion of setting laser power and gain for the mRNA level of 200 genes? Is it difficult to distinguish background and true signal of the mRNA in different area?

      This is a good point about background intensity levels (from non-specific binding of the YFP smFISH probe) across different tissue regions. We thank the review for raising it. Signal:background decreases with depth in all of the tissues, with superficial cells displaying similarly high signal:background in the CNS and NMJ, while signal:background in neuropil regions of the CNS are slightly lower. To address this point, we plan to include a supplementary figure to show background fluorescence of the smFISH probe across all regions of the CNS and NMJ.

      To address the point about image acquisition settings, we will included the following additional information in the Methods section (Page 17):

      “Consistent image acquisition settings (laser power, pixel dwell time or camera exposure, detector gain) were used for experimental and control experiments. Acquisition settings were optimized to achieve fast acquisition and high signal:background for each instrument.”

      We will add a further explicit explanation to the manuscript referring to previous publications, that the nature of the smFISH method makes it relatively simple to distinguish background from true signal. True punctae have a relatively uniform size, symmetrical shape, and consistent intensity distribution. Whereas background punctae that are either larger than diffraction-limited punctae or have lower intensity can easily be separated from real signal.

      Major 2: Would the insertion of YFP affect gene expression? Comparing to CS in Fig 1K, the dlg1 mRNA signals in dlg1::YFP line (Fig 1F) increases a lot. I do not know if this phenotype happens only in this area. So could you show some other regions for dlg1::YFP flies.

      This is a good point raised by both Reviewer #1 and Reviewer #2 (Major point 1). We agree that a proper quantification of the effect of YFP-insertion will bolster our conclusion, highlighting the utility of protein-trap collections for systematic analysis of post-transcriptional regulations. To address this, we plan to: (i) provide quantifications of dlg1 transcript expression in the CNS and NMJ and compare the levels between dlg1::YFP and wild-type lines, and (ii) provide new figure visuals reflecting our quantification results.

      Major 3: Is the dlg::YFP homozygous available? Among 200 gene trap lines, how many of them can be homozygous?

      This is a good point raised by both Reviewer #1 and Reviewer #3 (Major point 1). The dlg1::YFP (CPTI-000207) line used for the control experiments is homozygous. However, it is a great point that not all of the YFP insertions are homozygous viable. Out of the 200 lines we screened, 131/200 (65.5%) insertions are homozygous viable, whereas 69/200 (34.5%) are homozygous lethal or are unknown. We have addressed this caveat in the Methods section (Page 16) with the following statement:

      The majority of YFP insertion lines are homozygous (65.5%, 131/200), those that are not homozygous viable were kept over balancer chromosomes.”

      Our provisional analysis shows that the number of nervous system compartments expressing YFP-fused protein or mRNA are not affected by homozygous lethality. We plan to include this analysis in the revised manuscript.

      Major 4: Have you tried to investigate the mRNA and protein localization in adult brains?

      Yes, in a related study, we demonstrated that this approach also works in the adult brain (Mitchel et al., 2021, DOI:10.7554/eLife.62770). A systematic analysis of protein and mRNA expression patterns in the adult brain would be highly interesting and is certainly possible, however it is beyond the scope of the manuscript. To address this point, we will cite our related work and emphasise more clearly the wider applicability of our technique.

      Major 5: In Fig 3C, the authors claimed in MB or OL soma regions, some genes are protein expression only but no mRNA present. I wonder how do you explain this phenotype in soma.

      Our favoured explanation is that protein is more stable than mRNA. Therefore, after the mRNA is translated, it could get degraded while the protein is still present in the cell. We will add text in the relevant section to mention potential differential stability of protein/mRNA.

      Major 6: Since sgg mRNA localize to both sides of NMJ, would KCl stimulus affect sgg mRNA amount and localization in muscle?

      That is an interesting question. The data in Fig. 8I-J show that there is no additional Sgg::YFP protein accumulation at the muscle post synaptic density in response to KCl stimulus. It’s been shown elsewhere (Ataman et al., 2008, DOI:10.1016/j.neuron.2008.01.026) that Sgg protein translocates to the muscle nucleus in response to KCl stimulus. Determining whether that mechanism requires translation of new protein would require a complete new study with translational analysis and would distract from the message of the current study.

      Reviewer #2

      Major 1: Although the group is using an established and published set of gene traps, it would be good to confirm protein expression for same gene to increase confidence or provide more details on how is known that the YFP insertions do not affect mRNA stabilization or transcription or protein expression/localization. For example in Figure 1 F' versus K it is unclear why in the DlgYFP insertion there are more Dlg in situ signals than are observed in and around a neuroblast as compared to the wild type control. From the description provided these appear to the maximum intensity images. Is this due to background or an effect of the YFP insertion itself? Because of the increased level of expression is there a feedback loop of the protein regulating the mRNA expression? If had expression of Dlg protein in this figure would also confirm the YFP insertion mirrored the endogenous and it would be easier to discern if there were any changes in the number of Dlg mRNA molecules present. As this was the proof of principle example for the screen this information would increase confidence in the remainder of the data presented. AS an important part of the screen is looking at the potential for post transcriptional regulation this is an important factor to address.

      Thank you for the valuable suggestion. We agree with the reviewer that the comparison of dlg1 transcript levels would provide a valuable control. This point was raised by both Reviewer #1 and Reviewer #2. Please see [Reviewer #1 - Major point 2] for our response.

      Major 2: Will this pipeline capture information on whether is secreted (contain a signal regulatory peptide) or not as then would expect to be discordant. This should be clarified or commented on.

      The reviewer’s comment is correct. Secreted proteins may show discordant distribution of protein and mRNA between cell types even in absence of post-transcriptional regulations. Note that Shaggy (Sgg) is a secreted protein but we observe that most of the protein products are expressed in the same cell as the RNA. We propose to follow the reviewer’s suggestion and revise the text to discuss the limitation of our pipeline in identifying proteins regulated via secretory modes.

      Major 3: General molecular function is listed in supplementary table 1 but will other types of information be able to be correlated from datasets or databases as well.

      This question highlights a major feature of our dataset and associated metadata The analysis in Supplementary Table 1 is used to assess the functional representation of the 200 genes in our screen against the all known genes. We found that ~90% of GOSlim terms are covered by the 200 genes, highlighting the diversity of our list of genes. On the other hand, our Zegami resource (Accompanying data for Zegami) contains a rich collection of metadata (including the full list of GO terms) associated with each gene in the dataset, and extends that information to the entire genome. We anticipate that the Zegami resource will be a valuable platform to query data from our analysis and other databases. To address this, we plan to: (i) revise the legend for the Supplementary Table 1, and (ii) revise the text to clarify what kind of information is available in our Zegami resource.

      Reviewer #3

      Major 1: The approach relies on gene traps that often fail to be made homozygous, presumably due to deleterious function of the YFP insert. This is an obvious limitation of the study, which the authors address, but do so insufficiently by only analyzing a single case Dlg1. The authors should report how many of the 200 YFP-traps can produce viable homozygous animals, whether phenotypes can be observed, and any other relevant information to assess the functional properties of the tagged genes.

      Thank you for requesting further information on homozygous viability of the YFP-trap collection. This point was raised by both Reviewer #1 and Reviewer #3. Please see [Reviewer #1 - Major point 3] for our response.

      Major 2: The term "discordant" is used for non-congruous RNA/Protein levels in soma and distal processes, and sometimes the two are analyzed in the same figure (e.g Fig 3A). When it is stated that 98% of genes are discordant, this is an over-simplification as what the authors describe as "discordant" is expected to occur frequently in the distal process, but less often in the soma (which is what the authors find when presenting the data for individual compartments - Fig 3B-C). This is confusing because the observation means completely different things in the two compartments, though both are interesting to describe. These analyses, and their interpretation, should be kept separate.

      This is a fair point raised by the reviewer. To address this point we plan to: (i) prepare two separate tables summarising our annotation in soma and neurite compartments, and (ii) revise the text accordingly to explain and discuss how the discordant protein and mRNA expression pattern can arise both within different compartments of a cell or between different cell types in a cell lineage

      Major 3: There is not enough emphasis placed on the cell-type specific regulation of RNAs. There are very few studies that have investigated how localization of individual RNAs changes in different cell types or regions of the nervous system, and the authors find that this is quite prevalent. Therefore, the rather superficial analysis of these data fails to take advantage of a major strength of the data. For example, for the discordant genes that differ in neuropil localization between different regions of the CNS, what types of molecules do they encode, what is their function in neurons (if known), and why might they be required locally in one region of the CNS but not the other?

      We appreciate that the Reviewer recognizes the power of comparing RNA localization patterns across different brain regions (Figure 5R). We reported on a common set of synaptic mRNAs that encode nuclear proteins across the different regions of the nervous system. Per the Reviewer’s suggestion, we have begun to look into region-specific patterns of expression. In Figure 5R, two categories with the largest number of genes are ‘protein_MB_syn’ and ‘protein_OL_syn’, which contain proteins that are specific to those regions. However, given the small number of 15-16 genes, gene ontology enrichment analysis has limited power to infer information on the entire genome.

      We plan to revise the manuscript:

      to include tables with lists of genes specific to MB and OL regions. to revise the manuscript to include in the discussion a caveat of the limited power of analysis based on a small number of genes.

      Major 4: The authors conclude that mRNA and protein co-localization in glia processes shows that mRNA localization makes a major contribution of the proteome in processes. However, there is not enough evidence for such conclusion since neither translation of these mRNAs nor lack of protein trafficking from the somas was shown.

      The significant role of RNA localisation in shaping the local proteome and performing proteostatic regulation has been studied in detail (Zappulo et al., 2017, von Kugelgen and Chekulaeva 2022 Giandomenico et al., 2022). However, the reviewer’s comment is correct that we do not show direct evidence of mRNA translation or protein trafficking. Therefore, we propose to: (i) clarify the text by including the citation of these publications, and (ii) qualify our claim that mRNA localization is a major contribution of the proteome in neurite or glial processes.

      Zappulo et al., 2017, DOI: 10.1038/s41467-017-00690-6

      von Kugelgen and Chekulaeva 2022 DOI: 10.1002/wrna.1590

      Giandomenico et al., 2022, DOI: 10.1016/j.tins.2021.08.002

      Major 5: An important caveat of this technique that should be discussed is the lack of knowledge about the translation of these mRNAs, if the mRNA that is being detected is the same as the one that is translated. While the authors emphasize the discordance between mRNA and protein localization, it is not possible to know whether these mRNAs are being translated where they are found, e.g. soma vs neuropil. Moreover, there are many examples (e.g. BDNF) where the isoform influences the subcellular localization of the mRNA. There is no way of studying the isoforms here, and we could be looking for a different mRNA isoform localized to a specific compartment compared to the protein. These points must be discussed.

      We agree with the reviewer that our method does not provide information on whether the detected mRNA is being translated in time and space. Elucidating the relative contribution of localised mRNA in shaping the local proteome is not a trivial task and it is being actively investigated in the field. However, we believe our dataset provides a unique high-resolution map of transcripts that are potentially regulated at post-transcriptional and translational levels. It would be promising to follow up the ‘discordant’ genes identified from our survey using experimental methods that are able to track mRNA-ribosome associations (e.g. TRICK) in future studies. To address this point, we will revise the text to discuss this caveat.

      Thank you for pointing out the matter with mRNA isoforms. Our preliminary analysis indicates that 71% of the screened genes have constitutive YFP-insertions (i.e. YFP-cassette traps all mRNA isoforms). However, we agree that our approach cannot discriminate the case where protein produced from an mRNA isoform is trafficked and co-localises with another mRNA isoform that did not give rise to that protein. We plan to revise the text to discuss this point explicitly.

      Description of the revisions that have already been incorporated in the transferred manuscript

      Several minor comments regarding typos and simple errors have already been incorporated in the transferred manuscript. The changes are highlighted in yellow in the revised submission.

      We plan to address all the useful numerous minor comments that the reviewers have kindly highlighted to us. We feel these are straightforward to do and feasible in a short time, so do not require a detailed listed plan. If the reviewers feel they do afterall need such a list, we will be happy to provide it. However, there is one minor comment that we feel requires a little more explanation:

      Description of analyses that authors prefer not to carry out

      Reviewer #3 - Minor Comment on Figure 8: “...____they should characterize the (khc) mutant NMJs: what is the change in size, synapse number, etc..

      The khc mutants are already known to show synapse morphology phenotypes (Kang et al., 2014), though the khc23/khc27 transheterozygous allele has previously been used to assess localization defects at the larval NMJ (Gardiol and St. Johnston, 2014). Moreover, our manuscript (Figure 8) focuses on post-developmental stimulus-dependent processes, rather than cellular-level synapse developmental parameters with this mutant. The reviewer correctly points out that the khc developmental phenotypes are likely to have other secondary defects as a result of impaired microtubule transport. The purpose of that mutant was to assess the molecular-level question of whether microtubule-based transport is required for sgg mRNA localization at the axon terminal. The consequences and exact mechanism of disrupted transport are beyond the scope of this study. To address this point explicitly, we will:

      Revise the manuscript to quote more explicitly and clearly the developmental khc phenotype. Revise the manuscript to explain the difference between the developmental role of khc and role in the transport of sgg specifically to the axon terminal. Revise the manuscript to explain more explicitly the limitations of this mutant.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript, the authors address the important topic of post-transcriptional gene regulation using the larval nervous system in Drosophila. They utilize a novel approach taking advantage of existing protein trap library, which permits use of the same smFISH probe to detect an array of 200 RNAs and visualize their corresponding protein expression. Furthermore, the authors developed a computational pipeline to visualize and analyze the resulting data, which should enhance the application of this method by other researchers. A major strength of the data comes from the analysis of multiple cell types in distinct compartments of the nervous system, cell types (neuron, glia, neuroblast), and subcellular domains. From the cumulative data, the authors are able to describe several interesting observations relating to cell-specific post-transcriptional regulation, regulation within a central-neuroblast lineage and glial post-transcriptional regulation, among others.

      However, in spite of these strengths, there are several concerns related to the organization and interpretation of the manuscript that the authors should address in order to improve the manuscript:

      General concerns:

      1. The approach relies on gene traps that often fail to be made homozygous, presumably due to deleterious function of the YFP insert. This is an obvious limitation of the study, which the authors address, but do so insufficiently by only analyzing a single case Dlg1. The authors should report how many of the 200 YFP-traps can produce viable homozygous animals, whether phenotypes can be observed, and any other relevant information to assess the functional properties of the tagged genes.
      2. The term "discordant" is used for non-congruous RNA/Protein levels in soma and distal processes, and sometimes the two are analyzed in the same figure (e.g Fig 3A). When it is stated that 98% of genes are discordant, this is an over-simplification as what the authors describe as "discordant" is expected to occur frequently in the distal process, but less often in the soma (which is what the authors find when presenting the data for individual compartments - Fig 3B-C). This is confusing because the observation means completely different things in the two compartments, though both are interesting to describe. These analyses, and their interpretation, should be kept separate.
      3. There is not enough emphasis placed on the cell-type specific regulation of RNAs. There are very few studies that have investigated how localization of individual RNAs changes in different cell types or regions of the nervous system, and the authors find that this is quite prevalent. Therefore, the rather superficial analysis of these data fails to take advantage of a major strength of the data. For example, for the discordant genes that differ in neuropil localization between different regions of the CNS, what types of molecules do they encode, what is their function in neurons (if known), and why might they be required locally in one region of the CNS but not the other?
      4. The authors conclude that mRNA and protein co-localization in glia processes shows that mRNA localization makes a major contribution of the proteome in processes. However, there is not enough evidence for such conclusion since neither translation of these mRNAs nor lack of protein trafficking from the somas was shown.
      5. An important caveat of this technique that should be discussed is the lack of knowledge about the translation of these mRNAs, if the mRNA that is being detected is the same as the one that is translated. While the authors emphasize the discordance between mRNA and protein localization, it is not possible to know whether these mRNAs are being translated where they are found, e.g. soma vs neuropil. Moreover, there are many examples (e.g. BDNF) where the isoform influences the subcellular localization of the mRNA. There is no way of studying the isoforms here, and we could be looking for a different mRNA isoform localized to a specific compartment compared to the protein. These points must be discussed.

      Minor suggestions:

      • The authors should identify GO terms to understand what types of molecules are subjected to RNA regulation. They provide a supplementary table for all genes, but it would be useful to have a chart showing the proportion of different GO terms represented in the overall gene set, genes that show cell-specific regulation, genes that show neuron vs glia specific regulation, etc.
      • "However, post-transcriptional regulation can also manifest itself within a cell, so that a protein is localised to a distinct site from the mRNA that encodes it". While subcellular RNA localization may represent a regulatory layer, I do not agree that proteins that function in the cell at a different location than their translation site represents regulation per se. Many such cases exist for proteins that are trafficked!
      • "The majority of individual puncta appearing in the dlg1::YFP line (51% in the brain, 64% in larval muscles". Why is the agreement between YFP and endogenous FISH so low? Do many individual RNAs fail to hybridize? This should be discussed.
      • "However, one gene, indy, is highly transcribed in neuroblasts and a single ganglion mother cell before it is rapidly shut off (Figure S1A)". This figure does not exist. Where are the data?
      • The authors should be consistent about calling perineurial or perineural glia (both correct) in their images and text.
      • "We only observe a minority of localised axonal mRNAs that lack the protein they encode at the axon extremities, in contrast to our findings in the mushroom body, optic lobe, and ventral nerve cord neuropils" These results are not contrasted, as in all neuropils the minority of localized mRNAs are those lacking their corresponding proteins. For example, 9% in NMJ vs 7.5% in OL neuropil according to Fig. 1B. What is conflicting with the conclusion?
      • "These results suggest that motor axons are more selective than the other neuronal extensions in the mRNAs that are transported over their very long distances from the soma to the neuromuscular synapse" The current literature says that the same mechanism (cis-elements) is used to transport mRNAs to subcellular compartments, which would be inconsistent with the idea of motor axons being "more selective" than other neurons for the same mRNA, but just a result of fewer mRNAs being found in motor neurons: 34.% of the mRNAs are found in motor neurons soma vs 83% in OL soma, 86.5% in VNC soma, and 70.5% in MB soma. To get to this conclusion, the authors should show that mRNAs previously found in the neuronal extensions of other neurons are not found in the axons of motor neurons but are still expressed in thesir somas. They might want to suggest different RBPs involved in the transport or discussing the very long distance they need to travel which can influence their detection in the tips. Figures
      • Figure 1. Experimental approach summary
        • Some colors do not show well and should be changed, e.g: grey in Fig. 1A, and Fig. 1B probe sites indicated in light blue and pink within the introns of dlg1.
        • Fig. 1E': There appears to be a large discrepancy in co-detection % for CNS and muscle in the graph judging by the size of circles, yet in the text, it is stated that there is average of 51% and 64% in the two, respectively. I don't see any green circles with over 25% agreement in the graph. Are the colors correct here?
        • Fig. 1D-I: It's difficult to identify where the zoomed panels come from. E has its own square (indicating zoom in E'). Please make this square dashed or a different color in E so it is clear F and G do not come from there.
        • Comparing Fig. 1F vs K: Why does there appear to be so much more dlg1 mRNA in the YFP-tag condition? If this is due to selection of imaging area, please choose a more similar region to image so the RNA levels are comparable. Otherwise it indicates the YFP-tag line has more RNA expression, which is likely not the case.
      • Figure 2. Analysis pipeline overview
        • The lines for the first two zoomed panels are switched: The optic lobe is going to VNC and vice-versa.
      • Figure 3. Overall summary of results
        • Figure 3A: Soma/Neuropil/muscle should be separate or at least ordered such that they are next to each other to facilitate direct comparison of genes in the same region of the cell in neurons from different CNS areas. Why are glia not included in this summary? A third color should be used to indicate when there is neither mRNA nor protein expression.
        • "Compiling all the information together shows that there are that 196/200 or 98% of the genes show discordance between RNA and protein expression" However, 5 genes shown in Fig. 3A do not show "discordance": CG9650, cup, Lasb, rg, and vsg!!
      • Figure 4. Neuroblast lineage analysis
        • Is clustering around the NB sufficient to determine lineage relationship? There seems to be other neurons around the NB.
        • More examples should be shown for the post-transcriptional category, as it is the most interesting category, and there are many different possible outcomes. Are there cases of transcriptional control and post-transcriptional regulation? Are there cases where the youngest neurons (closer to the NB) in the progeny are expressing the protein while the oldest are not? If not, could this be an artifact from a slow translation and the protein being detected only after building up in the cell? Top1 protein (Fig. 4D) seems to be less expressed in the youngest neurons.
        • "The transcription rate of these genes, as indicated by the relative intensity of smFISH nuclear transcription foci, is similar across the neuroblast lineage, however protein signal is only detectable in a minority of the progeny cells (Figure 4E)". Many nuclei lack clear large spots, but have small spots indicative of RNA; how is this interpreted? Do they lack transcription, or is this due failure of the smFISH to capture all transcription sites? Were transcripts actually counted to assess cell-specific differences? This should be possible with smFISH
      • Figure 5. RNA synaptic localization
        • A have global analysis comparison of all neuropil areas would be welcome in this figure.
        • "Surprisingly, another 59 transcripts are present at synapses without detectable levels of protein (Figure 5E-H)" This text does not correspond to Fig 5E-H but 5I-L. Where is the text about 5E-H?
        • For Fig. 5J and 5N RNA appears scattered regularly throughout the entire panel area. How sure are the authors that this is not due to poor signal/noise? For example, perhaps too much probe being used for these targets.
        • Fig. 5R is not cited in the text.
      • Figure 6. RNA localization in glia
        • For Fig. 6B-G it is hard to tell if there is any overlap of the RNA and Glia. Maybe show multiple zoomed-in merged images and/or highlight the structures with lines that are present in all panels.
        • For Fig. 6L-O: How reproducible is this small amount of RNA puncta in the NMJ glia? Is this possibly biologically important?
        • Why do cartoons labelling subnuclear/perinuclear glia in Fig.6 and Fig.S6 show different localization?
        • The cartoons seem to extrapolate from the data: While in Fig 6B-D, we see neither the big bright spot of transcription in the glial nucleus nor as many transcripts in the neuropil, they are both present in the cartoon. In Fig. 6E-G there is no indication of cortical glia soma nor the transcription spot only in glia nuclei.
        • "To assess glial localisation for the 200 genes of interest, we used a pan-glial gal4 driving a membrane mCherry marker (repo-GAL4>UAS-mcd8-mCherry) to learn the expression pattern of all glial cells, and then classified the pattern in the YFP lines (without the marker) based on knowledge of that expression pattern. We validated this approach by combining the RFP marker" Did the authors use mCherry or RFP for these experiments? Also, the previous sentence is redundant.
      • Figure 7. RNA localization at neuromuscular synapse
        • RNA for these genes seems far too spread throughout the muscle to draw any conclusions
        • Also with so many RNAs distributed in the muscle, specific localization of RNA molecule to the precise PSD would have no conceivable benefit
        • I suggest drawing lines around the protein expression to facilitate visualization of the mRNA localization for panels B, F and J. It is especially hard to conclude anything from panels B and F.
        • Light grey with white dots is hard to see in the cartoons
      • Figure 8. Role of khc and activity in sgg localization
        • Presumably there is a huge number of developmental problems associated with this mutant that could cause decrease in sgg localization
        • If the authors include this, then they should characterize the mutant NMJs: what is the change in size, synapse number, etc..
        • Is there more sgg accumulated in soma as a result of less transport? Is sgg being expressed at the same level?
        • Fig. 8F-H: Why is Dlg1 accumulated in the entire axon, not just the presume synapse?
        • Fig. 8J: Why is sgg signal occurring in circles disconnected from the main axon? The authors should show a different image

      Significance

      This is a significant and complex paper that contributes with novel tools to an important issue

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Titlow et al present a data resource paper for mRNA localization and protein expression in vivo focusing on the larval nervous system which is an area of high interest currently. They screen a known group of YFP gene trap lines (200 lines) and looked at specific aspects of the nervous system such as expression in neuroblasts, the mushroom bodies, glia or the NMJ. They also present a computational workflow using this set of 200 genes for the investigation of the subcellular localization and potential role of post transcriptional regulation in whole larval tissues. This uses the image data obtained experimentally and then compares with existing datasets to obtain more information.

      Major comments

      The authors results largely support the claims made in the manuscript. Is a clear proof of concept analysis of specific examples and then presentation of examples from different part of the nervous system. Different aspects of the gene trap lines are taken into account. Is a high level analysis of the sub cellular localization of mRNA and protein in different parts of the nervous system. Some interesting new insights which can lead to more in depth analysis of mechanism are presented. Is an interesting idea and presents a method in which to approach a fieId that has many remaining open questions. This manuscript is an important and timely analysis that will be of high interest in the field.<br /> Is a positive that the authors confirmed the YFP mRNA in situs with an endogenous gene in situ. Although the group is using an established and published set of gene traps, it would be good to confirm protein expression for same gene to increase confidence or provide more details on how is known that the YFP insertions do not affect mRNA stabilization or transcription or protein expression/localization. For example in Figure 1 F' versus K it is unclear why in the DlgYFP insertion there are more Dlg in situ signals than are observed in and around a neuroblast as compared to the wild type control. From the description provided these appear to the maximum intensity images. Is this due to background or an effect of the YFP insertion itself? Because of the increased level of expression is there a feedback loop of the protein regulating the mRNA expression? If had expression of Dlg protein in this figure would also confirm the YFP insertion mirrored the endogenous and it would be easier to discern if there were any changes in the number of Dlg mRNA molecules present. As this was the proof of principle example for the screen this information would increase confidence in the remainder of the data presented. AS an important part of the screen is looking at the potential for post transcriptional regulation this is an important factor to address Will this pipeline capture information on whether is secreted (contain a signal regulatory peptide) or not as then would expect to be discordant. This should be clarified or commented on. General molecular function is listed in supplementary table 1 but will other types of information be able to be correlated from datasets or databases as well.

      Minor comments

      On page 9 refer to Figure 6S which I think is supposed to be Figure S6. In text refer to an example of gli but show gs2 in the figure so it is unclear what is being referred to or shown. Could include more description on the generation of the supplementary tables and analysis of the tables. I could not find any description/legend which made analysis of some of the tables more difficult. The data set was trained on a known set of data (analyzed by experts. It would be interesting to see what it could do with a novel set of genes in the context of post transcriptional regulation, but that is beyond the overall scope of this manuscript.

      Significance

      This is an interesting idea and is a useful resource for the genes analyzed. Gives an initial tool to analyze the expression of genes. Allows for systematic analysis of mRNA (smFISH) and protein on a larger scale but with high resolution. Adds new knowledge in terms of the localization of mRNAs and protein in the periphery of neural and glia processes which may inform future analyses of the role of these genes in these tissues.

      Is a useful resource within neurodevelopment in Drosophila and post transcriptional regulation. Would be of interest to a general audience as workflow could be applied to any tissue or set of genes. Covers a very broad set of genes with disparate biological functions again making this of interest to a broader audience.

      Expertise of reviewer Drosophila, neurodevelopment, RNA regulation, post transcriptional regulation, polarity and adhesion.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript by Titlow et al. systematically analyzed spatial distribution of 200 gene's mRNA and protein, and found common discordance between them. Moreover, the browsable resource is pretty useful to most fly people. Though the authors did huge amount of experiments and analysis, and got several really interesting findings, there are some basic questions need to be answered.

      Major 1: For the wildtype CS flies, there is no YFP mRNA signal in neuroblast region and how about YFP mRNA signal in MB, OL VNC and NMJ regions? What is the criterion of setting laser power and gain for the mRNA level of 200 genes? Is it difficult to distinguish background and true signal of the mRNA in different area?

      Major 2: Would the insertion of YFP affect gene expression? Comparing to CS in Fig 1K, the dlg1 mRNA signals in dlg1::YFP line (Fig 1F) increases a lot. I do not know if this phenotype happens only in this area. So could you show some other regions for dlg1::YFP flies.

      Major 3: Is the dlg::YFP homozygous available? Among 200 gene trap lines, how many of them can be homozygous?

      Major 4: Have you tried to investigate the mRNA and protein localization in adult brains?

      Major 5: In Fig 3C, the authors claimed in MB or OL soma regions, some genes are protein expression only but no mRNA present. I wonder how do you explain this phenotype in soma.

      Major 6: Since sgg mRNA localize to both sides of NMJ, would KCl stimulus affect sgg mRNA amount and localization in muscle?

      Minor 1: You claimed that Fig 1E shows high magnification image of the inset in D, but the scale bars are the same.

      Minor 2: Figure 1 legend: K-N, are the images individual channels shown in E? Or in J?

      Minor 3: In Fig 2A, optic lobe neuropil and VNC neuropil are mislabeled.

      Minor 4: Only one panel has scale bar in Fig 4.

      Minor 5: What is Fig 5B'and F'? You should describe them in the Figure legends.

      Significance

      The browsable resource is pretty useful to most fly people. The authors did huge amount of experiments and analysis, and got several really interesting and important findings.This work will provide mRNA localization information for post-transcriptional regulation studies.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General Statements [optional]

      We are grateful for the very kind, thoughtful, and detailed comments of the reviewers, which we have strived to fully integrate into the revised manuscript.

      Of note are the concerns with the data from stages S21 and S22, which we acknowledge do appear to be qualitatively and quantitatively distinct from the other samples. While we are unable to completely disambiguate meaningful biological variation from technical or experimental noise using our data, we hope a few additional analyses and visualization tools we have included can provide greater confidence in the reliability of our findings.

      Additionally, while attempting to evaluate Reviewer #2’s suggestions about examining the distribution of intergenic peaks along the genome, we discovered an error in our code that resulted in the improper assignment of peak categories. The error resulted in the improper assignment of intronic and exonic peaks as intergenic peaks. While the largest group of peaks in our dataset remains distal intergenic peaks (30.2%), and distal intergenic peaks remain a larger proportion of our intergenic peaks than proximal intergenic peaks, many of the peaks originally assigned to the intergenic categories have been reclassified as exonic or intronic peaks. We have updated our code and figures upon reanalysis of our data and have revised our findings and discussion accordingly.

      Description of the planned revisions

      Reviewer #3, Comment #3 of 11_

      “In general, I thought that the bioinformatic methods (i.e., the code or the options used for each program) would have been helpful for my understanding in some cases. The authors say that these will be published on an accompanying GitHub repository, which should be fine if this is sufficient for journal policy.”_

      We are still at work compiling the code for our analyses into a more reader-friendly form and setting up a GitHub repository to enable easy access to more detailed methods for interested readers. Some of the most important settings have been included in the Methods and Supplementary Methods sections, but we hope to include more thorough detailing of our pipelines in the GitHub repository. The raw data for portions of the RNA-Seq and all of the ATAC-Seq data have been uploaded to the Sequence Read Archive, and we are finalizing additional raw data submission. We are also in the process of determining what data to include in our Gene Expression Omnibus submission, which we hope to include all pertinent final data analysis files as well as any intermediate or accompanying datasets which would facilitate downstream analyses. The large size and number of our final analysis files has resulted in some challenges with data transfer and storage, which has delayed the upload and submission process.

      We are also collating several of the data visualization scripts built for this manuscript into a Jupyter notebook. This tool will enable the visualization of ImpulseDE2 models and peak classifications for arbitrary genes and genome regions of a user’s choice, alongside additional functions which are discussed in this revision plan.

      Description of the revisions that have already been incorporated in the transferred manuscript

      We have addressed the following substantive concerns with the manuscript:

      Reviewer #2, Comment #2 of 3:_

      “Authors have repeatedly used S21 and S22 throughout the manuscript to support their claims with clustering etc. May authors shed some light on the differences in replicates for these timepoints. Furthermore, I could not find Fig 3J, perhaps author would like to point out Fig 3H.”_

      Reviewer #3, Cross-comment #2 of 3:_

      “Focus on stages S21/S22: This might indeed be somewhat problematic. The libraries from these two stages (particularly S21) seem to be very different from those from the other stages. In the PCA (Fig. 1C), S21 doesn't cluster well with anything, and the difference between the two replicates is massive compared to other stages. The accessibility pattern (Fig. 1D) also looks odd. The libraries also have the lowest scores for % of mapped reads (Fig. S2B), fragment size distribution (S2E), and Spearman correlation (S2I). All this could be biologically sound and be due to a major developmental transition at this point, but maybe it justifies revisiting the data and testing whether leaving out S21 (and/or S22) makes a big difference for the clustering analyses.”_

      1. Reviewers #2 and #3 discussed concerns with the outlying nature of libraries S21 and S22. We had also previously held concerns about these samples and had performed some analyses to examine whether the global properties of our dataset are dramatically changed upon removing those samples. We did not observe dramatic changes to the structure of our data in the absence of the S21/S22 samples.

        • a. Samples S21 and S22 appear to be highly separated from the rest of our data using Principal Components Analysis. We had also previously believed that this suggested that these samples might be problematic. However, a colleague indicated to us that researchers in microbiome ecology had observed similar phenomena, often caused by strong single axes of variation (or “linear gradients”) in the datasets. In “Uncovering the Horseshoe Effect in Microbial Analyses” (mSystems, 2017) by Morton et al., the authors describe how a strong linear gradient can create a “horseshoe effect” or “Guttman effect”, where PCA results in the two ends of a linear gradient appearing to come together in ordinal space. The authors also describe a similar “arch effect” which strongly resembles the general shape of our PCA curve. We suggest that the strong apparent “outlier” appearance of S21 and S22 may be exaggerated or induced by the technical “arch effect” phenomenon, and may be caused by a strong single biological gradient – a developmental timecourse – which our data aimed to capture.
        • b. We also performed PCA on our dataset with the S21 and S22 time points removed prior to performing the analysis (see right panel, bottom). When we did so, we observed that the relative positions of the remaining libraries remains largely similar, with time points closer to the middle of development showing a positive loading in PC2, and time points closer to the beginning and end of development showing a negative loading. This suggests that the second major axis of variation in our dataset would remain a contrast between middle vs. terminal timepoints, even without the S21/S22 data, and that the relative positioning of the remaining data within PC-space is not entirely driven by S21/S22.
        • c. To further assess the degree of the S21/S22 samples’ outlying effects, we also performed ImpulseDE2 analysis to generate model fits without S21/S22 data. Doing so allowed us to determine to what degree the S21/S22 stages are necessary for driving the accessibility trajectory of individual peaks, and of the data more broadly. We performed IDE2 with either all data, or the S21/S22 data removed prior to input into IDE2. This generated two sets of model fits to the “cloud” of accessibility vs. time measurements: one that included the S21/S22 data, and one without. We evaluated, for each peak in our dataset, the time point at which the IDE2 model achieved maximum accessibility (the “IDE2 max fit”), and plotted both the “all” and “noS21S22” data as a histogram (see right panel, top graph). The presence of peaks that achieve predicted maximum accessibility in the S21/S22 stages in the “no S21/S22” data is a result of how we calculate “max fit”, which does not require that there is a known accessibility value at a given timepoint; only that the time point during which the model fit is maximum is closest to the timing of that developmental stage. Overall, we still observed early, middle, and late enrichment of IDE2 max fit even when the S21/S22 data are removed. We do see a rightward shift in the middle timepoint histogram in the direction of later stages, although this may be expected given the absence of concrete accessibility values at S21/S22 in the “no S21/S22” data. This indicates that our data globally retain the general trends of early, middle, and late enrichment of accessibility in the absence of the S21/S22 data. Moreover, this suggests that, even without the S21/S22 data, the remaining data from early and late stages result in a model fit that still predicts maximum accessibility at middle developmental stages for many peaks.
        • d. To further measure the influence of the S21/S22 data in IDE2 model fit, we also evaluated the degree of change in the global behavior of a peak when the S21/S22 stages were removed. This analysis aimed to assess whether removing S21/S22 data resulted in an IDE2 model with the same general trajectory as with all data, as opposed to the more stringent requirement of evaluating whether the exact developmental stage of the peak was changed. To perform this analysis, we grouped developmental stages into five quintiles, each representing three stages of development. We asked, for each peak in our dataset, whether that peak’s IDE2 max fit was “stable” when the S21/S22 data were removed; that is, if the quintile of the IDE2 max fit was altered when the S21/S22 data were removed (i.e. if a peak moved more than 3 developmental stages away from its original position), a peak was considered “unstable”. We observed that over 80% of peaks in each quintile remained “stable” after removing the S21/S22 data, suggesting that the vast majority peaks show the same general trajectory of accessibility even without the S21/S22 data. Peaks within the middle time points appeared to be more unstable than peaks at the terminal timepoints, which could be expected given that the S21/S22 timepoints constituted the middle-most timepoints in our dataset.

      We acknowledge that the S21/S22 timepoints still appear to be qualitatively different in other ways. Moreover, we acknowledge that some of the peaks in our dataset are “dependent” on the S21/S22 stages, given that their accessibility trajectory changes when these stages are removed. It is difficult to determine whether a change in accessibility trajectory for a given peak caused by the removal of S21/S22 data is indicative of technical differences in sample preparation, such as batch effects; biological variation, such as a potentially unknown mutant or sick embryo; or due to genuine wildtype biological processes that occur at the S21/S22 stages.

      These caveats acknowledged, a comparative analysis of the data in the absence of the S21/S22 stages suggests that much of the global picture of development remains the same. In the interest of providing the data we generated as a resource, we decided to include the S21/S22 data in the final manuscript we have prepared for submission.

      We have included an additional supplementary figure (Supp. Fig. 2.2) highlighting these further analyses, which we hope future readers will consider when performing their own analyses with these timepoints, as well as a summary of the ways we evaluated this potential concern in the Supplementary Methods. To facilitate future users of this dataset, we will include the model parameters calculated from IDE2 using both the full dataset and the data with S21/S22 removed in the GEO accession data, as well as a Jupyter notebook (ParhyaleATACExplorer.ipynb) that allows users to plot the raw accessibility data and IDE2 model fits for individual peaks of interest (C, example on right panel), so that downstream experiments can consider the potential differences with the S21/S22 samples.

      Reviewer #2, Comment #3:_

      “The majority of ATAC-seq peaks in the distal intergenic regions is a very surprising result. Authors defend this result by suggesting that this organism has big genome. May author perform a short analysis that shows that these peaks are indeed represent nearby genes or may point towards 3D genome organisation. For example, I see that this genome might have regions in the genomes that are densely organised in gene clusters, in those cases does the pattern remains same i.e he majority of the genes are very distant from each other and hence use vital regulatory elements?”_

      Reviewer #3, Cross-comment #3 of 3:_

      Peaks in distal intergenic regions: I agree that this could be elaborated on. It might also be that >10 kb is not actually that distal for Parhyale. I would suggest to split the "distal peaks" further (e.g., in 10 kb or 2-log steps, or whatever makes most sense) and try to understand if >10 kb is mostly <20 kb, or if most of them are hundreds of kb from the nearest gene?_

      1. Reviewers #2 and #3 expressed interest in understanding the absolute distribution of distal intergenic peak distances from nearby genes in our dataset. In generating the analyses to address this question, we stumbled upon an error in our code that reveals that the true number of intergenic peaks is much lower than we had originally reported. We discuss the nature of the error below. Moreover, we address the previous question using the new data, which overall still indicates that distal intergenic peaks remain a large portion of the Parhyale genome.
        • a. To address Reviewer #2’s comments with respect to the presence of potential clusters of intergenic regions, we built a Python tool (included in ParhyaleATACExplorer.ipynb) enabling the visualization of different cis-regulatory element categories along a genomic coordinate. Upon plotting our data with this tool, we observed problems with the categorization of the peaks – namely, that intronic and exonic peaks were erroneously classified as intergenic peaks (see right panel, top). We analyzed our script for classifying annotations more carefully and realized that we had erroneously used “bedtools closest” instead of “bedtools intersect” to try to identify all peaks overlapping with gene annotations in our genome. We corrected this error and observed the expected distribution and categories of peaks in our data (right panel, bottom).
        • b. The revised peak categories have been added to the updated manuscript in Fig. 3H and Fig. 5C. The categories of peaks we observed differ substantially from our previous results, in that we observe a much higher representation of exonic and intronic peaks in our dataset, with intronic peaks now representing 28.2% of all peaks (increased from <1%), and distal intergenic peaks representing 30.2% (decreased from 51.2%). While distal intergenic peaks remain the largest category over time, the proportion is relatively equal to the fraction of intronic peaks. Intergenic peaks (distal and proximal combined) now make up only a slightly larger fraction of peaks (37.2%) than gene body peaks (exon, intron; total 34.4%). This updated result is a significant departure from our previous report, and we have updated the text of the manuscript to correct this mistake.-
        • c. While intergenic and distal intergenic peaks constitute a much smaller portion of our data, we still wanted to address Reviewer #2 and #3’s questions about the distribution of distances between intergenic peaks and nearby genes. We generated a plot to illustrate the number of intergenic peaks at variable distances to the nearest gene (B, right panel). As illustrated in the plot, there are a very large number of distal intergenic peaks, including many peaks >100kb away from the nearest gene. The average distance of intergenic peaks from the nearest gene was 73,351bp. We neglected to mention in the original manuscript that one of the rationales for choosing a 10kb cutoff as “distal intergenic” was that peaks beyond this distance would be considerably more difficult to isolate as single fragments combined with a proximal promoter using PCR, agnostic of their orientation with respect to the promoter element. Such peaks could not have been easily identified using previous transgenic approaches, and are thus distinguished from “proximal” peaks by their necessary identification using techniques such as ATAC-Seq. We have updated the text to reflect this distinction.
        • d. Given that both intergenic and gene body peaks appeared to comprise large fractions of our revised data, we also examined the relative enrichment of intergenic and gene body peaks with respect to time (after normalizing for the fraction of “unknown” peaks, as suggested by Reviewer #3). We observed that the proportion of peaks belonging to intergenic and promoter regions declined slightly as development progressed, while the proportion of gene body peaks increased (E, below). There appeared to be slightly more intergenic peaks than gene body peaks at all developmental time points, and the ratio of intergenic peaks to gene body peaks declined very slightly over time (F, below). These data indicate that intergenic and gene body peaks have different enrichment trajectories over time. As development progresses, gene body peaks are increasingly enriched, and may have a greater impact on gene regulation. We have added these additional observations to the text and to a new Supplementary Figure 2.3.

      We have also addressed the following textual and conceptual concerns with the manuscript:

      Reviewer #3, Comment #1 of 11_

      I felt that the first paragraph of the introduction is not necessary._

      1. We believe the introductory paragraph helps frame the paper in the context of the broader scope of advances in technologies for emerging research organisms – currently, it has become straightforward to both generate a genome sequence and to identify and manipulate coding genes of interest across diverse taxa, but the identification of gene regulatory mechanisms remains more difficult. We have edited the introduction to better reflect this perspective and to link the first paragraph to the rest of the paper.

      Reviewer #2, Comment #1 of 3_

      “In Introductory paragraph 2, sentence one, authors suggest that gene regulation plays more important role in evolutionary process than genes. Although a significant amount of research has been dedicated to gene regulation based evolution still this field is in nascent form. For example evidence of inheritance of the gene regulation pattern across generation is scarce and requires more evidence. I suggest authors to modulate the claim that still gene based evolution is the main paradigm instead otherwise.”_

      Reviewer #3, Cross-comment #1 of 3_

      Evolution via gene regulation vs. coding sequence: While (to my understanding) it is largely accepted in the field that changes to the CDS will often have more deleterious effects than changes to the expression of a gene, I agree that this could be elaborated on a bit.

      1. As requested by Reviewers #2 and #3, we have clarified the language surrounding the debate between gene functional and gene regulatory evolution to indicate that both mechanisms appear to be important for evolutionary processes, with the importance of the latter more recently revealed.

      Reviewer #3, Comment #2 of 11_

      Use of Genrich: I presume this was run on both duplicates simultaneously? This is not clear from the methods section. It might have implications for downstream analyses (e.g., differential accessibility between time points) because running on both sequencing library replicates simultaneously leads to a single "replicate" of peaks per time point, while running it individually leads to two. However, I have never tested if this actually does make a difference. Maybe the authors have and can comment on this?

      1. In response to Reviewer #3’s inquiry about Genrich, we have added additional clarifying information into the Methods section. “Genrich analysis was run on both duplicate libraries simultaneously; Genrich performs peak calling on each peak individually, and then merges the p-values of the replicates using Fisher’s method to generate a q-value, obviating the need to calculate an Irreproducible Discovery Rate (IDR).” We did not test running Genrich on individual libraries, opting for the more conservative approach of using the combined q-value as a filtering score for peak quality. For further information, the reviewer can see the Genrich Github repository section here: < [https://github.com/jsh58/Genrich#multiple-replicates]

      Reviewer #3, Comment #4 of 11_

      The section on the IDE2 models (the paragraph at the end of page 4/beginning of page 5) was unclear to me but appears sound. (The only instance where I didn't quite understand what the program actually does.) Maybe this can be explained a bit easier?_

      1. As requested by Reviewer #3, we have attempted to explain the methods and logic of using ImpulseDE2 a bit more clearly:

      “To identify regions of dynamically accessible chromatin, we used the ImpulseDE2 (IDE2) pipeline (Fischer et al., 2018). IDE2 differs from other software for differential expression analysis in that it allows the investigation of trajectories of dynamic expression over large numbers of timepoints. It does so by modeling a gene expression trajectory as an “impulse” function that is the product of two sigmoid functions (Chechik and Koller, 2009; Yosef and Regev, 2011). This approach enables the modeling of a trajectory of gene expression in three parts: an initial value, a peak value, and a steady state value, thus summarizing an expression trajectory using a fixed number of parameters. With the ability to capture the differences between early, middle, and late expression values for each gene in a dataset, IDE2 also enables the detection of transient changes in gene expression or accessibility during a time course. Identifying differential expression over large numbers of timepoints is difficult for more categorical differential expression software such as edgeR and DESeq2, which generally use pairwise comparisons between timepoints to assess change over time (Love et al., 2014; Robinson et al., 2010).”

      Reviewer #2, Comment #2 of 3_

      2-2) Authors have repeatedly used S21 and S22 throughout the manuscript to support their claims with clustering etc. May authors shed some light on the differences in replicates for these timepoints. Furthermore, I could not find Fig 3J, perhaps author would like to point out Fig 3H.

      Reviewer #3, Comment #5 of 11_

      On page 7, Fig.3J needs changing to 3H. This figure should, in my opinion, also contain the absolute number of peaks for each time point to set the individual proportions into context.

      1. As requested by Reviewer #3, we have added a bar charts representing the number of peaks found at each time point (Fig. 3H) and the number of peaks found in each cluster (Fig. 5C) to the peak type proportion plots. We have also fixed references to Fig. 3J to instead refer to Fig. 3H – we apologize for the confusion.

      Reviewer #3, Comment #6 of 11_

      Last paragraph of the "Improving the Parhyale genome annotation" section: I think this needs to focus on those regions of the genome for which the location is known - after all, the "unknown" regions" could all be "distal transgenic", which would significantly change the relative proportions._

      1. We have revised our analysis of this topic with our updated peak type proportions, as described above in point 2d above under “substantive concerns”.

      Reviewer #3, Comment #7 of 11_

      “On page 9, t-SNE is mentioned but doesn't seem to be cited.”

      1. As requested by Reviewer #3, we have added citations for the t-SNE method, as well as scikit-learn, the software we used for t-SNE visualization.

      Reviewer #3, Comment #8 of 11_

      “The third paragraph on page 9 ("We evaluated the differences...") should mention the fact that clusters 1 and 2 are the only ones with significant proportions of exonic and intronic peaks. In the accompanying figure (5C), the total number of peaks would again be helpful.”_

      1. After identifying the error in our peak category classification pipeline, this observation was no longer true. However, upon examining the new distributions by cluster, we observed that in Clusters 3–7, for which we observed GO enrichment for developmental processes, there appeared to be slightly higher enrichment of intronic regulatory elements than distal intergenic regulatory elements. These results resemble the observation from recent work showing that tissue-specific enhancers are enriched in intronic regions in various human cell types (e.g. Borsari et al. 2021, Genome Research). We have noted this new observation in the text.

      Reviewer #3, Comment #9 of 11_

      In figure 5D, I can't quite make out at which stage the dip in the peak of Cluster 8 occurs. This is quite an unusual pattern of accessibility change, and I can't help but wonder if it has something to do with the quality of one of the libraries? Also, the fact that half of the peaks fall into unmapped regions of the genome is unusual, and I feel this deserves more discussion._

      1. In Figure 5D, Reviewer #3 asks about a dip in accessibility for Cluster 8 peaks. The dip in accessibility was actually observed for Cluster 9 peaks and is marked by the asterisk in that panel. We have updated the figure legend to clarify the significance of the asterisk and have referred readers to examine Supp. Fig. 5.1B, where the IDE2 model fits more clearly show a collective dip in accessibility for Cluster 9 peaks. Upon examining the size distribution of the clusters, we have also noticed that Cluster 8 is the smallest cluster. We have noted the small cluster size and high “unknown” peak enrichment for Cluster 8 in the text.

      Reviewer #3, Comment #10 of 11_

      “On page 10, the abbreviation PFM appears, but it is only explained in the legend of Fig.4. This should appear in the text.”_

      1. Reviewer #3 mentions that on page 10, we use the abbreviation for position frequency matrices (PFMs) without previous reference. We first introduce the abbreviation on page 8, but given the repeated use of “PFM” on page 10, we have added an additional explanation of the abbreviation on page 10, for ease of reading.

      Reviewer #3, Comment #11 of 11_

      “The section on "Concordant and discordant expression and accessibility" is the one I disagree most with. The authors seem to suggest that a repressive cis-regulatory module should become less accessible when the gene is activated. However, they leave trans-acting factors completely out of their conceptualisation here. It is in general likely the availability of transcription factors that leads to repression, while the "silencer" can be well accessible in all cells. Moreover, it has become clear in recent years that CRMs are not just repressors or enhancers per se but can act as either depending on the availability of transcription factors. I think these facts could partially explain the weak correlation and should be discussed.”_

      1. We appreciate the comments from Reviewer #3, which alerted us to the more recent literature around the bifunctional potential of regulatory elements. We have revised our claims to clarify that concordance and discordance analysis cannot be used to directly assign “enhancer” or “silencer” identity to given regulatory elements. Instead, we suggest that evaluating concordance and discordance can be useful for downstream users of our data, such as those aiming to build reporter constructs for a given gene of interest. To facilitate such tool development, we have built additional functions into a Jupyter notebook to enable the visualization of accessibility, gene expression, fold change of accessibility and gene expression, significance of fold change, and concordance/discordance assignment for arbitrary peak-gene pairs. An example of this visualization is shown on the following page. Panel A shows the region around the Engrailed-1 and Engrailed-2 loci in Parhyale (text labels within the plot region were added manually in Illustrator). Panel B shows visualization of the En1 promoter peak alongside En1 expression. Significant log fold changes (DESeq2 padj < 0.05) are marked by asterisks in the bar plots, and concordance/discordance assignment at each time point is indicated by the color of the comparison text (red = concordant, blue = discordant). Panels C and D show accessibility and expression visualization for a single peak (En1 peak5) compared to two nearby genes (En1 and En2). We hope to include sufficient documentation in our GitHub repository such that using these tools is accessible for most researchers, even with limited programming knowledge.

      Description of analyses that authors prefer not to carry out

      We were unable to easily visualize the distribution of regulatory elements across the whole genome as suggested by Reviewer #2. One challenge of working with the Parhyale genome is the lack of complete chromosomes. The genome is distributed across ~290,000 contigs of variable size. We were unable to find any software that could be easily and quickly set up to visualize our data, although we will provide in a Jupyter notebook the tools for local visualization of peak types that we developed.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, Sun et al. use RNAseq and ATAC-seq in 15 stages of embryonic development of the amphipod crustacean Parhyale hawaiensis to analyse gene regulation genome-wide. They assess the data in multiple ways to provide a more complete genome annotation, understand temporal changes in gene regulation, and identify different classes of cis-regulatory elements including associated GO terms and putative transcription factor binding site enrichment. The authors have made a great effort to account for potential biases in their datasets (one impressive example is the comparison of multiple transcriptome assemblies and the following quality assessment) and I enjoyed reading this manuscript for its great explanations of method usage (i.e., what each bioinformatic package does, why it was used etc.) and the overall style.

      I want to make a few suggestions that would make the study - in my opinion - even better:

      • I felt that the first paragraph of the introduction is not necessary.
      • Use of Genrich: I presume this was run on both duplicates simultaneously? This is not clear from the methods section. It might have implications for downstream analyses (e.g., differential accessibility between time points) because running on both sequencing library replicates simultaneously leads to a single "replicate" of peaks per time point, while running it individually leads to two. However, I have never tested if this actually does make a difference. Maybe the authors have and can comment on this?
      • In general, I thought that the bioinformatic methods (i.e., the code or the options used for each program) would have been helpful for my understanding in some cases. The authors say that these will be published on an accompanying GitHub repository, which should be fine if this is sufficient for journal policy.
      • The section on the IDE2 models (the paragraph at the end of page 4/beginning of page 5) was unclear to me but appears sound. (The only instance where I didn't quite understand what the program actually does.) Maybe this can be explained a bit easier?
      • On page 7, Fig.3J needs changing to 3H. This figure should, in my opinion, also contain the absolute number of peaks for each time point to set the individual proportions into context.
      • Last paragraph of the "Improving the Parhyale genome annotation" section: I think this needs to focus on those regions of the genome for which the location is known - after all, the "unknown" regions" could all be "distal transgenic", which would significantly change the relative proportions.
      • On page 9, t-SNE is mentioned but doesn't seem to be cited.
      • The third paragraph on page 9 ("We evaluated the differences...") should mention the fact that clusters 1 and 2 are the only ones with significant proportions of exonic and intronic peaks. In the accompanying figure (5C), the total number of peaks would again be helpful.
      • In figure 5D, I can't quite make out at which stage the dip in the peak of Cluster 8 occurs. This is quite an unusual pattern of accessibility change, and I can't help but wonder if it has something to do with the quality of one of the libraries? Also, the fact that half of the peaks fall into unmapped regions of the genome is unusual, and I feel this deserves more discussion.
      • On page 10, the abbreviation PFM appears, but it is only explained in the legend of Fig.4. This should appear in the text.
      • The section on "Concordant and discordant expression and accessibility" is the one I disagree most with. The authors seem to suggest that a repressive cis-regulatory module should become less accessible when the gene is activated. However, they leave trans-acting factors completely out of their conceptualisation here. It is in general likely the availability of transcription factors that leads to repression, while the "silencer" can be well accessible in all cells. Moreover, it has become clear in recent years that CRMs are not just repressors or enhancers per se but can act as either depending on the availability of transcription factors. I think these facts could partially explain the weak correlation and should be discussed.

      Significance

      This manuscript will greatly advance research in the emerging model organism Parhyale through a more complete genome annotation and vast amounts of gene expression and chromatin accessibility data (and accompanying analyses) at various stages of development. However, the impact goes far beyond the Parhyale community, and I believe this paper can be seen as a blueprint for similar studies in other organisms. The excellent documentation and comparison of their bioinformatic methods makes their re-use straightforward and much of the authors' pipeline can be used for a "standard" ATAC-seq data analysis - I am likely to use many of their methods myself. Therefore, I think the audience can range from the "classic" evo-devo community to developmental biologists, scientists interested in gene regulation in general, and bioinformaticians.

      My own expertise is in gene regulation through transcriptional control, and I use different seq approaches (ATAC, CUT&RUN, RNAseq) to study this process.

      Referees cross-commenting

      Thank you to my colleagues for their comments. Since Reviewer 1 was happy with the manuscript as it is, I'll only add my views to the points raised by Reviewer 2: - Evolution via gene regulation vs. coding sequence: While (to my understanding) it is largely accepted in the field that changes to the CDS will often have more deleterious effects than changes to the expression of a gene, I agree that this could be elaborated on a bit. - Focus on stages S21/S22: This might indeed be somewhat problematic. The libraries from these two stages (particularly S21) seem to be very different from those from the other stages. In the PCA (Fig. 1C), S21 doesn't cluster well with anything, and the difference between the two replicates is massive compared to other stages. The accessibility pattern (Fig. 1D) also looks odd. The libraries also have the lowest scores for % of mapped reads (Fig. S2B), fragment size distribution (S2E), and Spearman correlation (S2I). All this could be biologically sound and be due to a major developmental transition at this point, but maybe it justifies revisiting the data and testing whether leaving out S21 (and/or S22) makes a big difference for the clustering analyses. - Peaks in distal intergenic regions: I agree that this could be elaborated on. It might also be that >10 kb is not actually that distal for Parhyale. I would suggest to split the "distal peaks" further (e.g., in 10 kb or 2-log steps, or whatever makes most sense) and try to understand if >10 kb is mostly <20 kb, or if most of them are hundreds of kb from the nearest gene?

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Sun et al used omni-ATAC sequencing that is a modified version of classical ATAc-seq to identify and characterise the cis-regulatory elements in the P. hawaiensis genome. They further use long and short reads to improve upon existing gene annotation for this organism. The in-depth analysis ensures the results and conclusions to be sound however few points below might be needed to be addressed before the acceptance of manuscript.

      In Introductory paragraph 2, sentence one, authors suggest that gene regulation plays more important role in evolutionary process than genes. Although a significant amount of research has been dedicated to gene regulation based evolution still this field is in nascent form. For example evidence of inheritance of the gene regulation pattern across generation is scarce and requires more evidence. I suggest authors to modulate the claim that still gene based evolution is the main paradigm instead otherwise.

      Authors have repeatedly used S21 and S22 throughout the manuscript to support their claims with clustering etc. May authors shed some light on the differences in replicates for these timepoints. Furthermore, I could not find Fig 3J, perhaps author would like to point out Fig 3H.

      The majority of ATAC-seq peaks in the distal intergenic regions is a very surprising result. Authors defend this result by suggesting that this organism has big genome. May author perform a short analysis that shows that these peaks are indeed represent nearby genes or may point towards 3D genome organisation. For example, I see that this genome might have regions in the genomes that are densely organised in gene clusters, in those cases does the pattern remains same i.e he majority of the genes are very distant from each other and hence use vital regulatory elements?

      Significance

      The study by Sun et al is timely in nature and significantly improve the gene annotation of P. hawaiensis. It definitely advances the current knowledge for this organism regulatory elements. The comparison to other model organisms can be further improved by extending the discussion of the results especially in context of distal regulatory elements. The resource generated will be helpful for the researchers working in the field of developmental biology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The contribution by Sun et al. describes a very deep and thorough analysis of an Omni-ATAC-seq approach to identifying cis-regulatory elements in the crustacean Parhyale. This is a resource paper, so it does not explicitly have a research question or conclusions. The findings are a detailed dataset of putative regulatory elements, tested and validated with a number of different computational approaches, and - to a lesser extent - with a number of experimental approaches.

      The authors' work is very thorough, and while it may be possible to add more analyses and more validations, the work presented in the manuscript is impressive and stands on its own as a useful body of data. No additional work is needed to make this a complete contribution.

      The text is very well written and clear. It is a bit arduous in some places, but that is understandable, given the technical nature of the paper. The figures are clear and many of them are very eye-catching (in a positive sense).

      All in all, I have no criticism of this contribution. It is a very carefully executed and thorough analysis.

      Significance

      I am not aware of any other species outside of the main experimental model organisms for which there is data about putative regulatory elements that is as detailed as that presented in this manuscript. It is thus not only a fantastic resource for people working on Parhyale, but also a model for how such data can and should be generated for other species. The authors say this explicitly in their concluding paragraphs and I agree. The Parhyale community will pounce on this paper as a useful resource, whereas people working on other species might be inspired by it to generate equivalent data for their communities.

      I am an evolutionary developmental biologist who has worked on a number of species that are not traditional model species (I avoid the term "non-model", since every species is a model for something). I for one, fall into the category of people who will be inspired to generate equivalent data, although I must confess that I do not have the bioinformatic expertise of the authors, and therefore I am not able to critically assess the specifics of the tools they have used to generate and validate their data.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): Excellent quality of cell biology and biochemistry. the additional supports are needed for the claim of actin elongation using different formin variants.

      Reviewer #1 (Significance (Required)): Ingrid Billault-Chaumartin and co-authors described interesting research that provides insights on formin-isoform specific function in fission yeast and a new role of Fus1 FH2 domain in cell-cell fusion event. While three formin isoforms have different localization, research proposed an additional dissection in their functional differences by having different functions in C-terminus, including FH1 FH2 and formin C-terminus. The work also described additional factors that regulate cell fusions from autotrophy effect and formin expression level, in addition to the well-accepted formin biochemical activities. Here are my comments regarding the strengths of the work and improvements that could further strengthen the story.

      Major comments 1. Fig.1 shows Cdc12C could recapitulate Fus1 function by ~80% if fused with Fus1C, whereas deletion of the C-terminal tail of Cdc12 following FH2 introduces drastic dysfunction. Together with Fig. 3, these results indicate Cdc12 Cter plays more important roles than Fus1 Cter for there respective functions. Such results suggested a Cter-mediated mechanism that differentiates the functions of three fission yeast formin isoforms. The authors examined contributions from the difference in FH1 (Figs 4,5) and FH2 residues (Fig. 6). Whereas the obvious phenotype of Cter was not further investigated and not much discussed. The Cter of budding yeast formins interacts with nucleation-promoting factors, Bud6 and Aip5. Although S. Pombe does not have orthologs of budding yeast Bud6 and Aip5, I wonder would the author discuss the potential contribution of Cter in differentiating S. Pombe formins.

      The reviewer is correct that the C-terminal tail region of Cdc12 beyond the FH1-FH2 domains has a strong influence on the ability of Cdc12C to replace Fus1C. This is one reason why we specifically investigated the possible role of Fus1 C-terminal tail, which is much shorter than that of Cdc12. We found that Fus1 C-terminal tail plays only very minor role in regulating Fus1 function, as described in Figure 3. We note that contrary to what the reviewer states, Bud6 exists in S. pombe and binds the C-terminal tail of the formin For3 (see Martin et al, MBoC 2007), but whether it binds Fus1 is unknown. We have expanded our discussion to include a paragraph on the role of formin C-termini.

      Because the manuscript is focused on the function of Fus1 formin, we did not explore further the role of the Cdc12 C-terminal tail. It was previously shown that this region of Cdc12 contains an oligomerization domain that promotes actin bundling (Bohnert et al, Genes and Dev 2013). It is thus likely that this helps Cdc12 FH1-FH2 perform well in replacement of Fus1. In fact, it is likely that oligomerization boosts formin function, as we have discovered that Fus1 N-terminus contains a disordered region that fulfils exactly this function. This is described in a distinct manuscript under review elsewhere and just deposited on BioRxiv (Billault-Chaumartin et al, BioRxiv 2022; DOI: 10.1101/2022.05.05.490810). We have now cited this point in the discussion.

      1. Here, the study focuses on the FH1 between Fus1 and Cdc12 to understand their different functions in actin polymerization. FH1 mediated actin elongation through its interaction with profilin via polyP. The transfer rate of G-actin from profilin and profilin sliding depends on the polyP patterns regarding the length of each polyp motif and their distance to FH2 (Naomi Courtemanche and Thomas D. Pollard, JBC, 2012). To better understand the mechanisms by which these engineered FH1 variants on both Fus1 and Cdc12 in Fig. 4, the author may want to list the sequence of these engineered FH1 domains, including the information of the number and length of polyp motifs, and discuss these patterns.

      This list and discussion were available in the initial paper that characterized each of the constructs in vitro (Scott et al, MBoC 2011). We have now re-drawn it in a supplemental figure for convenience (as also answered in response to minor point 2), which is already provided in the revised manuscript as Figure S1. (Previous supplementary figures are re-numbered S1>S2, S2>S3 and S3>S4).

      1. Figs.4,5 cell biology results do not directly support the point of specific elongation rate unless the LifeAct-labeled actin cable elongation speed could be followed and quantified. The fluorescent tagging of tropomyosin does not show the actin cable pattern, which makes it very difficult to be used to study actin cable dynamics, such as elongation. Therefore, I feel the data in current Fig. 4 and Fig. 5 could not claim the differences in actin elongation without a quantitative comparison of elongation rate. I suggest a CK666 treatment to increase the visibility of the actin cable pattern of LifeAct, used before in both fission and budding yeasts, which would allow the author to quantify the actin cable elongation rate. Another way is to use the TIRF assay used in this study, which would give a better quantitation of formin nucleation and profilin-aided elongation.

      We respectfully disagree with the reviewer on this point. All the constructs we use in vivo have been characterized in vitro and their elongation rate carefully measured (Scott et al, MBoC 2011). These values are thus known and can be directly compared to our results in vivo.

      Of course, it would be fantastic to be able to directly measure formin elongation rates in vivo, but we are not aware that this has been done in any system. The proxy experiments that the reviewer suggests would be good ones, but each faces technical challenges that make them impossible in our system. First, because the fusion focus is a structure that forms in response to cell-cell pheromonal communication, we cannot add CK-666 or any other drug during this phase, as this perturbs the pheromone signal. Indeed, we had shown that simple buffer wash leads to loss of the fusion focus (see Dudin et al, Genes and Dev 2016). Second, the fusion focus is at the contact site between partner cells, i-e somewhat distant (1-2µm) from the coverslip during imaging. It is thus impossible to use TIRF. Finally, the fusion focus is a tightly packed actin structure. This is the reason why (rather than use of the tropomyosin marker) we cannot image single actin filaments (or even bundles) of which we could follow the dynamics as has been done to measure the retrograde flow of actin cables in yeast.

      What we have done is to use a better tropomyosin tag, mNeonGreen-Cdc8, which was just described (Hatano et al, BioRxiv 2022; DOI: 10.1101/2022.05.19.492673) to quantify amounts of linear actin. Although this is not a measure of elongation rate, it would give some sense about amounts of polymer assembled. We have obtained images with mNeonGreen-Cdc8 of all experiments previously conducted with GFP-Cdc8 and have replaced them in Figure 4C, Figure 5E, Figure 6E and Figure S2B. We have also quantified the relevant strains. The relative intensities of mNeonGreen-Cdc8 at the fusion focus at fusion time reflect remarkably well the measured elongation rates of the various formin constructs characterized in vitro. These data are now provided as new panels Figure 4F and Figure 5F.

      1. I appreciated the detailed biochemical dissections of multiple aspects of WTFus1 and Fus1R1054E, although the biochemical assays could not identify the mechanism by which R1054E causes the cell fusion. In many cases, the formin functions are diverse in diverse biological processes and sophisticated that cannot be explained well only from its biochemical activities in actin polymerization, such as the bundling, nucleation, and elongation studied in this story regarding fusion. This exciting information allows us to think of more possibilities that might regulate formin function rather than a direct change of formin activities in actin polymerization. I think a discussion of different aspects of functional regulation of formin might inspire society to investigate new possibilities to solve the mysteries. For example, the changes in formin behaviors and functions could be regulated by stress-induced formin turnover by degradation, cell signaling-regulated formin clustering and complex assembly, and their potential relevance to recruit protein constituents for fusion progression.

      We have added a paragraph on the role of Fus1 C-terminus. If you feel we should expand more on the diverse modes of regulation of formins, we could, but we have so far kept the discussion centred around the points of investigation in this paper, whose aim was to probe how changes in nucleation and elongation rates, rather than other regulations, affect the in vivo function of Fus1.

      Minor comments. 1. There are two types of "C", one includes FH1/FH2 and one following FH2, used in the manuscript, and it is a bit confusing. Better to differentiate them that allows an easy following. Fig. 1 uses Cdc12C-deltaC, Fig. 3 uses Fus1-delta Cter.

      We have updated the nomenclature to make this clearer: the C-terminal region beyond the FH1-FH2 domains is now called Cter throughout the manuscript.

      1. It's better to specify the amino acid position on the schematic of formins, such as panel A in many figures. It's always more informative to compare formin activities by considering the domain lengths, especially for the C-terminal tail that is variable in lengths and sequences. With similar thoughts, I suggest a supplementary figure that lists the sequence of all FH1 domains variants and Cter domains, such as the FH2 domain in Fig. S1.

      We have made a supplementary figure (new Figure S1) listing all constructs with specific aa positions as well as the FH1 domain variants and their sequences (see also answer to point 2 above). We have not added the sequence of the Cter domains in this figure, as these are extremely divergent and not particularly informative at this point.

      1. "n" for the statistic needs to be provided for Fig. S3.

      We have added the information to the legend of the figure (now Fig S4).

      1. The SDS-PAGE staining gel of the purified recombinant proteins for biochemical assays should be provided, particularly for these newly reported mutant variants.

      This is now provided as new panel S4C. We show the purified recombinant Cdc122FH1-Fus1FH2 proteins, which are the newly reported ones.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): In this study, Billaut-Chaumartin and colleagues investigate the molecular specialization of the S. pombe formin, Fus1. The authors systematically modulate the actin filament elongation and nucleation activities of Fus1 by expressing chimeric constructs that contain Formin Homology 1 and 2 domains from two other formins with known polymerization activities. By characterizing the architecture of the fusion focus and the efficiency of cell fusion, they find that both the elongation and nucleation properties of Fus1 are specifically tailored for its cellular role. Comparison of formin constructs with similar elongation and nucleation activities also reveals that the Fus1 FH2 domain possesses a specific property that promotes efficient cell fusion. Using sequence alignment and homology modeling, the authors identify R1054 as the residue that confers this novel, fusion-specific activity to Fus1, despite producing no effect on its bundling or polymerization properties in vitro.

      Overall, this study is well motivated, and the results support the conclusions that are drawn. I have only minor suggestions, as described below.

      Minor comments: (1) The schematic diagrams of the chimeric formin constructs are very helpful. However, it is difficult to distinguish the colors from one another, especially in the case of the Cdc12FH1-Fus1FH2 variant, which requires discernment of the relatively small purple region within the dark blue molecule. Would it be possible to modify the colors to increase their contrast? Similarly, the blue and gray data sets in Figure 3B are very difficult to discern.

      We have changed the colours to improve contrasts.

      (2) The affinities (Kd) with which the formins bind the barbed ends as described in the second-to-last paragraph on page 8, in Figure Legend 7G, and in the "Analysis of pyrene data" section of the Materials and Methods should be defined as dissociation "constants", rather than dissociation "rates". Also, these affinities are lacking units in the following sentence on page 8.

      We have corrected this. The unit is nM.

      (3) When comparing the TIRF micrographs in Figure S3A, it looks as though both formins (but especially the R1054E variant) nucleate more filaments in the presence of profilin than in its absence. Is this a reproducible effect? If so, can the authors provide an explanation for this?

      There is strong variability in the filament numbers observed by TIRF in replicate experiments, which makes it difficult to use this technique to determine the nucleation efficiency. This may be due for instance to the stickiness of the glass, which may influence the number of observed filaments. We have measured the number of filaments after 130s of polymerization for each condition to test whether there are any significant differences between conditions despite overall variability. The measurements suggest that the addition of profilin increases the number of actin filaments. However, these results should be taken very carefully due to the experimental variations (very large error bars). Additionally, because Fus1-associated filaments are very short in absence of profilin, it is quite likely that this influences their crowding at the glass surface compared to longer filaments (in presence of profilin). Since in TIRF we can only observe the filaments at the glass surface, we may miss a portion of short Fus1-bound actin filaments in absence of profilin.

      For these reasons, and because the possible role of profilin in modulating nucleation efficiency by formins is not the object of the work here, would thus prefer not to include this graph in the manuscript.

      Reviewer #2 (Significance (Required)): This study contributes a key advancement towards understanding how the polymerization activities of formins are tailored to support diverse and specific cellular functions. The results in this study nicely complement and expand upon similar recent work that dissected the polymerization requirements of the formin Cdc12, which mediates cytokinetic ring assembly in S. pombe, and For2, which drives the assembly of apical networks that are necessary for polarized growth in Physcomitrella patens. As such, this work will likely be of significant interest to scientists who study mechanisms of actin dynamics regulation. The identification of R1054 as a residue that confers a novel regulatory activity to the FH2 domain of Fus1 will also likely be of great interest to biochemists and other scientists who study formins at the molecular level.

      My expertise is in the field of formins and actin polymerization.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this study, Billaut-Chaumartin and colleagues investigate the molecular specialization of the S. pombe formin, Fus1. The authors systematically modulate the actin filament elongation and nucleation activities of Fus1 by expressing chimeric constructs that contain Formin Homology 1 and 2 domains from two other formins with known polymerization activities. By characterizing the architecture of the fusion focus and the efficiency of cell fusion, they find that both the elongation and nucleation properties of Fus1 are specifically tailored for its cellular role. Comparison of formin constructs with similar elongation and nucleation activities also reveals that the Fus1 FH2 domain possesses a specific property that promotes efficient cell fusion. Using sequence alignment and homology modeling, the authors identify R1054 as the residue that confers this novel, fusion-specific activity to Fus1, despite producing no effect on its bundling or polymerization properties in vitro.

      Overall, this study is well motivated, and the results support the conclusions that are drawn. I have only minor suggestions, as described below.

      Minor comments:

      1. The schematic diagrams of the chimeric formin constructs are very helpful. However, it is difficult to distinguish the colors from one another, especially in the case of the Cdc12FH1-Fus1FH2 variant, which requires discernment of the relatively small purple region within the dark blue molecule. Would it be possible to modify the colors to increase their contrast? Similarly, the blue and gray data sets in Figure 3B are very difficult to discern.
      2. The affinities (Kd) with which the formins bind the barbed ends as described in the second-to-last paragraph on page 8, in Figure Legend 7G, and in the "Analysis of pyrene data" section of the Materials and Methods should be defined as dissociation "constants", rather than dissociation "rates". Also, these affinities are lacking units in the following sentence on page 8.
      3. When comparing the TIRF micrographs in Figure S3A, it looks as though both formins (but especially the R1054E variant) nucleate more filaments in the presence of profilin than in its absence. Is this a reproducible effect? If so, can the authors provide an explanation for this?

      Significance

      This study contributes a key advancement towards understanding how the polymerization activities of formins are tailored to support diverse and specific cellular functions. The results in this study nicely complement and expand upon similar recent work that dissected the polymerization requirements of the formin Cdc12, which mediates cytokinetic ring assembly in S. pombe, and For2, which drives the assembly of apical networks that are necessary for polarized growth in Physcomitrella patens. As such, this work will likely be of significant interest to scientists who study mechanisms of actin dynamics regulation. The identification of R1054 as a residue that confers a novel regulatory activity to the FH2 domain of Fus1 will also likely be of great interest to biochemists and other scientists who study formins at the molecular level.

      My expertise is in the field of formins and actin polymerization.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Excellent quality of cell biology and biochemistry. the additional supports are needed for the claim of actin elongation using different formin variants.

      Significance

      Ingrid Billault-Chaumartin and co-authors described interesting research that provides insights on formin-isoform specific function in fission yeast and a new role of Fus1 FH2 domain in cell-cell fusion event. While three formin isoforms have different localization, research proposed an additional dissection in their functional differences by having different functions in C-terminus, including FH1 FH2 and formin C-terminus. The work also described additional factors that regulate cell fusions from autotrophy effect and formin expression level, in addition to the well-accepted formin biochemical activities. Here are my comments regarding the strengths of the work and improvements that could further strengthen the story.

      Major comments

      1. Fig.1 shows Cdc12C could recapitulate Fus1 function by ~80% if fused with Fus1C, whereas deletion of the C-terminal tail of Cdc12 following FH2 introduces drastic dysfunction. Together with Fig. 3, these results indicate Cdc12 Cter plays more important roles than Fus1 Cter for there respective functions. Such results suggested a Cter-mediated mechanism that differentiates the functions of three fission yeast formin isoforms. The authors examined contributions from the difference in FH1 (Figs 4,5) and FH2 residues (Fig. 6). Whereas the obvious phenotype of Cter was not further investigated and not much discussed. The Cter of budding yeast formins interacts with nucleation-promoting factors, Bud6 and Aip5. Although S. Pombe does not have orthologs of budding yeast Bud6 and Aip5, I wonder would the author discuss the potential contribution of Cter in differentiating S. Pombe formins.
      2. Here, the study focuses on the FH1 between Fus1 and Cdc12 to understand their different functions in actin polymerization. FH1 mediated actin elongation through its interaction with profilin via polyP. The transfer rate of G-actin from profilin and profilin sliding depends on the polyP patterns regarding the length of each polyp motif and their distance to FH2 (Naomi Courtemanche and Thomas D. Pollard, JBC, 2012). To better understand the mechanisms by which these engineered FH1 variants on both Fus1 and Cdc12 in Fig. 4, the author may want to list the sequence of these engineered FH1 domains, including the information of the number and length of polyp motifs, and discuss these patterns.
      3. Figs.4,5 cell biology results do not directly support the point of specific elongation rate unless the LifeAct-labeled actin cable elongation speed could be followed and quantified. The fluorescent tagging of tropomyosin does not show the actin cable pattern, which makes it very difficult to be used to study actin cable dynamics, such as elongation. Therefore, I feel the data in current Fig. 4 and Fig. 5 could not claim the differences in actin elongation without a quantitative comparison of elongation rate. I suggest a CK666 treatment to increase the visibility of the actin cable pattern of LifeAct, used before in both fission and budding yeasts, which would allow the author to quantify the actin cable elongation rate. Another way is to use the TIRF assay used in this study, which would give a better quantitation of formin nucleation and profilin-aided elongation.
      4. I appreciated the detailed biochemical dissections of multiple aspects of WTFus1 and Fus1R1054E, although the biochemical assays could not identify the mechanism by which R1054E causes the cell fusion. In many cases, the formin functions are diverse in diverse biological processes and sophisticated that cannot be explained well only from its biochemical activities in actin polymerization, such as the bundling, nucleation, and elongation studied in this story regarding fusion. This exciting information allows us to think of more possibilities that might regulate formin function rather than a direct change of formin activities in actin polymerization. I think a discussion of different aspects of functional regulation of formin might inspire society to investigate new possibilities to solve the mysteries. For example, the changes in formin behaviors and functions could be regulated by stress-induced formin turnover by degradation, cell signaling-regulated formin clustering and complex assembly, and their potential relevance to recruit protein constituents for fusion progression.

      Minor comments.

      1. There are two types of "C", one includes FH1/FH2 and one following FH2, used in the manuscript, and it is a bit confusing. Better to differentiate them that allows an easy following. Fig. 1 uses Cdc12C-deltaC, Fig. 3 uses Fus1-delta Cter.
      2. It's better to specify the amino acid position on the schematic of formins, such as panel A in many figures. It's always more informative to compare formin activities by considering the domain lengths, especially for the C-terminal tail that is variable in lengths and sequences. With similar thoughts, I suggest a supplementary figure that lists the sequence of all FH1 domains variants and Cter domains, such as the FH2 domain in Fig. S1.
      3. "n" for the statistic needs to be provided for Fig. S3.
      4. The SDS-PAGE staining gel of the purified recombinant proteins for biochemical assays should be provided, particularly for these newly reported mutant variants.
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      From the start, the authors would like to thank all the reviewers for their careful and constructive consideration of our manuscript. We have now made several changes to the paper and believe it to be better for the feedback.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this study, Rees et al. perform an RNA-seq circadian time course experiment in the recently formed allopolyploid wheat. Through comparisons with other circadian transcriptomic datasets in other species it appears that the period of rhythmic genes is much more variable in wheat with a shift to longer periods compared to the other species examined. Interestingly, by analyzing circadian parameters among expressed genes, they find evidence that this newly formed allopolyploid already shows signs of divergence in circadian traits among homoeologs. A thorough comparison with circadian regulated genes in Arabidopsis reveals overlap in phasing of genes involved in certain biological processes such as photosynthesis and light signaling whereas genes involved in starch metabolism were found to have different levels of rhythmicity and phasing. This dataset will be a great resource for the community and enable new predictions about the influence of polyploidy on the circadian control of important crop improvement traits and the circadian regulation of gene expression.

      Major Comments

      1. The results section starts with very little explanation of the experiment. It would help to provide a little more detail at the start of the results to explain the context for the experiment and what was done, when samples were collected and for how long. For the methods section, it isn't until line 650 that it is clearly stated that the sampling started at ZT0. It would be better to put this in the plant materials and growth condition section.

      Thank you for highlighting the need for this context, we agree that the manuscript is improved by an introduction to the experiments. We have now included an “Experimental context” section in the results and have taken the opportunity to explain how the full 0-68h and 24-68h datasets are used within our analysis. Ln 74-82. We have also edited the Methods as suggested Ln 610-615.

      The low proportion of circadian regulated genes is likely due to the very low cutoff for calling a gene expressed, especially when there are three days of repeated timepoints. If a gene is expressed across the time course it should have values above TPM 0 for at least 3 time points in order for it to be expressed each day. I'd also be suspicious of a gene with a TPM value less than 0.5. Comparing these types of numbers is always challenging due to the various cutoffs used. Along those lines, why was a different filtering scheme used for Arabidopsis (line 657)?

      We completely agree that the proportion of genes described as rhythmic changes a great deal with the threshold at which you exclude low expression transcripts as well as the window over which measurements are taken and the q-value cut-off for rhythmicity. We performed an analysis to test the effects of applying a pre-filtering step to exclude low-expression genes and discuss our findings in Supplementary Note 1. Briefly, we removed genes with expression less than 0.1 TPM in six or more timepoints and again ran Metacycle to define numbers of rhythmic genes. Our results are discussed in Supplementary Note 1 and are presented in Supplementary Table 1. Regardless of the cut-offs applied, Arabidopsis and wheat data was treated identically, and our findings reported in the main results were consistent with those reported in the Supplementary analysis. Thank you for raising this point, as we have now improved our description of this analysis in the main text (Ln 92-95).

      Regarding the different filtering schemes, the filtering mentioned by Reviewer 1 was applied to both Arabidopsis and wheat data for a stricter retention of rhythmic genes, as part of the pre-WGCNA clustering analysis. Filtering to retain genes with >0.5TPM across 3 timepoints was applied to reduce lowly expressed genes, that act as background 'noise' when defining clusters. We applied this across 3 timepoints rather than the WGCNA suggestion of 90% of samples - because the patterns of expression in our rhythmically filtered datasets were cyclical in nature.

      In reference to the shortening of the period every day, this should be interpreted with caution. Period estimate of a single cycle are not very reliable and the SD for each day is around 3h so it is difficult to draw any conclusions about changes in period each day. One option would be to only include genes with an SD less than 1h or alternatively to remove the discussion surrounding the comparison of period across the three days and focus on the period results for the full 24h-68h window shown in 1b. While 2 days is better it is still not ideal for calling period; however, your first day will still have a strong diurnal driven pattern that will likely skew your circadian period.

      Thank you for your comments. Our question here was to determine whether the mean period lengths of rhythmic transcripts in wheat were always immediately longer upon transfer to constant light, or whether they got progressively longer over time. Upon reading the reviewer’s comment, we realize that the explanation provided of how we conducted this analysis was misleading. Our approach was to take a 44h sliding window (almost 2 days) and measure period at 0-44h, 12-56h and 24-68h. We have now added the previously missing statistics that support our findings in the main text, and which hopefully show the significance of the period changes over time (supplementary note 2). One of the most surprising findings from this analysis was that the periods in the first window were the longest 28.61h (SD=3.421), suggesting that the diel (driven) oscillation had little impact upon immediate transfer to free run. Our interpretation is that the mean period initially lengthens trying to follow the missing dusk signal, before the free-running endogenous period asserts itself in later cycles (Ln 129-128).

      Line 87-93: If the dusk cue is important for clock expression you would think this would be biased towards genes that peak later in the day or near dusk. This argument should be connected better to the period results discussed on lines 98-101.

      Following on from our statement above, we have now combined our hypothesis for why wheat transcripts expressed at dusk have longer periods with the discussion about longer periods upon transfer to constant light. We agree that the two processes are likely to be connected and have now placed them together in Ln 129-128.

      1. Lines 650-652 of the Methods mentions that one of the main interests was the response to transfer to L:L, but this isn't mentioned in the introduction and doesn't come up much in the Results section. Most of the expression comparisons are focused on the 24-68h window. It also isn't clearly explained why the first day in LL is still a diurnal cycle. This would be helpful for non-circadian readers who may wonder why the first day is not included in all the analyses.

      We believe this point is now also addressed by the addition of an Experimental Context section in the results (Ln 74-82), in response to the reviewer’s previous comment.

      1. The phase comparisons shown in Figure suppl 4 are confusing. Suppl. Note 3 states that the period from the 24-68h data window was used to establish the bins but then the phase is shown for 3 different windows for each column? When calculating the phase for each of those 3 windows which period was used as the denominator in the phase calculation? Was it the period that matches the window used to calculate phase? What does the plot look like if phase is called on the same window used to calculate period (24-68)? What method was used to call phase in Suppl. Fig 4? As shown in Suppl Fig. 3 the method can influence the phase distributions. The methods suggest that the phase was determined with Metacycle but then FFT and MESA were used to verify. What does this mean verify, were they adjusted if FFT/MESA didn't agree?

      We agree that this Figure was unnecessarily complicated. We have now simplified Supplementary Figure 4 so that only the phases from 24-68h are presented. We have also clarified the legend to explain why we used FFT-NLLS to improve accuracy of Metacycle predictions.

      It is difficult to interpret the value of the period and phase comparisons shown in Fig. 1b, c, e and f after the preceding section about how variable the period and phase is across days. It is also surprising that the full 3 days were used to calculate the circadian statistics considering the first day is still under diurnal control. Do the ratios remain the same if the statistics are performed only on the 24h-68h window? For consistency with the rest of the paper and avoid confusion it would be best to have all circadian parameters measured using the same time window (24h-68h).

      Thank you for your comments, we can see how our logic in using the different data windows was not clear enough. As mentioned above, we have now explained the use of the full and shortened data windows in Experimental context section (Ln 74-82). Fig 1c is a comparison between different circadian datasets and as such we have only compared periods across 24-68h window. Similarly, Fig 1b is a global analysis of periods in rhythmic genes in comparison with Arabidopsis and so is again measured from 24-68h. We have now clarified this in the Figure legend for 1b.

      For comparisons of homoeologs within wheat triads, our question was in identifying homoeologs which behaved differently when placed under free-running conditions. We therefore still feel justified in using the full 0-68h dataset to identify homoeolog periods and phases which indicate differential circadian regulation, but we have now clarified that we are using the full dataset for the triad analysis in the results (Ln 140).

      Fig 1h-m. How were those genes chosen? It would help to see the SD of the replicates shown, since this is just showing one triad. It would be helpful to see a plot that represents the full set of triads rather than just one that looks best. If normalized to a standard phase they could be put on the same plot. For example, panel j is meant to show the 8h lag of subgenome D. If the data is normalized so that A and B are set to the same phase all the triads could be displayed with shaded SD bars to show the variation. Something like this would be a better representation of the data rather than showing just one example.

      Fig. 1h-m are case-studies illustrating the different forms of circadian imbalance between homoeologs. We agree that it is helpful to see the standard deviation as error bars on these triad plots and have added it as suggested. In line with another Reviewer 2’s suggestion we have removed Fig 1k and have replaced this with a comparison of mean normalised data for Triad 408 and Triad 2454, highlighting the difference between imbalanced rhythmicity and imbalanced amplitudes between homoeologs. Fig 1 I and m do not have error bars as adding standard deviations to mean normalised data wasn’t appropriate.

      Thank you for your suggestion on how to display the different phases between homoeologs. We feel that if we were to plot all of the triads displaying imbalanced phases, the differences in period length and accompanying noise differences would make the plot so busy as to be unreadable. We hope that the pie charts Fig 1 d-g give a global overview of the proportions of triads with circadian imbalance, but agree with the point that it is useful to allow readers to view triads of their own preference. Therefore, we have now provided the replicate level TPM data with the triad IDs annotated (Supplementary File 12) and Supplementary file 11 provides the classification of each triad alongside Metacycle statistics, ortholog identification and cluster information discussed elsewhere in the paper. Readers can now look up a triad or gene of interest and see how it was classified and what the expression looks like over the full dataset.

      It is surprising that there aren't more comparisons with the B. rapa dataset, especially when discussing the clock genes that show balanced or imbalanced expression. Are they similar in B. rapa and does it support your hypothesis that unbalance for certain genes are selected against?

      While we agree that a thorough, multiple species, comparative transcriptomic analysis is undoubtably of interest for the future, we feel it is beyond the scope of the questions being addressed in this paper. We do compare paralogs defined as “similar” in the Greenham dataset with homoeologs described as “balanced” in our dataset and find that genes involved with “photosynthesis” and “generation of precursor metabolites and energy” tend to be common between the two groups, potentially suggesting conservation of balance for certain types of genes (Ln 206-217).

      Figure 2 networks. Why were these specific modules selected? Is it actually appropriate to directly compare these modules? I do see that some of the comparisons have high correlations from panel a, but not all. For example, in panel b the W9 and A9 modules have a correlation value of 0.92, which seems appropriate. However, panel c (modules W3 and A2) have a correlation of 0.42, which seems far too low to make any sort of comparison meaningful.

      The modules were selected to simplify the comparison of genes expressed in the dawn, midday, dusk, and night. We were interested in identifying common GO-enrichment in genes peaking throughout the day, although as you have identified, the differences in period length between Arabidopsis and wheat made this difficult. Our reasons for comparing module W3 with module A2, were that, even though their eigengenes are not highly correlated per se, when period length is taken into account, both modules peak during the subjective day (CT 6.34h and 6.19h) and they share commonly enriched GO terms which make sense for day peaking genes.

      Further, as described in methods comments, using a cutHeight as low as 0.15 will likely lead to some number of genes in any given module that do not necessarily "share" a similar expression pattern. These genes could have a pattern that has very low correlation to their module eigengene and were only placed in that module because the pattern was "less similar" to other module eigengenes. The current expression plots in this figure follow a clear pattern, but I suspect this would be even more apparent if the genes within these modules had a higher correlation to the module eigengene. Perhaps the current genes in these modules could just be filtered to have a higher correlation score?

      Thank you for your comments, we have now made changes to the Results and Methods to clarify our approach (Ln 237-239 and Ln738-765). Merging modules with highly correlated module eigengenes (ME) is the final step in constructing our co-expression networks. To do this, as the reviewer describes - we used the WGCNA default parameter of a mergeCutHeight() of 0.15. This results in the merging of modules with highly correlated ME as the 0.15 mergeCutHeight() refers to the dissimilarity metric of 1 minus the eigengene correlation. So for WGCNA, a mergeCutHeight() of 0.15 corresponded to a correlation of 0.85. For the wheat modules, we took the additional step of merging closely related modules (mergeCloseModules()) using a cutHeight of 0.25, again a dissimilarity metric of 1 minus the eigengene correlation (corresponding to a correlation of 0.75). Reducing the stringency of the cutHeight to merge highly correlated wheat modules enabled us to more easily compare significantly correlated wheat and Arabidopsis co-expression modules to identify groups of genes in wheat and Arabidopsis expressed at similar times in the day, and enable the comparison of whether similar phased transcripts in wheat and Arabidopsis had similar biological roles.

      Lines 327-334: I am not following the connection between 'response to abiotic stimulus' and the photoreceptor and light signaling proteins. At the start of this section (line 308) the authors say that the GO analysis was only done on rhythmically expressed genes but the reference to only one PHYA being rhythmic and yet multiple genes are shown in the plot in fig. S16. Does this mean that all the genes were shown and not just the rhythmic ones? This would explain why many of the PHY and CRY genes don't seem to have rhythms. This should be clarified better in the text or indicated in the plot which ones were called rhythmic. Since the first day following transfer is still the diel pattern from the entrainment condition, what does the PHY and CRY expression look like? Does it appear rhythmic under diel but lose rhythmicity in LL? It should be noted in the text that arrhythmicity in circadian conditions doesn't mean there isn't rhythmicity under diel conditions. This could be an additional explanation apart from the current one in the text that the regulation is at the level of protein stability/localization. Overall, this entire section is very long and entirely based on data shown in the supplemental material. I do appreciate having the individual gene plots that supplement Figure 4 and would suggest either providing a main figure to highlight a small subset of genes or pathways in this section or shorten it and focus on the results shown in the main figures.

      Upon reading the reviewer’s comment, we realize that we should have made our motivations and processes clearer within this section. We used the data filtered for rhythmicity to conduct the GO-enrichment analysis and then used that to identify processes which should be of interest for further investigation. We have now added an additional sentence (Ln 352-354) to explain this more clearly. We then considered the orthologs of well-known Arabidopsis gene networks and extracted their expression from our circadian dataset, whether rhythmic or not. Supplementary Table 10 contains all of the genes we investigated, their expression and their MetaCycle statistics. We have also indicated here which genes are plotted in which Supplementary Figure 18-20. The reasons for plotting non-rhythmic genes in some cases was that it illustrates the differences between circadian control in Arabidopsis versus wheat (as is the case for the PHY and CRY genes). We understand that it is useful to see at a glance which genes are classified as rhythmic or arrhythmic, so have now highlighted each row in Supplementary Table 10 to make this more intuitive, and added a read me tab.

      Regarding your point about oscillation under diel cycles, we agree that some transcripts will show rhythmic behaviour under entraining environments but not under constant conditions, and may perform time-of-day specific functions. However, these transcripts are likely to not be regulated by the circadian clock (at the transcriptional level) and so are not discussed in the context of a circadian transcriptome.

      For your interest, here is the full expression of PHY and CRY transcripts starting at ZT0:

      [Image]

      It is difficult to say for definite, but it seems likely that some of these photoreceptors will have rhythmic patterns of expression under diel cycles, but these rhythms do not endogenously persist under constant conditions.

      We appreciate your feedback that this section would benefit from cutting down of text and addition of a Figure to illustrate the text. We have now cut some of this section down and created a new main figure based on some of the oscillation plots from Supplementary Figure 18 and 19. We chose examples that reflect a conservation of relationships between transcripts of different peak phases, as we find it interesting that both species have similar patterns. (Main Figure 4, Ln 361--363, 382).

      1. Primary metabolism section: in terms of the supplemental figure, similar to the previous one I think it would declutter the plots if the genes that are not rhythmic were left out and simply indicate below the plot that they didn't meet the rhythmicity cutoff. This is another area where there is more discussion surrounding the supplemental figures than the main figure 4.

      One of the overall findings of this section was that many of the genes involved in Starch and T6P metabolism which are rhythmically expressed in Arabidopsis are not rhythmically expressed in wheat. We feel removing these genes from the results would detract from the importance of this finding. We have now edited Supplementary Table 10 to highlight which genes are classified as rhythmic. We have also added in a sentence to the start of this section which lays out our motivations for this analysis, summarises our findings and better connects the text with an explanation of Fig. 5 (Ln 408-430).

      For all gene expression figures there should be SD or SE shown either as bars or ribbons to represent the variation in replicates.

      Although we agree that error bars are informative for showing variation between replicates (and have added them to Fig. 1 to show differences within wheat triads) we feel that adding error bars to the gene expression plots in Fig. 3, Fig 4 and Supplementary Fig 19-20 would make these plots difficult to read, particularly where the wheat homeologs are very similar. The purpose of these gene expression plots is to compare circadian profiles in Arabidopsis and wheat orthologs rather than to claim significant differences in expression at any particular timepoint. This is fairly common in other circadian biology studies:

      https://www.pnas.org/doi/10.1073/pnas.1408886111 ,

      https://www.jbc.org/article/S0021-9258(17)49454-3/fulltext#seccestitle20 , https://journals.plos.org/plosone/article/comments?id=10.1371/journal.pone.0169923 , https://www.science.org/doi/10.1126/science.290.5499.2110?url_ver=Z39.88-2003&rfr_id=ori:rid:crossref.org&rfr_dat=cr_pub%20%200pubmed,

      https://www.frontiersin.org/articles/10.3389/fgene.2021.664334/full,

      https://www.science.org/doi/full/10.1126/science.1161403

      The replication level information for each gene has now been made available in Supplementary file 12.

      1. It would be very helpful to include the code used to generate the networks and perform the cross-correlation of eigengenes across networks should be included in the Methods. This will also save you from responding to email requests!

      Thank you for your comment, Code for the cross-correlation analysis, Loom plots and WGCNA network construction is now available from our groups GitHub repository: https://github.com/AHallLab/circadian_transcriptome_regulation_paper_2022/tree/main

      Minor Comments

      1. Figure 1, panel d: - The "unbalanced" triads that are depicted by the lighter shading; do these in fact have a different cutoff than the original rhythmic homoeologs? In the figure it says qThank you for bringing this to our attention, this has now been corrected.

      Hard to directly compare the GO term overlap in Figure 2f. Might be better to only show the results for the 4 pairs shown in b-e and put them side by side in the bubble plot.

      Thank you for this feedback, We have tried to make this plot easier to understand without losing any of the available information. Hopefully it is now more intuitive to understand which columns are being compared. We have changed the coloured lines to make them slightly wider, put the modules in corresponding coloured boxes and highlighted GO-slim terms shared by modules being compared.

      1. Line 314 -316 don't see supp tables 10, 11

      Our apologies, these files were missed previously from the upload are now available.

      1. For the selection of B. rapa circadian paralogs with similar and differential expression patterns (starting line 714), the authors choose a hard cut off of 0.001 (differentially patterned) OR 0.1 (similarly patterned). What happens to the genes that are between these two cut offs or is this a typo. Since all the other cutoffs for rhythmicity was set at 0.01 it seems likely that this is a typo.

      We have now clarified this in the methods, (Ln 807-822). This is not a typo, but it is a different method to the Metacycle approach we have used for our wheat data. We defined similar/different paralogs as characterized in Greenham et al, (2020) using DiPALM p-values. We chose these DiPALM p-value cut-offs as they gave us approximately equal numbers of paralogs in each category, which represent tails of similarly expressed or differently expressed circadian genes. We checked these cut-offs by calculating average Pearson’s correlation statistics between paralogs and found that differential Brassica paralogs had a mean Pearson correlation coefficient of 0.31 (SD = 0.43) and similar Brassica paralogs had a mean Pearson correlation of 0.75 (SD= 0.23) which confirms that the DiPALM method of defining expression patterns makes sense in the context of this analysis.

      Line 681. Should be supplemental Figure 6 not 9.

      1. References to most supplemental figures are not the correct number.

      2. Labels above the plots in Supp Fig5 do not match the legend.

      We apologise for these mistakes. We realize that we had mistakenly submitted an earlier draft of the Supplementary materials file, which was missing Supplementary Figure 5, 6 and 9 which therefore shifted the order of the remaining figures. This is now updated.

      1. Suppl table 7 should be as a separate .csv file or similar to be able to see the full table.

      This is a good suggestion, and we have added this.

      1. Line 723 should be B. rapa not B. napus.

      Thank you for catching this! Corrected.

      1. Figure 4. There is no explanation for what the black boxes represent in the figure legend.

      Thank you for your comment. Figure 4 (new Figure 5) has now been updated.

      Reviewer #1 (Significance (Required)):

      This study provides new insight into the circadian regulation of the transcriptome in a new allopolyploid. It adds a valuable resource to a growing collection of circadian studies in important crops and will greatly improve our efforts to learn more about the circadian control of important crop improvement traits. The dataset will be of interest to other plant circadian biologists as well as the general plant biology community who focus on monocot crops. My expertise is more on the transcriptomic side and I do not have the expertise to evaluate the phylogenetic work presented in this study.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary Rees et al. present an RNAseq time course of bread wheat. Its recent polyploidisation is one motivation for this study as gene expression dosage is known to be important for clock function in other plants. The time course covers 3 days at sampling intervals of 4h of 2-week old wheat plants (all aerial tissues), in triplicates. The subsequent analysis of the RNAseq data includes analysis of the generated data by itself (e.g. GO analysis, rhythmicity, period and phase analysis, rhythmicity of transcription factor families as well as TF binding sites) as well as thorough comparison with published datasets of other species (Arabidopsis, Brassica rapa, Brachypodium dystachion). One of the key findings is that the mean period length and the period spread are larger in wheat than in these other species). Circadian clock genes largely have similar dynamics in wheat compared to Arabidopsis. In addition, one focus is the analysis of the dynamics of three genes of one triad and imbalance / balance of such triads. To the surprise of the authors, circadian regulated and clock genes were not necessarily balanced. Silencing is one of their explanation for imbalance of circadian genes as arrhythmic genes of one triad are typically those with the lowest expression level. Finally, the authors point out more examples of rhythmic processes and genes (photoreceptors and signalling, auxin, carbon metabolism) and their commonalities and differences with Arabidopsis.

      Major comments - The key conclusions and the data are convincing

      We thank the reviewer for their supportive comments.

      • line 120 and figure 1: In my opinion, q > 0.05 is not a good definition of arrhythmicity as non-significant q-values can result from either noise in spite of rhythmicity or from arrhythmicity. A more statistically sound way to detect arrhythmicity could for example be two-one-side tests (for example in the R package 'equivalence', e.g. see usage for time courses by Noordally et al. 2018, https://www.biorxiv.org/content/10.1101/287862v1).

      Thank you for pointing us in the direction of this package, we agree that choosing methods for circadian quantification and q-value cut-offs is always tricky and different approaches will perform better for noisier or non-sinusoidal waveforms. For future work, we will investigate the application of the suggested method in circadian rhythmicity analysis. However, we believe that the criteria used in this paper for rhythmicity quantification is suitable for addressing our questions, and overall, we are satisfied that rhythms with a q-value of >0.05 would also be classified by eye as being arrhythmic, and rhythms with a q-value Many other studies have used meta2d B.H q-values as a metric of rhythmicity: e.g. (https://bmcplantbiol.biomedcentral.com/articles/10.1186/s12870-022-03565-1 , https://link.springer.com/content/pdf/10.1186%2Fs12915-022-01258-7 , https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8782462/pdf/pcbi.1009762.pdf )

      • lines 480-484 and intro: In the introduction, the authors write that expression levels of clock components are important for the function of the clock, and that this is one motivation for the current study where polyploidisation is expected to affect the expression levels of clock genes and their outputs. I wonder what answers or speculations this study provides in the end, or whether such answers / speculations should be made clearer. For example, do the authors think that the higher variability of periods in wheat could be a consequence of lower robustness (in addition to possible spatial differences that are mentioned) due to polyploidisation? Is anything known about the period of rhythms of close wheat relatives that did not undergo polyploidisation? Did you look at dampening over the time course in wheat vs. Arabidopsis?

      The point above is an interesting one, and we thank the reviewer for raising it. We agree that the high variability of periods in wheat may be a product of polyploidisation, as functional redundancy between homoeologs may allow a tolerance for less tightly regulated, non-dominantly expressed circadian transcripts. We have now added this hypothesis to our discussion: Ln536-550.

      In our comparative analysis of period distributions, we looked at periods of transcripts from a diploid relative of hexaploid wheat, Brachypodium distachyon. In Brachypodium, period lengths have around the same SD as in Arabidopsis but the mean period length is slightly longer (Supplementary table 2). We have now edited our results to make the relationship between wheat and Brachypodium clearer (ln 109-110).

      Minor comments:

      Introduction - lines 49: it is unclear what is meant by ppd-1 at this position of the sentence

      We agree this was unclear and have revised it to “notably the ppd-1 locus within TaPRR3/7” Ln 52

      • line 54/55: clarify that this refers to Arabidopsis thaliana

      Corrected.

      Results - line 69 and 76: cite references for these tools here (not only in the methods section)

      Corrected.

      • line 90-93: Why wouldn't the same thing happen on subsequent subjective evenings?

      Thank you for your comments. We have now combined our hypothesis for why wheat transcripts expressed at dusk have longer periods with the discussion about longer periods upon transfer to constant light. We think that the two processes are likely to be connected and have now placed them together in Ln 126-131.

      The behaviour of mean period lengths of wheat transcripts upon transfer to constant light was unexpected and we believe is quite interesting. One explanation is that the influence of the ongoing light zeitgeber when dusk was expected causes a delay in the expression of evening peaking genes which are delayed by the continuous light signal. Then, on subsequent evenings the influence of the diel dusk signal is ‘forgotten’ as the governance of the endogenous clock takes over. The very long period observed at 0-24h (28.61h) may be due to a phase shift rather than an intrinsic lengthening of period per se. Whether this trait is unique to wheat or can also be seen in other plant species is, to our knowledge, unknown.

      • line 118: what is your defined cutoff for significance of the Chi square test (p=0.03 not regarded significant?)

      The reviewer is completely right, we have now clarified this. Ln 145-149

      • figure 1h,i: In order for the reader to see whether A and D (Figure 1h) or A (figure 1i) are indeed arrhythmic, one would need to see plots with a normalisation as done in figure 1m for 1l.

      We have now removed the triad showing one rhythmic gene and two arhythmic genes (as Fig. 1h already illustrates this type of circadian imbalance) and replaced this with a side by side comparison of how imbalance in rhythmicity differs from imbalance in relative amplitude as suggested.

      • figure 1h-m (and others with circadian time course traces): could a measure of variation (e.g. SD, SEM, confidence interval) be plotted as a shaded region around the curves (unless they're so small that they are there but not visible)?

      We have now added error bars to these plots to show standard deviation between replicates, in Fig. 1 h, j, k and l. We could not think of an accurate way to display this information for the mean normalised data (Fig 1. i and m) so have not put error bars on these plots.

      • line 139 (also in 737 and 450): give reference to Ramirez-Gonzalez et al in the same style as the rest of the manuscript (number)

      Thank you for raising this, we believe we have corrected all in-text citations (both narrative and fully parenthetical form) for consistency with the APA format used by the majority of Review Commons affiliate journals.

      • Clustering (modules): What is the reason for choosing 9 clusters? Was this number optimised or chosen for other reasons?

      WGCNA uses an unsupervised clustering algorithm that works within the supplied parameters to determine the optimum number of clusters to explain the dataset, without prior specification of the number of clusters. We have amended the manuscript text to clarify this Ln237-239.

      • lines 280 - 284: The TaELF3-1D phenotype could be explained a bit better to the non-wheat specialist, for example by mentioning in the beginning of this set of sentences.

      Done (Ln 314-318).

      • The authors present an analysis of TF binding sites. Can they say something about binding sites in a less sophisticated manner, such as on some very well-known motifs in promoters like the evening element?

      We agree that this is a very interesting question, and one that we may investigate in more detail with our data in the future. In this paper, we performed a global analysis of wheat TFBS predicted from orthologous Arabidopsis TF targets. These targets have been experimentally validated in Arabidopsis using DAP-seq, but we have not validated that these binding sites exist in wheat promoters. We therefore took a tentative approach, and presented only enrichments at the superfamily level rather than talking about specific regulatory motifs.

      The evening element would fit most likely fit within the MYB or MYB-related TFBS superfamily, however the diversity of transcription factors in this family means that there is significant enrichment of these TFBS in multiple modules throughout the day (Supplementary Figure 11). In summary, a more in depth TFBS analysis of known circadian motifs is of great interest, but we feel would be a substantial work in its own right.

      • Figure 1h-l: If known or meaningful, it would be interesting to know the gene identities behind the triads shown, as in supplementary figure 5.

      These triads were selected as case studies to exemplify the ways in which we were defining imbalanced circadian triads. They have no particular relevance to the figure, but out of curiosity, these are the closest Arabidopsis orthologs for the triads displayed in Fig. 1:

      Triad 408 has highest identity to a hypothetical protein (AT4G26415).

      Triad 2454 is similar to AT3G07600, a heavy metal transport/detoxification superfamily protein

      Triad 13405 is similar to AT3G22360, encoding an ALTERNATIVE OXIDASE 1B, AOX1B

      Triad 10854 is similar to NSE4A, a δ-kleisin component of the SMC5/6 complex, possibly involved in synaptonemal complex formation (AT1G51130).

      Information about wheat gene names in each triad and their Arabidopsis orthologs can be viewed in Supplementary Table 11, so that readers can search for genes of particular interest to them.

      • Figure 4 and text: The illustration of starch metabolism is very helpful. However, I think the paper would benefit from giving a better reason for the selection of this specific set of processes, for example by relating these findings to functional differences in starch metabolism in the two species (in contrast to Arabidopsis, wheat stores little starch in leaves but uses fructans as main reserve carbohydrate)? Are there known differences in the dynamics of starch degradation during the night?

      The reviewer raises an interesting point, and we have now clarified in our results that the stated differences between starch regulation in Arabidopsis and wheat was part of the motivation behind studying this pathway. Starch is at the centre of plant primary metabolism as a carbon storage source and is arguably one of the most important features that breeders look for in regard to grain filling and yields. Additionally, it is of interest to circadian biologists as starch (as well as sucrose) have been shown to transiently cycle and to be regulated by the circadian clock. However, in wheat, carbon storage primarily uses sucrose rather than starch, and we have now added sucrose to Figure 5 to place it in this context. We think your suggestion has now improved our explanation for why we focused on starch in the manuscript, and we are grateful for your input (Ln 408-421).

      We also agree that the differences in the ways that Arbaidopsis and wheat utilise starch versus sucrose, and perhaps the role that fructans have in as a reserve carbohydrate and in protection against freezing in wheat may be one of the reasons we are seeing differences in circadian regulation of starch. We have now added this to our discussion (Ln 584-592).

      • Figure 4: triose-phosphates can be transported in and out of the chloroplast, as is illustrated in the figure. However, the illustration looks as though they are converted to hexose phosphates during the transport process. In order to be consistent with other transport processes of the figure (maltose and glucose), triose-phosphate should be repeated on the cytosolic side.

      We have now amended this (new Fig. 5). Thank you for your feedback.

      Methods - line 543: if I understand correctly that triplicates were collected and analysed for each time point, '18 samples' is mis-leading (18 time points would be more accurate).

      We agree this was badly worded. Changed Ln 615.

      Supplementary - Supplementary figure 3: x axis label very small and contains typo

      Now corrected. Also enlarged axis for Supplementary Figure 2.

      • Supplementary table 1: Romanowski et al 2020 (add year), or use ref. number citation style as in the rest of the manuscript

      Thank you for raising this, we have now hopefully corrected all in text citations (both narrative and fully parenthetical form) to be consistent with APA format used by the majority of Review commons affiliate journals.

      • Supplementary table 9, primary metabolism: does bold highlighting of Arabidopsis accession numbers have a meaning or is it accidental?

      We apologise that this was unclear. We have corrected this. Supplementary Table 10 now also has a “Read me” tab which explains that table.

      Reviewer #2 (Significance (Required)):

      I believe this is a precious, carefully generated and analysed dataset which many biologists will benefit from, beyond wheat or circadian specialists. The dataset expands the knowledge of circadian transcriptome regulation to an important crop and contributes a resource of which only a handful of others exist in other species. Many high impact papers on RNAseq include some follow-up on candidates, for example in Romanowski et al 2020, which is admittedly easier to do in Arabidopsis than wheat due to the availability of genetic resources.

      My expertise: Plant circadian clock (Arabidopsis), dataset analysis (but not specifically for RNAseq)

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This manuscript is based on the analysis of a single experiment consisting in transcriptomic profiling of one (hexaploid) wheat genotype along 3 days (samples taken every 4 hours). The experiment is performed in constant light conditions, allowing detection of transcripts controlled by the circadian clock. The bioinformatic analysis studies the dynamics of the different homoeologous transcript in the polyploid genome and compares cycling transcripts in wheat with what is known from Arabidopsis.

      The manuscript is well written, the methods are correct, the analysis performed is sufficiently extensive and the figures are clear. The manuscript finds interesting expression patterns among homeologous genes, and goes into detail on important differences in circadian regulation of relevant gene families between Arabidopsis and wheat. The work is purely descriptive and does not aim at associations with physiological phenotypes, but the bioinformatic analysis is very thorough and uncovers interesting examples.

      Only one caveat: For what I gather, there is no replication in the RNA-seq experiment, although the exact method does not appear in the text. From the Methods section: "tissue was sampled every 4h for 3 days (18 samples in total)" and "At each timepoint, we sampled the entire aerial tissue from 3 replicate plants". Whether these samples were pooled or not is not described. The "Data Availability" section links to 18 RNA-seq paired end libraries, which suggest that the replicates were pooled, although some type of barcoding might have been used. The text should mention if the replicates were pooled or not, and, if so, what was the method used for poling (tissue, RNA or libraries). Even in the case of no biological replication the manuscript brings interesting insights into wheat transcriptomics and circadian biology. The editor (or the rules of the journal) should decide if they accept articles with no "real" biological replication (I am sure we all understand by now the benefits and limitations of pooling biological replicates into a single RNA-seq library).

      There was replication within the RNA sequencing experiment, and we apologise that this was unclear from our manuscript. Each timepoint consisted of three independent biological replicates. We have now created a new “Experimental context” section in the results to explain this (Ln 74-82) and have clarified in the methods how our data was processed (Ln 609-615 and 636-638).

      We have now included an additional matrix with TPMs at the replicate level to assist readers in looking at specific genes of interest (Supplementary Table 12).

      Minor comments:

      The description of the experimental setup in the first sentence of the Results section is too brief. Could you please talk about for how long the experiment was running? At what intervals the samples were taken? What conditions were used?

      We apologise that this was unclear. We hope that the new Experimental Context section, added in response to comments from several reviewers, makes this much clearer, alongside the clarification in the methods (Ln 609-615 and 636-638).

      Line 280: "...due *to* an introgression..."

      Corrected. Ln 315

      The legend of Figure 3l says elf4 instead of elf3

      We thank the reviewer for noticing this mistake that we have now corrected.

      Line 306 "says Supplementary Note 7 instead of Supplementary Note 7

      We are not sure what is to be corrected here!

      Reviewer #3 (Significance (Required)):

      This works advances our knowledge on how genome wide expression levels are controlled by the circadian clock in polyploids. Although previous works had performed similar analyses in other polyploid plants, this is the first time this is done in an hexaploid. This work is a starting step to understand gene regulation in this important crop, and have interest for researchers working in fundamental and applied plant biology.

      Thank you for your positive comments and your feedback in improving this manuscript. We would like to clarify that to our knowledge, this work presents the first analysis of a circadian transcriptome in a polyploid crop. The work by Greenham et al, although undoubtably providing insight into circadian regulation of ancient paralogs, was performed in the diploid Brassica rapa.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript is based on the analysis of a single experiment consisting in transcriptomic profiling of one (hexaploid) wheat genotype along 3 days (samples taken every 4 hours). The experiment is performed in constant light conditions, allowing detection of transcripts controlled by the circadian clock. The bioinformatic analysis studies the dynamics of the different homoeologous transcript in the polyploid genome and compares cycling transcripts in wheat with what is known from Arabidopsis.

      The manuscript is well written, the methods are correct, the analysis performed is sufficiently extensive and the figures are clear. The manuscript finds interesting expression patterns among homeologous genes, and goes into detail on important differences in circadian regulation of relevant gene families between Arabidopsis and wheat. The work is purely descriptive and does not aim at associations with physiological phenotypes, but the bioinformatic analysis is very thorough and uncovers interesting examples.

      Only one caveat: For what I gather, there is no replication in the RNA-seq experiment, although the exact method does not appear in the text. From the Methods section: "tissue was sampled every 4h for 3 days (18 samples in total)" and "At each timepoint, we sampled the entire aerial tissue from 3 replicate plants". Whether these samples were pooled or not is not described. The "Data Availability" section links to 18 RNA-seq paired end libraries, which suggest that the replicates were pooled, although some type of barcoding might have been used. The text should mention if the replicates were pooled or not, and, if so, what was the method used for poling (tissue, RNA or libraries). Even in the case of no biological replication the manuscript brings interesting insights into wheat transcriptomics and circadian biology. The editor (or the rules of the journal) should decide if they accept articles with no "real" biological replication (I am sure we all understand by now the benefits and limitations of pooling biological replicates into a single RNA-seq library).

      Minor comments:

      The description of the experimental setup in the first sentence of the Results section is too brief. Could you please talk about for how long the experiment was running? At what intervals the samples were taken? What conditions were used?

      Line 280: "...due to an introgression..."

      The legend of Figure 3l says elf4 instead of elf3

      Line 306 "says Supplementary Note 7 instead of Supplementary Note 7

      Significance

      This works advances our knowledge on how genome wide expression levels are controlled by the circadian clock in polyploids. Although previous works had performed similar analyses in other polyploid plants, this is the first time this is done in an hexaploid. This work is a starting step to understand gene regulation in this important crop, and have interest for researchers working in fundamental and applied plant biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Rees et al. present an RNAseq time course of bread wheat. Its recent polyploidisation is one motivation for this study as gene expression dosage is known to be important for clock function in other plants. The time course covers 3 days at sampling intervals of 4h of 2-week old wheat plants (all aerial tissues), in triplicates. The subsequent analysis of the RNAseq data includes analysis of the generated data by itself (e.g. GO analysis, rhythmicity, period and phase analysis, rhythmicity of transcription factor families as well as TF binding sites) as well as thorough comparison with published datasets of other species (Arabidopsis, Brassica rapa, Brachypodium dystachion). One of the key findings is that the mean period length and the period spread are larger in wheat than in these other species). Circadian clock genes largely have similar dynamics in wheat compared to Arabidopsis. In addition, one focus is the analysis of the dynamics of three genes of one triad and imbalance / balance of such triads. To the surprise of the authors, circadian regulated and clock genes were not necessarily balanced. Silencing is one of their explanation for imbalance of circadian genes as arrhythmic genes of one triad are typically those with the lowest expression level. Finally, the authors point out more examples of rhythmic processes and genes (photoreceptors and signalling, auxin, carbon metabolism) and their commonalities and differences with Arabidopsis.

      Major comments

      • The key conclusions and the data are convincing
      • line 120 and figure 1: In my opinion, q > 0.05 is not a good definition of arrhythmicity as non-significant q-values can result from either noise in spite of rhythmicity or from arrhythmicity. A more statistically sound way to detect arrhythmicity could for example be two-one-side tests (for example in the R package 'equivalence', e.g. see usage for time courses by Noordally et al. 2018, https://www.biorxiv.org/content/10.1101/287862v1).
      • lines 480-484 and intro: In the introduction, the authors write that expression levels of clock components are important for the function of the clock, and that this is one motivation for the current study where polyploidisation is expected to affect the expression levels of clock genes and their outputs. I wonder what answers or speculations this study provides in the end, or whether such answers / speculations should be made clearer. For example, do the authors think that the higher variability of periods in wheat could be a consequence of lower robustness (in addition to possible spatial differences that are mentioned) due to polyploidisation? Is anything known about the period of rhythms of close wheat relatives that did not undergo polyploidisation? Did you look at dampening over the time course in wheat vs. Arabidopsis?

      Minor comments:

      Introduction

      • lines 49: it is unclear what is meant by ppd-1 at this position of the sentence
      • line 54/55: clarify that this refers to Arabidopsis thaliana

      Results

      • line 69 and 76: cite references for these tools here (not only in the methods section)
      • line 90-93: Why wouldn't the same thing happen on subsequent subjective evenings?
      • line 118: what is your defined cutoff for significance of the Chi square test (p=0.03 not regarded significant?)
      • figure 1h,i: In order for the reader to see whether A and D (Figure 1h) or A (figure 1i) are indeed arrhythmic, one would need to see plots with a normalisation as done in figure 1m for 1l.
      • figure 1h-m (and others with circadian time course traces): could a measure of variation (e.g. SD, SEM, confidence interval) be plotted as a shaded region around the curves (unless they're so small that they are there but not visible)?
      • line 139 (also in 737 and 450): give reference to Ramirez-Gonzalez et al in the same style as the rest of the manuscript (number)
      • Clustering (modules): What is the reason for choosing 9 clusters? Was this number optimised or chosen for other reasons?
      • lines 280 - 284: The TaELF3-1D phenotype could be explained a bit better to the non-wheat specialist, for example by mentioning in the beginning of this set of sentences.
      • The authors present an analysis of TF binding sites. Can they say something about binding sites in a less sophisticated manner, such as on some very well-known motifs in promoters like the evening element?
      • Figure 1h-l: If known or meaningful, it would be interesting to know the gene identities behind the triads shown, as in supplementary figure 5.
      • Figure 4 and text: The illustration of starch metabolism is very helpful. However, I think the paper would benefit from giving a better reason for the selection of this specific set of processes, for example by relating these findings to functional differences in starch metabolism in the two species (in contrast to Arabidopsis, wheat stores little starch in leaves but uses fructans as main reserve carbohydrate)? Are there known differences in the dynamics of starch degradation during the night?
      • Figure 4: triose-phosphates can be transported in and out of the chloroplast, as is illustrated in the figure. However, the illustration looks as though they are converted to hexose phosphates during the transport process. In order to be consistent with other transport processes of the figure (maltose and glucose), triose-phosphate should be repeated on the cytosolic side.

      Methods

      • line 543: if I understand correctly that triplicates were collected and analysed for each time point, '18 samples' is mis-leading (18 time points would be more accurate)

      Supplementary

      • Supplementary figure 3: x axis label very small and contains typo
      • Supplementary table 1: Romanowski et al 2020 (add year), or use ref. number citation style as in the rest of the manuscript
      • Supplementary table 9, primary metabolism: does bold highlighting of Arabidopsis accession numbers have a meaning or is it accidental?

      Significance

      I believe this is a precious, carefully generated and analysed dataset which many biologists will benefit from, beyond wheat or circadian specialists. The dataset expands the knowledge of circadian transcriptome regulation to an important crop and contributes a resource of which only a handful of others exist in other species. Many high impact papers on RNAseq include some follow-up on candidates, for example in Romanowski et al 2020, which is admittedly easier to do in Arabidopsis than wheat due to the availability of genetic resources.

      My expertise: Plant circadian clock (Arabidopsis), dataset analysis (but not specifically for RNAseq)

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, Rees et al. perform an RNA-seq circadian time course experiment in the recently formed allopolyploid wheat. Through comparisons with other circadian transcriptomic datasets in other species it appears that the period of rhythmic genes is much more variable in wheat with a shift to longer periods compared to the other species examined. Interestingly, by analyzing circadian parameters among expressed genes, they find evidence that this newly formed allopolyploid already shows signs of divergence in circadian traits among homoeologs. A thorough comparison with circadian regulated genes in Arabidopsis reveals overlap in phasing of genes involved in certain biological processes such as photosynthesis and light signaling whereas genes involved in starch metabolism were found to have different levels of rhythmicity and phasing. This dataset will be a great resource for the community and enable new predictions about the influence of polyploidy on the circadian control of important crop improvement traits and the circadian regulation of gene expression.

      Major Comments

      1. The results section starts with very little explanation of the experiment. It would help to provide a little more detail at the start of the results to explain the context for the experiment and what was done, when samples were collected and for how long. For the methods section, it isn't until line 650 that it is clearly stated that the sampling started at ZT0. It would be better to put this in the plant materials and growth condition section.
      2. The low proportion of circadian regulated genes is likely due to the very low cutoff for calling a gene expressed, especially when there are three days of repeated timepoints. If a gene is expressed across the time course it should have values above TPM 0 for at least 3 time points in order for it to be expressed each day. I'd also be suspicious of a gene with a TPM value less than 0.5. Comparing these types of numbers is always challenging due to the various cutoffs used. Along those lines, why was a different filtering scheme used for Arabidopsis (line 657)?
      3. In reference to the shortening of the period every day, this should be interpreted with caution. Period estimate of a single cycle are not very reliable and the SD for each day is around 3h so it is difficult to draw any conclusions about changes in period each day. One option would be to only include genes with an SD less than 1h or alternatively to remove the discussion surrounding the comparison of period across the three days and focus on the period results for the full 24h-68h window shown in 1b. While 2 days is better it is still not ideal for calling period; however, your first day will still have a strong diurnal driven pattern that will likely skew your circadian period.
      4. Line 87-93: If the dusk cue is important for clock expression you would think this would be biased towards genes that peak later in the day or near dusk. This argument should be connected better to the period results discussed on lines 98-101.
      5. Lines 650-652 of the Methods mentions that one of the main interests was the response to transfer to L:L, but this isn't mentioned in the introduction and doesn't come up much in the Results section. Most of the expression comparisons are focused on the 24-68h window. It also isn't clearly explained why the first day in LL is still a diurnal cycle. This would be helpful for non-circadian readers who may wonder why the first day is not included in all the analyses.
      6. The phase comparisons shown in Figure suppl 4 are confusing. Suppl. Note 3 states that the period from the 24-68h data window was used to establish the bins but then the phase is shown for 3 different windows for each column? When calculating the phase for each of those 3 windows which period was used as the denominator in the phase calculation? Was it the period that matches the window used to calculate phase? What does the plot look like if phase is called on the same window used to calculate period (24-68)? What method was used to call phase in Suppl. Fig 4? As shown in Suppl Fig. 3 the method can influence the phase distributions. The methods suggest that the phase was determined with Metacycle but then FFT and MESA were used to verify. What does this mean verify, were they adjusted if FFT/MESA didn't agree?
      7. It is difficult to interpret the value of the period and phase comparisons shown in Fig. 1b, c, e and f after the preceding section about how variable the period and phase is across days. It is also surprising that the full 3 days were used to calculate the circadian statistics considering the first day is still under diurnal control. Do the ratios remain the same if the statistics are performed only on the 24h-68h window? For consistency with the rest of the paper and avoid confusion it would be best to have all circadian parameters measured using the same time window (24h-68h).
      8. Fig 1h-m. How were those genes chosen? It would help to see the SD of the replicates shown, since this is just showing one triad. It would be helpful to see a plot that represents the full set of triads rather than just one that looks best. If normalized to a standard phase they could be put on the same plot. For example, panel j is meant to show the 8h lag of subgenome D. If the data is normalized so that A and B are set to the same phase all the triads could be displayed with shaded SD bars to show the variation. Something like this would be a better representation of the data rather than showing just one example.
      9. It is surprising that there aren't more comparisons with the B. rapa dataset, especially when discussing the clock genes that show balanced or imbalanced expression. Are they similar in B. rapa and does it support your hypothesis that unbalance for certain genes are selected against?
      10. Figure 2 networks. Why were these specific modules selected? Is it actually appropriate to directly compare these modules? I do see that some of the comparisons have high correlations from panel a, but not all. For example, in panel b the W9 and A9 modules have a correlation value of 0.92, which seems appropriate. However, panel c (modules W3 and A2) have a correlation of 0.42, which seems far too low to make any sort of comparison meaningful. Further, as described in methods comments, using a cutHeight as low as 0.15 will likely lead to some number of genes in any given module that do not necessarily "share" a similar expression pattern. These genes could have a pattern that has very low correlation to their module eigengene and were only placed in that module because the pattern was "less similar" to other module eigengenes. The current expression plots in this figure follow a clear pattern, but I suspect this would be even more apparent if the genes within these modules had a higher correlation to the module eigengene. Perhaps the current genes in these modules could just be filtered to have a higher correlation score?
      11. Lines 327-334: I am not following the connection between 'response to abiotic stimulus' and the photoreceptor and light signaling proteins. At the start of this section (line 308) the authors say that the GO analysis was only done on rhythmically expressed genes but the reference to only one PHYA being rhythmic and yet multiple genes are shown in the plot in fig. S16. Does this mean that all the genes were shown and not just the rhythmic ones? This would explain why many of the PHY and CRY genes don't seem to have rhythms. This should be clarified better in the text or indicated in the plot which ones were called rhythmic. Since the first day following transfer is still the diel pattern from the entrainment condition, what does the PHY and CRY expression look like? Does it appear rhythmic under diel but lose rhythmicity in LL? It should be noted in the text that arrhythmicity in circadian conditions doesn't mean there isn't rhythmicity under diel conditions. This could be an additional explanation apart from the current one in the text that the regulation is at the level of protein stability/localization. Overall, this entire section is very long and entirely based on data shown in the supplemental material. I do appreciate having the individual gene plots that supplement figure 4 and would suggest either providing a main figure to highlight a small subset of genes or pathways in this section or shorten it and focus on the results shown in the main figures.
      12. Primary metabolism section: in terms of the supplemental figure, similar to the previous one I think it would declutter the plots if the genes that are not rhythmic were left out and simply indicate below the plot that they didn't meet the rhythmicity cutoff. This is another area where there is more discussion surrounding the supplemental figures than the main figure 4.
      13. For all gene expression figures there should be SD or SE shown either as bars or ribbons to represent the variation in replicates.
      14. It would be very helpful to include the code used to generate the networks and perform the cross-correlation of eigengenes across networks should be included in the Methods. This will also save you from responding to email requests!

      Minor Comments

      1. Figure 1, panel d: - The "unbalanced" triads that are depicted by the lighter shading; do these in fact have a different cutoff than the original rhythmic homoeologs? In the figure it says q<0.1 but I thought it was q<0.01.
      2. Hard to directly compare the GO term overlap in Figure 2f. Might be better to only show the results for the 4 pairs shown in b-e and put them side by side in the bubble plot.
      3. Line 314 -316 don't see supp tables 10, 11
      4. For the selection of B. rapa circadian paralogs with similar and differential expression patterns (starting line 714), the authors choose a hard cut off of 0.001 (differentially patterned) OR 0.1 (similarly patterned). What happens to the genes that are between these two cut offs or is this a typo. Since all the other cutoffs for rhythmicity was set at 0.01 it seems likely that this is a typo.
      5. Line 681. Should be supplemental Figure 6 not 9.
      6. References to most supplemental figures are not the correct number.
      7. Labels above the plots in Supp Fig5 do not match the legend.
      8. Suppl table 7 should be as a separate .csv file or similar to be able to see the full table.
      9. Line 723 should be B. rapa not B. napus.
      10. Figure 4. There is no explanation for what the black boxes represent in the figure legend.

      Significance

      This study provides new insight into the circadian regulation of the transcriptome in a new allopolyploid. It adds a valuable resource to a growing collection of circadian studies in important crops and will greatly improve our efforts to learn more about the circadian control of important crop improvement traits. The dataset will be of interest to other plant circadian biologists as well as the general plant biology community who focus on monocot crops. My expertise is more on the transcriptomic side and I do not have the expertise to evaluate the phylogenetic work presented in this study.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2021-01219

      Corresponding author(s): Rajan, Akhila

      1) General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      The goal of this study is to:

      • Define how prolonged exposure to a high-sugar diet (HSD) regime alters both the lipid landscape and feeding behavior.
      • Determine how changes in lipid classes within the adipose tissue regulates feeding behavior. Key findings:

      In this study, by taking an unbiased systems level and genetic approach, we reveal that phospholipid status of the fat tissue controls global satiety sensing.

      Impact of Key findings:

      By uncovering a critical role for adipose tissue phospholipid balance as a key regulator of organismal feeding, our work raises the possibility that the rate-limiting enzymes in phospholipid synthesis, including Pect, are potential targets for therapeutic interventions for obesity and feeding disorders.

      Peer review comments:

      This study has immensely benefited from the thoughtful peer-review of three reviewers. As per their recommendations, we have performed a major revision by performing additional experiments (see summary table below in next section) and strived to address the major concerns raised. Based on our reading, there were two major concerns that overlapped between all three reviewers raised. They are as follows:

      • Does the genetic disruption of Pect in fly fat body alter phospholipid levels? Two reviewers (#2 and #3) recommended that we perform lipidomic analyses on adult flies with adipose tissue specific knockdown of For the revised version, we have completed this lipidomic experiment, and present results as a new main Figure 6, Supplemental S7 and S9.
      • Is the dampened HSD induced hunger-driven feeding (HDF) behavior because of increased baseline feeding (#1 and #3)? In addition, reviewer #1, asked us whether HSD flies experience an energy-deficit? In other words, we were asked to uncouple whether what we observed was HSD-driven allostasis or indeed, as we had interpreted, that HSD dampened hunger-driven feeding response.

      Hence, they recommended that we:

      1. Re-analyze our hunger-driven feeding datasets and present non-normalized data (also requested by Reviewer #3) and show baseline feeding behavior on HSD. To address this, we have completed this analysis and present our results in Figure 1B-D and S1.
      2. Determine whether the HSD fed flies display an energy deficit on starvation. To this end, we performed an assayed starvation-induced fat mobilization on HSD, results for this are now presented on Figure 1E-G and S2. Conclusions after the revision:

      First, it is important to note here that the additional experiments have not caused a significant revision of the major conclusions of the original version of our study. In fact, we hope that the revised version provides clarity and further substantiation to our original arguments.

      • The lipidomics experiments on Pect fat-specific knock-down flies show that reducing Pect in fat-body causes a significant reduction in certain PE lipid species (PE 36.2 specifically- Figure 6B). This is consistent with a prior report on lipidomics of the Pect null allele by Tom Clandinin’s group (PMID: 30737130). Furthermore, we note that when Pect is knocked down in the fat body, there is a significant increase in two other classes of phospholipids LPC and LPE (Figure 6A). Together, this suggests that an imbalance in phospholipid composition in the absence of Pect activity in fat.
      • The starvation-induced fat mobilization experiments show that despite being fed a prolonged HSD, adult flies sense starvation and effectively mobilize fat stores, at a level comparable to Normal food (NF) fed adult flies, suggesting that even despite HSD exposure, adult flies experience an energy deficit on starvation.
      • In our non-normalized data, we find that the baseline feeding events are not significantly altered between HSD and NF-fed flies (Figure 1D). This suggests that the effects we observe are not due to an increase in the “denominator”, but a dampening of hunger-driven feeding on HSD. With regard to our original version, all three peer-reviewers found that the study was interesting, significant, important, and novel – Reviewer #1: “The work is potentially novel and interesting”; #2 : “I find the study to be potentially very important - the authors combine a longitudinal study that would be difficult in any other model with the powerful genetic tools available in the fly. The conclusions are mostly convincing”; #3: “This manuscript demonstrates how fat body Pect levels affect HSD induced changes in hunger-driven feeding response. I agree with all the reviewers points; potentially very interesting”. But had requested that we provide further substantiation and clarification.

      We sincerely hope that the peer-reviewers find that our revised version with additional new experimental datasets, improved data visualization, and the presentation of non-normalized raw data points, makes this study clear, compelling, and well-substantiated.

      • Point-by-point description of the revisions This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      Below we summarize in Part A, the key experiments that were performed to address the major concerns. In Part B, we provide a point-point response to each reviewer with embedded datasets.

      Part a:

      We performed several new experiments, including:

      • To address the primary concern of Reviewer #1 regarding whether the HSD flies have a similar energy deficit to Normal food (NF) fed flies, we performed analysis of stored neutral fat Triacylglycerol (TAG) reserves and how HSD fed flies mobilized fat stores on starvation. We present these results in Figure 1E-G, S2. These results show that HSD-flies despite accumulating more TAG (S2), breakdown a similar amount of fat reserves as NF-fed flies on starvation at any time-point (Figure 1E-G). This suggests that HSD-fed flies do sense and respond to energy deficit.
      • To address concerns of reviewer #2 and #3 on whether Pect genetic manipulation affects specific phospholipid classes, we performed lipidomic analyses. The table below summarizes the new 3 new figures and 4 supplemental figures (blue text are all new figure numbers and figure panels) and three new Supplementary files as per reviewer’s request.

      Figure #

      Main point

      New datasets in revision

      Companion Supplement

      1

      HSD alters feeding behavior, but flies still breakdown TAG on starvation.

      TAG storage and breakdown over longitudinal HSD shows that HSD and NF fed flies show similar levels of TAG breakdown on starvation, despite consistently elevated TAG on HSD. This supports the idea that flies do sense starvation even on HSD, but there is a uncoupling of the feeding behavior after Day 14. Revised the data representation of Figure 1 to show non-normalized data over time. S1 and S2 companions are new in the revision. Panels 1D to 1E are new for the revision.

      S1- Raw data of feeding events plotted.

      S2 Elevated TAG at all time points.

      2

      HSD causes insulin resistance

      S3A added to show that insulin transcript levels remain the same in response to reviewer #3’s concerns.

      S3

      3

      Phospholipid concentration raw data from lipidomic on Day 7 and Day 14 HSD suggest that PC, PE levels are increased on Day 14 HSD.

      Figure 3 revamped to show new data visualization and non-normalized raw data to address Reviewer #2’s major concerns. S4A and S4B added. In addition Supplementary File 1 and 2 provided with raw lipidomics data as per reviewer #2’s request.

      S4.

      S4A- non normalized raw data of all other lipid classes on HSD.

      S4B- fatty acid species data on Day 14 added as per request of rev.#2.

      4

      HSD regulate Apo-I levels in the IPCs and phenocopies Pect KD.

      Added Figure 4A to show that HSD phenocopies Pect-KD in terms of delivery to brain

      S5 showing the validation of the Apo-I antibody.

      S6 validation of Pect KD and over-expression and Pect mRNA levels dysregulation on HSD.

      5

      Pect RNAi is insulin resistant

      N/A

      N/A

      6

      Pect knockdown shows significant increase in LPC and LPE, and a non-significant reduction in PC, PE levels. Specifically, the PE lipid class PE36.2 is downregulated.

      Fig 6, S7, S9 are completely new based on reviewer #2 and #3 requests. In addition Supplementary File 3 provided with raw lipidomics data as per reviewer #2’s request

      S7, S8, S9#.

      S7- new Pect KD other classes

      S8- new PE classes for day 14 and Pect associated classes.

      S9- Pect OE lipidomics

      7

      Pisd and Pect activity in adipocytes are required for hunger-driven feeding behavior in normal diets

      Pisd RNAi data was moved from supplement to main figure.

      N/A

      Note on revised text: We have revised text not only in the results section, but also as per reviewer #2’s recommendation, we have revamped our introduction and discussion as well. Since the manuscript has been significantly revised to include a main figure 6, fully altered Figure 1 and 3, multiple new supplemental figures, the changes in text are extensive. Hence, they are unmarked in the main text. Nonetheless, we hope that the reviewers will be able to evaluate these changes, as we have provided the specific locations in text and embed key figures in the point-point response below.

      __Part B: __Point-Point responses to reviewer comments.

      Reviewer #1 comments in Blue, author response in black.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Kelly et al. show that the difference between the feeding behavior of fed and starved flies (hunger-driven feeding; HDF) is absent in animals fed a high-sugar diet (HSD) for two weeks or more. The disappearance of HDF with HSD coincides with changes in phospholipid profiles caused by HSD. Furthermore, RNAi-mediated downregulation of Pect in the fat body-a key enzyme in the PE biosynthesis pathway-phenocopies physiological effects of HSD. Moreover, downregulation or overexpression in the fat body abolishes or induces HDF, respectively, abolishes or induces HDF, respectively, independent of HSD treatment.

      Overall, the manuscript is well-written and the phenotypes are clear. However, I have major concerns regarding the authors' interpretation of the data and their conclusion. Most importantly, while it is clear that the authors' high-sugar dietary treatment affects feeding behavior and physiology, I am not convinced that the changes can be considered "hunger-driven"-which is central to the main point of the manuscript. Therefore, it is my recommendation that the authors substantially revise the manuscript by either showing additional/re-analyzed data that rule out alternative hypotheses, or rewriting the manuscript keeping alternative interpretations in mind.

      We are thankful to this reviewer for their thoughtful critique, and constructive and specific suggestions on how we can redress these concerns. We have taken on board the concerns of this reviewer regarding our interpretation of whether the changes in feeding behavior can be considered hunger-driven or not. Based on their advice, we have made significant changes by addressing: i) does HSD increased baseline feeding- we now show non-normalized raw data and data supports conclusion that baseline feeding is not higher; ii) whether HSD- fed flies can sense an energy deficit at levels similar to NF fed flies- we show that HSD flies sense energy deficit. We have provided detailed response below, and we hope the reviewer finds the additional datasets and re-analyzed data are consistent with the interpretation that prolonged HSD dampens starvation induced feeding. In addition to this key concern this reviewer has made a many other salient points that we have addressed with additional data or by clarifying the text.

      Major comments: 1) The data do not sufficiently show that the long-term HSD regime disrupts "hunger-sensing." The manuscript should address alternative hypotheses by showing raw instead of normalized data, rewriting the manuscript with a new central conclusion, or running additional experiments that actually show a defect in hunger-driven response. a. The main results that the authors rely on for the argument is that the ratio of feeding events that the starved and non-starved flies eat is different between the groups fed normal or HSD. However, because the authors only show normalized data (normalized to non-starved flies; Fig. 1), it is difficult to tell whether the change is due to a chronically increased feeding in non-starved HSD flies-maybe in perpetual hunger-like allostasis-or dampened starvation response. Indeed, the data shown in Fig S1 show that flies fed HSD for as short as 5 days show more frequent feeding events compared to age-matched controls fed normal food. It is possible that because the HSD-fed flies eat more than NF-fed flies, even without being starved, the ratio of starved/non-starved feeding is lower in the HSD-fed group-due to changes in the denominator, rather than the numerator.

      We have taken onboard this concern regarding presenting only normalized data, and that clouded the interpretation and left open other possibilities. In the completely revised figure 1 and S1. We now show non-normalized data, as a function of time. First we note that HSD-fed flies, do not show higher baseline feeding that NF fed flies, except on Day 10 of HSD, when there is a modest but significant elevation (Figure 1D).

      Nonetheless, on Day 10 HSD, flies still display increased hunger-driven feeding HDF (Figure 1C), it is only after Day 14 HSD that HSD dampens the starvation induced feeding.

      1. It is also possible that the HSD-fed flies are simply not in as big an energy deficit physiologically, due to the increased fat deposits they've accumulated (as the authors show later in the manuscript). It may take longer for the fat HSD flies to reach substantial energy deficiency than the NF flies, but they still may eventually be able to appropriately respond to hunger, just like NF flies. In such case, it would be a misnomer to call this behavioral change a 'defect in hunger-driven feeding behavior.' Maybe an experiment with a dose-response curve of "hunger driven feeding response" as a function of duration of starvation would help? Prompted by this reviewers question, we asked whether HSD fed flies, that have a higher baseline neutral fat store (Triacylglycerol-TAG) level, and if HSD-fed flies can sense energy deficit. For this, we revisited the longitudinal assays for neutral fat triacylglycerol (TAG) storage that our lab had generated, along with the HSD-HDF studies. We now present this evidence as Figure 1E-1G and Figure S2. Overall, our experiments point to the idea that adult flies fed HSD, are able to sense and mobilize TAG stores effectively throughout the 28-day time point that we analysed.

      First as shown in Figure S2, flies fed HSD display an increase in TAG levels. But it is to be noted that while TAG stores increase, the increase is not linear with time. This suggests that adult flies exposed to HSD store excess energy as TAG, but the increased TAG stores stay within a certain range despite the length of HSD exposure. This suggests that adult flies on HSD still display TAG homeostasis.

      Next, to directly address the reviewers point about HSD fed flies not sensing an energy deficit, we subject HSD-fed flies to an overnight starvation, same regime as used in the overnight feeding experiments, and asked whether they mobilize TAG. We noted that flies exposed to HSD breakdown TAG throughout the 28-day exposure at statistically significant levels for Day 3- Day 28, except on 14 and 21 days (Figure 1F). While there is TAG mobilization on Day 14 and 21, the difference is not statistically significant. Nonetheless, we note the same levels TAG breakdown for normal lab food (NF) fed flies on Day 14 and 21 (Figure 1E). Overall, HSD fed flies sense and display energy deficit, as measured by TAG store mobilization, throughout the 28 days of HSD exposure, at levels comparable to NF-fed flies (Figure 1G).

      Taken together, these results suggest that while HSD-fed flies experience an energy deficit on starvation, at levels comparable to NF-fed flies, throughout the 28-day time point assayed. But, their starvation driven feeding-response is dampened by Day 14 and by Day 28, the HSD-fed flies display more feeding events than HSD starved flies. These results are consistent with the interpretation that in HSD-fed flies the starvation-induced feeding behavior becomes desynchronized from the starvation induced TAG-mobilization, suggesting that there is an absence of hunger-driven feeding.

      2) How can you be sure that lower Dilp5 immunofluorescence is indicative of increased Dilp5 secretion? Wouldn't decreased production of dilp5 also have the same results?

      It has been shown previously in HSD fed larvae are hyperinsulinemic, i.e., they have 55% increase in circulating Dilp2 ( PMID: 22567167). Additionally, we have shown that ectopic activation of the insulin-producing neurons by expressing TRPA1, an ion channel that activates neurons, reduces Dilp5 accumulation without a change in Dilp5 mRNA levels (PMID: 32976758), suggesting that reduced Dilp5 accumulation, without alterations to mRNA levels is a proxy for increased secretion. Now, in response to this concern, in the revised manuscript, we have added qPCR data of Dilp2 and 5 (Figure S3A), which show no difference in expression levels after 14 days on HSD. Therefore, there is no dip in Dilp5 mRNA production. Given that Dilp2 and Dilp5 mRNA levels remain the same, but we see reduced Dilp5 accumulation, we interpret this to mean that Dilp5 secretion is increased.

      1. Also, the authors should state in the main text that it is Dilp5, not just any Dilp. Thanks for this suggestion and we have fixed this and referred to Dilp5 specifically throughout the text in the results section.

      3) Data presentation: a. Sometimes the data are normalized to NF (Fig 4B-C), sometimes not (ex. Fig 4A, S4C). Unless there is a specific rationale for the data transformation, it would be more appropriate to show untransformed data (ex. Fig 4A, S4C), especially as the authors use two-way ANOVA to determine significance. Only showing the differences implies comparison against a hypothetical mean (i.e. μ0=0), not between two group means.

      We thank the reviewers for bringing this issue to our attention. We updated all the figures to show untransformed data in the revised manuscript.

      1. Some figures show both individual data points and summary statistics (mean, SD, ... ex. Fig 2A)-which I believe is ideal-but some show only one or the other (ex. Fig 2B, no summary statistics; Fig. 3, no data points. The manuscript would read more convincing if data visualization is consistent across figures. We thank the reviewers for their feedback. We have made changes to all the figures in the revised manuscript to improve visual consistency.

      Minor comments: 1) High sugar diet: what is the actual sugar concentration in the NF v. HSD diets? The authors write that the HSD diet contains "30% more sugar" than the NF, but providing the final sugar concentrations-sucrose or others-would be informative for other scientists studying the effect of high sugar diets.

      We thank the reviewer for their suggestion and now we have updated the methods to include this sentence. After 7 days, flies were either maintained on normal diet or moved to a high sugar diet (HSD), composed of the same composition as normal diet but with an additional 300g of sucrose per liter”.

      1. Additionally, the definition of HSD is inconsistent. Main text (Page 5, line 17) states that their HSD is "60% more sugar than normal media," whereas the figure legend (Fig 1) and the Methods state that the HSD contains "30% more sugar." We apologize for this egregious typo in the figure legend! We have now fixed this to say 30% HSD. Only 30% HSD was used throughout this study.

      2) Starvation medium: please provide justification for why the authors used 1% sucrose/agar for starvation medium, instead of plain agar/water that most labs use. At least clarify and provide a reference for the claim that the 1% sucrose/agar "is a minimal food media to elicit a starvation response."

      We are very grateful for this reviewer identifying this this methods description error and bring it to our attention. We used 0% sucrose agar for overnight starvation in this study as most labs do. The error occurred because we were using another manuscript from the lab to help draft the methods section (PMID: 29017032). In that study, where we assayed the effect of chronic starvation our lab used: “1% sucrose agar for 5 days at 25C”. However, in this current study, because we are testing acute effects of overnight starvation, we are using 0% sucrose agar.

      3) Pect mRNA level is higher with HSD. This is surprising because not only, as authors mention, is increased PC32.2 with HSD suggests lower Pect activity, but also because Pect RNAi phenocopies long-term HSD in HDF behavior, lipid morphology, FOXO accumulation in fat body. The authors speculate that the data "likely shown an upregulation in an attempt to mediate the Pect dysregulation occurring at the protein level." If that were true, a western blot may be informative. Zhao and Wang (2020, PLoS Genetics) generated a Pect antibody that seems compatible with western blot applications. That being said, I don't think such data is critical for the manuscript. I mention this simply as a suggestion for the authors. a. page 8, line 22-23, did you mean to write "Given how PC32.2 is elevated after 14 days of exposure to HSD, we assumed that Pect levels would be low for flies under HSD," not "high?" Otherwise the subsequent 2 sentences don't make sense.

      We agree that the most confusing aspect of the study was that Pect mRNA levels being very high on Day 14 HSD, but nonetheless the effects of Pect-KD phenocopied HSD. To resolve this, we have now performed lipidomic analyses on whole adult flies, when Pect is knocked-down (KD) by RNAi in the fat tissue. We now present a new dataset in Figure 6. Two striking changes occur. They are:

      1. Pect-KD shows increase in the phospholipid classes LPC and LPE (Figure 6A). In contrast, LPE is significantly downregulated on HSD Day 14 (Figure 3).
      2. Pect-KD shows a significant reduction in specific class of PE 36.2 (Figure 6B). Our data regarding increase in PE 36.2 agree with a previous lipidomic analyses of Pect mutant retina (PMID: 30737130). In contrast, PE 36.2 trends upwards on 14 day HSD (Figure S7C) though not significantly. On 14-day HSD consistent with extreme upregulation of Pect mRNA fed flies (Figure S6A; Pect mRNA 200-250 fold), PE trends upwards on 14-day HSD (Figure 3) and PE 36.2 trends higher (Figure S7C). We note that on the surface of it PE and LPE per se are contrasting between 14-day HSD lipidome and fat-specifc Pect-KD. But there is a significant commonality that under both states there is an imbalance of phospholipids classes PE and LPE. Hence, we propose that maintaining the compositional balance of phospholipid classes PE and LPE is critical to hunger-driven feeding and insulin sensitivity. Hence, either increase or decrease, of these key phospholipid species, may lead to abnormal hunger-driven feeding.

      We agree that a western blot would be informative as well, but we were unable to obtain the reagent from Dr. Wang’s group, precluding us from performing this request. See email snapshot.

      To ensure that we appropriately discuss and clarify this issue, we have now included a section in the discussion - Page 14 Lines 26-34- under the subtitle “The implications of relationship between Pect levels and HSD”. We have pasted an excerpt from that subsection below for this reviewers assessment.

      Also, we note that over-expression of Pect cDNA in the fat-body does not alter phospholipid balance (Figure S9) and indeed improves HDF on HSD (Figure 7B). While this may appear inconsistent, it is critical to note that over-expression of Pect cDNA using UAS/Gal4 only increases Pect mRNA expression by 7-fold (Figure S6A), whereas HSD causes its upregulation by 250-fold (Figure S6B). Hence, we speculate that an increased ‘basal’ level of Pect such as by that provided by a cDNA over-expression in fat, may be protective to the negative effects of HSD (Figure 7B) without affecting overall phospholipid levels (Figure S9) , but extreme upregulation Pect on HSD affects the PE and LPE balance (Figure 3).”

      Reviewer #1 (Significance (Required)):

      The work is potentially novel and interesting, but at this stage it's difficult to interpret what the phenotype signifies. Although the manuscript could be revised simply by modifying the text, experimentally addressing the concerns would significantly improve the work.

      In sum, we hope we have addressed the key concern for Reviewer #1 as to whether the behavior we report here is indeed a dampening of starvation-induced feeding, or an effect of increase in baseline feeding. We hope that by reviewing our non-normalized data, they can appreciate that it is the former. Also, we hope that Reviewer #1 appreciates that we have strived to address the concerns by additional experiments, to clarify our findings and improve the impact of the work.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This intriguing manuscript by Kelly and colleagues uses the fruit fly Drosophila melanogaster as a model to understand how diet-induced obesity alters the feeding response over time. In particular, the authors findings indicate that chronic exposure to a high-sugar diet significantly alters the starvation-induced feeding response. These behavioral studies are complemented by a lipidomics approach that reveals how a chronic high sugar affects many lipid species, including phospholipids. The authors then pursue mechanistic studies that indicate phospholipid metabolism within the fat body appears to remotely affect insulin secretion from the insulin producing cells. Moreover, the changes in phospholipid abundance are associated with changes in insulin-signaling, including increased insulin secretion from the IPCs and elevated levels of FOXO within the nucleus.

      I find the study to be potentially very important - the authors combine a longitudinal study that would be difficult in any other model with the powerful genetic tools available in the fly. The conclusions are mostly convincing, but a few follow-up experiments are required:

      We are grateful for the reviewers constructive, detail-oriented, and balanced feedback, and their recognition of the value of this study. Now, we have performed additional experiments to address the key concerns raised by all reviewers. We hope that on reading the revised version of our study, that the reviewer continues to feel positive about the message of this study and its potential impact.

      1. The key conclusions from the manuscript assume that manipulation of Pect expression levels alters phosphatidylethanolamine (PE) levels. However, the authors make no attempt to verify that the genetic experiments described herein actually affect PE levels. At a minimum, changes in PE levels should be verified for the Pect knockdown and overexpression lines. Similarly, there is no evidence that manipulation of either EAS or Pcyt2 induces the expected metabolic effects. I'm not asking that the longitudinal feeding experiments be repeated, simply that the authors measure the relevant lipid species, preferably with a targeted LC-MS approach.

      Prompted by this reviewer, we performed targeted LC-MS on whole adult flies, on normal diet, to assess lipid levels for fat-specific Pect-KD and overexpression. We decided to focus on Pect, as its knock-down even on normal diet causes a dampened hunger-driven feeding behavior (Figure 7A) and phenocopied a 14-day HSD feeding phenotype.

      We now present a new dataset in Figure 6. Two striking changes occur:

      They are:

      Pect-KD shows a significant reduction in specific class of PE 36.2 (Figure 6B). Our data regarding decrease in PE 36.2 agree with a previous lipidomic analyses of Pect mutant retina (PMID: 30737130). It is to be noted that though overall levels of all PE species trend downwards, like the Clandinin lab study on Pect (PMID: 30737130), we did not find a significant change in the overall PC and PE levels.

      • Pect-KD shows increase in the phospholipid classes LPC and LPE (Figure 6A). In contrast, LPE is significantly downregulated on HSD Day 14 (Figure 3). On 14-day HSD consistent with extreme upregulation of Pect mRNA fed flies (Figure S6A; Pect mRNA 200-250 fold), PE trends upwards on 14-day HSD (Figure 3) and PE 36.2 trends higher (Figure S7C). We note that on the surface of it PE and LPE per se are contrasting between 14-day HSD lipidome and fat-specifc Pect-KD. But there is a significant commonality that under both states there is an imbalance of phospholipids classes PE and LPE. Hence, we propose that maintaining the compositional balance of phospholipid classes PE and LPE is critical to hunger-driven feeding and insulin sensitivity. Hence, either increase or decrease, of these key phospholipid species, may lead to abnormal hunger-driven feeding.

      Finally, fat-specific Pect-OE did not cause significant changes to lipid species (Figure S9). This could either be due to the fact that in fat-specific Pect-OE flies under normal food and that we were assaying whole body lipid levels and not fat-specific lipid changes. But to counter that, even a 60% reduction in Pect mRNA levels (Figure S6A), was sufficient to produce an effect on whole body phospholipid balance (Figure 6). Hence, we speculate that by maintaining a basally higher (7-fold higher Pect mRNA level Figure S6A), might allow 14-day HSD-fed flies to buffer the negative effects of HSD and we predict that it might take longer to disrupt the phospholipid balance and HDF response.

      We have now included a section in the discussion - Page 14 Lines 26-34- under the subtitle “The implications of relationship between Pect levels and HSD”. We have pasted an excerpt from that subsection below for this reviewers assessment.

      Also, we note that over-expression of Pect cDNA in the fat-body does not alter phospholipid balance (Figure S9) and indeed improves HDF on HSD (Figure 7B). While this may appear inconsistent, it is critical to note that over-expression of Pect cDNA using UAS/Gal4 only increases Pect mRNA expression by 7-fold (Figure S6A), whereas HSD causes its upregulation by 250-fold (Figure S6B). Hence, we speculate that an increased ‘basal’ level of Pect such as by that provided by a cDNA over-expression in fat, may be protective to the negative effects of HSD (Figure 7B) without affecting overall phospholipid levels (Figure S9), but extreme upregulation Pect on HSD affects the PE and LPE balance (Figure 3).”

      A central hypothesis in the study is that the HSD over a period of 14 days results in insulin resistant and that these changes are leading to changes in hunger dependent feeding. I would encourage the authors to determine if Foxo mutants are resistant to these HSD-induced effects on HFD.

      We thank the reviewers for this suggestion. However, given that dFOXO nuclear localization rather than expression levels regulate insulin sensitivity, we feel that disrupting dFOXO levels via mutation or knockdown will produce a plethora of indirect effects including developmental abnormalities (PMID: 24778227, PMID: 16179433, PMID: 29180716, PMID: 12893776). Our data suggest that chronic HSD treatment and Pect affect insulin sensitivity in fat tissue. However, we feel that investigating whether insulin sensitivity/FOXO signaling in fat tissue regulates feeding behavior is outside the scope of our work.

      1. In lines 25-30, the authors draw the conclusion that an increase in unsaturated fatty acid species is associated with the HSD and that these changes results in a more fluid lipid environment. While I agree with the model, the manuscript contains no evidence to support such a model. Either test the hypothesis or move the last line of the section to the discussion.

      We thank the reviewer for this important and insightful comment. We agree that the data we presented and discussed in the original version is at the moment speculative. Addressing the hypothesis that increase in unsaturated fatty acid species result in a more fluid lipid environment will require us to build tools and expertise. Hence, this hypothesis is better suited for exploration in a future study. Given this, we have moved this out of the results section into the Discussion section titled “HSD and fat-specific PECT-KD causes changes to phospholipid profile” (See excerpt below from page 13, lines 24-35).

      In addition to changes in phospholipid classes, we found that HSD caused an increase in the concentration of PE and PC species with double bonds (Figure S4C and S4D). Double bonds create kinks in the lipid bilayer, leading to increased lipid membrane fluidity which impacts vesicle budding, endocytosis, and molecular transport14,92. Hence it is possible that a mechanism by which HSD induces changes to signaling is by altering the membrane biophysical properties, such as by increased fluidity, which would have a significant impact on numerous biological processes including synaptic firing and inter-organ vesicle transport.”

      Also, as per the reviewer’s guidance, given that we are speculating here, we have also shifted this dataset from Main figure 4 to supplement S4C and S4D.

      In addition, lines 25-30 state that FFAs are increased after 14 days of a HSD. Figure 3A shows the exact opposite - FFAs are significantly decreased in 14 day fed animals despite being elevated in the 7 day fed animals. This is an interesting result that warrants discussion. Moreover, I would encourage to examine the lipidomic data more carefully to ensure that the text accurately portrays the lipid profiles.

      We apologize for misstating that FFAs are decreased on 14-day HSD in the lines 25-30. It was an error and we have corrected this. We agree with the reviewer that the reduction of FFA on Day 14-HSD is an intriguing and unexpected observation that needs to be emphasized and further discussed. To this end, we have added figure S4B, wherein we have provided the difference in FFA concentration (by species) after days 7 and 14.

      Furthermore, we have discussed what the potential meaning of reduced FFA at Day 14 implies in page 12, lines 19-27 of the Discussion section titled “HSD and fat-specific PECT-KD causes changes to phospholipid profile”. We have stated the following-

      We speculate that this reduction in FFA maybe due to their involvement in TAG biogenesis (PMID: 13843753). We were interested to see if the decrease in FFA correlated to a particular lipid species, as PE and PC are made from DAGs with specific fatty acid chains. However, further analysis of FFAs at the species level did not reveal any distinct patterns. The majority of FFA chains decreased in HSD, including 12.0, 16.0, 16.1, 18.0, 18.1, and 18.2 (Figure S4B). This data was more suggestive of a global decrease in FFA, likely being converted to TAG and DAG, rather than a specific fatty acid chain being depleted.”

      The processed lipidomics data should also be included as supplementary data table so that they can be independently analyzed by the reader.

      We thank the reviewer for this suggestion. As per the reviewers request, we have included the raw data as an attachment in our supplementary material (Supplementary Files 1-3.), so that interested readers can use the datasets generated in this study for future work and further analysis.

      Beyond these experimental suggestions, the manuscript needs significant editing for clarity. While I won't provide a comprehensive list, the authors need to provide accurate descriptions and annotation of genotypes (including w[1118], which is written as W1118), typos, and formatting. I've listed a few examples below:

      1. Page 3, Line 1 and 2: "...have been shown to impact feeding behavior and metabolism that leads to..." This is an awkward and grammatically incorrect sentence.
      2. Page 3, Lines 7-32 is one very large paragraph but contains concepts that should be broken down over at least three paragraphs.
      3. Page 3, Line 25: A description of the reaction catalyzed by Pect would be helpful for a manuscript focused on Pecte activity.
      4. Page 4, Line 10: "previously characterized method of eliciting diet induced feeding behavior." As stated in the text, the method is previously described yet the manuscript characterizing the method isn't cited.
      5. Figure legend 3 contains a random assortment of capitalized lipid species. Also, the names of lipid species are inappropriately broken into multiple names. Please use correct nomenclature throughout the manuscript.

      The list above is nowhere near comprehensive. The manuscript requires significant editing.

      We are grateful to the reviewer for drawing our attention to these errors. We have made significant edits to the revised manuscript to address the above-mentioned concerns, as well as made additional textual changes throughout and copyedited it. We hope that the reviewer will find the manuscript reads better and the clarity and preciseness is significantly improved.

      Reviewer #2 (Significance (Required)):

      I find the study to be potentially very important - the authors combine a longitudinal study that would be difficult in any other model with the powerful genetic tools available in the fly. The findings will significantly advance our understanding of how lipid metabolism links dietary nutrition with feeding behavior.

      Once again, we are grateful for this reviewer’s thoughtful critique and encouraging words regarding our work and its potential impact.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: This manuscript uses Drosophila to investigate how diet-induced obesity and the changes in the lipid metabolism of the fat boy modulate hunger-driven feeding (HDF) response. The authors first demonstrate that chronic exposure (14 days) of high sugar diet (HSD) suppresses HDF response. Through lipidome analysis, the authors identify a specific class of lipids to be elevated upon chronic HSD feeding. This coincided with the changes in expression of Pect, an enzyme that regulates the biosynthesis of these lipids. Modulating the expression of Pect specifically in the fat body affected HDF response.

      We thank this reviewer for their rigorous and thoughtful critique and for identifying a key issue with our original study pertaining to a gap in how Pect mRNA levels on 14-day HSD are elevated but the Pect-KD phenocopies the HDF. Now by performing whole-body adult fly lipidomic on fat-specific Pect-KD we have resolved this issue and provided clarity on role of Pect in maintaining phospholipid homeostasis and thus subsequently impacts hunger-driven feeding. We hope the reviewer finds that the revised manuscript provides further clarity to the functional link between Pect’s role in fat-body and hunger-driven feeding.

      Major comments: The author claim that the HDF response in HSD is distinct between early (5d, 7d) and chronic (day 14) HSD feeding. However, the data seem to indicate that HDF response is significantly decreased at all time points in HSD. For example, at day 5 HDF response was increased only 3-fold in HSD (Figure 1C) compared to around 50-fold increase in NF (Figure 1B). The scale of the Y-axis in Figure 1B and 1C is an order of magnitude different. Including the starved data (NFstv and HSDstv) in Figure S1, normalized to NF fed group, would better visualize the overall trends. Related to this, having the source data for the actual number of feeding events would be useful (e.g., to see the baseline changes in feeding in different time points in Figure 1 and the effect of genetic manipulations in Figure 7).

      As per the reviewers request, we now have modified our graphs to show source data (Figure S1) and show the raw feeding events.

      Then in the non-normalized graphs we plot, over a longitudinal time course, baseline and hunger-driven feeding events (Figure 1B-D). We also show that HSD fed flies do not display increased baseline feeding (Figure 1D) suggesting that the effect we see on HDF are no clouded by increased baseline feeding.

      Yes, the reviewer makes an important point that HDF response on HSD fed flies is of a lower magnitude than NF fed flies. We think that is a biologically meaningful observation, as it suggests that flies have a remarkably fine-tuned ability to coordinate food-intake with nutrient store levels.

      ­­Now we have included a paragraph in the Discussion, Page 11 Lines 23-27, that say the following to ensure the readers appreciate this salient point raised by this reviewer.

      *It is to be noted that the HDF response of HSD-fed flies (Figure 1C, Days 3-10) is of lower order of magnitude than the NF-fed flies. This suggests that that in addition to sensing an energy deficit and mobilizing fat stores (Figure 1F, 1G, S1), HSD fed flies calibrate their starvation-induced feeding to compensate only for the lost amount of fat. Overall, this suggests that flies have a remarkably fine-tuned ability to coordinate food-intake with nutrient store levels. *

      The association between fat body Pect level and phospholipid levels is not clear. Day 14 of HSD feeding shows high expression of Pect in the fat body and elevated levels of PC32.0 and PC32.2. The authors assume the high expression of Pect in the fat body is due to the compensatory response, but there are no data indicating downregulation of Pect levels at the earlier time points of HSD feeding. A previous study demonstrated that Pect mutant flies have lower levels of PC32.0 but higher PC32.2 (PMID: 30737130).

      We agree that one puzzling aspect of the original version of this study was that Pect mRNA levels being very high on Day 14 HSD, but nonetheless the effects of Pect-KD phenocopied HSD. To resolve this, prompted by Reviewer #2 and #3 concerns, for this revised version we have now performed lipidomic analyses on whole adult flies, when Pect is knocked down (KD) by RNAi in the fat tissue. We now present a new dataset in Figure 6. Two striking changes occu. They are:

      1. Pect-KD shows increase in the phospholipid classes LPC and LPE (Figure 6A). In contrast, LPE is significantly downregulated on HSD Day 14 (Figure 3).
      2. Pect-KD shows a significant reduction in specific class of PE 36.2 (Figure 6B). Our data regarding increase in PE 36.2 agree with a previous lipidomic analyses of Pect mutant retina (PMID: 30737130). In contrast, PE 36.2 trends upwards on 14 day HSD (Figure S7C) though not significantly. On 14-day HSD consistent with extreme upregulation of Pect mRNA fed flies (Figure S6A; Pect mRNA 200-250 fold), PE trends upwards on 14-day HSD (Figure 3) and PE 36.2 trends higher (Figure S7C). We note that on the surface of it PE and LPE per se are contrasting between 14-day HSD lipidome and fat-specifc Pect-KD. But there is a significant commonality that under both states there is an imbalance of phospholipids classes PE and LPE. Hence, we propose that maintaining the compositional balance of phospholipid classes PE and LPE is critical to hunger-driven feeding and insulin sensitivity. Hence, either increase or decrease, of these key phospholipid species, may lead to abnormal hunger-driven feeding.

      On day 14, HDF response was increased 70-fold in w1118 flies in NF (Figure 1B; w1118), but only 2.5-fold in lpp>LucRNAi control flies in NF (Figure 7A). This suggests that lpp-gal4 driver lines have a significant effect on HDF response. Using a different fat-body specific Gal4 line would be necessary to validate conclusions.

      Regards reduced HDF magnitude, in our experience using UAS-Gal4 reduces HDF response magnitude consistently and cannot be compared to w1118 which is more robust. To account for background differences, we use Uas-Gal4 with control RNAi. It clearly shows differences in HDF response on starvation, but Pect and Pisd RNAi does not (Figure 7A). Hence, given that this experiment internally controls for any changes in HDF response for UAS-Gal4>RNAi, we conclude that HDF response in disrupted in Pect and PISD KD (Figure 7).

      We only presented the Lpp-driver in our study, as this driver is the only fat-specific driver that has no leaky expression in other tissues, and is specific to fat as apolpp promoter used to generate this Gal4 line is only expressed in fat tissue (Eaton and colleagues, PMID: 22844248). Other widely used fat-specific drivers, including the pumpless-Gal4 (ppl-Gal4) driver has leaky expression in gut or other tissues (See Table 2 of this detailed study by Dr. Drummond- Barbosa https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7642949/). If the reviewer is aware of a fat-specific Gal4 line, other than Lpp-Gal4, which has a highly specific expression in the fat tissue without leaky expression in other tissues, then we are happy to take onboard the reviewer’s suggestion and try that fat-specific Gal4 that they suggest.

      HSD feeding promotes Pect expression (Figure S3C) and global changes in phospholipid levels (Figure 3, 4). Therefore, shouldn't Pect overexpression (not Pect RNAi) in a normal diet mimic HSD feeding state and promote loss of HDF response? Conversely shouldn't knockdown of Pect in HSD rescue loss of HDF response?

      We agree that a puzzling aspect is that Pect mRNA levels are significantly elevated in HSD Day-14, but Pect-KD showed displays the inappropriate HDF response. As we have described in our response to this reviewer on Page 19, we believe that Pect-KD and HSD disrupt PE and LPE balance overall but in different ways. Whereas Pect-OE using cDNA expression in fat body does not cause a significant change to any lipid class (Figure S9), and our results suggest that basally higher level of PECT is likely to be protective on HSD with respect to HDF(Figure 7B).

      To ensure that we appropriately discuss and clarify this issue, we have now included a section in the discussion - Page 14 Lines 26-33- under the subtitle “The implications of relationship between Pect levels and HSD”. We have pasted an excerpt from that subsection below for this reviewers assessment.

      Also, we note that over-expression of Pect cDNA in the fat-body does not alter phospholipid balance (Figure S9) and indeed improves HDF on HSD (Figure 7B). While this may appear inconsistent, it is critical to note that over-expression of Pect cDNA using UAS/Gal4 only increases Pect mRNA expression by 7-fold (Figure S6A), whereas HSD causes its upregulation by 250-fold (Figure S6B). Hence, we speculate that an increased ‘basal’ level of Pect such as by that provided by a cDNA over-expression in fat, may be protective to the negative effects of HSD (Figure 7B) without affecting overall phospholipid levels (Figure S9) , but extreme upregulation Pect on HSD affects the PE and LPE balance (Figure 3).”

      We would have liked to test Pect protein expression on HSD, but since we were unable to access antibodies for Pect published in a prior study (PMID: 33064773) from Dr. Wang’s lab (see Page 10-11, of response to Reviewer #1). Hence, we were unable to test how the proteins levels of Pect correlate with the 250-fold increase mRNA expression.

      In conclusion, we hope the reviewer appreciates that our results regarding Pect function are consistent with the main conclusion that achieving the right phospholipid balance between PE and LPE, is critical for an organism to display an appropriate HDF response.

      Minor comments: All graphs should plot individual data points and showed as box and whisker plot as much as possible.

      Thanks for this suggestion, we have added individual data points to the vast majority of figures in the paper. We have made exceptions to graphs such as seen in figure 1 and FigureS4B-D where we find individual data points add an unnecessary layer of complexity. We hope these changes provide additional clarity and strength to the claims made in this manuscript.

      Data for day 14 missing in Figure S4A and S4B.

      We have provided Day 14 for the PC composition and PE composition, due to changes in Figures, they are now S7A and S7B.

      Reviewer #3 (Significance (Required)):

      The interactions between diet-induced obesity, peripheral tissue homeostasis and feeding behavior is an interesting topic that can be addressed using Drosophila. This manuscript demonstrates how fat body Pect levels affect HSD induced changes in hunger-driven feeding response. However, at this point, the functional association between fat body Pect level, global phospholipid level, and loss of hunger-driven feeding response in chronic HSD feeding is not clear.

      We hope the revised data, and discussion of the paper, provides well-substantiated functional association on the importance of maintaining phospholipid balance, driven by Pect enzyme, as a critical regulator of hunger-driven feeding behavior. As stated in the revised discussion, the key take home message of our manuscript is that on prolonged HSD exposure PC, PE and LPE levels are dysregulated, the loss of phospholipid homeostasis coincided with a loss of hunger-driven feeding. Following this lead on phospholipid imbalance, we then uncovered a critical requirement for the activity of the rate-limiting PE enzyme PECT within the fat tissue in controlling hunger-driven feeding.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This manuscript uses Drosophila to investigate how diet-induced obesity and the changes in the lipid metabolism of the fat boy modulate hunger-driven feeding (HDF) response. The authors first demonstrate that chronic exposure (14 days) of high sugar diet (HSD) suppresses HDF response. Through lipidome analysis, the authors identify a specific class of lipids to be elevated upon chronic HSD feeding. This coincided with the changes in expression of PECT, an enzyme that regulates the biosynthesis of these lipids. Modulating the expression of PECT specifically in the fat body affected HDF response.

      Major comments:

      The author claim that the HDF response in HSD is distinct between early (5d, 7d) and chronic (day 14) HSD feeding. However, the data seem to indicate that HDF response is significantly decreased at all time points in HSD. For example, at day 5 HDF response was increased only 3-fold in HSD (Figure 1C) compared to around 50-fold increase in NF (Figure 1B). The scale of the Y-axis in Figure 1B and 1C is an order of magnitude different. Including the starved data (NFstv and HSDstv) in Figure S1, normalized to NF fed group, would better visualize the overall trends. Related to this, having the source data for the actual number of feeding events would be useful (e.g., to see the baseline changes in feeding in different time points in Figure 1 and the effect of genetic manipulations in Figure 7).

      The association between fat body PECT level and phospholipid levels is not clear. Day 14 of HSD feeding shows high expression of pect in the fat body and elevated levels of PC32.0 and PC32.2. The authors assume the high expression of pect in the fat body is due to the compensatory response, but there are no data indicating downregulation of pect levels at the earlier time points of HSD feeding. A previous study demonstrated that pect mutant flies have lower levels of PC32.0 but higher PC32.2 (PMID: 30737130). To better understand the link the authors should knockdown/OE PECT specifically in the fat body and assess changes in phospholipids.

      On day 14, HDF response was increased 70-fold in w1118 flies in NF (Figure 1B; w1118), but only 2.5-fold in lpp>LucRNAi control flies in NF (Figure 7A). This suggests that lpp-gal4 driver lines have a significant effect on HDF response. Using a different fat-body specific Gal4 line would be necessary to validate conclusions.

      HSD feeding promotes PECT expression (Figure S3C) and global changes in phospholipid levels (Figure 3, 4). Therefore, shouldn't PECT overexpression (not PECT RNAi) in a normal diet mimic HSD feeding state and promote loss of HDF response? Conversely shouldn't knockdown of PECT in HSD rescue loss of HDF response?

      Minor comments:

      All graphs should plot individual data points and showed as box and whisker plot as much as possible. Data for day 14 missing in Figure S4A and S4B.

      Significance

      The interactions between diet-induced obesity, peripheral tissue homeostasis and feeding behavior is an interesting topic that can be addressed using Drosophila. This manuscript demonstrates how fat body PECT levels affect HSD induced changes in hunger-driven feeding response. However, at this point, the functional association between fat body PETC level, global phospholipid level, and loss of hunger-driven feeding response in chronic HSD feeding is not clear.

      Referees cross-commenting

      I agree with all the reviwers points; potentially very interesting, but requires a significant amount of work.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This intriguing manuscript by Kelly and colleagues uses the fruit fly Drosophila melanogaster as a model to understand how diet-induced obesity alters the feeding response over time. In particular, the authors findings indicate that chronic exposure to a high-sugar diet significantly alters the starvation-induced feeding response. These behavioral studies are complemented by a lipidomics approach that reveals how a chronic high sugar affects many lipid species, including phospholipids. The authors then pursue mechanistic studies that indicate phospholipid metabolism within the fat body appears to remotely affect insulin secretion from the insulin producing cells. Moreover, the changes in phospholipid abundance are associated with changes in insulin-signaling, including increased insulin secretion from the IPCs and elevated levels of FOXO within the nucleus.

      I find the study to be potentially very important - the authors combine a longitudinal study that would be difficult in any other model with the powerful genetic tools available in the fly. The conclusions are mostly convincing, but a few follow-up experiments are required:

      1. The key conclusions from the manuscript assume that manipulation of PECT expression levels alters phosphatidylethanolamine (PE) levels. However, the authors make no attempt to verify that the genetic experiments described herein actually affect PE levels. At a minimum, changes in PE levels should be verified for the PECT knockdown and overexpression lines. Similarly, there is no evidence that manipulation of either EAS or Pcyt2 induces the expected metabolic effects. I'm not asking that the longitudinal feeding experiments be repeated, simply that the authors measure the relevant lipid species, preferably with a targeted LC-MS approach.
      2. A central hypothesis in the study is that the HSD over a period of 14 days results in insulin resistant and that these changes are leading to changes in hunger dependent feeding. I would encourage the authors to determine if Foxo mutants are resistant to these HSD-induced effects on HFD.
      3. In lines 25-30, the authors draw the conclusion that an increase in unsaturated fatty acid species is associated with the HSD and that these changes results in a more fluid lipid environment. While I agree with the model, the manuscript contains no evidence to support such a model. Either test the hypothesis or move the last line of the section to the discussion.

      In addition, lines 25-30 state that FFAs are increased after 14 days of a HSD. Figure 3A shows the exact opposite - FFAs are significantly decreased in 14 day fed animals despite being elevated in the 7 day fed animals. This is an interesting result that warrants discussion. Moreover, I would encourage to examine the lipidomic data more carefully to ensure that the text accurately portrays the lipid profiles.

      The processed lipidomics data should also be included as supplementary data table so that they can be independently analyzed by the reader.

      Beyond these experimental suggestions, the manuscript needs significant editing for clarity. While I won't provide a comprehensive list, the authors need to provide accurate descriptions and annotation of genotypes (including w[1118], which is written as W1118), typos, and formatting. I've listed a few examples below:

      1. Page 3, Line 1 and 2: "...have been shown to impact feeding behavior and metabolism that leads to..." This is an awkward and grammatically incorrect sentence.
      2. Page 3, Lines 7-32 is one very large paragraph but contains concepts that should be broken down over at least three paragraphs.
      3. Page 3, Line 25: A description of the reaction catalyzed by PECT would be helpful for a manuscript focused on PECT activity.
      4. Page 4, Line 10: "previously characterized method of eliciting diet induced feeding behavior." As stated in the text, the method is previously described yet the manuscript characterizing the method isn't cited.
      5. Figure legend 3 contains a random assortment of capitalized lipid species. Also, the names of lipid species are inappropriately broken into multiple names. Please use correct nomenclature throughout the manuscript.

      The list above is nowhere near comprehensive. The manuscript requires significant editing.

      Significance

      I find the study to be potentially very important - the authors combine a longitudinal study that would be difficult in any other model with the powerful genetic tools available in the fly. The findings will significantly advance our understanding of how lipid metabolism links dietary nutrition with feeding behavior.

      Referees cross-commenting

      I agree. We all think the manuscript is potentially interesting and important, but requires further experimentation. I agree with all concerns raised by the other reviewers. A revision would likely represent a significant amount of work.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Kelly et al. show that the difference between the feeding behavior of fed and starved flies (hunger-driven feeding; HDF) is absent in animals fed a high-sugar diet (HSD) for two weeks or more. The disappearance of HDF with HSD coincides with changes in phospholipid profiles caused by HSD. Furthermore, RNAi-mediated downregulation of PECT in the fat body-a key enzyme in the PE biosynthesis pathway-phenocopies physiological effects of HSD. Moreover, downregulation or overexpression in the fat body abolishes or induces HDF, respectively, abolishes or induces HDF, respectively, independent of HSD treatment.

      Overall, the manuscript is well-written and the phenotypes are clear. However, I have major concerns regarding the authors' interpretation of the data and their conclusion. Most importantly, while it is clear that the authors' high-sugar dietary treatment affects feeding behavior and physiology, I am not convinced that the changes can be considered "hunger-driven"-which is central to the main point of the manuscript. Therefore, it is my recommendation that the authors substantially revise the manuscript by either showing additional/re-analyzed data that rule out alternative hypotheses, or rewriting the manuscript keeping alternative interpretations in mind.

      Major comments:

      1. The data do not sufficiently show that the long-term HSD regime disrupts "hunger-sensing." The manuscript should address alternative hypotheses by showing raw instead of normalized data, rewriting the manuscript with a new central conclusion, or running additional experiments that actually show a defect in hunger-driven response.
        • a. The main results that the authors rely on for the argument is that the ratio of feeding events that the starved and non-starved flies eat is different between the groups fed normal or HSD. However, because the authors only show normalized data (normalized to non-starved flies; Fig. 1), it is difficult to tell whether the change is due to a chronically increased feeding in non-starved HSD flies-maybe in perpetual hunger-like allostasis-or dampened starvation response. Indeed, the data shown in Fig S1 show that flies fed HSD for as short as 5 days show more frequent feeding events compared to age-matched controls fed normal food. It is possible that because the HSD-fed flies eat more than NF-fed flies, even without being starved, the ratio of starved/non-starved feeding is lower in the HSD-fed group-due to changes in the denominator, rather than the numerator.
        • b. It is also possible that the HSD-fed flies are simply not in as big an energy deficit physiologically, due to the increased fat deposits they've accumulated (as the authors show later in the manuscript). It may take longer for the fat HSD flies to reach substantial energy deficiency than the NF flies, but they still may eventually be able to appropriately respond to hunger, just like NF flies. In such case, it would be a misnomer to call this behavioral change a 'defect in hunger-driven feeding behavior.' Maybe an experiment with a dose-response curve of "hunger driven feeding response" as a function of duration of starvation would help?
      2. How can you be sure that lower Dilp5 immunofluorescence is indicative of increased Dilp5 secretion? Wouldn't decreased production of dilp5 also have the same results?
        • a. Also, the authors should state in the main text that it is Dilp5, not just any Dilp.
      3. Data presentation:
        • a. Sometimes the data are normalized to NF (Fig 4B-C), sometimes not (ex. Fig 4A, S4C). Unless there is a specific rationale for the data transformation, it would be more appropriate to show untransformed data (ex. Fig 4A, S4C), especially as the authors use two-way ANOVA to determine significance. Only showing the differences implies comparison against a hypothetical mean (i.e. μ0=0), not between two group means.
        • b. Some figures show both individual data points and summary statistics (mean, SD, ... ex. Fig 2A)-which I believe is ideal-but some show only one or the other (ex. Fig 2B, no summary statistics; Fig. 3, no data points. The manuscript would read more convincing if data visualization is consistent across figures.

      Minor comments:

      1. High sugar diet: what is the actual sugar concentration in the NF v. HSD diets? The authors write that the HSD diet contains "30% more sugar" than the NF, but providing the final sugar concentrations-sucrose or others-would be informative for other scientists studying the effect of high sugar diets.
        • a. Additionally, the definition of HSD is inconsistent. Main text (Page 5, line 17) states that their HSD is "60% more sugar than normal media," whereas the figure legend (Fig 1) and the Methods state that the HSD contains "30% more sugar."
      2. Starvation medium: please provide justification for why the authors used 1% sucrose/agar for starvation medium, instead of plain agar/water that most labs use. At least clarify and provide a reference for the claim that the 1% sucrose/agar "is a minimal food media to elicit a starvation response."
      3. PECT mRNA level is higher with HSD. This is surprising because not only, as authors mention, is increased PC32.2 with HSD suggests lower PECT activity, but also because PECT RNAi phenocopies long-term HSD in HDF behavior, lipid morphology, FOXO accumulation in fat body. The authors speculate that the data "likely shown an upregulation in an attempt to mediate the PECT dysregulation occurring at the protein level." If that were true, a western blot may be informative. Zhao and Wang (2020, PLoS Genetics) generated a PECT antibody that seems compatible with western blot applications. That being said, I don't think such data is critical for the manuscript. I mention this simply as a suggestion for the authors.
        • a. page 8, line 22-23, did you mean to write "Given how PC32.2 is elevated after 14 days of exposure to HSD, we assumed that PECT levels would be low for flies under HSD," not "high?" Otherwise the subsequent 2 sentences don't make sense.

      Significance

      The work is potentially novel and interesting, but at this stage it's difficult to interpret what the phenotype signifies. Although the manuscript could be revised simply by modifying the text, experimentally addressing the concerns would significantly improve the work.

      The co-reviewer and I have expertise in Drosophila neurobiology and behavior.

      Referees cross-commenting

      Hi all, although the reviews hit upon some overlapping, but mostly different points, I agree with all of the concerns raised. There's some really interesting stuff here but some of the results, as presented, don't make sense. It's possible this will be clarified by revising the text, although I suspect it's more likely that the authors will have to add a number of the experimental suggestions made by the reviewers.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer 1

      Summary: The authors used conventional confocal and super-resolution STED microscopy to characterize the actin filament network in response to SARS-CoV-2 infection in pulmonary cells. They demonstrate that, although total levels of actin are unchanged, F-actin polymerization increases upon infection, with the most significant changes occurring at 48 hours post infection. Notably, F-actin remodels from primarily stress-fiber architectures to circularized, F-actin nanostructures that tend to colocalize with viral M cluster rings at 48 hours post infection. Additionally, there is a significant increase in F-actin-associated filopodia-like structures, with an example of a possible cell-to-cell filopodia that could possibly be a mode of inter-cellular viral transmission. The authors complement their imaging-based experiments with RNAseq to profile the cellular gene expression of SARS-CoV-2 infected pulmonary cells, revealing an upregulation of RHO GTPases activate PKNs and alpha-actinins. They show that treatment of pulmonary cells with Rho/SFR and PKN inhibitors during infection decreases the size of viral M clusters and release to comparable levels as the known viral therapeutic, Remdesivir.

      Major comments:

      1. The majority of the author's conclusions are based off of qualitative and quantitative analysis of their fluorescence images. While they do mention briefly an ImageJ plug-in and the statistical tests performed, the description of their quantitative image-based analyses for each experiment is lacking. For example, how was viral M cluster and actin intensity measured? How was the signal intensity normalized to account for variations in antibody labeling or other cell-to-cell variations? For figures 3C&D, how did the authors calculate viral and actin ring diameter? It is necessary to expand on the details of the quantitative analysis for each parameter mentioned in the methods section and/or include a figure panel demonstrating the details of the analysis (similar to what is nicely displayed for M cluster size in Figure S1B). Response:

      We would like to thank reviewer suggestions to improve the material and methods section. We have incorporated all the suggested details for image analysis and also schematic where ever it is necessary in the figures and SI figures in the revised manuscript to clarify:

      • viral M cluster measurement (Figure S1A) ; no variation in M antibody labelling or in cells was observed per se. the pic of infection regarding M clusters was always 48h pi (maximum of M clusters intensity and area.
      • F-actin intensity was considered for each cells, labelling cells with Phalloidin (for at least 30 cells in each condition), imaging z-stack and then considering the whole F-actin content for each cell.
      • Intracellular viral and F-actin ring diameter was calculating using the scale bar on 3D STED images using ImagJ.

      In particular, the details regarding the F-actin orientation measurements is lacking. Is there a consistent reference point for the orientation of the actin filaments? When comparing across two different cells, it is unclear how the orientations are normalized. Perhaps it would be more informative to plot the difference or the range in angles? Or the distribution of the differences in angles? Another point that is a bit misleading is describing this analysis as "F-actin orientation" since the term "orientation" can has a specific meaning for polar filaments such as actin. For example, given resolution limitations of the imaging approaches used in this manuscript, the authors are reporting on the orientations of bundles/populations of actins and not orientations of individual filaments relative to one another within the bundle (e.g. anti-parallel vs parallel vs branched). The authors should clarify this in the text and also further expand on the utility of their F-actin orientation analysis and how it informs us on the mechanisms of actin-mediated viral infection.

      Response:

      To quantify F-actin rearrangements, we have analyzed the orientation angle of actin nano-fibers from STED images (as in Nature Communications. 8 (2017), doi:10.1038/ncomms14347).

      For this analysis all the images were imaged with STED 2D microscopy for better resolution (axial 60 nm resolution). From STED 2D microscopy images of F-actin, the orientation angle of nano-fibers were evaluated based on the structure tensor of each nano-fibers compares to its local neighborhood using the Java plugin for ImageJ “OrientationJ”. From the given images, the OrientationJ plugin computes the structure tensor for each pixel in the image by sliding the Gaussian analysis window over the entire image. The local angle of orientation properties encoded in color and it is also generating a distribution of angles for each nano-fibers for a given image. Here, in the STED images, it is considered the vertically elongated nano-fibers as the major orientation angle (as around +90 Deg and – 90 Deg from the cell edge) and others orientation angles were calculated accordingly. Area are normalized to the distribution curve of angles to compare the changes in distribution for infected and non-infected cell (as in Fig. 3B).

      We have incorporated above explanations in the material and method section (Image analysis section, Page 11) in the revised manuscript.

      For the majority of figures and findings, they report that between "22

      Response:

      We have incorporated the exact number of cells analyzed for each condition and details about data sets used for analysis in each figure legend in the revised manuscript.

      The actin filament network can assemble into different architectures that are dependent on subcellular location. For example, actin at the basal region of the cell closest to the coverslip often assembles into stress fibers, whereas the cortical actin network often forms astral, highly branched networks. It would be important to take this into account when comparing across different cellular conditions. It is unclear if the authors were consistent with the z-slice examined for the different cellular treatment/infection conditions. Were the analyses performed on individual z-stacks or max projection images?

      Response:

      We agree with the reviewer views on actin network in different planes. Thus, to ensure reasonable quantification and comparison among conditions, all images were taken with the same objective (63x oil N 1.4) and microscope settings (same gain, same laser power). For post-processing, we mainly choose individual cells, which are not in contact with others and individual z-stacks were taken. Z-stacks images with fixed 0.3 micrometer slices for each cells were taken to ensure the whole cell was in focus. The Z-projection images of individual cells were then performed and used to calculate the F-actin or viral M cluster or ER mean intensity in the whole cell. We have analyzed the mean intensity per individual cell using a Fiji/Image J.

      We have incorporated above details in material method (image analysis) section in the revised manuscript.

      Since a major impact of this paper is the first imaging-based characterization of actin filament assembly in response to infection, the authors should provide a more comprehensive display of the raw data images. For example, figure S2 provides a nice gallery of images of actin and viral M particles, however it should show separate image channels in gray scales and consistent scaling across all images. Furthermore, all figure panels showing distinct imaging experiments and quantitative results should be complemented with a supplemental figure showing a gallery of images. This would apply to actin nanostructure rings (Figure 3C/E), filopodia and cell-to-cell contacts (Figure 4A/D), treatment with remdesivir/PKN inihibitor (Figure 6B), and ER localization of M particles (Figure S5).

      Response:

      As the reviewer suggested, we have now created an image gallery for each figure panel (Figures 3, 4, 6 and S5, S3, S8, S9) including STED images that were added as supplemental figures.

      The results in Figure 3D are difficult to interpret. The images should be larger and labeled. Also, based on the 3D STED image in Figure 3D, it appears that the brightest actin signal is actually at the center portion of the viral M cluster. Does this contradict the TEM image and what is described in the text? For Figure 3E: a more relevant analysis might be line scans across multiple images showing how relative actin-M cluster intensity varies within the dimensions of the nanostructure to demonstrate more clearly a pattern of ring assembly of both M clusters and actin.

      Response:

      Since the virus “rings” were mostly found in intracellular places, far from membrane surface, some times during imaging we observed F-actin signal from the upper plane, which is possibly the reason for brightest F-actin signal appears at the center portion of the viral M cluster. Thus, for better clarity of the image and to support our statements we have now incorporated other new images in the Figure 3E (STED 3D images) showing that an heterogeneity of the F-actin labelling but strongly associated with intracellular viral M clusters.

      The authors should address the implications and significance of the described cellular morphological changes in the context of the more physiologically relevant tissue/organ system. How do the changes they observe upon infection in isolated cultured cells compare to when these cells are assembled into tissue/organs?

      Response:

      The significance of the cellular morphological changes upon SARS-CoV2 infection showing a contraction-like effect on the cells as well as higher cells and less contact area could account in a pulmonary tissue by the destructuration of the lung tissue, consistent with the lung damaged seen in the case of COVID19. A sentence in that sense was added in the Discussion section.

      For Figure 6 and S5, the authors infected and treated cells with an inhibitor at the same time point and demonstrate that M cluster size and release is reduced to somewhat comparable levels as treatment with Remdesivir. The authors should expand their analyses for this experiment to include the other quantitative parameters outlined in the paper: F actin/M cluster nanostructures, cellular morphology, filopodia formation, orientation of actin, etc. Additionally, it would be more informative to treat cells post-infection to more closely mimic cellular conditions of infection/treatment.

      Response:

      We have now included quantitative analysis for cellular morphological changes of cells with or without drug treatment (both in the presence of PKN inhibitor and Remdesivir upon SARS-CoV-2 infection) in the revised manuscript (Supplemental Figure S7). We observed a restoration of F-actin nanostructures as well as did not observe any filopodia-like structure formation upon treatment with PKN inhibitor in infected cells.

      Minor comments:

      1. The individual data points should be overlaid on the violin plots for better interpretability of the variability in the data. Response:

      We have incorporated new violin plots with overlaid data points in the revised version of the manuscript for each figure with quantitative data (Figures 1,2,5,6).

      For Figure 3E: the images look "stretched" with an altered relative aspect ratio.

      Response:

      For sake of clarity, we have incorporated new (better) 3D STED images for a better visualization of intracellular F-actin/M clusters “rings” in revised manuscripts (Figure 3E).

      1. The authors should include a cartoon model figure highlighting both (1) how their results contribute to our knowledge of actin-mediated viral assembly/replication and (2) unknown portions of the pathway that need to be further probed to better understand the mechanistic underpinnings of this process.

      Response:

      We have now included a model scheme figure summarizing our results in the revised manuscript, as a new figure 7.

      There have been several high resolution cellular imaging studies using other complementary 3D volumetric imaging approaches (e.g. cryo-electron tomography and FIB/SEM) to characterize the subcellular ultrastructure’s of SARS-CoV-2 infection. The authors should include a brief discussion on how their study complement or compare to these reports, in particular noting whether or not actin filament assemblies were observed in these data.

      Response:

      Thanks to the reviewers for this very pertinent remark, we have added in the Discussion (Page 7,8), a section commenting previous high-resolution cellular imaging studies (REFERENCES: Mendonçà et al Nature Comm 12, 4629, 2021; Klein et al 2020) comparing our 2D/3D STED imaging with complementary 3D EM or 3D cryo-ET or FIB/SEM of SARS-CoV-2 infected cells recently published.

      From Mendonca et al 2021, one can see some intracellular dense structure underneath the CoV-2 budding membrane area, but not able to see if F-actin filaments were present or not. It would be difficult to observe because the vRNP underneath the Spike decorating membrane are very dense. The study was focus on viral assembly and egress using cryo-ET/FIB but not on F-actin filament per se. We don’t know if their imaging conditions would preserve F-actin fibers on membranes. On the other side, when studying virus egress, then we can clearly see CoV-2 individual particles surfing on giant filopodia-like structures very much resembling our STED imaging of viruses on filopodia 48h pi. We can clearly see and recognize parallel F-actin filament bundles inside the enlarged filopodia (Figure 5 D/E) with viruses on it.

      Same results were observed using Cryo-EM tomography in another study (Klein et al 2020) where one can see viruses on filopodia for many cell types A549-hACE2, VeroE6, Calu3 infected cells.

      Reviewer 2

      The authors investigate the role of F-actin in infected human pulmonary alveolar A549-hACE2 cells. They investigate infection progression at different time points by the detection of the M protein by confocal microscopy and western blot. They compare the detection of M with S and N in western blot and with viral RNA detection by Q-PCR. The authors correlate M cluster formation to peak at 48h p.i. with particle assembly and particle release at 72h p.i. An increase in F-actin at 24h and 48h p.i. was monitored by confocal microscopy and z-stacks, whereas the overall amount of actin determined by western blot was not changed. Using 2D STED microscopy the authors identified F-actin rearrangement from stress fibers to filamentous protrusions at 24h-48h p.i. and conclude importance for particle assembly and release. By 3D STED microscopy M labeled intracellular organelles called "viral rings" surrounded by actin called "actin rings" are shown. By transmission electron microscopy (TEM) vesicular structures with budding particles were shown at intracellular membranes. The authors conclude from these findings that F-actin stabilizes assembly platforms at membranes or support the transport of virus loaded vesicles to the plasma membrane. The authors found more and longer filopodia in infected cells which were loaded with virus particles bridging cells suggesting role in cell-cell spread. At the plasma membrane they found bigger particles and at the filopodia smaller, suggesting release from the plasma membrane in packages.

      Transcriptom analysis of non infected and SARS-CoV-2 infected A549-hACE2 revealed upregulation of Rho-GTPases activated proteins like PKN and α-actinins upon infection. The levels of α-actinins in WB were 2-fold higher in infected cells. The authors show that inhibitors of Rho/SRF and PKN restored cell morphology, reduced M cluster formation and virus release. The PKN inhibitor blocked M in the ER. The authors conclude from this data a role of the alpha-actinins superfamily in SARS-CoV-2 assembly and egress.

      Major comments:

      The presented data are convincing but some figures may need improvements, see in minor comments. For some conclusions, more evidences like marker staining may be needed.

      Response:

      In accordance with the reviewer, we have significantly improve the figures in the revised manuscript. We identified that the intracellular compartment containing budding viruses were derived from the ER (gpr78 marker) – shown in revised Figure 6 - and not in lysosomes (Lamp1 marker) or extracellular vesicles (CD81 marker) – See new supplemental revised Figure S10. We have included all the new results and discussion in revised manuscripts.

      Minor comments:

      The authors conclude that F-actin stabilizes assembly platforms at membranes like ERGIC, but an ERGIC marker staining is not provided. The authors suggest that F-actin might also be involved in transport of virus loaded vesicles to the plasma membrane. Here a plasma membrane marker or native staining of particles may help to descriminate between intracellular Exosomes and extracellular particles. Co-staining with exosomal markers would also be more convincing.

      Response:

      Also as per reviewer suggestion we have identified that the intracellular compartment containing budding viruses were derived from the ER (gpr78 marker) – shown in revised Figure 6, Fig. S8. New quantitative analysis (including with PKN inhibitor) to support the data also included in the figures. Also we have used lysosomes marker LAMP1 and extracellular vesicles (EV) marker CD81 and we found that there was no colocalization with Viral M clusters ( Supporting Figure 10). we have tried the ERGIC marker grp53 without any success so far,

      Further, It is well documented on CryoFIB/SEM study of SARS-CoV-2-infected cells suggested the presence of “exit tunnels”, linking virion-rich intracellular vacuoles to the extracellular space (Mendonça, L. et al. Nature Communications 12, (2021)). The size of this vacuoles observed in the cell periphery was approximately 1 µm, which is well corelated with actin and viral ring we have observed from STED 2D images Also the author suggested that SARS-CoV-2 could possibly egress through these tunnels by a mechanism of exocytosis from these large intracellular vacuoles.

      We have now included all above results and discussions in revised manuscripts to support our claims.

      Figure S1 A. Align individual pictures in one line and do not overlap, scale bars not readable, Is in each picture the same magnification shown? Show representative pictures with the same area magnification!

      Response:

      Thanks to the reviewer to point out these imperfections, so we have improved the figures accordingly in the revised manuscript. Individual images are aligned properly, scale bars are readable, images are with the same magnification.

      Figure 3C and 3E for better orientation magnified areas should be indicated as squares, not in circles.

      Response:

      As suggested, we have modified the figure 3 accordingly in the revised manuscript.

      Figure S4 quality of pictures not appropriate to see differences.

      Response:

      We improved the figure quality in the revised manuscript (see new Figures 6 and S8)

      Fig S5 All pictures overlap in one? ER marker in blue very difficult to read.

      Response:

      We have modified the new figure S8 as such as the ER marker is visible (in magenta color) in the revised manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The authors investigate the role of F-actin in infected human pulmonary alveolar A549-hACE2 cells. They investigate infection progression at different time points by the detection of the M protein by confocal microscopy and western blot. They compare the detection of M with S and N in western blot and with viral RNA detection by Q-PCR. The authors correlate M cluster formation to peak at 48h p.i. with particle assembly and particle release at 72h p.i. An increase in F-actin at 24h and 48h p.i. was monitored by confocal mircroskopy and z-stacks, whereas the overall amount of actin determined by western blot was not changed. Using 2D STED microscopy the authors identified F-actin rearrangement from stress fibers to filamentous protrusions at 24h-48h p.i. and conclude importance for particle assembly and release. By 3D STED microscopy M labeled intracellular organelles called "viral rings" surrounded by actin called "actin rings" are shown. By transmission electron microscopy (TEM) vesicular structures with budding particles were shown at intracellular membranes. The authors conclude from these findings that F-actin stabilizes assembly platforms at membranes or support the transport of virus loaded vesicles to the plasma membrane. The authors found more and longer filopodia in infected cells which were loaded with virus particles bridging cells suggesting role in cell-cell spread. At the plasma membrane they found bigger particles and at the filopodia smaller, suggesting release from the plasma membrane in packages. Transcriptom analysis of non infected and SARS-CoV-2 infected A549-hACE2 revealed upregulation of Rho-GTPaes activated proteins like PKN and α-actinins upon infection. The levels of α-actinins in WB were 2-fold higher in infected cells. The authors show that inhibitors of Rho/SRF and PKN restored cell morphology, reduced M cluster formation and virus release. The PKN inhibitor blocked M in the ER. The authors conclud from this data a role of the alpha-actinins superfamily in SARS-CoV-2 assembly and egress.

      Major comments:

      The presented data are convincing but some figures may need improvements, see in minor comments. For some conclusions more evidences like marker staining may be needed.

      Minor comments:

      The authors conclude that F-actin stabilizes assembly platforms at membranes like ERGIC, but an ERGIC marker staining is not provided. The autors suggest that F-actin migth also be involved in transport of virus loaded vesicles to the plasma membrane. Here a plasma membrane marker or native staining of particles may help to descriminate between intracellular Exosomes and extracellular particles. Co-staining with exosomal markers would also be more convincing. - Figure S1 A. Align individual pictures in one line and do not overlap, scale bars not readable, Is in each picture the same magnification shown? Show representative pictures with the same area magnification! - Figure 3C and 3E for better orientation magnified areas should be indicated as squares, not in circles. - Figure S4 quality of pictures not appropriate to see differences. - Fig S5 All pictures overlap in one? ER marker in blue very difficult to read.

      Significance

      The presented data provide a nice peace of work to the knoweledge on SARS-COV-2 replication in human pulmonary cells. The authors use advanced imaging and molecular biology methods for their experiments. The indentified cellular target may help to develop specific inhibitors for antiviral therapy.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The authors used conventional confocal and super-resolution STED microscopy to characterize the actin filament network in response to SARS-CoV-2 infection in pulmonary cells. They demonstrate that, although total levels of actin are unchanged, F-actin polymerization increases upon infection, with the most significant changes occurring at 48 hours post infection. Notably, F-actin remodels from primarily stress-fiber architectures to circularized, F-actin nanostructures that tend to colocalize with viral M cluster rings at 48 hours post infection. Additionally, there is a significant increase in F-actin-associated filopodia-like structures, with an example of a possible cell-to-cell filopodia that could possibly be a mode of inter-cellular viral transmission. The authors complement their imaging-based experiments with RNAseq to profile the cellular gene expression of SARS-CoV-2 infected pulmonary cells, revealing an upregulation of RHO GTPases activate PKNs and alpha-actinins. They show that treatment of pulmonary cells with Rho/SFR and PKN inhibitors during infection decreases the size of viral M clusters and release to comparable levels as the known viral therapeutic, Remdesivir.

      Major comments:

      1. The majority of the author's conclusions are based off of qualitative and quantitative analysis of their fluorescence images. While they do mention briefly an ImageJ plug-in and the statistical tests performed, the description of their quantitative image-based analyses for each experiment is lacking. For example, how was viral M cluster and actin intensity measured? How was the signal intensity normalized to account for variations in antibody labeling or other cell-to-cell variations? For figures 3C&D, how did the authors calculate viral and actin ring diameter? It is necessary to expand on the details of the quantitative analysis for each parameter mentioned in the methods section and/or include a figure panel demonstrating the details of the analysis (similar to what is nicely displayed for M cluster size in Figure S1B).
      2. In particular, the details regarding the F-actin orientation measurements is lacking. Is there a consistent reference point for the orientation of the actin filaments? When comparing across two different cells, it is unclear how the orientations are normalized. Perhaps it would be more informative to plot the difference or the range in angles? Or the distribution of the differences in angles? Another point that is a bit misleading is describing this analysis as "F-actin orientation" since the term "orientation" can has a specific meaning for polar filaments such as actin. For example, given resolution limitations of the imaging approaches used in this manuscript, the authors are reporting on the orientations of bundles/populations of actins and not orientations of individual filaments relative to one another within the bundle (e.g. anti-parallel vs parallel vs branched). The authors should clarify this in the text and also further expand on the utility of their F-actin orientation analysis and how it informs us on the mechanisms of actin-mediated viral infection.
      3. For the majority of figures and findings, they report that between "22<n<50 cells" were analyzed. The authors should be more specific of the exact sample size for each experiment/figure panel displayed. In particular, it is unclear in a few figure panels showing exemplar images whether or not this is the full sample size (n=1) or just an exemplar image. I recommend reporting specifically in the figure legend and/or a supplemental table outlining the sample size and analysis used for each imaging experiment to add clarify to their quantitative analysis and strengthen their claims.
      4. The actin filament network can assemble into different architectures that are dependent on subcellular location. For example, actin at the basal region of the cell closest to the coverslip often assembles into stress fibers, whereas the cortical actin network often forms astral, highly branched networks. It would be important to take this into account when comparing across different cellular conditions. It is unclear if the authors were consistent with the z-slice examined for the different cellular treatment/infection conditions. Were the analyses performed on individual z-stacks or max projection images?
      5. Since a major impact of this paper is the first imaging-based characterization of actin filament assembly in response to infection, the authors should provide a more comprehensive display of the raw data images. For example, figure S2 provides a nice gallery of images of actin and viral M particles, however it should show separate image channels in gray scales and consistent scaling across all images. Furthermore, all figure panels showing distinct imaging experiments and quantitative results should be complemented with a supplemental figure showing a gallery of images. This would apply to actin nanostructure rings (Figure 3C/E), filopodia and cell-to-cell contacts (Figure 4A/D), treatment with remdesivir/PKN inihibitor (Figure 6B), and ER localization of M particles (Figure S5).
      6. The results in Figure 3D are difficult to interpret. The images should be larger and labeled. Also, based on the 3D STED image in Figure 3D, it appears that the brightest actin signal is actually at the center portion of the viral M cluster. Does this contradict the TEM image and what is described in the text? For Figure 3E: a more relevant analysis might be line scans across multiple images showing how relative actin-M cluster intensity varies within the dimensions of the nanostructure to demonstrate more clearly a pattern of ring assembly of both M clusters and actin.
      7. The authors should address the implications and significance of the described cellular morphological changes in the context of the more physiologically relevant tissue/organ system. How do the changes they observe upon infection in isolated cultured cells compare to when these cells are assembled into tissue/organs?
      8. For Figure 6 and S5, the authors infected and treated cells with an inhibitor at the same time point and demonstrate that M cluster size and release is reduced to somewhat comparable levels as treatment with Remdesivir. The authors should expand their analyses for this experiment to include the other quantitative parameters outlined in the paper: F actin/M cluster nanostructures, cellular morphology, filopodia formation, orientation of actin, etc. Additionally, it would be more informative to treat cells post-infection to more closely mimic cellular conditions of infection/treatment.

      Minor comments:

      1. The individual data points should be overlaid on the violin plots for better interpretability of the variability in the data.
      2. For Figure 3E: the images look "stretched" with an altered relative aspect ratio.
      3. The authors should include a cartoon model figure highlighting both (1) how their results contribute to our knowledge of actin-mediated viral assembly/replication and (2) unknown portions of the pathway that need to be further probed to better understand the mechanistic underpinnings of this process.
      4. There have been several high resolution cellular imaging studies using other complementary 3D volumetric imaging approaches (e.g. cryo-electron tomography and FIB/SEM) to characterize the subcellular ultrastructures of SARS-CoV-2 infection. The authors should include a brief discussion on how their study complement or compare to these reports, in particular noting whether or not actin filament assemblies were observed in these data.

      Significance

      Impact:

      This manuscript provides the first characterization of the architecture of the actin filament network upon SARS-CoV-2 infection. Since actin filament remodeling is a mechanism used by several other viruses, there is considerable interest in targeting these assemblies for the development of therapeutics to prevent and treat infection. This manuscript lays the groundwork for more detailed analysis probing the mechanisms mediating actin-mediated viral entry, replication, and release. Furthermore, it establishes some quantitative tools to standardize how this process is studied and analyzed in future studies.

      Audience:

      I anticipate that this work will motivate future studies aimed at further ultrastructural characterization of actin and other cytoskeletal filaments by complementary, high-resolution imaging techniques, as well as studies aimed at screening for small molecule drugs to inhibit actin-mediated viral infection.

      Field of expertise:

      cellular cryo-electron tomography, quantitative imaging, cytoskeletal-based motility, functional cytoskeleton-organelle interactions. Insufficient expertise to evaluate RNAseq experiments.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      *Reviewers comments in italics *

      We thank all reviewers for their positive and encouraging comments and criticisms to improve our work. Here we present a reviewed version of the manuscript according to the comments risen.

      • Reviewer #1 (Evidence, reproducibility and clarity (Required)): This is an interesting paper that identifies Tns3 as a potential effector of oligodendrocytes differentiation based on an ingenious strategy comparing regulatory binding sites of known master regulators of differentiation, and then shows using in vivo genetics that this role is indeed correct. Next, a potential mechanism is identified by showing co-localization with beta 1 integrin, known to regulate apoptosis of newly-formed oligodendrocytes. The results are well illustrated and the experiments performed with appropriate power using a broad range of techniques that combine in silico, in vitro and in vivo work to great effect.

      I think this represents an important contribution that will be of significant interest to neuroscientists - the mechanisms regulating oligodendrocytes generation remain poorly understood and the evidence that this contributes to adult learning (adaptive myelination) and CNS regeneration makes this a key question. I would suggest that the following are considered before publication: We thank the reviewer for this positive comments and critics to improve the manuscript. The work describing the KO mice that were not used as they proved unsuitable need not be described - it breaks the logical flow.*

      In agreement with the reviewer comment, we have reduced this part to a sort paragraph indicating that our analyses of several Tns3 constitutive KO lines showed developmental lethality and possible genetic compensation in Tns3 expression, leading us to conclude them inappropriate tools to study Tns3 function in oligodendrogenesis. We have summarized the data in Fig. S7 and the description in the method section.

      It would be useful to compare the extent of cell death in the Tns3 cKO mice with that described in the alpha6 integrin KO and the integrin beta1 cKO (the Colognato and Benninger papers). Do they match? If not (and I suspect the Tns3 cKO death is greater) could other mechanisms be downstream of the Tns3?

      In agreement with the reviewer comment, we have added the following paragraph to the discussion:

      ‘Knockout mice for integrin-a6 present a 50% reduction in brainstem MBP+ OLs at E18.5, just before they die at birth, accompanied by an increase in TUNEL+ dying OLs (Colognato et al, 2002). Similarly, conditional deletion of integrin-b1 in immature OLs by Cnp-Cre also leads to a 50% reduction in cerebellar OLs at P5, with a parallel increase in TUNEL+ dying OLs (Benninger et al., 2006). Therefore, given that Tns3-induced deletion in postnatal OPCs also leads to 40-50% reduction in OLs in both grey and white matter regions of the postnatal telencephalon (this study), paralleled by similar increase in TUNEL+ apoptotic oligodendroglia, we suggest that Tns3 is required for integrin-b1 mediated survival signal in immature oligodendrocytes.’

      I'm not sure why the authors argue that the activation of beta 1 would not be informative experiment? This will regulate actin dynamics just as it regulates other integrin signaling pathways. Indeed, I would argue that an integrin activation experiments would be a neat way to prove mechanism (as it would be predicted to rescue the Tns3 cKO phenotype).

      In agreement with the reviewer comment, we have removed this sentence: ‘If so, exogenous activation of integrin a6b1 in cultured OPCs by Mn2+ (Colognato et al., 2004) would not be expected to increase oligodendrogenesis in Tns3-iKO oligodendroglia.’

      In an effort, to understand Tns3 function by acute Tns3-deletion in postnatal OPCs, we have compared the transcriptome of Tns3-iKO oligodendroglia compared to control cells, and we present these results in figure 7 pinpointing deregulated genes leading to reduced oligodendroglial differentiation, integrin dysregulation, increase apoptosis, and conflicting cell cycle signaling, and leaving for further studies the full characterization how the loss of Tns3 leads to the deregulation of these processes.

      Can the authors provide any data on GM oligos and their OPCs? Is the requirement for Tns3 the same, and if so what might the implications be in the adult where new oligodendrocytes are being generated throughout life?

      Indeed, in our analyses of Tns3-iKO mice, we provide quantifications of the cortex as a grey matter territory, showing a similar 40-50% reduction in OLs as in white matter areas (corpus callosum and fimbria, and mixed regions such as the striatum.

      I note in S13 that integrin beta1 is not highly expressed in human oligos at the time in question. Does this call into question the relevance for human disease?

      We realize that scRNAseq plots are never easy to interpret but it is important to note that the levels of expression are coded by the intensity of the color scale, while the surface of the dot plots indicate the experimental sensitivity to detect transcript expression in a larger or smaller proportion of the cells in a given cluster/cell type (due to the drop out limitation of current single cell RNA-seq technologies). Considering this, please note that beyond a stronger expression in neural progenitor cells (NPCs, blue color), integrin-b1 (Itgb1) transcripts are expressed at medium to high levels (green to blue) in human immature OLs (Fig. S13B), similar to their pattern of expression in mouse oligodendroglia (Fig. S13A).

      Reviewer #1 (Significance (Required)): See above

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      *In this article, the authors identify and characterise Tensin3 (Tns3) as a target of key oligodendroglial transcription factors driving differentiation in the mouse. They use multiple transgenic models to describe loss of function, and suggest Tns3's action through integrin B1 signalling, with the key function being oligodendroglial survival.

      There is extensive and impressive work here, including identification of Tns3 by ChIPseq, expression of Tns3 in brain development, analysis of human (ES-derived) and mouse scRNAseq to infer timing of expression in the differentiation pathway, generation of V5-tagged Tns3-KI mice to overcome antibody limitations, identification of its expression in mouse remyelination, generation of a new Tns3KO mouse, in vivo Crispr Tns3KO in development, generation of a conditional KO, for deletion in adulthood, and finally some culture work to investigate potential mechanisms of actions. The bottom line is that Tns3 is required for survival of OPCs and immature oligodendrocytes in development/remyelination in mouse at least, and loss leads to apoptosis (through p53 increase and loss of integrin-B1 signalling), leading to a failure of proper differentiation.

      The experiments are carefully done, convincing and the tools generated impressive. There is clearly more to be done on clarifying the mechanism of action of Tns3, but I do not think further experiments on this topic are needed for this paper - they can wait for the next.*

      We thank the reviewer for the positive and encouraging reviewing comments. In an effort, to understand Tns3 function by acute Tns3-deletion in postnatal OPCs, we have compared the transcriptome of Tns3-iKO oligodendroglia compared to control cells, and we present these results in figure 7 pinpointing deregulated genes leading to reduced oligodendroglial differentiation, integrin dysregulation, increase apoptosis, and conflicting cell cycle signaling, and leaving for further studies the full characterization how the loss of Tns3 leads to the deregulation of these processes.

      My only query is whether the expression of Tns3 is also in immature OLs in human brain (rather than human ES-derived OLs). This should be easily checked with interrogation of online Shiny apps from already published snRNAseq from various groups on human post mortem adult brain, but if not present then in also baby/fetal brain. This would be interesting and may well be different from the ES_derived cells which tend to be very immature and would add interest to the possible translational impact.

      According to the suggestion of the reviewer, we analyzed 69,174 snRNAseq GW9-GW22 from fetal cerebellum,; Aldinger & Miller, 2021; https://doi-org.proxy.insermbiblio.inist.fr/10.1038/s41593-021-00872-y), which we present now in Figure S3, finding a cluster of cells expressing iOL markers, including NKX2-2, TNS3, ITPR2, and BCAS1, similar to the hiPSCs-derived iOL1/iOL2 clusters and mouse iOL1/iOL2 clusters shown in Fig. S2.

      We also analyzed other datasets without finding iOLs given their age or numbers, including:

      • Immunopanned PDGFRA+ cells from human cortex GW20-GW24 (2690 cells, Huang and Kriegstein, Cell 2020) finding OPCs but not iOLs.

      -The recently published dataset from GW8-GW10 human forebrain oligodendroglia (van Brugen & Castelo-Branco, Dev Cell 2022; https://doi.org/10.1016/j.devcel.2022.04.016) containing OPCs but not iOLs.

      -The GW17 to GW18 human cortex (40,000 cells, Polioudakis & Geschwind, 2019, https://doi.org/10.1016/j.neuron.2019.06.011) containing OPCs but not iOLs.

      Reviewer #2 (Significance (Required)): This work extends our knowledge of oligodendroglial differentiation, links it to the ECM and provides interest in manipulating this in diseases including glioma. My expertise: myelin, oligodendroglia, remyelination, human neuropathology

      *Reviewer #3 (Evidence, reproducibility and clarity (Required)): *

      see below Reviewer #3 (Significance (Required)): Using purified oligodendrocytes target genes of key regulators of oligodendrocyte differentiation were analyzed, which led to the identification of Tensin-3. The authors performed a detail characterization of Tensin-3 expression. They found that Tensin-3 is highly expressed in immature mouse and human oligodendrocytes. Interestingly, Tensin-3 is selectively enriched in immature oligodendrocytes, and not present at detectable levels in OPCs and mature oligodendrocytes. Subsequently, the authors characterized Tensin-3 function by a series of knockdown approaches in vitro and in vivo. These series of experiments revealed an essential function of Tensin-3 in supporting oligodendrocytes survival. In the absence of Tensin-3 a large fraction of oligodendrocytes undergo apoptosis while differentiating to mature oligodendrocytes. This is a remarkable study applying an impressive array of methods that led to an important discovery in the field of oligodendrocyte biology. The main advances for the field are: 1) identification of a novel marker for premyelinating oligodendrocytes, 2) elucidation of Tensin-3 as a pro-survival factor in oligodendrocytes differentiation, 3) evidence of link of Tensin-3-integrin signal in survival of oligodendrocytes. The data is well presented and organized, and the paper well written. I recommend publication with only minor suggestions for a revision:

      • *

      We thank the reviewer for this positive comments and critics to improve the manuscript.

      In Figure 2, only images are shown, and the data is referred to as highly expressed or strong co-localization. Even if the data looks clear, the authors should provide some quantification of the data in the figure.

      We thank the reviewer for his comment and we have now provided a quantification of the fraction of Tns3+ cells expressing different markers of oligodendrocyte lineage progression/stages, and the percentage of each stage expressing Tns3.

      Figure 3 is given too much weight in the manuscript text. I would recommend to shorten the text in the result section, and to move this figure to the supplement as it does not advance the story. It mainly shows that the KO mice still express transcripts in the brain. Were the transcripts lost in peripheral tissue?

      • *

      As mentioned above, in agreement with the reviewers #1 and #3 comments, we have reduced this part to a sort paragraph indicating that our analyses of several Tns3 constitutive KO lines showed developmental lethality and possible genetic compensation in Tns3 expression, leading us to conclude them inappropriate tools to study Tns3 function in oligodendrogenesis. We have summarized the data in Fig. S7 and the description in the method section.

      Page 11: the authors describe in the text how the floxed allele was generated. This should be shifted to the supplement.

      According to reviewers suggestion, we have moved the description of Tns3 floxed allele generation to the Methods section. Page 16: the authors refer to Bcas1 as a problematic marker for immature oligodendrocytes, because the transcript is also expressed in mature oligodendrocytes. The authors are correct that the transcript is expressed in mature oligodendrocytes. However, the proteins changes its localization when oligodendrocytes mature. On protein level, it is valuable and a selective marker, as antibodies only label pre-myelinating and actively myelinating cells. In mature oligodendrocytes, antibodies against Bcas1 do not label the cell, only myelin. The text is misleading and needs to be corrected.

      In agreement with reviewers comment we have modified the text as follows: ‘An optimized protocol for immunodetection using Bcas1-recognizing antibodies has been shown to label iOLs (Fard et al., 2017).’

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      see below

      Significance

      Using purified oligodendrocytes target genes of key regulators of oligodendrocyte differentiation were analyzed, which led to the identification of Tensin-3. The authors performed a detail characterization of Tensin-3 expression. They found that Tensin-3 is highly expressed in immature mouse and human oligodendrocytes. Interestingly, Tensin-3 is selectively enriched in immature oligodendrocytes, and not present at detectable levels in OPCs and mature oligodendrocytes. Subsequently, the authors characterized Tensin-3 function by a series of knockdown approaches in vitro and in vivo. These series of experiments revealed an essential function of Tensin-3 in supporting oligodendrocytes survival. In the absence of Tensin-3 a large fraction of oligodendrocytes undergo apoptosis while differentiating to mature oligodendrocytes.

      This is a remarkable study applying an impressive array of methods that led to an important discovery in the field of oligodendrocyte biology. The main advances for the field are: 1) identification of a novel marker for premyelinating oligodendrocytes, 2) elucidation of Tensin-3 as a pro-survival factor in oligodendrocytes differentiation, 3) evidence of link of Tensin-3-integrin signal in survival of oligodendrocytes. The data is well presented and organized, and the paper well written.

      I recommend publication with only minor suggestions for a revision:

      In Figure 2, only images are shown, and the data is referred to as highly expressed or strong co-localization. Even if the data looks clear, the authors should provide some quantification of the data in the figure.

      Figure 3 is given too much weight in the manuscript text. I would recommend to shorten the text in the result section, and to move this figure to the supplement as it does not advance the story. It mainly shows that the KO mice still express transcripts in the brain. Were the transcripts lost in peripheral tissue?

      Page 11: the authors describe in the text how the floxed allel was generated. This should be shifted to the supplement.

      Page 16: the authors refer to Bcas1 as a problematic marker for immature oligodendrocytes, because the transcript is also expressed in mature oligodendrocytes. The authors are correct that the transcript is expressed in mature oligodendrocytes. However, the proteins changes its localization when oligodendrocytes mature. On protein level, it is valuable and a selective marker, as antibodies only label pre-myelinating and actively myelinating cells. In mature oligodendrocytes, antibodies against Bcas1 do not label the cell, only myelin. The text is misleading and needs to be corrected.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this article, the authors identify and characterise Tensin3 (Tns3) as a target of key oligodendroglial transcription factors driving differentiation in the mouse. They use multiple transgenic models to describe loss of function, and suggest Tns3's action through integrin B1 signalling, with the key function being oligodendroglial survival.

      There is extensive and impressive work here, including identification of Tns3 by CHIPseq, expression of Tns3 in brain development, analysis of human(ES-derived) and mouse scRNAseq to infer timing of expression in the differentiation pathway, generation of V5-tagged Tns-KI mice to overcome antibody limitations, identification of its expression in mouse remyelination, generation of a new Tns3KO mouse, in vivo crispr Tns3KO in development, generation of a conditional KO, for deletion in adulthood, and finally some culture work to investigate potential mechanisms of actions. The bottom line is that Tns3 is required for survival of OPCs and immature oligodendrocytes in development/remyelination in mouse at least, and loss leads to apoptosis (through p53 increase and loss of integrinB1 signalling), leading to a failure of proper differentiation.

      The experiments are carefully done, convincing and the tools generated impressive. There is clearly more to be done on clarifying the mechanism of action of Tns3, but I do not think further experiments on this topic are needed for this paper - they can wait for the next.

      My only query is whether the expression of Tns3 is also in immature OLs in human brain (rather than human ES-derived OLs). This should be easily checked with interrogation of online Shiny apps from already published snRNAseq from various groups on human post mortem adult brain, but if not present then in also baby/fetal brain. This would be interesting and may well be different from the ES_derived cells which tend to be very immature and would add interest to the possible translational impact.

      Significance

      This work extends our knowledge of oligodendroglial differentiation, links it to the ECM and provides interest in manipulating this in diseases including glioma.

      My expertise: myelin, oligodendroglia, remyelination, human neuropathology

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is an interesting paper that identifies Tns3 as a potential effector of oligodendrocytes differentiation based on an ingenious strategy comparing regulatory binding sites of known master regulators of differentiation, and then shows using in vivo genetics that this role is indeed correct. Next, a potential mechanism is identified by showing co-localization with beta 1 integrin, known to regulate apoptosis of newly-formed oligodendrocytes. The results are well illustrated and the experiments performed with appropriate power using a broad range of techniques that combine in silico, in vitro and in vivo work to great effect.

      I think this represents an important contribution that will be of significant interest to neuroscientists - the mechanisms regulating oligodendrocytes generation remain poorly understood and the evidence that this contributes to adult learning (adaptive myelination) and CNS regeneration makes this a key question. I would suggest that the following are considered before publication:

      The work describing the KO mice that were not used as they proved unsuitable need not be described - it breaks the logical flow.

      It would be useful to compare the extent of cell death in the Tns3 cKO mice with that described in the alpha6 integrin KO and the integrin beta1 cKO (the Colognato and Benninger papers). Do they match? If not (and I suspect the Tns3 cKO death is greater) could other mechanisms be downstream of the Tns3?

      I'm not sure why the authors argue that the activation of beta 1 would not be informative experiment? This will regulate actin dynamics just as it regulates other integrin signaling pathways. Indeed, I would argue that an integrin activation experiments would be a neat way to prove mechanism (as it would be predicted to rescue the Tns3 cKO phenotype).

      Can the authors provide any data on GM oligos and their OPCs? Is the requirement for Tns3 the same, and if so what might the implications be in the adult where new olligodendrocytes are being generated throughout life?

      I note in S13 that integrin beta1 is not highly expressed in human oligos at the time in question. Does this call into question the relevance for human disease?

      Significance

      See above

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      The manuscript by Tran et al. describes the mechanism by which IFNa treatment prevents the development of liver CRC metastasis in several mouse models. They show how continuous administration of IFNa strength liver vascular barrier by a direct effect on endothelial cells and avoids the trans-sinusoidal migration of tumour cells.

      Major points:

      1. Authors use an elegant orthotopic model of liver metastasis to confirm the effect of continuous IFNa on hepatic colonization (Fig.3). Although they extensively characterize the metastatic lesions, they do not show data on the potential impact of IFNa treatment in the primary caecum tumour. Authors should clarify if the described effects are taken place in the liver or/and in the caecum. It would be interesting to show if IFNa affects the primary tumour size, the extravasation of cancer cells and the immune infiltration since all these factors could have an impact in the number of liver lesions.

      We thank the reviewer for acknowledging the importance of our results particularly in the context of the orthotopic mouse model we developed. We agree that displaying the results of continuous IFNα therapy on primary intracecal tumors, as well as the results pertaining to the few mice that develop microscopic or macroscopic liver metastasis, is important for the interpretation of our work. Thus, we evaluated the dimension of primary intracecal CRC lesions (Fig 3D,E) and we performed additional IHC characterization of the primary tumors (Fig S4A,B). The analysis showed that the dimension of the primary lesions and the markers we analyzed were non significantly modified by continuous IFNα therapy (Fig 3D,E and Fig S4A,B). These results favor the hypothesis that IFNα therapy does not modify the number of cells that spread from the primary tumors and seed into the liver, but it rather impinges on the intravascular containment of CRC cells circulating within the liver (Fig 3F). As said earlier, the data also highlight the possibility that CRC tumors may become refractory to IFNα or that the dose and schedule we adopted does not significantly affect the growth of established liver CRCs at late time points. The data are also consistent with results obtained with MC38Ifnar1_KO CRC cells indicating that continuous IFNα therapy does not require Ifnar1 expression by tumor cells to exert its antimetastatic function (Fig 4A,C-D). This is also in line with the high IFNα concentrations required to activate the "tunable" direct antiproliferative functions of this cytokine that exceed those achieved in our system (Catarinella et al, 2016; Schreiber, 2017). Text has been added in the revised manuscript at lines 175-197 and in the discussion lines 425-431.

      1. Figure 3f right shows liver images without any obvious metastatic lesion. Since authors are analysing the effect of IFNa treatment in proliferation, vascularization and immune composition in liver tumours, they may show and quantify images with metastatic lesions and restrict the analysis to the tumour area.

      Since the main finding of our manuscript regards the prevention of hepatic colonization by continuous IFNα therapy, we think that the original data presented in Fig 3G,H are representative of the overall efficacy of our strategy that confers protection in up to 60% of the mice carrying intramesenteric tumors of increasing dimensions (Fig 3H). We have thus maintained our original results, adding the quantification of all IHC data on groups of Sham control livers (n=6), as suggested. In any case, we also included the same IHC characterization of the few and small intrahepatic lesions that have bypassed the intravascular antimetastatic barrier (Fig S4C,D). Indeed, in agreement with the results observed in primary intracecal lesions, these metastatic lesions that developed in IFNαtreated mice showed similar markers of cell proliferation, neoangiogenesis, F4/80 macrophages and CD3+ T cells, as control lesions detected in NaCl-treated mice. Once again, the results highlight the possibility that CRC tumors, once established as micro/macroscopic metastases, may become refractory and resistant to IFNα therapy by downregulating the Ifnar1 in various components of the tumor microenvironment (Boukhaled et al., 2021; Katlinski et al., 2017). Text has been added in the revised manuscript at lines 175-197 and in the discussion lines 496-515.

      1. Authors analyse the recombination efficiency of different mouse CRE lines by non-quantitative methods (PCR of hepatic genomic DNA and GFP expression by immunofluorescence in healthy liver). Since PDGFRβ-Cre/ERT2 and CD11c-Cre lines are used to exclude a role of IFNa on the targeted cells, authors should provide stronger evidences to support this. They may consider studding the ablation of Ifnar1 in FACS sorted fibroblasts and myeloid cells. Moreover, it would be important showing the proportion of GFP+ cells in the sorted populations to understand how broadly these stromal populations are targeted.

      We thank the referee for raising this important issue, which is related to the relative efficiency of Ifnar1 recombination in each of the Cre-expressing mouse models we have used in the study. To this regard, we newly performed an extensive colocalization analysis quantifying the percentage of GFP+ cells that colocalize with cell specific markers (i.e., PDGFRβ, CD11c, F4/80 and CD31) of the various mouse models (PDGFRβCreERT2, CD11cCre and VeCadCreERT2, respectively) crossed with RosaZsGreen reporter mice. Colocalization analysis of GFP in the different systems was performed using the ImageJ “colocalization” algorithm developed by Pierre Bourdoncle (Institut Jacques Monod, Service Imagerie, Paris; 2003–2004). The method allows the generation of unsupervised profiles of co-localized pixels between two channels. This methodology has been included in the section Methods and Protocols, line 806-809. Of note, we observed an almost complete recombination in liver fibroblast (GFP+/PDGFRβ+), with about 98.2 ± 0.72% hepatic stellate cells that co-expressed GFP+ and PDGFRβ+ signals (see the new Fig S5E). Similarly, hepatic DCs (GFP+/CD11c+) had 94.17 ± 2.16% colocalization, while F4/80+ KCs or LCMs (GFP+/F4/80+) colocalized in 78.14 ± 5.03% (see the new Fig S5E). Finally, HECs, including LSECs, (GFP+/CD31+) showed 85.3 ± 5.03% colocalization (see the new Fig S5E,F), with no expression of GFP signals in cells other than CD31+. Note that these values indicate an almost complete colocalization of the Cre recombinase in the target cell types analyzed (see representative IF shown in Fig S5E). Text has been added in the revised manuscript at lines 225-233. Moreover, DEGs analysis between NaCl-treated VeCadIfnar1_KO and Ifnar1fl/fl HECs showed a significant downregulation of Ifnar1 expression in CD31+ VeCadIfnar1_KO cells, with a log2 fold-change of -0.387 and an adjusted p-value of 0.033, further confirming Cre recombination in HECs isolated from VeCadIfnar1_KO mice (as depicted in the heatmap of Fig 6B; the 12th gene of the Type I IFN response is Ifnar1). We have prepared all source images at higher dimension to better appreciate the colocalization within liver microvasculature. In addition, we performed several flow cytometry analyses to identify liver cell populations of Cre-recombinant mice that express Ifnar1. Unfortunately, the predicted low cellular surface expression of this molecule coupled with the experimental conditions needed to extract viable non-parenchymal cells from the liver have prevented us from obtaining informative results.

      1. Ifnar1 ablation in VeCad+ cells prevents the effect of IFNa on tumour growth (Fig. 4d), suggesting the existence of anti-tumour mechanisms beyond the effects on hepatic colonization. Authors may consider checking proliferation, vascularization and immune infiltration in these tumours to enhance their conclusion.

      We fully agree with the referee’s concern and as above mentioned, we have followed his/her suggestion and examined the existence of antitumor mechanisms beyond the effects on hepatic colonization in VeCadIfnar1_KO mice treated with NaCl or IFNα. To this end, 4 NaCl-Ifnar1fl/fl, 7 IFNα-Ifnar1fl/fl, 4 NaCl-VeCadIfnar1_KO and 4 IFNα-VeCadIfnar1_KO mice were intrasplenically injected with MC38 CRC cells (Fig S7A,B). Twenty-one days after injection, mice were euthanized and their livers analyzed for tumor size, proliferation, signs of angiogenesis (as denoted by CD34 staining) and immune infiltration (F4/80+ macrophages and CD3+ T cells). Consistent with data presented in Fig 4D, histological analysis showed that Ifnar1fl/fl mice did not develop liver metastases in IFNα-treated mice. Furthermore, metastatic lesions detected in VeCadIfnar1_KO mice treated or not with IFNα did not show significant differences in Ki67 positivity, CD34 staining or the amount of F4/80+ resident macrophages and CD3+ T cells. This further supports that the antimetastatic potential of IFNα therapy may be primarily depend on the inhibition of hepatic trans-sinusoidal migration, a limiting step in the metastatic cascade that could secondarily influence colonization and outgrowth (Chambers et al, 2002). Corresponding text has been added at lines 248-252.

      1. Immune properties of LSECs are analysed in vivo by using a mouse CRE line that targets all endothelial cells, including those ones located in lymphoid organs, and evaluating T cell composition in the spleen. I found difficult to conclude that these properties are exerted directly by LSECs and not by other endothelial cells in vivo. To clarify the local effect of LSECs in modulating anti-tumour immunity, T cell composition and activation should be checked in tumours shortly after tamoxifen administration.

      We thank the reviewer for pointing out this issue, which cannot not be tested directly because - as also mentioned by reviewer 2 - LSEC-specific Cre-recombinant driver mice do not exist . As also indicated in the cited literature, central memory T cells accumulate after peripheral priming in secondary lymphoid organs such as the spleen (Sallusto et al, 2004; Stone et al, 2009; Yu et al, 2019). To this end, the generation and regulation of antitumor immunity is a highly orchestrated multistep process involving the uptake of tumor-associated antigens by professional APCs, their time-consuming migration to draining lymph nodes and the generation of protective T cells. Unlike other APCs, HECs/LSECs do not need to migrate to draining lymph nodes to activate effector T cells, leading to a rapid intrahepatic CD8+ T cell activation. In this context, LSECs must not only efficiently uptake, process and present CRC-derived antigens coming from intravascularly contained tumor cells, but they also require the attraction and retention within the liver micro-vasculature of T cell populations necessary for the generation of effective antitumor immune responses, where chemokines play an important role (Lalor et al, 2002). As shown in Fig 6A-C, two prominent chemokines (Cxcl10 and Cxcl9) required for T cell recruitment to the liver are specifically upregulated only in HECs/LSECs from IFNα-treated Ifnar1fl/fl mice, whereas HECs from VeCadIfnar1_KO mice maintained low expression of these chemoattractants in both NaCl- and IFNα-treated mice. These data are also consistent with the in vitro cross-priming results (see Fig 7A,B) showing that in the absence of IFNα, HECs have a low capacity to prime naïve T cells (Katz et al, 2004), indicating that LSEC-primed by tumor-derived antigens coming from apoptotic intravascular CRC metastatic cells play an important role in inducing tolerance (Berg et al, 2006; Katz et al., 2004), especially when CRC cells quickly extravasate and position within the space of Disse, likely becoming less accessible to intravascular patrolling by naïve and effector T cells (Benechet et al, 2019; Guidotti et al, 2015). On the contrary, in IFNα-treated Ifnar1fl/fl mice, CRC cells are rapidly contained in the liver microvasculature (Fig 5A,B) with CRC-derived antigens that could be immediately taken up by LSECs due to their anatomical proximity and efficient endocytosis capacity, which is among the highest of all cell types in the body (Sorensen, 2020). Here, the continuous sensing of IFNα by LSECs upregulates several genes related to antigen processing and presentation pathways (Fig. 6B,D), leading to efficient cross-priming of tumor-specific CD8+ T cells to the same extent as professional APCs, such as splenic DCs (Fig 7B). Text has been added in the revised manuscript at lines 496-515. Finally, regarding the suggestion to analyze the role of HECs/LSECs in inducing antitumor T cell immunity shortly after tamoxifen administration, while we agree that it would be interesting to analyze HEC/LSEC-mediated T cell activation by treating NaCl- and IFNαtreated Ifnar1fl/fl and VeCadIfnar1_KO mice with tamoxifen after CRC cell injection, we would like to point out that tamoxifen treatment will not only induce Cre recombination and Ifnar1 loss on endothelial cells but it may also induce several “off-target” effects complicating the interpretation of the results. Indeed, tamoxifen is known to i) inhibit the in vitro proliferation of several CRC cell lines (Ziv et al, 1994), ii) impair the growth of CRC liver metastases in vivo (Kuruppu et al, 1998) and iii) modify matrix stiffness to reduce tumor cell survival (Cortes et al, 2019). Further, as IFNα modifies the hepatic vascular barrier and the accessibility of antigens by LSECs, the specific timing of tamoxifen treatment could also affect the immunological consequences of Ifnar1 deletion making these experiment impractical. For these reasons, we’d like not to perform the suggested experiment with tamoxifen.

      Reviewer #1 (Significance):

      The conclusions of this study are consistent with previously published literature and the biological insights are potentially useful to the cancer biology community.

      Reviewer #2 (Evidence, reproducibility and clarity):

      In this study Dr. Sitia's group investigated the effect of IFNα1 as perioperative agent preventing liver metastasis formation of colorectal carcinoma (CRC). To this end, various mouse models were used such as liver colonization models, i.e. intrasplenic and mesenterial injections of MC38 and CT26 CRC cell lines. Besides, spontaneous metastasis of CRC was analyzed by orthotopic injection of MC38 into the cecum. To study the influence of IFNα1 in these settings mini-osmotic pumps releasing IFNα1 were used. Moreover, conditional mouse models with a cell-type specific deficiency of Ifnar1 were compared. Altogether, the application of IFNα1 led to a reduction in liver colonization of CRC in all models studied. This was ascribed to decreased trans-sinusoidal migration of CRC and increased cross-priming by LSEC entailing in T cell activation.

      Major comments:

      Overall the study is well performed and the major conclusions seem to be drawn well. However, there are certain points I like to address:

      • First, the authors started their experiments with MC38 and CT26 CRC cell lines. At the end they just applied MC38. The rational behind this should be clearly stated. Second, as in their previous publication (Catarinella et al, 2016) F1 hybrids of C57BL/6 x BALB/c mice were used for the experiments. However, I believe that the genetic heterogeneity might be strongly increased by this approach which might lead to difficult reproducibility of the results.

      We thank the referee for raising this important issue; additional text describing the reason of our choice has been introduced at lines: 203-205. We respectfully disagree with the comment that CB6F1 hybrids may increase genetic heterogeneity and impair reproducibility of our results. Each CB6F1 hybrid individual is genetically identical to its littermates, sharing 50% of genes of each parental mouse line and being tolerant to reciprocal MHC-I genes (thus permitting the correct engraftment of both cell lines). We agree that the use of mismatched backcrosses after the F1 generation would increase genetic heterogeneity and thus may affect outcome. This is also the reason why we could not perform experiments with CT26 in the Ifnar1fl/fl conditional lines that are in C57BL/6 background and would have needed at least 10 generations of backcrossing in the BALB/c background before being suitable to such experiments. Finally, all experiments described in Fig 4, 5, 6 and 7 were performed in C57BL/6 mice using MC38 CRC cells with results that reproduced those obtained in CB6F1 hybrids, and very similarly to what we have previously reported with MC38 in C57BL/6 mice (see Fig 5 (Catarinella et al., 2016)).

      • At page 16 the authors conclude that "patients suffering from chronic liver fibrotic disease... display lower incidence of hepatic metastases". In the community there is contradictory data (see Kondo et al, BJC, 2016, https://www.nature.com/articles/bjc2016155). This should be precisely discussed, otherwise this claim should be removed.

      We thank the referee for raising this issue and modified the discussion accordingly. Text has been added in the revised manuscript at lines 455-457.

      We agree with the reviewer's suggestion and added new text to recognized the interplay between different cell types such as dendritic cells within the hepatic niche (see new text at lines 505-515).

      • Last, multiple times the authors write about data that is "not shown". Please either include these data in the manuscript or delete corresponding phrases because it is not possible for the reader to scrutinize it.

      We fully agree with the referee’s concern and displayed all “not shown results” in Fig S1E and Fig S9C-I.

      • Besides, I suggest additional experiments further substantiating the study:
      • To see if this effect of IFNα1 is cell type-specific liver metastasis of other solid tumors such as breast cancer or melanoma should be investigated.

      We agree with the reviewer's suggestion, as also indicated in our original discussion. We believe that additional experiments with other solid tumor cell lines would be important to generalize the potential of perioperative IFNα therapy. In particular, we believe that pancreatic ductal adenocarcinoma (PDAC), a highly lethal disease that most commonly metastasizes to the liver (Lambert et al, 2017), may benefit from our approach. It should be noted, however, that the pleotropic nature of IFNα allows this cytokine to inhibit tumor growth by several mechanisms. Above all, the ability of IFNα therapy to directly reduce tumor growth depends on the relative surface expression of Ifnar1 on each tumor cell and the ability to maintain such expression in the harsh tumor microenvironment during IFNα therapy. As the degradation of Ifnar1 by CRC tumors has been well described (Katlinski et al., 2017), it is possible that CRC tumors thus escaping the antitumor properties of endogenous type I interferons may respond less efficiently to therapeutic IFNα regimens such as those herein described. This notion is consistent with our data on primary orthotopic tumors (Fig. 3D,E), which are no longer responsive to continuous IFNα therapy as early as 7 days after implantation of CT26LM3 cells. In addition, the definition of the HEC/LSEC antimetastatic barrier has been possible only because CRC cells are not directly susceptible to the IFNα antiproliferative activity, which we observed in vitro at extremely high IFNα dosages (Catarinella et al., 2016) but not in vivo (as formally demonstrated by using MC38Ifnar_ko cells, Fig 4A). At any rate, we followed the reviewer’s suggestion and performed an additional experiment in which we intramesenterically injected the PDAC cell line Panc02 (H-2b, C57BL/6-derived) (Soares et al, 2014) into C57BL/6 mice 7 days after of NaCl or IFNα therapy initiation. As shown below, MRI analysis at day 21 showed that none of the IFNα-treated Panc02 challenged mice developed metastatic lesions, while NaCl controls displayed a high metastatic burden that required euthanization for ethical reasons of about 67% of these mice shortly after MRI analysis. These data indicate that perioperative IFNα therapy completely curbs metastatic development in IFNα-treated PDAC animals. The notion that these cells may be more IFNα-susceptible than CRCs may well depend on the relative capacity of the former cells to maintain Ifnar1 expression, as suggested by others (Zhu et al, 2014). Properly addressing the reviewer’s comment would thus require extensive investigations involving the establishment of new mouse models of metastases from other solid tumors, starting from the in vitro and in vivo regulation of surface Ifnar1 expression in each tumor cell. We strongly believe that this work has merit but we think that it should be reported separately.

      • The authors applied a broad range of cell type-specific mice. However, a thorough characterization of the deletion of Ifnar1 in the corresponding cell types is missing. This is crucial for the manuscript.

      We fully agree with the referee’s concern and as previously mentioned, we have improved the characterization of Ifnar1 deletion (see response to the same critique received from reviewer 1, comment 3).

      • The capillarization of the hepatic vascular niche is a crucial point in this story. I believe that the hepatic endothelium should be further characterized by additional vascular markers.

      In response to the reviewer’s suggestion, we have included in our analysis the characterization of Lyve-1, a marker of hepatic capillarization (Pandey et al, 2020; Wohlfeil et al, 2019). Indeed, IFNα treatment of Ifnar1fl/fl mice significantly increased the expression of Lyve-1, whereas IFNα treatment of VeCadIfnar1_KO mice showed no effect (Fig S9A,B), further corroborating our findings. Text has been added in the revised manuscript at lines 291-294. To better aid readers, we have prepared high-resolution images for each IF channel and have provided these data as source date for Fig S9A.

      • Last, the data and methods appear adequately presented and experiments seem to be reproducible. Just in Figure 4 the exact number of mice and replicates are not clearly presented. Otherwise, everything is fine.

      We thank the reviewer for raising this issue, which apparently was not properly described in our original submission. We have now included the exact number of mice in each experimental group in the figure legend to Fig 4.

      Minor comments:

      Overall the text and figures are accurately presented. However, I would like to add further minor comments:

      • In Fig. 1 you present the IFNα dosing regimen. How do you explain the decrease in serum IFNα after day 2? Besides, the data points at day 0 should be excluded since measuring startet from day 2! Why did you decide to treat for seven days until the start of the experiment? One could think 2 days might already be enough.

      We thank the reviewer for raising these important points. Regarding the pharmacokineticpharmacodynamic (PK-PD) behavior of our approach, we do not believe that MOP reduced its pumping efficacy after day 2 (Theeuwes & Yum, 1976), nor that counterregulatory mechanisms, such as the induction of anti-IFNα blocking antibodies, occurred in such a short time frame (Wang et al, 2001). It is neither feasible that IFNα treatment significantly downregulated Ifnar1 in the liver (as demonstrated by pSTAT1 activation after MOP treatment in Fig S1E). Rather, our results reflect the PK-PD behavior of other long-lasting formulations of IFNα, which depend on intrinsic pharmacological properties of IFNα already described in (Jeon et al, 2013). Text has been added in the revised manuscript at lines 110-112. We also corrected the figures in which we quantified serum IFNα. Indeed, blood was drawn one day before MOP implantation rather than on the same day of surgery to avoid additional blood loss, which could be a source of unnecessary stress for the animals. Therefore, we corrected the results section and Fig S1A-C and Fig 1A,B. The decision to start treatment 7 days rather than 2 days before seeding was made for several reasons: i) this study follows our previous gene/cell therapy approach, in which the time interval between reconstitution of the transduced bone marrow with Tie2-IFNα and tumor challenge was at least 7-8 weeks. We therefore thought that 7 days might be a sufficient/necessary time period to induce similar phenotypes in the liver after continuous IFNα administration; ii) 7 days is a time frame compatible with the perioperative period in humans (Horowitz et al, 2015). Furthermore, the side effects that patients may experience after IFNα therapy are generally limited to the first few days after administration, allowing patients to benefit from IFNα-induced vascular antimetastatic barriers at the time of surgery without potential side effects of IFNα. Because oncologic guidelines recommend starting adjuvant chemotherapy at least 4 weeks after surgery in stage 2-3 CRC patients at risk of later developing liver metastases (Engstrand et al, 2019; van Gestel et al, 2014), our proposed perioperative time frame does not even conflict with these indications (Van Cutsem et al, 2016). We have included additional text in the lines 131-132 to motivate the timing of our regimens.

      • Fig. 2: Did you check for metastases in other organs than the liver at the timepoint of euthanization, e.g. lungs. In the discussion section you talk about a potential influence of IFNα1 on other organs. Therefore, I think that the mice should be thoroughly analyzed and the data presented. The manuscript will benefit from it.

      We thank the reviewer for this valuable comment. Indeed, we always check for dissemination of CRC metastases on MRI analysis and necroscopy. As stated at lines 146-147 and 158 CRC tumors seeded in the liver vasculature after colonizing the liver do not spread to other organs such as the lungs. Indeed, CRC cells intravascularly seeded in the portal circulation, are trapped at the beginning of hepatic sinusoids because their diameter is bigger than that of liver sinusoids (Fig S8A,B). These micro-anatomic peculiarities are also thought to impede the spreading of tumor cells from periportal to centrilobular areas and to the general circulation (Catarinella et al., 2016; Vidal-Vanaclocha, 2008), and this is consistent with studies showing that in CRC patients undergoing surgery the majority of CRC-derived circulating tumor cells are found in the portal vein (Deneve et al, 2013).

      • Overall, MRI pictures and pictures of IHC or IF are sometimes too small to see. Please provide pictures with larger magnification or enlarge the images.

      We thank you for this suggestion and we have indeed increased the size of all MRI, IHC, and IF images to the maximum that will fit within the figure. In addition, we presented the images at the highest magnification available, without making digital enlargements that would significantly reduce resolution.

      • Fig. 3 F, G: immune cell infiltration in the liver was analyzed. Please compare it to untreated, tumor-free wildtype liver tissue.

      We appreciated the reviewer's suggestion and included the results of six Sham mice per each marker in our analysis. The text was added on the figure legends to Fig 3H and Fig S4B,D.

      • Fig. 6: the graphs are too small to be read, especially the volcano plot and the gene names of the heatmap.

      We increased the font size of genes in the volcano plots and heatmap in Fig 6A,B, as suggested.

      • Fig. S6: Pictures of co-immunofluorescences are presented. For the reader it is really hard to distinguish the stainings and to identify colocalized areas. Please provide pictures with one channel to better compare the marker expression.

      We thank the reviewer for pointing this out and we have tried to make each panel as large as possible to fit into a two-column figure. We have also prepared high magnification images of each channel for all immunofluorescence images, which we provide as source data. We hope that this is sufficient to help readers to interpret our results without increasing the number of main or supplementary figures.

      • From page 8 onwards (section about transgenic mice) LSEC was used as kind of synonym for hepatic endothelial cells. Since there is still no LSEC-specific driver mouse, it should be stated "hepatic endothelial cells" instead.

      We agree with this suggestion and thus have indicated that the results refer to HECs but include a large majority of LSECs. Indeed, LSECs make up the majority (~89%) of the total HEC population (Su et al, 2021). In addition, some SEM and TEM analyses were performed only on LSECs, as well as the IF analyses. Therefore, we believe that LSECs play an important role in this process. Although not specifically suggested, we have also changed the title of our manuscript to reflect the reviewer's suggestion. Thus, we propose "Continuous sensing of IFNα by hepatic endothelial cells shapes a vascular antimetastatic barrier" as new title.

      • P. 11: there is a typo: Fig. Fig. S6G,H

      We corrected this typo.

      • P. 13: the authors describe Gata4 as inhibitor of subendothelial matrix deposition. This should be precisely written, since Gata4 originally is described as master-regulator of liver sinusoidal differentiation which leads to liver fibrosis development upon loss of Gata4.<br /> Besides, I came across a study of the same group that investigated the role of Notch signaling in hepatic CRC and melanoma metastasis (Wohlfeil et al, Cancer Res, 2019, https://aacrjournals.org/cancerres/article/79/3/598/638600/Hepatic-Endothelial-Notch-Activation-Protects). Similar to your study they tie the reduction in hepatic metastasis to capillarization of the hepatic microvasculature.

      We agree with this suggestion and modified text accordingly. We are also glad that our results agree with previous reported literature that has now been correctly cited at lines 351-356 and in the discussion lines 474-476.

      • The discussion reads like paraphrasing the results section. The manuscript would clearly benefit if the discussion section had been rewritten short and concisely.

      We agree with this suggestion, and we have modified discussion accordingly. We are also willing to shorten the discussion by removing the schematic model that could possibly be used as a graphical abstract.

      References

      Benechet AP, De Simone G, Di Lucia P, Cilenti F, Barbiera G, Le Bert N, Fumagalli V, Lusito E, Moalli F, Bianchessi V et al (2019) Dynamics and genomic landscape of CD8(+) T cells undergoing hepatic priming. Nature 574: 200-205

      Berg M, Wingender G, Djandji D, Hegenbarth S, Momburg F, Hammerling G, Limmer A, Knolle P (2006) Cross-presentation of antigens from apoptotic tumor cells by liver sinusoidal endothelial cells leads to tumor-specific CD8+ T cell tolerance. Eur J Immunol 36: 2960-2970

      Boukhaled GM, Harding S, Brooks DG (2021) Opposing Roles of Type I Interferons in Cancer Immunity. Annu Rev Pathol 16: 167-198

      Catarinella M, Monestiroli A, Escobar G, Fiocchi A, Tran NL, Aiolfi R, Marra P, Esposito A, Cipriani F, Aldrighetti L et al (2016) IFNalpha gene/cell therapy curbs colorectal cancer colonization of the liver by acting on the hepatic microenvironment. EMBO Mol Med 8: 155-170

      Chambers AF, Groom AC, MacDonald IC (2002) Dissemination and growth of cancer cells in metastatic sites. Nat Rev Cancer 2: 563-572

      Cortes E, Lachowski D, Robinson B, Sarper M, Teppo JS, Thorpe SD, Lieberthal TJ, Iwamoto K, Lee DA, Okada-Hatakeyama M et al (2019) Tamoxifen mechanically reprograms the tumor microenvironment via HIF-1A and reduces cancer cell survival. EMBO Rep 20

      Deneve E, Riethdorf S, Ramos J, Nocca D, Coffy A, Daures JP, Maudelonde T, Fabre JM, Pantel K, Alix-Panabieres C (2013) Capture of viable circulating tumor cells in the liver of colorectal cancer patients. Clin Chem 59: 1384-1392

      Engstrand J, Stromberg C, Nilsson H, Freedman J, Jonas E (2019) Synchronous and metachronous liver metastases in patients with colorectal cancer-towards a clinically relevant definition. World J Surg Oncol 17: 228

      Guidotti LG, Inverso D, Sironi L, Di Lucia P, Fioravanti J, Ganzer L, Fiocchi A, Vacca M, Aiolfi R, Sammicheli S et al (2015) Immunosurveillance of the liver by intravascular effector CD8(+) T cells. Cell 161: 486-500

      Horowitz M, Neeman E, Sharon E, Ben-Eliyahu S (2015) Exploiting the critical perioperative period to improve long-term cancer outcomes. Nature reviews Clinical oncology 12: 213-226

      Jeon S, Juhn JH, Han S, Lee J, Hong T, Paek J, Yim DS (2013) Saturable human neopterin response to interferon-alpha assessed by a pharmacokinetic-pharmacodynamic model. Journal of translational medicine 11: 240

      Katlinski KV, Gui J, Katlinskaya YV, Ortiz A, Chakraborty R, Bhattacharya S, Carbone CJ, Beiting DP, Girondo MA, Peck AR et al (2017) Inactivation of Interferon Receptor Promotes the Establishment of Immune Privileged Tumor Microenvironment. Cancer cell 31: 194-207

      Katz SC, Pillarisetty VG, Bleier JI, Shah AB, DeMatteo RP (2004) Liver sinusoidal endothelial cells are insufficient to activate T cells. Journal of immunology 173: 230-235

      Kuruppu D, Christophi C, Bertram JF, O'Brien PE (1998) Tamoxifen inhibits colorectal cancer metastases in the liver: a study in a murine model. Journal of gastroenterology and hepatology 13: 521-527

      Lalor PF, Shields P, Grant A, Adams DH (2002) Recruitment of lymphocytes to the human liver. Immunol Cell Biol 80: 52-64

      Lambert AW, Pattabiraman DR, Weinberg RA (2017) Emerging Biological Principles of Metastasis. Cell 168: 670-691

      Pandey E, Nour AS, Harris EN (2020) Prominent Receptors of Liver Sinusoidal Endothelial Cells in Liver Homeostasis and Disease. Front Physiol 11: 873

      Sallusto F, Geginat J, Lanzavecchia A (2004) Central memory and effector memory T cell subsets: function, generation, and maintenance. Annu Rev Immunol 22: 745-763

      Schreiber G (2017) The molecular basis for differential type I interferon signaling. J Biol Chem 292: 7285-7294

      Soares KC, Foley K, Olino K, Leubner A, Mayo SC, Jain A, Jaffee E, Schulick RD, Yoshimura K, Edil B et al (2014) A preclinical murine model of hepatic metastases. J Vis Exp: 51677

      Sorensen KK, Smedsrod, B. (2020) The Liver Sinusoidal Endothelial Cell: Basic Biology and Pathobiology. In: The Liver: Biology and Pathobiology, Sixth Edition pp. 422-434. John Wiley & Sons Ltd. :

      Stone JD, Chervin AS, Kranz DM (2009) T-cell receptor binding affinities and kinetics: impact on T-cell activity and specificity. Immunology 126: 165-176

      Su T, Yang Y, Lai S, Jeong J, Jung Y, McConnell M, Utsumi T, Iwakiri Y (2021) Single-Cell Transcriptomics Reveals Zone-Specific Alterations of Liver Sinusoidal Endothelial Cells in Cirrhosis. Cell Mol Gastroenterol Hepatol 11: 1139-1161

      Theeuwes F, Yum SI (1976) Principles of the design and operation of generic osmotic pumps for the delivery of semisolid or liquid drug formulations. Ann Biomed Eng 4: 343- 353

      Van Cutsem E, Cervantes A, Adam R, Sobrero A, Van Krieken JH, Aderka D, Aranda Aguilar E, Bardelli A, Benson A, Bodoky G et al (2016) ESMO consensus guidelines for the management of patients with metastatic colorectal cancer. Ann Oncol 27: 1386-1422

      van Gestel YR, de Hingh IH, van Herk-Sukel MP, van Erning FN, Beerepoot LV, Wijsman JH, Slooter GD, Rutten HJ, Creemers GJ, Lemmens VE (2014) Patterns of metachronous metastases after curative treatment of colorectal cancer. Cancer Epidemiol 38: 448-454

      Vidal-Vanaclocha F (2008) The prometastatic microenvironment of the liver. Cancer microenvironment : official journal of the International Cancer Microenvironment Society 1: 113-129

      Wang DS, Ohdo S, Koyanagi S, Takane H, Aramaki H, Yukawa E, Higuchi S (2001) Effect of dosing schedule on pharmacokinetics of alpha interferon and anti-alpha interferon neutralizing antibody in mice. Antimicrob Agents Chemother 45: 176-180

      Wohlfeil SA, Hafele V, Dietsch B, Schledzewski K, Winkler M, Zierow J, Leibing T, Mohammadi MM, Heineke J, Sticht C et al (2019) Hepatic Endothelial Notch Activation Protects against Liver Metastasis by Regulating Endothelial-Tumor Cell Adhesion Independent of Angiocrine Signaling. Cancer research 79: 598-610

      Yu X, Chen L, Liu J, Dai B, Xu G, Shen G, Luo Q, Zhang Z (2019) Immune modulation of liver sinusoidal endothelial cells by melittin nanoparticles suppresses liver metastasis. Nat Commun 10: 574

      Zhu Y, Karakhanova S, Huang X, Deng SP, Werner J, Bazhin AV (2014) Influence of interferon-alpha on the expression of the cancer stem cell markers in pancreatic carcinoma cells. Exp Cell Res 324: 146-156

      Ziv Y, Gupta MK, Milsom JW, Vladisavljevic A, Brand M, Fazio VW (1994) The effect of tamoxifen and fenretinimide on human colorectal cancer cell lines in vitro. Anticancer Res 14: 2005-2009

      Reviewer #2 (Significance):

      • Since liver metastases of various tumor are tremendously hard to treat and mediates therapy resistance, the authors focus on a very important field of research - prevention of liver metastasis formation.
      • This study adds insights into the mechanisms of action of IFNα1 in the hepatic microenvironment. It extends previous findings of Toyoshima who described anti-tumoral effects of IFNα1 released by dendritic cells in the liver.
      • The study is well designed and will be of great interest for the scientific community. Besides, it will be appreciated by physicians, However, as mentioned in the discussion, further clinical studies by physicians are needed to translate its findings into the clinic.
      • The author of this review works as physician and often deals with liver metastasis. It is one field of focus of her/his research.
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study Dr. Sitia's group investigated the effect of IFNα1 as perioperative agent preventing liver metastasis formation of colorectal carcinoma (CRC). To this end, various mouse models were used such as liver colonization models, i.e. intrasplenic and mesenterial injections of MC38 and CT26 CRC cell lines. Besides, spontaneous metastasis of CRC was analyzed by orthotopic injection of MC38 into the cecum. To study the influence of IFNα1 in these settings mini-osmotic pumps releasing IFNα1 were used. Moreover, conditional mouse models with a cell-type specific deficiency of Ifnar1 were compared. Altogether, the application of IFNα1 led to a reduction in liver colonization of CRC in all models studied. This was ascribed to decreased trans-sinusoidal migration of CRC and increased cross-priming by LSEC entailing in T cell activation.

      Major comments:

      Overall the study is well performed and the major conclusions seem to be drawn well. However, there are certain points I like to address:

      • First, the authors started their experiments with MC38 and CT26 CRC cell lines. At the end they just applied MC38. The rational behind this should be clearly stated. Second, as in their previous publication (Catarinella et al, 2016) F1 hybrids of C57BL/6 x BALB/c mice were used for the experiments. However, I believe that the genetic heterogeneity might be strongly increased by this approach which might lead to difficult reproducibility of the results.
      • At page 16 the authors conclude that "patients suffering from chronic liver fibrotic disease... display lower incidence of hepatic metastases". In the community there is contradictory data (see Kondo et al, BJC, 2016, https://www.nature.com/articles/bjc2016155). This should be precisely discussed, otherwise this claim should be removed.
      • In the discussion section the interplay of other cell types within the hepatic niche should be stated. For example, in Toyoshima's study a direct anti-tumoral effect of dendritic cells releasing IFNα1 was demonstrated (see Toyoshima et al, Cancer Immunol Res, 2019, https://aacrjournals.org/cancerimmunolres/article/7/12/1944/469540/IL6-Modulates-the-Immune-Status-of-the-Tumor). This further strengthens your data.
      • Last, multiple times the authors write about data that is "not shown". Please either include these data in the manuscript or delete corresponding phrases because it is not possible for the reader to scrutinize it.
      • Besides, I suggest additional experiments further substantiating the study:
      • To see if this effect of IFNα1 is cell type-specific liver metastasis of other solid tumors such as breast cancer or melanoma should be investigated.
      • The authors applied a broad range of cell type-specific mice. However, a thorough characterization of the deletion of Ifnar1 in the corresponding cell types is missing. This is crucial for the manuscript.
      • The capillarization of the hepatic vascular niche is a crucial point in this story. I believe that the hepatic endothelium should be further characterized by additional vascular markers.
      • Last, the data and methods appear adequately presented and experiments seem to be reproducible. Just in Figure 4 the exact number of mice and replicates are not clearly presented. Otherwise, everything is fine.

      Minor comments:

      Overall the text and figures are accurately presented. However, I would like to add further minor comments:

      • In Fig. 1 you present the IFNα dosing regimen. How do you explain the decrease in serum IFNα after day 2? Besides, the data points at day 0 should be excluded since measuring startet from day 2! Why did you decide to treat for seven days until the start of the experiment? One could think 2 days might already be enough.
      • Fig. 2: Did you check for metastases in other organs than the liver at the timepoint of euthanization, e.g. lungs. In the discussion section you talk about a potential influence of IFNα1 on other organs. Therefore, I think that the mice should be thoroughly analyzed and the data presented. The manuscript will benefit from it.
      • Overall, MRI pictures and pictures of IHC or IF are sometimes too small to see. Please provide pictures with larger magnification or enlarge the images.
      • Fig. 3 F, G: immune cell infiltration in the liver was analyzed. Please compare it to untreated, tumor-free wildtype liver tissue.
      • Fig. 6: the graphs are too small to be read, especially the volcano plot and the gene names of the heatmap.
      • Fig. S6: Pictures of co-immunofluorescences are presented. For the reader it is really hard to distinguish the stainings and to identify colocalized areas. Please provide pictures with one channel to better compare the marker expression.
      • From page 8 onwards (section about transgenic mice) LSEC was used as kind of synonym for hepatic endothelial cells. Since there is still no LSEC-specific driver mouse, it should be stated "hepatic endothelial cells" instead.
      • P. 11: there is a typo: Fig. Fig. S6G,H
      • P. 13: the authors describe Gata4 as inhibitor of subendothelial matrix deposition. This should be precisely written, since Gata4 originally is described as master-regulator of liver sinusoidal differentiation which leads to liver fibrosis development upon loss of Gata4. Besides, I came across a study of the same group that investigated the role of Notch signaling in hepatic CRC and melanoma metastasis (Wohlfeil et al, Cancer Res, 2019, https://aacrjournals.org/cancerres/article/79/3/598/638600/Hepatic-Endothelial-Notch-Activation-Protects). Similar to your study they tie the reduction in hepatic metastasis to capillarization of the hepatic microvasculature.
      • The discussion reads like paraphrasing the results section. The manuscript would clearly benefit if the discussion section had been rewritten short and concisely.

      Significance

      • Since liver metastases of various tumor are tremendously hard to treat and mediates therapy resistance, the authors focus on a very important field of research - prevention of liver metastasis formation.
      • This study adds insights into the mechanisms of action of IFNα1 in the hepatic microenvironment. It extends previous findings of Toyoshima who described anti-tumoral effects of IFNα1 released by dendritic cells in the liver.
      • The study is well designed and will be of great interest for the scientific community. Besides, it will be appreciated by physicians, However, as mentioned in the discussion, further clinical studies by physicians are needed to translate its findings into the clinic.
      • The author of this review works as physician and often deals with liver metastasis. It is one field of focus of her/his research.
    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Tran et al. describes the mechanism by which IFNa treatment prevents the development of liver CRC metastasis in several mouse models. They show how continuous administration of IFNa strength liver vascular barrier by a direct effect on endothelial cells and avoids the trans-sinusoidal migration of tumour cells.

      Major points:

      1. Authors use an elegant orthotopic model of liver metastasis to confirm the effect of continuous IFNa on hepatic colonization (Fig.3). Although they extensively characterize the metastatic lesions, they do not show data on the potential impact of IFNa treatment in the primary caecum tumour. Authors should clarify if the described effects are taken place in the liver or/and in the caecum. It would be interesting to show if IFNa affects the primary tumour size, the extravasation of cancer cells and the immune infiltration since all these factors could have an impact in the number of liver lesions.
      2. Figure 3f right shows liver images without any obvious metastatic lesion. Since authors are analysing the effect of IFNa treatment in proliferation, vascularization and immune composition in liver tumours, they may show and quantify images with metastatic lesions and restrict the analysis to the tumour area.
      3. Authors analyse the recombination efficiency of different mouse CRE lines by non-quantitative methods (PCR of hepatic genomic DNA and GFP expression by immunofluorescence in healthy liver). Since PDGFRβ-Cre/ERT2 and CD11c-Cre lines are used to exclude a role of IFNa on the targeted cells, authors should provide stronger evidences to support this. They may consider studding the ablation of Ifnar1 in FACS sorted fibroblasts and myeloid cells. Moreover, it would be important showing the proportion of GFP+ cells in the sorted populations to understand how broadly these stromal populations are targeted.
      4. Ifnar1 ablation in VeCad+ cells prevents the effect of IFNa on tumour growth (Fig. 4d), suggesting the existence of anti-tumour mechanisms beyond the effects on hepatic colonization. Authors may consider checking proliferation, vascularization and immune infiltration in these tumours to enhance their conclusion.
      5. Immune properties of LSECs are analysed in vivo by using a mouse CRE line that targets all endothelial cells, including those ones located in lymphoid organs, and evaluating T cell composition in the spleen. I found difficult to conclude that these properties are exerted directly by LSECs and not by other endothelial cells in vivo. To clarify the local effect of LSECs in modulating anti-tumour immunity, T cell composition and activation should be checked in tumours shortly after tamoxifen administration.

      Significance

      The conclusions of this study are consistent with previously published literature and the biological insights are potentially useful to the cancer biology community.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors use soft X-ray tomography to examine cell structure following infection by herpes simplex virus-1 (HSV-1). This imaging method can provide 3D images of cryo-preserved intact cells without chemical fixation or staining. The authors find several morphological differences between uninfected and infected cells, including changes in the number and size of vesicles and in the size and shape of mitochondria.

      This is a well-done study with careful and extensive analysis that in general produces convincing images to support the authors' conclusions. The procedures are clearly described and reproducible, and the authors have examined an impressive number of images and have performed appropriate statistical analyses.

      I had two comments / suggestions regarding the findings about changes in morphology after infection. First, in the Discussion, the authors consider the possibility of Golgi fragmentation. Can the authors test this by counting Golgi before and after fragmentation? Second, in the Results the authors report that they did not observe a change in lipid droplets after infection. However, the late-stage image in Fig. 5A seems to show such a change, with the lipid droplets becoming larger and darker relative to the early stage or uninfected cells. Maybe this is just the particular image that was selected, but perhaps it is worth looking at more images by eye just in case the segmentation procedure somehow missed this change.

      Minor comments:

      Line 127 - As I understand it, the alignment by fiducial markers corrects primarily for small inaccuracies in tilting of the stage. Hopefully there are not significant vibrations in the microscope because this would also lead to loss of resolution during the exposure of each tilt angle.

      Line 145 - "electron light" Is this common usage? To me it seems more accurate to just say electrons because light to me means photons.

      Line 390 - detection OF ("of" is missing)

      Line 564 - Fig. 2 legend. "partial retention in the nucleus of U2OS cells". I am not sure where the nucleus is in the images. To me, it looks like there is almost no stain for ICP0 in hTERT at stage 1 and stage 3, and then cytoplasmic stain at stage 2 and stage 4. In contrast, for U2OS, the stain looks mostly nuclear until stage 4 when it is partially cytoplasmic. This all needs to be better explained, and perhaps arrows added to the images such that the reader does not have to guess.

      Line 585 - The authors could consider rotating the images by 180{degree sign} in panel A (late) in order to maintain the same orientation of nucleus and cytoplasm. This would make it easier for readers to see the point.

      Line 614 - I could not find the length of the scale bar in the legend.

      Significance

      The significance of the study is two-fold. First, it is a nice technical demonstration of what can be accomplished using soft X-ray tomography. I am qualified to evaluate this, since my expertise is in biological applications of this technique. The second significant aspect of the study is the demonstration of morphological changes in mitochondria and vesicles. I am not a virologist, so I do not know the literature on this point with regard to virus infection, but I find it interesting that the authors were able to detect such changes.

      I believe the authors should cite a couple of papers:

      10.1016/j.cell.2015.11.029 which looks at HSV infection and reports viral particles between the inner and outer nuclear membrane. 10.1016/j.jsb.2011.11.025 which also reports nuclear membrane separations or bulges by soft X-ray tomography.

      Regarding these nuclear membrane bulges, there are a number of papers that show they can also arise from mutations in nuclear-lamin associated proteins like nesprin and SUN (see for example https://doi.org/10.1093/hmg/ddm338). This is perhaps something interesting for the authors to think about, but not necessary for the current manuscript.

    2. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the reviewers for their careful and constructive analysis of our work. Our manuscript aims to exemplify the use of cryo-soft-X-ray tomography (cryoSXT) as a technique to study the dynamic changes to host-cell morphology that accompanies virus infection. This emerging method has several strengths when compared to other ultrastructural analysis techniques. Specifically, cryoSXT does not require the addition of contrast agents and therefore samples can be prepared via plunge cryopreservation alone, allowing us to capture them in a near-native state. Furthermore, the penetrating power of soft X rays and large field of view in cryoSXT allow rapid data acquisition, facilitating quantitative analysis of 10s to 100s of individual cells. We combined high-throughput cryoSXT data collection with semi-automated tomogram segmentation and fluorescence cryo-microscopy to study a recombinant herpes simplex virus (HSV)-1 that produces a pattern of fluorescence indicative of the stage of the infection in a single cell (‘timestamp’ HSV-1) and quantitatively monitored changes in lipid droplet, vesicle and mitochondrial morphology as HSV-1 infection progresses. In response to the reviewers’ comments, we have expanded our analysis of lipid droplet morphology, identifying a transient increase in the size of lipid droplets at early stages of HSV-1 infection, and completed additional fluorescence microscopy analysis to support our statements about the changes to microtubule, mitochondrial and Golgi morphology that accompany infection. Furthermore, we have included additional discussion on the relative merits of cryoSXT versus other ultrastructural analysis techniques like transmission electron microscopy, electron cryo-microscopy and electron cryotomography. We believe that our study serves as a powerful example of how cryoSXT can be used for quantitative cell biology and will be of broad interest to an audience of cell biologists and colleagues who study infection processes.

      1. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity):

      Summary

      The authors have performed an explorative study, investigating morphological changes that occur in cells upon infection with Herpes Simplex Virus 1 (HSV-1) by the use of cryo soft X-ray tomography (cryoSXT). cryoSXT is an emerging technique for imaging of biological material, that allows for 3D imaging of significant volumes of cells under near-native conditions, without the need for sectioning or sample preparation other than rapid freezing. Reference (Groen et al. 2019) provides a nice list of examples from various biological samples. By the use of cryoSXT, the authors confirm findings that they have previously published by use of light and expansion microscopy (ref 16 from manuscript), namely an enrichment of small vesicles close to the nucleus and elongation and branching of mitochondria into interconnected networks in infected cells.

      Infection experiments were done in two different cell types in this study (HFF and U2OS), and a timestamp reporter virus that allows to distinguish between early and late stages of infection was used to provide more context to the observed morphological changes in the cells.

      Major comments

      It is a bit difficult to follow the main message throughout the manuscript, as the topics brought up in the introduction, results and discussion sections are not very coherent. The introduction gives some background on the virus and the timestamp reporter system, and further focuses on cryoSXT as a method and how this can overcome sample preparation artefacts that might be introduced by chemical fixation and sample processing. The results do not contain any direct comparisons between cryoSXT and other methods or sample preparations (light microscopy or EM-based), and the discussion only to a small extent comes back to the advantages brought by cryoSXT compared to other methods. Rather the discussion largely revolves around the possible involvement of microtubules in generating the observed morphological changes, and the possible meaning of elongated mitochondria in infected cells. Both of these topics are barely introduced, and not at all experimentally interrogated in the case of microtubules. There is also some discussion about Golgi fragmentation, although this is also not directly interrogated by cryoSXT in the current manuscript.

      We thank the reviewer for these comments. We have: - Updated the introduction to enunciate more clearly the aims of our study - Included a substantial comparison of the relative merits of cryoSXT versus other ultrastructural analysis techniques (TEM, cryoEM and cryoET) in the discussion - Updated the introduction to introduce the concepts of microtubule and mitochondrial morphology changes during infection that are covered in depth in the discussion - Included additional microscopy experiments, including super-resolution structured illumination microscopy (SIM), to demonstrate the changes in Golgi (Figures 6 and 7), microtubule (Figure 8) and mitochondrial (Suppl. Figure 4) morphology that accompany HSV-1 infection. These additional experiments support the hypotheses presented in the submitted manuscript, namely that microtubule organising centres are disrupted, Golgi membranes dispersed, and mitochondria redistributed as HSV-1 infection progresses.

      The authors perform imaging with a 40nm or a 25nm zone plate, where the 25nm zone plate provides improved resolution of a smaller volume compared to the 40nm zone plate. The authors do not really make use of the improved resolution offered by the 25nm zone plate in the results, so the motivation for turning to this (and therefor also changing cell line) is a bit unclear. The reason for the U2OS cell line to better preserved during X ray imaging is also not discussed, maybe it has to do with the thickness of the cells (as the U2OS cells are very flat). Furthermore, images from the 25 nm zone plate are not compared side by side to neither the 40nm zone plate nor standard TEM, which makes it hard to judge what the increased resolution really brings.

      Only one zone plate can be installed at any one time in the microscope and altering the zone plates requires extensive hardware changes that are outside the control of beamline users. We agree that this was not clearly discussed in the text. We have included additional text in the results (lines 207–208) and methods (lines 633–638) explaining this operational limitation and clarifying which zone plate was used for which experiment. In this study we observed that tomograms acquired with the 25 nm zone plate did not provide significantly more biological information than with the 40 nm zone plate, and thus both are suitable for characterisation of overarching cellular ultrastructural changes that accompany infection. We have added a sentence to this effect to the discussion (lines 410–412). Like U2OS cells, HFF-hTERT cells are also very flat. They appear more robust compared to HFFs when used for protracted exposures to soft X-rays and less likely to suffer from heat deposition after an extensive data collection round. We can speculate at this point that this could conceivably be due to the particular chemical composition of the intracellular environment in different cell lineages but it is impossible to offer anything other than speculation and therefore we have refrained from commenting further on this in the manuscript.

      The switch from a 40 to a 25nm zone plate required a switch in the model system, as mentioned above. The chosen cell types are not linked to biological relevance however (neurons and epithelial cells are mentioned as relevant cell types in the introduction), and it is therefor a bit unclear what the relevance is of keeping results from both cell types and comparing the two, rather than sticking to the one that works with cryoSXT. The results from the U2OS cells could still be compared by LM to the HFF cells if this contributes to the aim of the study.

      U2OS cells were chosen because they have been used previously for studies of HSV-1 infection (references 55–56) and are known to be well suited to cryoSXT analysis (references 32–33). We have added a sentence to this effect to the results (lines 208–211).

      The distribution of the viral proteins of the timestamp reporter virus is used to categorize infected HFF cells into 4 infection stages. In the U2OS cells the protein distribution is a bit different, which only allows them to be categorized into early (stage 1+2) and late (stage 3+4) stage of infection. Although this is what the authors state in the text, all 4 stages are included in Fig.2 for the U2OS cells, so it is not clear how this subdivision is performed and it does not seem like an accurate representation of the data. Furthermore, the uninfected population is not included in the timecourse, and there is not really a gradual change in infection states over the different timepoints as one could have expected. Therefor it is a bit hard to see the relevance of the timecourse. In the paper where the reporter virus is published (ref 16), shorter infection times were used, which leads to a more gradual change in infection stages.

      We thank the reviewer for pointing out these omissions. We have updated Figure 2A to only show the categories early (stage 1+2) and late (stage 3+4) for the U2OS cells. Furthermore, we have repeated the infection time course experiment, quantitating uninfected cells in addition to infected cells and including additional time points (2-, 4- and 6-hours post-infection). This new data (Figure 2B) demonstrates that the temporal profiles of infection progression are similar in HFF-hTERT and U2OS cells. Furthermore, it supports our choice of 9 hours post-infection as a suitable time point for plunge freezing of samples in order to obtain a mixture of cells at early and late stages of infection.

      There is a lot of importance given to the morphological changes of mitochondrial networks in infected cells. However, the quantification represented in Fig.5B is a bit unclear. The mitochondria are classified into different groups, but there is no specific description of the definition and cutoff values of each group. The name of some groups is also confusing, such as "short and long" mitochondria. Furthermore, there are large differences between replicates (suppl. fig. 2). The authors state that some mitochondria are swollen, which they interpret as a sign of apoptosis. They find these swollen mitochondria in 75% of the tomograms of uninfected cells in replicate number 3. If this is indeed cell death this replicate is not healthy.

      We apologise that the categorisation of mitochondria was not sufficiently clear in the submitted manuscript. The categories were percentage of tomograms that had the different mitochondrial morphologies present, not percentages of mitochondria. Thus, tomograms with both short and long mitochondria were classified as “short and long”. We have re-generated Figure 5C and Suppl. Figure 2C as a Venn diagram to illustrate this point more clearly. We have also updated the legend of Figure 5C (lines 845–850) to state clearly that the diagram shows percentage of tomograms with the relevant mitochondrial morphologies. The categorisation was performed manually and we have included examples of each category in Figure 5A. Manual classification can be subjective but, given the large number of tomograms analysed and the clear distinction between morphology in uninfected vs early- and late-stage infected cells, we are confident that our results are robust. We note that we have deposited all of the source tomograms in the Apollo repository at the University of Cambridge (https://doi.org/10.17863/CAM.78593); the data we used for this analysis are thus freely available for inspection and re-analysis by interested colleagues. We note that the swollen mitochondria were observed in multiple samples of uninfected and infected cells. This suggests that, regardless of infection, this is a common phenotype of U2OS cells. Others have observed this morphology by EM in the context of apoptosis and suggest it may represent porous mitochondria (reference 61). Although the proportion of tomograms containing these swollen mitochondria were higher in the uninfected sample of replicate 3, the other 25% contained typical mitochondrial morphologies that we could include in our analysis. The presence of inter-cell morphological variability such as this highlights the importance of imaging multiple cells within a population and performing several distinct biological replicates, as we have done in this study, to ensure project-relevant information is captured and delineated from the background structural variability inherent within a cell population. Previous cryoSXT studies had observed (but did not specifically comment on) a similar swollen mitochondrial morphology (reference 59). However, out of an abundance of caution we excluded all tomograms with swollen mitochondria from our analysis of mitochondrial branching (Figure 5C). Moreover, Tukey tests were performed per replicate for each pair of conditions in Figure 5C and statistical significance was reported only if it was observed independently in all three replicates. We are thus confident that any sampling error in replicate 3 that may arise from excluding tomograms will not have meaningfully altered our conclusions.

      Minor comments

      Results section 1, line 115-117: Where the authors state that it is unclear whether "naked" HSV-1 capsids would be visible by cryoSXT, it would be useful to refer to literature where these are observed by TEM, or to compare to TEM in their own experiments.

      We have included references to previous TEM studies in the results (lines 128–129), as requested. However, we note that TEM and cryoSXT are fundamentally different as TEM uses contrast agents whereas contrast in cryoSXT arises from differential elemental densities (in particular the density of oxygen versus carbon or phosphorous). We have updated the results (lines 129–131) to clarify this point.

      Results line 143: The authors state that it's hard to observe the perinuclear viruses with TEM, but there are several examples of this in the literature that could be referenced, e.g. (Skepper et al. 2001; Leuzinger et al. 2005; Baines et al. 2007; Johnson and Baines 2011), although this does not mean that they are not hard to find or that 3D is not advantegous.

      We thank the reviewer for these references and we have added them to the manuscript.

      Fig.4: It is unclear why all the vesicles are open-ended

      This is due to the differential path-length of carbon rich (and thus high contrast) membrane traversed by the X-rays for the membranes normal or parallel to the incident X-ray beam. We have clarified this point in the results (lines 290–301).

      Some places in the manuscript PFU per cell is used, other places MOI

      Thank you for pointing this out. For consistency, we have changed all instances of PFU per cell to MOI.

      If some specific adjustments to the methods had to be implemented for bio safely reasons (virus work), this should be stated in the methods.

      We have added a section on biosafety measures to the methods (lines 562–568).

      Access to the synchrotron should also be described

      We have expanded the synchrotron access attribution the Acknowledgments section (lines 737– 738).

      Discussion line 320: "consistent with previous research" - there is a reference missing.

      Thank you for spotting this. We have now added the reference.

      The quantifications are based on a limited number of tomograms, but there is no statement as to how the specific tomograms were selected. With a variability between replicates and tomograms, a random selection is important.

      We included all tomograms collected for the relevant experimental condition in all our analyses unless otherwise stated. For the vesicle segmentation we chose four reconstructed tomograms from each condition at random (lines 690–691). For lipid droplet volume analysis and mitochondrial branching analysis we included all tomograms that matched our quality-control criteria. We have added a few sentences to the Segmentation and Graphs and Statistics sections of the methods (lines 691–694 and 724–733) describing our selection criteria for the lipid droplet, vesicle and mitochondrial branching analysis, respectively.

      If gold fiducials are visible in the tomograms it could be useful to indicate, as they can look similar to lipid droplets to a non-expert reader.

      We have indicated gold fiducials Figure 1 H, the only figure in which they are visible, with a gold star as requested.

      Suppl. Fig.2: For clarity it would be good not to use the same color arrows to indicate different things in A and B.

      Suppl. Figure 2B has been removed in response to another reviewer request.

      Reviewer #1 (Significance):

      The authors of this study demonstrate that cells infected by HSV-1 virus can be investigated by the use of cryoSXT, and use this to show that infected cells have more elongated and interconnected mitochondria, and an enrichment of small vesicles close to the nucleus. They thereby also show that cryoSXT offers a nice resolution for characterizing morphological changes in significant volumes of near native-state cells, and that the method offers a promising throughput for screening of large amounts of cells. However, the study does not really present new biological or technical advances compared to previously published literature, see e.g. Müller et.al. 2012, Duke et.al 2014, Perez Berna et.al. 2016, Groen et.al. 2019, Weinhardt et.al. 2020, Loconte et.al. 2021 (not cryo but demonstrates the advantage of capillaries), Kounatidis et.al. 2020, Scherer 2021 (ref 16 from paper), some of which are also referenced in the current study. The study could thus have profited from a more defined focus and possibly further experiments (live-cell imaging, CLEM, TEM, microtubules or more mechanistically focused) depending on the main interest of the authors. The advantage with the current broad focus (assuming that the main concerns are addressed) is that the study could interest a larger audience, ranging from virology, cell biology and immunology to microscopy and methods development.

      We thank the reviewer for recognising the broad audience that will be interested in our manuscript. We believe that our analysis highlights the broad applicability of cryoSXT for analysing cell ultrastructure and changes that occur in response to infection. Furthermore, we think that our use of robust numerical analysis to quantitate the phenotypes we observe highlights the strength of cryoSXT as a high throughput technique for ultrastructural analysis. Our study is the first to investigate HSV-1 infection using cryoSXT and, in addition to confirming previous ultrastructural changes observed using other methods, we present new biological insight in organelle architecture and distribution such as that lipid droplets undergo a transient size increase during early stages of infection. We believe that we have demonstrated the robust utility of cryoSXT as a tool to study ultrastructural changes in response to insults, such as infection by intracellular pathogens, and hope that our manuscript will act as inspiration for others seeking to use cryoSXT to image cellular ultrastructure.

      Reviewer #2 (Evidence, reproducibility and clarity):

      The authors use soft X-ray tomography to examine cell structure following infection by herpes simplex virus-1 (HSV-1). This imaging method can provide 3D images of cryo-preserved intact cells without chemical fixation or staining. The authors find several morphological differences between uninfected and infected cells, including changes in the number and size of vesicles and in the size and shape of mitochondria.

      This is a well-done study with careful and extensive analysis that in general produces convincing images to support the authors' conclusions. The procedures are clearly described and reproducible, and the authors have examined an impressive number of images and have performed appropriate statistical analyses.

      We thank the reviewer for their positive comments.

      I had two comments / suggestions regarding the findings about changes in morphology after infection. First, in the Discussion, the authors consider the possibility of Golgi fragmentation. Can the authors test this by counting Golgi before and after fragmentation?

      We did not frequently observe well-defined Golgi apparatuses in our tomograms, consistent with previous cryoSXT studies (reference 61). We therefore performed new experiments using SIM microscopy to demonstrate the disruption of Golgi apparatus and trans-Golgi network in fixed U2OS cells stained with the markers GM130 and TGN46, respectively. These new results are presented in Figures 6 and 7 and in the results (lines 342–355).

      Second, in the Results the authors report that they did not observe a change in lipid droplets after infection. However, the late-stage image in Fig. 5A seems to show such a change, with the lipid droplets becoming larger and darker relative to the early stage or uninfected cells. Maybe this is just the particular image that was selected, but perhaps it is worth looking at more images by eye just in case the segmentation procedure somehow missed this change.

      We thank the reviewer for suggesting we re-visit the properties of lipid droplets. Based on this suggestion we segmented the lipid droplets from 94 tomograms and found a robust change in the median volume of lipid droplets at early stages of infection. We have included this new data in Figure 4C, Suppl Figure 2 and the text of the results (lines 302–312). The observation that lipid droplet volumes change is particularly interesting as another group recently observed similar changes in lipid droplets in response to HSV-1 infection of astrocytes and they postulate that this may modulate the cellular immune response (reference 85). Our data support and extend their conclusions, as described in the discussion (lines 476–494).

      Minor comments:

      Line 127 - As I understand it, the alignment by fiducial markers corrects primarily for small inaccuracies in tilting of the stage. Hopefully there are not significant vibrations in the microscope because this would also lead to loss of resolution during the exposure of each tilt angle.

      Thank you, we have corrected “vibrations” to “small inaccuracies in tilting of the microscope stage”.

      Line 145 - "electron light" Is this common usage? To me it seems more accurate to just say electrons because light to me means photons.

      Thank you, we have corrected “electron light” to “electrons”.

      Line 390 - detection OF ("of" is missing)

      Thank you, we have made the correction.

      Line 564 - Fig. 2 legend. "partial retention in the nucleus of U2OS cells". I am not sure where the nucleus is in the images. To me, it looks like there is almost no stain for ICP0 in hTERT at stage 1 and stage 3, and then cytoplasmic stain at stage 2 and stage 4. In contrast, for U2OS, the stain looks mostly nuclear until stage 4 when it is partially cytoplasmic. This all needs to be better explained, and perhaps arrows added to the images such that the reader does not have to guess.

      We agree and have added a silhouette around each nuclei in Figure 2 to make this clearer. We have also added arrows to indicate the gC-mCherry enriched juxtanuclear compartment in cells at stage 3 (HFF-hTERT) or a late stage (U2OS) of infection.

      Line 585 - The authors could consider rotating the images by 180{degree sign} in panel A (late) in order to maintain the same orientation of nucleus and cytoplasm. This would make it easier for readers to see the point.

      Done as requested.

      Line 614 - I could not find the length of the scale bar in the legend.

      We apologise for omitting this – is has now been added.

      Reviewer #2 (Significance):

      The significance of the study is two-fold. First, it is a nice technical demonstration of what can be accomplished using soft X-ray tomography. I am qualified to evaluate this, since my expertise is in biological applications of this technique. The second significant aspect of the study is the demonstration of morphological changes in mitochondria and vesicles. I am not a virologist, so I do not know the literature on this point with regard to virus infection, but I find it interesting that the authors were able to detect such changes.

      We thank the reviewer for their positive assessment of our work.

      I believe the authors should cite a couple of papers:

      10.1016/j.cell.2015.11.029 which looks at HSV infection and reports viral particles between the inner and outer nuclear membrane.

      We have included a citation to this work as requested (lines 162–165).

      10.1016/j.jsb.2011.11.025 which also reports nuclear membrane separations or bulges by soft X-ray tomography.

      We have elaborated on this section and incorporated the reference as requested (lines 265– 276).

      Regarding these nuclear membrane bulges, there are a number of papers that show they can also arise from mutations in nuclear-lamin associated proteins like nesprin and SUN (see for example https://doi.org/10.1093/hmg/ddm338). This is perhaps something interesting for the authors to think about, but not necessary for the current manuscript.

      Thank you for this comment. We did consider studying the breakdown of the nuclear lamina during HSV-1 infection, as this has been shown in previous studies [e.g. 10.1101/2021.06.02.446771]. However, we could not robustly resolve the nuclear lamina from the nuclear envelope in uninfected cells. The nuclear lamina is quite thin (30–100 nm in width) and this may have confounded its identification.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary:

      The manuscript by Nahas et al. describes the structural studies performed in U2OS cells infected with a recombinant HSV-1 virus that enables tracing the stage of the infection using fluorescent markers. This system was used to determine major structural changes in HSV-1 infected cells using cryo-soft X ray tomography (cryo-SXT) on near native-state samples. The data presented complement previous studies (particularly ref.16) using similar reagents but different microscopy techniques. While the data are generally well presented and discussed, they do not provide any substantially novel information on the structural changes in HSV-1. Nevetheless, they constitute an interesting technical achievement.

      We thank the reviewer for supporting the technical quality of the analysis. In response to the comments of another reviewer we have extended our analysis and documented new biological information for this system relating to lipid droplet re-shaping and distribution in response to HSV-1 infection; all our new findings are included in the updated manuscript.

      Major comments:

      There are no major concerns on the data, although some of the statements could be revised for a more realistic interpretation of the results.

      • In Figure 1F and lines 152-156 it is stated that a bulging of the nuclear envelope occurs around some of the putative particles, while in lines 243-244 and lines 625-628, it is stated that bulging occurs both in mock and infected cells. This should be clarified to avoid confusion. It is possible that authors differentiate both situations and this should be more clearly stated.

      Many thanks for identifying a possible area of confusion. We have updated the results to clearly distinguish the expansion of the perinuclear space that accompanies virus nuclear egress (lines 160–175) from the bulges of the nuclear envelope that are observed in uninfected and infected cells (lines 265–276).

      • The statistical tests are different for different hypothesis testing throughout the manuscript. The authors should justify in the methods section the use of one or another test. This will contribute to clarity in the hypothesis that is being test and will clarify the reason for the selected test.

      We have significantly expanded the Graphs and Statistics section of the methods (lines 703– 734) to further justify the statistical tests used throughout our study.

      • Sentence: "Our observation..." in lines 349-352. Even though the sentence is in the Discussion it is wildly speculative. The authors could use different approaches to tackle experimentally the question of whether active fusion or faulty fission is involved, but this is not the main subject the manuscript. Please revise the sentence or address experimentally, this would provide new insight into the impact of HSV-1 infection on mitochondrial network morphology. This sentence could be qualified as "speculative".

      We agree that this section of the discussion strayed into speculative territory and have removed it from the updated manuscript.

      • Although ref.16 provides evidence supporting Golgi fragmentation and mitochondrial elongation after HSV-1_timestamp virus infection in HFF cells, it would be important to show confocal microscopy data in U2OS cells, which were used for cryo-SXT, particularly since the authors refer differential virus kinetics and subcellular distribution of viral antigens in these cells. These would greatly contribute to support the statements regarding these two phenomena. It is very likely that the authors already have the data and could easily show them.

      We have included new microscopy experiments to demonstrate changes in mitochondrial (Suppl. Figure 4) and Golgi (Figures 6 and 7) morphology that accompany HSV-1 infection, and these new experiments are now included in the results (lines 335–310 and 342–355).

      -Line 269: Apposition of lipid droplets and mitochondria is not thoroughly described. This statement requires quantitation. Optimally, confocal imaging using Mitotracker and bodipy493/503 or superresolution imaging using specific antibodies may also contribute to strengthen the statement.

      We agree with the reviewer that we do not at this stage have adequate data to support this assertion and have therefore removed it from the manuscript.

      • It would be of great interest to document the budding events observed by cryo-SXT using higher resolution techniques and the kinetic resolution provided by the fluorescent infection fiducials. This would confirm the nature of the particles (using immunogold) and would demonstrate the the usefulness of the cryo-SXT data. This by itself would justify the use of cryo-SXT to temporally locate events that are difficult to visualize otherwise (as stated by the authors).

      We agree with the reviewer that a correlative imaging strategy involving cryoSXT and fluorescence microscopy could aid in identifying features of infection, and have highlighted this interesting future direction in the discussion (line 406–409). However, performing such analysis will be a substantial experimental commitment in its own and is outside the scope of our current manuscript.

      Minor comments:

      • Given that the software used for segmentation (Contour) is not published, a minimal comparative description between manual and semi-automated segmentation may be shown in the supplementary, to illustrate the robustness of the new method and the reliability of the measurements.

      We have now published a preprint (recently accepted in the journal Biological Imaging) that describes Contour in detail, which we have referenced in the updated manuscript: Nahas, K. L., Ferreira Fernandes, J., Crump, C., Graham, S. C. & Harkiolaki, M. (2021) Contour, a semi-automated segmentation and quantitation tool for cryo-soft-X-ray tomography. http://biorxiv.org/lookup/doi/10.1101/2021.12.03.470962

      • Lines 278-280: statistical test and p value are not shown.

      We have updated the text to include details of the statistical test and p value as requested (lines 326–330 of the updated manuscript).

      • After line 376: It would be interesting to mention that transient elongation of mitochondria is observed during dengue virus infection (https://doi.org/10.1016/j.chom.2016.07.008) and that this has also consequences for innate immunity against viruses.

      We thank the reviewer for this suggestion, which we have incorporated into the discussion (lines 522–523).

      • Given that HSV-1 is a BSL-2 level virus and that a recombinant version (GMO) has been used in the study, the authors should describe the biosafety measures taken to image non-inactivated infectious samples by cryo-SXT. The authors should state that a biosafety committee has reviewed these activities.

      We have included a Biosafety Measures section to the methods (lines 562–568) that details the biosafety measures used and their approval by the relevant committees.

      Reviewer #3 (Significance):

      This study constitutes an incremental technical advance in the study of HSV-1 infection. The broad context and the quasi-native structure of the cells enables documenting events that are difficult to observe thin sections for TEM.

      This study is one of the few examples of the use of cryo-SXT for infected cell imaging. Other examples of the literature are cited as well as previous structural studies performed with higher resolution techniques.

      The manuscript may be suitable for HSV-1 specialists and cell biologists interested in using near-native samples for gross cellular imaging and documentation of low-resolution maps revealing alterations in large subcellular structures.

      We thank the reviewer for highlighting that ours is one of only a few comprehensive studies using cryoSXT, illustrating how it can be used to image cellular processes that are hard to ‘catch’ using techniques that require ultra-thin sectioning, and as such that it will be of interest to cell biologists studying infection processes in cellulo.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Nahas et al. describes the structural studies performed in U2OS cells infected with a recombinant HSV-1 virus that enables tracing the stage of the infection using fluorescent markers. This system was used to determine major structural changes in HSV-1 infected cells using cryo-soft X ray tomography (cryo-SXT) on near native-state samples. The data presented complement previous studies (particularly ref.16) using similar reagents but different microscopy techniques. While the data are generally well presented and discussed, they do not provide any substantially novel information on the structural changes in HSV-1. Nevetheless, they constitute an interesting technical achievement.

      Major comments:

      There are no major concerns on the data, although some of the statements could be revised for a more realistic interpretation of the results.

      • In Figure 1F and lines 152-156 it is stated that a bulging of the nuclear envelope occurs around some of the putative particles, while in lines 243-244 and lines 625-628, it is stated that bulging occurs both in mock and infected cells. This should be clarified to avoid confusion. It is possible that authors differentiate both situations and this should be more clearly stated.
      • The statistical tests are different for different hypothesis testing throughout the manuscript. The authors should justify in the methods section the use of one or another test. This will contribute to clarity in the hypothesis that is being test and will clarify the reason for the selected test.
      • Sentence: "Our observation..." in lines 349-352. Even though the sentence is in the Discussion it is wildly speculative. The authors could use different approaches to tackle experimentally the question of whether active fusion or faulty fission is involved, but this is not the main subject the manuscript. Please revise the sentence or address experimentally, this would provide new insight into the impact of HSV-1 infection on mitochondrial network morphology. This sentence could be qualified as "speculative".
      • Although ref.16 provides evidence supporting Golgi fragmentation and mitochondrial elongation after HSV-1_timestamp virus infection in HFF cells, it would be important to show confocal microscopy data in U2OS cells, which were used for cryo-SXT, particularly since the authors refer differential virus kinetics and subcellular distribution of viral antigens in these cells. These would greatly contribute to support the statements regarding these two phenomena. It is very likely that the authors already have the data and could easily show them. -Line 269: Apposition of lipid droplets and mitochondria is not thoroughly described. This statement requires quantitation. Optimally, confocal imaging using Mitotracker and bodipy493/503 or superresolution imaging using specific antibodies may also contribute to strengthen the statement.
      • It would be of great interest to document the budding events observed by cryo-SXT using higher resolution techniques and the kinetic resolution provided by the fluorescent infection fiducials. This would confirm the nature of the particles (using immunogold) and would demonstrate the the usefulness of the cryo-SXT data. This by itself would justify the use of cryo-SXT to temporally locate events that are difficult to visualize otherwise (as stated by the authors).

      Minor comments:

      • Given that the software used for segmentation (Contour) is not published, a minimal comparative description between manual and semi-automated segmentation may be shown in the supplementary, to illustrate the robustness of the new method and the reliability of the measurements.
      • Lines 278-280: statistical test and p value are not shown.
      • After line 376: It would be interesting to mention that transient elongation of mitochondria is observed during dengue virus infection (https://doi.org/10.1016/j.chom.2016.07.008) and that this has also consequences for innate immunity against viruses.
      • Given that HSV-1 is a BSL-2 level virus and that a recombinant version (GMO) has been used in the study, the authors should describe the biosafety measures taken to image non-inactivated infectious samples by cryo-SXT. The authors should state that a biosafety committee has reviewed these activities.

      Significance

      This study constitutes an incremental technical advance in the study of HSV-1 infection. The broad context and the quasi-native structure of the cells enables documenting events that are difficult to observe thin sections for TEM.

      This study is one of the few examples of the use of cryo-SXT for infected cell imaging. Other examples of the literature are cited as well as previous structural studies performed with higher resolution techniques.

      The manuscript may be suitable for HSV-1 specialists and cell biologists interested in using near-native samples for gross cellular imaging and documentation of low-resolution maps revealing alterations in large subcellular structures.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The authors have performed an explorative study, investigating morphological changes that occur in cells upon infection with Herpes Simplex Virus 1 (HSV-1) by the use of cryo soft X-ray tomography (cryoSXT). cryoSXT is an emerging technique for imaging of biological material, that allows for 3D imaging of significant volumes of cells under near-native conditions, without the need for sectioning or sample preparation other than rapid freezing. Reference (Groen et al. 2019) provides a nice list of examples from various biological samples. By the use of cryoSXT, the authors confirm findings that they have previously published by use of light and expansion microscopy (ref 16 from manuscript), namely an enrichment of small vesicles close to the nucleus and elongation and branching of mitochondria into interconnected networks in infected cells.

      Infection experiments were done in two different cell types in this study (HFF and U2OS), and a timestamp reporter virus that allows to distinguish between early and late stages of infection was used to provide more context to the observed morphological changes in the cells.

      Major comments

      It is a bit difficult to follow the main message throughout the manuscript, as the topics brought up in the introduction, results and discussion sections are not very coherent. The introduction gives some background on the virus and the timestamp reporter system, and further focuses on cryoSXT as a method and how this can overcome sample preparation artefacts that might be introduced by chemical fixation and sample processing. The results do not contain any direct comparisons between cryoSXT and other methods or sample preparations (light microscopy or EM-based), and the discussion only to a small extent comes back to the advantages brought by cryoSXT compared to other methods. Rather the discussion largely revolves around the possible involvement of microtubules in generating the observed morphological changes, and the possible meaning of elongated mitochondria in infected cells. Both of these topics are barely introduced, and not at all experimentally interrogated in the case of microtubules. There is also some discussion about Golgi fragmentation, although this is also not directly interrogated by cryoSXT in the current manuscript.

      The authors perform imaging with a 40nm or a 25nm zone plate, where the 25nm zone plate provides improved resolution of a smaller volume compared to the 40nm zone plate. The authors do not really make use of the improved resolution offered by the 25nm zone plate in the results, so the motivation for turning to this (and therefor also changing cell line) is a bit unclear. The reason for the U2OS cell line to better preserved during X ray imaging is also not discussed, maybe it has to do with the thickness of the cells (as the U2OS cells are very flat). Furthermore, images from the 25 nm zone plate are not compared side by side to neither the 40nm zone plate nor standard TEM, which makes it hard to judge what the increased resolution really brings.

      The switch from a 40 to a 25nm zone plate required a switch in the model system, as mentioned above. The chosen cell types are not linked to biological relevance however (neurons and epithelial cells are mentioned as relevant cell types in the introduction), and it is therefor a bit unclear what the relevance is of keeping results from both cell types and comparing the two, rather than sticking to the one that works with cryoSXT. The results from the U2OS cells could still be compared by LM to the HFF cells if this contributes to the aim of the study.

      The distribution of the viral proteins of the timestamp reporter virus is used to categorize infected HFF cells into 4 infection stages. In the U2OS cells the protein distribution is a bit different, which only allows them to be categorized into early (stage 1+2) and late (stage 3+4) stage of infection. Although this is what the authors state in the text, all 4 stages are included in Fig.2 for the U2OS cells, so it is not clear how this subdivision is performed and it does not seem like an accurate representation of the data. Furthermore, the uninfected population is not included in the timecourse, and there is not really a gradual change in infection states over the different timepoints as one could have expected. Therefor it is a bit hard to see the relevance of the timecourse. In the paper where the reporter virus is published (ref 16), shorter infection times were used, which leads to a more gradual change in infection stages.

      There is a lot of importance given to the morphological changes of mitochondrial networks in infected cells. However, the quantification represented in Fig.5B is a bit unclear. The mitochondria are classified into different groups, but there is no specific description of the definition and cutoff values of each group. The name of some groups is also confusing, such as "short and long" mitochondria. Furthermore, there are large differences between replicates (suppl. fig. 2). The authors state that some mitochondria are swollen, which they interpret as a sign of apoptosis. They find these swollen mitochondria in 75% of the tomograms of uninfected cells in replicate number 3. If this is indeed cell death this replicate is not healthy.

      Minor comments

      Results section 1, line 115-117: Where the authors state that it is unclear whether "naked" HSV-1 capsids would be visible by cryoSXT, it would be useful to refer to literature where these are observed by TEM, or to compare to TEM in their own experiments.

      Results line 143: The authors state that it's hard to observe the perinuclear viruses with TEM, but there are several examples of this in the literature that could be referenced, e.g. (Skepper et al. 2001; Leuzinger et al. 2005; Baines et al. 2007; Johnson and Baines 2011), although this does not mean that they are not hard to find or that 3D is not advantegous.

      Fig.4: It is unclear why all the vesicles are open-ended

      Some places in the manuscript PFU per cell is used, other places MOI

      If some specific adjustments to the methods had to be implemented for bio safely reasons (virus work), this should be stated in the methods.

      Access to the synchrotron should also be described

      Discussion line 320: "consistent with previous research" - there is a reference missing.

      The quantifications are based on a limited number of tomograms, but there is no statement as to how the specific tomograms were selected. With a variability between replicates and tomograms, a random selection is important.

      If gold fiducials are visible in the tomograms it could be useful to indicate, as they can look similar to lipid droplets to a non-expert reader.

      Suppl. Fig.2: For clarity it would be good not to use the same color arrows to indicate different things in A and B.

      Significance

      The authors of this study demonstrate that cells infected by HSV-1 virus can be investigated by the use of cryoSXT, and use this to show that infected cells have more elongated and interconnected mitochondria, and an enrichment of small vesicles close to the nucleus. They thereby also show that cryoSXT offers a nice resolution for characterizing morphological changes in significant volumes of near native-state cells, and that the method offers a promising throughput for screening of large amounts of cells. However, the study does not really present new biological or technical advances compared to previously published literature, see e.g. Müller et.al. 2012, Duke et.al 2014, Perez Berna et.al. 2016, Groen et.al. 2019, Weinhardt et.al. 2020, Loconte et.al. 2021 (not cryo but demonstrates the advantage of capillaries), Kounatidis et.al. 2020, Scherer 2021 (ref 16 from paper), some of which are also referenced in the current study. The study could thus have profited from a more defined focus and possibly further experiments (live-cell imaging, CLEM, TEM, microtubules or more mechanistically focused) depending on the main interest of the authors. The advantage with the current broad focus (assuming that the main concerns are addressed) is that the study could interest a larger audience, ranging from virology, cell biology and immunology to microscopy and methods development.

      Reviewers expertise

      Electron microscopy, volume EM, CLEM, light microscopy, host-pathogen interactions

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We would like to thank the reviewers for their helpful and constructive comments.

      2. Point-by-point description of the revisions

      Reviewer #1

      This reviewer thought our findings would be of interest to a broad range of scientists from both the centrosome and mitosis fields, but noted some important aspects for improvements.

      Additional Experiments (we number these points for ease of discussion).

        • Figure 3. The reviewer points out that because our analysis of Ana2-∆CC and Ana2-∆STAN mutant proteins was conducted in the presence of endogenous WT protein, we should be more cautious in our interpretation.* We agree and apologise for overstating these findings. We have now rewritten the title and text of this section to be more cautious (p11, para.2)
      1. Figure 5A. The reviewer wonders whether the reduced recruitment of Sas-6 in the presence of Ana2(12A) is due to reduced binding, and they request we test this biochemically. This is our favoured interpretation, but we have been unable to test this biochemically for two reasons. First, although we have successfully purified several recombinant Sas-6 and/or Ana2 fragments (Cottee et al., eLife, 2015), the full-length proteins are poorly behaved (tending to precipitate, likely due to their inherent ability to self-oligomerise). Thus, we have been unable to reconstitute their interaction in vitro*. Second, as we show here, the proteins are normally expressed in embryos at surprisingly low concentrations (~5-20nM), and we can detect no interaction between them in coimmunoprecipitation experiments from embryo extracts (not shown). Indeed, this concentration is so low that Sas-6 does not even appear to form a homo-dimer in the embryo, even though Sas-6 clearly functions as a homo-dimer in centriole assembly (new Figure S4A). We now explain these points, and state that our favoured hypothesis that Ana2(12A) has reduced affinity for Sas-6 (or other core duplication proteins) remains to be tested (p22, para.2).

      2. The Reviewer wonders if all 12 of the potential Cdk1 phosphorylation sites that we mutate in Ana2(12A) are important in vivo, and whether we have tested whether mutating fewer sites (e.g. the two sites [S284/T301] that we show are phosphorylated by Cdk1/Cyclin B in vitro) might be sufficient to recapitulate the Ana2(12A) phenotype. *We have now tested this by mutating just the S284/T301 sites to Alanine [Ana2(2A)], but the results were not very informative (Reviewer Figure 1 [RF1]). Whereas Ana2(12A) is recruited to centrioles for a longer period and to higher levels than WT Ana2 (Figure 4A), Ana2(2A) is recruited to centrioles for a normal period but to lower levels (RF1A,B). The interpretation of this result is complicated because western blots show that Ana2(2A) is also present at lower-levels than normal (RF1B). Thus, it is clear that Ana2(2A) does not recapitulate well the behaviour of Ana2(12A). We have decided not to present this data as it is difficult to interpret and it does not change any of our conclusions.

      3. Figure 6. The reviewer asks whether the 12A mutations impair the interaction with Plk4, influence Plk4’s kinase activity or the ability of Plk4 to phosphorylate Ana2. These are excellent questions but, for the same reasons described in point 2 above, we cannot address them biochemically as we cannot purify well-behaved recombinant full-length Ana2 or active Plk4 in vitro, and both proteins are present at such low levels in the embryo that we cannot detect any interaction between them in embryo extracts. We are working hard to reconstitute in vitro* systems to probe these important points, but it may be sometime before we are able to do so.

      4. Figure 7. The reviewer suggests that the 12D/E phosphomimetic substitutions introduce more negative charge than the putative phosphorylation of Ser/Thr residues and they ask if the Ana2(2D/E) [stated as Ana2(3D/E)] is, like the Ana2(12D/E) mutant, not efficiently recruited to centrioles.* This is a fair comment, but we have not analysed an Ana2(2D/E) mutant because, as described in point 3 above, the Ana2(2A) mutant did not recapitulate well the Ana2(12A) phenotype.

      Minor comments

        • Figure S1. The reviewer requests that we show that the mNG tag on its own is not recruited to centrioles.* We do not show this (as it would create a lot of white space in this Figure), but now state that mNG and dNG do not detectably localise to centrioles (p7, para.1).
        • Figure S4C.* We have included the missing error bars (now Figure S4B).
        • Figure S5A. The reviewer asks about the expression levels of the Ana2(12A) mutant, which are not shown in this Figure. They also state that the expression levels of the transgenes shown in Figure 5A are not similar.* The expression level of Ana2(12A) is shown in Figure S9, as this data was analysed independently of the other mutant proteins shown in Figure S5. We agree that it was overly simplifying the situation to state that the expression levels of WT Ana2-mNG, eAna2(∆CC)-mNG and eAna2(∆STAN)-mNG were “similar” (Figure S5), and we now specifically mention the differences between them (p11, para.3). Reviewer #2

      This reviewer found this a rigorous study that advances our understanding of the regulation of centriole duplication, but raised some minor points.

      Minor Points

      The reviewer requests that we mention the literature describing how Ana2/STIL can influence the abundance and centriolar localisation of Plk4. We apologise for this omission, and have amended our description of this literature in the Introduction to include this point (p3, para.2).

      The reviewer notes that we interpret the ability of the Ana2(12A) mutant to keep incorporating into the centrioles for a longer period as being consistent with our idea that rising levels of Cdk activity during S-phase normally reduce the ability of WT Ana2 to bind to the centriole. They ask us to show how Cdk activity increases over this time-course, and to test whether dampening Cdk has the same effect on Ana2 recruitment (i.e. allows Ana2 to be recruited for a longer period). The time-course of Cdk activation in these embryos has been reported previously (Deneke et al., Dev. Cell, 2016; we present the relevant data from this paper in RF#2A [black line]). This reveals how Cdk activity rises throughout S-phase, which is crucial for our model. To assess the effect of dampening Cdk activity in these embryos we have now analysed the effect of halving the genetic dose of Cyclin B (RF#2B). This perturbation extends S-phase length, but has a complicated effect on the recruitment dynamics of Ana2 (RF#2B). As we would predict, Ana2 is recruited to centrioles for a longer period in these embryos, but it is also recruited more slowly (so it accumulates to lower levels). This is consistent with our hypothesis that Cdk1 activity might first stimulate and then ultimately inhibit the centriolar recruitment of Ana2. The interpretation of this experiment is not straightforward, however, as dampening Cdk1 activity alters Ana2 recruitment dynamics (and many other processes in the embryo) in complicated ways, so we have decided not to include it in the manuscript.

      The reviewer suggests that it would be valuable to show that all 12 of the potential Cdk1 phosphorylation sites in Ana2 can be phosphorylated by Cdk1 in vitro. We think this would not be particularly informative as our hypothesis does not rely on all 12 sites being phosphorylated to generate the Ana2(12A) phenotype. We simply mutate all 12 sites because we don’t know which, if any, are relevant. Thus, showing that some/all of the 12 sites can/cannot be phosphorylated in vitro does not test any hypothesis and would not change any of our conclusions. We now explain our thinking on this in more detail (p12, para.2)

      Other points

      Figure 3. We have corrected the amino-acid numbering mistakes.

      Figure 5Aii. We have changed the x-axis (time) labelling in this and all other Figures.

      Figure Legends. We have tried to eliminate the typos from the Figure legends, and apologise that these errors made it through to the final submitted version of our manuscript.

      Reviewer #3

      This reviewer thought our manuscript would be of great interest to not only the centrosome field but also to cell biologists more generally. Although they had no major concerns, they made a number of suggestions for improvements.

      1. As the reviewer suggests, we now explicitly state that although the Ana2(12A) mutant appears to be largely functional, the overall conformation of the protein may be altered, changing its function in ways we do not appreciate (p21, para.2).

      2. The reviewer suggests we include a multiple sequence alignment of Ana2/STIL proteins to provide more context about the distribution and conservation of the 12 S/T-P sites mutated in Ana2(12A).* This is an excellent idea, and we now include this in a new Figure S6, where we also provide more information about which of these sites have been shown to be phosphorylated in embryo or S2-cell extracts

      3. The reviewer is confused as to why the 12A and 12D/E mutants rescue the ana2-/- mutant flies so well, which suggests that the mechanism we propose here cannot be essential for centriole duplication. We understand this confusion and we now make this point more clearly and explain why we think this occurs in more detail (e.g. p22, para.1). We propose that Cdk normally phosphorylates Ana2 to inhibit its ability to promote centriole duplication, but this phosphorylation does not entirely block this function. So, if all other elements of the system are functional, Ana2(12A) is recruited to centrioles for longer than normal, but this does not dramatically perturb centriole duplication because the many other factors that regulate centriole duplication (such as the pulse of Plk4 recruitment to centrioles [Aydogan et al., Cell, 2020]) still occur normally and are sufficient to ensure that centrioles still duplicate normally. When Ana2 phosphorylation is mimicked [Ana2(12D/E)], the ability of Ana2 to promote centriole duplication is perturbed (but not abolished). This perturbation is lethal in the early embryo—where the centrioles must duplicate in just a few minutes to keep pace with the rapid nuclear divisions. In somatic cells S-phase is much longer, so these cells can still duplicate their centrioles (as we observe) even though Ana2(12D/E) does not function efficiently. As we now explain, this phenotype (being lethal in the early embryo, but not in somatic cells) is a common feature of mutations that influence the efficiency* of centriole and centrosome assembly (p17, para.2).

      4A. The reviewer asks us to comment in more detail on why centrioles do not seem to be elongated in the Ana2(12A) mutant wing disc cells (now Figure S8C), even though we show that Ana2(12A) (Figure 4A), and also Sas-6 (Figure 5), are recruited to centrioles for an abnormally long period. This is an excellent question and, although we do not know the answer, we now discuss this interesting point in more detail (p16, para.1). We think this is likely due to the “homeostatic” nature of centriole growth: in our hands, almost any perturbation that makes centrioles grow for a longer/shorter period, also makes them grow more slowly/quickly, so that they tend to grow to a similar size (Aydogan et al., JCB, 2018; Cell, 2020). This is fascinating, but poorly understood. When we perturb the system by expressing Ana2(12A), both Ana2(12A) and Sas-6 incorporate into centrioles for a longer period, as we predict (Figure 4A and 5A). Unexpectedly, however, Sas-6 is also recruited to centrioles much more slowly. Thus, as so often happens, when we perturb the system so the centrioles grow for a longer time, the centrioles “adapt” by growing more slowly. We do not currently understand why this occurs (although we speculate that Ana2 may also be regulated by Cdk/Cyclins to help recruit Sas-6 to centrioles in early S-phase). In the embryo, where S-phase is very short, this homeostatic compensation is not perfect, and the centrioles appear to actually be shorter than normal. In somatic wing-disc cells, where S-phase is much longer, we suspect that there is more scope for homeostatic compensation and so the centrioles grow to the correct size.

      4B. In this point (also labelled [4] by the reviewer, so we have retained this numbering but labelled the points A and B) the reviewer asks why levels of Ana2(12A) eventually decline at centrioles once the embryos actually enter mitosis. The reviewer notes our rheostat theory, but suggests a discussion of other mechanisms might be interesting. This is a good point, and we agree that the observation that Ana2(12A) levels ultimately still decline at centrioles during mitosis is likely to be important in explaining why centriole duplication is not more dramatically perturbed by Ana2(12A). We now expand our discussion of this point, highlighting that other mechanisms must help to ensure that Ana2 is not recruited to centrioles during M-phase, and discussing the possibility that the receptors that recruit Ana2 to centrioles are themselves inactivated during mitosis by high levels of Cdk activity (p15, para.1). In such a model, the rapid drop in WT Ana2 centriolar levels is due to a combination of switching off Ana2’s ability to bind to centrioles (as we propose here) and switching off the ability of the centrioles to recruit Ana2. For Ana2(12A), only the latter mechanism would operate, so Ana2(12A) levels would start to drop later in the cycle (as the inflexion point at which Ana2 recruitment and loss balances out would be moved to later in the cycle), and these levels would drop more slowly—as we observe.

      • The reviewer is confused to how the Ana2(12D/E) mutant can rescue the mutant phenotype when it is recruited to centrioles so poorly. Ana2(12D/E) is indeed recruited very poorly to centrioles in the experiment shown in Figure 7. However, this experiment had to be conducted in the presence of WT untagged Ana2—as the embryos do not develop in the presence of only Ana2(12D/E). We would predict that WT Ana2 would bind more efficiently to centrioles than Ana2(12D/E) (which appears to behave as if it has been phosphorylated by Cdk/Cyclins, and so cannot be recruited to centrioles efficiently). Thus, in the experiment we show in Figure 7, the Ana2(12D/E) protein is probably being “outcompeted” for binding to the centriole by the WT protein. In somatic cells expressing only* Ana2(12D/E) presumably sufficient mutant protein can be recruited to centrioles to support normal centriole duplication (as it no longer has to compete with the WT protein). We now explain our thinking on this point (p18, para.1).

      • The reviewer wonders whether Ana2(12D/E) may be unable to homo-oligomerize, and this may explain why the protein is not recruited to centrioles efficiently even in the presence of WT protein. This is indeed a possibility, but we think it unlikely as it is widely believed that Ana2/STIL proteins must multimerize to be functional (Arquint et al., eLife, 2015; Cottee et al., eLife, 2015; Rogala et al., eLife, 2015; David et al., Sci. Rep., 2016). As Ana2(12D/E) strongly restores centriole duplication in ana2-/-* mutant somatic cells, it seems unlikely that it cannot multimerize. Nevertheless, we now specifically highlight that the 12D/E (and 12A) mutations might alter the ability of Ana2 to multimerise (p21, para.2).

      We thank the reviewers again for their thoughtful and constructive comments. We hope they will agree that the revised manuscript is now improved and would be appropriate for publication in The Journal of Cell Biology.

      With best wishes,

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript entitled "Centriole growth is not limited by a finite pool of components, but is limited by the Cdk1/Cyclin-dependent phosphorylation of Ana2/STIL" by Steinacker et al. nicely demonstrates that centriole growth in Drosophila embryos is not limited by a finite pool of core centriole components as in other systems. In contrast, they unveiled a specific elevated cytoplasmic diffusion rate of Ana2/STIL towards the end of the S-phase, correlating with the rise of Cdk1/Cyclin activity, that they hypothesize is important for the abrupt stop of centriole growth before mitosis (end of S phase). They found using an Ana2 mutant (12A) that cannot be phosphorylated by Cdk1/Cyclin that this elevated diffusion rate is abrogated, demonstrating that this kinase is involved in this process. The authors further conclude that daughter centrioles grow at a slower rate for an extended period (as followed by SAS-6 incorporation at centrioles in the context of the 12A mutant). Thus, the authors conclude that this novel mechanism ensures that daughter centrioles stop growing at the correct time and propose that it could be part of the explanation why centriole duplication does not occur during mitosis.

      Overall, this is a solid study that is well written and easy to follow. The text and figures are well presented and the quality of the data is convincing. This manuscript would be of great interest not only to the centrosome field but also more generally to cell biologists.

      I do not have major concerns regarding the experiments. However, I would like to propose some minor comments/clarification in order to further improve the manuscript.

      Suggestions for additional improvements:

      My main comments are related to the phosphorylated mutants of Ana2 (12A) and (12D/E).

      1. To study the impact of Cdk1 on Ana2, the authors generated a mutant where 12 potential Cdk1 sites have been replaced by Alanine (12A). Although I acknowledge that all controls were properly done on this mutant and that the 12A protein is functional since it rescues the ana2-/- mutant phenotype, one can still wonder whether this could not affect somehow the overall protein conformation, or structure. Maybe this could simply be stated somewhere in the manuscript.
      2. The authors mentioned that there is evidence that 10 Cdk1 sites in Ana2 are phosphorylated in vivo and they further demonstrate convincingly that 2 of the most conserved sites can be phosphorylated in vitro by Cdk1/CyclinB (Figure S6). Could the authors include the alignment showing the potential phosphorylation residues and highlight the 12 that were mutated and show the overall conservation of these sites? It would be easier to find the residues as from the scheme of Fig. 3A it is not easy to find which residues are mutated (although the information can be found in the method section p. 32).
      3. My main confusion regarding the phosphorylation mutants 12A or 12D/E comes from the fact that both can rescue the ana2-/- mutant phenotype, which indicates that the mutant protein is functional and that somehow these sites are not fully important for centriole duplication or are not solely responsible for this type of regulation. Is this interpretation correct, this is somehow what I take from the end of the discussion p.21? if true maybe it should be a bit more emphasized.
      4. Moreover, I would have expected since centriole growth does not stop abruptly (one could talk about "prolonged" centriole growth) in the 12A mutant that centrioles would be longer. However, this is not the case as shown in Figure S7. One possible explanation would be that even though the centriole growth is extended (looking at SAS6 as a proxy), the slope/rate of incorporation is lower. Could you please comment on this more? I think this is an important point of discussion/interpretation of the results.
      5. The authors nicely show that the 12A mutant, despite similar expression levels as the taggedAna2WT, continued to accumulate at centrioles till NEBD (consistent with the hypothesis that Cdk1 cannot phosphorylate it and thus stops its recruitment). But how can the 12A levels decline at centrioles in mitosis where Cdk1 activity is the highest? This would mean that Cdk1 activity/level regulates Ana2 differentially over time or that other mechanisms might be at play. The authors mention in the discussion the attractive hypothesis of the "rheostat" (p. 20) but maybe a further discussion on an alternative mechanism could be also interesting. Could the fact that the 12A level decreases in mitosis also explain the lack of centriole phenotype if we would imagine that levels at centrioles would stay high? Could the authors comment on this? They mention it briefly (p.14 and p.21) but if they could expand a bit would be great.
      6. I was a bit confused about how the 12D/E mutant that is not recruited efficiently to centrioles could rescue the ana2-/- mutant centrioles? Could the authors comment on this, please?
      7. p.16 still about the 12D/E that is not properly recruited to centrioles even in presence of one WT copy of Ana2 (untagged): The authors conclude that "phosphorylation at one or more of these S/T sites inhibits, but does not completely block, Ana2 recruitment to and/or maintenance at centrioles". Could the mutation also prevent Ana2 homo-oligomerization? In other words, could this result suggest that the 12D/E cannot interact with untagged Ana2WT and be recruited to centrioles? Is it a possibility?

      Significance

      In this manuscript, the authors address two major fundamental questions:

      1. the mechanism that restricts strict cell cycle regulation of centriole duplication
      2. How daughter centrioles grow to the correct size. These questions are very important and this study provides some clues on the mechanisms that can be at play, among which Cdk1/cyclin seems to be involved.

      In addition, this paper raises an interesting point in showing that the core centriole duplication components concentration is as low in human cells as in fast-dividing Drosophila embryos in the range of 5-20nM. This is very interesting as it was commonly thought that embryos would have a stockpile of core components to ensure fast and numerous centriole duplication cycles. Furthermore, they found that these concentrations remained constant using FCS or Pecos, demonstrating that core centriole components concentrations are not rate-limiting for centriole duplication (over time) in this system. Instead, they propose an alternative hypothesis whereby Cdk1/Cyclin phosphorylation of Ana2/STIL would be important to regulate centriole growth and ensured timely duplication (ie no duplication in mitosis, when Cdk1 activity is high).

      In this context, this study would certainly have a broad interest and impact on cell biologists.

      Reviewer's expertise: Centrioles, microtubules, microscopy, cell biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Centriole growth is not limited by a finite pool of components, but is limited by the Cdk1/Cyclin-dependent phosphorylation of Ana2/STIL

      Authors: Thomas L. Steinacker, Siu-Shing Wong, Zsofia A. Novak, Saroj Saurya, Lisa Gartenmann, Eline J.H. van Houtum, Judith R. Sayers, B. Christoffer Lagerholm, Jordan W. Raff

      Centriole biogenesis is a tightly regulated process that occurs once per cell cycle. Defects in this process can lead to the acquisition of abnormal centriole numbers which has been linked to several human diseases. Centriole duplication starts with the assembly of a procentriole on the mother centriole in early S-phase followed by procentriole growth during G2 phase. A big question in the centrosome field is how new procentrioles assemble at the right time and acquire the correct final size.

      In this manuscript, Wong et al. analyse whether the cytoplasmic concentration of several proteins changes during centriole assembly (Asl, Plk4, Ana2, Sas-6, and Sas-4). The authors show that the cytoplasmic concentration of these proteins remains constant during centriole duplication, indicating they are not limiting components for procentriole assembly. Nevertheless, the authors found that Ana2/STIL's cytoplasmic diffusion rate increases before the onset of mitosis, concurrent with an increase in Cdk1/Cyclin activity. Mutation of 10 putative phosphorylation sites in Ana2 prevented the diffusion rate change and enabled centrioles to grow for a longer period. This suggests that phosphorylation of Ana2/STIL by Cdk1/Cyclin could control the period of centriole growth.

      Minor points:

      In the introduction, the authors describe how PLK4 is required to recruit STIL and Sas-6 to promote the formation of the cartwheel during centriole duplication. However, there is also literature describing a role for STIL in regulating PLK4 abundance and localization pattern (i.e ring or dot) at the centriole.

      The authors note that the levels of the Ana2(12A) mutant keep increasing until the onset of mitosis. The authors claim that this phenotype is consistent with the timing of increased Cdk1 activity. It would be interesting to show the increase in Cdk1 kinase activity over the same time-course and test whether dampening Cdk1 has the same effect on Ana2 recruitment.

      While I appreciate detecting in vivo phosphorylation sites can be very challenging, It would be valuable to show the 10 Ana2 phosphorylation sites can be phosphorylated by Cdk1, at least in vitro.

      Other points:

      Figure 3: Amino acids numbers for CC domain are not the same in the figure and in the figure legend.

      Figure 5Aii, the x-axis should be changed to minutes for easier comparison with other figures.

      There are some typos in the figure legends.

      Significance

      This study attempts to address a central question in the centrosome field: how centriole growth is controlled. Although the paper does not provide a detailed mechanistic advance, the authors do provide some evidence against a limited pool of centriole components controlling centriole length, and they are careful not to overstate conclusions. The manuscript is well written and easy to follow. While it is not clear at present how phosphorylation of Ana2 alters its diffusion rate or limits centriole growth, I feel the study will be of interest to members of the centriole community and will stimulate new lines of investigation. Given that the cartwheel stops elongating in S phase in mammalian systems, it is not clear if the mechanism proposed would be conserved. That notwithstanding, I found this to be a rigorous study that advances our understanding of the regulation of the centriole duplication.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Centriole duplication is a conserved pathway that need to be tightly regulated. The key enzyme of centriole assembly is Plk4 which is recruited to the centrioles and undergoes dynamic re-localization from a ring-like pattern around a centriole to a dot-like morphology at the daughter centriole assembly site. This event is central for inducing centriole biogenesis. Plk4 then phosphorylates Ana2/STIL which allows recruitment of Sas-6 to form the cartwheel structure for centriole assembly.

      In the present study, Steinacker, Wong et al. monitor how cytoplasmic concentrations of the key proteins in centriole assembly, Plk4, Asl/Cep152, Ana2/STIL, Sas-6 and Sas-4/CPAP change during the centriole assembly process in the Drosophila embryo by using fluorescence correlation spectroscopy (FCS) and Peak Counting Spectroscopy (PeCoS). They find that their concentrations remain constant with exception of Ana2/STIL of which cytoplasmic diffusion rate increased at the end of S-phase and is dependent on phosphorylation by Cdk1/CyclinB. Phosphorylated Ana2/STIL blocks centriole duplication thus preventing premature initiation of centriole duplication in mitosis.

      Major comments

      The manuscript is interesting and very well written. Most of the experiments are carefully performed. However, there are some important aspects for improvements that are listed below

      Additional experiments:

      • Figure 3: the transgenic flies that were generated here, CC and STAN, still contain wild-type Ana2. So, the authors therefore need remove or dampen their claim that the change in Ana2's cytoplasmic diffusion does not depend on its interaction with Sas-6 (page 11).
      • Figure 5A: is the observed reduced recruitment of Sas-6 by Ana2(12A) due to a decrease in binding affinity? This should also be shown by analyzing protein-protein interactions between Ana2(12A) and Sas-6 biochemically.
      • The authors use an Ana2(12A) mutant which comprises putative Cdk1 phosphorylation sites that have been identified in Mc Lamarrah et al. JCB 2018. However, only three of them were phosphorylated by Cdk1/cyclin B in vitro (Fig. S6). Are all these 12 putative Cdk1 phosphorylation sites important in vivo? Did the authors generate the Ana2(3A) or the S284A/T301A mutants to see whether it can rescue the ana2-/- mutant phenotype similar to the 12A mutant? These might be sufficient to observe the phenotype.
      • Figure 6: is the interaction between Plk4 and Ana2(12A) impaired? Similarly, Plk4 activity and phosphorylation of Ana2(12A) by Plk4
      • Figure 7: Phosphomimetics, in this case 12 amino acid changes, have the disadvantage of introducing more negative charge than the phosphorylated residue. The Ana2/(12D/E)-mNG is not efficiently recruited to centrioles. Is effect also observed for the Ana2/(3D/E) mutant?

      Minor comments

      Figure S1: only mNG-tagged centriolar proteins are shown. An empty mNGtag or an mNG-tagged non-centriolar protein should be shown to exclude that the tag by itself shows centriolar localization or somehow affects the localization

      S4C: Sas6-mNG CPM error bars are missing for the 10min time point

      S5A: What are the expression levels of the Ana2(12A) mutant? The expression levels shown in this Figure are not similar.

      Significance

      Centriole duplication normally begins at the G1/S phase transition. An important question in the field is how premature centriole duplication in mitosis is prevented. The authors used fluorescence correlation spectroscopy (FCS) and Peak Counting Spectroscopy (PeCoS) to study the major conserved proteins in the centriole assembly pathwayq and found that only Ana2/STIL's cytoplasmic diffusion increases at the end of S-phase. It is known from the literature that Cdk1 prevent Plk4-STIL complex assembly in centriole biogenesis by directly competing with Plk4 for the CC domain of Ana2/STIL (Zitouni et al. Curr Biol 26, 1127-1137 (2016). However, Ana2/STIL can also bind to Plk4 via its conserved C-terminal region of STIL (Ohta et al., Cell Reports 11, 2018; McLamarrah et al., J Cell Biol 2018, 217, 1217-1231). The work by Steinacker, Wong et al. suggest that at least in fly embryos, growth of the daughter centriole is regulated though phosphorylation of Ana2 by Cdk1/CyclinB rather than binding. The findings described in this manuscript are interesting for a broad range of scientists from both the centrosome and mitosis fields

      Expertise of the reviewer: centriole biogenesis, structural and numerical centrosomal aberrations in disease

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Point-by-point description of the revisions

      Black: Comments from reviewers

      Green: Answers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Yamamoto and colleagues have investigated the interplay between microtubules (MTs) and actin in positioning the MTOC at "the cell centre". They have developed a novel experimental setup akin to a synthetic cell to study this question. Essentially a cell-sized (15 µm) microwell that is coated in lipid and then tubulin/actin added and the positioning of a MTOC proxy is studied by microscopy. This is a well executed study. These complicated biochemical reconstitutions are the hallmark of Blanchoin and Théry's group, but even so, it's clear that the exact conditions (e.g. tubulin concentration) are fiddly and critical for these experiments to work. The data are clear, well analysed and presented. In brief, the conditions for centring a cytoskeletal network and decentring/polarising it are recapitulated. This is a short, straightforward paper and I found the results to be clear and the authors' interpretation to be well supported by the data.

      Two questions occurred to me as I read the paper: 1. While the setup is reminiscent of a cell, I suspect that the edge/wall of the microwell is much stiffer than the plasma membrane. So a MT that encounters the wall may behave differently in the cell. This would affect the non-actin conditions but possible also the conditions where an actin mesh is present. Maybe my intuition is not even correct, but I think this issue should be discussed in the paper as a potential limitation of the system.

      Author response: We thank the reviewer for this wise comment. Indeed, the deformation of the container may impact the organization of the MT network, the force balance and the final position of the MTOC. We commented this limitation in the revised discussion (page 10 line 31). However, it should be noted that in the presence of a cortical actin network, MTs are much less capable of deforming the cell than in a vesicle or a in cell treated with actin drugs, so our conditions with a cortical actin network are physiologically relevant although the container can not be deformed.

      1. The graphs in 3C and 4G (lesser extent Fig 1) show nicely that the aMTOC position has apparently rested at a steady state. Some representative trajectories are shown in some figures, but not mentioned much in the text. How does the pathlength (cumulative distance) over time compare to the "distance to centre" measurement? Is there more or less travel under the different conditions? From the supplementary videos it looks like there is a difference. An apparent resting position may still represent significant motion, e.g. circling the centre. What does an analysis of tracklength tell us, if anything?

      Author response: We appreciated reviewer’s comment and followed his/her advice. We measured the pathlength (cumulative distance moved) based on the data shown in Figure 3C and 4G. The analysis confirmed that the MTOC was static in the presence of bulk actin network (shown in the new Supplementary Figure 6B). Interestingly, it also showed that the final position adopted by the MTOC in conditions where it could move more freely was also static, as revealed by the saturation of the pathlength after 1 hour. These analyses are shown in the new Supplementary Figure 6B for the centering in the absence of cortical actin, for the non-centering with long microtubules in Supplementary Figure 7E and for the centering with long MTs and a cortical actin network in Supplementary Figure 7E.

      Very minor clerical point: - the first two sentences of the abstract could be clearer. "The position of centrosome, the main microtubule-organizing center (MTOC), is instrumental in the definition of cell polarity. It is defined by the balance of tension and pressure forces in the network of microtubules (MTs)." In the second sentence, "it" and "defined" are confusing. Are you talking about the position of the centrosome or cell polarity?

      Author response: We thank the reviewer for this comment. As the reviewer suggested, this was a confusing description. Accordingly, we corrected the sentence in the abstract for :

      The orientation of cell polarity depends on the position of the centrosome, the main microtubule-organizing center (MTOC). It is determined by the balance of tension and pressure forces in the network of microtubules (MTs).

      Reviewer #1 (Significance (Required)):

      As I see it, the main advance here is in novel experimental setup which has real potential in the field. Existing methods such as MTs inside lipid bubbles are limited, whereas as the microwell method with fabrication methods allows the shape of the "synthetic cell" to be carefully modulated. Tying the results together with cytosim simulations is also a powerful combination. There is a lot of interest in bottom-up reconstitution of cell biological phenomena, especially those that underlie specialised cell processes, e.g. polarity. My expertise: microtubules in a cellular context with limited experience of MT reconstitution assays.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: This manuscript describes the use of an elegant in vitro reconstitution system to study the effect of variations in the organization of the actin network on the positioning of a microtubule organizing center (MTOC) within the cell. By using a reconstituted system the authors are able to specifically study the contribution of the "pushing" forces generated by microtubule (MT) growth, without the confounding influence of other factors, like pulling forces from MT motors. The authors find that a bulk actin networks at sufficient density can impair MTOC displacement, likely a result of the large viscous drag of the MTOC. Next they show that MTOC centering more resilient to changes in microtubule length. Finally they show that an asymmetric actin network can cause asymmetric positioning of the MTOC.

      Major comments: 1) The model the authors put forth is that the growth of long MTs leads to decentering as a result of the MTs slipping along the well edge. The presence of a cortical actin mesh prevents this slipping. Their argument would be strengthened with and analysis of the MT behaviors in the various conditions. For example when discussing MTOC in well without actin...

      "As they grew, they first ensured a proper centering but after an hour, MT elongation and slippage along microwell edges broke the network symmetry and MTs pushed aMTOC away from the center (Figure 1I, J and Supplementary Movie 2)"

      In this movie I don't see evidence of MTs hitting the cortex and sliding on the "short" side of the well relative to the MTOC. An analysis of the behavior of MTs in various circumstances would help link the behavior of MTs to the movement of the MTOC for all of their conditions. What fraction of MTs hit the cortex and remain relatively motionless, what fraction slide, what fraction catastrophe, what fraction turn and follow the curve of the well? And how does this behavior change for microtubules that end up on the short side vs. the long side of the MTOC? This type of analysis would solidify their model for how centering/decentering occurs in the various conditions they test.

      Author response: This is a fair criticism. The possibility to perform fine analysis of MT dynamics is technically limited by the fluorescent background due to free tubulin dimers. It is the reason why classical in vitro assays are monitored in TIRF microscopy, which is not possible here since MTOCs move in 3D in the microwells. In addition, working with higher laser power to increase the signal to noise ratio generates severe photodamages on MTs. Nevertheless, we could visualize MT dynamics and displacements near the edge of the microwells and describe their behavior more precisely than in the previous version of our manuscript. New images and tracking of MT behavior are now reported in the new Figure 4E, 4F and 5G, as well as the new supplementary Figure 4C, 4D, 7B, and 7C. We also replaced the supplementary movie 2 and Figure 1I in order to show more clearly MTs hitting and slipping along the well boundary. In addition, we also characterized the pivoting of MTs around the MTOC and near the edge of the microwell in order to better characterize the effect of cortical actin. This is now shown in the new Figure 4G and 4H as well as in the new Supplementary Figure 7C-D). We found that the changes in MT orientation and position, at the centrosome and at the contact with the microwell, were clearly prevented by the presence of cortical actin.

      2) The authors use simulations to support their in vitro findings. However, their simulations have many more microtubules emanating from the MTOC than their experiment (Looks like about 50 in the cytosim and they state they are aiming for 15-20 in the aMTOCs). Do the simulations still reproduce the behavior of the in vitro system with a similar number of MTs?

      Author response: This is another fair criticism. We addressed this point by performing simulations with 10~30 microtubules (the number of MTs is variable because of MT dynamics) which are more similar to the number of MTs that we obtained in our experimental conditions. Results were consistent with previous simulations with higher number of MTs and are now shown in the new supplementary figures 6E-F, 7G and 8I).

      3) When the actin networks are asymmetric, the authors see decentering of the MTOC towards the side with less actin. However there is still actin on the side where the MTOC will move to and in some of their images it looks pretty think. Is the actin on that side not dense enough to prevent MT sliding along the "cortex"? If so, can they generate less dense, but uniform actin networks on the "cortex", where MTs can slide. Again descriptions of MT behaviors would be useful in understanding what is happening.

      Author response: We thank the reviewer for asking this important question. We followed reviewer’s advice and generated homogeneous and less dense cortex by working at lower concentration of actin (0.5 mM). In such conditions, we could not see the centering effect that was observed with dense cortex. These new data are now shown in the new Supplementary Figure 7I. This effect was also tested with numerical simulations (new Supplementary Figure 7J) which were consistent with the key role played by actin network density for MT network positioning by cortical friction.

      Minor Comments: 1)Title - the current title implies that actin is balancing the forces generated by the MTs. I'm not sure this is a good description of what is shown in the paper.

      Author response: We thank the reviewer for pointing at this issue. We revised the title to:

      Reconstitution of centrosome positioning by the production of pushing forces in microtubules growing against the actin network.

      2)The discussion would benefit from more explanation about how the results of this paper relate to the classic examples of MTOC positioning they cite. How do they envision the actin and MTs interacting in these systems and what new insight have we gained from the experiments in this manuscript.

      Author response: This is a good suggestion. We added some comments in our discussion about the actin network asymmetry in several classical examples of cell polarization and explained how our observations suggest some new interpretation on the role of this asymmetry in the reorganization of forces in the MT network and on the consequential peripheral positioning of the MTOC.

      Reviewer #2 (Significance (Required)):

      Overall, this work is a significant advance in our understanding of the potential mechanisms of MTOC movement in cells via pushing by MT growth. The experimental system they have developed is powerful advance, allowing meaningful MTOC reconstitution experiments to be performed in chambers of approximately cellular size. This is an important contribution to understanding the interaction between microtubule pushing and the actin cortex.

      Reviewer expertise: Cell biology of MTOC assembly and positioning. I do not have the expertise to assess the parameters used to generate their cytosim models.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Review of "The architecture of the actin network can balance the pushing forces produced by growing microtubules" by Yamamoto et al.

      The means by which cells maintain their characteristic cytoskeletal architectures is not well understood. This is in part because there is considerable variation in such architectures with, for example, fibroblasts, neurons, and epithelial cells. It is also in part because the microtubule, actin and intermediate filaments engage in a wide range of mechanical and signaling crosstalk mediated by a wealth of proteins and signaling networks, which further complicates the picture.

      In the current study, Yamamoto take the welcome step of developing a simplified system for assessing the mutual contributions of microtubules and F-actin for general cytoskeletal organization in vitro (specifically, in lipid-lined microwells). This allows them to define basic principles of microtubule-F-actin interactions in the absence of the various confounding factors alluded to above. Using their model, they show that artificial MTOCs (aMTOCs) alone will center but as a complex function of microtubule length (controlled by varying tubulin concentrations). That is, the aMTOCs are randomly positioned with short microtubules, stably centered with intermediate length microtubules, and randomly oriented with very long microtubules (following symmetry breaking).

      They then assess the contributions of F-actin to the centering process. In low concentrations of "bulk" F-actin (ie F-actin distributed throughout the droplet) there is no effect on centering whereas at higher concentrations of bulk F-actin, centering is impaired as is the translocation of the aMTOCs. In the presence of uniform peripheral F-actin, in contrast, aMTOC centering is enhanced, and rendered less sensitive to variations in microtubule length. Finally, when the authors contrive a situation in which the peripheral F-actin is non-uniform (by lowering the concentration of actin and adding alpha-actinin, which creates a peripheral ring of F-actin with (I think) relatively less F-actin within the ring), the aMTOCs position themselves within the ring.

      Finally, the authors extend their results with simulations that indicate that the various behaviors can be explained by a combination of friction, pushing and slippage.

      This study is fascinating and will be of general interest to anyone who seeks to understand the contributions of mechanical forces to cytoskeletal organization in a minimal system. I have only minor concerns; these are listed below.

      1. Some of the terminology was a little confusing. The authors introduce the term "inner zone" (pg. 8) without defining it. From the context, it seems like they are talking about the approximate center of the ring of peripheral F-actin. If so, why not just do away with the term "inner zone" and refer to the ring center. If it isn't the ring center, then more explanation is needed as to what the inner zone actually is.

      Author response: We apologize for this confusion and appreciate reviewer’s comment. We coined earlier the term “actin inner zone” to define the central cytoplasmic region in cells that is devoid of actin filament (Jimenez et al., Current Biology, 2021). Because it was a confusing point, we clarified this in the revised version of the manuscript (Page 8, Line 20). What we would like to call the “inner zone” is the region inside of the actin cortex. The definition of this zone and of its geometrical reference points were also pictured more precisely in the new Supplementary Figure 9B.

      1. It is not clear from the text or the images if the region within the F-actin ring has less F-actin, more F-actin, or the same amount of F-actin as the region outside the F-actin ring. This point should be clarified, as it makes a big difference in the interpretation of the findings.

      Author response: We apologize for this lack of clarity. In the revised version of our manuscript, we plotted a line scan intensity profile of the actin fluorescence (new Supplementary Figure 9B). It showed that the region within the actin inner zone contained much less actin than in the cortex. This is consistent with our interpretation of a region-selective pattern of friction acting on microtubules.

      1. Ideally, the authors would include manipulations in which the high concentration of peripheral F-actin is combined with alpha-actinin because, as currently presented, the authors are drawing conclusions from changing two variables at once (ie going from a high concentration of peripheral F-actin to a lower concentration with added alpha-actinin). Thus, the authors cannot cleanly distinguish between effects that arise from F-actin asymmetry versus the presence of an F-actin crosslinker. Since the crosslinking is likely to change the mechanical properties of the peripheral F-actin network, this point should at least be addressed in the text, if not by experiments.

      Author response: We are not sure to fully understand the reviewer’s point. We don’t understand how the crosslinking of a symmetric actin network could break the symmetry of the MT network and force its off-centering. The opposite is clearer to us. A homogeneous and loose actin network can allow MT gliding and MTOC off-centering (like in in Supplementary Figure 7J). The mechanical reinforcement of this network by crosslinkers could indeed resist gliding. But the consequence of this resistance would be similar to the consequence of a dense network: a more robust centering (like in Figure 4). So we don’t understand how the crosslinking by alpha-actinin, rather than the asymmetry of the actin network, could be at the origin of the off-centering we observed. In addition the off-centering of the MTOC was systematically aligned with the asymmetry of the actin network, so both parameters were clearly connected.

      Reviewer #3 (Significance (Required)):

      This is an elegant, well-designed study that provides a clear description of how basic mechanical forces can contribute to cytoskeletal organization in a simplified model system.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Review of "The architecture of the actin network can balance the pushing forces produced by growing microtubules" by Yamamoto et al.

      The means by which cells maintain their characteristic cytoskeletal architectures is not well understood. This is in part because there is considerable variation in such architectures with, for example, fibroblasts, neurons, and epithelial cells. It is also in part because the microtubule, actin and intermediate filaments engage in a wide range of mechanical and signaling crosstalk mediated by a wealth of proteins and signaling networks, which further complicates the picture.

      In the current study, Yamamoto take the welcome step of developing a simplified system for assessing the mutual contributions of microtubules and F-actin for general cytoskeletal organization in vitro (specifically, in lipid-lined microwells). This allows them to define basic principles of microtubule-F-actin interactions in the absence of the various confounding factors alluded to above. Using their model, they show that artificial MTOCs (aMTOCs) alone will center but as a complex function of microtubule length (controlled by varying tubulin concentrations). That is, the aMTOCs are randomly positioned with short microtubules, stably centered with intermediate length microtubules, and randomly oriented with very long microtubules (following symmetry breaking).

      They then assess the contributions of F-actin to the centering process. In low concentrations of "bulk" F-actin (ie F-actin distributed throughout the droplet) there is no effect on centering whereas at higher concentrations of bulk F-actin, centering is impaired as is the translocation of the aMTOCs. In the presence of uniform peripheral F-actin, in contrast, aMTOC centering is enhanced, and rendered less sensitive to variations in microtubule length. Finally, when the authors contrive a situation in which the peripheral F-actin is non-uniform (by lowering the concentration of actin and adding alpha-actinin, which creates a peripheral ring of F-actin with (I think) relatively less F-actin within the ring), the aMTOCs position themselves within the ring.

      Finally, the authors extend their results with simulations that indicate that the various behaviors can be explained by a combination of friction, pushing and slippage.

      This study is fascinating and will be of general interest to anyone who seeks to understand the contributions of mechanical forces to cytoskeletal organization in a minimal system. I have only minor concerns; these are listed below.

      1) Some of the terminology was a little confusing. The authors introduce the term "inner zone" (pg. 8) without defining it. From the context, it seems like they are talking about the approximate center of the ring of peripheral F-actin. If so, why not just do away with the term "inner zone" and refer to the ring center. If it isn't the ring center, then more explanation is needed as to what the inner zone actually is.

      2) It is not clear from the text or the images if the region within the F-actin ring has less F-actin, more F-actin, or the same amount of F-actin as the region outside the F-actin ring. This point should be clarified, as it makes a big difference in the interpretation of the findings.

      3) Ideally, the authors would include manipulations in which the high concentration of peripheral F-actin is combined with alpha-actinin because, as currently presented, the authors are drawing conclusions from changing two variables at once (ie going from a high concentration of peripheral F-actin to a lower concentration with added alpha-actinin). Thus, the authors cannot cleanly distinguish between effects that arise from F-actin asymmetry versus the presence of an F-actin crosslinker. Since the crosslinking is likely to change the mechanical properties of the peripheral F-actin network, this point should at least be addressed in the text, if not by experiments.

      Significance

      This is an elegant, well-designed study that provides a clear description of how basic mechanical forces can contribute to cytoskeletal organization in a simplified model system.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This manuscript describes the use of an elegant in vitro reconstitution system to study the effect of variations in the organization of the actin network on the positioning of a microtubule organizing center (MTOC) within the cell. By using a reconstituted system the authors are able to specifically study the contribution of the "pushing" forces generated by microtubule (MT) growth, without the confounding influence of other factors, like pulling forces from MT motors. The authors find that a bulk actin networks at sufficient density can impair MTOC displacement, likely a result of the large viscous drag of the MTOC. Next they show that MTOC centering more resilient to changes in microtubule length. Finally they show that an asymmetric actin network can cause asymmetric positioning of the MTOC.

      Major comments:

      1) The model the authors put forth is that the growth of long MTs leads to decentering as a result of the MTs slipping along the well edge. The presence of a cortical actin mesh prevents this slipping. Their argument would be strengthened with and analysis of the MT behaviors in the various conditions. For example when discussing MTOC in well without actin...

      "As they grew, they first ensured a proper centering but after an hour, MT elongation and slippage along microwell edges broke the network symmetry and MTs pushed aMTOC away from the center (Figure 1I, J and Supplementary Movie 2)"

      In this movie I don't see evidence of MTs hitting the cortex and sliding on the "short" side of the well relative to the MTOC. An analysis of the behavior of MTs in various circumstances would help link the behavior of MTs to the movement of the MTOC for all of their conditions. What fraction of MTs hit the cortex and remain relatively motionless, what fraction slide, what fraction catastrophe, what fraction turn and follow the curve of the well? And how does this behavior change for microtubules that end up on the short side vs. the long side of the MTOC? This type of analysis would solidify their model for how centering/decentering occurs in the various conditions they test.

      2) The authors use simulations to support their in vitro findings. However, their simulations have many more microtubules emanating from the MTOC than their experiment (Looks like about 50 in the cytosim and they state they are aiming for 15-20 in the aMTOCs). Do the simulations still reproduce the behavior of the in vitro system with a similar number of MTs?

      3) When the actin networks are asymmetric, the authors see decentering of the MTOC towards the side with less actin. However there is still actin on the side where the MTOC will move to and in some of their images it looks pretty think. Is the actin on that side not dense enough to prevent MT sliding along the "cortex"? If so, can they generate less dense, but uniform actin networks on the "cortex", where MTs can slide. Again descriptions of MT behaviors would be useful in understanding what is happening.

      Minor Comments:

      1) Title - the current title implies that actin is balancing the forces generated by the MTs. I'm not sure this is a good description of what is shown in the paper.

      2) The discussion would benefit from more explanation about how the results of this paper relate to the classic examples of MTOC positioning they cite. How do they envision the actin and MTs interacting in these systems and what new insight have we gained from the experiments in this manuscript.

      Significance

      Overall, this work is a significant advance in our understanding of the potential mechanisms of MTOC movement in cells via pushing by MT growth. The experimental system they have developed is powerful advance, allowing meaningful MTOC reconstitution experiments to be performed in chambers of approximately cellular size. This is an important contribution to understanding the interaction between microtubule pushing and the actin cortex.

      Reviewer expertise: Cell biology of MTOC assembly and positioning. I do not have the expertise to assess the parameters used to generate their cytosim models.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Yamamoto and colleagues have investigated the interplay between microtubules (MTs) and actin in positioning the MTOC at "the cell centre". They have developed a novel experimental setup akin to a synthetic cell to study this question. Essentially a cell-sized (15 µm) microwell that is coated in lipid and then tubulin/actin added and the positioning of a MTOC proxy is studied by microscopy. This is a well executed study. These complicated biochemical reconstitutions are the hallmark of Blanchoin and Théry's group, but even so, it's clear that the exact conditions (e.g. tubulin concentration) are fiddly and critical for these experiments to work. The data are clear, well analysed and presented. In brief, the conditions for centring a cytoskeletal network and decentring/polarising it are recapitulated. This is a short, straightforward paper and I found the results to be clear and the authors' interpretation to be well supported by the data.

      Two questions occurred to me as I read the paper: * While the setup is reminiscent of a cell, I suspect that the edge/wall of the microwell is much stiffer than the plasma membrane. So a MT that encounters the wall may behave differently in the cell. This would affect the non-actin conditions but possible also the conditions where an actin mesh is present. Maybe my intuition is not even correct, but I think this issue should be discussed in the paper as a potential limitation of the system. * The graphs in 3C and 4G (lesser extent Fig 1) show nicely that the aMTOC position has apparently rested at a steady state. Some representative trajectories are shown in some figures, but not mentioned much in the text. How does the pathlength (cumulative distance) over time compare to the "distance to centre" measurement? Is there more or less travel under the different conditions? From the supplementary videos it looks like there is a difference. An apparent resting position may still represent significant motion, e.g. circling the centre. What does an analysis of tracklength tell us, if anything?

      Very minor clerical point: * the first two sentences of the abstract could be clearer. "The position of centrosome, the main microtubule-organizing center (MTOC), is instrumental in the definition of cell polarity. It is defined by the balance of tension and pressure forces in the network of microtubules (MTs)." In the second sentence, "it" and "defined" are confusing. Are you talking about the position of the centrosome or cell polarity?

      Significance

      As I see it, the main advance here is in novel experimental setup which has real potential in the field. Existing methods such as MTs inside lipid bubbles are limited, whereas as the microwell method with fabrication methods allows the shape of the "synthetic cell" to be carefully modulated. Tying the results together with cytosim simulations is also a powerful combination. There is a lot of interest in bottom-up reconstitution of cell biological phenomena, especially those that underlie specialised cell processes, e.g. polarity.

      My expertise: microtubules in a cellular context with limited experience of MT reconstitution assays.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Our response to the reviewers comments as well as our revision plan has been included as a separate file in the submission.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      This reviewer did not leave any comments

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Dantas and colleagues use mechanical confinement assays to demonstrate that both mitotic entry and the timing of prophase are sensitive to mechanical perturbations. They identify a novel mechanism that fine tunes the dynamics of cyclin B1 nuclear import during prophase whereby acto-myosin contractility leads to nuclear membrane unfolding, cPLA2 recruitment and cyclinB1 nuclear import. They show how mechanical confinement can accelerate this mechanism by independently inducing nuclear unfolding, and that this can go on to induce defects in mitotic spindle assembly and chromosome segregation.

      Major

      This work contains an impressive amount of data including some technically challenging experiments. The conclusions are convincing and for the most part well supported by experimental evidence (for exceptions see below). Appropriate controls are presented and statistical analysis is adequate. The methods are mostly described well but some important details are omitted (see below). The methods and figure legends would benefit from expansion, particularly in describing how the images presented relate to quantification in graphs. Although generally the manuscript is well written, there are parts when both the experimental logic and conclusions are hard to follow, particularly in the description of figures 1 and 5 (see below for details). With a large amount of data, including important experiments relegated to supplementary figures, this work would benefit from expansion into a longer article format to allow for more clarity. Particularly:

      • Figure 1A-C: here the authors show that non-adherent cells only enter mitosis when confined. There is some key information lacking here, including the experimental timeframe. How long were the cells plated on pll-peg before imaging and for how long were they imaged? In 1C, 80% of confined cells enter mitosis, which implies that cells were filmed for a relatively long time (given an average cell cycle length of 20-24 hours). Unless of course cells were previously synchronised in G2 but the authors do not state that this is the case. In the legend it states that images were acquired every 20s. Imaging cells for 20+ hours every 20s with multiple zs is likely to have a very deleterious effect on cells and to disrupt mitotic entry itself. The authors need to explicitly explain the experimental set-up used to generate the graphs in figure 1. In 1C, it would also be good to see the equivalent adherent control included in the graph (ie % cells that enter mitosis on fibronectin in the same timeframe). The authors use the data in 1A-C to claim that 'the G2-M transition requires contact with external stimuli'. However they haven't shown this, only that non adherent cells don't enter mitosis. To show that the G2/M transition is affected, they need to look at the cell cycle phase of cells on PLL-PEG and show that cells become arrested specifically in G2.
      • Figure 5: The explanation of the conclusions here was hard to follow. It's not immediately clear why a faster prophase would lead to chromosome attachment delays in metaphase or segregation errors in anaphase since these events occur only after NEP. I think the authors' hypothesis is that a faster prophase results in less time for centrosome separation and that this is responsible for later spindle defects but this is not very clearly stated. If this is the case, then one might expect cells in which centrosome separation is delayed to also be the cells with lagging chromosomes. Did the authors observe such a correlation? It's also not clear why the authors expected confinement to rescue the spindle defects imposed by STLC treatment (supp figure 5). An alternative hypothesis that the authors neglect to mention is that faster cyclinB1 entry into the nucleus could also induce defects through changes to nuclear events such as chromosome condensation? Did they also see any changes to the rate of chromosome condensation in the confined prophase? Either way, the authors should explain more clearly in the text what they think is happening here.

      Minor

      • No reference is cited for the endogenous tagged CyclinB1 RPE1 line nor are any details about its construction given. Has this cell line been previously published by the Pines lab? Are one or both alleles tagged? N or C terminus?
      • Figure 1C: presumably n in this case is number of experiments, not cells. How many cells were analysed in each case?
      • Figure 1H. Why do the graphs have different scales on the x axis? Where does 101+-12s for confined cyclin B translocation mentioned in the text come from? From the graph, it looks longer than this?
      • Figure 3 J, K. Confinement is able to rescue the effect of Y27 on cyclin B dynamics but not shROCK1. Why this difference? The authors should discuss this discrepancy in the text.

      Significance

      This work identifies a novel mechanical mechanism that regulates the timing of cyclin B1 nuclear import in early mitosis. The role of nuclear unfolding in controlling cyclinB1 import is particularly interesting. How important this new mechanism will be in controlling the duration of prophase or mitotic fidelity in a 'normal' mitosis within a tissue is not yet clear. However, it raises many intriguing questions about how cells' mechanical environment could impact mitotic entry, which could be relevant to disease situations where mechanics is altered such as fibrosis or cancer. The work is likely to be of interest to a wide range of cell and molecular biologists including those interested in cell cycle, mitosis, mechano-biology and nuclear biology.

      I am a cell biologist working on mitosis and the cell cycle.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Dantas and colleagues report that confinement is sufficient to restore G2/M transition in cells than can't adhere to their matrix. Exploring further the mechanisms involved, they show that confinement (dynamic cell compression) stimulates nuclear import of cyclin B1 and nuclear envelope permeability using cells in 2D culture. The authors observed that actomyosin contractility increases NE tension in cells preparing for prophase, leading to an increase in nuclear translocation of cyclin B1. However, a few inconsistencies between the data and the conclusion make the current report too preliminary for publication. It may require significant additional work to consolidate the authors' model.

      • The specific contribution of Nuclear Envelope tension. The authors conclude that confinement acts through increasing NE tension, although confinement may affect cytoplasmic signaling, which could contribute to G2/M transition. The authors should test whether compressing the nucleus versus compressing the cytoplasm have distinct effects on cyclin B1 nuclear translocation and G2/M, as it has been done by others when addressing nuclear mechanosensitive mechanisms (Elosegui-Artola et al. or Lomakin et al.). To consolidate their model, the authors should also test whether decreasing NE tension (independently of actomyosin tension) has opposite effect on G2/M (for example using LBR overexpression). Increase in nuclear membrane tension has been shown to trigger cPLA2 recruitment to the NE (Enyeidi et al, 2013; Lomakin et al. 2020), although the authors show here that confinement does not induce cPLA2 recruitment (but still increases NE tension figure 4G) in the absence of Rock activity or when the LINC complex is disrupted. This is surprising considering that confinement should increase NE tension independently of actomyosin contractility and should increase cPLA2 recruitment at the NE, unless in this case cPLA2 recruitment is not mediated by an increase in NE tension.
      • NPC transport versus NE permeability. The authors suggest that confinement increases cyclin B1 transport via NPC-mediated transport and rule out that confinement may affect NE permeability based on the absence of NE rupture using the INM marker lap2. However, the sample size for this observation is missing and NE permeability could be altered even in the absence of major INM rupture observed by confocal. The authors should use a reporter of nuclear permeability (fluorescent cytoplasmic marker or nuclear marker as previously used by Denais et al or, 2016 or Raab et al., 2016) to make sure that NE permeability is not affected by confinement. In addition, NPC function should be tested in parallel with other fluorescent reporter (such as NLS-GFP constructs) to test whether global NPC-mediated transport is changed during prophase (with or without confinement).
      • Effect of confinement on cyclin B transport (NEP) in adherent cells. In figure 1D, we can see that confinement enhances cyclin B1 nuclear translocation in cells adhering on fibronectin. Although it is unclear whether confinement has a significant effect in other figures, for example in figure 2F: DMSO is not significantly different from confiner+CDKi (same thing in 3i and 3j with Rock inhibitor and Kash construct). In these figures the untreated+confiner (or control in 3j) is missing, and the absence of difference between treated+confiner and control is puzzling. Either there is no difference between confiner and CDKi+confiner and it means there is no difference between control and confiner (surprising considering figure 1D); or there is a difference between CDKi+confiner and confiner, indicating that CDK inhibition affects confinement-induced cyclin B import. Both possibilities suggest that the authors should significantly revisit their model. In any case, all control (untreated, treated +/- confiner should be in all figures to avoid any misunderstanding).
      • Consequences of cPLA2 recruitment at the NE. The authors state that "Active cPLA2 then stimulates actomyosin contractility creating a positive feedback loop" But the NE is already unfolded and distance between NPR is increased before cPLA2 recruitment. Does PLA2 inhibition affect nuclear irregularity (or distance between NPC)? Or does cPLA2 impact cyclin B1 transport via a distinct mechanism? Did the author analyze CDK1 phosphorylation in presence of PLA2 inhibitor?
      • Robustness of the main observation. On page 4, the authors report that cells enter mitosis after 140 sec (+/- 80 sec) of confinement, although in the example showed in figure 1b, the cell enters at least 420 sec min after confinement, as we can see that the cell is already confined -420 sec (compressed shape) and NEP occurs at 0. Did the author showed a cell that was not included in their statistics? This would be very surprising considering the very low sample size used for this experiment (n=6 and 10). In addition, many observations have been made on small sample size (n=6 for figure 1) or/and not from independent experiments. The authors should increase their sample size and compare results from independent experiments to consolidate their model.
      • 2h shows nuclear signal (cyclin in grayscale), while 2e does not, why?
      • starting point to quantify cyclin entry is the lowest intensity, which may depend on many factors (and could be affected by experimental design). It would be necessary to have synchronized cells to homogenize the starting point of these experiments.
      • DN-KASH have been transiently transfected for single cell experiments, how does the authors unsure that cell observed are transfected? Does it have a fluorescent tag, if so which one?
      • "requires contact with external stimuli" or "that mechanical confinement is sufficient to overcome the lack of external stimuli." (page 4): external stimuli is vague here and it could be better to replace it with a more specific description

      Significance

      While the physiological relevance of these findings remain to be determined, the authors report an interesting observation that could have a significant impact in the field. The authors do not comment the potential overlap of their findings with other reports involving the LINC complex (Booth et al., ELife) or CDK-mediated actin remodeling (Ramanathan et al., NCB 2015) during prophase.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Unlike other cell organelles, mitochondria contain a small fraction of their genetic information. However, most of the genetic information about mitochondrial proteins is still in the cell's nucleus and the localization of the respective proteins to mitochondria is facilitated by localized translation of their mRNAs. In turn, the mRNA localization to the mitochondria is partly due to the co-translational association, via the mitochondrial target sequence (MTS) of the nascent peptide.

      The manuscript "Mitochondrial mRNA localization is governed by translation kinetics and spatial transport" investigates the mechanisms of mRNA transport and attachment to mitochondria. Concerning mitochondria-localized mRNAs, two types of mRNAs have been distinguished before: mRNAs that are always attached to the mitochondrium (called "constitutively binding" by the authors) and mRNAs that become "sticky" only under certain conditions (called "conditionally binding" by the authors). Modeling the corresponding cellular processes biophysically, the authors infer that yeast cells exercise control over the localization of mRNA (and consequently over their metabolism) in two ways: via varying the mitochondrial volume fraction, and via varying the speed of translation elongation. Data from previously published genome-wide measurements of mRNAs that localize constitutively and conditionally via their MTS in budding yeast S. cerevisiae were used to investigate these mechanisms.

      The manuscript is very well written and the analysis is of high quality. It starts with an introduction that thoroughly reviews many facets around the conducted research and briefly, but self-consistently, summarizes the current knowledge regarding mitochondrial localization of mRNAs. Next, the consequences of the modeling work (presented in the "methods"-section) are explored in the "Results"-section, which contains meaningful and instructive figures and explanations. The manuscript concludes with a comprehensive evaluation of the consequences of the conducted research. All in all, there are only very few minor changes that could be considered.

      Content-wise, we suggest:

      The modeling of translation kinetics is pretty coarse-grained, using only an average elongation rate per amino acid. Much work in this field was done using totally antisymmetric exclusion principle (TASEP)-based models (e.g. MacDonald, J.H. Gibbs, A.C. Pipkin: Kinetics of biopolymerization on nucleic acid templates; Duc, Saleem, Song: Theoretical analysis of the distribution of isolated particles in totally asymmetric exclusion processes: Application to mRNA translation rate estimation). Perhaps this work can be mentioned, and furthermore, the consequences of inhomogeneity of elongation rate for different codons and amino acids could be explored or at least discussed. In particular, this could shed light into the question if ribosome interference and tRNA charging times have any impact on mitochondrial mRNA localization.

      Thank you to the reviewer for pointing us to these relevant papers. As suggested, we have added a paragraph to our Discussion that mentions this work and discusses the possible implications of inhomogeneous elongation along mRNA sequences. We find this suggestion (and the similar one made by the other reviewer) to explore inhomogeneous elongation particularly encouraging, because we are in the early stages of actively pursuing such work. We feel that beyond discussion, exploring the consequences of inhomogeneous elongation is beyond the scope of this work because significant further experimental work would be needed to quantify the impact of specific sequences on translation progress.

      To our Discussion, we have added the following paragraph.

      "In this work our quantitative model assumed uniform ribosome elongation rates along mRNA transcripts. In the presence of ribosome interactions, such dynamics can lead to both uniform and non-uniform ribosome densities and effective elongation rates along the transcript (MacDonald et al., 1968; Duc et al., 2018). With these uniform ribosome elongation rates, previous theoretical results suggest that collisions will be rare (Duc et al., 2018). However, elongation may not be homogeneous along an mRNA transcript, due to factors such as tRNA availability (Varenne et al., 1984), boundaries between protein regions (Thanaraj and Argos, 1996), amino acid charge (Charneski and Hurst, 2013), and short peptide sequences related to ribosome stalling (Sabi and Tuller, 2017). We have found that slow (homogeneous) elongation facilitates mitochondrial mRNA localization, by providing time for MTS maturation, diffusive search, and to maintain binding-competent MTS-mediated mRNA binding to mitochondria. We expect that inhomogeneities in elongation rate along mRNA could either enhance or reduce mitochondrial mRNA localization, controlled by whether slower elongation is in regions that favor longer MTS exposure. For example, a ribosome stall site following full MTS translation could provide more time for MTS maturation and facilitate mitochondrial localization. Future experimental work could identify such stalling sequences and point towards how modeling can improve understanding of sequence impact on localization."

      Ribosome occupancy data from Arava used to infer translation parameters. But there are more recent data sets based on ribosome profiling. Any reason for not using the more recent data?

      We thank the reviewer for bringing up this important point. Our text describing the origin of data for ribosome occupancy in the inset of Figure 2A lacked a citation to the dataset used, and we agree that more recent ribosome occupancy datasets are more appropriate. For the cumulative distributions of ribosome occupancy shown in the inset of Figure 2A, we used the ribosome occupancy data from Zid and O'Shea from 2014. The Arava data from 2003 was used for the cumulative distributions of Figure S1, to show that the similarity between conditional and constitutive genes in the inset of Figure 2A was present in more than a single dataset.

      We have clarified the origin of the ribosome occupancy data in the text.

      In the text description of the inset of Figure 2A, we now include a direct citation of Zid and O'Shea from 2014.

      "These measurements (Zid and O'Shea, 2014) indicate that conditional and constitutive genes have similar distributions of ribosome occupancy (Fig. 2A, inset; see Fig. S1 for similar distributions of conditional and constitutive gene ribosome occupancy derived from (Arava et al., 2003))."

      We also added a citation of Zid and O'Shea to the caption describing the inset of Figure 2A.

      "Inset is cumulative distribution of ribosome occupancy (Zid and O’Shea, 2014), showing ribosome occupancy and β have similar distributions. "

      To determine the translation parameters in our quantitative model, we applied the datasets of Couvillion et al from 2016 for relative protein per mRNA measurements and Zid and O'Shea from 2014 for ribosome occupancy measurements, combined with individual measurements from Morgenstern et al from 2016 and Riba et al from 2019. How these datasets and measurements are used is described in the Methods subsection “Calculation of translation rates”. In addition to the citations in the methods, we have added citations to the briefer description in the Results section.

      "Using protein per mRNA and ribosome occupancy data (Couvillion et al., 2016; Morgenstern et al., 2017; Zid and O’Shea, 2014; Riba et al., 2019), we estimated the gene specific initiation rate kinit and elongation rate kelong for 52 conditional and 70 constitutive genes (see Methods)."

      The effect of the mitochondrial volume fraction on mRNA localization is investigated with a diffusive model. However, the authors make a two dimensional Ansatz for the cell and mitochondrion while it would seem more natural to assume diffusion in three spatial dimensions, as the cell and mitochondria are both three dimensional objects and diffusion strongly depends on the number of dimensions it occurs in. Why was that Ansatz made and why is it justified?

      Our diffusion model is in fact three-dimensional, rather than two dimensional. Specifically, we treat the search process as occurring in a three-dimensional cylinder, whose cross-section is shown in Figure 1D. We have added to Figure 1D to further describe how three-dimensional cylinders represent the mitochondrial proximity in the cell.

      In the Results, we now write:

      “Specifically, we treat the geometry as a sequence of concentric three-dimensional cylinders, each representing an effective region surrounding a tubule of the mitochondrial network. Figure 1D shows a two-dimensional cross-sectional view of these cylinders. The innermost cylinder represents a mitochondrial tubule…”

      We have also clarified the caption of Figure 1D to include:

      "Schematic of mRNA diffusion in spatial model, shown in cross-section. The cytoplasmic space is treated as a cylinder centered on a mitochondrial cylinder: the three dimensional volume extends along the cylinder axis (not shown)."

      The range of variability in the localized fraction +/- CHX is smaller in the experiment compared to the model (Fig. 4B, C). What could be the rationale?

      We agree that the variability in localized fraction from applying CHX is smaller in the experiment (Figure 4C) in comparison to the model (Figure 4B). Our model uses translation parameters (initiation and elongation rates) that are derived from experimental measurements that are expected to be quite noisy. We expect that this noise in the model parameters will expand the range of localization changes predicted by the model for CHX application.

      In l. 417, the authors remark that "constitutively localized mRNAs are on average longer [...] than conditionally localized mRNAs." Yet constitutively localized mRNAs seem to have higher localized fraction than conditionally localized mRNAs. This is somewhat surprising. While it's clear that a higher diffusivity would be compatible with a faster response time of shorter, conditionally-localized mRNAs, it is not clear how the longer, less diffusive mRNAs would have a higher localization fraction. Perhaps the authors can clarify this point.

      The reviewer is correct that experimental measurements show that constitutively-localized genes are, on average, longer than conditionally-localized genes. In our quantitative model, we assume the mRNA of all genes have the same diffusivity. We have used the same diffusivity for different genes because experimental measurements suggest that mRNA length and the number of translating ribosomes on an mRNA do not substantially impact mRNA diffusivity. In our Methods section, we have added citations to papers indicating lack of dependence of mRNA diffusivity on mRNA length.

      "Simulated mRNA have a diffusivity of 0.1 𝜇m2/s. This diffusivity remains constant across genes and mRNA states, consistent with experimental measurements showing little dependence of mRNA diffusivity on mRNA length (Calderwood et al., 2016) or number of translating ribosomes (Wang et al., 2016)."

      We have additionally clarified the part of our Discussion where we explain the distinction of our results from proposals based on differential mRNA diffusion speed.

      "Lower occupancy was proposed to drive mRNA localization through increased mRNA mobility of a poorly loaded mRNA (Poulsen et al., 2019), as more mobile mRNA could more quickly find mitochondria when binding competent, increasing the localization of these mRNA. By contrast, our results imply an alternate prediction – that translational kinetics lead to enhanced localization of longer mRNAs, due to the increased number of loaded ribosomes bearing a binding-competent MTS. Indeed, constitutively localized mRNAs are on average longer than conditionally localized mRNAs."

      Minor formal changes would be:

      Setting the expressions of the fraction in the binding-competent state in l. 118 and the faction of the mRNA-accessible volume in l. 123 in normal math-environments instead of the inline-environment since they are of key importance to the following discussion.

      These two equations (now equations (1) and (2)) are set as distinct equations that are now referred to by their equation numbers later in the manuscript.

      l. 414 contains the verb "vary" twice

      Thank you to the reviewer for pointing out this redundancy, the sentence now reads

      "Translation kinetics can widely vary between genes ... "

      l. 438 lacks an "h" in the word mitochondria

      Thank you to the reviewer for pointing this out, this spelling error has been corrected. The sentence now reads "all mRNA transcripts studied would be highly localized to mitochondria in all conditions."

      Reviewer #1 (Significance (Required)):

      All in all, this is a strong manuscript that contains solid, simple but meaningful and by no means oversimplified models with impactful consequences on the understanding of mitochondrial mRNA localization. Furthermore, it is likely that the approach applies to other cellular compartments like the ER. The research is explained in a remarkably clear and focussed style which makes it easy to follow and meanwhile succeeds in not omitting any details.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Arceo et al. have developed a stochastic, quantitative model of mitochondrial targeting sequence (MTS)-mediated mRNA localization to mitochondria in yeast. They use this model to investigate the role of translation- and diffusion kinetics in controlling mitochondrial mRNA localization of conditional as well as constitutional genes.

      Most importantly, they find that neither mRNA diffusivity nor ribosome density alone are sufficient to account for the differences in localization that were experimentally observed for the two types of genes. Therefore, they implement an MTS maturation time into their model and find that they can now predict gene specific localization rates. Based on these observations, the authors conclude that yeast cells can regulate the localization of mRNAs to mitochondria through (controlling mitochondrial volume fractions and) differences in translation kinetics, which adjust the exposure time and numbers of mature MTSs that are presented on the mRNP and convey binding-competence.

      Major comments:

      Overall, the manuscript is well written and the conclusions are convincing. The underlying assumptions of the model make sense, but I have no background in modelling and can therefore only comment on the RNA biology aspects and general comprehensibility of the work.

      • The authors calculate gene-specific translation initiation and elongation rates to model localization on different transcript classes. In this context,

      (i) They use a single decay rate to estimate trajectory lifetime and this decay rate is such (1 nt / 600 s) that it would take the average yeast mRNA (~ 1400 nt; Smith et al., JCB, 2015) 10 days to be turned over. This is not consistent with physiological decay rates and as a consequence, they are essentially not accounting for mRNA turnover. This should be explained in the Methods.

      The reviewer has highlighted a lack of clarity in our model description. The mRNA decay rate in the model is (1/600) inverse seconds per entire mRNA molecule, rather than (1/600) inverse seconds per nucleotide. This leads the typical mRNA lifetime to be 600 seconds. The sentence in the Methods section describing the decay timescale now reads "The mRNA decay rate is set to kdecay = 0.0017 s-1 per mRNA molecule, such that the typical decay time for an mRNA molecule is 600 s. This decay time is consistent with measured average yeast mRNA decay times ranging from 4.8 minutes (Chan et al., 2018) to 22 minutes (Chia and McLaughlin, 1979)."

      (ii) Translation and decay are intrinsically linked and translation machinery also recruits decay enzymes. What is more, decay rates differ greatly for different mRNA transcripts. I cannot judge how feasible this is, but it might benefit the model if variable decay rates (i.e. modelled based on translation efficiency?) could be included.

      We appreciate this suggestion from the reviewer. We have added a supplemental figure (Figure S4) to explore how mRNA decay rate can impact mitochondrial localization of mRNA. While longer decay rates have little impact on localization, if the decay rate is sufficiently high, the mRNA will have limited opportunity for translation to initiate and a binding-competent MTS to develop, substantially reducing localization. This analysis does not consider how the mRNA lifetime might be coupled with translational effects (such as ribosome stalling). Accounting for the impact of such more complex decay mechanisms would require substantial expansion of the model and extensive additional experiments to parameterize the coupling effects; we believe this extension would be beyond the scope of this manuscript.

      To our Discussion, we have added

      "While we have focused on how variation in translational kinetics between genes can impact mitochondrial mRNA localization, there is also significant variation in mRNA decay timescales (Chia and McLaughlin, 1979; Chan et al., 2018). Our model suggests (see Fig. S4) that the mRNA decay timescale has a limited effect on mitochondrial mRNA localization, unless the decay time is sufficiently short to compete with the timescale for a newly-synthesized mRNA to first gain binding competence. We leave specific factors thought to modulate mRNA decay, such as ribosome stalling (Mishima et al., 2022), as a topic of future study."

      (iii) Along the same lines: Rare codons as well as specific stalling sequences, are known to slow down translation elongation on many transcripts (and will effectively increase MTS exposure time). Can the authors identify transcripts with such signal sequences (on a global scale, apart from TIM50) and incorporate in their model?

      We find this suggestion (and the similar one made by the other reviewer) to explore stalling sequences particularly encouraging, because we are in the early stages of actively pursuing such work. We feel that beyond discussion, exploring the consequences of inhomogeneous elongation is beyond the scope of this work because significant further experimental work would be needed to quantify the impact of specific sequences on translation progress.

      To our Discussion, we have added the following paragraph.

      "In this work our quantitative model has applied uniform ribosome elongation rates along mRNA transcripts, which with ribosome interactions can lead to both uniform and non-uniform ribosome densities and effective elongation rates along the transcript (MacDonald et al., 1968; Duc et al., 2018). With these uniform ribosome elongation rates, previous theoretical results suggest that collisions will be rare (Duc et al., 2018). However, elongation may not be homogeneous along an mRNA transcript, due to factors such as tRNA availability (Varenne et al., 1984), boundaries between protein regions (Thanaraj and Argos, 1996), amino acid charge (Charneski and Hurst, 2013), and short peptide sequences related to ribosome stalling (Sabi and Tuller, 2017). We have found that slow (homogeneous) elongation facilitates mitochondrial mRNA localization, by providing time for MTS maturation, diffusive search, and maintains a binding-competent MTS-mediated mRNA binding to mitochondria. We expect that inhomogeneities in elongation rate along mRNA could either enhance or reduce mitochondrial mRNA localization, controlled by whether slower elongation is in regions that favor longer MTS exposure. For example, a ribosome stall site after the MTS is fully translated could provide more time for MTS maturation and facilitate mitochondrial localization. Future experimental work could identify such stalling sequences and point towards how modeling can improve understanding of sequence impact on localization."

      • Reduced mature MTS exposure time is presented as one of the determining factors that regulate mitochondrial localization of conditionally localized transcripts. For my background, the underlying mechanisms that determine MTS maturation are insufficiently explained. I understand how chaperone recruitment can contribute to MTS maturation. However, it is not obvious to me how receptor binding would account for such long maturation times as the 40 s used here (Fig. 3, 4). I would appreciate if the authors could elaborate and possibly point to directions that their model could be used to study those.

      We agree with the reviewer that the diffusive search time for a chaperone to find a newly-synthesized MTS would be very short (a small fraction of the proposed 40-second MTS maturation time), and we expect that this maturation period is largely controlled by chaperone and co-chaperone interaction timescales. There is a wide range of timescales for newly-synthesized (or misfolded) proteins to productively interact with a chaperone, and the literature provides examples of timescales comparable to 40 seconds, which we now cite.

      To our Discussion, we have added

      "While the diffusive search for a newly-synthesized MTS by chaperones is expected be very fast ( 100 seconds for human chaperone-mediated folding (Wu et al., 2020)."

      We feel that modeling chaperone facilitation of MTS folding, to determine the timescale of this process, is very distinct from the topics covered in our manuscript, and thus beyond the scope of this work.

      • One of the two main conclusions (at least according to the abstract) from the work is that yeast cells modulate mitochondrial volume fractions to regulate mRNA localization to mitochondria. This is a fact, not a novel finding. The other main conclusion, which is that cells use different translation dynamics to control mRNA localization, is intriguing and deserves more attention. It would be great if the authors could suggest/discuss an experimental approach (i.e. a single mRNA imaging experiment quantifying mitochondrial co-localization and translation kinetics of different reporter constructs) to test this hypothesis.

      We appreciate the reviewer raising the point that yeast cells modulate mitochondrial volume fraction to regulate mitochondrial mRNA localization. While we previously showed this relationship between mitochondrial volume fraction and localization, we used experimental techniques (mutations, nutrient sources) that changed many other factors beyond mitochondrial volume fraction. In this work we have used a quantitative model, lacking those extraneous factors, to demonstrate that a change to mitochondrial volume fraction alone can lead to a change in mitochondrial mRNA localization. This work supports our interpretation of those previous experimental results.

      To our Discussion we have added the sentence

      "Previous experimental work suggested that changing mitochondrial volume fraction could control mitochondrial mRNA localization (Tsuboi et al., 2020) --- our quantitative modeling work provides further support for this mechanism of regulating mRNA localization."

      The reviewer also requests a discussion of an experimental approach to test how cells use translational dynamics to control mRNA localization. With the advent of combined mRNA imaging and live translational imaging it would be interesting to directly measure translation in live cells to correlate localization with a time delay. Unfortunately there are currently no published live translational imaging studies in yeast, and thus such a measurement would require the development of the technique in yeast.

      To our Discussion, we have added

      "Experimentally testing our proposal for translation-controlled localization would involve using combined mRNA and live translational imaging (as yet undeveloped in yeast), to directly measure translation and correlate localization with a time delay, presenting a fruitful pathway for future study."

      Minor comments:

      • Figure 1: X axis labels between panel E and F are not consistent. Inset in panel F is mainly and first discussed in text. Please do not show data as tiny inset but as separate panel.

      We have changed the axis label of Figure 1E to match the axis label of Figure 1G (previously Figure 1F). The inset of the old Figure 1F is now the new Figure 1F, and the old Figure 1F is now the new Figure 1G. We have adjusted the Figure 1 caption and the text description of Figure 1 to match these changes.

      Elongation rates of 250 aa per second are not physiological. In mammalian cells elongation has been quantified to proceed between 1 and app. 20 aa per second (Wang et al, 2016; Wu et al., 2016; Yan et al., 2016; Morisaki et al., 2016).

      The reviewer is correct that the elongation rates of 50/s and 250/s too large to be physiological. These large values have been deliberately selected to probe the nonequilibrium behavior of the quantitative model to test the prediction of the simpler four-state model, rather than represent physiological behavior.

      To the text in the Results section discussing Figure 1F, we have added the following sentence.

      "We include unphysiologically high elongation rates to compare to the expected behavior from the 4-state model."

      Panel E: elongation rate range does not match Fig 1F nor median in Fig 3A.

      The reviewer is correct that the elongation rate parameter range of Figure 1E does not match the elongation rates of Figure 1F or the median in Figure 3A. In Figure 1E, we aimed to show that the physiological range of translation parameters can produce a wide range of both MTSs per mRNA and mRNA binding competence for mitochondria.

      We have expanded the description of Figure 1E in the text.

      "By exploring the physiological range of translation parameters, many orders of magnitude of the mean number of translated MTSs per mRNA (β, see Eq. 5) are covered, which also covers the full range of mRNA binding competence (Fig.1E). We find that, for any set of physiological translation parameters, the number of binding-competent MTS sequences (β) is predictive of the fraction of time (fs) that each mRNA spends in the binding competent state (Fig.1E)."

      • Figure 2A and S1: Please explain how ribosome occupancy is defined here and why it is so different between figures

      We have inserted a citation for Zid 2014, to distinguish that the ribosome occupancy measurements in Figure 2A (Zid and O’Shea) and Figure S1 (Arava et al) come from two different techniques. Zid and O’Shea used ribosome profiling to obtain a relative, rather than absolute measurement. While Arava used a technique where they fractioned mRNAs based on the absolute number of ribosomes loaded across 14 fractions of a sucrose gradient, and measured the relative amount of mRNA in each fraction by microarray. So while ribosome occupancy in each paper was calculated in a very distinct manner, the comparison between conditional and constitutively localized mRNAs shows a very similar trend without significant differences in ribosome occupancy between these two classes of mRNAs with either measurement of ribosome occupancy.

      To the caption of Figure S1, we have added

      "These ribosome occupancy values cover a distinct range, in comparison to those of Fig. 2A, due to distinct experimental measurement techniques."

      • Figure 2C: please show experimental data along with model prediction (in the same graph) so that conclusion becomes immediately apparent from figure not just main text. Label clearly (in figure) when experimental and when model data is shown (maybe by using consistent color scheme?)

      We have added experimental data to Figure 2C. Throughout the manuscript, we have kept a consistent color scheme for data for mitochondrial localization for ATP3, TIM50, conditional, and constitutive mRNA, whether from model or experimental data. We have applied distinct line types (e.g. solid for model vs. dot-dashed with circles for experimental).

      • Figure 4B and C: clearly indicate in figure which are experimental and which are modelled data

      In Figures 4B and 4C, we have clarified which data is experimental and which is modeled by adding to the labels for each violin plot. Violin plot labels for model data now read "Model Conditional" or "Model Constitutive" and labels for experimental data now read "Expt Conditional" or "Expt Constitutive".

      • Figure 4D: show experimental vs. model data in same graph (at same axis scaling) for comparability

      We have added the experimental data, previously in the inset of Figure 4D, to the main part of Figure 4D.

      • Line 305: "constitutive" mRNA

      Thank you to the reviewer for pointing out this redundancy, the sentence now reads

      "Figure 3C shows how the localization for the prototypical conditional and constitutive mRNA varies with the maturation time."

      • Line 334: "other changes, such as diffusivity, are unable to separate the two gene groups" - what other changes? The authors only show diffusivity (Fig S3).

      Thank you to the reviewer for pointing this out. We have revised this sentence to only refer to diffusivity changes.

      "While introduction of this maturation time distinguishes the mitochondrial localization of conditional and constitutive gene groups (Fig. 4A vs Fig. 2B), changes to diffusivity are unable to separate the two gene groups (Fig. S3)."

      • Line 403-405: maybe useful to argue against lower ribosome occupancies as drivers of nascent chain complex mobilities: Wang at el, Cell, 2016; single translation site imaging experiments indicating that ribosome occupancy is not the main determinant of mRNP mobility.

      We thank the reviewer for the direction to this paper, which indeed indicates that ribosome occupancy has limited impact on mRNA diffusivity.

      We now cite this paper in our Methods section.

      "Simulated mRNA have a diffusivity of 0.1𝜇m2/s. This diffusivity remains constant across genes and mRNA states, consistent with experimental measurements showing little dependence of mRNA diffusivity on mRNA length (Calderwood et al., 2016) or number of translating ribosomes (Wang et al., 2016)."

      • Line 601-607: include experimental references to explain how measures (25 nm vs 250 nm) were determined/selected.

      The reviewer raises a valuable point, as it is important to motivate these lengthscales used in the model.

      Microscopy with visible light has a lateral resolution limit of approximately 250 nm, often known as the Abbe limit. Accordingly, we assume that mRNA within 250 nm of mitochondria will be measured as adjacent to mitochondria. To the Methods section, we now include a short explanation and a citation.

      Unlike the 250-nm diffraction limit, there is no widely-used reaction range for mRNA binding to intracellular substrates, nor a measurement of the required proximity for an MTS-bearing mRNA to bind to mitochondria. We estimate the 25-nm distance for mRNA binding to mitochondria from the following contributions:

      • The yeast ribosome is 25 - 28 nm in diameter, or 13 - 14 nm in radius.
      • Yeast MTSs have a length of up to 70 amino acids, with 20 estimated yeast MTS lengths having a mean of 31 amino acids. The MTS forms an amphipathic helix (an alpha helix), which has a pitch of 0.54 nm and 3.6 amino acids per turn, so the 31 amino acids will be approximately 5 nm long
      • The MTS will be attached to the ribosome/mRNA by other peptide regions, expected to typically be a few nanometers in length So overall we estimate a 25 nm range for an MTS-bearing mRNA to bind to mitochondria.

      To our methods, we have added this reasoning and accompanying citations.

      "We estimate the 25-nm binding distance by combining several contributions. The yeast ribosome has a radius of 13 - 14 nm (Verschoor et al, 1998). The MTS region, up to 70 amino acids long, forms an amphipathic helix (Bacman et al., 2020) a form of alpha helix. With an alpha helical pitch of 0.54 nm and 3.6 amino acids per turn, a 31 amino acid MTS (the mean of 20 yeast MTS lengths (Dong et al., 2021)) is approximately 5 nm in length. An additional few nanometers of other peptide regions bridging the MTS to the ribosome provides an estimate of 25 nm for the range of an MTS-bearing mRNA to bind mitochondria. The 250-nm imaging distance is based on the Abbe limit to resolution with visible light (Georgiades et al., 2016)."

      Reviewer #2 (Significance (Required)):

      My field of expertise is the development of single mRNA imaging methods to quantify translation/decay dynamics in living mammalians systems. Thus, I cannot judge the significance of this work with respect to the modelling that is presented here.

      However, I do appreciate that one of the main conclusions of this work, which is that cells might use different translation dynamics to control mRNA localization, is truly exciting and could be applied to other types of transcripts (this is exactly what SRP does for ER-targeted mRNAs) as well. Because mechanisms that regulate translation in a transcript-specific manner and in different subcellular localizations have only been described for a handful of cases, I think that this observation is worth following up on and should be appreciated by a broad scientific audience.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Arceo et al. have developed a stochastic, quantitative model of mitochondrial targeting sequence (MTS)-mediated mRNA localization to mitochondria in yeast. They use this model to investigate the role of translation- and diffusion kinetics in controlling mitochondrial mRNA localization of conditional as well as constitutional genes.

      Most importantly, they find that neither mRNA diffusivity nor ribosome density alone are sufficient to account for the differences in localization that were experimentally observed for the two types of genes. Therefore, they implement an MTS maturation time into their model and find that they can now predict gene specific localization rates. Based on these observations, the authors conclude that yeast cells can regulate the localization of mRNAs to mitochondria through (controlling mitochondrial volume fractions and) differences in translation kinetics, which adjust the exposure time and numbers of mature MTSs that are presented on the mRNP and convey binding-competence.

      Major comments:

      Overall, the manuscript is well written and the conclusions are convincing. The underlying assumptions of the model make sense, but I have no background in modelling and can therefore only comment on the RNA biology aspects and general comprehensibility of the work.

      • The authors calculate gene-specific translation initiation and elongation rates to model localization on different transcript classes. In this context,
        • (i) They use a single decay rate to estimate trajectory lifetime and this decay rate is such (1 nt / 600 s) that it would take the average yeast mRNA (~ 1400 nt; Smith et al., JCB, 2015) 10 days to be turned over. This is not consistent with physiological decay rates and as a consequence, they are essentially not accounting for mRNA turnover. This should be explained in the Methods.
        • (ii) Translation and decay are intrinsically linked and translation machinery also recruits decay enzymes. What is more, decay rates differ greatly for different mRNA transcripts. I cannot judge how feasible this is, but it might benefit the model if variable decay rates (i.e. modelled based on translation efficiency?) could be included.
        • (iii) Along the same lines: Rare codons as well as specific stalling sequences, are known to slow down translation elongation on many transcripts (and will effectively increase MTS exposure time). Can the authors identify transcripts with such signal sequences (on a global scale, apart from TIM50) and incorporate in their model?
      • Reduced mature MTS exposure time is presented as one of the determining factors that regulate mitochondrial localization of conditionally localized transcripts. For my background, the underlying mechanisms that determine MTS maturation are insufficiently explained. I understand how chaperone recruitment can contribute to MTS maturation. However, it is not obvious to me how receptor binding would account for such long maturation times as the 40 s used here (Fig. 3, 4). I would appreciate if the authors could elaborate and possibly point to directions that their model could be used to study those.
      • One of the two main conclusions (at least according to the abstract) from the work is that yeast cells modulate mitochondrial volume fractions to regulate mRNA localization to mitochondria. This is a fact, not a novel finding. The other main conclusion, which is that cells use different translation dynamics to control mRNA localization, is intriguing and deserves more attention. It would be great if the authors could suggest/discuss an experimental approach (i.e. a single mRNA imaging experiment quantifying mitochondrial co-localization and translation kinetics of different reporter constructs) to test this hypothesis.

      Minor comments:

      • Figure 1: X axis labels between panel E and F are not consistent. Inset in panel F is mainly and first discussed in text. Please do not show data as tiny inset but as separate panel. Elongation rates of 250 aa per second are not physiological. In mammalian cells elongation has been quantified to proceed between 1 and app. 20 aa per second (Wang et al, 2016; Wu et al., 2016; Yan et al., 2016; Morisaki et al., 2016). Panel E: elongation rate range does not match Fig 1F nor median in Fig 3A.
      • Figure 2A and S1: Please explain how ribosome occupancy is defined here and why it is so different between figures
      • Figure 2C: please show experimental data along with model prediction (in the same graph) so that conclusion becomes immediately apparent from figure not just main text. Label clearly (in figure) when experimental and when model data is shown (maybe by using consistent color scheme?)
      • Figure 4B and C: clearly indicate in figure which are experimental and which are modelled data
      • Figure 4D: show experimental vs. model data in same graph (at same axis scaling) for comparability
      • Line 305: "constitutive" mRNA
      • Line 334: "other changes, such as diffusivity, are unable to separate the two gene groups" - what other changes? The authors only show diffusivity (Fig S3).
      • Line 403-405: maybe useful to argue against lower ribosome occupancies as drivers of nascent chain complex mobilities: Wang at el, Cell, 2016; single translation site imaging experiments indicating that ribosome occupancy is not the main determinant of mRNP mobility.
      • Line 601-607: include experimental references to explain how measures (25 nm vs 250 nm) were determined/selected.

      Significance

      My field of expertise is the development of single mRNA imaging methods to quantify translation/decay dynamics in living mammalians systems. Thus, I cannot judge the significance of this work with respect to the modelling that is presented here. However, I do appreciate that one of the main conclusions of this work, which is that cells might use different translation dynamics to control mRNA localization, is truly exciting and could be applied to other types of transcripts (this is exactly what SRP does for ER-targeted mRNAs) as well. Because mechanisms that regulate translation in a transcript-specific manner and in different subcellular localizations have only been described for a handful of cases, I think that this observation is worth following up on and should be appreciated by a broad scientific audience.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Unlike other cell organelles, mitochondria contain a small fraction of their genetic information. However, most of the genetic information about mitochondrial proteins is still in the cell's nucleus and the localization of the respective proteins to mitochondria is facilitated by localized translation of their mRNAs. In turn, the mRNA localization to the mitochondria is partly due to the co-translational association, via the mitochondrial target sequence (MTS) of the nascent peptide.

      The manuscript "Mitochondrial mRNA localization is governed by translation kinetics and spatial transport" investigates the mechanisms of mRNA transport and attachment to mitochondria. Concerning mitochondria-localized mRNAs, two types of mRNAs have been distinguished before: mRNAs that are always attached to the mitochondrium (called "constitutively binding" by the authors) and mRNAs that become "sticky" only under certain conditions (called "conditionally binding" by the authors). Modeling the corresponding cellular processes biophysically, the authors infer that yeast cells exercise control over the localization of mRNA (and consequently over their metabolism) in two ways: via varying the mitochondrial volume fraction, and via varying the speed of translation elongation. Data from previously published genome-wide measurements of mRNAs that localize constitutively and conditionally via their MTS in budding yeast S. cerevisiae were used to investigate these mechanisms.

      The manuscript is very well written and the analysis is of high quality. It starts with an introduction that thoroughly reviews many facets around the conducted research and briefly, but self-consistently, summarizes the current knowledge regarding mitochondrial localization of mRNAs. Next, the consequences of the modeling work (presented in the "methods"-section) are explored in the "Results"-section, which contains meaningful and instructive figures and explanations. The manuscript concludes with a comprehensive evaluation of the consequences of the conducted research. All in all, there are only very few minor changes that could be considered.

      Content-wise, we suggest:

      The modeling of translation kinetics is pretty coarse-grained, using only an average elongation rate per amino acid. Much work in this field was done using totally antisymmetric exclusion principle (TASEP)-based models (e.g. MacDonald, J.H. Gibbs, A.C. Pipkin: Kinetics of biopolymerization on nucleic acid templates; Duc, Saleem, Song: Theoretical analysis of the distribution of isolated particles in totally asymmetric exclusion processes: Application to mRNA translation rate estimation). Perhaps this work can be mentioned, and furthermore, the consequences of inhomogeneity of elongation rate for different codons and amino acids could be explored or at least discussed. In particular, this could shed light into the question if ribosome interference and tRNA charging times have any impact on mitochondrial mRNA localization.

      Ribosome occupancy data from Arava used to infer translation parameters. But there are more recent data sets based on ribosome profiling. Any reason for not using the more recent data?

      The effect of the mitochondrial volume fraction on mRNA localization is investigated with a diffusive model. However, the authors make a two dimensional Ansatz for the cell and mitochondrion while it would seem more natural to assume diffusion in three spatial dimensions, as the cell and mitochondria are both three dimensional objects and diffusion strongly depends on the number of dimensions it occurs in. Why was that Ansatz made and why is it justified?

      The range of variability in the localized fraction +/- CHX is smaller in the experiment compared to the model (Fig. 4B, C). What could be the rationale?

      In l. 417, the authors remark that "constitutively localized mRNAs are on average longer [...] than conditionally localized mRNAs." Yet constitutively localized mRNAs seem to have higher localized fraction than conditionally localized mRNAs. This is somewhat surprising. While it's clear that a higher diffusivity would be compatible with a faster response time of shorter, conditionally-localized mRNAs, it is not clear how the longer, less diffusive mRNAs would have a higher localization fraction. Perhaps the authors can clarify this point.

      Minor formal changes would be:

      Setting the expressions of the fraction in the binding-competent state in l. 118 and the faction of the mRNA-accessible volume in l. 123 in normal math-environments instead of the inline-environment since they are of key importance to the following discussion.

      l. 414 contains the verb "vary" twice

      l. 438 lacks an "h" in the word mitochondria

      Significance

      All in all, this is a strong manuscript that contains solid, simple but meaningful and by no means oversimplified models with impactful consequences on the understanding of mitochondrial mRNA localization. Furthermore, it is likely that the approach applies to other cellular compartments like the ER. The research is explained in a remarkably clear and focussed style which makes it easy to follow and meanwhile succeeds in not omitting any details.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      REVIEWER #1

      Summary:

      The manuscript titled "the transcription factor RUNX2 promotes the development of human tissue-resident NK cells" addressed a new role of RUNX2 on human NK cell development and phenotypes. It also provides a new insight into how RUNX2 affects human NK cells switch between the circulatory and tissue-resident NK cells

      Major comments:

      The authors described that IL-2Rb and other NK cell receptors are not affected in the RUNX2 shRNA system. However, the evidence is not enough to conclude that RUNX2 controls human NK cell development by direct induction of IL-2Rb expression, for the knockdown system did affect the Granzyme B and perforin expression without affecting EOMES and T-bet. More direct evidence should be provided to address the knockdown or knockout system that has a direct induction of IL-2Rb on NK cell development.

      You are indeed correct that our conclusion regarding the direct RUNX2-mediated regulation of IL-2Rβ was overstated and that additional experiments needed to be performed in order to validate our strong claims. We tried to demonstrate the direct induction of IL-2Rβ by RUNX2 using the IL-2Rβ reporter construct of Genecopoeia (HPRM30810-LvPM03 with IL-2Rβ promoter sequence, 1585 bp in length, 269bp downstream of TSS). We introduced both this IL-2Rβ reporter vector and the RUNX2-I or empty control LZRS vector in the RUNX2low IL-2Rβlow ALL-SIL cell line to examine whether RUNX2 is able to activate the promoter of IL-2Rβ. Unfortunately, we were unable to generate conclusive results. Since we are unable to definitively demonstrate the direct induction of IL-2Rβ by RUNX2, we have nuanced our statements in the manuscript. Our data still show that RUNX2 affects IL-2Rβ expression in human NK cell development in vitro. We adjusted the text in the manuscript as follows:

      • Abstract (page 2 line 9-10)
      • Results (page 6 line 110/132-133)
      • Discussion (page 10 line 219/page 11 line 273-275).

        In Fig 1e, n is 4-12 for the experiments. Could the authors provide the reason for that? This may increase the bias to use sample number 4 vs. 12. For example, shRNA system d14 of NK cell result may not reflect the truth by 4 vs. 12

      The data shown in Figure 1e are the result of multiple experiments using NK cell differentiation cultures, which were performed with transduced umbilical cord blood-derived hematopoietic stem cells (HSC). Whereas the number of samples was indeed different at the different analysis timepoints, the data of the paired control and RUNX2 shRNA/RUNX2-I graphs contain equal sample numbers. So, e.g. both control and RUNX2 shRNA HSC: n=4, while both control and RUNX2 shRNA NK d21: n=12. To illustrate this, I show all individual donor data in the figure as information for the reviewer (see separate uploaded file).

      Minor comments:

      Fig 1a shows the protein expression level of RUNX 2. Fig 1d shows the qPCR level of RUNX2-1. Could more explanation be provided that ST1 and ST2 have lower mRNA levels but higher protein levels than the d0?

      You are indeed correct that the expression levels of RUNX2 in human NK cell developmental stages differ between protein and mRNA levels. However, mRNA levels do not always mirror protein levels, as has also been frequently found in several other studies. Possible causes for this phenomenon include different translation efficiency, protein stability and/or posttranslational modification in different subpopulations of cells. I have included this explanation in the discussion (page 10 line 239-243).

      Please check the CCR7 flow data in Fig4a circulatory. The histogram did not reflect the bar plot result. It looks like the expression of CCR4 between Ctrl and RUNX2-1 is different.

      Thank you for this remark. I re-examined the data concerning CCR7 and included a more representative biological replicate in the revised figure 4a (see separate figure file).

      Please provide more discussion or explanation that the EOMES and T-bet level are not changed in the knockdown system, but the Granzyme B and perforin expression is changed there.

      You are indeed correct that despite the unaltered expression T-BET and EOMES, two transcriptional regulators of granzyme B and perforin, RUNX2 knockdown results in the upregulation of these cytotoxic effector molecules. However, as shown by the RUNX2-specific ChIP-seq, granzyme B and perforin are direct targets of RUNX2, which indicates that the expression of these cytotoxic effector molecules can be directly regulated by RUNX2, independent of T-BET and EOMES. I have included this insight in the discussion (page 13 line 338-343).

      Fig 5 only showed CD107a+ percentage of NK; how about the CD107a expression level? Based on the current data, CD107a expression did not match with GZMB and PRF. Any explanation for that? PRF and GZMB levels changed in the shRNA system, but K562 cells cannot be killed. Any explanation for the way that happened. How about other target cells or viral infection?

      While the percentage of CD107a is significantly reduced when RUNX2 was silenced (Fig. 5b), the expression level of CD107a (MFI) is not significantly altered, neither by knockdown nor by overexpression of RUNX2(-I). Please see the graphs in uploaded file with reviewer comments as information for the reviewer.

      Cytoplasmic effector molecule expression in ‘resting’ NK cells and degranulation (cell membrane CD107a expression) upon target recognition are two different processes that are not necessarily similarly regulated. It is possible that NK cells increase expression of cytotoxic effector molecules, while degranulation is reduced. It is known that RUNX2 can act as a transcriptional activator as well as a repressor, depending on the location of the consensus sequence or the type of binding partners in the transcriptional complex. Our hypothesis is that RUNX2 activates expression of cytotoxic effector molecules but reduces degranulation.

      One possible explanation for the unaltered cytotoxic potential of RUNX2-silenced NK cells is that, while granzyme B and perforin expression are increased, degranulation is reduced, and these opposite effects might counterbalance each other. Although NK cells with RUNX2 knockdown did not exhibit changes in cytotoxicity towards K562 cells, it does not mean that the same is true for other target cells. However, we did not perform experiments with other target cells or viral infection.

      Check the period in line 31. The color is different.

      This is adjusted now (see track changes).

      Check line 108; it should be "RUNX2 controls human NK cell development."

      As indicated in our response to your first major comment, we were not able to demonstrate direct induction of IL-2Rβ by RUNX2, but our data show that RUNX2 affects IL-2Rβ expression in human NK cell development in vitro. We therefore suggest the following title of the paragraph: “RUNX2 controls human NK development possibly by regulating IL-2Rβ expression”

      Significance:

      Defining the role of Runx2 in NK cell development and functions.

      Audience:

      Researchers working on transcription factors and NK cell biologists.

      REVIEWER #2

      Summary:

      This is an interesting study that adds new information regarding a role for RUNX2 in NK cell development. Wahlen et al. present very interesting findings highlighting the role for RUNX2 in the acquisition of a tissue-resident phenotype in differentiating NK cells. The authors demonstrate that RUNX2-I isoform is predominantly expressed in specific human NK cells subsets and that RUNX2 upregulates IL-2Rβ expression in NK cell-committed progenitors. Interesting results integrating CHIP-seq and RNAseq data and basic functional studies show that RUNX2 regulates several genes associated with NK cell tissue homing and recirculation. The authors postulate that RUNX2 regulates the acquisition of a tolerogenic tissue-resident phenotype in human NK cells. There are a number of intriguing observations that should be of interest in the field.

      Major comments:

      While the results obtained are interesting and scientifically sound, the manuscript does not rigorously prove that RUNX2 is involved in NK cell differentiation and development. The results were obtained using human cells ex vivo and in vitro human HSC-based cultures for NK cell differentiation and development. The authors would need a relevant model in vivo to fully characterize phenotypic and functional features of NK cells in the absence or presence of RUNX2. Such studies would be essential, in particular for the acquisition of a tissue-specific resident phenotype in human NK cells in distinct microenvironments. Furthermore, the authors should also modify extensively the title and several statements across the manuscript regarding the role for RUNX2 in NK cell differentiation and development.

      Thank you for your very positive comments. We fully agree that an in vivo model is required to study the role of RUNX2 in differentiation of human NK cells and in their acquisition of a tissue-specific resident phenotype in distinct organs. We therefore now performed an in vivo experiment in which we humanised lethally irradiated NSG-huIL-15 mice by intravenous injection of bulk CD34+ CB-derived HSC, which were transduced with either control or RUNX2 shRNA lentivirus. After 6-7 weeks, we analysed the presence of human eGFP+ NK cells (CD45+CD56+CD94+) as well as the frequency of tissue-resident (CD69+CD49e-) and circulating (CD69-CD49e+) human NK cells in the lungs, liver, spleen, bone marrow and lamina propria of the intestine (Figure 6A; Supplementary Figure 5). We found that the absolute number of human NK cells was drastically reduced in all organs of mice injected with RUNX2-silenced HPC compared to those injected with control HPC (Figure 6B). In addition, RUNX2 silencing significantly reduced the frequency of trNK cells in the bone marrow and lamina propria fraction, while it increased the percentage of circNK cells (Figure 6C). These data show that 1) also in vivo RUNX2 is an important transcription factor for human NK cell development and that 2) RUNX2 is involved in human NK cell tissue residency, at least in the bone marrow and in lamina propria of the intestine. With regard to the other examined organs, the frequency of tr- and circNK cells was unaffected by RUNX2 knockdown, except for the spleen where circNK cells were decreased. This suggests that either RUNX2-mediated regulation of NK cell tissue residency is tissue-specific (bone marrow and lamina propria) or that, at least for some organs, this mouse model is not representative for human biology. We have incorporated these new findings in the manuscript as follows:

      • Results section (page 8 line 199-212)
      • Figure 6 legend (page 30 line 882-891)
      • Supplementary Figure 5 (Supplementary materials),
      • Discussion (page 13-14 line 354-371).
      • Ma____terials and methods (page 21-22 line 570-584). With the new insights that we gained after the additional experiments, we changed the title of the manuscript to ‘The transcription factor RUNX2 drives the generation of human NK cells and promotes tissue residency’. As the reviewer suggested, we also re-evaluated several statements across the manuscript regarding the role of RUNX2 in NK cell differentiation via regulation of IL-2Rβ expression and in promoting tissue residency.

      Additional evidence for the direct regulation of IL-2Rβ expression by RUNX2 would be helpful

      We fully agree with your comment, which is why we tried to confirm the direct influence of RUNX2 on IL-2Rβ expression using an IL-2Rβ reporter assay. In this assay, we introduced both the IL-2Rβ reporter vector (Genecopoeia vector HPRM30810-LvPM03 with IL-2Rβ promoter sequence, 1585 bp in length, 269bp downstream of TSS), and the LZRS vector with RUNX2-I in a RUNX2low IL-2Rβlow ALL-SIL cell line. Instead of the RUNX2-I vector, control ALL-SIL cells received the empty LZRS vector together with the IL-2Rβ reporter construct. However, due to practical issues, we were unable to generate any conclusive results. Since we are unable to definitively demonstrate the direct induction of IL-2Rβ by RUNX2, we have nuanced our statements in the manuscript. Our data still show that RUNX2 affects IL-2Rβ expression in human NK cell development in vitro. We adjusted the text in the manuscript as follows:

      • Abstract (page 2 line 9-10)
      • Results (page 6 line 110/132-133)
      • Discussion (page 10 line 219/ page 11 line 273-275)

        The studies showing that RUNX2 negatively regulates granzyme B, perforin expression and IFN-γ and TNF-α secretion are intriguing and could be better explored.

      Although the role of Runx3 in the transcriptional regulation of granzyme B, perforin, IFN-γ and TNF-α in murine CD8+ T and Th1 cells has been demonstrated, a similar role for Runx2 has not been described yet. For example, a study performed by Olesin et al. (PMID 30264035) showed that although Runx2 plays a role in the generation of murine CD8+ memory T cells, there was no impact on the expression of effector molecules and recall response. Furthermore, the regulation of these NK cell effector molecules by RUNX proteins in human NK cells remains unidentified. I agree that the underlying molecular mechanism of RUNX2-mediated regulation of effector molecule expression is certainly an interesting topic that should be thoroughly investigated. Since there are many mechanisms involved in regulation of the expression and secretion of effector molecules, it is almost a topic on its own and therefore part of a follow-up study.

      Minor comments:

      "Taken together, we deduce from the data that RUNX2 promotes NK cell development by inducing IL-2Rβ expression and thereby enabling NK lineage commitment" - this is an overstatement

      We fully agree with your comment. We have therefore adjusted our statement by concluding that RUNX2 promotes NK cell development in part by regulating IL-2Rβ expression and thereby promoting NK lineage commitment. You can find this adjustment in the discussion on page 11 line 273-274).

      "It is well-known that RUNX2 by itself is a relatively weak transcription factor ...." - this statement should be modified.

      Our statement that RUNX2 is a weak transcription factor may indeed cause misconceptions. As RUNX2 by itself has a low affinity for DNA, it needs to form a complex with other co-factors such as CBFβ to increase the stability and affinity of the interaction with DNA. You can find the adjusted statement in the discussion (page 11 line 277-278).

      Significance:

      To date, the role of RUNX2 in NK cell development has not been investigated in mice nor in humans. The findings will contribute to a better understanding of NK cell biology and may help in in improving existing NK cell-based therapies in the future. However, lack of relevant in vivo studies diminishes the importance of this work. Further studies are warranted.

      Audience:

      Immunologists

      Expertise in leukemia pathobiology and immunotherapies.

      REVIEWER #3

      Summary:

      In this study, Wahlen et al. interrogated the role of RUNX2 in regulating human NK cell development through knockdown and overexpression studies. In agreement with previous work, the authors observed high RUNX2 expression in NK cell progenitors and a decline in expression levels in mature subsets. RUNX2 knockdown and overexpression was performed through viral transduction of cord blood-derived HPCs that were subsequently differentiated into NK cells in vitro. The authors found that RUNX2 knockdown led to a reduction in the numbers of mature NK cells, while overexpression had the opposite effect. They also provide data suggesting that RUNX2 may directly promote expression of the beta chain of the IL-2 receptor during NK cell development. The authors also performed RNA-seq on sorted RUNX2 knockdown and overexpressing cells and compared this data to RNA-seq datasets that were generated using tissue-resident NK cells from liver and bone marrow. They identified Gene Set Enrichment signatures that were similar between tissue-resident NK cells and NK cells overexpressing RUNX2. Changes in the expression of several genes associated with circulation and residency were confirmed by flow cytometry. Finally, the authors performed function assays and found that manipulation of RUNX2 did not affect cytotoxicity, but overexpression reduced inflammatory cytokine production.

      Major comments:

      The title of the paper (The transcription factor RUNX2 promotes the development of human tissue-resident NK cells) presents too strong of a conclusion that is not sufficiently supported by the data. While the NK cells differentiated in vitro with RUNX2 overexpression do appear to share a transcriptional signature with tissue-resident subsets and express receptors associated with tissue residency, the authors did not perform any adoptive transfer experiments showing that RUNX2 overexpressing NK cells are actually tissue resident. Such experiments would be necessary to support the conclusion stated in the title.

      Indeed, since the original manuscript contained only data obtained from in vitro differentiation cultures, the concept of NK cell tissue residency required validation in an in vivo system. We therefore performed an in vivo experiment, in which we humanised lethally irradiated NSG-huIL-15 mice by intravenous injection of bulk CD34+ CB-derived HSC, which were transduced with either control or RUNX2 shRNA lentivirus. After 6-7 weeks, we analysed the presence of human eGFP+ NK cells (CD45+CD56+CD94+) as well as the frequency of tissue-resident (CD69+CD49e-) and circulating (CD69-CD49e+) human NK cells in the lungs, liver, spleen, bone marrow and lamina propria of the intestine (Figure 6A; Supplemental Figure 5). We found that the absolute number of human NK cells was drastically reduced in all organs of mice injected with RUNX2-silenced HPC compared to those injected with control HPC (Figure 6B). In addition, RUNX2 silencing significantly reduced the frequency of trNK cells in the bone marrow and lamina propria fraction, while it increased the percentage of circNK cells (Figure 6C). These data show that 1) also in vivo RUNX2 is an important transcription factor for human NK cell development and that 2) RUNX2 is involved in human NK cell tissue residency, at least in the bone marrow and in lamina propria of the intestine. With regard to the other examined organs, the frequency of tr- and circNK cells was unaffected by RUNX2 knockdown, except for the spleen where circNK cells were decreased. This suggests that either RUNX2-mediated regulation of NK cell tissue residency is tissue-specific (bone marrow and lamina propria) or that, at least for some organs, this mouse model is not representative for human biology. We have incorporated these new findings in the manuscript as follows:

      • Results section (page 8 line 199-212)
      • Figure 6 legend (page 30 line 882-891)
      • Supplementary Figure 5 (Supplementary materials),
      • Discussion (page 13-14 line 354-371).
      • Ma____terials and methods (page 21-22 line 570-584). With the new insights that we gained after the additional experiments, we changed the title of the manuscript to ‘The transcription factor RUNX2 drives the generation of human NK cells and promotes tissue residency’. As the reviewer suggested, we also re-evaluated several statements across the manuscript regarding the role of RUNX2 in NK cell differentiation via regulation of IL-2Rβ expression and in promoting tissue residency.

      The authors used RNA-seq data from tissue-resident NK cells in comparisons with their RUNX2 knockdown and overexpressing NK cells. Do they see elevated RUNX2 transcript levels in tissue-resident NK cells? I don't know if matched circulating NK cell data is available, but such a finding would further strengthen the connection between the tissue residency profile and RUNX2.

      Thank you for your very valid comment. We re-investigated the public RNA-seq data of tissue-resident and circulatory NK cells of liver (Cuff et al.) and bone marrow (Melsen et al.). These data sets show that in the liver and bone marrow, RUNX2 transcript levels are indeed elevated in tissue-resident NK cells compared to circulatory NK cells. The fold changes of RUNX2 in liver and bone marrow tissue-resident versus circulatory NK cells are 25 and 13, respectively. This supports our hypothesis and we have included this in the results on page 7 line 159-163

      The evidence that RUNX2 controls human NK cell development by direct induction of IL-2Rbeta expression is fairly weak. In Figure 2a, it appears as though RUNX2 knockdown didn't significantly affect IL-2Rbeta expression, and RUNX2 overexpression only affected IL-2Rbeta expression at day 7 (not day 14). Furthermore, in Figure 2b, RUNX2 knockdown did not impact IL-2Rbeta expression in YTS cells. The authors speculate that residual RUNX2 may be sufficient to drive IL-2Rbeta expression. This could be tested by knocking out RUNX2 with a CRISPR-Cas9 system. The authors also provide some ChIP-seq data showing that RUNX2 binds to the promoter region of IL2RB in human PBNK cells but do not provide any ChIP data showing enhanced enrichment of RUNX2 within IL2RB in their RUNX2 overexpressing cells.

      Indeed, RUNX2 knockdown did not result in significant changes of IL-2Rβ expression in NK cell progenitors or in the YTS cell line, which we attributed to residual RUNX2 expression. As suggested by the reviewer, we attempted to knockout RUNX2 in the YTS cell line using the CRISPR-Cas9 system (Synthego) to investigate the effect on IL-2Rβ expression. However, we were unsuccessful to obtain cells that lacked RUNX2 expression. So at this point we cannot state that RUNX2 is essential for IL-2Rβ expression in human NK cells or their progenitors.

      The reviewer is also correct that the percentage of IL-2Rβ+ stage 3 cells upon RUNX overexpression in HSC was significantly increased on day 7, whereas this was no longer the case on day 14. However, this is not a counterargument for a regulating role of RUNX2 in IL-2Rβ expression as the subpopulation of stage 3 cells that expresses IL-2Rβ+ are NK cell-committed progenitors, that will probably have differentiated into stage 4 or stage 5 NK cells on day 14. This is in agreement with the increased absolute cell numbers of stage 4 and stage 5 NK cells in the RUNX2 overexpression cultures on day 14. We could not perform ChIP-seq on RUNX2 overexpressing stage 3 cells as this technique requires large cell numbers, which are not feasible to generate in vitro.

      Thus, although we do not state that RUNX2 is essential for IL-2Rβ expression, our data from the RUNX2-I overexpression model and RUNX2-specific ChIP-seq do provide evidence for a certain degree of RUNX2-mediated regulation of IL-2Rβ expression during human NK cell development. We therefore downgraded our statements regarding this matter in the manuscript as follows:

      • Abstract (page 2 line 9-10)
      • Results (page 6 line 110/132-133)
      • Discussion (page 10 line 219/page 11 line 264-275). Minor comments:

      No minor comments

      Significance:

      As an NK cell biologist suggest returning for major revision and re-evaluation.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, Wahlen et al. interrogated the role of RUNX2 in regulating human NK cell development through knockdown and overexpression studies. In agreement with previous work, the authors observed high RUNX2 expression in NK cell progenitors and a decline in expression levels in mature subsets. RUNX2 knockdown and overexpression was performed through viral transduction of cord blood-derived HPCs that were subsequently differentiated into NK cells in vitro. The authors found that RUNX2 knockdown led to a reduction in the numbers of mature NK cells, while overexpression had the opposite effect. They also provide data suggesting that RUNX2 may directly promote expression of the beta chain of the IL-2 receptor during NK cell development. The authors also performed RNA-seq on sorted RUNX2 knockdown and overexpressing cells and compared this data to RNA-seq datasets that were generated using tissue-resident NK cells from liver and bone marrow. They identified Gene Set Enrichment signatures that were similar between tissue-resident NK cells and NK cells overexpressing RUNX2. Changes in the expression of several genes associated with circulation and residency were confirmed by flow cytometry. Finally, the authors performed function assays and found that manipulation of RUNX2 did not affect cytotoxicity, but overexpression reduced inflammatory cytokine production.

      Major comments:

      1.The title of the paper (The transcription factor RUNX2 promotes the development of human tissue-resident NK cells) presents too strong of a conclusion that is not sufficiently supported by the data. While the NK cells differentiated in vitro with RUNX2 overexpression do appear to share a transcriptional signature with tissue-resident subsets and express receptors associated with tissue residency, the authors did not perform any adoptive transfer experiments showing that RUNX2 overexpressing NK cells are actually tissue resident. Such experiments would be necessary to support the conclusion stated in the title.

      2.The authors used RNA-seq data from tissue-resident NK cells in comparisons with their RUNX2 knockdown and overexpressing NK cells. Do they see elevated RUNX2 transcript levels in tissue-resident NK cells? I don't know if matched circulating NK cell data is available, but such a finding would further strengthen the connection between the tissue residency profile and RUNX2.

      3.The evidence that RUNX2 controls human NK cell development by direct induction of IL-2Rbeta expression is fairly weak. In Figure 2a, it appears as though RUNX2 knockdown didn't significantly affect IL-2Rbeta expression, and RUNX2 overexpression only affected IL-2Rbeta expression at day 7 (not day 14). Furthermore, in Figure 2b, RUNX2 knockdown did not impact IL-2Rbeta expression in YTS cells. The authors speculate that residual RUNX2 may be sufficient to drive IL-2Rbeta expression. This could be tested by knocking out RUNX2 with a CRISPR-Cas9 system. The authors also provide some ChIP-seq data showing that RUNX2 binds to the promoter region of IL2RB in human PBNK cells but do not provide any ChIP data showing enhanced enrichment of RUNX2 within IL2RB in their RUNX2 overexpressing cells.

      Significance

      As an Nk cell biologist suggest returning for major revision and re-evaluation.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This is an interesting study that adds new information regarding a role for RUNX2 in NK cell development. Wahlen et al. present very interesting findings highlighting the role for RUNX2 in the acquisition of a tissue-resident phenotype in differentiating NK cells. The authors demonstrate that RUNX2-I isoform is predominantly expressed in specific human NK cells subsets and that RUNX2 upregulates IL-2Rβ expression in NK cell-committed progenitors. Interesting results integrating CHIP-seq and RNAseq data and basic functional studies show that RUNX2 regulates several genes associated with NK cell tissue homing and recirculation. The authors postulate that RUNX2 regulates the acquisition of a tolerogenic tissue-resident phenotype in human NK cells. There are a number of intriguing observations that should be of interest in the field.

      Major comments:

      -While the results obtained are interesting and scientifically sound, the manuscript does not rigorously prove that RUNX2 is involved in NK cell differentiation and development. The results were obtained using human cells ex vivo and in vitro human HSC-based cultures for NK cell differentiation and development. The authors would need a relevant model in vivo to fully characterize phenotypic and functional features of NK cells in the absence or presence of RUNX2. Such studies would be essential, in particular for the acquisition of a tissue-specific resident phenotype in human NK cells in distinct microenvironments. Furthermore, the authors should also modify extensively the title and several statements across the manuscript regarding the role for RUNX2 in NK cell differentiation and development.

      -Additional evidence for the direct regulation of IL-2Rβ expression by RUNX2 would be helpful

      -The studies showing that RUNX2 negatively regulates granzyme B, perforin expression and IFN-γ and TNF-α secretion are intriguing and could be better explored.

      Minor comments:

      -"Taken together, we deduce from the data that 238 RUNX2 promotes NK cell development by inducing IL-2Rβ expression and thereby enabling 239 NK lineage commitmen" - this is an overstatement

      -"It is well-known that RUNX2 by itself is a relatively weak transcription factor ...." - this statement should be modified.

      Significance

      To date, the role of RUNX2 in NK cell development has not been investigated in mice nor in humans. The findings will contribute to a better understanding of NK cell biology and may help in in improving existing NK cell based therapies in the future. However, lack of relevant in vivo studies diminishes the importance of this work. Further studies are warranted.

      Audience: immunologists

      Expertise in leukemia pathobiology and immunotherapies.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The manuscript titled "the transcription factor RUNX2 promotes the development of human tissue-resident NK cells" addressed a new role of RUNX2 on human NK cell development and phenotypes. It also provides a new insight into how RUNX2 affects human NK cells switch between the circulatory and tissue-resident NK cells.

      Major comments:

      1.The authors described that IL-2Rb and other NK cell receptors are not affected in the RUNX2 shRNA system. However, the evidence is not enough to conclude that RUNX2 controls human NK cell development by direct induction of IL-2Rb expression, for the knockdown system did affect the Granzyme B and perforin expression without affecting EOMES and T-bet. More direct evidence should be provided to address the knockdown or knockout system that has a direct induction of IL-2Rb on NK cell development.

      2.In Fig 1e, n is 4-12 for the experiments. Could the authors provide the reason for that? This may increase the bias to use sample number 4 vs. 12. For example, shRNA system d14 of NK cell result may not reflect the truth by 4 vs. 12.

      Minor comments:

      1.Fig 1a shows the protein expression level of RUNX 2. Fig 1d shows the qPCR level of RUNX2-1. Could more explanation be provided that ST1 and ST2 have lower mRNA levels but higher protein levels than the d0?

      2.Please check the CCR7 flow data in Fig4a circulatory. The histogram did not reflect the bar plot result. It looks like the expression of CCR4 between Ctrl and RUNX2-1 is different.

      3.Please provide more discussion or explanation that the EOMES and T-bet level are not changed in the knockdown system, but the Granzyme B and perforin expression is changed there. Fig 5 only showed CD107a+ percentage of NK; how about the CD107a expression level? Based on the current data, CD 107a expression did not match with GZMB and PRF. Any explanation for that? PRF and GZMB levels changed in the shRNA system, but K562 cells cannot be killed. Any explanation for the way that happened. How about other target cells or viral infection?

      4.Check the period in line 31. The color is different.

      5.Check line 108; it should be "RUNX2 controls human NK cell development."

      Significance

      Defining the role of Runx2 in NK cell development and functions.

      Audience: Researchers working on transcription factors and NK cell biologists.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank the reviewers for their positive and constructive reviews. We have already addressed their major concerns by including additional screen and control data, especially in the new Figure 7, supplemental figures 3 and 4 and new Table S1. We also detail the planned experiments that we propose to perform to address their remaining comments. Some points are mentioned in both section 2 and 3 of the revision plan.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Reviewer #1

      • Fig 4 and 5. ARL-8 is localized to lysosomes, phagolysosomes and early endosomes. How about TORC and BORC subunits?*

      We plan to acquire images of our endogenously tagged BORC subunit SAM-4::mSc on the polar body phagolysosome. We also crossed SAM-4::mSc to a GFP::RAB-5 early endosomal marker and plan to cross it to a CTNS-1::mCit lysosomal marker and will characterize their colocalization.

      To look for the localization of TORC1, we requested a recently published strain with a single-copy transgene encoding a fluorescently tagged DAF-15/RAPTOR (AK Sewell et al., iScience 2022), but there is no fluorescence visible in live embryos. We are currently staining these strains to test for any embryonic expression, but it is not uncommon for transgenes to undergo germline silencing, even single-copy transgenes. To observe TORC1 localization in early embryos, it may be necessary to create a knock-in strain which would take a couple of months to generate and another month to cross to various reporters and analyze.

      We predict that TORC1 and BORC will localize to lysosomes and phagolysosomes, consistent with previous literature (Sancak et al. Cell 2010, Pu et al. Dev Cell 2015).

      • Figure 6 *

      *Authors need to establish knockout cell lines by picking up single drug-resistant colonies and characterize each line by western blot or immunofluorescent microscopy. *

      We have been successful in creating stable mutant cell lines for Myrlysin and Lyspersin using CRISPR/Cas9 and are currently characterizing these lines before performing direct assays for phagolysosomal vesiculation.

      1. b) While the authors monitored cell growth every 3 hours for 4 days, the authors showed the result of day 4 only. Time lapse data would be useful.

      We plan to replace the RBC-overfeeding experiment with more direct evidence for phagolysosomal vesiculation (see 3c below).

      1. *c) The effect of KO may be because of defects in phagolysosomes. However, the authors cannot conclude "phagolysosomal vesiculation is affected in mammalian cells" or not until they directly observe the phenomena. *

      We plan to feed our newly isolated Myrlysin and Lyspersin mutant cell lines with RBCs and use a stage-based analysis at defined time points to test how the size and shape of the phagolysosomes change over time, similar to Fig. 1a-c in R Levin-Konigsberg et al. Nature Cell Biol 2019. This experiment will assess the effect of these genes directly on phagolysosomal vesiculation rather than general phagolysosome function.

      Reviewer #1 (Significance):

      This study shows the mechanism [involvement of TORC-BORC-ARL8 pathway] is conserved in phagolysosomes as well in worms. As described above, the involvement of these molecules in the mammalian phagolysosomes is not convincing at this stage.

      We plan to provide direct evidence for the involvement of BORC by using stable mutant cell lines and directly monitoring phagolysosome vesiculation, as described above.

      **Referees cross-commenting**

      *I fully agree with the reviewer #2 that identification of arl8 effectors will make this paper more attractive as I described in my original peer review. Moreover, I agree with the reviewer that genetic data in C. elegans is convincing. *

      As RNAi targeting UNC-116 disrupts embryonic development, including the birth of the polar body during meiosis, we generated a ZF1 degron-tagged UNC-116 allele to test the role of Kinesin-1 in phagolysosomal vesiculation. We are currently characterizing this strain and crossing it to polar body markers. We predict that degron-mediated degradation of UNC-116 will start during the 4-cell stage, which is after polar body birth but well before the onset of phagolysosome vesiculation. This new unc-116::mCitrine::ZF1 will provide us with a tool to more specifically test the role of UNC-116 in phagolysosome vesiculation in the context of a developing embryo.

      * While reviewer #2 have not concerned about the quality of experiments, I'm still not convinced by their mammalian cell experiments. I don't have any more concerns if the authors (1) remove the mammalian study or (2) improve the quality of mammalian data. *

      We hope that our planned experiments with the isolated mutant cell lines will address the reviewer’s concern. Otherwise, we can remove the mammalian experiments.

      Reviewer #2

      **Referees cross-commenting**

      The mammalian work … assays general phagolysosome function rather than directly addressing vesiculation.

      We hope that our planned experiments with the isolated mutant cell lines will directly demonstrate a role in phagolysosomal vesiculation.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty. *

      Reviewer #1

        • It is very difficult to understand which molecules are TORC, BORC and/or BLOC subunits, which molecules are required and which are not required. * It is very helpful if the authors include a table showing the summary of RNAi and mutant phenotypes. We have added Table S1, summarizing our findings with different protein complexes and previous findings regarding lysosome and synaptic vesicle precursor trafficking.
      1. Figure 6 ** a) There are many concerns in mammalian cell experiments. It is not clear whether the knockout procedures really work or not. … *

      We performed genotyping for the pooled lines and found high editing efficiency leading to frame-shifts in Arl8B (>96%) and BLOC1S1 (>85%), as well as Cathepsin B (>67%). These data are now included in supplementary figure S4.

      As the downstream of ARL8 remains elusive in this study, it is unclear how phagolysosomes are tubulated (Fig 6D model).

      ARL8 has been shown to interact with kinesins, dyneins, and the HOPS complex directly or through bridging molecules such as PLEKHM or RUFY proteins.

      Using the reference allele e1265 of the KIF1 ortholog UNC-104, which causes a severe paralysis phenotype, we observed no effect on phagolysosomal vesiculation (new Fig. S3A) or polar body degradation (new Fig. S3B).

      We next used the partial loss-of-function wy270 allele of the KIF5 ortholog UNC-116 but observed no effect on phagolysosomal vesiculation (new Fig. S3A). There was however a mild delay in polar body degradation (new Fig. S3B), leading us to develop a degron-tagged UNC-116 allele to be able to better analyze the role of UNC-116 in phagolysosomal biology (described in section 2).

      We have also included data using RNAi and mutant alleles to test a role for PLEKHM family proteins CUP-14 and RUB-1. We observed no effect on phagolysosomal vesiculation (new Fig. S3A) or polar body degradation (new Fig. S3B). In preliminary data (n=3), knocking down rub-1 in a cup-14 mutant background had no effect on phagolysosomal vesiculation (new Fig. S3A), but sped up polar body degradation (new Fig. S3B). These data are inconsistent with PLEKHM proteins having a role in phagolysosomal vesiculation.

      BLAST searches revealed no clear homologs for RUFY proteins in C. elegans.

      Based on reviewer #2’s suggestion, we also examined a role for the HOPS complex by knocking down the HOPS subunit VPS-41. Human VPS41 has been shown to bind ARL8b (Khatter et al JCS 2015). Treating worms with vps-41 RNAi resulted in normal phagolysosomal vesiculation (new Fig. 7A) but did result in a significant delay in polar body degradation (new Fig. 7B-C). Interestingly, disappearance of small phagolysosomal vesicles was significantly delayed (new Fig. 7D), while corpse membrane breakdown within the large phagolysosome was unaffected (new Fig. 7E). As corpse membrane breakdown depends on RAB-7-mediated fusion of lysosomes (Fazeli et al., Cell Rep 2018) and HOPS promotes lysosomal fusion (JA Nguyen & RM Yates, Front Immunol 2021), these data suggest that VPS-41 and HOPS are preferentially required for lysosome fusion to small phagolysosomal vesicles and are not necessary for lysosome fusion to the large phagolysosome.

      Thus, we screened through the known ARL-8 effectors and the identity of the downstream effector(s) of ARL-8 involved in phagolysosomal vesiculation remain elusive. As the kinesin and PLEKHM data were negative, we had opted not to include it in the original manuscript, but now discuss it in the main text and present it in Fig. 7 and Fig. S3.

      Reviewer #2

      It would be good if the authors could speculate further as to why mTORC1 is required for Arl8 activity but not recruitment, and if there are further experiments that could augment this conclusion they might be helpful.

      We added a new hypothesis to the discussion of how TORC1 might affect a downstream effector of ARL-8. Unfortunately, we have not yet been able to identify any ARL-8 effectors involved in phagolysosome vesiculation to be able to test this hypothesis experimentally.

      *The authors could consider making the study more extensive by applying their simple but elegant system to further possible players in the pathway, and in particular to test some possible Arl8 effectors. Although there is no clear orthologue of PLEKHM2 in C. elegans, both the HOPS complex and PLEKHM1 have been reported to bind to Arl8. *

      As potential SKIP/PLEKHM-related proteins that link ARL8 to kinesins, we screened two RUN and PH domain-containing proteins, CUP-14 and RUB-1, for their role in phagolysosome resolution. Phagolysosome vesiculation was normal and resolution was not delayed in cup-14(cd32) or rub-1(RNAi) mutants or after treating cup-14 mutants with rub-1 RNAi (Fig. S3A-B). These data are now discussed in the main text and included as a supplementary figure.

      To test a role for the HOPS complex, best known for its role in lysosome fusion, we tested whether RNAi knockdown of the HOPS-specific subunit VPS-41 disrupted or delayed phagolysosome vesiculation. Vps41 has been shown to bind ARL8b (Khatter et al. JCS 2015). HOPS knockdown did not affect phagolysosome vesiculation (new Fig. 7A), but significantly delayed polar body degradation (Fig. 7B-C). In particular, vps-41 knockdown delayed the degradation of phagolysosomal vesicles (Fig. 7D), probably by affecting fusion of these small vesicles to additional lysosomes. These data are now discussed in the main text and included as a supplementary figure.

      **Referees cross-commenting**

      *One approach might to drop the mammalian studies, and instead add an investigation of more Arl8 effectors in C. elegans. Hopefully, few people would doubt the relevance to mammals of studies in C. elegans on the function of well conserved proteins. *

      We hope to strengthen the relevance of our findings across species to increase the impact of our work. Therefore, we plan to replace the RBC-overfeeding experiment in Fig. 6 with direct evidence for phagolysosomal vesiculation using established cell culture assays similar to Fig. 1a-c in R Levin-Konigsberg et al. Nature Cell Biol 2019 and newly isolated BORC mutant cell lines.

      We added details on our screen for ARL-8 effectors to the manuscript, including new figures (Fig. S3 and Table S1). While the effectors involved in phagolysosomal vesiculation remain a mystery, we were able to distinguish different requirements for HOPS during corpse membrane breakdown and phagolysosomal vesicle resolution, suggesting that HOPS is critical for the lysosomal fusion of small phagolysosomal vesicles, but not the large cell corpse phagosome.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors have previously used C. elegans early development to establish an elegant model system with which to investigate process by which the content of phagolysosomes is degraded and the structure resolved. This phagocytosis plays a central role in the clearance of dead cells and pathogens and so the work is likely to be of widespread interest to those working in both C. elegans and mammalian phagocytic cells like macrophages. The authors have previously shown that the tubulation and fragmentation of the phagolysosome is important for both degradation of contents and resolution of the phagolysosome, and have identified the small GTPase Arl8 and the mTor kinase as being required. This study investigates the relationship between these components and also adds further players to the pathway. In summary, they show that there is a pathway starting with amino acid release and going through mTORC1 and then the BORC complex that is known to activate Arl8. They show that only some subunits of BORC are required (and not the related BLOC complex with which is shares some subunits), and that Arl8 needs to cycle between its GDP- and GTP-bound states to exert its effects. They also examine the membrane recruitment of Arl8 and make two interesting findings. Firstly, mTORC1 is required for Arl8 activity but not its localisation, possibly suggesting that mTORC1 is required for the activity of a critical Arl8 effector. Secondly, they find that when BORC is removed, Arl8 is recruited to endosomes, which implies the existence of second, as yet unknown, GEF for Arl8 that acts on endosomes.

      Significance

      Overall the data are clear and convincing, with many conclusions based on genetic mutations, and the results carefully quantified. There seems little required to be done to improve the experiments that are presented, although it would be good if the authors could speculate further as to why mTORC1 is required for Arl8 activity but not recruitment, and if there are further experiments that could augment this conclusion they might be helpful.

      My only substantial suggestion, is that the authors could consider making the study more extensive by applying their simple but elegant system to further possible players in the pathway, and in particular to test some possible Arl8 effectors. Although there is no clear orthologue of PLEKHM2 in C. elegans, both the HOPS complex and PLEKHM1 have been reported to bind to Arl8. Thus it would be interesting to see if either of these is required in this process and at what step. If suitable mutants are available, then hopefully these experiement would take c 2-3 months and be relatively inexpensive as they would involve worm breeding and fluorescent-microscopy.

      My expertise includes membrane traffic and small GTPases but not phagocytosis or C.elegans.

      Referees cross-commenting

      The mammalian work is rather brief but it does seem to have involved collaborators with relevant experience. Nonetheless, it assays general phagolysosome function rather than directly addressing vesiculation. One approach might to drop the mammalian studies, and instead add an investigation of more Arl8 effectors in C. elegans. Hopefully, few people would doubt the relevance to mammals of studies in C. elegans on the function of well conserved proteins.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      Phagocytosis is fundamental to innate immunity and tissue homeostasis. C. elegans is a good system to study phagolysosomal dynamics. Here, Fazeli et al. study the molecular mechanism of phagosysosomal tubulation and vesiculation using C. elegans. The authors show the involvement of TORC in the vesiculation of phagolysosomes (Fig 1). Upstream regulatior of TORC would be amino acid release because knockdown of slc-36.1 decreased the number of fission events. BORC is also required for phagolysosome vesiculation (Fig 2). Interestingly, essential subunits are different from synaptic vesicle transport and lysosomal transport. ARL-8, a small GTPase downstream of BORC, is also essential for phagolysosome vesiculation (Fig 3). The nucleotide cycle of ARL-8 is essential for the localization of ARL-8 on phagolysosomes (Fig 4). BORC is required for the vesicular localization of ARL-8 (Fig 5). Finally the authors performed an experiment to show this pathway is conserved in mammalian macrophages (Fig 6).

      Genetic experiments in worms are solid while mammalian cell experiments are not. This reviewer thinks the authors need to improve the quality of cell line experiments.

      Major comments

      1. A lot of components are analyzed in this paper. Then, it is very difficult to understand which molecules are TORC, BORC and/or BLOC subunits, which molecules are required and which are not required. It is very helpful if the authors include a table showing the summary of RNAi and mutant phenotypes. It is very helpful if the table include lysosomal phenotypes and synaptic vesicle phenotypes as well.
      2. Fig 4 and 5. ARL-8 is localized to lysosomes, phagolysosomes and early endosomes. How about TORC and BORC subunits? Are there differences in the localization of essential subunits and non-essential subunits?
      3. Figure 6
        • a) There are many concerns in mammalian cell experiments. It is not clear whether the knockout procedures really work or not. Methods section says the authors pooled puromycin resistant cells. This is not general protocol to establish knockout cell lines. Generally, some drug resistant cells are not always complete KO. Authors need to establish knockout cell lines by picking up single drug-resistant colonies and characterize each line by western blot or immunofluorescent microscopy.
        • b) While the authors monitored cell growth every 3 hours for 4 days, the authors showed the result of day 4 only. Time lapse data would be useful.
        • c) The effect of KO may be because of defects in phagolysosomes. However the authors cannnot conclude "phagolysosomal vesiculation is affected in mammalian cells" or not until they directly observe the phenomena.

      Significance

      TORC-BORC-ARL8 pathway has been shown in lysosomal transport in mammalian cells (Pu et al, 2015, 2017). It has been shown that BORC is required for the localization of ARL8 in the case of worm synaptic vesicles and mammalian lysosomes (Pu et al, 2015; Niwa et al., 2016). This study shows the mechanism is conserved in phagolysosomes as well in worms. As described above, the involvement of these molecules in the mammalian phagolysosomes is not convincing at this stage.

      In the case of lysosomes and synaptic vesicles, the effector of ARL8 (downstream motors) has been shown (Pu et al., 2015; Niwa et al., 2016). It is interesting observation that the nucleotide cycle of ARL8 is essential for the phagolysosomal fission. However, as the downstream of ARL8 remains elusive in this study, it is unclear how phagolysosomes are tubulated (Fig 6D model).

      Referees cross-commenting

      I fully agree with the reviewer #2 that identification of arl8 effectors will make this paper more attractive as I described in my original peer review. Moreover, I agree with the reviewer that genetic data in C. elegans is convincing. While reviewer #2 have not concerned about the quality of experiments, I'm still not convinced by their mammalian cell experiments. I don't have any more concerns if the authors (1) remove the mammalian study or (2) improve the quality of mammalian data.

  4. Apr 2022